Question about the current implementation of mobility regularity-aware loss #6

mengcz13 · 2022-12-28T06:13:29Z

Line 165 in 93e6837

gloss += dloss_alpha * dl

Is the current implementation of mobility regularity-aware loss correct? Now both $L_d$ and $L_p$ are directly added to the loss for calculating policy gradient, but they won't produce any gradient on the input (generated sequences, since they are discrete values).

I guess the correct way is to add them to the reward instead. Is that right?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question about the current implementation of mobility regularity-aware loss #6

Question about the current implementation of mobility regularity-aware loss #6

mengcz13 commented Dec 28, 2022

Question about the current implementation of mobility regularity-aware loss #6

Question about the current implementation of mobility regularity-aware loss #6

Comments

mengcz13 commented Dec 28, 2022