Temporal Difference TD Learning

Dopamine reward prediction errors reflect hidden-state inference across time

Midbrain dopamine neurons signal reward prediction error (RPE), or actual minus expected reward. The temporal difference (TD) learning model has been a cornerstone in ...

Princeton University

The Effects of Uncertainty on TD Learning

Substantial evidence suggests that the phasic activities of dopamine (DA) neurons in the primate midbrain represent a temporal difference (TD) error in the ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Dopamine reward prediction errors reflect hidden-state inference across time

The Effects of Uncertainty on TD Learning

Trending now