Midbrain dopamine neurons signal reward prediction error (RPE), or actual minus expected reward. The temporal difference (TD) learning model has been a cornerstone in ...
Substantial evidence suggests that the phasic activities of dopamine (DA) neurons in the primate midbrain represent a temporal difference (TD) error in the ...