I asked GPT and it says some methods update all steps’ value function while others only update those not related to exploration step, I know these two methods are quite different but they both work, why is that? What’s their essential distinction? Any opinion is welcome and any discussion would be great!

