奖励和惩罚路径的差异性回放预示着接近和回避的发生-小柯机器人-科学网

奖励和惩罚路径的差异性回放预示着接近和回避的发生

2023-04-11 16:23

英国伦敦大学学院Jessica McFadyen等研究人员发现，奖励和惩罚路径的差异性回放预示着接近和回避的发生。2023年4月5日，《自然—神经科学》杂志在线发表了这项成果。

研究人员使用脑磁图（MEG）研究了人类参与者的重放，他们计划接近或避免一个不确定的环境，其中包含导致奖励或惩罚的路径。研究人员发现在计划过程中，有证据表明，在20至90毫秒的时间里，状态与状态之间有快速的转换，这就是前向顺序重放。相对于厌恶性路径，在决定回避之前，奖励性路径的重放被提高，而在决定接近之前则被削弱。

对重放未来的惩罚性路径的逐次试验偏见预示着接近风险环境的非理性决定，这种效应在具有较高特质焦虑的参与者中更为明显。研究结果提示重放与计划行为的一种耦合，其中重放优先考虑对接近或避免的最坏情况的在线表征。

据悉，神经重放与计划有关，在计划中，与任务目标相关的状态被迅速地依次重新激活。目前还不清楚，在计划期间，重放是否与实际的前瞻性选择有关。

附：英文原文

Title: Differential replay of reward and punishment paths predicts approach and avoidance

Author: McFadyen, Jessica, Liu, Yunzhe, Dolan, Raymond J.

Issue&Volume: 2023-04-05

Abstract: Neural replay is implicated in planning, where states relevant to a task goal are rapidly reactivated in sequence. It remains unclear whether, during planning, replay relates to an actual prospective choice. Here, using magnetoencephalography (MEG), we studied replay in human participants while they planned to either approach or avoid an uncertain environment containing paths leading to reward or punishment. We find evidence for forward sequential replay during planning, with rapid state-to-state transitions from 20 to 90ms. Replay of rewarding paths was boosted, relative to aversive paths, before a decision to avoid and attenuated before a decision to approach. A trial-by-trial bias toward replaying prospective punishing paths predicted irrational decisions to approach riskier environments, an effect more pronounced in participants with higher trait anxiety. The findings indicate a coupling of replay with planned behavior, where replay prioritizes an online representation of a worst-case scenario for approaching or avoiding.

DOI: 10.1038/s41593-023-01287-7

Source: https://www.nature.com/articles/s41593-023-01287-7

Nature Neuroscience：《自然—神经科学》，创刊于1998年。隶属于施普林格·自然出版集团，最新IF：28.771
官方网址：https://www.nature.com/neuro/
投稿链接：https://mts-nn.nature.com/cgi-bin/main.plex

本期文章：《自然—神经科学》：Online/在线发表

分享到: