体育运动员应该先打新冠疫苗么?

a robot can choose left route or right route, with +1 or +2 rewards respectively
一个简单的例子,机器人需要在左边和右边的路上做决策,γ 小就会走左边(活在当下),反之就会走右边(延迟满足),你甚至可以计算出 γ 的临界点。

--

--

Hi there! I’m Jiayu Liu, currently an engineering manager at Airbnb China, located in Beijing.

Love podcasts or audiobooks? Learn on the go with our new app.

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store
Jiayu Liu

Jiayu Liu

Hi there! I’m Jiayu Liu, currently an engineering manager at Airbnb China, located in Beijing.