对于构建强化学习代理,我们将使用如下所示的 OpenAI Gym 包 –
import gym
env = gym.make('CartPole-v0')
for _ in range(20):
observation = env.reset()
for i in range(100):
env.render()
print(observation)
action = env.action_space.sample()
observation, reward, done, info = env.step(action)
if done:
print("Episode finished after {} timesteps".format(i+1))
break
观察小推车可以平衡。


![晴川云Minecraft Wiki教程:初始资源[ ],晴川云](https://baike.qcidc.com/wp-content/uploads/2025/09/20250919082155456-u_2150650237_2232205033fm_253fmt_autoapp_138f_JPEG.jpeg)







暂无评论内容