AI人工智能 使用 Python 构建学习智能体
为了构建强化学习智能体,我们将使用OpenAI Gym包,如下所示:
import gym
env = gym.make('CartPole-v0')
for _ in range(20):
observation = env.reset()
for i in range(100):
env.render()
print(observation)
action = env.action_space.sample()
observation, reward, done, info = env.step(action)
if done:
print("Episode finished after {} timesteps".format(i+1))
break

可以观察到,Cartpole 能够自我平衡。