AI/ML, Robotics

What do policy and policy search mean?

Policy – The agent’s action selection is modeled as a map called “policy”. The policy gives the probability of taking action a when in state s.

{\displaystyle \pi :S\times A\rightarrow [0,1]}

{\displaystyle \pi (a|s)=P(a_{t}=a|s_{t}=s)}

Policy Search – A method to learn by searching directly in (some subset of) the policy space.

Some methods of policy search are presented in the picture below, and the material is from policy search tutorial in ICML 2015.

policy_search_class

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out /  Change )

Google photo

You are commenting using your Google account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s