Important Notice: Our web hosting provider recently started charging us for additional visits, which was unexpected. In response, we're seeking donations. Depending on the situation, we may explore different monetization options for our Community and Expert Contributors. It's crucial to provide more returns for their expertise and offer more Expert Validated Answers or AI Validated Answers. Learn more about our hosting issue here.

Most RL work assumes the action space is discrete; what about continuous actions?

0
Posted

Most RL work assumes the action space is discrete; what about continuous actions?

0

It is true that most RL work has considered discrete action spaces, but this was usually done for convenience, not as an essential limitation of the ideas; and there are exceptions. Nevertheless, it is often not obvious how to extend RL methods to continuous, or even large discrete, action spaces. The key problem is that RL methods typically involve a max or sum over elements of the action space, which is not feasible if the space is large or infinite. The natural approach is to replace the enumeration of actions with a sample of them, and average (just as we replace the enumeration of possible next states with a sample of the, and average). This requires either a very special structure for the action-value function, or else a stored representation of the best known policy. Actor-critic methods are one approach. With no attempt to be exhaustive, some of the earlier RL research with continuous actions includes: • Williams, R.J. (1992). Simple statistical gradient-following algorithms fo

Related Questions

What is your question?

*Sadly, we had to bring back ads too. Hopefully more targeted.