Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Are there examples of using reinforcement learning for text classification?

Imagine a binary classification problem like sentiment analysis. Since we have the labels, cant we use the gap between actual - predicted as reward for RL ?

I wish to try Reinforcement Learning for Classification Problems

like image 696
Anuj Gupta Avatar asked Jun 20 '17 12:06

Anuj Gupta


1 Answers

Interesting thought! According to my knowledge it can be done.

  1. Imitation Learning - On a high level it is observing sample trajectories performed by the agent in the environment and use it to predict the policy given a particular stat configuration. I prefer Probabilistic Graphical Models for the prediction since I have more interpretability in the model. I have implemented a similar algorithm from the research paper: http://homes.soic.indiana.edu/natarasr/Papers/ijcai11_imitation_learning.pdf

  2. Inverse Reinforcement Learning - Again a similar method developed by Andrew Ng from Stanford to find the reward function from sample trajectories, and the reward function can be used to frame the desirable actions. http://ai.stanford.edu/~ang/papers/icml00-irl.pdf

like image 76
vikky 2405 Avatar answered Oct 06 '22 06:10

vikky 2405