Python libraries for on-line machine learning MDP

Tags:

I am trying to devise an iterative markov decision process (MDP) agent in Python with the following characteristics:

observable state
- I handle potential 'unknown' state by reserving some state space for answering query-type moves made by the DP (the state at t+1 will identify the previous query [or zero if previous move was not a query] as well as the embedded result vector) this space is padded with 0s to a fixed length to keep the state frame aligned regardless of query answered (whose data lengths may vary)
actions that may not always be available at all states
reward function may change over time
policy convergence should incremental and only computed per move

So the basic idea is the MDP should make its best guess optimized move at T using its current probability model (and since its probabilistic the move it makes is expectedly stochastic implying possible randomness), couple the new input state at T+1 with the reward from previous move at T and reevaluate the model. The convergence must not be permanent since the reward may modulate or the available actions could change.

What I'd like to know is if there are any current python libraries (preferably cross-platform as I necessarily change environments between Windoze and Linux) that can do this sort of thing already (or may support it with suitable customization eg: derived class support that allows redefining say reward method with one's own).

I'm finding information about on-line per-move MDP learning is rather scarce. Most use of MDP that I can find seems to focus on solving the entire policy as a preprocessing step.

617

asked Feb 05 '12 02:02

Brian Jack

2 Answers

I am a grad student doing lots of MCMC stuff in Python and to my knowledge nothing implements MDPs directly. The closest thing I am aware of is PyMC. Digging around the documentation provided this, which gives some advice on extending their classes. They definitely don't have rewards, etc., available out of the box.

If you're serious about developing something good, you might consider extending and subclassing the PyMC stuff to create your decision processes, as then you can get it included in the next update of PyMC and help out lots of future folks.

answered Sep 22 '22 05:09

ely

Here is a python toolbox for MDPs.

Caveat: It's for vanilla textbook MDPs and not for partially observable MDPs (POMDPs), or any kind of non-stationarity in rewards.

Second Caveat: I found the documentation to be really lacking. You have to look in the python code if you want to know what it implements or you can quickly look at their documentation for a similar toolbox they have for MATLAB.

answered Sep 23 '22 05:09

kitchenette

Related questions
                            
                                print all available tuples in python from a debugger
                            
                                Python library release best practices
                            
                                Generate permalinks on header with python markdown library
                            
                                Pros and cons to using sparse matrices in python/R?
                            
                                OpenOffice Python macros: Where can I find useful documentation?
                            
                                Out of Core Rules Engine
                            
                                Creating custom JSONEncoder
                            
                                implementing lisp in Python
                            
                                Python's fromtimestamp returns inconsistent results on different machines
                            
                                Is there any library in C like python's inspect?
                            
                                Python MixIn standards
                            
                                Programmatically Revoke OAuth token for google account
                            
                                Single Django model, multiple tables?
                            
                                How to inherit from MonkeyDevice?
                            
                                Django model fields. Custom field value setter
                            
                                Current pure python solution for facebook-oauth?
                            
                                Cython static link with python runtime?
                            
                                How to identify stripes of different colors
                            
                                How to map the most "similar" strings from one list to another in python?
                            
                                Can an abstract class force the inheriting class to implement a method as static?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Python libraries for on-line machine learning MDP

Tags:

python

machine-learning

markov

Brian Jack

People also ask

2 Answers

ely

kitchenette

Recent Activity

Donate For Us