I have implemented the value iteration algorithm for simple Markov decision process Wikipedia in Python. In order to keep the structure (states, actions, transitions, rewards) of the particular Markov process and iterate over it I have used the following data structures: <ol> <li> dictionary for states and actions that are available for those states: <code>SA = { 'state A': {' action 1', 'action 2', ..}, ...}</code> </li> <li> dictionary for transition probabilities: <code>T = {('state A', 'action 1'): {'state B': probability}, ...}</code> </li> <li> dictionary for rewards: <code>R = {('state A', 'action 1'): {'state B': reward}, ...}</code>. </li> </ol> My question is: is this the right approach? What are the most suitable data structures (in Python) for MDP?

I implemented Markov Decision Processes in Python before and found the following code useful. http://aima.cs.berkeley.edu/python/mdp.html This code is taken from Artificial Intelligence: A Modern Approach by Stuart Russell and Peter Norvig.

Whether a data structure is suitable or not mostly depends on what you do with the data. You mention that you want to iterate over the process, so optimize your data structure for this purpose. Transitions in Markov processes are often modeled by matrix multiplications. The transition probabilities <code>Pa(s1,s2)</code> and the rewards <code>Ra(s1,s2)</code> could be described by (potentially sparse) matrices <code>Pa</code> and <code>Ra</code> indexed by the states. I think this would have a few advantages: <ul> <li>If you use numpy arrays for this, indexing will probably be faster than with the dictionaries. </li> <li>Also state transitions could then be simply described by matrix multiplication.</li> <li>Process simulation with for example roulette wheel selection will be faster and more clearly implemented, since you simply need to pick the corresponding column of the transition matrix.</li> </ul>

Data structure for Markov Decision Process [closed]

Tags:

python

artificial-intelligence

markov

I have implemented the value iteration algorithm for simple Markov decision process Wikipedia in Python. In order to keep the structure (states, actions, transitions, rewards) of the particular Markov process and iterate over it I have used the following data structures:

dictionary for states and actions that are available for those states:

SA = { 'state A': {' action 1', 'action 2', ..}, ...}
dictionary for transition probabilities:

T = {('state A', 'action 1'): {'state B': probability}, ...}
dictionary for rewards:

R = {('state A', 'action 1'): {'state B': reward}, ...}.

My question is: is this the right approach? What are the most suitable data structures (in Python) for MDP?

849

asked Dec 20 '12 20:12

JackAW

2 Answers

I implemented Markov Decision Processes in Python before and found the following code useful.

http://aima.cs.berkeley.edu/python/mdp.html

This code is taken from Artificial Intelligence: A Modern Approach by Stuart Russell and Peter Norvig.

196

answered Sep 26 '22 07:09

Xiong Yiliang

Whether a data structure is suitable or not mostly depends on what you do with the data. You mention that you want to iterate over the process, so optimize your data structure for this purpose.

Transitions in Markov processes are often modeled by matrix multiplications. The transition probabilities Pa(s1,s2) and the rewards Ra(s1,s2) could be described by (potentially sparse) matrices Pa and Ra indexed by the states. I think this would have a few advantages:

If you use numpy arrays for this, indexing will probably be faster than with the dictionaries.
Also state transitions could then be simply described by matrix multiplication.
Process simulation with for example roulette wheel selection will be faster and more clearly implemented, since you simply need to pick the corresponding column of the transition matrix.

answered Sep 25 '22 07:09

silvado

Related questions
                            
                                Output of True and []
                            
                                Python list to bitwise operations
                            
                                set numbers of admin.TabularInline in django admin
                            
                                C/C++ for Python programmer [closed]
                            
                                Python 3.2 won't import cookielib
                            
                                What are the connection limits for Google Cloud SQL from App Engine, and how to best reuse DB connections?
                            
                                python: when can I unpack a generator?
                            
                                Run Python in cmd [duplicate]
                            
                                Implementing a directed graph in python
                            
                                About MySQLdb conn.autocommit(True)
                            
                                Python and MySQL print results
                            
                                How do I find the formatting for a subset of text in an Excel document cell
                            
                                Why don't my Scrapy CrawlSpider rules work?
                            
                                Aliases for commands with Python cmd module
                            
                                Encrypt data with python, decrypt in php
                            
                                How do I hide a sub-menu in QMenu
                            
                                Django Form Wizard to Edit Model
                            
                                flask-login: Chrome ignoring cookie expiration?
                            
                                #!/usr/bin/python and #!/usr/bin/env python, which support?
                            
                                Can params passed to pytest fixture be passed in as a variable?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With