What is the difference between markov chain models and hidden markov model? I've read in Wikipedia, but couldn't understand the differences.

To explain by example, I'll use an example from natural language processing. Imagine you want to know the probability of this sentence: I enjoy coffee In a Markov model, you could estimate its probability by calculating: <pre class="prettyprint"><code>P(WORD = I) x P(WORD = enjoy | PREVIOUS_WORD = I) x P(word = coffee| PREVIOUS_WORD = enjoy) </code></pre> Now, imagine we wanted to know the parts-of-speech tags of this sentence, that is, if a word is a past tense verb, a noun, etc. We did not observe any parts-of-speech tags in that sentence, but we assume they are there. Thus, we calculate what's the probability of the parts-of-speech tag sequence. In our case, the actual sequence is: <blockquote> PRP-VBP-NN </blockquote> (where PRP=“Personal Pronoun”, VBP=“Verb, non-3rd person singular present”, NN=“Noun, singular or mass”. See https://cs.nyu.edu/grishman/jet/guide/PennPOS.html for complete notation of Penn POS tagging) But wait! This is a sequence that we can apply a Markov model to. But we call it hidden, since the parts-of-speech sequence is never directly observed. Of course in practice, we will calculate many such sequences and we'd like to find the hidden sequence that best explains our observation (e.g. we are more likely to see words such as 'the', 'this', generated from the determiner (DET) tag) The best explanation I have ever encountered is in a paper from 1989 by Lawrence R. Rabiner: http://www.cs.ubc.ca/~murphyk/Bayes/rabiner.pdf

What is the difference between markov chains and hidden markov model?

2 Answers

To explain by example, I'll use an example from natural language processing. Imagine you want to know the probability of this sentence:

I enjoy coffee

In a Markov model, you could estimate its probability by calculating:

P(WORD = I) x P(WORD = enjoy | PREVIOUS_WORD = I) x P(word = coffee| PREVIOUS_WORD = enjoy)

Now, imagine we wanted to know the parts-of-speech tags of this sentence, that is, if a word is a past tense verb, a noun, etc.

We did not observe any parts-of-speech tags in that sentence, but we assume they are there. Thus, we calculate what's the probability of the parts-of-speech tag sequence. In our case, the actual sequence is:

PRP-VBP-NN

(where PRP=“Personal Pronoun”, VBP=“Verb, non-3rd person singular present”, NN=“Noun, singular or mass”. See https://cs.nyu.edu/grishman/jet/guide/PennPOS.html for complete notation of Penn POS tagging)

But wait! This is a sequence that we can apply a Markov model to. But we call it hidden, since the parts-of-speech sequence is never directly observed. Of course in practice, we will calculate many such sequences and we'd like to find the hidden sequence that best explains our observation (e.g. we are more likely to see words such as 'the', 'this', generated from the determiner (DET) tag)

The best explanation I have ever encountered is in a paper from 1989 by Lawrence R. Rabiner: http://www.cs.ubc.ca/~murphyk/Bayes/rabiner.pdf

168

answered Oct 10 '22 18:10

matt

Markov model is a state machine with the state changes being probabilities. In a hidden Markov model, you don't know the probabilities, but you know the outcomes.

For example, when you flip a coin, you can get the probabilities, but, if you couldn't see the flips and someone moves one of five fingers with each coin flip, you could take the finger movements and use a hidden Markov model to get the best guess of coin flips.

answered Oct 10 '22 20:10

TechEffigy

Related questions
                            
                                Hidden Markov in PyMC3
                            
                                Hidden Markov Model Multiple Observation values for each state
                            
                                Exact Hidden Markov Model training algorithm
                            
                                How do I have to train a HMM with Baum-Welch and multiple observations?
                            
                                Hidden Markov Model predicting next observation
                            
                                What is the difference between K-means clustering and vector quantization?
                            
                                What machine learning algorithm is appropriate for predicting one time-series from another?
                            
                                Prediction step for time series using continuous hidden Markov models
                            
                                Unsupervised HMM training in NLTK
                            
                                Viterbi training or Baum-Welch algorithm to estimate the transition and emission probabilities?
                            
                                How to find the most likely sequences of hidden states for a Hidden Markov Model
                            
                                hidden markov model thresholding
                            
                                Any Matlab functions out there for handling Hidden Markov Models with continuous observation variables?
                            
                                Finding the top - k viterbi paths in HMM
                            
                                Hidden Markov Models [closed]
                            
                                Decoding sequences in a GaussianHMM
                            
                                Hidden Markov Models with C++ [closed]
                            
                                Issue in training hidden markov model and usage for classification
                            
                                Hidden Markov Model for multiple observed variables
                            
                                Fitting a scikits.learn.hmm.GaussianHMM to variable length training sequences

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

What is the difference between markov chains and hidden markov model?

Tags:

markov-chains

hidden-markov-models

markov

good_evening

People also ask

2 Answers

matt

TechEffigy

Recent Activity

Donate For Us