I am having problems interpreting the results of the <code>mi.plugin()</code> (or <code>mi.empirical()</code>) function from the entropy package. As far as I understand, an MI=0 tells you that the two variables that you are comparing are completely independent; and as MI increases, the association between the two variables is increasingly non-random. Why, then, do I get a value of 0 when running the following in R (using the <code>{entropy}</code> package): <code>mi.plugin( rbind( c(1, 2, 3), c(1, 2, 3) ) )</code> when I'm comparing two vectors that are exactly the same? I assume my confusion is based on a theoretical misunderstanding on my part, can someone tell me where I've gone wrong? Thanks in advance.

Use <code>mutinformation(x,y)</code> from package infotheo. <pre class="prettyprint"><code>> mutinformation(c(1, 2, 3), c(1, 2, 3) ) [1] 1.098612 > mutinformation(seq(1:5),seq(1:5)) [1] 1.609438 </code></pre> and normalized mutual information will be 1.

Calculation of mutual information in R

Tags:

r

entropy

information-theory

I am having problems interpreting the results of the mi.plugin() (or mi.empirical()) function from the entropy package. As far as I understand, an MI=0 tells you that the two variables that you are comparing are completely independent; and as MI increases, the association between the two variables is increasingly non-random.

Why, then, do I get a value of 0 when running the following in R (using the {entropy} package):

mi.plugin( rbind( c(1, 2, 3), c(1, 2, 3) ) )

when I'm comparing two vectors that are exactly the same?

I assume my confusion is based on a theoretical misunderstanding on my part, can someone tell me where I've gone wrong?

Thanks in advance.

867

asked Sep 11 '14 15:09

lemhop

1 Answers

Use mutinformation(x,y) from package infotheo.

> mutinformation(c(1, 2, 3), c(1, 2, 3) ) 
[1] 1.098612

> mutinformation(seq(1:5),seq(1:5))
[1] 1.609438

and normalized mutual information will be 1.

112

answered Oct 14 '22 16:10

Monicam

Related questions
                            
                                How to improve speed in parallel cluster processing
                            
                                there is no package called 'BiocInstaller'
                            
                                How do I tell R to fill the circle dots with colour on a scatter plot?
                            
                                Set x-axis labels to dates when plotting time series
                            
                                R: find column with the largest column sum
                            
                                Normalizing the values in a data table using the values stored in another data table
                            
                                Handling htmlParse error (failed to load HTTP resource)
                            
                                Problems while reproducing Sankey chart example with d3_sankey
                            
                                Scatter plot and boxplot overlay
                            
                                Correct usage of scale_fill_manual() to create multi-colored histogram bars in ggplot2?
                            
                                Subtract multiple columns ignoring NA
                            
                                RStudio is blank when opened
                            
                                Automatically document all methods of an S4 generic, using roxygen2
                            
                                configure: WARNING: you cannot build info or HTML versions of the R manuals
                            
                                ggplot2 is there an easy way to wrap annotation text?
                            
                                Call custom function with if statement in the summarize function in dplyr
                            
                                R using diff: non-numeric argument to binary operator error
                            
                                Make table show percentages instead of frequencies in R
                            
                                Extracting nth element from a nested list following strsplit - R
                            
                                Code box size and font size in RPres

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With