Is there a way to print a trained decision tree in scikit-learn? I want to train a decision tree for my thesis and I want to put the picture of the tree in the thesis. Is that possible?

There is a method to export to graph_viz format: http://scikit-learn.org/stable/modules/generated/sklearn.tree.export_graphviz.html So from the online docs: <pre class="prettyprint"><code>>>> from sklearn.datasets import load_iris >>> from sklearn import tree >>> >>> clf = tree.DecisionTreeClassifier() >>> iris = load_iris() >>> >>> clf = clf.fit(iris.data, iris.target) >>> tree.export_graphviz(clf, ... out_file='tree.dot') </code></pre> Then you can load this using graph viz, or if you have pydot installed then you can do this more directly: http://scikit-learn.org/stable/modules/tree.html <pre class="prettyprint"><code>>>> from sklearn.externals.six import StringIO >>> import pydot >>> dot_data = StringIO() >>> tree.export_graphviz(clf, out_file=dot_data) >>> graph = pydot.graph_from_dot_data(dot_data.getvalue()) >>> graph.write_pdf("iris.pdf") </code></pre> Will produce an svg, can't display it here so you'll have to follow the link: http://scikit-learn.org/stable/_images/iris.svg Update It seems that there has been a change in the behaviour since I first answered this question and it now returns a <code>list</code> and hence you get this error: <pre class="prettyprint"><code>AttributeError: 'list' object has no attribute 'write_pdf' </code></pre> Firstly when you see this it's worth just printing the object and inspecting the object, and most likely what you want is the first object: <pre class="prettyprint"><code>graph[0].write_pdf("iris.pdf") </code></pre> Thanks to @NickBraunagel for the comment

Although I'm late to the game, the below comprehensive instructions could be useful for others who want to display decision tree output: Install necessary modules: <ol> <li>install <code>graphviz</code>. I used conda's install package here (recommended over <code>pip install graphviz</code> as <code>pip</code> install doesn't include the actual GraphViz executables)</li> <li>install <code>pydot</code> via pip (<code>pip install pydot</code>)</li> <li>Add the graphviz folder directory containing the .exe files (e.g. dot.exe) to your environment variable PATH</li> <li>run EdChum's above (NOTE: <code>graph</code> is a <code>list</code> containing the <code>pydot.Dot</code> object):</li> </ol> <hr> <pre class="prettyprint"><code>from sklearn.datasets import load_iris from sklearn import tree from sklearn.externals.six import StringIO import pydot clf = tree.DecisionTreeClassifier() iris = load_iris() clf = clf.fit(iris.data, iris.target) dot_data = StringIO() tree.export_graphviz(clf, out_file=dot_data) graph = pydot.graph_from_dot_data(dot_data.getvalue()) graph[0].write_pdf("iris.pdf") # must access graph's first element </code></pre> Now you'll find the "iris.pdf" within your environment's default directory

Is it possible to print the decision tree in scikit-learn?

2 Answers

There is a method to export to graph_viz format: http://scikit-learn.org/stable/modules/generated/sklearn.tree.export_graphviz.html

So from the online docs:

>>> from sklearn.datasets import load_iris
>>> from sklearn import tree
>>>
>>> clf = tree.DecisionTreeClassifier()
>>> iris = load_iris()
>>>
>>> clf = clf.fit(iris.data, iris.target)
>>> tree.export_graphviz(clf,
...     out_file='tree.dot')

Then you can load this using graph viz, or if you have pydot installed then you can do this more directly: http://scikit-learn.org/stable/modules/tree.html

>>> from sklearn.externals.six import StringIO  
>>> import pydot 
>>> dot_data = StringIO() 
>>> tree.export_graphviz(clf, out_file=dot_data) 
>>> graph = pydot.graph_from_dot_data(dot_data.getvalue()) 
>>> graph.write_pdf("iris.pdf")

Will produce an svg, can't display it here so you'll have to follow the link: http://scikit-learn.org/stable/_images/iris.svg

Update

It seems that there has been a change in the behaviour since I first answered this question and it now returns a list and hence you get this error:

AttributeError: 'list' object has no attribute 'write_pdf'

Firstly when you see this it's worth just printing the object and inspecting the object, and most likely what you want is the first object:

graph[0].write_pdf("iris.pdf")

Thanks to @NickBraunagel for the comment

130

answered Oct 02 '22 00:10

EdChum

Although I'm late to the game, the below comprehensive instructions could be useful for others who want to display decision tree output:

Install necessary modules:

install graphviz. I used conda's install package here (recommended over pip install graphviz as pip install doesn't include the actual GraphViz executables)
install pydot via pip (pip install pydot)
Add the graphviz folder directory containing the .exe files (e.g. dot.exe) to your environment variable PATH
run EdChum's above (NOTE: graph is a list containing the pydot.Dot object):

from sklearn.datasets import load_iris
from sklearn import tree
from sklearn.externals.six import StringIO  
import pydot 

clf = tree.DecisionTreeClassifier()
iris = load_iris()
clf = clf.fit(iris.data, iris.target)

dot_data = StringIO() 
tree.export_graphviz(clf, out_file=dot_data) 
graph = pydot.graph_from_dot_data(dot_data.getvalue()) 

graph[0].write_pdf("iris.pdf")  # must access graph's first element

Now you'll find the "iris.pdf" within your environment's default directory

answered Oct 02 '22 00:10

NickBraunagel

Related questions
                            
                                How do you get the next value in the floating-point sequence? [duplicate]
                            
                                Python interactive CLI application?
                            
                                Python: sorting a dependency list
                            
                                Time-complexity of checking if two set are equal in Python
                            
                                Python etree control empty tag format
                            
                                Why is "aClass.aProperty" not callable?
                            
                                How to get object from PK inside Django template?
                            
                                Drop row in Pandas Series and clean up index
                            
                                python pip still looking for previous installation
                            
                                Python equivalent of unix "strings" utility
                            
                                Does Python intern strings?
                            
                                matplotlib advanced bar plot
                            
                                SSL error installing pycurl after SSL is set
                            
                                Django form: what is the best way to modify posted data before validating?
                            
                                Python - Decode UTF-16 file with BOM
                            
                                How to use full_clean() for data validation before saving in Django 1.5 gracefully?
                            
                                How can I make an animation with contourf()?
                            
                                How to limit choices of ForeignKey choices for Django raw_id_field
                            
                                Django - Multiple apps on one webpage?
                            
                                How can I identify requests made via AJAX in Python's Flask?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Is it possible to print the decision tree in scikit-learn?

Tags:

python

scikit-learn

Jack Twain

People also ask

2 Answers

EdChum

NickBraunagel

Recent Activity

Donate For Us