I'm having difficulty understanding the import statement and its variations. Suppose I'm using the <code>lxml</code> module for scraping websites. The following examples show... <pre class="prettyprint"><code>from lxml.html import parse parse( 'http://somesite' ) </code></pre> ...Google's python style guide prefers the basic import statement, to preserve the namespaces. I'd prefer to do that, but when I try this: <pre class="prettyprint"><code>import lxml lxml.html.parse( 'http://somesite' ) </code></pre> ...then I get the following error message: <blockquote> AttributeError: 'module' object has no attribute 'html' </blockquote> Can anyone help me understand what is going on? I'd much prefer to use modules within their namespaces, but need some assistance understanding the semantics.

<pre class="prettyprint"><code>import lxml.html as LH doc = LH.parse('http://somesite') </code></pre> <code>lxml.html</code> is a module. When you <code>import lxml</code>, the <code>html</code> module is not imported into the <code>lxml</code> namespace. This is a developer's decision. Some packages automatically import some modules, some don't. In this case, you have to do it yourself with <code>import lxml.html</code>. <code>import lxml.html as LH</code> imports the <code>html</code> module and binds it to the name <code>LH</code> in the current module's namespace. So you can access the parse function with <code>LH.parse</code>. <hr> If you want to delve deeper into when a package (like <code>lxml</code>) imports modules (like <code>lxml.html</code>) automatically, open a terminal and type <pre class="prettyprint"><code>In [16]: import lxml In [17]: lxml Out[17]: <module 'lxml' from '/usr/lib/python2.7/dist-packages/lxml/__init__.pyc'> </code></pre> Here is you see the path to the <code>lxml</code> package's <code>__init__.py</code> file. If you look at the contents you find it is empty. So no submodules are imported. If you look in numpy's <code>__init__.py</code>, you see lots of code, amongst which is <pre class="prettyprint"><code>import linalg import fft import polynomial import random import ctypeslib import ma </code></pre> These are all submodules which are imported into the <code>numpy</code> namespace. So from a user's perspective, <code>import numpy</code> automatically gives you access to <code>numpy.linalg</code>, <code>numpy.fft</code>, etc.

Let's take an example of a package <code>pkg</code> with two module in it <code>a.py</code> and <code>b.py</code>: <pre class="prettyprint"><code>--pkg | | -- a.py | | -- b.py | | -- __init__.py </code></pre> in <code>__init__.py</code> you are importing <code>a.py</code> and not <code>b.py</code>: <blockquote> import a </blockquote> So if you open your terminal and do: <pre class="prettyprint"><code>>>> import pkg >>> pkg.a >>> pkg.b AttributeError: 'module' object has no attribute 'b' </code></pre> As you can see because we have imported <code>a.py</code> in pkg's <code>__init__.py</code>, we was able to access it as an attribute of <code>pkg</code> but <code>b</code> is not there, so to access this later we should use: <pre class="prettyprint"><code>>>> import pkg.b # OR: from pkg import b </code></pre> HTH,

Python import statement semantics

Tags:

python

python-import

I'm having difficulty understanding the import statement and its variations.

Suppose I'm using the lxml module for scraping websites.

The following examples show...

from lxml.html import parse
parse( 'http://somesite' )

...Google's python style guide prefers the basic import statement, to preserve the namespaces.

I'd prefer to do that, but when I try this:

import lxml
lxml.html.parse( 'http://somesite' )

...then I get the following error message:

AttributeError: 'module' object has no attribute 'html'

Can anyone help me understand what is going on? I'd much prefer to use modules within their namespaces, but need some assistance understanding the semantics.

888

asked Oct 26 '12 20:10

Travis Leleu

2 Answers

import lxml.html as LH
doc = LH.parse('http://somesite')

lxml.html is a module. When you import lxml, the html module is not imported into the lxml namespace. This is a developer's decision. Some packages automatically import some modules, some don't. In this case, you have to do it yourself with import lxml.html.

import lxml.html as LH imports the html module and binds it to the name LH in the current module's namespace. So you can access the parse function with LH.parse.

If you want to delve deeper into when a package (like lxml) imports modules (like lxml.html) automatically, open a terminal and type

In [16]: import lxml

In [17]: lxml
Out[17]: <module 'lxml' from '/usr/lib/python2.7/dist-packages/lxml/__init__.pyc'>

Here is you see the path to the lxml package's __init__.py file. If you look at the contents you find it is empty. So no submodules are imported. If you look in numpy's __init__.py, you see lots of code, amongst which is

import linalg
import fft
import polynomial
import random
import ctypeslib
import ma

These are all submodules which are imported into the numpy namespace. So from a user's perspective, import numpy automatically gives you access to numpy.linalg, numpy.fft, etc.

177

answered Oct 15 '22 15:10

unutbu

Let's take an example of a package pkg with two module in it a.py and b.py:

--pkg
   |
   | -- a.py
   |
   | -- b.py
   |
   | -- __init__.py

in __init__.py you are importing a.py and not b.py:

import a

So if you open your terminal and do:

>>> import pkg
>>> pkg.a
>>> pkg.b
AttributeError: 'module' object has no attribute 'b'

As you can see because we have imported a.py in pkg's __init__.py, we was able to access it as an attribute of pkg but b is not there, so to access this later we should use:

>>> import pkg.b   # OR: from pkg import b

HTH,

answered Oct 15 '22 14:10

mouad

Related questions
                            
                                What is the easiest way to make an optional C extension for a python package?
                            
                                Pickle a dynamically parameterized sub-class
                            
                                Using Python and Mechanize to submit form data and authenticate
                            
                                Accessing Python instance variables with __dict__- Is it wrong?
                            
                                Replace with newline python
                            
                                What is an InstrumentedList in Python?
                            
                                Use openpyxl to edit a Excel2007 file (.xlsx) without changing its own styles?
                            
                                What are some good web apps for learning Flask? [closed]
                            
                                Efficient Way to Create Numpy Arrays from Binary Files
                            
                                ValueError("color kwarg must have one color per dataset")?
                            
                                Import a class from a folder at another level
                            
                                Why is Clojure 10 times slower than Python for the equivalent solution of Euler 50?
                            
                                Python in Desktop Application Development
                            
                                What is the difference between **kwargs and dict in Python 3.2?
                            
                                String formatting [str.format()] with a dictionary key which is a str() of a number
                            
                                Python file operations
                            
                                Prevent pandas from automatically inferring type in read_csv
                            
                                Merging of two dictionaries [duplicate]
                            
                                What is the operator precedence when writing a double inequality in Python (explicitly in the code, and how can this be overridden for arrays?)
                            
                                Test/Test Coverage with Python in Sonar not showing up?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With