I just stumbled across this unexpected behavior in python (both 2.7 and 3.x): <pre class="prettyprint"><code>>>> import re as regexp >>> regexp <module 're' from '.../re.py'> >>> from regexp import search Traceback (most recent call last): File "<stdin>", line 1, in <module> ImportError: No module named 'regexp' </code></pre> Of course <code>from re import search</code> succeeds, just as it would have before I created the alias. But why can't I use the alias <code>regexp</code>, which is now a known module, as a source for importing names? This sets you up for a nasty surprise whenever there are multiple variants of a module: Say I am still using Python 2 and I want to use the C version of <code>pickle</code>, <code>cPickle</code>. If I then try to import a name from <code>pickle</code>, it will be fetched from the simple <code>pickle</code> module (and I won't notice since it doesn't throw an error!) <pre class="prettyprint"><code>>>> import cPickle as pickle >>> from pickle import dump >>> import inspect >>> inspect.getsourcefile(dump) '.../python2.7/pickle.py' # Expected cPickle.dump </code></pre> Oops! Poking around I see that <code>sys.modules</code> includes the real module name (<code>re</code> or <code>cPickle</code>, but not the alias <code>regexp</code> or <code>pickle</code>. That explains how the second import fails, but not why python module name resolution works this way, i.e. what the rules and rationale are for doing it this way. Note: This was marked as a duplicate of a question that has nothing to do with module aliasing: aliasing is not even mentioned in the question (which is about importing submodules from a package) or the top answers. While the answers to that question provide information relevant to this question, the questions themselves are not even similar IMHO.

<h3>In short:</h3> You can think of the loading process in that way: You can load a module into your program, in the form of a variable. You can name the variable for using the module whatever you want. But, the loading process, is based on the name of the module's file, not "module variables". <hr> <h3>Long version:</h3> <code>import re</code> creates a global variable named <code>re</code> that serves as the "module portal", in the way it provides the ability to use the module operations. Most alike, <code>import re as regex</code> creates such a "portal" under the variable named <code>regex</code>. But, when looking to create such portal and load the module functionality into it, the importer does not use such references. Instead, it looks for the module in your python <code>\Lib</code> directory, or your current working directory, as a file named <code>re.py</code> (or whatever is the name of the module you import). The <code>import</code> instructions does not address variables, but files, like <code>#include<stdio.h></code> in C. They have their "own syntax", and set of instructions, as ruled by the interpreter structure, which is, to that case, the interpretation of <code>re</code> as a file name rather than a variable and <code>as</code> for ruling the name of the module "portal". That is why <code>regex</code> is an operation alias for the portal for <code>re</code>, but not an importation alias for the module (for that purpose you'll have to use the name of the file). <ul> <li>I have used terms like "module portal" and "operation alias" since I have not found any standard terms for these. Most of the modules and importer mechanics is related to the interpreter implementation. In CPython (where the usage of the C API is common among developers), for example, <code>create_module</code> creates modules for the importer (in the form of <code>PyObject</code>s) using the provided specifications for the module, and the <code>PyModule_NewObject</code> and <code>PyModule_New</code> functions for the module instance creation that bears the module attributes. These can be viewed in the C API modules decumentation.</li> <li>When I mentioned the term "portal" as a way to reference the variable created by the <code>import</code> statement, I meant to refer to it as a static portal, not a dynamic one. A change in the module file will not reflect in a running program that already imported it (as long as it didn't reload it), as it will load a copy of the module and use it, rather than asking the module file for the operations when encountering need.</li> </ul> <hr> Here is pretty much how the variable loading goes realtime: <pre class="prettyprint"><code>>>> import re >>> re <module 're' from 'C:\\Programs\\Python35\\lib\\re.py'> >>> import re as regex >>> regex <module 're' from 'C:\\Programs\\Python35\\lib\\re.py'> </code></pre> You can see that <code>re</code> is the module referenced, and it was loaded from the file <code>C:\Programs\Python35\lib\re.py</code> (may change depending on where your python is installed).

You cannot treat the module name in import statements as variables. If that was the case, surely your initial import would fail because <code>re</code> is not yet a declared variable. Basically the import statement is semantic sugar; it is a statement of its own with its own rules. One such rule is this: The written module name is understood as if it was a string. That is, it does not lookup a variable with the name <code>re</code>, instead it uses the string value <code>'re'</code> directly as the sought after module name. It then searches for a module/package (file) with this name and does the import. This is the only situation (Edit: Well, see the discussion in the comments...) in the language where this behavior is seen, which is the cause of the confusion. Consider this alternative syntax, which is much more in line with the rest of the Python language: <pre class="prettyprint"><code>import 're' # Or alternatively module_name = 're' import module_name </code></pre> Here, variable expansion is assumed in the import statement. As we know this is not the syntax which was actually chosen for the import statement. One can discuss which syntax is the better one, but the above is definitely more harmonious with the rest of the language syntax.

When from import is used python tries to look in the from file to import what you have requested. This might make it clearer. <pre class="prettyprint"><code>import re as regexp from regexp import search </code></pre> This essentially asks python to look in a file called 'regexp' which it can't find. This is why the alias won't work.

Why can't I import from a module alias?

Tags:

python

I just stumbled across this unexpected behavior in python (both 2.7 and 3.x):

>>> import re as regexp
>>> regexp
<module 're' from '.../re.py'>
>>> from regexp import search
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
ImportError: No module named 'regexp'

Of course from re import search succeeds, just as it would have before I created the alias. But why can't I use the alias regexp, which is now a known module, as a source for importing names?

This sets you up for a nasty surprise whenever there are multiple variants of a module: Say I am still using Python 2 and I want to use the C version of pickle, cPickle. If I then try to import a name from pickle, it will be fetched from the simple pickle module (and I won't notice since it doesn't throw an error!)

>>> import cPickle as pickle
>>> from pickle import dump
>>> import inspect
>>> inspect.getsourcefile(dump)
'.../python2.7/pickle.py'    # Expected cPickle.dump

Oops!

Poking around I see that sys.modules includes the real module name (re or cPickle, but not the alias regexp or pickle. That explains how the second import fails, but not why python module name resolution works this way, i.e. what the rules and rationale are for doing it this way.

Note: This was marked as a duplicate of a question that has nothing to do with module aliasing: aliasing is not even mentioned in the question (which is about importing submodules from a package) or the top answers. While the answers to that question provide information relevant to this question, the questions themselves are not even similar IMHO.

380

asked Nov 26 '16 21:11

alexis

4 Answers

In short:

You can think of the loading process in that way:

You can load a module into your program, in the form of a variable. You can name the variable for using the module whatever you want. But, the loading process, is based on the name of the module's file, not "module variables".

Long version:

import re creates a global variable named re that serves as the "module portal", in the way it provides the ability to use the module operations.

Most alike, import re as regex creates such a "portal" under the variable named regex.

But, when looking to create such portal and load the module functionality into it, the importer does not use such references. Instead, it looks for the module in your python \Lib directory, or your current working directory, as a file named re.py (or whatever is the name of the module you import).

The import instructions does not address variables, but files, like #include<stdio.h> in C. They have their "own syntax", and set of instructions, as ruled by the interpreter structure, which is, to that case, the interpretation of re as a file name rather than a variable and as for ruling the name of the module "portal".

That is why regex is an operation alias for the portal for re, but not an importation alias for the module (for that purpose you'll have to use the name of the file).

I have used terms like "module portal" and "operation alias" since I have not found any standard terms for these. Most of the modules and importer mechanics is related to the interpreter implementation. In CPython (where the usage of the C API is common among developers), for example, create_module creates modules for the importer (in the form of PyObjects) using the provided specifications for the module, and the PyModule_NewObject and PyModule_New functions for the module instance creation that bears the module attributes. These can be viewed in the C API modules decumentation.
When I mentioned the term "portal" as a way to reference the variable created by the import statement, I meant to refer to it as a static portal, not a dynamic one. A change in the module file will not reflect in a running program that already imported it (as long as it didn't reload it), as it will load a copy of the module and use it, rather than asking the module file for the operations when encountering need.

Here is pretty much how the variable loading goes realtime:

>>> import re
>>> re
<module 're' from 'C:\\Programs\\Python35\\lib\\re.py'>
>>> import re as regex
>>> regex
<module 're' from 'C:\\Programs\\Python35\\lib\\re.py'>

You can see that re is the module referenced, and it was loaded from the file C:\Programs\Python35\lib\re.py (may change depending on where your python is installed).

answered Oct 20 '22 18:10

Uriel

You cannot treat the module name in import statements as variables. If that was the case, surely your initial import would fail because re is not yet a declared variable. Basically the import statement is semantic sugar; it is a statement of its own with its own rules.

One such rule is this: The written module name is understood as if it was a string. That is, it does not lookup a variable with the name re, instead it uses the string value 're' directly as the sought after module name. It then searches for a module/package (file) with this name and does the import.

This is the only situation (Edit: Well, see the discussion in the comments...) in the language where this behavior is seen, which is the cause of the confusion. Consider this alternative syntax, which is much more in line with the rest of the Python language:

import 're'
# Or alternatively
module_name = 're'
import module_name

Here, variable expansion is assumed in the import statement. As we know this is not the syntax which was actually chosen for the import statement. One can discuss which syntax is the better one, but the above is definitely more harmonious with the rest of the language syntax.

answered Oct 20 '22 16:10

jmd_dk

To get a definite answer on this you'll have to ask the designers themselves but, I think you're asking the wrong question.

The question shouldn't be: Why is it done this way?" but, it should be, what would be the benefit of doing it the way you're asking? Surely it can be done but why should it?

As is the import statement is dead simple and very intuitive, you give it a file name, it tries to finds load it up. You even get fancy as and from but, the concept is simply, you write filenames and you let it be.

What would obfuscating it and making it harder understand achieve, the only achievement is making things arguably more complex.

Python has a history of looking for the rationale behind changes to its design, people asking why aren't function objects subclassable will get a "Why should they?" reply; this behavior doesn't really have a use-case. As is, the import is simple, intuitive and reminiscent of including/using files in other languages.

answered Oct 20 '22 18:10

Dimitris Fasarakis Hilliard

When from import is used python tries to look in the from file to import what you have requested. This might make it clearer.

import re as regexp

from regexp import search

This essentially asks python to look in a file called 'regexp' which it can't find. This is why the alias won't work.

answered Oct 20 '22 17:10

Byron Filer

Related questions
                            
                                How to manually pass source of bzip2 install for Python install?
                            
                                More compact ElementTree or lxml Namespaces
                            
                                Cross-correlation of non-periodic function with NumPy
                            
                                How to construct regex for this text [closed]
                            
                                Inheritance troubles. - Python
                            
                                Does Matplotlib support Qt5?
                            
                                Convert unicode cyrillic symbols to string in python [duplicate]
                            
                                Python pandas print value where column = X and row = Y
                            
                                Cumulative sum but conditionally excluding earlier rows
                            
                                Pandas group cumsum with condition
                            
                                Shuffle a list and return a copy
                            
                                Install tkinter in python3.6 on Ubuntu
                            
                                Best way to add attributes to a Python function
                            
                                Create block diagonal numpy array from a given numpy array
                            
                                TypeError: only integer scalar arrays can be converted to a scalar index
                            
                                Learning Python for a .NET developer [closed]
                            
                                integrating 2D samples on a rectangular grid using SciPy
                            
                                BeautifulSoup: what's the difference between 'lxml' and 'html.parser' and 'html5lib' parsers?
                            
                                subprocess "TypeError: a bytes-like object is required, not 'str'"
                            
                                reshaping a tensor with padding in pytorch

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With