I am running to of the following programs. Importantly, imagine that there is <code>mymodule.py</code> file in the directory where both these programs are located. The first: <pre class="prettyprint"><code>exec('''import sys import os os.chdir('/') sys.path = [] import mymodule''', {}) </code></pre> The second: <pre class="prettyprint"><code>import mymodule exec('''import sys import os os.chdir('/') sys.path = [] import mymodule''', {}) </code></pre> The first snippet raises <code>ImportError</code> as expected (after all, the directory where mymodule is located is not in path). The second snippet, however, does not, even though mymodule is also not in its path and the environment I am giving it is empty. My question is why

According to The import system - The module cache, <blockquote> The first place checked during import search is <code>sys.modules</code>. This mapping serves as a cache of all modules that have been previously imported, including the intermediate paths. So if foo.bar.baz was previously imported, sys.modules will contain entries for foo, foo.bar, and foo.bar.baz. Each key will have as its value the corresponding module object. During import, the module name is looked up in <code>sys.modules</code> and if present, the associated value is the module satisfying the import, and the process completes. However, if the value is None, then a ModuleNotFoundError is raised. If the module name is missing, Python will continue searching for the module. </blockquote> The second snippets successfully imports <code>mymodule</code>; it's cached in <code>sys.modules</code>, so no search in other places occurs.

This has nothing to do with <code>exec()</code>, and is a simple misunderstanding about what is available on your <code>sys.path</code> when running a script, and when Python looks for files to load. You state: <blockquote> I am running to of the following programs. Importantly, imagine that there is <code>mymodule.py</code> file in the directory where both these programs are located. [...] The second snippet, however, does not, even though mymodule is also not in its path </blockquote> The module is on its path. The directory your script is located in, is added at the start of the module search path. See Command line: <blockquote> <code><script></code> Execute the Python code contained in script, which must be a filesystem path (absolute or relative) referring to either a Python file, a directory containing a <code>__main__.py</code> file, or a zipfile containing a <code>__main__.py</code> file. [...] If the script name refers directly to a Python file, the directory containing that file is added to the start of <code>sys.path</code>, and the file is executed as the <code>__main__</code> module. </blockquote> Bold emphasis mine. So, <code>mymodule.py</code>, which you state is located in the same directory as the scripts you are running, is on the path. Once loaded, modules stay loaded. <code>import <module></code> will only look at the module search path if there is not already a module in <code>sys.modules</code> by that name. It doesn't matter if you use <code>exec</code> or not to do the import. From the <code>import</code> statement documentation: <blockquote> The basic <code>import</code> statement (no from clause) is executed in two steps: <ol> <li>find a module, loading and initializing it if necessary </li> <li>define a name or names in the local namespace for the scope where the <code>import</code> statement occurs.</li> </ol> </blockquote> The if necessary part is the important bit. Further, from The import system: <blockquote> The <code>import</code> statement combines two operations; it searches for the named module, then it binds the results of that search to a name in the local scope. [...] When a module is first imported, Python searches for the module and if found, it creates a module object, initializing it. </blockquote> and from The module cache: <blockquote> The first place checked during import search is <code>sys.modules</code>. This mapping serves as a cache of all modules that have been previously imported, including the intermediate paths. So if <code>foo.bar.baz</code> was previously imported, <code>sys.modules</code> will contain entries for <code>foo</code>, <code>foo.bar</code>, and <code>foo.bar.baz</code>. Each key will have as its value the corresponding module object. During import, the module name is looked up in <code>sys.modules</code> and if present, the associated value is the module satisfying the import, and the process completes. </blockquote> So by the time your <code>exec()</code> code runs, the first <code>import mymodule</code> had already succeeded and <code>sys.modules[</code>mymodule<code>] exists. The second</code>import mymodule` finds that object, and the search ends.

Difference between exec behavior when module is imported or not

Tags:

python

python-import

exec

I am running to of the following programs. Importantly, imagine that there is mymodule.py file in the directory where both these programs are located.

The first:

exec('''import sys
import os
os.chdir('/') 
sys.path = []
import mymodule''', {})

The second:

import mymodule
exec('''import sys
import os
os.chdir('/') 
sys.path = []
import mymodule''', {})

The first snippet raises ImportError as expected (after all, the directory where mymodule is located is not in path). The second snippet, however, does not, even though mymodule is also not in its path and the environment I am giving it is empty.

My question is why

451

asked Feb 27 '18 19:02

Dmitry Torba

Video Answer

2 Answers

According to The import system - The module cache,

The first place checked during import search is sys.modules. This mapping serves as a cache of all modules that have been previously imported, including the intermediate paths. So if foo.bar.baz was previously imported, sys.modules will contain entries for foo, foo.bar, and foo.bar.baz. Each key will have as its value the corresponding module object.

During import, the module name is looked up in sys.modules and if present, the associated value is the module satisfying the import, and the process completes. However, if the value is None, then a ModuleNotFoundError is raised. If the module name is missing, Python will continue searching for the module.

The second snippets successfully imports mymodule; it's cached in sys.modules, so no search in other places occurs.

165

answered Oct 10 '22 08:10

falsetru

This has nothing to do with exec(), and is a simple misunderstanding about what is available on your sys.path when running a script, and when Python looks for files to load.

You state:

I am running to of the following programs. Importantly, imagine that there is mymodule.py file in the directory where both these programs are located.

[...]

The second snippet, however, does not, even though mymodule is also not in its path

The module is on its path. The directory your script is located in, is added at the start of the module search path. See Command line:

<script>

Execute the Python code contained in script, which must be a filesystem path (absolute or relative) referring to either a Python file, a directory containing a __main__.py file, or a zipfile containing a __main__.py file.

[...]

If the script name refers directly to a Python file, the directory containing that file is added to the start of sys.path, and the file is executed as the __main__ module.

Bold emphasis mine.

So, mymodule.py, which you state is located in the same directory as the scripts you are running, is on the path.

Once loaded, modules stay loaded. import <module> will only look at the module search path if there is not already a module in sys.modules by that name. It doesn't matter if you use exec or not to do the import.

From the import statement documentation:

The basic import statement (no from clause) is executed in two steps:

find a module, loading and initializing it if necessary

define a name or names in the local namespace for the scope where the import statement occurs.

The if necessary part is the important bit.

Further, from The import system:

The import statement combines two operations; it searches for the named module, then it binds the results of that search to a name in the local scope.

[...]

When a module is first imported, Python searches for the module and if found, it creates a module object, initializing it.

and from The module cache:

The first place checked during import search is sys.modules. This mapping serves as a cache of all modules that have been previously imported, including the intermediate paths. So if foo.bar.baz was previously imported, sys.modules will contain entries for foo, foo.bar, and foo.bar.baz. Each key will have as its value the corresponding module object.

During import, the module name is looked up in sys.modules and if present, the associated value is the module satisfying the import, and the process completes.

So by the time your exec() code runs, the first import mymodule had already succeeded and sys.modules[mymodule] exists. The secondimport mymodule` finds that object, and the search ends.

answered Oct 10 '22 09:10

Martijn Pieters

Related questions
                            
                                Python read csv with Hebrew header
                            
                                Numpy array to vtk table
                            
                                Why doesn't the last command variable "_" appear in dir()? [duplicate]
                            
                                Is it necessary to close the file in json.load?
                            
                                imgradient matlab equivalent in Python
                            
                                How to extract False Positive, False Negative from a confusion matrix of multiclass classification
                            
                                Python 3: unittest.mock how to specify different return values for specific inputs?
                            
                                'Image not found' Error After Installing OpenCV Python Wheel on Mac
                            
                                Randomly sampling lines from a file
                            
                                matplotlib: formatting of timestamp on x-axis
                            
                                How to build a N*(N+1) matrix with number in range of 1~N*N and totally distributed?
                            
                                How can I use numpy to create a diagonal matrix from a 1d array?
                            
                                How to count the number of reduced proper fractions fast enough?
                            
                                How to conditionally assign values to tensor [masking for loss function]?
                            
                                Prevent backtracking on regex to find non-comment lines (not starting with indented '#')
                            
                                Groupby multiple columns in a list
                            
                                Python Logging in Docker
                            
                                Python: PBS submission, what happens if I change script?
                            
                                python pandas: split comma-separated column into new columns - one per value
                            
                                datetime to decimal hour and minutes in python3

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With