To preface, I think I may have figured out how to get this code working (based on Changing module variables after import), but my question is really about why the following behavior occurs so I can understand what to not do in the future. I have three files. The first is mod1.py: <pre class="prettyprint"><code># mod1.py import mod2 var1A = None def func1A(): global var1 var1 = 'A' mod2.func2() def func1B(): global var1 print var1 if __name__ == '__main__': func1A() </code></pre> Next I have mod2.py: <pre class="prettyprint"><code># mod2.py import mod1 def func2(): mod1.func1B() </code></pre> Finally I have driver.py: <pre class="prettyprint"><code># driver.py import mod1 if __name__ == '__main__': mod1.func1A() </code></pre> If I execute the command <code>python mod1.py</code> then the output is <code>None</code>. Based on the link I referenced above, it seems that there is some distinction between <code>mod1.py</code> being imported as <code>__main__</code> and <code>mod1.py</code> being imported from <code>mod2.py</code>. Therefore, I created <code>driver.py</code>. If I execute the command <code>python driver.py</code> then I get the expected output: <code>A</code>. I sort of see the difference, but I don't really see the mechanism or the reason for it. How and why does this happen? It seems counterintuitive that the same module would exist twice. If I execute <code>python mod1.py</code>, would it be possible to access the variables in the <code>__main__</code> version of <code>mod1.py</code> instead of the variables in the version imported by <code>mod2.py</code>?

The <code>__name__</code> variable always contains the name of the module, except when the file has been loaded into the interpreter as a script instead. Then that variable is set to the string <code>'__main__'</code> instead. After all, the script is then run as the main file of the whole program, everything else are modules imported directly or indirectly by that main file. By testing the <code>__name__</code> variable, you can thus detect if a file has been imported as a module, or was run directly. Internally, modules are given a namespace dictionary, which is stored as part of the metadata for each module, in <code>sys.modules</code>. The main file, the executed script, is stored in that same structure as <code>'__main__'</code>. But when you import a file as a module, python first looks in <code>sys.modules</code> to see if that module has already been imported before. So, <code>import mod1</code> means that we first look in <code>sys.modules</code> for the <code>mod1</code> module. It'll create a new module structure with a namespace if <code>mod1</code> isn't there yet. So, if you both run <code>mod1.py</code> as the main file, and later import it as a python module, it'll get two namespace entries in <code>sys.modules</code>. One as <code>'__main__'</code>, then later as <code>'mod1'</code>. These two namespaces are completely separate. Your global <code>var1</code> is stored in <code>sys.modules['__main__']</code>, but <code>func1B</code> is looking in <code>sys.modules['mod1']</code> for <code>var1</code>, where it is <code>None</code>. But when you use <code>python driver.py</code>, <code>driver.py</code> becomes the <code>'__main__'</code> main file of the program, and <code>mod1</code> will be imported just once into the <code>sys.modules['mod1']</code> structure. This time round, <code>func1A</code> stores <code>var1</code> in the <code>sys.modules['mod1']</code> structure, and that's what <code>func1B</code> will find.

Importing modules: main vs import as module

Tags:

python

python-module

module

python-import

To preface, I think I may have figured out how to get this code working (based on Changing module variables after import), but my question is really about why the following behavior occurs so I can understand what to not do in the future.

I have three files. The first is mod1.py:

# mod1.py

import mod2

var1A = None

def func1A():
    global var1
    var1 = 'A'
    mod2.func2()

def func1B():
    global var1
    print var1

if __name__ == '__main__':
    func1A()

Next I have mod2.py:

# mod2.py

import mod1

def func2():
    mod1.func1B()

Finally I have driver.py:

# driver.py

import mod1

if __name__ == '__main__':
    mod1.func1A()

If I execute the command python mod1.py then the output is None. Based on the link I referenced above, it seems that there is some distinction between mod1.py being imported as __main__ and mod1.py being imported from mod2.py. Therefore, I created driver.py. If I execute the command python driver.py then I get the expected output: A. I sort of see the difference, but I don't really see the mechanism or the reason for it. How and why does this happen? It seems counterintuitive that the same module would exist twice. If I execute python mod1.py, would it be possible to access the variables in the __main__ version of mod1.py instead of the variables in the version imported by mod2.py?

793

asked Nov 01 '12 16:11

Brendan

1 Answers

The __name__ variable always contains the name of the module, except when the file has been loaded into the interpreter as a script instead. Then that variable is set to the string '__main__' instead.

After all, the script is then run as the main file of the whole program, everything else are modules imported directly or indirectly by that main file. By testing the __name__ variable, you can thus detect if a file has been imported as a module, or was run directly.

Internally, modules are given a namespace dictionary, which is stored as part of the metadata for each module, in sys.modules. The main file, the executed script, is stored in that same structure as '__main__'.

But when you import a file as a module, python first looks in sys.modules to see if that module has already been imported before. So, import mod1 means that we first look in sys.modules for the mod1 module. It'll create a new module structure with a namespace if mod1 isn't there yet.

So, if you both run mod1.py as the main file, and later import it as a python module, it'll get two namespace entries in sys.modules. One as '__main__', then later as 'mod1'. These two namespaces are completely separate. Your global var1 is stored in sys.modules['__main__'], but func1B is looking in sys.modules['mod1'] for var1, where it is None.

But when you use python driver.py, driver.py becomes the '__main__' main file of the program, and mod1 will be imported just once into the sys.modules['mod1'] structure. This time round, func1A stores var1 in the sys.modules['mod1'] structure, and that's what func1B will find.

answered Oct 21 '22 06:10

Martijn Pieters

Related questions
                            
                                ModuleNotFoundError: No module named 'libtorrent'
                            
                                How to find out the summarized text of a given URL in python / Django? [closed]
                            
                                How to filter query in sqlalchemy by year (datetime column)
                            
                                Killing the children with the parent
                            
                                multiprocessing problem [pyqt, py2exe]
                            
                                Word wrap on report lab PDF table
                            
                                Why isn't there a do while flow control statement in python?
                            
                                In Python, how can I find the index of the first item in a list that is NOT some value?
                            
                                recursively traverse multidimensional dictionary, dimension unknown
                            
                                How to convert hex string to hex number?
                            
                                fabric vs pexpect
                            
                                Python: sort this dictionary (dict in dict)
                            
                                Map list by partial function vs lambda
                            
                                Python: super and __init__() vs __init__( self )
                            
                                run web app with gevent
                            
                                Runtime Error with Vim Omnicompletion
                            
                                Pyparsing setParseAction function is getting no arguments
                            
                                Python - set list range to a specific value
                            
                                Where can I find python's built-in classes' methods and attributes? [closed]
                            
                                Unexpected behavior in PHP - Same code gives correct results in C# and Python

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Importing modules: main vs import as module

Tags:

python

python-module

module

python-import

Brendan

People also ask

1 Answers

Martijn Pieters

Recent Activity

Donate For Us

Importing modules: __main__ vs import as module

Tags:

python

python-module

module

python-import

Brendan

People also ask

1 Answers

Martijn Pieters

Related questions

Recent Activity

Donate For Us

Importing modules: main vs import as module