<p><strong>TL;DR</strong></p> <p>Here's an example repository that is set up as described in the first diagram (below): https://github.com/Poddster/package_problems</p> <p>If you could please make it look like the second diagram in terms of project organisation and can still run the following commands, then you've answered the question:</p> <pre class="prettyprint"><code>$ git clone https://github.com/Poddster/package_problems.git $ cd package_problems <do your magic here> $ nosetests $ ./my_tool/my_tool.py $ ./my_tool/t.py $ ./my_tool/d.py (or for the above commands, $ cd ./my_tool/ && ./my_tool.py is also acceptable) </code></pre> <p>Alternatively: Give me a different project structure that allows me to group together related files ('package'), run all of the files individually, import the files into other files in the same package, and import the packages/files into other package's files.</p> <hr> <h3>Current situation</h3> <p>I have a bunch of python files. Most of them are useful when callable from the command line i.e. they all use argparse and <code>if __name__ == "__main__"</code> to do useful things.</p> <p>Currently I have this directory structure, and everything is working fine:</p> <pre class="prettyprint"><code>. ├── config.txt ├── docs/ │ ├── ... ├── my_tool.py ├── a.py ├── b.py ├── c.py ├── d.py ├── e.py ├── README.md ├── tests │ ├── __init__.py │ ├── a.py │ ├── b.py │ ├── c.py │ ├── d.py │ └── e.py └── resources ├── ... </code></pre> <p>Some of the scripts <code>import</code> things from other scripts to do their work. But no script is merely a library, they are all invokable. e.g. I could invoke <code>./my_tool.py</code>, <code>./a.by</code>, <code>./b.py</code>, <code>./c.py</code> etc and they would do useful things for the user. </p> <p>"my_tool.py" is the main script that leverages all of the other scripts.</p> <h3>What I want to happen</h3> <p>However I want to change the way the project is organised. The project itself represents an entire program useable by the user, and will be distributed as such, but I know that parts of it will be useful in different projects later so I want to try and encapsulate the current files into a package. In the immediate future I will also add other packages to this same project. </p> <p>To facilitate this I've decided to re-organise the project to something like the following:</p> <pre class="prettyprint"><code>. ├── config.txt ├── docs/ │ ├── ... ├── my_tool │ ├── __init__.py │ ├── my_tool.py │ ├── a.py │ ├── b.py │ ├── c.py │ ├── d.py │ ├── e.py │ └── tests │ ├── __init__.py │ ├── a.py │ ├── b.py │ ├── c.py │ ├── d.py │ └── e.py ├── package2 │ ├── __init__.py │ ├── my_second_package.py | ├── ... ├── README.md └── resources ├── ... </code></pre> <p>However, I can't figure out an project organisation that satisfies the following criteria:</p> <ol> <li>All of the scripts are invokable on the command line (either as <code>my_tool\a.py</code> or <code>cd my_tool && a.py</code>)</li> <li>The tests actually run :) </li> <li>Files in package2 can do <code>import my_tool</code> </li> </ol> <p>The main problem is with the import statements used by the packages and the tests.</p> <p>Currently, all of the packages, including the tests, simply do <code>import <module></code> and it's resolved correctly. But when jiggering things around it doesn't work.</p> <p>Note that supporting py2.7 is a requirement so all of the files have <code>from __future__ import absolute_import, ...</code> at the top.</p> <h3>What I've tried, and the disastrous results</h3> <h3>1</h3> <p>If I move the files around as shown above, but leave all of the import statements as they currently are:</p> <ol> <li> <code>$ ./my_tool/*.py</code> works and they all run properly</li> <li> <code>$ nosetests</code> run from the top directory doesn't work. The tests fail to import the packages scripts.</li> <li>pycharm highlights import statements in red when editing those files :(</li> </ol> <h3>2</h3> <p>If I then change the test scripts to do:</p> <pre class="prettyprint"><code>from my_tool import x </code></pre> <ol> <li> <code>$ ./my_tool/*.py</code> still works and they all run properly</li> <li> <code>$ nosetests</code> run from the top directory doesn't work. Then tests can import the correct scripts, but the imports in the scripts themselves fail when the test scripts import them. </li> <li>pycharm highlights import statements in red in the main scripts still :(</li> </ol> <h3>3</h3> <p>If I keep the same structure and change <em>everything</em> to be <code>from my_tool import</code> then:</p> <ol> <li> <code>$ ./my_tool/*.py</code> results in <code>ImportError</code>s </li> <li> <code>$ nosetests</code> runs everything ok. </li> <li>pycharm doesn't complain about anything </li> </ol> <p>e.g. of 1.:</p> <pre class="prettyprint"><code>Traceback (most recent call last): File "./my_tool/a.py", line 34, in <module> from my_tool import b ImportError: cannot import name b </code></pre> <h3>4</h3> <p>I also tried <code>from . import x</code> but that just ends up with <code>ValueError: Attempted relative import in non-package</code> for the direct running of scripts.</p> <h3>Looking at some other SO answers:</h3> <p>I can't just use <code>python -m pkg.tests.core_test</code> as</p> <p>a) I don't have <strong>main</strong>.py. I guess I could have one?<br> b) I want to be able to run all of the scripts, not just main?</p> <p>I've tried:</p> <pre class="prettyprint"><code>if __name__ == '__main__' and __package__ is None: from os import sys, path sys.path.append(path.dirname(path.dirname(path.abspath(__file__)))) </code></pre> <p>but it didn't help. </p> <p>I also tried:</p> <pre class="prettyprint"><code>__package__ = "my_tool" from . import b </code></pre> <p>But received:</p> <pre class="prettyprint"><code>SystemError: Parent module 'loading_tool' not loaded, cannot perform relative import </code></pre> <p>adding <code>import my_tool</code> before <code>from . import b</code> just ends up back with <code>ImportError: cannot import name b</code></p> <h3>Fix?</h3> <p>What's the correct set of magical incantations and directory layout to make all of this work?</p>

<p>Once you move to your desired configuration, the absolute imports you are using to load the modules that are specific to <code>my_tool</code> no longer work.</p> <p>You need three modifications after you create the <code>my_tool</code> subdirectory and move the files into it:</p> <ol> <li><p>Create <code>my_tool/__init__.py</code>. (You seem to already do this but I wanted to mention it for completeness.)</p></li> <li> <p>In the files directly under in <code>my_tool</code>: change the <code>import</code> statements to load the modules from the current package. So in <code>my_tool.py</code> change:</p> <pre class="prettyprint"><code>import c import d import k import s </code></pre> <p>to: </p> <pre class="prettyprint"><code>from . import c from . import d from . import k from . import s </code></pre> <p>You need to make a similar change to all your other files. (You mention having tried setting <code>__package__</code> and then doing a relative import but setting <code>__package__</code> is not needed.)</p> </li> <li> <p>In the files located in <code>my_tool/tests</code>: change the <code>import</code> statements that import the code you want to test to relative imports that load from one package up in the hierarchy. So in <code>test_my_tool.py</code> change:</p> <pre class="prettyprint"><code>import my_tool </code></pre> <p>to:</p> <pre class="prettyprint"><code>from .. import my_tool </code></pre> <p>Similarly for all the other test files.</p> </li> </ol> <p>With the modifications above, I can run modules directly:</p> <pre class="prettyprint"><code>$ python -m my_tool.my_tool C! D! F! V! K! T! S! my_tool! my_tool main! |main tool!||detected||tar edit!||installed||keys||LOL||ssl connect||parse ASN.1||config| $ python -m my_tool.k F! V! K! K main! |keys||LOL||ssl connect||parse ASN.1| </code></pre> <p>and I can run tests:</p> <pre class="prettyprint"><code>$ nosetests ........ ---------------------------------------------------------------------- Ran 8 tests in 0.006s OK </code></pre> <p>Note that I can run the above both with Python 2.7 and Python 3.</p> <hr> <p>Rather than make the various modules under <code>my_tool</code> be directly executable, I suggest using a proper <code>setup.py</code> file to declare entry points and let <code>setup.py</code> create these entry points when the package is installed. Since you intend to distribute this code, you should use a <code>setup.py</code> to formally package it anyway.</p> <ol> <li> <p>Modify the modules that can be invoked from the command line so that, taking <code>my_tool/my_tool.py</code> as example, instead of this:</p> <pre class="prettyprint"><code>if __name__ == "__main__": print("my_tool main!") print(do_something()) </code></pre> <p>You have:</p> <pre class="prettyprint"><code>def main(): print("my_tool main!") print(do_something()) if __name__ == "__main__": main() </code></pre> </li> <li> <p>Create a <code>setup.py</code> file that contains the proper <code>entry_points</code>. For instance:</p> <pre class="prettyprint"><code>from setuptools import setup, find_packages setup( name="my_tool", version="0.1.0", packages=find_packages(), entry_points={ 'console_scripts': [ 'my_tool = my_tool.my_tool:main' ], }, author="", author_email="", description="Does stuff.", license="MIT", keywords=[], url="", classifiers=[ ], ) </code></pre> <p>The file above instructs <code>setup.py</code> to create a script named <code>my_tool</code> that will invoke the <code>main</code> method in the module <code>my_tool.my_tool</code>. On my system, once the package is installed, there is a script located at <code>/usr/local/bin/my_tool</code> that invokes the <code>main</code> method in <code>my_tool.my_tool</code>. It produces the same output as running <code>python -m my_tool.my_tool</code>, which I've shown above.</p> </li> </ol>

<h3>Point 1</h3> <p>I believe it's working, so I don't comment on it.</p> <h3>Point 2</h3> <p>I always used tests at the same level as my_tool, not below it, but they should work if you do this at the top of each tests files (before importing my_tool or any other py file in the same directory)</p> <pre class="prettyprint"><code>import os import sys sys.path.insert(0, os.path.abspath(__file__).rsplit(os.sep, 2)[0]) </code></pre> <h3>Point 3</h3> <p>In my_second_package.py do this at the top (before importing my_tool)</p> <pre class="prettyprint"><code>import os import sys sys.path.insert(0, os.path.abspath(__file__).rsplit(os.sep, 2)[0] + os.sep + 'my_tool') </code></pre> <p>Best regards,</p> <p>JM</p>

How do you organise a python project that contains multiple packages so that each file in a package can still be run individually?

Tags:

python

import

pycharm

python-2.7

packages

TL;DR

Here's an example repository that is set up as described in the first diagram (below): https://github.com/Poddster/package_problems

If you could please make it look like the second diagram in terms of project organisation and can still run the following commands, then you've answered the question:

$ git clone https://github.com/Poddster/package_problems.git $ cd package_problems <do your magic here>  $ nosetests  $ ./my_tool/my_tool.py $ ./my_tool/t.py $ ./my_tool/d.py   (or for the above commands, $ cd ./my_tool/ && ./my_tool.py is also acceptable)

Alternatively: Give me a different project structure that allows me to group together related files ('package'), run all of the files individually, import the files into other files in the same package, and import the packages/files into other package's files.

Current situation

I have a bunch of python files. Most of them are useful when callable from the command line i.e. they all use argparse and if __name__ == "__main__" to do useful things.

Currently I have this directory structure, and everything is working fine:

. ├── config.txt ├── docs/ │   ├── ... ├── my_tool.py ├── a.py ├── b.py ├── c.py ├── d.py ├── e.py ├── README.md ├── tests │   ├── __init__.py │   ├── a.py │   ├── b.py │   ├── c.py │   ├── d.py │   └── e.py └── resources     ├── ...

Some of the scripts import things from other scripts to do their work. But no script is merely a library, they are all invokable. e.g. I could invoke ./my_tool.py, ./a.by, ./b.py, ./c.py etc and they would do useful things for the user.

"my_tool.py" is the main script that leverages all of the other scripts.

What I want to happen

However I want to change the way the project is organised. The project itself represents an entire program useable by the user, and will be distributed as such, but I know that parts of it will be useful in different projects later so I want to try and encapsulate the current files into a package. In the immediate future I will also add other packages to this same project.

To facilitate this I've decided to re-organise the project to something like the following:

. ├── config.txt ├── docs/ │   ├── ... ├── my_tool │   ├── __init__.py │   ├── my_tool.py │   ├── a.py │   ├── b.py │   ├── c.py │   ├── d.py │   ├── e.py │   └── tests │       ├── __init__.py │       ├── a.py │       ├── b.py │       ├── c.py │       ├── d.py │       └── e.py ├── package2 │   ├── __init__.py │   ├── my_second_package.py |   ├── ... ├── README.md └── resources     ├── ...

However, I can't figure out an project organisation that satisfies the following criteria:

All of the scripts are invokable on the command line (either as my_tool\a.py or cd my_tool && a.py)
The tests actually run :)
Files in package2 can do import my_tool

The main problem is with the import statements used by the packages and the tests.

Currently, all of the packages, including the tests, simply do import <module> and it's resolved correctly. But when jiggering things around it doesn't work.

Note that supporting py2.7 is a requirement so all of the files have from __future__ import absolute_import, ... at the top.

What I've tried, and the disastrous results

1

If I move the files around as shown above, but leave all of the import statements as they currently are:

$ ./my_tool/*.py works and they all run properly
$ nosetests run from the top directory doesn't work. The tests fail to import the packages scripts.
pycharm highlights import statements in red when editing those files :(

2

If I then change the test scripts to do:

from my_tool import x

$ ./my_tool/*.py still works and they all run properly
$ nosetests run from the top directory doesn't work. Then tests can import the correct scripts, but the imports in the scripts themselves fail when the test scripts import them.
pycharm highlights import statements in red in the main scripts still :(

3

If I keep the same structure and change everything to be from my_tool import then:

$ ./my_tool/*.py results in ImportErrors
$ nosetests runs everything ok.
pycharm doesn't complain about anything

e.g. of 1.:

Traceback (most recent call last):   File "./my_tool/a.py", line 34, in <module>     from my_tool import b ImportError: cannot import name b

4

I also tried from . import x but that just ends up with ValueError: Attempted relative import in non-package for the direct running of scripts.

Looking at some other SO answers:

I can't just use python -m pkg.tests.core_test as

a) I don't have main.py. I guess I could have one?
b) I want to be able to run all of the scripts, not just main?

I've tried:

if __name__ == '__main__' and __package__ is None:     from os import sys, path     sys.path.append(path.dirname(path.dirname(path.abspath(__file__))))

but it didn't help.

I also tried:

__package__ = "my_tool" from . import b

But received:

SystemError: Parent module 'loading_tool' not loaded, cannot perform relative import

adding import my_tool before from . import b just ends up back with ImportError: cannot import name b

Fix?

What's the correct set of magical incantations and directory layout to make all of this work?

691

asked Sep 16 '16 10:09

Pod

2 Answers

Once you move to your desired configuration, the absolute imports you are using to load the modules that are specific to my_tool no longer work.

You need three modifications after you create the my_tool subdirectory and move the files into it:

Create my_tool/__init__.py. (You seem to already do this but I wanted to mention it for completeness.)
In the files directly under in my_tool: change the import statements to load the modules from the current package. So in my_tool.py change:
```
import c import d import k import s 
```
to:
```
from . import c from . import d from . import k from . import s 
```
You need to make a similar change to all your other files. (You mention having tried setting __package__ and then doing a relative import but setting __package__ is not needed.)
In the files located in my_tool/tests: change the import statements that import the code you want to test to relative imports that load from one package up in the hierarchy. So in test_my_tool.py change:
```
import my_tool 
```
to:
```
from .. import my_tool 
```
Similarly for all the other test files.

With the modifications above, I can run modules directly:

$ python -m my_tool.my_tool C! D! F! V! K! T! S! my_tool! my_tool main! |main tool!||detected||tar edit!||installed||keys||LOL||ssl connect||parse ASN.1||config|  $ python -m my_tool.k F! V! K! K main! |keys||LOL||ssl connect||parse ASN.1|

and I can run tests:

$ nosetests  ........ ---------------------------------------------------------------------- Ran 8 tests in 0.006s  OK

Note that I can run the above both with Python 2.7 and Python 3.

Rather than make the various modules under my_tool be directly executable, I suggest using a proper setup.py file to declare entry points and let setup.py create these entry points when the package is installed. Since you intend to distribute this code, you should use a setup.py to formally package it anyway.

Modify the modules that can be invoked from the command line so that, taking my_tool/my_tool.py as example, instead of this:

if __name__ == "__main__":     print("my_tool main!")     print(do_something())

You have:

def main():     print("my_tool main!")     print(do_something())  if __name__ == "__main__":     main()

Create a setup.py file that contains the proper entry_points. For instance:
```
from setuptools import setup, find_packages  setup(     name="my_tool",     version="0.1.0",     packages=find_packages(),     entry_points={         'console_scripts': [             'my_tool = my_tool.my_tool:main'         ],     },     author="",     author_email="",     description="Does stuff.",     license="MIT",     keywords=[],     url="",     classifiers=[     ], ) 
```
The file above instructs setup.py to create a script named my_tool that will invoke the main method in the module my_tool.my_tool. On my system, once the package is installed, there is a script located at /usr/local/bin/my_tool that invokes the main method in my_tool.my_tool. It produces the same output as running python -m my_tool.my_tool, which I've shown above.

answered Oct 09 '22 02:10

Louis

Point 1

I believe it's working, so I don't comment on it.

Point 2

I always used tests at the same level as my_tool, not below it, but they should work if you do this at the top of each tests files (before importing my_tool or any other py file in the same directory)

import os import sys  sys.path.insert(0, os.path.abspath(__file__).rsplit(os.sep, 2)[0])

Point 3

In my_second_package.py do this at the top (before importing my_tool)

import os import sys  sys.path.insert(0,                 os.path.abspath(__file__).rsplit(os.sep, 2)[0] + os.sep                 + 'my_tool')

Best regards,

answered Oct 09 '22 03:10

jmatos

Related questions
                            
                                Sqlite / SQLAlchemy: how to enforce Foreign Keys?
                            
                                Override the {...} notation so i get an OrderedDict() instead of a dict()?
                            
                                How do I get the weights of a layer in Keras?
                            
                                Type checking of arguments Python [duplicate]
                            
                                py.test: how to get the current test's name from the setup method?
                            
                                Printing an int list in a single line python3
                            
                                XlsxWriter object save as http response to create download in Django
                            
                                "Python version 2.7 required, which was not found in the registry" error when attempting to install netCDF4 on Windows 8
                            
                                ValueError: dict contains fields not in fieldnames
                            
                                Convert spreadsheet number to column letter
                            
                                No module named 'polls.apps.PollsConfigdjango'; Django project tutorial 2
                            
                                Reconnecting remote Jupyter Notebook and get current cell output
                            
                                Background Worker with Flask
                            
                                Is it possible to display an OpenCV video inside the IPython /JuPyter Notebook?
                            
                                Sharing Memory in Gunicorn?
                            
                                Why do I get SQLAlchemy nested rollback error?
                            
                                Sending a C++ array to Python and back (Extending C++ with Numpy)
                            
                                Why can't I handle a KeyboardInterrupt in python?
                            
                                Why is Jython much slower than CPython, despite the JVM's advances?
                            
                                Difference between nonzero(a), where(a) and argwhere(a). When to use which?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With