Get only the code out of Jupyter Notebook

Tags:

Is there a solution to pull out all the code of the notebook? For example, if I wanted to generate a source file of my notebook "source.py" that contained all the code in the code cells of the notebook, is that possible?

Thanks!

355

asked Jan 24 '19 15:01

J DOe

2 Answers

nbconvert

You can use the command line tool nbconvert to convert the ipynb file to various other formats. The easiest way to convert it to a .py file is:

jupyter nbconvert --no-prompt --to script notebook_name.ipynb

It outputs only the code and comments without the markdown, input and output prompts. There is also --stdout option.

nbconvert documentation

jq

But you can also just parse the JSON of the notebook using jq:

jq -j '
  .cells
  | map( select(.cell_type == "code") | .source + ["\n\n"] )
  | .[][]
  ' \
  notebook.ipynb > source.py

jq homepage
Jupyter Notebook format

104

answered Sep 20 '22 05:09

Neeraz Lakkapragada

Since the notebook format is JSON it's relatively easy to extract just the text content of only the code cells. The task is made even easier when you use the Python API for working with notebook files.

The following will get you the code on standard output. You can handle it in other ways similarly easily. Bear in mind code source may not have a terminating newline.

from nbformat import read, NO_CONVERT

with open("Some Notebook.ipynb") as fp:
    notebook = read(fp, NO_CONVERT)
cells = notebook['cells']
code_cells = [c for c in cells if c['cell_type'] == 'code']
for cell in code_cells:
    print(cell['source'])

Notebook nodes are a little more flexible than dictionaries, though, and allow attribute (.name) access to fields as well as subscripting (['name']). As a typing-challenged person I find it preferable to write

cells = notebook.cells
code_cells = [c for c in cells if c.cell_type == 'code']

for cell in code_cells:
    print(cell.source)

In answering this question I became aware that the nbformat library has been unbundled, and can therefore be installed with pip without the rest of Jupyter.

answered Sep 21 '22 05:09

holdenweb

Related questions
                            
                                AttributeError: 'list' object has no attribute 'replace' when trying to remove character
                            
                                Pandas groupby two columns then get dict for values
                            
                                Running Python script via systemd fails to load module
                            
                                Python Error on Google Cloud Install. How do I properly set the environment variable?
                            
                                can not convert column type from object to str in python dataframe
                            
                                Can't open Jupyter notebook with Anaconda
                            
                                Change static folder from config in Flask
                            
                                Virtualenv not compatible with this system or executable
                            
                                Reading settings in spider scrapy
                            
                                Python: convert datedelta to int value of time difference
                            
                                'numpy.float64' object has no attribute 'translate' Inserting value to Mysql in Python
                            
                                Python: Find count of the elements of one list in another list
                            
                                How to activate virtual environment from Windows 10 command prompt?
                            
                                how to replace multiple values with one value python
                            
                                Drop columns in a pandas dataframe based on the % of null values
                            
                                Keep running a python script on AWS EC2 even if CLI session is closed
                            
                                Why does the **kwargs mapping compare equal with a differently ordered OrderedDict?
                            
                                The QuerySet value for an exact lookup must be limited to one result using slicing-Django
                            
                                Cannot import cv2 on PyCharm
                            
                                TypeError: fit_transform() missing 1 required positional argument: 'X'

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Get only the code out of Jupyter Notebook

Tags:

python

jupyter-notebook