Jupyter (IPython) notebook: Convert an HTML notebook to ipynb

1 Answers

I recently used BeautifulSoup and JSON to convert html notebook to ipynb. the trick is to look at the JSON schema of a notebook and emulate that. The code selects only input code cells and markdown cells

here is my code

from bs4 import BeautifulSoup import json import urllib.request url = 'http://nbviewer.jupyter.org/url/jakevdp.github.com/downloads/notebooks/XKCD_plots.ipynb' response = urllib.request.urlopen(url) #  for local html file # response = open("/Users/note/jupyter/notebook.html") text = response.read()  soup = BeautifulSoup(text, 'lxml') # see some of the html print(soup.div) dictionary = {'nbformat': 4, 'nbformat_minor': 1, 'cells': [], 'metadata': {}} for d in soup.findAll("div"):     if 'class' in d.attrs.keys():         for clas in d.attrs["class"]:             if clas in ["text_cell_render", "input_area"]:                 # code cell                 if clas == "input_area":                     cell = {}                     cell['metadata'] = {}                     cell['outputs'] = []                     cell['source'] = [d.get_text()]                     cell['execution_count'] = None                     cell['cell_type'] = 'code'                     dictionary['cells'].append(cell)                  else:                     cell = {}                     cell['metadata'] = {}                      cell['source'] = [d.decode_contents()]                     cell['cell_type'] = 'markdown'                     dictionary['cells'].append(cell) open('notebook.ipynb', 'w').write(json.dumps(dictionary))

here is part of print(soup.div) output

div class="container"> <div class="navbar-header"> <button class="navbar-toggle collapsed" data-target=".navbar-collapse" data-toggle="collapse" type="button"> <span class="sr-only">Toggle navigation</span> <i class="fa fa-bars"></i> </button> <a class="navbar-brand" href="/"> <img src="/static/img/nav_logo.svg?v=479cefe8d932fb14a67b93911b97d70f" width="159"/> </a> </div> <div class="collapse navbar-collapse"> <ul class="nav navbar-nav navbar-right"> <li> <a class="active" href="http://jupyter.org">JUPYTER</a> </li> <li> <a href="/faq" title="FAQ"> <span>FAQ</span>

A screen shot of the resulting ipynb file, loaded on my local jupyter and after running all the cells

enter image description here

193

answered Sep 23 '22 18:09

sgDysregulation

Related questions
                            
                                "Too many indexers" with DataFrame.loc
                            
                                Airbnb Airflow vs Apache Nifi [closed]
                            
                                Does get_or_create() have to save right away? (Django)
                            
                                Commit in git only if tests pass
                            
                                Why does pandas apply calculate twice
                            
                                How to use gettext with python >3.6 f-strings
                            
                                Nodejs: Where or How to write complicated business logic?
                            
                                Numpy quirk: Apply function to all pairs of two 1D arrays, to get one 2D array
                            
                                Cyclic module dependencies and relative imports in Python
                            
                                Pip install forked github-repo
                            
                                How is the feature score(/importance) in the XGBoost package calculated?
                            
                                Angle between points?
                            
                                Python's equivalent for R's dput() function
                            
                                Python >=3.5: Checking type annotation at runtime
                            
                                104, 'Connection reset by peer' socket error, or When does closing a socket result in a RST rather than FIN?
                            
                                Plotting results of Pandas GroupBy
                            
                                Closest equivalent of a factor variable in Python Pandas
                            
                                What's the difference between pandas ACF and statsmodel ACF?
                            
                                PHP equivalent of Python's __name__ == "__main__"?
                            
                                How to Reduce the time taken to load a pickle file in python

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Jupyter (IPython) notebook: Convert an HTML notebook to ipynb

Tags:

python

ipython

jupyter-notebook

jupyter

nbconvert

foglerit

People also ask

1 Answers

sgDysregulation

Recent Activity

Donate For Us