Data Jupyter is an interactive web application for simplifying the task of data processing on MongoDB databases. The main interface of Data Jupyter is modelled after the popular Jupyter Notebook, making it familiar to many data analysts. Data Jupyter also offers support for data traceability through its integrated checkpoint system and by enforcing a modular style of data processing.
- Choose a demo dataset (e.g. nobel_prizes_incorrect) and press Select.
- Add a new cell in the editor by clicking the + icon.
- In the new cell, type
show = col.find()
to make the Data Browser show all documents in the collection.
- Click the single arrow icon to run the cell.
- The data will be displayed on the Raw tab.
Data traceability is a very important aspect in data analytics. Data Jupyter supported this by integrating a checkpoint system.
- Click on the icon to the right.
- Select the checkpoint “6/15/2021, 1:09:23 PM” and click Load.
- The notebook state will be restored to that checkpoint.