Data Jupyter

Data Jupyter is an interactive web application for simplifying the task of data processing on MongoDB databases. The main interface of Data Jupyter is modelled after the popular Jupyter Notebook, making it familiar to many data analysts. Data Jupyter also offers support for data traceability through its integrated checkpoint system and by enforcing a modular style of data processing.

Loading a dataset

Choose a demo dataset (e.g. nobel_prizes_incorrect) and press Select.

Add a new cell in the editor by clicking the + icon.

In the new cell, type show = col.find() to make the Data Browser show all documents in the collection.

Click the single arrow icon to run the cell.

The data will be displayed on the Raw tab.

Loading a checkpoint

Data traceability is a very important aspect in data analytics. Data Jupyter supported this by integrating a checkpoint system.

Click on the icon to the right.

Select the checkpoint “6/15/2021, 1:09:23 PM” and click Load.

The notebook state will be restored to that checkpoint.

Name		Name	Last commit message	Last commit date
Latest commit History 63 Commits
backend		backend
frontend		frontend
.gitignore		.gitignore
Procfile		Procfile
README.md		README.md
requirements.txt		requirements.txt
wsgi.py		wsgi.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Data Jupyter

Loading a dataset

Loading a checkpoint

About

Releases 3

Packages

Languages

alvin-yang68/Data-Jupyter

Folders and files

Latest commit

History

Repository files navigation

Data Jupyter

Loading a dataset

Loading a checkpoint

About

Resources

Stars

Watchers

Forks

Releases 3

Packages 0

Languages

Packages