Skip to content

A demo web app for interactive data processing of MongoDB databases.

Notifications You must be signed in to change notification settings

alvin-yang68/Data-Jupyter

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

63 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Data Jupyter

Data Jupyter is an interactive web application for simplifying the task of data processing on MongoDB databases. The main interface of Data Jupyter is modelled after the popular Jupyter Notebook, making it familiar to many data analysts. Data Jupyter also offers support for data traceability through its integrated checkpoint system and by enforcing a modular style of data processing.

Loading a dataset

  1. Choose a demo dataset (e.g. nobel_prizes_incorrect) and press Select.

image

  1. Add a new cell in the editor by clicking the + icon.

image

  1. In the new cell, type show = col.find() to make the Data Browser show all documents in the collection.

image

  1. Click the single arrow icon to run the cell.

image

  1. The data will be displayed on the Raw tab.

image

Loading a checkpoint

Data traceability is a very important aspect in data analytics. Data Jupyter supported this by integrating a checkpoint system.

  1. Click on the icon to the right.

image

  1. Select the checkpoint “6/15/2021, 1:09:23 PM” and click Load.

image

  1. The notebook state will be restored to that checkpoint.

image