JupyterDays Boston is a 2 day event on March 17-18. Sponsored by
Here's a great summary of the first day by @DainaBouquin:
http://altbibl.io/gazette/jupyterdays-day-1/
- 9:00AM to 9:30AM: Registration and Breakfast
- 9:30AM to 9:40AM: Welcome from the Organizers - Andrew Odewahn (O'Reilly Media), Chris Erdmann (Harvard-Smithsonian Center for Astrophysics), Ana Ruvalcaba (Project Jupyter)
- 9:40AM to 10:30AM: Keynote - Matthias Bussonnier — From Data Gathering to Publishing
- 10:30AM to 11:00AM: Thorin Tabor — GenePattern Notebooks: Jupyter for Bioinformatic Research
- 11:00AM to 12:00PM: Birds of a Feather Discussions
- 12:00PM to 1:00PM: Lunch
- 1:00PM to 1:30PM: Faras Sadek, Yasha Iravantchi, Diana Zhang and Demba Ba — Wearable Signal Processing Using Docker Notebook Containers on AWS
- 1:30PM to 2:00PM: Jeremy Freeman - Binder
- 2:00PM to 2:30PM: Safia Abdalla - Kooking with Kernels
- 2:30PM to 2:45PM: Break
- 2:45PM to 3:15PM: Elaine Angelino and Sam Lau - Jupyter in UC Berkeley's Data Science Education Program
- 3:15PM to 3:45PM: Chelsea Douglas - Data Visualization with Plotly in Jupyter
- 3:45PM to 4:00PM: Break
- 4:00PM to 5:30PM: Jupyter in Education Panel — Paco Nathan, Rahul Dave, Elaine Angelino, Demba Ba, Eni Mustafaraj, Allen Downey
- 9:00AM to 9:30AM: Registration and Breakfast
- 9:30AM to 10:45 Jonas Rosland -- Introduction to Docker
- 10:45AM to 12:30PM Kyle Kelley and Ash Wilson -- Running JupyterHub on Docker
- 12:30PM to 1:30PM Lunch
- 1:30PM to 2:30PM Laurent Gautier - Polyglot Data Analysis (Visually Demonstrated) With Python and R
- 2:30PM to 3:30PM Brian Keegan -- Data Mining and Network Analysis from Wikipedia Data
- 3:30PM to 4:30PM Peter Bull -- Data Science is Software
- 4:30PM to 5:00PM Closing Remarks
Project Jupyter is an engaged and respectful community made up of people from all over the world. Your involvement helps us to further our mission and create an open platform that serves a broad range of communities, from research and education to journalism, industry and beyond.
At Project Jupyter and JupyterDay, we are committed to providing a harassment-free conference experience for everyone, regardless of gender, sexual orientation, disability, physical appearance, body size, race, or religion. Conference organizers will not tolerate harassment of participants in any form. Sexual language and imagery is not appropriate for any part of the conference venue, including talks.
For the full code of conduct and how to report an incident, see code of conduct.
The event is being held at:
Harvard Law School
1585 Massachusetts Ave
Cambridge, MA 02138
Milstein East A & B
We will be using the hash tag #JupyterDays
for the event. Please tweet liberally during the event!
Allen Downey is a Professor at Olin College and the author of a series of books including Think Python, Think Bayes, and Think Stats, all published by O’Reilly Media and available under free licenses from Green Tea Press.
Ash Wilson builds developer tools, SDKs, and a content management system on the Developer Experience team at Rackspace. His interests include programming languages, IDEs, chat bots, continuous integration and delivery, configuration management, and Docker.
Brian is a scientist of network, data, and social computing and a research associate at the Harvard Business School’s HBX online learning platform. His research analyzes the structure and dynamics of large-scale online collaboration and communication networks. He received his Ph.D. in Media, Technology, & Society from Northwestern University and S.B. degrees in Mechanical Engineering and Science, Technology, & Society from MIT.
Chelsea Douglas is the lead support engineer at Plotly. She is developing Plotly's python API and javascript library, and is also an avid Jupyter Notebook user!
Daina Boiquin is the Assistant Head Librarian at the Harvard-Smithsonian Center for Astrophysics in Cambridge, MA. Her work focuses primarily on how computational techniques and thinking can be applied to socio-technical and historical questions about how libraries and archives can contribute to the creation of new knowledge. Some specific topics of interest include open data, metadata, data visualization, machine learning and digital technologies development.
Demba Ba is Professor of Electrical Engineering and Bioengineering in the School of Engineering and Applied Sciences at Harvard University. He has close to 10 years of experience, in industry and academia, developing novel algorithms for extracting useful information from data. In his research, he develops mathematical and computational tools that leverage the inherent structure in data from artificial and natural systems. His group has developed a new, Jupyter-based course on Signal Processing at Harvard that addresses several challenges.
Elaine Angelino is a Miller Fellow at the University of California, Berkeley, where her faculty host is Michael I. Jordan. She completed her A.B. in applied mathematics, M.A. in systems biology, and Ph.D. in computer science at Harvard. Her recent research is at the intersection of machine learning and systems.
Eni Mustafaraj is an Assistant Professor of Computer Science at Wellesley College. She has introduced the use of Jupyter notebooks in the CS1 course at Wellesley, and they are now used extensively during class time. Jupyter notebooks are also the central component of her elective course on Data Science.
Jeremy Freeman is a neuroscientist using computation to understand the brain. He is the group leader at Janelia, co-organizer of CodeNeuro, and the creator of the popular binder service for launching Notebooks and other Docker-based environments.
Jonas Rosland is a Developer Advocate at EMC {code}, loves tech, food, wine, craft beers and geeky things. He blogs at jonasrosland.com.
Kyle Kelley is a programmer, sometimes mathematician, oftentimes ops. He works for Rackspace, where he both fosters Developer Experience and works on open source to benefit communities, open source projects, and the companies that rely on and support the tooling. As part of this, he is a core contributor to IPython and the Jupyter project.
Laurent Gautier is a Senior Investigator at Novartis Institutes for BioMedical Research. He is a long-time user of R and original core member of the bioconductor project, the lead of the Python-R bridge (up to 60,000 downloads/month!), and he wrote the first Julia-R bridge.
Matthias Bussonnier is a post-doctoral scholar who received his PhD in Biophysics at Institut Curie (Paris, France) after a training in fundamental Physics at ENS Cachan (France). Matthias develops tools for modern computational research across disciplines, with an emphasis on high-level languages, literate computing, and reproducible research. Matthias has been a core developer of the IPython and Jupyter Project team since 2011 and is bringing real-time collaboration to scientific tools.
Paco Nathan is the Director of the Learning Group at O'Reilly Media. He has led several data teams in industry, evangelized many popular open source projects (Spark, Mesos, Cascading, and been teaching with Jupyter notebooks for a couple of years, and loving it! http://liber118.com/pxn/
Peter is a co-founder at DrivenData, whose mission is to bring the power of data science to the social sector. Recently he has worked on projects in smart school budgeting, predicting trends in women’s healthcare, and improving public services by using novel data sources. He earned his master’s in Computational Science and Engineering from Harvard in 2014. Previously he worked as a software engineer at Microsoft and earned a BA in philosophy from Yale University.
Rahul Dave is a Computational Scientist at the Astrophysics Data System (ADS) at Harvard University in Cambridge, MA. He is also Chief Wrangler at Nareau Inc, a consultancy focussed on managing large data sets, finding patterns in and computing on data, visualizing data and results of computation, and making these available over the web. He is also interested in collaborative software, especially as it pertains to personal and group information management, and its potential to create new, more democratic and responsible forms of media. In the recent past, he was a computational scientist at Harvard’s Initiative in Innovative Computing. He is a cosmologist by training, and a developer by vocation. He is an avid climber and skier, and a mountain person at heart.
Safia is an energetic software engineer with an interest in data science for social good, intelligent interfaces, and great coffee. She is the founder and fearless leader of dsfa, a startup aimed at providing data science services to small businesses. She is also the organizer of PyData Chicago, where she brings an awesome community of developers and data scientists to build amazing open source software. In her free time, she enjoys working out, contributing to open-source software, and teaching people how to be awesome with code.
Thorin Tabor is the developer of the GenePattern Notebook environment, which uses Jupyter Notebooks to capture, automate and share workflows used in cancer research, as well as other genomic analyses.
GenePattern Notebook is an extension that allows Jupyter to communicate with the open source GenePattern environment. This environment wraps hundreds of different software tools for analyzing gene expression data, and makes these tools available in a user-friendly interface.