dtu.prosys

About this project 🚀

This project intends to be a didactic tool to teach spectroscopy and chemometrics in the context of fermentation technology. During my studies, I often felt that many courses were theory-based only due to the limited access to real-world data. For this reason, I have decided to distribute the data I generated during my studies hoping to improve the learning experience of future students.

This module contains:

Training data (spectra of different samples and the glucose concentration).
Fermentation spectra (spectra measured in real-time every minute).
Fermentation HPLC data (measured off-line every hour).
Common preprocessing operations used in chemometrics.
Workflow to train partial least squares (PLS) models.
Plotting functions for time-series and spectral data.

These functions can be used as a starting point for the, but more advanced users are encouradged to explore other packages to play with this data (e.g., scikit-learn, or scipy).

About the data 📈

This project provides two datasets (a training and a validation set). Both data sets were recorded at the Technical University of Denmark, at the PROSYS research center (department of Chemical and Biochemical engineering) during 2019. More information about the dataset can be found in the following article Transforming data to information: A parallel hybrid model for real-time state estimation in lignocellulosic ethanol fermentation

The training set

The training set contains the spectra of 20 semi-synthetic samples and their reference glucose concentration measured with high performance liquid chromatography (HPLC). The spectra were measured using attenuated total refractance mid infrared (ATR-MIR) spectroscopy.

Validation set

The validation contains spectra measured every minute during a lignocellulose to ethanol fermentation. These spectra were collected in real-time using the same ATR-MIR instrument, connected to a flow-cell. Moreover, the extracellular concentrations of glucose, xylose, ethanol, furfural and acetic acid were also measured every hour using HPLC.

Installation 💻

Dependencies

This project is build targetting Python >= 3.7 to ensure compatibility with Google Colab.

User installation `pip`

pip install -U dtuprosys

Quick start 🏁

A complete example of can be found in the example.ipynb. The raw data can be conviniently accessed using as the following commands:

from dtuprosys.datasets import load_train_data, load_fermentation_data

to access the training data

train_spectra, train_hplc = load_train_data()

to access the validation data:

fermentation_spectra, fermentation_hplc = load_fermentation_data()

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.github		.github
.vscode		.vscode
docs		docs
fermentools		fermentools
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
Pipfile		Pipfile
Pipfile.lock		Pipfile.lock
README.md		README.md
example.ipynb		example.ipynb
mechanistic.ipynb		mechanistic.ipynb
pyproject.toml		pyproject.toml
real_time.ipynb		real_time.ipynb
requirements.txt		requirements.txt
setup.py		setup.py
test.ipynb		test.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

dtu.prosys

About this project 🚀

About the data 📈

The training set

Validation set

Installation 💻

Dependencies

User installation `pip`

Quick start 🏁

About

Releases 6

Packages

Languages

License

paucablop/fermentools

Folders and files

Latest commit

History

Repository files navigation

dtu.prosys

About this project 🚀

About the data 📈

The training set

Validation set

Installation 💻

Dependencies

User installation pip

Quick start 🏁

About

Resources

License

Stars

Watchers

Forks

Releases 6

Packages 0

Languages

User installation `pip`

Packages