Skip to content

bio-datascience/ProDG

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

33 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Python version License: MIT

ProDG: Prokaryotic Data Generator

ProDG is a Python library for generating synthetic data based on different probability distributions. It's specifically designed for prokaryotic data but can be used for any zero-inflated compositional data.

Installation

You can install ProDG using:

git clone https://github.com/Vlasovets/microbe-data-gen.git

Usage

Here is a basic example of how to use ProDG:

from prodg import DataGenerator

# create a sample data where rows are bacterial species names and columns are sample names
df = pd.DataFrame(np.random.randint(0,100,size=(100, 4)), columns=['Sample1', 'Sample2', 'Sample3', 'Sample4'])

# call generator instance
prodg = DataGenerator()

# Fit the models to the data
prodg.fit(df)

# Generate new data
synthetic_data = prodg.generate(df)

# Print the synthetic data
print(synthetic_data)

License

ProDG is licensed under the MIT License.

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%