Skip to content

shashwat0105/Bioinformatics-Drug-Discovery

Repository files navigation

Bioinformatics-Drug-Discovery

  • Collecting the data(from the ChEMBL database) and preprocessing for SARS coronavirus 3C-like proteinase.
  • Exploratory analysis(chemical space analysis in this case) via the Lipinski descriptors to evaluate the druglikeness of a compound.
  • Calculated bioactivity fingerprint descriptors from the PaDEL descriptor and stored them in a data frame for further model building.
  • Regression model building with random forest, scatter plot between experimental and predicted pIC50 values.
  • Comparison of several machine learning models using lazypredict, visualisation and deployment(currently under process).