The ESMFold Protein Structure Predictor is a web application that uses the ESM-2 language model to predict the 3D structure of proteins based on their amino acid sequences. This tool allows users to input protein sequences in text format or upload a FASTA file and visualize the predicted protein structure in 3D. It also provides pLDDT scores, which are confidence measures for the predicted structure.
- Protein Sequence Input: Accepts protein sequences via text input or FASTA file upload.
- Structure Prediction: Predicts protein structures using the ESMFold API.
- 3D Visualization: Visualizes the predicted protein structure using Py3Dmol.
- pLDDT Scores: Extracts and displays pLDDT scores, indicating the confidence of the predicted structure.
- Download Option: Allows users to download the predicted PDB file.
- Python 3.7 or higher
- Streamlit
- Requests
- Urllib3
- Py3Dmol
- Clone the repository:
git clone https://github.com/yourusername/esmfold-predictor.git cd esmfold-predictor
- Create and activate a virtual environment:
python3 -m venv venv source venv/bin/activate # On Windows, use `venv\Scripts\activate`
- Install the required packages:
pip install -r requirements.txt
- Run the Streamlit application:
streamlit run esm.py
- Open your web browser and navigate to
http://localhost:8501
.