Skip to content

Latest commit

 

History

History
45 lines (31 loc) · 3.02 KB

protein-representation.md

File metadata and controls

45 lines (31 loc) · 3.02 KB

👈 Back to Home Page

Protein Representation

📘 Related paper reading list: awesome-protein-representation-learning

Protein Language model

Title Pub.&Year Notes
TAPE Evaluating Protein Transfer Learning with TAPE NIPS '2019
ESM-1b Biological structure and function emerge from scaling unsupervised learning to 250 million protein sequences PNAS '2020
ProtTrans ProtTrans: Toward Understanding the Language of Life Through Self-Supervised Learning TPAMI '2021
MSA Transformer
[Survey] Learning functional properties of proteins with language models NMI '2022
[Benchmark] PEER: A Comprehensive and Multi-Task Benchmark for Protein Sequence Understanding NIPS '2022
Large language models generate functional protein sequences across diverse families NBT '2022
ProtGPT2 ProtGPT2 is a deep unsupervised language model for protein design NC '2022
[Survey] A Survey on Protein Representation Learning: Retrospect and Prospect Arxiv '2023 paper list
xTrimoPGLM xTrimoPGLM: Unified 100B-Scale Pre-trained Transformer for Deciphering the Language of Protein 2023
SaProt SaProt: Protein Language Modeling with Structure-aware Vocabulary ICLR '2024 code
[Benchmark] Feature Reuse and Scaling: Understanding Transfer Learning with Protein Language Models ICML '2024
ESM All-Atom: Multi-scale Protein Language Model for Unified Molecular Modeling ICML '2024
AMPLIFY Protein Language Models: Is Scaling Necessary? BioRxiv '2024
Ginkgo-AA A Protein Sequence LLM Trained on 2 Billion Proprietary Sequences blog 650M

👆 Back to Top

Structure representation

Title Pub.&Year Notes

👆 Back to Top