Here we have collected papers with the theme of "AI for crystalline materials" that have appeared at top machine learning conferences and journals (ICML, ICLR, NeurIPS, AAAI, NPJ, NC, etc.) in recent years. See https://arxiv.org/abs/2408.08044 for details. We will keep this page updated.
Method | Paper |
---|---|
SchNet | Schnet: A continuous-filter convolutional neural network for modeling quantum interactions (NeurIPS2017) [Paper][Code] |
CGCNN | Crystal graph convolutional neural networks for an accurate and interpretable prediction of material properties (Physical Review Letters, 2018) [Paper][Code] |
MEGNET | Graph networks as a universal machine learning framework for molecules and crystals (Chemistry of Materials, 2019) [Paper][Code] |
GATGNN | Graph convolutional neural networks with global attention for improved materials property prediction (Physical Chemistry Chemical Physics, 2020) [Paper][Code] |
ALIGNN | Atomistic line graph neural network for improved materials property predictions (npj Computational Materials, 2021) [Paper][Code] |
ECN | Equivariant networks for crystal structures (NeurIPS2022) [Paper][Code] |
PotNet | Efficient Approximations of Complete Interatomic Potentials for Crystal Property Prediction (ICML2023) [Paper][Code] |
CrysGNN | Crysgnn: Distilling pre-trained knowledge to enhance property prediction for crystalline materials (AAAI2023) [Paper][Code] |
ETGNN | A general tensor prediction framework based on graph neural networks (The Journal of Physical Chemistry Letters, 2023) [Paper] |
DTNet | Dielectric tensor prediction for inorganic materials using latent information from preferred potential (npj Computational Materials, 2024) [Paper][Code] |
GMTNet | A Space Group Symmetry Informed Network for O(3) Equivariant Crystal Tensor Prediction (ICML2024) [Paper][Code] |
CEGANN | CEGANN: Crystal Edge Graph Attention Neural Network for multiscale classification of materials environment (npj Computational Materials, 2023) [Paper][Code] |
ComFormer | Complete and Efficient Graph Transformers for Crystal Material Property Prediction (ICLR2024) [Paper][Code] |
Crystalformer | Crystalformer: infinitely connected attention for periodic structure encoding (ICLR2024) [Paper][Code] |
Crystalformer | Conformal Crystal Graph Transformer with Robust Encoding of Periodic Invariance (AAAI2024) [Paper] |
E(3)NN | Direct prediction of phonon density of states with Euclidean neural networks (Advanced Science, 2021) [Paper][Code] |
DOSTransformer | Density of States Prediction of Crystalline Materials via Prompt-guided Multi-Modal Transformer (NeurIPS2023) [Paper][Code] |
Matformer | Periodic Graph Transformers for Crystal Material Property Prediction (NeurIPS2022) [Paper][Code] |
CrysDiff | A Diffusion-Based Pre-training Framework for Crystal Property Prediction (AAAI2024) [Paper] |
MOFTransformer | A multi-modal pre-training transformer for universal transfer learning in metal-organic frameworks (Nature Machine Intelligence, 2023) [Paper][Code] |
Uni-MOF | A comprehensive transformer-based approach for high-accuracy gas adsorption predictions in metal-organic frameworks (Nature Communications, 2024) [Paper][Code] |
SODNet | Learning Superconductivity from Ordered and Disordered Material Structures (NeurIPS2024) [Paper][Code] |
ECSG | Predicting thermodynamic stability of inorganic compounds using ensemble machine learning based on electron configuration (Nature Communications, 2025) [Paper][Code] |
Method | Paper |
---|---|
G-SchNet | Symmetry-adapted generation of 3d point sets for the targeted discovery of molecules (NeurIPS2019) [Paper][Code] |
CDVAE | Crystal Diffusion Variational Autoencoder for Periodic Material Generation (ICLR2022) [Paper][Code] |
Con-CDVAE | Con-CDVAE: A method for the conditional generation of crystal structures (Computational Materials Today, 2024) [Paper][Code] |
Cond-CDVAE | Deep learning generative model for crystal structure prediction (npj Computational Materials, 2024) [Paper][Code] |
LCOMs | Latent Conservative Objective Models for Data-Driven Crystal Structure Prediction (NeurIPS2023 Workshop) [Paper] |
DiffCSP | Crystal structure prediction by joint equivariant diffusion on lattices and fractional coordinates (NeurIPS2023) [Paper][Code] |
DiffCSP-SC | Learning Superconductivity from Ordered and Disordered Material Structures (NeurIPS2024) [Paper][Code] |
EquiCSP | Equivariant Diffusion for Crystal Structure Prediction (ICML2024) [Paper][Code] |
GemsDiff | Vector Field Oriented Diffusion Model for Crystal Material Generation (AAAI2024) [Paper][Code] |
SyMat | Towards symmetry-aware generation of periodic materials (NeurIPS2023) [Paper][Code] |
EMPNN | Equivariant Message Passing Neural Network for Crystal Material Discovery (AAAI2023) [Paper][Code] |
UniMat | Scalable Diffusion for Materials Generation (ICLR2024) [Paper][Code] |
PGCGM | Physics guided deep learning for generative design of crystal materials with symmetry constraints (npj Computational Materials, 2023) [Paper][Code] |
CubicGAN | High-throughput discovery of novel cubic crystal materials using deep generative neural networks (Advanced Science, 2021) [Paper][Code] |
PCVAE | PCVAE: A Physics-informed Neural Network for Determining the Symmetry and Geometry of Crystals (IJCNN2023) [Paper][Code] |
DiffCSP++ | Space Group Constrained Crystal Generation (ICLR2024) [Paper][Code] |
FlowMM | FlowMM: Generating Materials with Riemannian Flow Matching (ICML2024) [Paper][Code] |
Govindarajan | Behavioral Cloning for Crystal Design (ICLR2023 Workshop) [Paper][Code] |
CHGFlowNet | Hierarchical GFlownet for Crystal Structure Generation (NeurIPS2023 Workshop) [Paper] |
LM-CM,LM-AC | Language models can generate molecules, materials, and protein binding sites directly in three dimensions as xyz, cif, and pdb files (Arxiv, 2023) [Paper][Code] |
CrystaLLM | Crystal structure generation with autoregressive large language modeling (Nature Communications, 2024) [Paper][Code] |
CrystalFormer | Space Group Informed Transformer for Crystalline Materials Generation (Arxiv, 2024) [Paper][Code] |
SLI2Cry | An invertible, invariant crystal representation for inverse design of solid-state materials using generative deep learning (Nature Communications, 2023) [Paper][Code] |
Gruver | Fine-Tuned Language Models Generate Stable Inorganic Materials as Text (ICLR2024) [Paper][Code] |
FlowLLM | FlowLLM: Flow Matching for Material Generation with Large Language Models as Base Distributions (NeurIPS2024) [Paper][Code] |
Mat2Seq | Invariant Tokenization of Crystalline Materials for Language Model Enabled Generation (NeurIPS2024) [Paper] |
GenMS | Generative Hierarchical Materials Search (NeurIPS2024) [Paper] |
ChemReasoner | CHEMREASONER: Heuristic Search over a Large Language Model’s Knowledge Space using Quantum-Chemical Feedback (ICML2024) [Paper] [Code] |
a²c | Predicting emergence of crystals from amorphous precursors with deep learning potentials (Nature Computational Science, 2024) [Paper][Code] |
MatterGen | A generative model for inorganic materials design (Nature, 2025) [Paper][Code] |
Method | Paper |
---|---|
- | Insightful classification of crystal structures using deep learning (Nature Communications, 2018) [Paper] |
- | Advanced steel microstructural classification by deep learning methods (Scientific Reports, 2018) [Paper] |
- | Neural network for nanoscience scanning electron microscope image recognition (Scientific Reports, 2017) [Paper] |
- | Deep Learning-Assisted Quantification of Atomic Dopants and Defects in 2D Materials (Advanced Science, 2021) [Paper] |
- | Classification of crystal structure using a convolutional neural network (IUCrJ,2017) [Paper] |
- | Synthesis, optical imaging, and absorption spectroscopy data for 179072 metal oxides (Scientific Data, 2019) [Paper] |
- | Adaptively driven X-ray diffraction guided by machine learning for autonomous phase identification (npj Computational Materials, 2023) [Paper] [Code] |
- | Automated classification of big X-ray diffraction data using deep learning models (npj Computational Materials, 2023) [Paper] [Code] |
XRD-AutoAnalyzer | Integrated analysis of X-ray diffraction patterns and pair distribution functions for machine-learned phase identification (npj Computational Materials, 2024) [Paper] [Code] |
CrystalNet | Towards end-to-end structure determination from x-ray diffraction data using deep learning (npj Computational Materials, 2024) [Paper] [Code] |
Method | Paper |
---|---|
BPNN | Generalized neural-network representation of high-dimensional potential-energy surfaces (Physical Review Letters, 2007) [Paper] |
- | Gaussian approximation potentials: The accuracy of quantum mechanics, without the electrons (Physical Review Letters, 2010) [Paper] |
NequIP | E (3)-equivariant graph neural networks for data-efficient and accurate interatomic potentials (Nature Communications, 2022) [Paper][Code] |
CHGNet | CHGNet as a pretrained universal neural network potential for charge-informed atomistic modelling (Nature Machine Intelligence, 2023) [Paper][Code] |
Cormorant | Cormorant: Covariant molecular neural networks (NeurIPS2019) [Paper][Code] |
MACE | MACE: Higher order equivariant message passing neural networks for fast and accurate force fields (NeurIPS2022) [Paper][Code] |
DimeNet | Directional Message Passing for Molecular Graphs (ICLR2020) [Paper][Code] |
M3GNet | A universal graph deep learning interatomic potential for the periodic table (Nature Computational Science, 2022) [Paper][Code] |
- | Injecting domain knowledge from empirical interatomic potentials to neural networks for predicting material properties (NeurIPS2022) [Paper][Code] |
CHGNet | CHGNet as a pretrained universal neural network potential for charge-informed atomistic modelling (Nature Machine Intelligence, 2023) [Paper][Code] |
- | Forces are not Enough: Benchmark and Critical Evaluation for Machine Learning Force Fields with Molecular Simulations (Transactions on Machine Learning Research, 2023) [Paper] |
DeepRelax | Scalable crystal structure relaxation using an iteration-free deep generative model with uncertainty quantification (Nature Communications, 2024) [Paper] [Code] |
Dataset | Description | URL |
---|---|---|
Materials Project | Materials Project encompasses over 120,000 materials, each accompanied by a comprehensive specification of its crystal structure and important physical properties. | Materials Project |
JARVIS-DFT | JARVIS-DFT encompasses data for approximately 40,000 materials and includes around one million calculated properties. | JARVIS-DFT |
OQMD | OQMD is a repository of thermodynamic and structural properties of inorganic materials, derived from high-throughput DFT calculations. | OQMD |
Perov-5 | Perov-5 is a specialized dataset of perovskite crystal materials, containing 18,928 different perovskite materials. | Perov-5 |
Carbon-24 | Carbon-24 is a specialized dataset of carbon materials, containing over 10,000 different carbon structures. | Carbon-24 |
Crystallography Open Database | Crystallography Open Database is a crystallography database that specializes in collecting and storing crystal structure information for inorganic compounds, small organic molecules, metal-organic compounds, and minerals. | Crystallography Open Database |
Raman Open Database | Raman Open Database is an open database that specializes in collecting and storing Raman spectroscopy data. | Raman Open Database |
Inorganic Crystal Structure Database | Inorganic Crystal Structure Database is the world's largest database for completely identified inorganic crystal structures. | Inorganic Crystal Structure Database |
Open Catalyst Project | The goal of Open Catalyst Project is to utilize artificial intelligence to simulate and discover new catalysts for renewable energy storage. | Open Catalyst Project |
Python Materials Genomics | Python Materials Genomics is a robust, open-source Python library for materials analysis, offering a range of modules for handling crystal structures, band structures, phase diagrams, and material properties. | Python Materials Genomics |
MatBench | MatBench is a benchmark suite in the field of materials science, designed to evaluate and compare the performance of various ML models. | MatBench |
M² Hub | M² Hub is a machine learning toolkit for materials discovery research that covers the entire workflow. | M² Hub |
Phonon DOS Dataset | Phonon DOS Dataset contains approximately 1,500 crystalline materials whose phonon DOS is calculated from DFPT. | Phonon DOS Dataset |
Carolina Materials Database | CMD primarily consists of ternary and quaternary materials generated by some AI methods. | Carolina Materials Database |
Alexandria Database | Alexandria Database includes a large quantity of hypothetical crystal structures generated by ML methods or other algorithmic methodologies. | Alexandria Database |
Materials Project Trajectory Dataset | MPtrj contains 1,580,395 atomic configurations, corresponding energies, 7,944,833 magnetic moments, 49,295,660 forces, and 14,223,555 stress values. | Materials Project Trajectory Dataset |
Quantum MOF | QMOF is a dataset of over 20K metal-organic frameworks and coordination polymers derived from DFT. | Quantum MOF |
Open Materials 2024 | OMat24 contains over 110 million DFT calculations focused on structural and compositional diversity. | Open Materials 2024 |
SuperCon3D | SuperCon3D contains 1,578 superconductor materials (includes 83 distinct elements), each with both Tc and crystal structure data. | SuperCon3D |
Atomly | The Atomly database provides an extensive collection of material data generated through high-throughput first-principles calculations. This includes 320,000 inorganic crystal structures, 310,000 bandgap and density of states profiles, 12,000 dielectric constant tensors, and 16,000 mechanical tensors. | Atomly |