Recently, the advancement of artificial intelligence (AI) has revolutionized automation in various software domains, including software security. AI-driven security approaches, particularly those leveraging machine learning or deep learning, hold promise in automating security workflows. They could reduce manual efforts, which can be integrated into DevOps to ensure uninterrupted delivery speed and align with the DevSecOps paradigm simultaneously.
We identified 12 security tasks associated with the DevSecOps process and reviewed current AI-driven security approaches. Through this analysis, we uncovered 15 challenges faced by these approaches and outlined potential opportunities for future research.
π¦ Comprehensive resources for our AI for DevSecOps survey, authored by Michael Fu, Jirat Pasuksmit, and Chakkrit Tantithamthavorn
π©βπ§ Please let us know if you notice any mistakes or have any suggestions!
π If you find this resource helpful, please consider to star this repository and cite our survey paper:
@article{fu2024ai,
title={AI for DevSecOps: A Landscape and Future Opportunities},
author={Fu, Michael and Pasuksmit, Jirat and Tantithamthavorn, Chakkrit},
journal={arXiv preprint arXiv:2404.04839},
year={2024}
}
- π [January-16-2025] Just Accepted Version available on ACM Digital Library π
- π [December-11-2024] Our paper has been accepted for publication in the ACM Transactions on Software Engineering and Methodology (TOSEM)!
- π [August-23-2024] First revision of our AI4DevSecOps survey is completed
- π [April-07-2024] Our AI4DevSecOps survey (v1) is available on arXiv π
We welcome contributions from the community!
If you have a valuable resource, tool, or idea related to AI for DevSecOps, please submit your PR using the following template.
π₯ This is a great opportunity to share and promote your work, research, or projects with a wider audience!
# Description: [Briefly describe your contribution, its purpose, and relevance.]
# Type of Contribution
- [ ] Research Paper
- [ ] Dataset
- [ ] Tool/Library
- [ ] Tutorial/Guide
- [ ] Other (please specify):
# Paper/Resource Link: [Provide the link here]
We will review and merge your contribution if it is appropriate and relevant to our project.
Thank you for helping us improve Awesome-AI4DevSecOps!
Current Landscape of AI-Driven Security Appoaches in DevSecOps (Section 4 in our paper)
-
Threat Modeling
- No Relevant Publications Identified Using Our Defined Search Strategy
-
Impact Analysis
- No Relevant Publications Identified Using Our Defined Search Strategy
- Software Vulnerability Detection (SVD)
Recurrent Neural Network (RNN)
- Automatic feature learning for predicting vulnerable software components (TSE, 2018) π
- Automated vulnerability detection in source code using deep representation learning (ICMLA, 2018) π
- Vuldeepecker: A deep learning-based system for vulnerability detection (NDSS, 2018) π
- Vuldeelocator: a deep learning-based fine-grained vulnerability detector (TDSC, 2021) π
- VUDENC: vulnerability detection with deep learning on a natural codebase for Python (IST, 2022) π
Text Convolutional Neural Network (TextCNN)
- A software vulnerability detection method based on deep learning with complex network analysis and subgraph partition (IST, 2023) π
Graph Neural Network (GNN)
- Devign: Effective vulnerability identification by learning comprehensive program semantics via graph neural networks (NeurIPS, 2019) π
- Bgnn4vd: Constructing bidirectional graph neural-network for vulnerability detection (IST, 2021) π
- Deep learning based vulnerability detection: Are we there yet (TSE, 2021) π
- Vulnerability detection with fine-grained interpretations (FSE, 2021) π
- LineVD: Statement-level vulnerability detection using graph neural networks (MSR, 2022) π
- mVulPreter: A Multi-Granularity Vulnerability Detection System With Interpretations (TDSC, 2022) π
- VulChecker: Graph-based Vulnerability Localization in Source Code (USENIX, 2022) π
- CPVD: Cross Project Vulnerability Detection Based On Graph Attention Network And Domain Adaptation (TSE, 2023) π
- DeepVD: Toward Class-Separation Features for Neural Network Vulnerability Detection (ICSE, 2023) π
- Learning Program Semantics for Vulnerability Detection via Vulnerability-Specific Inter-procedural Slicing (FSE, 2023) π
- SedSVD: Statement-level software vulnerability detection based on Relational Graph Convolutional Network with subgraph embedding (IST, 2023) π
Node2Vec
- Enhancing Deep Learning-based Vulnerability Detection by Building Behavior Graph Model (ICSE, 2023) π
Pre-trained Code Language Model (CLM) (Transformers)
LM + GNN
Benchmark | Year | Granularity | Programming Language | Real-World | Synthesis |
---|---|---|---|---|---|
Firefox | 2013 | File | C, C++ | β | |
Android | 2014 | File | Java | β | |
Draper | 2018 | Function | C, C++ | β | β |
Vuldeepecker | 2018 | Code Gadget | C, C++ | β | β |
Du et al. | 2019 | Function | C, C++ | β | |
Devign | 2019 | Function | C, C++ | β | |
FUNDED | 2020 | Function | C, Java, Swift, PHP | β | β |
Big-Vul | 2020 | Function/Line | C, C++ | β | |
Reveal | 2021 | Function | C, C++ | β | |
Cao et al. | 2021 | Function | C, C++ | β | |
D2A | 2021 | Function | C, C++ | β | |
Deepwukong | 2021 | Function | C, C++ | β | β |
Vuldeelocator | 2021 | Line | C, C++ | β | β |
VulCNN | 2022 | Function | C, C++ | β | β |
VUDENC | 2022 | Token | Python | β | |
DeepVD | 2023 | Function | C, C++ | β | |
VulChecker | 2023 | Instruction | C, C++ | β |
- Software Vulnerability Classification (SVC)
Machine Learning (ML)
RNN
Text Recurrent Convolutional Neural Network (TextRCNN)
- DeKeDVer: A deep learning-based multi-type software vulnerability classification framework using vulnerability description and source code (IST, 2023) π
Vanilla Transformer
- Towards Vulnerability Types Classification Using Pure Self-Attention: A Common Weakness Enumeration Based Approach (CSE, 2021) π
Pre-trained Language Model (LM) (Transformers)
CLM
CLM + RNN
- Fine-grained commit-level vulnerability type prediction by CWE tree structure (ICSE, 2023) π
Benchmark | Year | Granularity | Programming Language | Real-World | Synthesis |
---|---|---|---|---|---|
ΞΌVulDeePecker | 2019 | Code Gadget | C, C++ | β | β |
TreeVul | 2023 | Commit | C, C++, Java, and Python | β |
- Automated Vulnerability Repair (AVR)
ML
- Sqlifix: Learning based approach to fix sql injection vulnerabilities in source code (SANER, 2021) π
CNN
- Coconut: combining context-aware neural translation models using ensemble for program repair (ISSTA, 2020) π
RNN
Tree-based RNN
- Dlfix: Context-based code transformation learning for automated program repair (ICSE, 2020) π
GNN
- Hoppity: Learning graph transformations to detect and fix bugs in programs (ICLR, 2020) π
Vanilla Transformer
- A syntax-guided edit decoder for neural program repair (FSE, 2021) π
- Neural transfer learning for repairing security vulnerabilities in c code (TSE, 2022) π
- Seqtrans: automatic vulnerability fix via sequence to sequence learning (TSE, 2022) π
- Tare: Type-aware neural program repair (ICSE, 2023) π
CLM
- Cure: Code-aware neural machine translation for automatic program repair (ICSE, 2021) π
- Applying codebert for automated program repair of java simple bugs (MSR, 2021) π
- Tfix: Learning to fix coding errors with a text-to-text transformer (PMLR, 2021) π
- VulRepair: a T5-based automated software vulnerability repair (FSE, 2022) π
- Improving automated program repair with domain adaptation (TOSEM, 2022) π
- Vision Transformer-Inspired Automated Vulnerability Repair (TOSEM, 2023) π
- Enhancing Code Language Models for Program Repair by Curricular Fine-tuning Framework (ICSME, 2023) π
- Pre-trained model-based automated software vulnerability repair: How far are we? (TDSC, 2023) π
- Examining zero-shot vulnerability repair with large language models (SP, 2023) π
- Inferfix: End-to-end program repair with llms (FSE, 2023) π
- Unifying Defect Prediction, Categorization, and Repair by Multi-Task Deep Learning (ASE, 2023) π
Benchmark | Year | Programming Language | Real-World | Synthesis |
---|---|---|---|---|
Defects4J | 2014 | Java | β | |
ManyBugs | 2015 | C | β | |
BugAID | 2016 | JavaScript | β | |
QuixBugs | 2017 | Java, Python | β | |
CodeFlaws | 2017 | C | β | |
Bugs.jar | 2018 | Java | β | |
SequenceR | 2019 | Java | β | |
Bugs2Fix | 2019 | Java | β | |
ManySStuBs4J | 2020 | Java | β | |
Hoppity | 2020 | JavaScript | β | |
CodeXGLUE | 2021 | Java | β | β |
TFix | 2021 | JavaScript | β | |
VRepair | 2022 | C, C++ | β | |
Namavar et al. | 2022 | JavaScript | β | |
Pearce et al. | 2023 | C, C++ | β | β |
Function-SStuBs4J | 2023 | Java | β | |
InferFix | 2023 | Java, C# | β |
- Security Tools in IDEs
LM-based Security Tool
- AIBugHunter: A Practical tool for predicting, classifying and repairing software vulnerabilities (EMSE, 2023) π
-
Dependency Management
- No Relevant Publications Identified Using Our Defined Search Strategy
-
CI/CD Secure Pipelines
ML
- Improving missing issue-commit link recovery using positive and unlabeled data (ASE, 2017) π
- MULTI: Multi-objective effort-aware just-in-time software defect prediction (IST, 2018) π
- Class imbalance evolution and verification latency in just-in-time software defect prediction (ICSE, 2019) π
- Fine-grained just-in-time defect prediction (JSS, 2019) π
- Effort-aware semi-supervised just-in-time defect prediction (IST, 2020) π
- Just-in-time defect identification and localization: A two-phase framework (TSE, 2020) π
- Adapting bug prediction models to predict reverted commits at Wayfair (FSE, 2020) π
- JITLine: A simpler, better, faster, finer-grained just-in-time defect prediction (MSR, 2021) π
- Enhancing just-in-time defect prediction using change request-based metrics (SANER, 2021) π
Explainable AI (XAI) For ML
- Pyexplainer: Explaining the predictions of just-in-time defect models (ASE, 2021) π
RNN
Tree-based RNN
- Lessons learned from using a deep tree-based model for software defect prediction in practice (MSR, 2019) π
Vanilla Transformer
- Deep just-in-time defect localization (TSE, 2021) π
LM
- BTLink: automatic link recovery between issues and commits based on pre-trained BERT model (EMSE, 2023) π
CLM
- EALink: An Efficient and Accurate Pre-trained Framework for Issue-Commit Link Recovery (ASE, 2023) π
ML-based Just-In-Time (JIT) Software Defect Prediction (SDP) Tool
ML-based Change Analysis Tool
- Rex: Preventing bugs and misconfiguration in large services using correlated change analysis (USENIX, 2020) π
Benchmark | Year | Granularity | Programming Language | Real-World | Synthesis |
---|---|---|---|---|---|
PROMISE | 2007 | Commit | Java | β | |
Kamei et al. | 2012 | Commit | C, C++, Java, JavaScript, Perl | β | |
Qt & OpenStack | 2018 | Commit/Line | C++, Python | β | |
Cabral et al. | 2019 | Commit/File | Java, JavaScript, Python | β | |
Yan et al. | 2020 | Commit/File | Java | β | |
Wattanakriengkrai et al. | 2020 | Commit | Java | β | |
Suh | 2020 | Commit/File | JavaScript, PHP | β |
-
Configuration Validation
ML
- Tuning configuration of apache spark on public clouds by combining multi-objective optimization and performance prediction model (JSS, 2021) π
- KGSecConfig: A Knowledge Graph Based Approach for Secured Container Orchestrator Configuration (SANER, 2022) π
- CoMSA: A Modeling-Driven Sampling Approach for Configuration Performance Testing (ASE, 2023) π
Feed-Forward Neural Network (FFNN)
- DeepPerf: Performance prediction for configurable software with deep sparse neural network (ICSE, 2019) π
Generative Adversarial Network (GAN)
-
Infrastructure Scanning
ML
Word2Vec-CBOW (Continuous Bag of Words)
- FindICI: Using machine learning to detect linguistic inconsistencies between code and natural language descriptions in infrastructure-as-code (EMSE, 2022) π
Benchmark | Year | Real-World | Synthesis |
---|---|---|---|
Rahman and Williams | 2018 | β | |
Rahman and Williams | 2019 | β | |
Dalla et al. | 2021 | β | |
Borovits et al. | 2022 | β |
- Log Analysis & Anomaly Detection
ML
- An anomaly detection system based on variable N-gram features and one-class SVM (IST, 2017) π
- Anomaly detection and diagnosis for cloud services: Practical experiments and lessons learned (JSS, 2018) π
- Adaptive performance anomaly detection in distributed systems using online svms (TDSC, 2018) π
- Log-based anomaly detection with robust feature extraction and online learning (TIFS, 2021) π
- Try with Simpler--An Evaluation of Improved Principal Component Analysis in Log-based Anomaly Detection (TOSEM, 2023) π
- On the effectiveness of log representation for log-based anomaly detection (EMSE, 2023) π
RNN
- Deeplog: Anomaly detection and diagnosis from system logs through deep learning (CCS, 2017) π
- Robust log-based anomaly detection on unstable log data (FSE, 2019) π
- Loganomaly: Unsupervised detection of sequential and quantitative anomalies in unstructured logs (IJCAI, 2019) π
- Anomaly detection in operating system logs with deep learning-based sentiment analysis (TDSC, 2020) π
- SwissLog: Robust anomaly detection and localization for interleaved unstructured logs (TDSC, 2022) π
- DeepSyslog: Deep Anomaly Detection on Syslog Using Sentence Embedding and Metadata (TIFS, 2022) π
- LogOnline: A Semi-Supervised Log-Based Anomaly Detector Aided with Online Learning Mechanism (ASE, 2023) π
- On the effectiveness of log representation for log-based anomaly detection (EMSE, 2023) π
RNN-based AutoEncoder (AE)
GNN
- LogGraph: Log Event Graph Learning Aided Robust Fine-Grained Anomaly Diagnosis (TDSC, 2023) π
Vanilla Transformer
- Log-based anomaly detection without log parsing (ASE, 2021) π
XAI For Deep Learning (DL)
Conditional Diffusion Model
- Maat: Performance Metric Anomaly Anticipation for Cloud Services with Conditional Diffusion (ASE, 2023) π
Benchmark | Year | Real-World | Synthesis |
---|---|---|---|
Yahoo! Webscope | 2006 | β | β |
BGL | 2007 | β | |
HDFS | 2009 | β | |
ADFA-LD | 2013 | β | |
SDS | 2015 | β | |
UNSW-NB15 | 2015 | β | |
OpenStack | 2017 | β | |
Microsoft | 2019 | β | |
LogHub | 2020 | β | |
Studiawan et al. | 2020 | β | |
Yang et al. | 2023 | β |
- Cyber-Physical Systems
ML
RNN + GNN
- Digital Twin-based Anomaly Detection with Curriculum Learning in Cyber-physical Systems (TOSEM, 2023) π
GAN
- Digital twin-based anomaly detection in cyber-physical systems (ICST, 2021) π
Variational AutoEncoder (VAE)
- From Point-wise to Group-wise: A Fast and Accurate Microservice Trace Anomaly Detection Approach (FSE, 2023) π
Vanilla Transformer
- Twin Graph-Based Anomaly Detection via Attentive Multi-Modal Learning for Microservice System (ASE, 2023) π
LM + RNN
- KDDT: Knowledge Distillation-Empowered Digital Twin for Anomaly Detection (FSE, 2023) π
Benchmark | Year | Real-World | Synthesis |
---|---|---|---|
Gas Pipeline Dataset | 2015 | β | |
SWaT | 2016 | β | |
WADI | 2017 | β | |
BATADAL | 2018 | β | |
MSDS | 2023 | β |
DevOps Step | Identified Security Task | Themes of Challenges |
---|---|---|
Plan | Threat Modeling | - |
Software Impact Analysis | - | |
Development | Software Vulnerability Detection | C1-1 - Data Imbalance |
C4 - Cross Project | ||
C5 - MBU Vulnerabilities | ||
C6 - Data Quality | ||
Software Vulnerability Classification | C1-2 - Data Imbalance | |
C7 - Incompleted CWE Tree | ||
Automated Vulnerability Repair | C2-1 - Model Explainability | |
C8 - Sequence Length and Computing Resource | ||
C9 - Loss of Pre-Trained Knowledge | ||
C10 - Automated Repair on Real-World Scenarios | ||
Security Tools in IDEs | C3-1 - Lack of AI Security Tooling in IDEs | |
Code Commit | CI/CD Secure Pipelines | C2-2 - Model Explainability |
C3-2 - Lack of AI Security Tooling in CI/CD | ||
C11 - The Use of RNNs | ||
Build, Test, and Deployment | Configuration Validation | C12 - Complex Feature Space |
Infrastructure Scanning | C3-3 - Lack of AI Security Tooling for Infrastructure Scanning | |
C13 - Manual Feature Engineering | ||
Operation and Monitoring | Log Analysis and Anomaly Detection | C2-3 - Model Explainability |
C14 - Normality Drift for Zero-Positive Anomaly Detection | ||
Cyber-Physical Systems | C15 - Monitoring Multiple Cyber-Attacks Simultaneously |
Identified 15 Research Directions of AI-Driven Security Approach in DevSecOps (Section 5 in our paper)
DevOps Step | Identified Security Task | Research Opportunity |
---|---|---|
Plan | Threat Modeling | - |
Software Impact Analysis | - | |
Development | Software Vulnerability Detection | R1-1 - Data augmentation and logit adjustment |
R4 - Evaluate cross-project SVD with diverse CWE-IDs | ||
R5 - Evaluate SVD on MBU vulnerabilities | ||
R6 - Address data inaccuracy from automatic data collection. | ||
Software Vulnerability Classification | R1-2 - Meta-learning and LLMs | |
R7 - Develop advanced tree-based SVC | ||
Automated Vulnerability Repair | R2-1 - Evidence-based explainable AI (XAI) | |
R8 - Explore transformer variants that can process longer sequences | ||
R9 - Explore different training paradigms during fine-tuning | ||
R10 - Address limitations of LLMs | ||
Security Tools in IDEs | R3-1 - AI tool deployment and comprehensive tool evaluation | |
Code Commit | CI/CD Secure Pipelines | R2-2 - Explainable AI (XAI) for DL Models |
R3-2 - AI tool deployment in CI/CD pipelines | ||
R11 - Explore LMs and LLMs | ||
Build, Test, and Deployment | Configuration Validation | R12 - Explore transformers for tabular data |
Infrastructure Scanning | R3-3 - AI tool deployment and post-deployment evaluation | |
R13 - Explore DL-based techniques | ||
Operation and Monitoring | Log Analysis and Anomaly Detection | R2-3 - Explainable AI (XAI) for ML Models |
R14 - Enhance normality drift detection | ||
Cyber-Physical Systems | R15 - Distributed anomaly detection and multi-agent systems |