Skip to content

Python Project to Automate Collection of Snippets with Leaked Secrets in Code to Build a Dataset for ML Trainning.

License

Notifications You must be signed in to change notification settings

Anti-Malware-Alliance/secret-harvest

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

16 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Secret Harvester

Secret-Harvest

Python Project to Automate Collection of Snippets with Leaked Secrets in Code, and Snippets Clean of Secrets to Build a Dataset for ML Trainning.

Problem

Secret Sprawl in Code Repositories, Artifacts, Logs and Documentation is an increasing issue. Current Secret Scanning Solutions have a high False positive Ratio, making the solutions noisy. Machine Learning Models can be Developed to aid in reducing False Positive from Secrets that are been scanned.

Further Datasets are required for Researchers to work and Develop ML Models around Secret Detection.

This Python tooling aims to automate and maintain such datasets.

Solution

About

Python Project to Automate Collection of Snippets with Leaked Secrets in Code to Build a Dataset for ML Trainning.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages