./Joern Benchmarks

A repository for running Joern against known benchmarks.

Usage

$ sbt stage
$ ./joern-benchmarks --help
joern-benchmark v0.0.1
Usage: joern-benchmark [options] benchmark frontend

A benchmarking suite for Joern
  -h, --help
  --version                Prints the version
  benchmark                The benchmark to run. Available [SECURIBENCH_MICRO,SECURIBENCH_MICRO_JS,ICHNAEA,THORAT,BUGS_IN_PY,DEFECTS4J]
  frontend                 The frontend to use. Available [JAVASRC,JAVA,JSSRC,PYSRC,SEMGREP,CODEQL]
  -d, --dataset-dir <value>
                           The dataset directory where benchmarks will be initialized and executed. Default is `./workspace`.
  -o, --output <value>     The output directory to write results to. Default is `./results`.
  -f, --format <value>     The output format to write results as. Default is MD. Available [JSON,CSV,MD]
  --disable-semantics      Disables the user-defined semantics for Joern data-flows. Has no effect on non-Joern frontends.
  -k, --max-call-depth <value>
                           The max call depth `k` for the data-flow engine. Has no effect on non-Joern frontends. Default is 5.
  -i, --iterations <value>
                           The number of iterations for a given benchmark. Default is 1.
  -w, --whole-program      Enables whole program analysis. Off by default.

Example of testing for various values of k:

for k in {0..5}; do ./joern-benchmarks DEFECTS4J JAVA -f CSV -J-Xmx8G -i 1 -k $k; done

Data-Flow Benchmarks

The benchmark naming convention of <BENCHMARK>_<FRONTEND>, e.g. OWASP_JAVA runs OWASP using the jimple2cpg frontend (JVM bytecode).

Benchmark	Enabled Frontends
`SECURIBENCH_MICRO`	`JAVASRC` `JAVA` `SEMGREP` `CODEQL`
`ICHNAEA`	`JSSRC` `SEMGREP`
`THORAT`	`PYSRC` `SEMGREP` `CODEQL`

Joern

Joern's open-source data-flow engine is enabled whenever a Joern frontend is selected, e.g. JAVA, PYSRC, etc.

Semgrep

If SEMGREP is selected, this requires an installation of Semgrep where semgrep scan will be used to initiate the process. Custom rules specific to benchmarks can be found under src/main/resources/semgrep.

Note: Only results with data-flow traces are considered as findings.

CodeQL

If CODEQL is selected, this requires an installation of CodeQL CLI where codeql will be used to create the database and run the scans. Custom rules specific to benchmarks can be found under src/main/resources/codeql.

Notes

Benchmarks successfully tested on the following versions of target software:

Joern v4.0.119
Semgrep v1.93.0
CodeQL v2.19.2

Name		Name	Last commit message	Last commit date
Latest commit History 47 Commits
domain-classes		domain-classes
project		project
schema-extender		schema-extender
schema		schema
src/main		src/main
.gitignore		.gitignore
.installation_root		.installation_root
.scalafmt.conf		.scalafmt.conf
README.md		README.md
build.sbt		build.sbt
install-local-joern.sh		install-local-joern.sh
joern		joern
joern-benchmarks		joern-benchmarks
log4j2.xml		log4j2.xml
repl		repl
updateDependencies.sh		updateDependencies.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

./Joern Benchmarks

Usage

Data-Flow Benchmarks

Joern

Semgrep

CodeQL

Notes

About

Releases

Packages

Contributors 2

Languages

joernio/joern-benchmarks

Folders and files

Latest commit

History

Repository files navigation

./Joern Benchmarks

Usage

Data-Flow Benchmarks

Joern

Semgrep

CodeQL

Notes

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages