Skip to content

Latest commit

 

History

History
154 lines (89 loc) · 10.1 KB

datasets.md

File metadata and controls

154 lines (89 loc) · 10.1 KB

Image Forgery Datasets

Image Tampering Datasets, Image Manipulations Datasets... A repo intend to collect most of datasets (training and evaluation) for the image forgery detection and localization.

图像篡改检测定位数据集收集

The most updated content may go to https://github.com/greatzh/Image-Forgery-Datasets-List

Summary

Dataset name Image Format Post-processing Forgery types Real/Forged Images Train/Test Images Download Paper Year
CASIA v1.0 Splicing, copy move, removal 800/921
CASIA v2.0 TIFF, JPEG Yes Splicing, copy move, removal 7200/5123 715/100 http://forensics.idealtest.org/#/ CASIA Image Tampering Detection Evaluation Database 2013
Columbia Color TIFF No Splicing 183/180 125/45 Columbia Uncompressed Image Splicing Detection Evaluation Dataset: https://www.ee.columbia.edu/ln/dvmm/downloads/authsplcuncmp/ Detecting Image Splicing using Geometry Invariants and Camera Characteristics Consistency 2006
Columbia Gray bmp No Splicing 933/912 1845, 933 Authentic, 912 Spliced, 128 x 128 https://www.ee.columbia.edu/ln/dvmm/downloads/AuthSplicedDataSet/AuthSplicedDataSet.htm A data set of authentic and spliced image blocks 2004
NIST 2016(Nimble Challenge 2016 (NC16)), NC17, MFC18 JPEG Yes Splicing, copy move, removal 184/50 NC17: https://www.nist.gov/itl/iad/mig/nimble-challenge-2017-evaluation; MFC18: https://www.nist.gov/itl/iad/mig/media-forensics-challenge-2018 MFC Datasets: Large-Scale Benchmark Datasets for Media Forensic Challenge Evaluation 2019
Fantastic Reality JPEG No Splicing 12000/1000 http://zefirus.org/MAG The Point Where Reality Meets Fantasy: Mixed Adversarial Generators for Image Splice Detection NeurIPS '19
Carvalho 100 http://www.ic.unicamp.br/tjose/files/database-tifs-small-resolution.zip (BROKEN) Exposing digital image forgeries by illumination color classification 2013
Realistic Tampering TIFF object insertion , removal. 220 https://pkorus.pl/downloads/dataset-realistic-tampering Multi-scale analysis strategies in prnu-based tampering localization TIFS '16
COVERAGE TIFF No Copy move 100 pairs, 400 x486 https://github.com/wenbihan/coverage COVERAGE — A novel database for copy-move forgery detection ICIP '16
CoMoFoD Yes Copy move 260 sets https://www.vcl.fer.hr/comofod/ CoMoFoD — New database for copy-move forgery detection 2013
MICC F220/F2000 copy move 2200 A SIFT-Based Forensic Method for Copy–Move Attack Detection and Transformation Recovery TIFS '11
FAU/Manip Yes Copy move 48 http://www5.cs.fau.de/research/data/image-manipulation/index.html An Evaluation of Popular Copy-Move Forgery Detection Approaches TIFS '12
Dresden The Dresden Image Database for Benchmarking Digital Image Forensics
GRIP Copy move http://www.grip.unina.it/download/prog/CMFD/ Efficient dense-field copy-move forgery detection TIFS '15
IMD2020 Yes Splicing, copy move, removal 35000/35000 2010 http://staff.utia.cas.cz/novozada/db/ An evaluation of popular copy-move forgery detection approaches WACV '20
DEFACTO Splicing, copy move, removal -/229000
PS-Battle
In the Wild

Details

CASIA v2

Fantasatic Reality

IMD2020

Paper: IMD2020: A Large-Scale Annotated Dataset Tailored for Detecting Manipulated Images

Project Page: http://staff.utia.cas.cz/novozada/db/

Download: http://staff.utia.cas.cz/novozada/db/

NC16 Splicing

Appear in: [1]P. Zhuang, H. Li, S. Tan, B. Li, and J. Huang, “Image Tampering Localization Using a Dense Fully Convolutional Network,” Ieee T Inf Foren Sec, vol. 16, pp. 2986–2999, 2021, doi: 10.1109/tifs.2021.3070444.

  1. Artificial PS dataset with post-processing on boundary (PS-boundary)

  2. Artificial PS dataset with arbitrary post-processing (PS-arbitrary)

  3. NIST 2016 dataset (NIST-2016)

Image Splicing

Train

Dresden(16961 images)

Paper: The 'Dresden Image Database' for benchmarking digital image forensics SAC '10

Official: http://forensics.inf.tu-dresden.de/dresden_image_database/ (Cannot be ACCESSED)

Vision(34427 images)

Socrates(8742 images)

FODB (23106 images)

Kaggle (2750 images)

Test

Columbia (363 images, 180 spliced)

https://www.ee.columbia.edu/ln/dvmm/newDownloads.htm

Columbia Uncompressed Image Splicing DetectionColumbia Uncompressed Image Splicing Detection: https://www.dropbox.com/sh/786qv3yhvc7s9ki/AACbEEzGPrD3_y38bpWHzgdqa?dl=0

Columbia Image Splicing Detection Evaluation Dataset: https://www.dropbox.com/s/bo10et4p1zg08aj/ImSpliceDataset.rar?dl=0

Carvalho/DSO (200 images, 100 spliced)

Realistic Tampering (RT)/Korus (440 images, 220 spliced)

Description: This dataset contains 220 realistic forgeries created by hand in modern photo-editing software (GIMP and Affinity Photo) and covers various challenging tampering scenarios involving both object insertion and removal...

Citation: Pawel Korus and Jiwu Huang. Multi-scale analysis strategiesin prnu-based tampering localization. Trans. Info. For. Sec.,12(4):809–824, apr 2017.

Page: https://pkorus.pl/downloads/dataset-realistic-tampering

Download: realistic-tampering-dataset.zip via google drive (1.7 GB) you'll need request access

Copy Move

GRIP

CPH

GRIP

CMH

CoMoFoD

Small image category database: 512 x 512, 200 image sets, 40 images per transformation type, total number of images with post-processed images = 10400. Large image category database (please contact us for downloading): 3000 x 2000, 60 image sets, 10/20 images per transformation type, total number of images with postprocessed images = 3120.

Removal

Image Inpainting

Synthetic and Manipulated Overhead Imagery For Forensics Research

https://stresearch.github.io/synthetic-overhead-dataset/

DeepFake

Useful Link

  1. Information Forensics and Security
  2. Forensics Dataset
  3. Pattern Recognition Lab, FAU, https://lme.tf.fau.de/category/dataset/
Name Paper Project Download Type Remarks
Columbia
CASIA
MICC
IMD
CoMoFoD
COVERAGE
GRIP
FAU