AAR Crack detection with CNN and ViT

This project focuses on detecting cracks caused by alkali-aggregate reaction (AAR), which are subtle and challenging to identify. While CNNs are effective for crack detection, their performance declines with microscale cracks, and AAR detection remains under-explored. Building on prior work with limited models like Inception V3, this study explores the performance of pre-trained CNNs in binary classification of AAR cracks.

Authors

Nathan Kabas Kuroiwa
Amélie Menoud
Lisa Van de Panne

Code organization

The project is divided into different notebooks.

"Step_0_Data Augmentation.ipynb": In this notebook we have the data augmentation test we performed before choosing the one for our final model.
"Step_1_Model_archi.ipynb": This is the notebook in which we did the search for the best model architecture, by comparating the losses, AUC's and F1-score.
"Step_2_Pretraining_and_training.ipynb": Here, we are experimenting with a pretrained or non pretrained Googlenet model, which was the model with the best performance from step 1.
"Step_3_Hyperparameters_tuning_1.ipynb" and "Step_3_Hyperparameters_tuning_2.ipynb": These files contains our trials with different learning rates and decreasing learning rate strategy. We used these strategy: constant LR, cosine LR schedulers and ReduceLROnPlateau, which reduces the LR when a metric has stopped improving.
"Step_4_Results.ipynb": The results of our best model with the best hyperparameters and data augmentation from the last files. A visualization of our results are shown with ROC curves, use of the gradient method and a visualization of the results per walls. Also the final results of accuracies, F1-scores and losses are provided.
"Vit Transformer.ipynb": This final notebook contains the attempt of ViT transformers, with the use of vit-tiny and vit-b-16 and their results.

Usage:

We used googlecolab for the project, and all the codes work with a path to a google drive that contains the data (the link to this drive is provided only to the IMOS lab).

To make the code work, the path to the data at the beginning of each notebook should be changed. Also, the data needs to be prepared: Manually put the clean and cracked images in folders called "Clean" and "Cracked" for each split, so that the dataloader can work. Then, when training the code, the saving path should also be changed. And lastly, the path to load the models should be changed as well.

The weights for the final model are provided in the github as the file 'GoogLeNet_trial_1_best.pth', and can be used in the "Step_4_Results.ipynb" to reproduce our final results.

About the txt files:

Three txt files containing the details of our final predictions are provided (training_set.txt, validation_set.txt, test_set.txt).
requirements.txt: file containing all external libraries used.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AAR Crack detection with CNN and ViT

Authors

Code organization

Usage:

About the txt files:

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
GoogLeNet_trial_1_best.pth		GoogLeNet_trial_1_best.pth
README.md		README.md
Step_0_Data Augmentation.ipynb		Step_0_Data Augmentation.ipynb
Step_1_Model_archi.ipynb		Step_1_Model_archi.ipynb
Step_2_Pretraining_and_training_strategies.ipynb		Step_2_Pretraining_and_training_strategies.ipynb
Step_3_Hyperparameters_tuning_1.ipynb		Step_3_Hyperparameters_tuning_1.ipynb
Step_3_Hyperparameters_tuning_2.ipynb		Step_3_Hyperparameters_tuning_2.ipynb
Step_4_Results.ipynb		Step_4_Results.ipynb
Vit Transformer reborn.ipynb		Vit Transformer reborn.ipynb
requirements.txt		requirements.txt

CS-433/ml-project-2-lan

Folders and files

Latest commit

History

Repository files navigation

AAR Crack detection with CNN and ViT

Authors

Code organization

Usage:

About the txt files:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages