Skip to content

Medical Image transfer learning using RadImageNet convolutional feature extraction. CS231n final project, Stanford.


Notifications You must be signed in to change notification settings



Repository files navigation

BACON: Breast and Acl COnvolutional Networks

Highly performant breast lesion malignancy detection and ACL tear detection models built using transfer-learning of large CNNs. Interpretable model decisions powered by Grad-CAM.

Related Article: Towards Optimal Convolutional Transfer Learning Architectures for Downstream Medical Classification Tasks

(Arxiv Link Pending)

Main Scripts is the most comprehensive wrapper script for our analysis. With --method runall, this manages gridsearching for optimal architectures by passing cross-products of hyperparameters and architecture choices to main(). With --method summarize --filter key value the script produces a results/results.csv file containing all metrics for each hyperparameter combination where key = value (e.g. if we set --filter epochs 10, we will retrieve all results for experiments run with --epochs 10). Note that each row in the results.csv corresponds to the 'best'/checkpointed epoch from model training, as selected by highest validation AUC. With --method visualize the script produces a set of visualizations in results/acl/, results/breast/, and results/overall/ which provide box-plot + experimental scatters comparing hyperparameter choices' affect on metrics.

Example (also triggerable as a sequence with ./

# Run all experiments defined in loops
python --method runall --verbose

# Summarize all epoch=10 experiments into a single CSV
python --method summarize --verbose --filter epochs 10

# Create visualizations from the summarized results
python --method visualize --verbose handles dataloader setup and device setup, and serves as a point of contact for users to trigger new experiments from the CLI (and for to start grid search experiments).

Examples (Best Breast Model and Best ACL Model):

$ python --data_dir breast --database ImageNet --backbone_model_name ResNet50 --clf ConvSkip --structure unfreezetop5 --verbose --dropout_prob 0.5 --fc_hidden_size_ratio 1.0 --num_filters 16 --kernel_size 2 --epoch 30 --batch_size 64 --lr_decay_method cosine --amp --lr 5e-4

$ python --data_dir acl --database ImageNet --backbone_model_name ResNet50 --clf ConvSkip --structure unfreezetop5 --verbose --dropout_prob 0.5 --fc_hidden_size_ratio 0.5 --num_filters 16 --kernel_size 4 --epoch 30 --batch_size 64 --lr_decay_method cosine --amp --lr 1e-3 handles all Grad-CAM logic for generating and visualizing Grad-CAM heatmaps to interpret model results.


$ python --data_dir breast --database ImageNet --backbone_model_name ResNet50 --clf ConvSkip --structure unfreezetop5 --verbose --dropout_prob 0.5 --fc_hidden_size_ratio 1.0 --num_filters 16 --kernel_size 2 --epoch 30 --batch_size 64 --lr_decay_method cosine --amp --lr 5e-4 --image_index 0 is a simple script for producing predictions/preds_{MODEL_PARAM_STR}.csv files with all the test predictions for a particular model.


$ python --data_dir breast --database ImageNet  --backbone_model_name ResNet50 --clf ConvSkip --structure unfreezetop5 --verbose --dropout_prob 0.5 --fc_hidden_size_ratio 1.0 --num_filters 16 --kernel_size 2 --epoch 30 --batch_size 64 --lr_decay_method cosine --amp --lr 5e-4

Source Code

src/ contains the source code for argument parsing, dataloader setup, model architecture building in PyTorch, and other utils.

Other Important Directories

data/ contains the data for all of the downstream classification tasks. We focus primarily on data/breast/ and /data/acl/. Each of this subdirectories contains folders datafram/e, images/, and models/. The dataframe folder contains the five-fold splits used by RadImageNet, as well as combined, re-split 75/15/10 train/val/test stratified (on target) splits that we generate and use. Each row contains a label and an image path, which points to an image in images/. models/ contains training histories (performance metrics throughout training) as well as checkpointed models, though much of this is not uploaded to github due to filesize constraints.

logs/ is used for TensorBoard logging, and should also be mostly empty on github.

predictions/ contains predictions for our best breast and ACL models, as well as their less performant RadImageNet initialized counterparts.

results/ contains gridsearch and unfreezing experiment results and visualizations.

tflow_replicated_expts/ contains debugged code from the original RadImageNet repo, used to compare results for our Linear baselines models.

====== Internal Usage for Authors =======

Updates History:

PyTorch v3 had fixes to Caffe preprocessing, train dataloader shuffling (especailly important for ACL), and a handful of other fixes.

PyTorch v4 architecture removes the softmax from the classifier appended to the backbone, relying instead on SoftmaxLoss so that we don't do a double softmax. This massively improves breast performance.

After Refactor May 29:

Example usage: python --data_dir acl --database RadImageNet --backbone_model_name ResNet50 --clf NonLinear --structure freezeall --verbose --dropout_prob 0.5 --fc_hidden_size_ratio 0.5 --num_filters 8 --kernel_size 2 --epoch 5 --batch_size 64

See for the full list of arguments. handles training the models, as well as defines the Backbone and Classifier layers. validates arguments and provides functions for loading data. parses arguments, sets device, and iterates through training and validation folds.

====== 05/31: ====== Aditri added Convolutions with Skip Connections as an option. Daniel added data prep options to run against full train/val/test splits and re-split and aggregated the data to ensure no leakage.

Daniel added LR scheduling, more dynamic model checkpointing for all hyperparameters.

Daniel added linting.

TBD: Daniel adding SWA, for running a vast grid of experiments + summarizing experiments into overall results/results.csv, visualizations for report.


Medical Image transfer learning using RadImageNet convolutional feature extraction. CS231n final project, Stanford.







No releases published


No packages published


  • Python 81.2%
  • Jupyter Notebook 16.9%
  • Shell 1.9%