Code used in Meisler and Gabrieli 2022
This repository includes the code used to preprocess the neuroimaging data, segment white matter tracts, perform tractometry, and compare cohort characteristics. The preprocessing and white matter segmentation is agnostic to the data set. That is, it can be run on any BIDs compliant dataset with T1w and diffusion data. The tractometry and cohort characteristic code is tailored to work with a Healthy Brain Network (HBN) phenotypic query, but can be adapted to work with a custom file.
Meisler, S.L. and Gabrieli, J.D.E "A Large-Scale Investigation of White Matter Microstructural Associations with Reading Ability." NeuroImage (2022): 118909. If using this code, please also cite relevant papers to the software and methods employed here. See our paper or the software-specific documentation (bottom of this README) for these references.,
- BIDS-compliant dataset with at least T1w and DWI images
- Singularity (used to compile and run Docker Images)
- QSIPrep docker container (
singularity build qsiprep.simg docker://pennbbl/qsiprep:0.13.0RC1
) - TractSeg docker container (
singularity build tractseg.simg docker://wasserth/tractseg_container:master
) - Combined MrTrix 3.0.3 (
singularity build mrtrix.simg docker://mrtrix3/mrtrix3:3.0.3
) - FSL 6.0.4 (
singularity build fsl.simg docker://brainlife/fsl:6.0.4-patched
) - If using your own single-shelled DTI data (not HBN), then you will also need mrtrix3tissue (
singularity build mrtrix3t.simg docker://kaitj/mrtrix3tissue:v5.2.9
) - Singularity 3.6.3 was used in this study. The images above were used in this study, but more recent versions of these software may introduce improvements that should be used in future research.
- QSIPrep docker container (
- SLURM job scheduler, used for parallelizing jobs. If you uses SGE/PBS to schedule jobs, the scripts can be adapted using tips from this webpage: https://www.msi.umn.edu/slurm/pbs-conversion
- Python environment with Jupyter capabilities and the following dependencies: numpy, scipy, scikit-learn, pandas, os, glob, matplotlib, json, filecmp, nilearn, fslpy, pingouin, statsmodels, and seaborn
- FreeSurfer license (https://surfer.nmr.mgh.harvard.edu/fswiki/License)
- Data Usage Agreement (if working with Healthy Brain Network data), used to access neuroimaging and phenotypic data
- Amazon Web Services (AWS) Command Line version 2 (or any version with s3 capabilities), used to download neuroimaging data (https://docs.aws.amazon.com/cli/latest/userguide/install-cliv2.html)
- Obtain a Data Usage Agreement: http://fcon_1000.projects.nitrc.org/indi/cmi_healthy_brain_network/Pheno_Access.html#DUA, https://data.healthybrainnetwork.org/login/request-account/
- In your terminal, navigate in to the directory where you want your BIDS HBN data to live. Then, download the neuroimaging data with:
aws s3 cp s3://fcp-indi/data/Projects/HBN/MRI/Site-RU/ $path/to/HBN_dir --exclude "derivatives/*" --recursive
, where$path/to/HBN_dir
should be replaced by the path to your directory. This downloads the raw data from the Rutgers University site as used in our study, excluding preprocessed derivatives. You can changeRU
to look at other sites (for exampleCBIC
for the Cornell site). - To prepare the HBN data for QSIPrep, we need to update the DWI JSON files so the DWI fieldmaps can be assosciated with the DWI NIFTI files. This involves:
- Rename the fieldmap files to match BIDs conventions. In your BIDs code directory, download
rename_fmaps.sh
, update the variablebids
in the first line to direct to your BIDs directory, and run the code. - Add "IntendedFor" fields in the JSON to associate fieldmaps with NIFTIs. Download
add_intended_for.ipynb
to your BIDS code directory, update the variablebids
in the first line to direct to your BIDs directory, and run the code/notebook.
- Rename the fieldmap files to match BIDs conventions. In your BIDs code directory, download
- Open a fieldmap JSON file to make sure this field has been added, and that the name convention matches BIDS specification (https://bids-specification.readthedocs.io/en/stable/04-modality-specific-files/01-magnetic-resonance-imaging-data.html).
sub-<label>[_ses-<label>][_acq-<label>][_ce-<label>]_dir-<label>[_run-<index>]_epi.json
sub-<label>[_ses-<label>][_acq-<label>][_ce-<label>]_dir-<label>[_run-<index>]_epi.nii[.gz]
- Using the HBN Loris portal (https://data.healthybrainnetwork.org/main.php) make a phenotypic query with at least the following fields: "Basic_Demos", "TOWRE", "EHQ", and "Clinician Diagnoses"
- Download the
code/QSIPrep
folder to your BIDS code folder. In it, add your FreeSurfer license aslicense.txt
(case-sensitive). - In the
submit_job_array.sh
script, udpate the variablebids
in the beginning of the script to direct to your BIDS directory. Additionally, in the last line of the script, you can update the parameter after%
to limit how many jobs can be active at a time. We set this to 100 as a default, but you can alter this or delete it to not set a limit. - In the
ss_qsiprep.sh
script, change the SBATCH header to match your desired parameters (e.g. memory usage, time-to-wall, etc). Then update the variableIMG
to the path of your QSIPrep singularity image. Set the variablescratch
to where you want intermediate files to be stored during the QSIPrep workflow. Review the command at the bottom of the script to make sure you understand how QSIPrep is being run and change any parameters you would like. - Make sure Singularity is loaded in your environment.
- In terminal, navigate to the QSIPrep code folder, and run
./submit_job_array.sh
to begin QSIPrep.
- Download the
code/TractSeg
folder to your BIDS code folder. - In the
submit_job_array.sh
script, udpate the variablebids
in the beginning of the script to direct to your BIDS directory. Additionally, in the last line of the script, you can update the parameter after%
to limit how many jobs can be active at a time. We set this to 100 as a default, but you can alter this or delete it to not set a limit. - If using your own data, change the
single_multi
(either 'single' or 'multi') andflip_x
variables ('yes' or 'no') to match your data. That is, single shell data only has one non-0 b-value, and sometimes, the x-gradient needs to be flipped to correct its orientation. Theflip_x
may be hard to know apriori, but typically Siemens data need it flipped. If unsure, leave it as 'no', and if tracts do not look right, try changing it.- In single shell data, we instead use MrTrix3tissue to estimate the three tissue response functions.
- In the
ss_tractseg.sh
script, update the SBATCH header as needed to accomodate your memory usage needs, and update the first few variables to the paths to your singularity containers. - Make sure Singularity is loaded in your environment.
- In terminal, navigate to the TractSeg code folder, and run
./submit_job_array.sh
to begin TractSeg.
- Save your HBN phenotypic data as
HBN_query.csv
and place it in your BIDS code directory. - Using the numbered jupyter notebooks, create the dataframe with all relevant data, and then run the statistical models. Be sure to read the comments in the notebooks.
- You should see a file called
subjects.txt
in your TractSeg derivatives folder. Open it and update the/PATH/TO/BIDS
portion of lines 1 and 3 (beginning withtracometry_path
andplot_3d
) to direct to your BIDS directory. Feel free to rename this file something more informative (e.g.towre_group_difference.txt
) - Based on the analysis you want to run (correlation vs group difference) and your desired nuisance regressors, update line 28 in accordance with the directions written in the body of
subjects.txt
. - Copy the tractometry model input (output of
5_tract_profiles.ipynb
) and paste it tosubjects.txt
(or whatever you named it) under line 28. - Navigate terminal to your TractSeg derivatives folder and run the model:
singularity exec -e -B /PATH/TO/BIDS /PATH/TO/tractseg.simg plot_tractometry_results -i /PATH/TO/subjects.txt -o /PATH/TO/output
-
- If making a 3D Plot, make sure you include the options
--plot3D pval
and--tracking_format trk
to the command
- If making a 3D Plot, make sure you include the options
Questions? Feel free to either open an issue in this repository or email Steven Meisler (smeisler@g.harvard.edu) with any problems, suggestions, or feedback!
- Healthy Brain Network Data Portal (http://fcon_1000.projects.nitrc.org/indi/cmi_healthy_brain_network/index.html)
- TractSeg Website (https://github.com/MIC-DKFZ/TractSeg)
- QSIPrep Website (https://qsiprep.readthedocs.io/en/latest/)
MIT License
Copyright (c) 2021 Steven Meisler
Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the "Software"), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:
The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.
THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.