This repository contains Matlab code for detecting cells in both 3D image volumes and 2D images.
You can find further details about how we apply the methods in this repo to analyze mm-scale brain volumes in the following paper:
Dyer, Eva L., et al. "Quantifying mesoscale neuroanatomy using X-ray microtomography", eNeuro 25 September 2017, 4 (5) ENEURO.0195-17.2017 [Link to Paper].
If you use any of the code or datasets in this repo, please cite this paper. Please direct any questions to Eva Dyer at evadyer{at}gatech{dot}edu.
The input to our cell detection algorithm is a "probability map" that encodes the probability that each voxel/pixel corresponds to a cell body. The output is a list of the position of all detected cells as well as a map of the cells that is the same size as the input image or volume.
The main functions used for cell detection in the 3D and 2D case are:
- [Centroids, Nmap] = OMP_ProbMap(Probmap,ptr,presid,startsz,dilatesz,kmax); (3D)
- [Centroids,Nmap] = OMP_ProbMap2D(Probmap,ptr,presid,startsz,dilatesz,kmax); (2D)
The parameters that you need to set:
- ptr is the probability threshold - all probabilities less than ptr will be set to zero
- presid is the stopping criterion - when the correlation between the cell template and the probmap is less than presid, the algorithm terminates
- startsz is the size of the spherical/circular template used (set this to be the average radius of cells)
- dilatesz is the size of the dilation template used to remove cells after they are detected (usually set to 1 or 2)
- kmax is the maximum number of cells you want to detect (a way to stop the algorithm early)
You must supply a probability map as input for the cell detection algorithm. There are a few options depending on the complexity of your data.
-
Gausian Mixture Model: We supply a simple unsupervised method for computing a cell probability map which uses a Gaussian Mixture Model. This can provide a reasonable output when the color (pixel intensities) of the cells is different from the background in the image.
-
Ilastik pixel classifier: When an intensity-based classifier is not sufficient for your data, you can compute a probability map with Ilastik. The ilastik team provides great documentation on their website for creating a Pixel Classifier. You can also see an example of a Pixel Classifier in our demo data (see below for a link to download our data and classifiers).
-
Use the image: In images where the intensity of the foreground (cell) is significantly different from the background, you can simply use the image, but you need to rescale your image first so the background pixels have values close to zero and cell pixels have values close to one.
If you want to run any of the demos in this repository, please download the data from DropBox.
To begin, run the following script:
script_cellfinder
This script will run the main cell finding routine OMP_ProbMap.m on the test data in the celldetect folder. The output includes two variables: (1) Centroids_out, a 10x4 matrix with the position (x,y,z) first 10 detected cells centroids (columns 1-3) and the correlation value between all detected cells and theinput probability map (column 4). (2) Nmap_out, a 200x200x100 matrix with all of the detected cells labeled with a unique ID (and the remaining volume labeled w/ zeros).
To find more cells in the volume, set kmax to a larger number. The variable kmax controls the maximum number of iterations of the algorithms (and constrains the maximum number of detected cells).
To find the top 100 cells, call the greedy sphere finder method again (this will take a few minutes).
kmax = 100;
[Centroids_out,Nmap_out] = OMP_ProbMap(Probmap,ptr,presid,startsz,dilatesz,kmax);
figure;
for i=1:size(Prob,3)
imagesc(Prob(:,:,i)-(Nmap(:,:,i)~=0)),
pause,
end
To compute a cell probability map using a GMM, run our demo on X-ray data:
script_runGMMseg
To test our methods on 2D image data, run the following script:
script_run2Dcelldetect
This script will output the centroids and map of the detected cell bodies (each cell is labeled with a unique ID). If you use the test data (from the Allen Institute Reference Atlas) provided in this example, please cite the following paper:
Lein, E.S. et al. (2007) Genome-wide atlas of gene expression in the adult mouse brain, Nature 445: 168-176. doi: 10.1038/nature05453.
- OMP_ProbMap.m: This is the main function used for cell detection, as it implements our greedy sphere finding approach described in Dyer et al. 2017. This algorithm takes a 3D probability map (the same size as the image data) as its input and returns the centroids and confidence value (between 0-1) of all detected cell bodies.
- compute3dvec.m: This function places an input 3D template (vec) at a fixed position (which_loc) in a bounding box of width = Lbox*2 + 1.
- convn_fft.m: This computes a n-dimensional convolution in the Fourier domain (uses fft rather than spatial convolution to reduce complexity).
- create_synth_dict.m: This function creates a collection of spherical templates of different sizes. The output is a dictionary of template vectors, of size (Lbox^3 x length(radii)), where Lbox = box_radius*2 +1 and radii is an input to the function which contains a vector of different sphere sizes.
Contributors:
- Eva Dyer (@evadyer, Georgia Tech)
- Will Gray Roncal (Hopkins APL)
- TJ LaGrow (@tjlagrow, Georgia Tech)