| Literature DB >> 34647039 |
Kayvan Bijari1, Gema Valera2, Hernán López-Schier2, Giorgio A Ascoli1,3.
Abstract
We present a protocol to characterize the morphological properties of individual neurons reconstructed from microscopic imaging. We first describe a simple procedure to extract relevant morphological features from digital tracings of neural arbors. Then, we provide detailed steps on classification, clustering, and statistical analysis of the traced cells based on morphological features. We illustrate the pipeline design using specific examples from zebrafish anatomy. Our approach can be readily applied and generalized to the characterization of axonal, dendritic, or glial geometry. For complete context and scientific motivation for the studies and datasets used here, refer to Valera et al. (2021).Entities:
Keywords: Bioinformatics; Cell Biology; Computer sciences; Microscopy; Neuroscience
Mesh:
Year: 2021 PMID: 34647039 PMCID: PMC8496329 DOI: 10.1016/j.xpro.2021.100867
Source DB: PubMed Journal: STAR Protoc ISSN: 2666-1667
Figure 1Sample visualized neuron from the zebrafish data along with the corresponding rows of its SWC file
Figure 2Neuronal quantification
(A) Graphical User Interface of the L-Measure software (open on the default ‘specificity’ tab).
(B) Sample output file produced by L-Measure.
Description of L-Measure outputs
| Core function (brief description) | Relevant statistics to consider | Reasoning behind the chosen statistic |
|---|---|---|
| N_bifs (number of bifurcations) | Total_sum | Returns the total number of bifurcations |
| N_branch (number of branches) | Total_sum | Returns the total number of branches |
| Width (neuronal width) | L-Measure returns the same value for Min, Max, and Ave | Horizontal extent (x-coordinate) containing 95% of all tracing points |
| Height (neuronal height) | L-Measure returns the same value for Min, Max, and Ave | Vertical extent (y-coordinate) containing 95% of all tracing points |
| Depth (neuronal depth) | L-Measure returns the same value for Min, Max, and Ave | Depth (z-coordinate) containing 95% of all tracing points |
| Length (total arborization length) | Total_sum | Returns the total length summed across all compartments |
| EucDistance (maximum Euclidean distance from soma to the tips) | Ave and max are relevant, we used Max | Maximum straight distance encompassing the whole neuron |
| PathDistance (path distance of a compartment) | Ave and max are relevant, we used Max | Maximum geodesic distance from soma to tips |
| Branch_Order (order of the branch with respect to the soma) | Ave and max are relevant, we used Max | Maximum number of bifurcations from soma to tips |
| Contraction (ratio between Euclidean distance of a branch and its path length) | Ave | Average tortuosity across all branches |
| Fragmentation (total number of compartments that constitute a branch between two bifurcation points) | All are relevant, we used Total_sum | Total number of compartments from all of the branches |
| Partition_asymmetry (average over all bifurcations of sub-trees) | Ave | Topological tree asymmetry measured from all bifurcation points |
| Bif_ampl_local (angle between the first two bifurcation compartments) | All are relevant, we used Ave | Average over all bifurcations of the angle between the first two daughter compartments |
| Bif_ampl_remote (angle between, current plane of bifurcation and previous plane of bifurcation) | All are relevant, we used Ave | Average over all bifurcations of the angle between the following bifurcations or tips |
| Fractal_Dim (slope of linear fit of regression line obtained from the plot of path vs. Euclidean distances) | All are relevant, we used Ave | Average space occupancy measured from all branches |
Figure 3K-means clustering results
(A) Elbow curve to determine the optimal number of clusters.
(B) Scatter plot of the neurons based on their first two principal components (PC1 and PC2) and color-coded clusters (each color represents a cluster).
(C) Distribution of different cluster assignments found by K-means algorithm.
Figure 4Gaussian Mixture Model (GMM) clustering results
(A) Bayesian Information Criterion (BIC) and Akaike Information Criterion (AIC) scores to determine the optimal number of clusters.
(B) Scatter plot of the neurons based on their first two principal components (PC1 and PC2) and color-coded clusters (each color represents a cluster).
(C) Distribution of different cluster assignments found by GMM.
Figure 5Distribution of the neurons based on their principal components (PC1 and PC2) and their ground truth labels
(A) Scatter plot based on different neuromast labels (A: anterior, L: lateral, T: trunk, D: dorsal; numbers associated with the labels indicate closeness of the neuron to the head of the animal, with 1 being the closest and 6 being furthest).
(B) Scatter plot based on different tuning labels (u: unknown, r: rostral, c: caudal).
(C) Scatter plot based on different region labels (trunk, tail, posterior lateral line, dorsal lateral line, and anterior lateral line).
(D) Scatter plot based on different hemisphere labels (right and left).
Figure 6Visualization of the neurons based on K-means results (color-coded clusters) and their ground truth labels (shapes)
For label meanings, see Figure 5 legend.
Figure 7Visualization of the neurons based on GMM results (color-coded groups) and their ground truth labels (shapes)
For label meanings, see Figure 5 legend.
Figure 8Supervised analysis results
(A) Feature importance of the data.
(B) Classification accuracy of logistic regression, decision tree, K-nearest-neighbor (K-NN), and multilayer perceptron (MLP) using all features and just the top feature. For more information on the morphological features see step 7 and for details on feature importance see step 16.
Figure 9Density analysis
Density plot for features ‘Contraction’ (branch tortuosity) (A) and ‘EucDistance’ (maximum straight distance from soma to tips) (B) in the entire dataset as well as in selected sub-class labels (ALL and PLL).
Figure 10Statistical analysis of persistence diagram vectors (PDVs) relative to lateral lines (LL)
The script first calculates the pairwise arccosine distances between PDVs of ‘within’ and ‘across’ populations with respect to class labels (ALL and PLL), and then performs their statistical comparison. Bar plot shows the average of ‘within’ and ‘across’ distances with error bars indicating standard deviations.
| REAGENT or RESOURCE | SOURCE | IDENTIFIER |
|---|---|---|
| Neuronal reconstructions | NeuroMorpho.Org | RRID:SCR_002145 |
| Source code | RRID:SCR_021638 | |
| Python 3.8 | python.org/downloads/ | RRID:SCR_008394 |
| L-Measure | cng.gmu.edu:8080/Lm | RRID:SCR_003487 |
| SciPy 1.6.0 | scipy.org | RRID:SCR_008058 |
| scikit-learn 0.24.1 | scikit-learn.org | RRID:SCR_002577 |
| Pandas 1.2.2 | pandas.pydata.org | RRID:SCR_018214 |
| NumPy 1.20.1 | numpy.org | RRID:SCR_008633 |
| matplotlib 3.3.4 | matplotlib.org | RRID:SCR_008624 |