| Literature DB >> 32188430 |
Yu Toyoshima1, Stephen Wu2,3, Manami Kanamori1, Hirofumi Sato1, Moon Sun Jang1, Suzu Oe4, Yuko Murakami4, Takayuki Teramoto4, Chanhyun Park1, Yuishi Iwasaki5, Takeshi Ishihara4, Ryo Yoshida6,7, Yuichi Iino8.
Abstract
BACKGROUND: Annotation of cell identity is an essential process in neuroscience that allows comparison of cells, including that of neural activities across different animals. In Caenorhabditis elegans, although unique identities have been assigned to all neurons, the number of annotatable neurons in an intact animal has been limited due to the lack of quantitative information on the location and identity of neurons.Entities:
Keywords: Biological resources; Caenorhabditis elegans; Cell-specific promoters; Computational method; Large dataset for cell positions; Neuron identification; Volumetric images; Whole-brain activity imaging
Year: 2020 PMID: 32188430 PMCID: PMC7081613 DOI: 10.1186/s12915-020-0745-2
Source DB: PubMed Journal: BMC Biol ISSN: 1741-7007 Impact factor: 7.431
Fig. 1Outline of the neuron ID dataset. a The expression pattern of cell-specific promoter tax-4p (modified from WormAtlas) and an example image of the strain JN3006 in which the landmark fluorescent protein was expressed by tax-4p. The maximum intensity projection of the right side of a representative animal is shown. b The list of the cell-specific promoters and the number of animals used in the neuron ID dataset. c The number of the detected and the identified nuclei in each animal. d The names of identified cells and the number of animals (“counts”) in which the cells were identified. The names and the positions of all identified cells in each animal were summarized in Figshare Dataset S1 [23] and Additional file 1: Table S1
Fig. 2Variations of cell positions. a Visualization of the variation of cell positions. The ellipsoid indicates the mean and the covariance of the positions of the cells. Cells in the right half of the body are shown. The colors are assigned randomly for visualization. In the case of the cells whose covariance cannot be calculated, the median of other covariance was used for visualization and shown in gray color. A/P means anterior-posterior, D/V means dorsal-ventral, and L/R means left-right directions. b Minimum distance (Euclid distance of centers of nearest ellipsoids) and the shortest axis length of the ellipsoids (equal to the twice of the smallest standard deviation) for each cell. The line shows where the minimum distance equals the shortest axis length. c Variation of the relative position of cell pairs is shown as a heat map. The red box and red dotted box indicate clusters of less varying cell pairs in the lateral ganglion and pharynx, respectively. For visualization, the variations were divided by their median value, and the color axis was truncated at 5 (the colors for cell pairs whose variation is larger than 5 are the same as the color for cell pairs whose variation is 5). d The error rate of the naive estimation method is visualized with cell positions in 3D. In the naive estimation method, the posterior probability of assignments was calculated for the respective cells in the respective animals based on the mixture of the Gaussian distributions. The name of the cell was estimated as the name of the Gaussian which had the largest probability for the cell. The error rates were calculated for each ground-truth cell. The hot color indicates that the error rate is high
Fig. 4An automatic annotation method and evaluation. a The outline of the atlas generation method. b The outline of the automatic annotation method. The schemes of bipartite graph matching and majority voting are shown. c Error rates of the automatic annotation method for the animals in the neuron ID dataset. The names of the cells were estimated based on their positions. The error rate was calculated as 1 – (Ncorrect)/(Nannotated) for each animal, where Nannotated is the number of human-annotated cells (ground truth) and Ncorrect is the number of cells whose annotation by the algorithm was correct. Cells un-annotated by human were not included in the calculation of error rate. The rank R indicates that it is considered correct if the correct annotation appeared in the top R estimations by the algorithm. The error rates were evaluated by cross-validation, and mean ± standard deviation over well-annotated six animals is shown. d Error rates of the automatic annotation method for the strain JN3039 that expresses the fluorescent landmarks. The names of the cells were estimated based on their positions with or without the expression of landmark promoters. Mean ± standard deviation over 15 animals is shown. e The automatic annotation method was integrated in the graphical user interface roiedit3d that enables feedback between automatic and manual annotations. f The effect of manual correction on the error rate of automatic annotation. A wrong annotation of a cell in rank 1 estimation for JN3039 (see Fig. 4d) was corrected, and the automatic annotation method was performed by using the correction information. This step was repeated sequentially
Fig. 3Optimal combination of the cell-specific promoters increases the number of identified cells in an animal. a Number of positive cells and stability of expression of cell-specific promoters. A cell was counted as positive for a promoter if the cell expresses the landmark fluorescent protein in at least one animal. The positive ratio is a ratio of positive (expressing) cells over the total number of the cells (= number of tested animals). Stability of expression was calculated as an average of the positive ratio over the cells in which at least one cell is positive for the promoter. b Number of positive cells and sparseness of the expression pattern (see the “Methods” section) of the cell-specific promoters. Note that, for visibility, only the several labels are shown in a and b. Fully labeled panels are shown in Additional file 12: Figure S8. c Visualization of the optimal combination of the cell-specific promoters. The cells in the right half of the body are shown. d A part of c is zoomed for comparison with e. e An example fluorescent image of JN3039 strain and annotated cell names. See Additional file 14: Figure S9 for enlarged image. f The number of the detected and the identified nuclei in each animal. g The names of identified cells and the identification ratio, which is a ratio of identified cells over the total number of the cells. The solid lines are guide for visualization
Strain list used in this study
| Strain | Genotype | Used in |
|---|---|---|
| Bristol N2 | Figure | |
| JN3000 | Figs. | |
| JN3001 | Figs. | |
| JN3002 | Figs. | |
| JN3003 | Figs. | |
| JN3004 | Figs. | |
| JN3005 | Figs. | |
| JN3006 | Figs. | |
| JN3007 | Figs. | |
| JN3008 | Figs. | |
| JN3009 | Figs. | |
| JN3010 | Figs. | |
| JN3011 | Figs. | |
| JN3012 | Figs. | |
| JN3013 | Figs. | |
| JN3014 | Figs. | |
| JN3015 | Figs. | |
| JN3016 | Figs. | |
| JN3017 | Figs. | |
| JN3018 | Figs. | |
| JN3019 | Figs. | |
| JN3020 | Figs. | |
| JN3021 | Figs. | |
| JN3022 | Figs. | |
| JN3023 | Figs. | |
| JN3024 | Figs. | |
| JN3025 | Figs. | |
| JN3026 | Figs. | |
| JN3027 | Figs. | |
| JN3028 | Figs. | |
| JN3029 | Figs. | |
| JN3030 | Figs. | |
| JN3031 | Figs. | |
| JN3032 | Figs. | |
| JN3033 | Figs. | |
| JN2101 | Figs. | |
| JN3035 | Figs. | |
| JN3036 | Figs. | |
| JN3038 | Figs. | |
| JN3039 | Figs. | |
| JN3041 | Fig. | |
| ZIM945 | Figure | |
| ZIM1048 | Figure |