| Literature DB >> 35614082 |
Jay Devine1,2,3, Marta Vidal-García1,2,3, Wei Liu1,2,3, Amanda Neves4, Lucas D Lo Vercio1,2,3, Rebecca M Green5, Heather A Richbourg6, Marta Marchini1,2,3, Colton M Unger2,7, Audrey C Nickle8,9, Bethany Radford1,10, Nathan M Young6, Paula N Gonzalez11, Robert E Schuler12, Alejandro Bugacov12, Campbell Rolian2,13, Christopher J Percival14, Trevor Williams15, Lee Niswander16, Anne L Calof17,18, Arthur D Lander18,19, Axel Visel20,21,22, Frank R Jirik1,2,10, James M Cheverud23, Ophir D Klein24,25,26, Ramon Y Birnbaum27, Amy E Merrill8,9, Rebecca R Ackermann28,29, Daniel Graf30,31, Myriam Hemberger1,10, Wendy Dean1,3, Nils D Forkert1,32, Stephen A Murray33, Henrik Westerberg34, Ralph S Marcucio6, Benedikt Hallgrímsson35,36,37.
Abstract
Complex morphological traits are the product of many genes with transient or lasting developmental effects that interact in anatomical context. Mouse models are a key resource for disentangling such effects, because they offer myriad tools for manipulating the genome in a controlled environment. Unfortunately, phenotypic data are often obtained using laboratory-specific protocols, resulting in self-contained datasets that are difficult to relate to one another for larger scale analyses. To enable meta-analyses of morphological variation, particularly in the craniofacial complex and brain, we created MusMorph, a database of standardized mouse morphology data spanning numerous genotypes and developmental stages, including E10.5, E11.5, E14.5, E15.5, E18.5, and adulthood. To standardize data collection, we implemented an atlas-based phenotyping pipeline that combines techniques from image registration, deep learning, and morphometrics. Alongside stage-specific atlases, we provide aligned micro-computed tomography images, dense anatomical landmarks, and segmentations (if available) for each specimen (N = 10,056). Our workflow is open-source to encourage transparency and reproducible data collection. The MusMorph data and scripts are available on FaceBase ( www.facebase.org , https://doi.org/10.25550/3-HXMC ) and GitHub ( https://github.com/jaydevine/MusMorph ).Entities:
Mesh:
Year: 2022 PMID: 35614082 PMCID: PMC9133120 DOI: 10.1038/s41597-022-01338-x
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 8.501
Fig. 1Schematic overview of the phenotyping pipeline. Specimens were staged, prepared (fixed/stored), stained, and imaged with different but standardized lab-specific protocols. While the E10.5, E11.5, E14.5, and adult specimens were obtained in Calgary, the E15.5 and E18.5 specimens were acquired from the IMPC. To account for differences in image acquisition (e.g., intensity artifacts, image resolution and dimensions, and position), each image was subjected to a series of preprocessing steps. Next, each preprocessed image was non-linearly registered to a stage-specific reference atlas with a detailed set of landmarks and segmentations. We recovered deformation fields, landmarks, and segmentations (if available) for each specimen. To optimize the landmark predictions of poorly registered specimens, as measured by cross-correlation similarity, a downstream neural network was used.
Summary of imaging data.
| Source | Stage Segmentations | Anatomy | X | Y | Z | Resolution (mm) | N | Landmarks | |
|---|---|---|---|---|---|---|---|---|---|
| Calgary | E10.5 | Head | 220 | 295 | 350 | 0.012 | 434 | ✓ | × |
| Calgary | E11.5 | Head | 502 | 503 | 390 | 0.012 | 531 | ✓ | × |
| Calgary | E14.5 | Head; Body | 486 | 567 | 723 | 0.027 | 84 (84) | ✓ | × |
| IMPC | E15.5 | Head; Body | 486 | 567 | 723 | 0.027 | 1426 | ✓ | ✓ |
| IMPC | E18.5 | Head; Body | 293 | 414 | 667 | 0.054 | 1657 | ✓ | × |
| Calgary | Adult | Skull | 642 | 586 | 979 | 0.035 | 6000 (154) | ✓ | ✓ |
Source is where the image was acquired. Stage is the age of the specimen at sacrifice. Anatomy is the labelled and scanned anatomy. X, Y, and Z are the voxel lengths of each atlas axis. Resolution is the isotropic resolution of each scan. N is the sample size, with the number of scans awaiting publication of primary research in parentheses. Landmarks and segmentations indicate the presence (✓) or absence (×) of labels on the stage-specific atlas.
Fig. 2Embryo reference atlases. Sagittal cross-sections of the E10.5 (www.facebase.org/id/6-F00W), E11.5 (www.facebase.org/id/6-F012), E14.5 (www.facebase.org/id/6-F016), E15.5 (www.facebase.org/id/6-F6SE), and E18.5 (www.facebase.org/id/6-F6T4) atlas volumes are shown to display the stained internal anatomy. Each head surface was labelled with a dense landmark configuration to capture global and local aspects of morphology. Lateral, superior, and anterior views of each head isosurface are shown. The equivalent semi-landmark patches (small, color-coded points) were interpolated between a set of sparse homologous landmarks (large, red points). They can be slid and resampled for morphometric analyses.
Fig. 3Adult reference atlas. Cranium (top), mandible (middle), and endocast (bottom) surfaces were segmented from the skull atlas (www.facebase.org/id/6-F6VC), then labelled with a dense landmark configuration to capture global and local aspects of morphology. Lateral, superior, and anterior views of each segmentation isosurface are shown. There are sparse landmarks (red) as well as surface (blue) and curve (green) semi-landmarks that can be slid and resampled for morphometric analyses.
Summary of metadata identifiers.
| Identifier | Description |
|---|---|
| Biosample | The name of the specimen, which corresponds to the image and label names. |
| Strain | The background strain of the specimen. |
| Strain_MGI_ID | The MGI ID for the strain. |
| Strain_Type | An attribute of strain that describes whether it is inbred or outbred and lab-derived or wild-derived. |
| Gene | The gene symbol as provided by MGI. |
| Gene_MGI_ID | The MGI ID for the gene. |
| Zygosity | Whether the specimen is homozygous, heterozygous, wildtype, or otherwise (e.g., flox/null) for a given gene mutation. |
| Genotype | A concatenation of the gene symbol and zygosity symbol. |
| Anatomy | The region of anatomy that has been scanned and labelled. |
| Treatment | An environmental effect that the specimen has been treated with. |
| Experimental Group | An identifier derived from genotype that denotes whether the specimen is a control or mutant. |
| Sex | The sex of the specimen. |
| Stage | The age of the specimen in days, either embryonic (E) or postnatal (PN). |
| Life_Phase | An identifier derived from stage that indicates life phase (e.g., gestation vs. adulthood). |
| Dataset | The published or unpublished study (see Study_Metadata.csv) the sample is associated with. |
| Availability | Whether the images and phenotypic data are available or pending publication of a primary research article. |
Fig. 4Summary of metadata. (a) Distribution of sex, strain type, and genotype for the embryo datasets. (b) Distribution of sex, strain type, and genotype for the adult dataset. (c) Sample sizes of each developmental stage included in the database. All “NA” specimens are mature or middle-aged adults. (d) Left: Example landmarks and segmentations of the adult skull and endocast (brain). Middle/Right: Morphological analyses, such as PCA and allometry regressions, that one might perform with a dense landmark dataset. Each color in the plot represents a different mouse genotype. (e) Left: Slice visualization of a non-linear deformation grid. Middle/Right: Morphological analyses, such as statistical parametric mapping, that one might perform with a deformation field. The t values show significant (p < 0.05) voxel-wise differences in form (i.e., volume shrinkage) in Ghrhr homozygous mutants relative to wild type, whereas the variance heatmap shows voxel-wise variances in Ghrhr mutants.
Fig. 5Validation of adult crania test set. (a) Left: Regression of automated-manual Euclidean distances (error) on cross-correlation, a measure of the final target-reference image similarity. Right: Boxplots showing the distribution of cross-correlation values within each developmental stage. (b) Correlation of automated and manual PC scores. Left: Baseline automated PC correlations. Right: Optimized automated PC correlations. (c) Mean shape deviations between the automated and manual datasets. Red arrows indicate error prone areas.
Fig. 6Principal Component Analysis of stage-specific shape data. The mean shape (center) was deformed to the minimum (left) and maximum (right) extremes of PC1. Every morph is shown with anterior and lateral views. Each row represents a different developmental stage, ranging from E10.5 to adulthood.
| Measurement(s) | mouse anatomy • brain morphology • transgenic mouse models • developmental stage • craniofacial region |
| Technology Type(s) | micro-computed tomography • image registration • anatomical landmarks • micro-computed tomography • gene knockout |
| Factor Type(s) | strain • genotype • treatment • sex • stage |
| Sample Characteristic - Organism | Mus musculus |