Literature DB >> 32423919

The Ancestral Caenorhabditis elegans Cuticle Suppresses rol-1.

Luke M Noble1, Asif Miah2, Taniya Kaur2, Matthew V Rockman3.   

Abstract

Genetic background commonly modifies the effects of mutations. We discovered that worms mutant for the canonical rol-1 gene, identified by Brenner in 1974, do not roll in the genetic background of the wild strain CB4856. Using linkage mapping, association analysis and gene editing, we determined that N2 carries an insertion in the collagen gene col-182 that acts as a recessive enhancer of rol-1 rolling. From population and comparative genomics, we infer the insertion is derived in N2 and related laboratory lines, likely arising during the domestication of Caenorhabditis elegans, and breaking a conserved protein. The ancestral version of col-182 also modifies the phenotypes of four other classical cuticle mutant alleles, and the effects of natural genetic variation on worm shape and locomotion. These results underscore the importance of genetic background and the serendipity of Brenner's choice of strain.
Copyright © 2020 Noble et al.

Entities:  

Keywords:  collagen; cuticle; genetic background; genetic interaction

Mesh:

Substances:

Year:  2020        PMID: 32423919      PMCID: PMC7341120          DOI: 10.1534/g3.120.401336

Source DB:  PubMed          Journal:  G3 (Bethesda)        ISSN: 2160-1836            Impact factor:   3.154


Since Morgan’s first white-eyed fly, forward genetics has been one of our most powerful tools for discovering biological mechanisms. In 1974, Sydney Brenner introduced geneticists to C. elegans, an experimental organism with properties ideal for probing the molecular basis of development and neurobiology (Brenner 1974). Brenner began by isolating mutants with conspicuous effects under the evocative nomenclature of Dumpy, Squat, Long, Blistered, and Roller phenotypes, including a single allele of . This mutation causes helical twisting of the adult worm’s cuticle, which manifests most obviously as sinusoidal motion along the short axis of locomoting animals and consequent gyration on the uniform surface of an agar plate. , and several other genes from Brenner’s first screen, opened the door not only to linkage mapping in C. elegans but also to decades of productive work on the worm cuticle. The cuticle is a complex structure, made primarily of cross-linked collagens generated anew with each larval molt (Page and Johnstone 2007). It plays an integral structural role as both barrier and morphological scaffold for muscle attachment. Epistasis analysis of collagens and collagen-modifying enzymes represents a landmark example of the power of transmission genetics to reveal molecular and developmental mechanisms (Higgins and Hirsh 1977; Cox ; Kramer and Johnson 1993; McMahon ). Brenner’s original screen, and the vast majority of subsequent research in C. elegans, took place in the genetic context of the inbred reference strain, N2. Over the past decade, researchers have discovered that N2 evolved during its adaptation to laboratory conditions, and that wild isolates of C. elegans differ from the lab strain in diverse and substantive ways (Hodgkin and Doniach 1997; de Bono and Bargmann 1998; McGrath ; Duveau and Félix 2012; Andersen ; Sterken ; Large ; Gimond ). Critically, the effects of mutations are often modified by genetic background. This kind of background dependence both complicates experimental analyses and underlies important genetic phenomena such as variable penetrance of Mendelian diseases in humans (Summers 1996; Scriver and Waters 1999; Dipple and McCabe 2000; Gibson and Dworkin 2004; Paaby and Rockman 2014; Paaby and Gibson 2016). While using a allele as a visible marker for genetic mapping experiments, we discovered that its rolling phenotype is substantially suppressed by a wild strain background. We mapped the major-effect locus responsible for this suppression, finding that N2 carries a derived insertion in , a collagen gene with no known mutational effects. The ancestral allele of this collagen, found in all wild isolates of C. elegans and highly conserved among Caenorhabditis species, also modifies the effects of other canonical cuticle mutants, including those with Blistered and Squat phenotypes. These results underscore the importance of genetic background and the serendipity of Brenner’s choice of strain.

Materials and Methods

Strains

All experiments were carried out at 20° with NGM-agarose plates and OP50-1 E. coli for food, unless otherwise noted. We used the following strains: BE8: () V, BE13: () II, BE22: () II, BE44: () X, BE93: () II, BE108: () II, CB61: () I, CB91: () II, CB224: () V, CB768: () II, CB769: () II, CB1166: () IV, CB2070: () () II, CB4856: Hawaiian wild type, COP1834: () X, EG7993: [eft-3p::TdTomato::H2B] X, EG8951: [eft-3p::GFP::NLS + NeoR] X, N2: laboratory wild type, QG2797: () II; ajIR6 [X, CB4856 > N2] X, QG2798: () II; [eft-3p::TdTomato::H2B] [eft-3p::GFP::NLS + NeoR] X, QG2804: () II, QG2952: () II; () X, QG2954: () V; () X, QG2953: () II; () X, QG2955: () IV; () X, QG2956: () () II; () X, QG2957: () II; () X, QG2958: () II; () X, QG2960: () I; () X, QG2961: () II; () X, QG2962: () II; () X, QG3070: () V; () X, QG3072: () II; () X, QG3074: () II; () X, QG3076: () () X, SP419: () () II, TN64: () II, and WE5241: ajIR6 [X, CB4856 > N2] X. COP1834 () X was generated by Knudra Biosciences (now InVivo Biosystems), using the CRISPR/Cas9 homology-directed repair protocol of Paix . We refer to this synthetic ancestral allele in the main text as anc. In the N2 background, Cas9 was directed to sites on each side of the target insertion with sgRNA sequences CTGGATATAGTTGTTCCCGG and CAAGGAGAGATCGGACGCGA, and repair was directed by the DNA oligo AGGACGGATGCCGCGTCTGCCCACCAGGACCACCGGGTA- CTACCGGTTACCCAGGTCAGCCTGGCCCCCAGGGTCTCC- CAGGTAACCAAGGAGACCAAGGTGAAATTGGTCGCGAA- GGATCACCAGGAGCCCAGGGAGCTCCAGGAG. This oligo precisely excises the 8-bp insertion sequence from the N2 gene and simultaneously alters 16 additional sites, all silent codon positions, to prevent recutting by Cas9. We verified the presence of the () mutation in these strains by Sanger sequencing using primers RolF3: CAAATTCGACAAAGCGACAA and RolR3: GAGCATCGTAAGGCTGGAAA. We verified the presence of the () mutation by Sanger sequencing using primers X12.636F: TAGGCAAACTTGCTGCACAC and X12.636R: GAGACAGGCTGGAAATGAGC.

Observation of segregation distortion

As described in Kaur and Rockman (2014), we crossed wild isolate CB4856 and a strain carrying () () II in the N2 background. hermaphrodites were singled and Rol nonUnc adults were isolated and genotyped by Illumina GoldenGate Assay. Genotyping was performed by the DNA Sequencing and Genomics Core Facility of the University of Utah. Allele frequencies are in File S2.

Complementation crosses

We used visible markers to generate animals that are homozygous on chromosome II and heterozygous on the X chromosome, with one X chromosome from N2 and the second from the strain of interest (SOI). If the SOI carries the dominant suppressor of , then these animals are expected to show suppressed rolling behavior. Specifically, we crossed [GFP] II males to SOI hermaphrodites to generate / + + II; SOI X males. These we crossed to II hermaphrodites. For each SOI, we tested six GFP-positive nonUnc hermaphrodite progeny of this cross for rolling behavior in the second day of adulthood. These animals are mostly + / + II; SOI/N2 X. A small fraction of the phenotyped animals could be heterozygotes due to rare recombination events between and in the males. In addition, each tested animal is heterozygous (N2/SOI) for a random fraction (expectation 1/2) of the autosomes except for chromosome II. We used this approach to test 5 recombinant inbred advanced intercross lines (QX33, QX43, QX126, QX150, and QX202). We also tested 8 wild isolates by this approach (CB4856, PB306, EG4347, QX1211, JU319 [CeNDR isotype JU311], JU1088, PX179, and JU400 [CeNDR isotype JU394]), along with N2 and LJS1, which are laboratory-adapted strains.

Fine mapping with recombinants

To fine-map the suppressor, we crossed () II; CB4856 X and () II; X. The latter strain carries integrated single-copy transgenes on the X expressing fluorescent proteins, tdTomato::H2B from X:11.049 Mb and GFP::NLS from X:13.480 Mb [Wormbuilder; Frøkjær-Jensen ]. Both strains carry N2-derived autosomes. We homozygosed X chromosomes that were recombinant between the transgenes, scored their rolling behavior, and genotyped them at SNP markers in the mapping interval. Recombinant strain AM_GNR9 placed the suppressor to the right of SNP WBVar00083599 at X:12,579,121, and recombinant AM_GNR13 placed it to the left of SNP WBVar1602269 at X:12,662,633. Genotyping primers used for mapping were: WBVar01981458 indel, TGGGTAAACATCGGCTCCAT and TGTTCTGCACGGGAAAAGAT; WBVar00083599 SNP, CGACATCCAAAGTTTTTGAGACT and GAGAAAGTGTTATGGGCATGG; WBVar01602269 SNP, CGTGTGTTTCCGTTGTGAAT and TTCAGTGTTCATCGCAATCTG.

Association mapping

Variant data for 330 C. elegans isotypes were downloaded as the 20180527 CeNDR release soft-filtered vcf. We tested for a match between variants in the recombination-mapping interval (X:12,579,122 - X:12,662,632) and suppression phenotype in the panel of N2, CB4856, LSJ1, and the seven other tested wild isolates.

Gene model

Sequence and annotation data for the Caenorhabditis genus were downloaded from the Caenorhabditis Genomes Project. RNAseq data from young adults of C. elegans wild isolates were downloaded from the NCBI SRA: CB4856 PRJNA437313 (Zamanian ), AB1 and ED3040 PRJNA288824 (Vu ). Reads were mapped to a 100 Kb region of the N2 WS220 genome centered on with bwa mem version 0.7.17 (Li and Durbin 2010), assembled with Trinity version 2.3.2 in genome-guided mode (Grabherr ), and homologous transcripts were extracted by blast version 2.9 (Altschul ) against orthologs across the genus. Coding and protein sequences were aligned with mafft version 7.3.10 in L-INS-i mode (Katoh ), and homology and gene structures were plotted using R packages (Paradis and Schliep 2019), (Yu ), (Pagès ), and (Yin ). Species with only computationally predicted annotations (C. kamaaina, C. becei, C. panamensis) and two species (C. quiockensis and C. sulstoni) with extremely long predicted ortholog sequences potentially deriving from annotation errors were excluded. The CGP C. remanei protein sequence contains a large deletion that is not present in the WormBase version, and we opted to use the latter. Collagen triplet stability scores shown in Figure 2B were obtained from (Persikov ), and ignore higher-order interactions. Gene models and coding sequence alignments are in Files S10 and S11.

Epistasis analysis of visible mutants

For each tested mutation, we grew the mutant line and () double mutant in parallel at low population densities for several generations and performed synchronous egg lays to generate animals for phenotyping. For most strains we observed these synchronized animals three days later as young adults. For (), because the heterozygous phenotype (Rol) is different from the homozygous Squat phenotype (Sqt), we also scored /+ animals. For and ; animals, we followed 200 of each genotype through the third day of adulthood. Videos of worms (CB91, QG2979, QG2957, BE22, QG2953) were taken on day four of adulthood. Ten worms were picked to fresh seeded plates and imaged for 8 min at 12 frames per minute. Videos of worms (BE8, QG3070) were taken on day two of adulthood. Twenty worms were picked to fresh seeded plates and imaged for five minutes at 12 frames per minute. Video samples are in Files S3-S9.

Quantitative locomotion analysis

We generated quantitative phenotype data in three genetic backgrounds (wild-type, (), and ()), each with the ancestral and derived allele. Conditions and worm tracking have been described previously (Noble ; Mallard ). Lines (N2, COP1834, BE8, BE22, QG2957, QG3070) were each split to duplicate lineages, bleached, and grown under common conditions on HT115 bacteria in 90mm plates at 20° for two generations before assay, with each generation starting from around 500 L1 larvae that had been starvation synchronized in M9 buffer for 18 hr. Young adults were imaged in random order, on food, during day three post-L1, and again on two further generations (treated as above) for a total of six replicate plates per genotype. Worms were tracked for 8 min using the Multi-Worm Tracker (Swierczek ), the final 4 of which were analyzed after subsampling to 4 Hz, and 11-point skeletons and outlines from Choreography were parsed to generate summary track statistics based on size and movement. More precisely, we used Choreography-defined measurements of length, width, area, speed, acceleration, angular momentum (turning rate), mean body curvature, kink (the maximum ratio of angles between head/tail and body), and the length of continuous runs of Forward, Backward or Still motion (“bias”). Raw data are in File S12. For each worm track we took the median and variance of each metric, as a whole and split by bias state. Exploratory behavior was quantified as the area and circularity ( ) of the track convex hull (Pebesma and Bivand 2005), averaged first across 30 sec intervals for each of the longest 100 tracks from each plate, then across tracks. Traits were log transformed where strongly non-normal (an improvement in Shapiro-Wilk -log10 p-value > 6), and effects of assay block, defined by lineage and assay day, were removed by linear regression. Processed data are in File S13. We show univariate and multivariate (classical multidimensional scaling on the mean centered and scaled Euclidean distance matrix of plate means, base R ) analysis. In Figure 3A-B we used multivariate analysis of a subset of traits selected using sparse linear discriminant analysis (Clemmensen ). From 25 traits (repeatability < 0.5, thinned to reduce maximum colinearity to < 0.5), we selected the five metrics most associated with suppression of () or (), separately, using plate means. Traits retained for (BE8 vs. QG3070, COP1834, N2), ordered by absolute loading on the discriminant function, were log transformed curvature (S), width (F), kink, width (S) variance, and acceleration (F). Traits retained for (BE22 vs. QG2957, COP1834, N2) were log transformed circularity, velocity variance, width (F), kink (F), and run length (F) variance. R code for this analysis is in File S14.
Figure 3

Ancestral suppresses and alleles. A-B. Multidimensional scaling of locomotion and size traits selected by sparse discriminant analysis that maximize suppression of (A) or (B) by ancestral . Point shapes show genotype (ancestral or N2), bounded by ellipses giving the 95% confidence interval under a multivariate t-distribution (solid lines for ancestral ), colors show genetic interaction (none in the N2 background, or alleles of and ). Each point is the grand mean of tracks from around 500 young adult worms per replicate plate measured over three consecutive generations for each genotype, assayed for N2 (green triangles) and COP1834 (ancestral in the N2 background; green circles), BE22 (N2 ; (); orange triangles) and QG2957 (ancestral ; (); orange circles), and BE8 (N2 ; (); purple triangles) and QG3070 (ancestral ; (); purple circles). C-F. Univariate comparisons show variable effects across backgrounds. genotype is indicated with – (N2 allele) and + (ancestral) symbols. Complete suppression of track circularity is seen for () (C), and of worm width in the forward state for () (F), however partial (or no) suppression is the most common outcome. Raw and processed data in Files S12 and S13, and code in File S14.

Epistasis analysis in the CeMEE

To test for potential modifying effects of on natural variation segregating in the C. elegans multiparent experimental evolution (CeMEE) panel we genotyped the N2 indel from existing sequence data (Noble ) using bcftools (Li 2011) after indel realignment (DePristo ), obtaining calls for 365 recombinant inbred lines (RILs; from populations A6140, CA[1-3]50 and GA[1,2,4]50) for which locomotion has been measured on NGM (Mallard ). Two lines were excluded as multivariate outliers based on Mahalanobis distance. Genotypes were marker set 1 from Noble . The N2 allele is at a frequency of 16.5% in these lines, providing sufficient power to detect pairwise interactions conditional on joint allele frequency. In total, 167,187 diallelic SNPs where all four genotype classes were present at a minimum frequency of 10, excluding any uncertain imputations, were tested. To test for -by-genotype interactions we fit nested bivariate linear models for three pairs of partially correlated traits: length and width (forward state, log transformed), the and suppression discriminant functions that are linear combinations of five traits (see Quantitative locomotion analysis), and the single traits with the highest loading in each discriminant function, curvature and track circularity (log transformed). Trait values were best linear unbiased predictions (BLUPs) extracted from linear mixed effects models (R package ) fit to replicate observations, with fixed effects of population replicate. Significance testing followed the univariate approach in Noble . In brief, we first tested for genetic effects by likelihood ratio (Pillai’s trace statistic) for a full model with additive and interaction effects of genotype and focal marker genotype, against a null model (intercept only). Genome-wide significance was declared against a null distribution of >1000 test statistics generated by permuting lines within populations and retaining the minimum observed p-value. We used a relative permissive false discovery rate (FDR) of 0.2. Quantile-quantile plots showed statistics were well calibrated for length/width and the Rol/Sqt discriminant functions at , but strongly deflated for curvature/circularity. Inflation was evident for length/width at , independent of linkage disequilibrium, consistent with additional polygenic interactions. For loci with significant genetic effects, interaction significance was then assessed at a nominal threshold of by parametric bootstrap against the additive model (Bůžková ), resampling responses jointly among lines 5000 times. Genotype and phenotype data, and R code for this analysis are in Files S15, S16.

Gene expression analysis

We extracted data for 199 recombinant inbred advanced intercross lines (RIAILs) from Rockman , after excluding data from lines with annotation issues (Zych ). We performed structured nonparametric trait mapping as in Rockman and Kruglyak (2009) for abundance of 15,617 transcripts whose genes are more than a megabase from (to exclude local linkages for genes near that have their own -acting variants). We retained traits with genome- and experiment-wide significant linkage peaks (LOD > 4.3, 5% FDR) within 1 RIAIL-effective cM of (approximately 400 Kb). The nine significantly linked transcripts were tested for functional enrichment using the WormBase Enrichment Analysis Suite (Angeles-Albores ).

Data availability

All quantitative data and code to reproduce our analyses and main figures are available from FigShare and https://github.com/lukemn/cuticle. Strains are available upon request. File S1 details all supplemental files, File S2 contains genotyping data from Figure 1, Files S3-S9 contain short videos of mutant and suppressed adult hermaphrodites on plates, File S10 contains gene models from Figure 2A, File S11 contains coding sequence alignments for Figure 2B, Files S12 and S13 contain raw and processed Multi-Worm tracker data for Figure 3 with associated R code in File S14. File S15 contains genotypes and phenotypes used for detecting genetic interactions between and SNPs in the CeMEE (Figure 4), with associated R code in File S16. Supplemental material available at figshare: https://doi.org/10.25387/g3.12293138.
Figure 1

Allele frequencies along the X chromosome in Rol nonUnc s from a cross of CB4856 and N2-background () () II. Dashed lines show the Mendelian expectations for heterozygotes (1/2) and homozygotes (1/4 each).

Figure 2

A. N2 carries a derived insertion of 8 bp (red) relative to the ancestral allele. The insertion duplicates an 8-bp sequence present in the ancestor (underlined). B. gene tree and model for C. elegans CB4856 and other Caenorhabditis species. The x-axis shows distance from the start codon for each genome. C. Protein alignment, including predicted sequences for C. elegans wild isolates CB4856, AB1 and ED3040 assembled from young adult RNAseq data, and the frame-shifted N2 translation (marked with a blue dot). Gray-scale shading represents site conservation (% identity), and labels are colored by predicted collagen triplet stability (melting temperature) for runs of >1 G-X-Y repeats conserved across all sequences other than N2. The positions of conserved triplets are indicated above the alignment by gray boxes, and the positions of conserved cysteine residues potentially involved in inter-strand disulphide bridges are shown as red boxes.

Figure 4

modifies the effects of natural genetic variation on worm movement and shape. Genome-wide statistics are shown for two bivariate response models of indel SNP genotype, worm length/width (A) and Rol/Sqt sparse discriminant functions (B), using diallelic SNPs segregating in 363 recombinant inbred lines of the C. elegans multiparent experimental evolution (CeMEE) panel. Statistics are from a likelihood ratio test for a full additive and interaction linear model against a null model of no genetic effects. Permutation thresholds for genome-wide significance (FDR = 0.2) are shown in red, light shaded regions show 1.5 LOD drop QTL intervals (expanded to a minimum of 300 Kb for visibility), and the location of on the X chromosome is indicated by a dark gray line. Effect plots at right show genotype class means and standard errors for QTL with significant interactions (parametric bootstrap against additive models, ). Trait values shown are the first principal component for length/width, and the Sqt discriminant function, which explains most of the interaction in the Rol/Sqt bivariate model. Reference-based genotype is on the x-axis. Genotype and phenotype data in File S15, code in File S16.

Allele frequencies along the X chromosome in Rol nonUnc s from a cross of CB4856 and N2-background () () II. Dashed lines show the Mendelian expectations for heterozygotes (1/2) and homozygotes (1/4 each). A. N2 carries a derived insertion of 8 bp (red) relative to the ancestral allele. The insertion duplicates an 8-bp sequence present in the ancestor (underlined). B. gene tree and model for C. elegans CB4856 and other Caenorhabditis species. The x-axis shows distance from the start codon for each genome. C. Protein alignment, including predicted sequences for C. elegans wild isolates CB4856, AB1 and ED3040 assembled from young adult RNAseq data, and the frame-shifted N2 translation (marked with a blue dot). Gray-scale shading represents site conservation (% identity), and labels are colored by predicted collagen triplet stability (melting temperature) for runs of >1 G-X-Y repeats conserved across all sequences other than N2. The positions of conserved triplets are indicated above the alignment by gray boxes, and the positions of conserved cysteine residues potentially involved in inter-strand disulphide bridges are shown as red boxes. Ancestral suppresses and alleles. A-B. Multidimensional scaling of locomotion and size traits selected by sparse discriminant analysis that maximize suppression of (A) or (B) by ancestral . Point shapes show genotype (ancestral or N2), bounded by ellipses giving the 95% confidence interval under a multivariate t-distribution (solid lines for ancestral ), colors show genetic interaction (none in the N2 background, or alleles of and ). Each point is the grand mean of tracks from around 500 young adult worms per replicate plate measured over three consecutive generations for each genotype, assayed for N2 (green triangles) and COP1834 (ancestral in the N2 background; green circles), BE22 (N2 ; (); orange triangles) and QG2957 (ancestral ; (); orange circles), and BE8 (N2 ; (); purple triangles) and QG3070 (ancestral ; (); purple circles). C-F. Univariate comparisons show variable effects across backgrounds. genotype is indicated with – (N2 allele) and + (ancestral) symbols. Complete suppression of track circularity is seen for () (C), and of worm width in the forward state for () (F), however partial (or no) suppression is the most common outcome. Raw and processed data in Files S12 and S13, and code in File S14. modifies the effects of natural genetic variation on worm movement and shape. Genome-wide statistics are shown for two bivariate response models of indel SNP genotype, worm length/width (A) and Rol/Sqt sparse discriminant functions (B), using diallelic SNPs segregating in 363 recombinant inbred lines of the C. elegans multiparent experimental evolution (CeMEE) panel. Statistics are from a likelihood ratio test for a full additive and interaction linear model against a null model of no genetic effects. Permutation thresholds for genome-wide significance (FDR = 0.2) are shown in red, light shaded regions show 1.5 LOD drop QTL intervals (expanded to a minimum of 300 Kb for visibility), and the location of on the X chromosome is indicated by a dark gray line. Effect plots at right show genotype class means and standard errors for QTL with significant interactions (parametric bootstrap against additive models, ). Trait values shown are the first principal component for length/width, and the Sqt discriminant function, which explains most of the interaction in the Rol/Sqt bivariate model. Reference-based genotype is on the x-axis. Genotype and phenotype data in File S15, code in File S16.

Results

CB4856 carries a dominant suppressor of rol-1

During a study of recombination patterns on C. elegans chromosome II (Kaur and Rockman 2014), we crossed () () II worms (N2 background) with wild isolate strain CB4856, isolated in Hawaii in 1972 (Hodgkin and Doniach 1997). After allowing hermaphrodites to reproduce by selfing, we isolated Rol nonUnc s, which are homozygous for the N2 allele on chromosome II but should show Mendelian segregation of other chromosomes [ the incompatibility on chromosome I (Seidel )]. We noticed strong segregation distortion on the X chromosome among these Rol nonUnc s, which we had genotyped at 37 SNP markers (Figure 1, File S2). Distortion favored the N2 background, with a peak around 13 Mb where zero of 234 worms were homozygous for the CB4856 genotype (expectation 1/4 = 58.5). At the peak, 216 (92%) worms were N2 homozygotes and the remaining 18 (8%) were heterozygotes. We hypothesized that the CB4856 X chromosome carries a dominant suppressor of . We crossed the () allele into an X-chromosome substitution strain, which carries the CB4856 X chromosome but is otherwise N2, and confirmed that the resulting () II CB4856 X strain is strongly, but incompletely, suppressed for rolling. The worms retain a slight helical twist and can be distinguished from wildtype N2 and CB4856, but the dramatic rolling and circling behaviors of () are absent (Videos: CB91 (); N2 X vs. QG2797 (); CB4856 X in Files S3 and S4). Suppression of is also observed in X chromosome heterozygotes, consistent with a dominant suppressor on the CB4856 X and explaining the pattern of segregation distortion we observed in the crosses. Suppression often reveals interactions between genes in physical association or in specific developmental pathways, but can also reflect altered transcription, splicing, or translation of a mutant gene. These informational suppressors can be allele specific, masking only particular kinds of missense or nonsense or splice site variants, for example (Hodgkin 2005). There are two alleles with known effects on phenotype at present, and we found that the second, , is also suppressed by the CB4856 X chromosome. Although the molecular lesion is unknown, and mutants show distinct phenotypic profiles, including temperature sensitivity (Cox ), and we concluded that allele-specific interaction was thus unlikely.

The suppressor maps to an indel polymorphism in col-182

We performed complementation testing with a panel of N2-CB4856 recombinant inbred advanced intercross lines (Rockman and Kruglyak 2009) with breakpoints near the peak allele frequency distortion, taking advantage of the dominant mode of action of the CB4856 allele. Using this approach, we tested 5 RIAILs, scoring based on Rolling, with boundaries defined by RIAIL QX43, which carries the suppressor, and RIAIL QX126, which does not. These strains place the suppressor to the right of SNP WBVar00083496 at X:12,364,484 and to the left of indel WBVar01981458 at X:12,699,819 (WS272 coordinates). Next we used integrated single-copy fluorescent marker transgenes (Frøkjær-Jensen ) to select for N2/CB4856 recombinants in the interval, in N2 autosomal genetic backgrounds. These data localized the causative locus to the interval between 12.579 and 12.662 Mb. Complementation testing with seven additional wild isolates found that all exhibit suppression of . Laboratory strain LSJ1, which shares a laboratory ancestor with N2 but was cultured separately since 1963, does not exhibit suppression. Among all variants segregating in these strains in the mapped interval, only one exhibits perfect cosegregation with suppression: an 8-base pair (bp) deletion in the gene (WBVar01928355). Like , is one of the 181 collagen genes in the C. elegans genome (Teuscher ).

N2 carries a derived insertion mutation in col-182

Among 330 genome-sequenced C. elegans isotypes from around the world (CeNDR freeze 20180527), the 8-bp deletion is present in every strain except for the lab strains N2, LSJ1, and ECA252. These three strains are all derived from a single isolate of C. elegans sampled by L. N. Staniland in 1951 (McGrath ; Weber ; Sterken ; Cook ). The deletion state is also found in orthologs of all other examined Caenorhabditis species (Figure 2). This strongly suggests the reference allele is a derived insertion, one that arose either in the wild, and happened to be sampled by Staniland in 1951, or in the lab sometime before 1963 [likely before 1958 (Gimond )]. The eight basepair sequence, ATATCCAG, is also present in the ancestral allele, two bases downstream of the insertion site, so that it now occurs twice in N2 (Figure 2A). C. elegans collagens contain two blocks of Gly-X-Y repeats, flanked and separated by short cysteine-containing domains involved in interchain disulphide bonds (Page and Johnstone 2007). Until recently, was annotated as a three-exon, two-intron gene encoding a near-canonical collagen protein, missing only the middle cysteine domain (Teuscher ). However, new RNAseq data suggests that the earlier gene model was erroneous, and that the N2 transcript has only one intron (WormBase WS274). With this RNAseq-supported gene model, the 8-bp insertion causes a frameshift that eliminates the entire second Gly-X-Y domain and final cysteine domain (Figure 2B). As a consequence, has been reclassified as a pseudogene. The ancestral version of the gene, preserved in CB4856 and all other wild isolates, encodes a perfect canonical cuticular collagen. The C-terminal Gly-X-Y domain includes 43 consecutive Gly-X-Ys, tying it with four other genes for the longest such collagenous stretch among C. elegans collagens [with , , , and , none of which have known mutant phenotypes; (Teuscher )]. To test whether is the suppressor and whether the ancestral allele functions via the predicted isoform, we commissioned a strain that carries the ancestral splicing and reading frame in an otherwise N2 background (see strains). CRISPR-Cas9 conversion removed the 8-bp insertion and altered 16 additional base pairs, each a synonymous third codon position change in the predicted ancestral reading frame. We then confirmed that this ancestral allele anc suppressed , both the and alleles (videos: CB91 () vs. QG2953 (); anc, and BE22 () vs. QG2957 (); anc, Files S3 and S5-S7). We generated quantitative data for the allele (selected arbitrarily) by tracking young adult hermaphrodites and extracting an array of statistics describing locomotion, size, and shape (see Quantitative locomotion analysis). Reduction of this data by multidimensional scaling into two orthogonal axes provided quantitative confirmation that the ancestral anc allele suppresses some of these phenotypes in the background (Figure 3A). By univariate analysis, double mutants are indistinguishable from wild-type N2 for a metric that captures circling locomotion (Figure 3C), but weak to no suppression is the predominant outcome across the correlated set of single traits (Figure 3D-F).

col-182 interacts with other collagen mutants

is part of a complex network of collagens and collagen-modifying enzymes that interact genetically, often in quite complex ways (Cox ; Kusch and Edgar 1986). We therefore expected that the ancestral might modify ’s epistatic relationships with other genes, and might itself interact with other cuticle-specifying genes. Blistered mutants, described along with in Brenner’s original screen, develop fluid-filled blisters along the adult cuticle (Brenner 1974). In the N2 background, suppresses the blister phenotype of mutants (Cox ). We found that ; triple mutants show the expected suppression of rolling but are also unblistered, indicating that suppresses ’s Rol phenotype but not its suppression of . Moreover, modified ’s phenotype by itself; blisters were largely suppressed in ; double mutant hermaphrodites. The penetrance of the blister phenotype was reduced and the individual blisters were spatially restricted to the head region (Table 1). Conversely, enhances the blister phenotype of worms; young adults had small blisters, typically restricted to their heads, but ; blisters were larger, sometimes extending the whole length of the worm (Table 1). Like and , and encode collagens.
Table 1

Epistasis analysis for Blister phenotype. Each cell shows the counts of mutants/total worms. “Head-only” and “Full-body” count the number of worms with blisters that showed blisters either restricted to the head region or extending the entire length of the animal. Asterisks report p-values from Fisher’s Exact Test for a difference between the single and double mutant strains for each bli gene and phenotype. ns, not significant, * P < 0.01, ** P < 10-6

GenotypeBlistersHead-onlyFull-body
bli-133/363/330/33
bli-1; col-182anc9/40**9/9**0/9ns
bli-239/4024/390/39
bli-2; col-182anc30/30ns7/30*7/30*
Cox identified alleles in several genes that give rise to left-handed rollers, like (). Although we observed no gross phenotypic effect of on (), (), or (), we observed partial suppression of rolling in (), which we quantified by worm tracking (Figure 3, and see videos: BE8 () vs. QG3070 () anc, Files S8,S9). Suppression was qualitatively and quantitatively distinct to that of ; near complete for the single measure of worm width, but again highly variable and generally weak for other measures of locomotion and morphology (Figure 3B-F). () exhibits right-handed rolling as a heterozygote in the N2 background, and did so as well in the anc background. However, heterozygotes showed slowed development in the N2 background, while the ancestral allele suppressed the developmental delays. Finally, alleles of several additional genes involved in cuticle development – collagens (), () and (), and thioredoxin () – showed no gross phenotypic modification in the anc background.

col-182 modifies effects of natural variation on worm shape and locomotion

Collagens are known to influence body size (Brenner 1974; Fernando ; Madaan ), and our locomotion analysis identified specific axes of worm size, posture and locomotion modified by in two genetic backgrounds (Figure 3 B-F). We next sought to test more broadly for interactions between and natural genetic variation for these traits in the C. elegans Multiparent Experimental Evolution (CeMEE) panel, a collection of recombinant inbred lines derived from the pooled standing genetic diversity of 14 wild isolates and two N2-related strains (Teotónio ; Noble , 2019). We genotyped the N2 insertion in RILs sampled from an ancestral laboratory-adapted population, A6140, and from six populations derived from A6140 that evolved under varying mating system and environment (Noble ). Using Multi-Worm Tracker data for 363 lines, we fit bivariate linear models for three sets of correlated traits (length and width, body curvature and track circularity, and the Rol/Sqt discriminant functions from Figure 3B) to test for interaction effects. Univariate tests showed that genotype had no effect on the means of these population-centered traits ( by likelihood ratio test). We detected four loci with significant genetic effects at a per-model false discovery rate of 20% (Figure 4). Two QTL were detected for length/width with clear genetic interactions ( by bootstrap against the additive model): the first, on chromosome I, fell within the central recombination rate domain (1.5 LOD drop interval around 170 Kb); the second, on chromosome II, was contained by a single very large N2 protein coding gene, , with several missense variants, a splice-donor change, and heterozygous SNP calls suggestive of copy number variation segregating in the CeMEE founder haplotypes (Cook ). encodes a highly conserved predicted Rab family GTPase activator which, based on homology, may be involved in intracellular trafficking, a process critically important for collagen secretion from hypodermal cells (Roberts ; Ackema ). Two QTL were detected for the Rol/Sqt discriminant functions: one on chromosome II (interaction bootstrap ; 18 Kb interval) contained nine N2 annotated protein-coding genes of unknown function, mostly of the nematode-specific peptide group E family; the second, on chromosome V (interaction ; 25 Kb interval), was a specific interaction with MY16 haplotypes, spanning predicted ubiquitin protease partially, and fully, along with eight non-coding RNAs. is a non-essential, non-condensin subunit of the dosage compensation (DC) complex (Meyer and Casson 1986), with additional DC-independent roles in gene regulation (Webster ), and loss-of-function mutants show an enrichment in dysregulation of genes involved in the cuticle (Kramer ).

col-182 does not have systemic effects on gene expression

Several of the alleles that arose during C. elegans domestication have large and systemic effects on C. elegans biology. These include (de Bono and Bargmann 1998; McGrath ; Andersen ; Zhao ), (Duveau and Félix 2012), (Large ), and (Rockman ; Burga ). We therefore investigated whether is linked to systemic effects on gene expression in adult hermaphrodites, using a published dataset of gene expression in 199 N2/CB4856 recombinant inbred advanced intercross lines [RIAILs; (Rockman )]. The RIAILs provide much higher genotypic replication than a typical pairwise contrast of strains, as each allele is homozygous in approximately half the RIAILs, but effects that map to may be due to nearby variants in other genes. As expected, abundance shows strong linkage to its own location. Nine other genes show significant linkage to the region, including two glutathione S-transferases and two cytochrome P450 enzymes (Table S1). However, none are known to be involved in cuticle development and collectively they show no enrichment for any particular tissue. Thus the mutation in N2 appears to have limited effects on gene expression in young adult hermaphrodites, at least under ordinary laboratory conditions.

Discussion

The structural complexity of the nematode cuticle is reflected in its developmental and genetic regulation, and its environmental (temperature) and genetic sensitivity. Around 4% of the worm genome is dedicated to expressing, processing, and assembling the collagens, cuticulins, glycoproteins and other components of the multilayered extracellular matrix (Teuscher ). Yet, of 173 predicted cuticular collagen genes, phenotypes from extensive mutagenesis screens have been detected for just 21 (Page and Johnstone 2007). To this number we can now add , though we have also shown that even this select list might well have been shorter had Brenner not adopted N2 as the C. elegans reference genetic background. In the absence of molecular and structural data, the precise role of in the worm cuticle and its mode of interaction with other collagens remains obscure. The derived N2 insertion represents an evolved enhancer of rolling, and the ancestral modifies to a variable extent the phenotypes from other classical mutant alleles of , , and , but not obviously those of , , , , and , or . The expression of cuticular genes during worm development offers no clear insight. Of the tested genes, only , for which suppression by ancestral was strongest, shows strong stage specificity, being around 30-fold enriched in L4 (i.e., when the adult cuticle is manufactured; Figure S1). But , and show generally similar patterns and levels of transcriptional activity over the life-cycle, with very low expression in embryonic and early larval stages. The Blistered phenotype is thought to be due to defects in struts linking basal and cortical layers of the adult cuticle, and of six Blistered mutants, three are enzymes rather than structural components. , and are all highly expressed collagens that interact genetically, with similar stage specificity from L2 onward (Figure S1). is unique among collagens in its essentiality (Priess and Hirsh 1986; van der Keyl ; Novelli ), and is strongly expressed in the embryo as well. also interacts genetically with and , and 22 other genes (Cox ; Kusch and Edgar 1986; Kramer and Johnson 1993; Kramer 1994; Westlund ; Nyström ; Byrne ; Shephard ; Cai ), yet we saw no obvious modification of the left rolling phenotype of () mutants in the ancestral background. This may be explained by the extreme specificity of allelic interactions among collagen mutants, and sqt mutants in particular. The () allele tested is a recessive C-terminal C>Y substitution, altering cross-linking (Kramer and Johnson 1993; Yang and Kramer 1999), while () is an N-terminal R>C substitution of unknown structural effect. Alleles of vary markedly in their type, severity, temperature sensitivity, and degree of dominance of phenotypes, as well as inter- and intragenic interaction effects; from near wild-type for a null allele, to left or right rolling, abnormal hermaphrodite tail or male rays, or variation in body length. Enrichment in CeMEE interaction statistics for worm length/width over a small region spanning () provides some fuel for speculation that allele-specific interactions with may exist. Lastly, no grossly visible interactions were seen for collagens involved in annuli formation and shape , , , (McMahon ). In sum, we surmise that likely plays a role, apparently redundant under laboratory conditions, in one or both of the strut-anchored adult cuticular layers. The oft-touted genetic simplicity of C. elegans breaks down somewhat when considering the cuticle, and targeted biochemical and structural analysis, together with epistasis analysis encompassing natural genetic variation, will be required to clarify the precise role of and the majority of other collagens with no known function in the N2 background. Effects of genetic background are ubiquitous in complex genetic systems wherever they are carefully considered. Studies mixing natural with domesticated genetic variation have amply shown the importance of genetic interaction on the phenotypic outcome of allelic effects in C. elegans (Seidel ; McGrath ; Bendesky ; Duveau and Félix 2012; Gaertner ; Andersen ; Glater ; Greene ; Ben-David ; Bernstein ; Zhao ). This extends to classical mutations of the cuticle, some of the first mutants isolated in C. elegans and core components of the worm geneticist’s toolkit.
  71 in total

1.  Crossover heterogeneity in the absence of hotspots in Caenorhabditis elegans.

Authors:  Taniya Kaur; Matthew V Rockman
Journal:  Genetics       Date:  2013-10-30       Impact factor: 4.562

2.  BMP Signaling Determines Body Size via Transcriptional Regulation of Collagen Genes in Caenorhabditis elegans.

Authors:  Uday Madaan; Edlira Yzeiraj; Michael Meade; James F Clark; Christine A Rushlow; Cathy Savage-Dunn
Journal:  Genetics       Date:  2018-10-01       Impact factor: 4.562

Review 3.  Monogenic traits are not simple: lessons from phenylketonuria.

Authors:  C R Scriver; P J Waters
Journal:  Trends Genet       Date:  1999-07       Impact factor: 11.639

4.  Widespread genetic incompatibility in C. elegans maintained by balancing selection.

Authors:  Hannah S Seidel; Matthew V Rockman; Leonid Kruglyak
Journal:  Science       Date:  2008-01-10       Impact factor: 47.728

5.  ggbio: an R package for extending the grammar of graphics for genomic data.

Authors:  Tengfei Yin; Dianne Cook; Michael Lawrence
Journal:  Genome Biol       Date:  2012-08-31       Impact factor: 13.583

6.  Balancing selection shapes density-dependent foraging behaviour.

Authors:  Joshua S Greene; Maximillian Brown; May Dobosiewicz; Itzel G Ishida; Evan Z Macosko; Xinxing Zhang; Rebecca A Butcher; Devin J Cline; Patrick T McGrath; Cornelia I Bargmann
Journal:  Nature       Date:  2016-10-31       Impact factor: 49.962

7.  CeNDR, the Caenorhabditis elegans natural diversity resource.

Authors:  Daniel E Cook; Stefan Zdraljevic; Joshua P Roberts; Erik C Andersen
Journal:  Nucleic Acids Res       Date:  2016-10-03       Impact factor: 16.971

8.  Changes to social feeding behaviors are not sufficient for fitness gains of the Caenorhabditis elegans N2 reference strain.

Authors:  Yuehui Zhao; Lijiang Long; Wen Xu; Richard F Campbell; Edward E Large; Joshua S Greene; Patrick T McGrath
Journal:  Elife       Date:  2018-10-17       Impact factor: 8.140

9.  Full-length transcriptome assembly from RNA-Seq data without a reference genome.

Authors:  Manfred G Grabherr; Brian J Haas; Moran Yassour; Joshua Z Levin; Dawn A Thompson; Ido Amit; Xian Adiconis; Lin Fan; Raktima Raychowdhury; Qiandong Zeng; Zehua Chen; Evan Mauceli; Nir Hacohen; Andreas Gnirke; Nicholas Rhind; Federica di Palma; Bruce W Birren; Chad Nusbaum; Kerstin Lindblad-Toh; Nir Friedman; Aviv Regev
Journal:  Nat Biotechnol       Date:  2011-05-15       Impact factor: 54.908

10.  A global analysis of genetic interactions in Caenorhabditis elegans.

Authors:  Alexandra B Byrne; Matthew T Weirauch; Victoria Wong; Martina Koeva; Scott J Dixon; Joshua M Stuart; Peter J Roy
Journal:  J Biol       Date:  2007-09-26
View more
  4 in total

Review 1.  C. elegans Apical Extracellular Matrices Shape Epithelia.

Authors:  Jennifer D Cohen; Meera V Sundaram
Journal:  J Dev Biol       Date:  2020-10-06

Review 2.  Form and function of the apical extracellular matrix: new insights from Caenorhabditis elegans, Drosophila melanogaster, and the vertebrate inner ear.

Authors:  Sherry Li Zheng; Jennifer Gotenstein Adams; Andrew D Chisholm
Journal:  Fac Rev       Date:  2020-12-22

3.  Natural genetic variation as a tool for discovery in Caenorhabditis nematodes.

Authors:  Erik C Andersen; Matthew V Rockman
Journal:  Genetics       Date:  2022-01-04       Impact factor: 4.562

Review 4.  From QTL to gene: C. elegans facilitates discoveries of the genetic mechanisms underlying natural variation.

Authors:  Kathryn S Evans; Marijke H van Wijk; Patrick T McGrath; Erik C Andersen; Mark G Sterken
Journal:  Trends Genet       Date:  2021-07-03       Impact factor: 11.639

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.