| Literature DB >> 30886426 |
Xiaobo Li1,2,3, Weronika Patena1,2, Friedrich Fauser1,2, Robert E Jinkerson2,4, Shai Saroussi2, Moritz T Meyer1, Nina Ivanova2, Jacob M Robertson1,2, Rebecca Yue2, Ru Zhang2,5, Josep Vilarrasa-Blasi2, Tyler M Wittkopp2,6,7, Silvia Ramundo8, Sean R Blum2, Audrey Goh1, Matthew Laudon9, Tharan Srikumar1, Paul A Lefebvre9, Arthur R Grossman2, Martin C Jonikas10,11.
Abstract
Photosynthetic organisms provide food and energy for nearly all life on Earth, yet half of their protein-coding genes remain uncharacterized1,2. Characterization of these genes could be greatly accelerated by new genetic resources for unicellular organisms. Here we generated a genome-wide, indexed library of mapped insertion mutants for the unicellular alga Chlamydomonas reinhardtii. The 62,389 mutants in the library, covering 83% of nuclear protein-coding genes, are available to the community. Each mutant contains unique DNA barcodes, allowing the collection to be screened as a pool. We performed a genome-wide survey of genes required for photosynthesis, which identified 303 candidate genes. Characterization of one of these genes, the conserved predicted phosphatase-encoding gene CPL3, showed that it is important for accumulation of multiple photosynthetic protein complexes. Notably, 21 of the 43 higher-confidence genes are novel, opening new opportunities for advances in understanding of this biogeochemically fundamental process. This library will accelerate the characterization of thousands of genes in algae, plants, and animals.Entities:
Mesh:
Year: 2019 PMID: 30886426 PMCID: PMC6636631 DOI: 10.1038/s41588-019-0370-6
Source DB: PubMed Journal: Nat Genet ISSN: 1061-4036 Impact factor: 38.330
Fig. 1 ∣A genome-wide library of Chlamydomonas mutants was generated by random insertion of barcoded cassettes and mapping of insertion sites.
a, Chlamydomonas reinhardtii is used for studies of various cellular processes and organism-environment interactions. b, Our library contains 62,389 insertional mutants maintained as 245 plates of 384-colony arrays. Each mutant contains at least one insertion cassette at a random site in its genome; each insertion cassette contains one unique barcode at each end (Supplementary Fig. 1a-c). c, The insertion density is largely random over the majority of the genome. This panel compares the observed insertion density over the genome (the left column above each chromosome number) to three simulations with insertions randomly distributed over all mappable positions in the genome (the three narrow columns to the right for each chromosome). Areas that are white throughout all columns represent regions where insertions cannot be mapped to a unique genomic position due to highly repetitive sequence. See also Supplementary Fig. 4.
Fig. 2 ∣The library covers 83% of Chlamydomonas genes.
a, 83% of all Chlamydomonas genes have one or more insertions in the library. b, In various functional groups, more than 75% of genes are represented by insertions in the library. c, The number of insertions per gene is roughly correlated with gene length. Box heights represent quartiles, whiskers represent 1st and 99th percentiles, and outliers are plotted as crosses. Box widths are proportional to the number of genes in each bin. d, Insertion density varies among different gene features, with the lowest density in exons. e, More than 1,800 mutants were distributed to approximately 200 laboratories around the world during the first 18 months of its availability. f, Distributed mutants are being used to study a variety of biological processes. Only genes with some functional annotation are shown.
Fig. 3 ∣A high-throughput screen using the library identifies many genes with known roles in photosynthesis and many novel components.
a, Unique barcodes allow screening mutants in a pool. Mutants deficient in photosynthesis can be identified because their barcodes will be less abundant after photosynthetic growth relative to after heterotrophic growth. b, Biological replicates were highly reproducible, with a Spearman’s correlation of 0.982. Each dot represents one barcode. See also Supplementary Fig. 5. c, The phenotype of each insertion was determined by comparing its read count under photosynthetic and heterotrophic conditions. Insertions that fell below the phenotype cutoff were considered to show a defect in photosynthesis. cpl3 alleles are highlighted. d, Exon and intron insertions are most likely to show strong phenotypes, while 3’UTR insertions rarely do. The plot is based on all insertions for the 43 higher-confidence genes. e, The photosynthetic/heterotrophic ratio of all the alleles are shown for hit and control genes. Each column is a gene; each horizontal bar is an allele. f, The 303 candidate genes were categorized based on statistical confidence in this screen and based on whether the genes had a previously known function in photosynthesis (see Supplementary Note). g, Known higher-confidence genes, novel higher-confidence genes, and lower-confidence genes are all enriched in predicted chloroplast-targeted proteins (P < 0.011). h, A schematic summary illustrates the numbers of candidate genes in each category (panel f) and the specific functions of the genes with a known role in processes related to photosynthesis.
Higher-confidence genes from the photosynthesis screen that had a previously known role in photosynthesis.
| Category | Gene | Defline/ | Pred | Alleles in two | At homolog[ | Reference and | ||
|---|---|---|---|---|---|---|---|---|
| +[ | −[ | FDR[ | ||||||
| Calvin-Benson-Bassham cycle | Cre03.g185550 ( | Sedoheptulose-1,7-bisphosphatase | C | 3 | 0 | 0.021 | AT3G55800.1 ( | Arabidopsis[ |
| 3 | 0 | 0.018 | ||||||
| Cre12.g524500 ( | Rubisco small subunit N-methyltransferase | O | 3 | 0 | 0.021 | AT3G07670.1 | Pisum[ | |
| 3 | 0 | 0.018 | ||||||
| Cre06.g298300 ( | Pentatricopeptide repeat protein, stabilizes rbcL mRNA | C | 1 | 1 | 1.000 | AT4G34830.1 ( | Chlamydomonas and Arabidopsis[ | |
| 2 | 0 | 0.239 | ||||||
| Carbon concentrating mechanism | Cre12.g497300 ( | Rhodanese-like Ca-sensing receptor | C | 2 | 0 | 0.260 | AT5G23060.1 ( | Chlamydomonas[ |
| 2 | 0 | 0.239 | ||||||
| Cre10.g452800 ( | Low-CO2-inducible protein | C | 2 | 0 | 0.260 | - | Chlamydomonas[ | |
| 1 | 1 | 1.000 | ||||||
| Chloroplast and thylakoid morphogenesis | Cre14.g616600 | - | M | 4 | 3 | 0.021 | AT1G03160.1 ( | Arabidopsis[ |
| 4 | 3 | 0.018 | ||||||
| Cofactor and signaling molecule metabolism | Cre13.g581850 | - | M | 5 | 5 | 0.010 | AT4G31390.1 | Arabidopsis[ |
| 2 | 8 | 1.000 | ||||||
| Cre10.g423500 ( | Heme oxygenase | C | 3 | 0 | 0.021 | AT1G69720.1 ( | Chlamydomonas[ | |
| 3 | 0 | 0.018 | ||||||
| Cre03.g188700 ( | Plastid lipid associated protein, Fibrillin | C | 3 | 1 | 0.070 | AT5G09820.2 | Arabidopsis[ | |
| 3 | 1 | 0.056 | ||||||
| Cre16.g659050 | - | C | 4 | 6 | 0.098 | AT1G68890.1 | Chlamydomonas[ | |
| 4 | 6 | 0.075 | ||||||
| PSI protein synthesis and assembly | Cre12.g524300 ( | Predicted protein | C | 2 | 0 | 0.260 | AT1G22700.1 | Synechocystis[ |
| 2 | 0 | 0.239 | ||||||
| Cre01.g045902 | - | C | 1 | 1 | 1.000 | AT3G24430.1 ( | Arabidopsis[ | |
| 2 | 0 | 0.239 | ||||||
| PSI RNA splicing and stabilization | Cre09.g389615 | - | M | 5 | 0 | 0.0002 | AT3G17040.1 ( | Chlamydomonas[ |
| 5 | 0 | 0.0002 | ||||||
| Cre01.g027150 ( | DEAD/DEAH-box helicase | M | 5 | 1 | 0.0004 | AT1G70070.1 ( | Arabidopsis[ | |
| 5 | 1 | 0.0003 | ||||||
| Cre09.g394150 ( | - | M | 5 | 1 | 0.0004 | - | Chlamydomonas[ | |
| 5 | 1 | 0.0003 | ||||||
| Cre12.g531050 ( | PsaA mRNA maturation factor 3 | C | 3 | 0 | 0.021 | - | Chlamydomonas[ | |
| 3 | 0 | 0.018 | ||||||
| Cre10.g440000 ( | - | C | 2 | 0 | 0.260 | - | Chlamydomonas[ | |
| 2 | 0 | 0.239 | ||||||
| PSII protein synthesis and assembly | Cre13.g578650 ( | Similar to complex I intermediate-associated protein 30 | C | 3 | 3 | 0.260 | AT1G16720.1 ( | Arabidopsis[ |
| 3 | 3 | 0.208 | ||||||
| Cre02.g073850 ( | Predicted protein | C | 2 | 0 | 0.260 | AT1G05385.1 ( | Arabidopsis[ | |
| 2 | 0 | 0.239 | ||||||
| Cre02.g105650 | - | C | 2 | 0 | 0.260 | AT5G51545.1 ( | Arabidopsis[ | |
| 2 | 0 | 0.239 | ||||||
| Cre06.g273700 ( | - | C | 2 | 0 | 0.260 | AT5G23120.1 ( | Arabidopsis[ | |
| 1 | 1 | 1.000 | ||||||
| Cre10.g430150 ( | - | C | 2 | 0 | 0.260 | AT1G02910.1 ( | Arabidopsis[ | |
| 1 | 1 | 1.000 | ||||||
Prediction of protein localization by PredAlgo[56]: C = chloroplast, M = mitochondrion, SP = secretory pathway, O = other.
The number of exon/intron/5’UTR mutant alleles for that gene that satisfy our requirement of minimum 50 reads and showed at least 10X fewer normalized reads in the TP-light sample compared to the TAP-dark sample.
The number exon/intron/5’UTR mutant alleles for that gene that satisfy our minimum read count requirement but did not satisfy the at least 10X depletion in TP-light criterion.
the FDR for that gene compared to all alleles for all genes (see Supplementary Note).
Arabidopsis homolog, obtained from the “best_arabidopsis_TAIR10_hit_name” field in Phytozome[12].
AT3G17040.1 is required for functional PSII in Arabidopsis whereas Cre09.g389615 was shown to be involved in PSI accumulation in Chlamydomonas.
Higher-confidence genes from the photosynthesis screen with no previously known role in photosynthesis.
| Gene | Defline/description in | PredAlgo | Alleles in two | At homolog | ||
|---|---|---|---|---|---|---|
| + | − | FDR | ||||
| Cre01.g008550 | Serine/threonine kinase-related | O | 2 | 0 | 0.260 | AT1G73450.1 |
| 1 | 1 | 1.000 | ||||
| Cre01.g014000 | - | C | 3 | 0 | 0.021 | - |
| 3 | 0 | 0.018 | ||||
| Cre01.g037800 ( | ATP binding protein; thioredoxin domain | O | 3 | 3 | 0.260 | AT2G18990.1 ( |
| 1 | 5 | 1.000 | ||||
| Cre02.g073900 | All-trans-10'-apo-beta-carotenal 13,14-cleaving dioxygenase | C | 3 | 1 | 0.070 | AT4G32810.1 ( |
| 3 | 1 | 0.056 | ||||
| Cre02.g111550 | Serine/threonine kinase-related | SP | 10 | 8 | < 10−6 | AT4G24480.1 |
| 6 | 12 | 0.015 | ||||
| Cre03.g185200 ( | Metallophosphoesterase/metallodependent phosphatase | C | 3 | 4 | 0.260 | AT1G07010.1 |
| 3 | 4 | 0.239 | ||||
| Cre06.g259100 | - | C | 1 | 4 | 1.000 | - |
| 3 | 2 | 0.117 | ||||
| Cre06.g281800 | Domain of unknown function (DUF1995) | C | 3 | 0 | 0.021 | - |
| 3 | 0 | 0.018 | ||||
| Cre07.g316050 ( | Chloroplast DnaJ-like protein | M | 2 | 0 | 0.260 | AT5G59610.1 |
| 1 | 1 | 1.000 | ||||
| Cre07.g341850 ( | Translation initiation factor IF-2, chloroplastic | C | 2 | 0 | 0.260 | AT1G17220.1 ( |
| 2 | 0 | 0.239 | ||||
| Cre08.g358350 ( | Fast leu-rich domain-containing[ | C | 3 | 2 | 0.152 | - |
| 3 | 2 | 0.117 | ||||
| Cre09.g396250 ( | Phosphatidate cytidylyltransferase | SP | 2 | 0 | 0.260 | AT5G04490.1 ( |
| 1 | 1 | 1.000 | ||||
| Cre10.g429650 | Alpha/beta hydrolase family (Abhydrolase_5) | O | 2 | 0 | 0.260 | - |
| 1 | 1 | 1.000 | ||||
| Cre10.g448950 | Nocturnin | C | 1 | 1 | 1.000 | AT3G58560.1 |
| 2 | 0 | 0.239 | ||||
| Cre11.g467712 | Structural maintenance of chromosomes smc family member; starch-binding domain | M | 7 | 7 | 0.0003 | AT5G05180.1 |
| 7 | 7 | 0.0003 | ||||
| Cre12.g542569 | Ionotropic glutamate receptor | O | 0 | 2 | 1.000 | AT1G05200.1 ( |
| 2 | 0 | 0.239 | ||||
| Cre13.g566400 ( | Fast leu-rich domain-containing[ | M | 4 | 2 | 0.018 | - |
| 4 | 2 | 0.015 | ||||
| Cre13.g574000 ( | Voltage-gated chloride channel | O | 1 | 11 | 1.000 | AT5G26240.1 ( |
| 4 | 8 | 0.144 | ||||
| Cre13.g586750 | Transportin 3 and importin | O | 3 | 4 | 0.260 | AT5G62600.1 |
| 2 | 5 | 1.000 | ||||
| Cre16.g658950 | - | C | 2 | 2 | 0.909 | - |
| 3 | 1 | 0.056 | ||||
| Cre50.g761497 | Magnesium transporter mrs2 homolog, mitochondrial | M | 2 | 0 | 0.260 | AT5G22830.1 ( |
| 2 | 0 | 0.239 | ||||
The annotation of “fast leu-rich domain-containing” cannot be confirmed by BLASTp analysis at NCBI[57].
Fig. 4 ∣CPL3 is required for photosynthetic growth and accumulation of photosynthetic protein complexes in the thylakoid membranes.
a, The cpl3 mutant contains cassettes inserted in the first exon of CPL3. The locations of conserved protein phosphatase motifs are indicated (see Supplementary Fig. 6e). b, cpl3 is deficient in growth under photosynthetic conditions and can be rescued upon complementation with the wild-type CPL3 gene (comp1–3 represent three independent complemented lines). c, cpl3 has a lower relative photosynthetic electron transport rate than the wild-type strain (WT) and comp1. Error bars indicate standard deviations (n = 3 for WT and comp1; n = 7 for cpl3). d, Whole-cell proteomics (Supplementary Table 14) indicate that cpl3 is deficient in accumulation of PSII, PSI, and the chloroplast ATP synthase. Each gray dot represents one Chlamydomonas protein. PSII, PSI and ATP synthase subunits are highlighted as black or red symbols. e, Western blots show that CPL3 is required for normal accumulation of the PSII subunit CP43, the PSI subunit PsaA, and the chloroplast ATP synthase subunit ATPC. α-tubulin was used as a loading control. See also Supplementary Fig. 7c. f, A heatmap of the protein abundance of subunits in the light reactions protein complexes or enzymes in the CBB cycle in cpl3 relative to the wild type based on proteomics data. Depicted subunits that were not detected by proteomics are filled with gray. Nuclear- and chloroplast-encoded proteins are labeled in black and red fonts respectively. A stack of horizontal ovals indicates different isoforms for the same enzyme, such as FBA1, FBA2, and FBA3.