Literature DB >> 30886426

A genome-wide algal mutant library and functional screen identifies genes required for eukaryotic photosynthesis.

Xiaobo Li^1,2,3, Weronika Patena^1,2, Friedrich Fauser^1,2, Robert E Jinkerson^2,4, Shai Saroussi², Moritz T Meyer¹, Nina Ivanova², Jacob M Robertson^1,2, Rebecca Yue², Ru Zhang^2,5, Josep Vilarrasa-Blasi², Tyler M Wittkopp^2,6,7, Silvia Ramundo⁸, Sean R Blum², Audrey Goh¹, Matthew Laudon⁹, Tharan Srikumar¹, Paul A Lefebvre⁹, Arthur R Grossman², Martin C Jonikas^10,11.

Abstract

Photosynthetic organisms provide food and energy for nearly all life on Earth, yet half of their protein-coding genes remain uncharacterized1,2. Characterization of these genes could be greatly accelerated by new genetic resources for unicellular organisms. Here we generated a genome-wide, indexed library of mapped insertion mutants for the unicellular alga Chlamydomonas reinhardtii. The 62,389 mutants in the library, covering 83% of nuclear protein-coding genes, are available to the community. Each mutant contains unique DNA barcodes, allowing the collection to be screened as a pool. We performed a genome-wide survey of genes required for photosynthesis, which identified 303 candidate genes. Characterization of one of these genes, the conserved predicted phosphatase-encoding gene CPL3, showed that it is important for accumulation of multiple photosynthetic protein complexes. Notably, 21 of the 43 higher-confidence genes are novel, opening new opportunities for advances in understanding of this biogeochemically fundamental process. This library will accelerate the characterization of thousands of genes in algae, plants, and animals.

Entities: CellLine Chemical Disease Gene Mutation Species

Mesh：

Year: 2019 PMID： 30886426 PMCID： PMC6636631 DOI： 10.1038/s41588-019-0370-6

Source DB: PubMed Journal: Nat Genet ISSN： 1061-4036 Impact factor: 38.330

The green alga Chlamydomonas has long been employed for genetic studies of eukaryotic photosynthesis because of its rare ability to grow in the absence of photosynthetic function[3]. In addition, it has made extensive contributions to our basic understanding of light signaling, stress acclimation, and metabolism of carbohydrates, lipids, and pigments (Fig. 1a)[4-6]. Moreover, Chlamydomonas retained many genes from the plant-animal common ancestor, which contributed to the understanding of fundamental aspects of the structure and function of cilia and basal bodies[7,8]. Like Saccharomyces cerevisiae, Chlamydomonas can grow as a haploid, facilitating genetic studies. However, until now, the value of Chlamydomonas has been limited by the lack of mutants in most of its nuclear genes.

Fig. 1 ∣

A genome-wide library of Chlamydomonas mutants was generated by random insertion of barcoded cassettes and mapping of insertion sites.

a, Chlamydomonas reinhardtii is used for studies of various cellular processes and organism-environment interactions. b, Our library contains 62,389 insertional mutants maintained as 245 plates of 384-colony arrays. Each mutant contains at least one insertion cassette at a random site in its genome; each insertion cassette contains one unique barcode at each end (Supplementary Fig. 1a-c). c, The insertion density is largely random over the majority of the genome. This panel compares the observed insertion density over the genome (the left column above each chromosome number) to three simulations with insertions randomly distributed over all mappable positions in the genome (the three narrow columns to the right for each chromosome). Areas that are white throughout all columns represent regions where insertions cannot be mapped to a unique genomic position due to highly repetitive sequence. See also Supplementary Fig. 4.

In the present study, we sought to generate a genome-wide collection of Chlamydomonas mutants with known gene disruptions to provide mutants in genes of interest for the scientific community, and then to leverage this collection to identify genes with roles in photosynthesis. To reach the necessary scale, we chose to use random insertional mutagenesis and built on advances in insertion mapping and mutant propagation from our pilot study[9]. To enable mapping of insertion sites and screening pools of mutants on a much larger scale, we developed new tools leveraging unique DNA barcodes in each transforming cassette. We generated mutants by transforming haploid cells with DNA cassettes that randomly insert into the genome and inactivate the genes they insert into. We maintained the mutants as indexed colony arrays on agar media containing acetate as a carbon and energy source to allow recovery of mutants with defects in photosynthesis. Each DNA cassette contained two unique barcodes, one on each side of the cassette (Supplementary Fig. 1a-d). For each mutant, the barcode and genomic flanking sequences on each side of the cassette were initially unknown (Supplementary Fig. 1e). We determined the sequence of the barcode(s) in each mutant colony by combinatorial pooling and deep sequencing (Supplementary Fig. 1f, Supplementary Fig. 2). We then mapped each insertion by pooling all mutants and amplifying all flanking sequences together with their corresponding barcodes followed by deep sequencing (Supplementary Fig. 1g). The combination of these datasets identified the insertion site(s) in each mutant. This procedure yielded 62,389 mutants on 245 plates, with a total of 74,923 insertions that were largely randomly distributed over the chromosomes (Fig. 1b,c; Supplementary Fig. 3, 4; Supplementary Table 5). This library provides mutants for ~83% of all nuclear genes (Fig. 2a-d). Approximately 69% of genes are represented by an insertion in a 5’ UTR, an exon or an intron – regions most likely to cause an altered phenotype when disrupted. Many gene sets of interest to the research community are well represented, including genes encoding proteins phylogenetically associated with the plant lineage (GreenCut2)[1], proteins that localize to the chloroplast[10], or those associated with the structure and function of flagella or basal bodies[11,12] (Fig. 2b). Mutants in this collection are available through the CLiP website (see URLs). Over 1,800 mutants have already been distributed to over 200 laboratories worldwide in the first 18 months of pre-publication distribution (Fig. 2e). These mutants are facilitating genetic investigation of a broad range of processes, ranging from photosynthesis and metabolism to cilia structure and function (Fig. 2f).

Fig. 2 ∣

The library covers 83% of Chlamydomonas genes.

a, 83% of all Chlamydomonas genes have one or more insertions in the library. b, In various functional groups, more than 75% of genes are represented by insertions in the library. c, The number of insertions per gene is roughly correlated with gene length. Box heights represent quartiles, whiskers represent 1st and 99th percentiles, and outliers are plotted as crosses. Box widths are proportional to the number of genes in each bin. d, Insertion density varies among different gene features, with the lowest density in exons. e, More than 1,800 mutants were distributed to approximately 200 laboratories around the world during the first 18 months of its availability. f, Distributed mutants are being used to study a variety of biological processes. Only genes with some functional annotation are shown.

To identify genes required for photosynthesis, we screened our library for mutants deficient in photosynthetic growth. Rather than phenotyping each strain individually, we pooled the entire library into one culture and leveraged the unique barcodes present in each strain to track its abundance after growth under different conditions. This feature enables genome-wide screening with speed and depth unprecedented in photosynthetic eukaryotes. We grew a pool of mutants photosynthetically in light in minimal Tris-Phosphate (TP) medium with CO2 as the sole source of carbon, and heterotrophically in the dark in Tris-Acetate-Phosphate (TAP) medium, where acetate provides fixed carbon and energy[3] (Fig. 3a). To quantify mutant growth under each condition, we amplified and deep sequenced the barcodes from the final cell populations. We then compared the ability of each mutant to grow under photosynthetic and heterotrophic conditions by comparing the read counts of each barcode from each condition (Supplementary Table 10; Supplementary Note). Mutant phenotypes were highly reproducible (Fig. 3b and Supplementary Fig. 5, a and b). We identified 3,109 mutants deficient in photosynthetic growth (Fig. 3c; Supplementary Note).

Fig. 3 ∣

A high-throughput screen using the library identifies many genes with known roles in photosynthesis and many novel components.

a, Unique barcodes allow screening mutants in a pool. Mutants deficient in photosynthesis can be identified because their barcodes will be less abundant after photosynthetic growth relative to after heterotrophic growth. b, Biological replicates were highly reproducible, with a Spearman’s correlation of 0.982. Each dot represents one barcode. See also Supplementary Fig. 5. c, The phenotype of each insertion was determined by comparing its read count under photosynthetic and heterotrophic conditions. Insertions that fell below the phenotype cutoff were considered to show a defect in photosynthesis. cpl3 alleles are highlighted. d, Exon and intron insertions are most likely to show strong phenotypes, while 3’UTR insertions rarely do. The plot is based on all insertions for the 43 higher-confidence genes. e, The photosynthetic/heterotrophic ratio of all the alleles are shown for hit and control genes. Each column is a gene; each horizontal bar is an allele. f, The 303 candidate genes were categorized based on statistical confidence in this screen and based on whether the genes had a previously known function in photosynthesis (see Supplementary Note). g, Known higher-confidence genes, novel higher-confidence genes, and lower-confidence genes are all enriched in predicted chloroplast-targeted proteins (P < 0.011). h, A schematic summary illustrates the numbers of candidate genes in each category (panel f) and the specific functions of the genes with a known role in processes related to photosynthesis.

To identify genes with roles in photosynthesis, we developed a statistical analysis framework that leverages the presence of multiple alleles for many genes. This framework allows us to overcome several sources of false positives that have been difficult to account for with previous methods, including cases where the phenotype is not caused by the mapped disruption. For each gene, we counted the number of mutant alleles with and without a phenotype, and evaluated the likelihood of obtaining these numbers by chance given the total number of mutants in the library that exhibit the phenotype (Supplementary Table 11; Supplementary Note). We identified 303 candidate photosynthesis genes based on our statistical analysis above. These genes are enriched for membership in a diurnally regulated photosynthesis-related transcriptional cluster[13] (P < 10−11), are enriched for upregulation upon dark-to-light transitions[14] (P < 0.003), and encode proteins enriched for predicted chloroplast localization (P < 10−8). As expected[15], the candidate genes also encode a disproportionate number of GreenCut2 proteins (P < 10−8), which are conserved among photosynthetic organisms but absent from non-photosynthetic organisms[1]: 32 GreenCut2 proteins are encoded by the 303 candidate genes (11%), compared to ~3% in the entire genome. Photosynthesis occurs in two stages: the light reactions and carbon fixation. The light reactions convert solar energy into chemical energy, and require coordinated action of Photosystem II (PSII), Cytochrome b6f, Photosystem I (PSI), ATP synthase complexes, a plastocyanin or cytochrome c6 metalloprotein, as well as small molecule cofactors[16]. PSII and PSI are each assisted by peripheral light-harvesting complexes (LHCs) known as LHCII and LHCI, respectively. Carbon fixation is performed by enzymes in the Calvin-Benson-Bassham cycle, including the CO2-fixing enzyme Rubisco. In addition, most eukaryotic algae have a mechanism to concentrate CO2 around Rubisco to enhance its activity[17]. Sixty-five of the genes we identified encode proteins that were previously shown to play a role in photosynthesis or chloroplast function in Chlamydomonas or vascular plants (Fig. 3f). These include three PSII-LHCII subunits (PSBP1, PSBP2, and PSB27) and seven PSII-LHCII biogenesis factors (CGL54, CPLD10, HCF136, LPA1, MBB1, TBC2, and Cre02.g105650), two cytochrome b6f complex subunits (PETC and PETM) and six cytochrome b6f biogenesis factors (CCB2, CCS5, CPLD43, CPLD49, MCD1, and MCG1), five PSI-LHCI subunits (LHCA3, LHCA7, PSAD, PSAE, and PSAL) and nine PSI-LHCI biogenesis factors (CGL71, CPLD46, OPR120, RAA1, RAA2, RAA3, RAT2, Cre01.g045902, and Cre09.g389615), one protein required for ATP synthase function (PHT3), plastocyanin (PCY1) and two plastocyanin biogenesis factors (CTP2 and PCC1), 12 proteins involved in the metabolism of photosynthesis cofactors or signaling molecules (CHLD, CTH1, CYP745A1, DVR1, HMOX1, HPD2, MTF1, PLAP6, UROD3, Cre08.g358538, Cre13.g581850, and Cre16.g659050), three Calvin-Benson-Bassham Cycle enzymes (FBP1, PRK1, and SEBP1), two Rubisco biogenesis factors (MRL1 and RMT2), three proteins involved in the algal carbon concentrating mechanism (CAH3, CAS1, and LCIB), as well as proteins that play a role in photorespiration (GSF1), CO2 regulation of photosynthesis (Cre02.g146851), chloroplast morphogenesis (Cre14.g616600), chloroplast protein import (SDR17), and chloroplast DNA, RNA, and protein metabolism (DEG9, MSH1, MSRA1, TSM2, and Cre01.g010864) (Fig. 3h, Supplementary Table 12). We caution that not all genes previously demonstrated to be required for photosynthetic growth are detectable by this approach, especially the ones with paralogous genes in the genome, such as RBCS1 and RBCS2 that encode the small subunit of Rubisco[18]. Nonetheless, the large number of known factors recovered in our screen is a testament to the power of this approach. In addition to recovering these 65 genes with known roles in photosynthesis, our analysis identified 238 candidate genes with no previously reported role in photosynthesis. These 238 genes represent a rich set of targets to better understand photosynthesis. Because our screen likely yielded some false positives, we divided all genes into “higher-confidence” (P < 0.0011; false discovery rate (FDR) < 0.27) and “lower-confidence” genes based on the number of alleles that supported each gene’s involvement in photosynthesis (Fig. 3d-f; Tables 1 and 2; Supplementary Note). The 21 higher-confidence genes with no previously reported role in photosynthesis are enriched in chloroplast localization (9/21, P < 0.011; Fig. 3g) and transcriptional upregulation during dark to light transition (5/21, P < 0.005), similar to the known photosynthesis genes. Thus, these 21 higher-confidence genes are particularly high-priority targets for the field to pursue.

Table 1 ∣

Higher-confidence genes from the photosynthesis screen that had a previously known role in photosynthesis.

Category	Gene	Defline/description in Phytozome[12]	PredAlgo[a]	Alleles in two replicates			At homolog[e]	Reference and the corresponding organism(s)
Category	Gene	Defline/description in Phytozome[12]	PredAlgo[a]	+[b]	−[c]	FDR[d]	At homolog[e]	Reference and the corresponding organism(s)
Calvin-Benson-Bassham cycle	Cre03.g185550 (SEBP1, SBP1)	Sedoheptulose-1,7-bisphosphatase	C	3	0	0.021	AT3G55800.1 (SBPASE)	Arabidopsis[29]
	Cre03.g185550 (SEBP1, SBP1)	Sedoheptulose-1,7-bisphosphatase	C	3	0	0.018	AT3G55800.1 (SBPASE)	Arabidopsis[29]
	Cre12.g524500 (RMT2)	Rubisco small subunit N-methyltransferase	O	3	0	0.021	AT3G07670.1	Pisum[30]
	Cre12.g524500 (RMT2)	Rubisco small subunit N-methyltransferase	O	3	0	0.018	AT3G07670.1	Pisum[30]
	Cre06.g298300 (MRL1, PPR2)	Pentatricopeptide repeat protein, stabilizes rbcL mRNA	C	1	1	1.000	AT4G34830.1 (MRL1)	Chlamydomonas and Arabidopsis[31]
	Cre06.g298300 (MRL1, PPR2)	Pentatricopeptide repeat protein, stabilizes rbcL mRNA	C	2	0	0.239	AT4G34830.1 (MRL1)	Chlamydomonas and Arabidopsis[31]
Carbon concentrating mechanism	Cre12.g497300 (CAS1, TEF2)	Rhodanese-like Ca-sensing receptor	C	2	0	0.260	AT5G23060.1 (CaS)	Chlamydomonas[32]
	Cre12.g497300 (CAS1, TEF2)	Rhodanese-like Ca-sensing receptor	C	2	0	0.239	AT5G23060.1 (CaS)	Chlamydomonas[32]
	Cre10.g452800 (LCIB)	Low-CO2-inducible protein	C	2	0	0.260	-	Chlamydomonas[33]
	Cre10.g452800 (LCIB)	Low-CO2-inducible protein	C	1	1	1.000	-	Chlamydomonas[33]
Chloroplast and thylakoid morphogenesis	Cre14.g616600	-	M	4	3	0.021	AT1G03160.1 (FZL)	Arabidopsis[34]
Chloroplast and thylakoid morphogenesis	Cre14.g616600	-	M	4	3	0.018	AT1G03160.1 (FZL)	Arabidopsis[34]
Cofactor and signaling molecule metabolism	Cre13.g581850	-	M	5	5	0.010	AT4G31390.1	Arabidopsis[35]
	Cre13.g581850	-	M	2	8	1.000	AT4G31390.1	Arabidopsis[35]
	Cre10.g423500 (HMOX1, HMO1)	Heme oxygenase	C	3	0	0.021	AT1G69720.1 (HO3)	Chlamydomonas[14]
	Cre10.g423500 (HMOX1, HMO1)	Heme oxygenase	C	3	0	0.018	AT1G69720.1 (HO3)	Chlamydomonas[14]
	Cre03.g188700 (PLAP6, PLP6)	Plastid lipid associated protein, Fibrillin	C	3	1	0.070	AT5G09820.2	Arabidopsis[36]
	Cre03.g188700 (PLAP6, PLP6)	Plastid lipid associated protein, Fibrillin	C	3	1	0.056	AT5G09820.2	Arabidopsis[36]
	Cre16.g659050	-	C	4	6	0.098	AT1G68890.1	Chlamydomonas[37]
	Cre16.g659050	-	C	4	6	0.075	AT1G68890.1	Chlamydomonas[37]
PSI protein synthesis and assembly	Cre12.g524300 (CGL71)	Predicted protein	C	2	0	0.260	AT1G22700.1	Synechocystis[38]; Arabidopsis[39]; Chlamydomonas[40]
	Cre12.g524300 (CGL71)	Predicted protein	C	2	0	0.239	AT1G22700.1	Synechocystis[38]; Arabidopsis[39]; Chlamydomonas[40]
	Cre01.g045902	-	C	1	1	1.000	AT3G24430.1 (HCF101)	Arabidopsis[41,42]
	Cre01.g045902	-	C	2	0	0.239	AT3G24430.1 (HCF101)	Arabidopsis[41,42]
PSI RNA splicing and stabilization	Cre09.g389615	-	M	5	0	0.0002	AT3G17040.1 (HCF107)	Chlamydomonas[43]; Arabidopsis[42,44,f]
	Cre09.g389615	-	M	5	0	0.0002	AT3G17040.1 (HCF107)	Chlamydomonas[43]; Arabidopsis[42,44,f]
	Cre01.g027150 (CPLD46, HEL5)	DEAD/DEAH-box helicase	M	5	1	0.0004	AT1G70070.1 (EMB25, ISE2, PDE317)	Arabidopsis[45]
	Cre01.g027150 (CPLD46, HEL5)	DEAD/DEAH-box helicase	M	5	1	0.0003	AT1G70070.1 (EMB25, ISE2, PDE317)	Arabidopsis[45]
	Cre09.g394150 (RAA1)	-	M	5	1	0.0004	-	Chlamydomonas[46]
	Cre09.g394150 (RAA1)	-	M	5	1	0.0003	-	Chlamydomonas[46]
	Cre12.g531050 (RAA3)	PsaA mRNA maturation factor 3	C	3	0	0.021	-	Chlamydomonas[47]
	Cre12.g531050 (RAA3)	PsaA mRNA maturation factor 3	C	3	0	0.018	-	Chlamydomonas[47]
	Cre10.g440000 (OPR120)	-	C	2	0	0.260	-	Chlamydomonas[48,49]
	Cre10.g440000 (OPR120)	-	C	2	0	0.239	-	Chlamydomonas[48,49]
PSII protein synthesis and assembly	Cre13.g578650 (CPLD10, NUOAF5)	Similar to complex I intermediate-associated protein 30	C	3	3	0.260	AT1G16720.1 (HCF173)	Arabidopsis[42,50,51]
	Cre13.g578650 (CPLD10, NUOAF5)	Similar to complex I intermediate-associated protein 30	C	3	3	0.208	AT1G16720.1 (HCF173)	Arabidopsis[42,50,51]
	Cre02.g073850 (CGL54)	Predicted protein	C	2	0	0.260	AT1G05385.1 (LPA19, Psb27-H1)	Arabidopsis[52]
	Cre02.g073850 (CGL54)	Predicted protein	C	2	0	0.239	AT1G05385.1 (LPA19, Psb27-H1)	Arabidopsis[52]
	Cre02.g105650	-	C	2	0	0.260	AT5G51545.1 (LPA2)	Arabidopsis[53]
	Cre02.g105650	-	C	2	0	0.239	AT5G51545.1 (LPA2)	Arabidopsis[53]
	Cre06.g273700 (HCF136)	-	C	2	0	0.260	AT5G23120.1 (HCF136)	Arabidopsis[42]; Synechocystis[54]
	Cre06.g273700 (HCF136)	-	C	1	1	1.000	AT5G23120.1 (HCF136)	Arabidopsis[42]; Synechocystis[54]
	Cre10.g430150 (LPA1, REP27)	-	C	2	0	0.260	AT1G02910.1 (LPA1)	Arabidopsis[55]
	Cre10.g430150 (LPA1, REP27)	-	C	1	1	1.000	AT1G02910.1 (LPA1)	Arabidopsis[55]

Prediction of protein localization by PredAlgo[56]: C = chloroplast, M = mitochondrion, SP = secretory pathway, O = other.

The number of exon/intron/5’UTR mutant alleles for that gene that satisfy our requirement of minimum 50 reads and showed at least 10X fewer normalized reads in the TP-light sample compared to the TAP-dark sample.

The number exon/intron/5’UTR mutant alleles for that gene that satisfy our minimum read count requirement but did not satisfy the at least 10X depletion in TP-light criterion.

the FDR for that gene compared to all alleles for all genes (see Supplementary Note).

Arabidopsis homolog, obtained from the “best_arabidopsis_TAIR10_hit_name” field in Phytozome[12].

AT3G17040.1 is required for functional PSII in Arabidopsis whereas Cre09.g389615 was shown to be involved in PSI accumulation in Chlamydomonas.

Table 2 ∣

Higher-confidence genes from the photosynthesis screen with no previously known role in photosynthesis.

Gene	Defline/description in Phytozome	PredAlgo	Alleles in two replicates			At homolog
Gene	Defline/description in Phytozome	PredAlgo	+	−	FDR	At homolog
Cre01.g008550	Serine/threonine kinase-related	O	2	0	0.260	AT1G73450.1
Cre01.g008550	Serine/threonine kinase-related	O	1	1	1.000	AT1G73450.1
Cre01.g014000	-	C	3	0	0.021	-
Cre01.g014000	-	C	3	0	0.018	-
Cre01.g037800 (TRX21)	ATP binding protein; thioredoxin domain	O	3	3	0.260	AT2G18990.1 (TXND9)
Cre01.g037800 (TRX21)	ATP binding protein; thioredoxin domain	O	1	5	1.000	AT2G18990.1 (TXND9)
Cre02.g073900	All-trans-10'-apo-beta-carotenal 13,14-cleaving dioxygenase	C	3	1	0.070	AT4G32810.1 (ATCCD8, CCD8, MAX4)
Cre02.g073900	All-trans-10'-apo-beta-carotenal 13,14-cleaving dioxygenase	C	3	1	0.056	AT4G32810.1 (ATCCD8, CCD8, MAX4)
Cre02.g111550	Serine/threonine kinase-related	SP	10	8	< 10⁻⁶	AT4G24480.1
Cre02.g111550	Serine/threonine kinase-related	SP	6	12	0.015	AT4G24480.1
Cre03.g185200 (CPL3, MPA6)	Metallophosphoesterase/metallodependent phosphatase	C	3	4	0.260	AT1G07010.1
Cre03.g185200 (CPL3, MPA6)	Metallophosphoesterase/metallodependent phosphatase	C	3	4	0.239	AT1G07010.1
Cre06.g259100	-	C	1	4	1.000	-
Cre06.g259100	-	C	3	2	0.117	-
Cre06.g281800	Domain of unknown function (DUF1995)	C	3	0	0.021	-
Cre06.g281800	Domain of unknown function (DUF1995)	C	3	0	0.018	-
Cre07.g316050 (CDJ2)	Chloroplast DnaJ-like protein	M	2	0	0.260	AT5G59610.1
Cre07.g316050 (CDJ2)	Chloroplast DnaJ-like protein	M	1	1	1.000	AT5G59610.1
Cre07.g341850 (EIF2, INFB)	Translation initiation factor IF-2, chloroplastic	C	2	0	0.260	AT1G17220.1 (FUG1)
Cre07.g341850 (EIF2, INFB)	Translation initiation factor IF-2, chloroplastic	C	2	0	0.239	AT1G17220.1 (FUG1)
Cre08.g358350 (TDA1, OPR34)	Fast leu-rich domain-containing[a]	C	3	2	0.152	-
Cre08.g358350 (TDA1, OPR34)	Fast leu-rich domain-containing[a]	C	3	2	0.117	-
Cre09.g396250 (VTE5)	Phosphatidate cytidylyltransferase	SP	2	0	0.260	AT5G04490.1 (VTE5)
Cre09.g396250 (VTE5)	Phosphatidate cytidylyltransferase	SP	1	1	1.000	AT5G04490.1 (VTE5)
Cre10.g429650	Alpha/beta hydrolase family (Abhydrolase_5)	O	2	0	0.260	-
Cre10.g429650	Alpha/beta hydrolase family (Abhydrolase_5)	O	1	1	1.000	-
Cre10.g448950	Nocturnin	C	1	1	1.000	AT3G58560.1
Cre10.g448950	Nocturnin	C	2	0	0.239	AT3G58560.1
Cre11.g467712	Structural maintenance of chromosomes smc family member; starch-binding domain	M	7	7	0.0003	AT5G05180.1
Cre11.g467712		M	7	7	0.0003	AT5G05180.1
Cre12.g542569	Ionotropic glutamate receptor	O	0	2	1.000	AT1G05200.1 (ATGLR3.4, GLR3.4, GLUR3)
Cre12.g542569	Ionotropic glutamate receptor	O	2	0	0.239	AT1G05200.1 (ATGLR3.4, GLR3.4, GLUR3)
Cre13.g566400 (OPR55)	Fast leu-rich domain-containing[a]	M	4	2	0.018	-
Cre13.g566400 (OPR55)	Fast leu-rich domain-containing[a]	M	4	2	0.015	-
Cre13.g574000 (GEF1, CLV1)	Voltage-gated chloride channel	O	1	11	1.000	AT5G26240.1 (ATCLC-D, CLC-D)
Cre13.g574000 (GEF1, CLV1)	Voltage-gated chloride channel	O	4	8	0.144	AT5G26240.1 (ATCLC-D, CLC-D)
Cre13.g586750	Transportin 3 and importin	O	3	4	0.260	AT5G62600.1
Cre13.g586750	Transportin 3 and importin	O	2	5	1.000	AT5G62600.1
Cre16.g658950	-	C	2	2	0.909	-
Cre16.g658950	-	C	3	1	0.056	-
Cre50.g761497	Magnesium transporter mrs2 homolog, mitochondrial	M	2	0	0.260	AT5G22830.1 (ATMGT10, GMN10, MGT10, MRS2-11)
Cre50.g761497	Magnesium transporter mrs2 homolog, mitochondrial	M	2	0	0.239	AT5G22830.1 (ATMGT10, GMN10, MGT10, MRS2-11)

The annotation of “fast leu-rich domain-containing” cannot be confirmed by BLASTp analysis at NCBI[57].

Functional annotations for 15 of the 21 higher-confidence genes suggest that these genes could play roles in regulation of photosynthesis, photosynthetic metabolism, and biosynthesis of the photosynthetic machinery. Seven of the genes likely play roles in regulation of photosynthesis: GEF1 encodes a voltage-gated channel, Cre01.g008550 and Cre02.g111550 encode putative protein kinases, CPL3 encodes a predicted protein phosphatase, TRX21 contains a thioredoxin domain, Cre12.g542569 encodes a putative glutamate receptor, and Cre13.g586750 contains a predicted nuclear importin domain. Six of the genes are likely involved in photosynthetic metabolism: the Arabidopsis homolog of Cre10.g448950 modulates sucrose and starch accumulation[19], Cre11.g467712 contains a starch-binding domain, Cre02.g073900 encodes a putative carotenoid dioxygenase, VTE5 encodes a putative phosphatidate cytidylyltransferase, Cre10.g429650 encodes a putative alpha/beta hydrolase, and Cre50.g761497 contains a magnesium transporter domain. Finally, two of the genes are likely to play roles in the biogenesis and function of photosynthesis machinery: EIF2 has a translation initiation factor domain, and CDJ2 has a chloroplast DnaJ domain. Future characterization of these genes by the community is likely to yield fundamental insights into our understanding of photosynthesis. As an illustration of the value of genes identified in this screen, we sought to explore the specific function of one of the higher-confidence candidate genes, CPL3 (Conserved in Plant Lineage 3, Cre03.g185200, also known as MPA6), which encodes a putative protein phosphatase (Fig. 4a, Supplementary Fig. 6). Many proteins in the photosynthetic apparatus are phosphorylated, but the role and regulation of these phosphorylations are poorly understood[20]. An insertion junction mapped to the 3’ UTR of CPL3 was previously found in a collection of acetate-requiring mutants, although it was not determined if this mutation caused the phenotype[15]. In our screen, three mutants with insertion junctions in CPL3 exons or introns exhibited a deficiency in photosynthetic growth (Fig. 3c, Supplementary Table 13). We chose to examine one allele (LMJ.RY0402.153647, referred to hereafter as cpl3; Fig. 4a, Supplementary Fig. 6a) for phenotypic confirmation, genetic complementation, and further studies.

Fig. 4 ∣

CPL3 is required for photosynthetic growth and accumulation of photosynthetic protein complexes in the thylakoid membranes.

a, The cpl3 mutant contains cassettes inserted in the first exon of CPL3. The locations of conserved protein phosphatase motifs are indicated (see Supplementary Fig. 6e). b, cpl3 is deficient in growth under photosynthetic conditions and can be rescued upon complementation with the wild-type CPL3 gene (comp1–3 represent three independent complemented lines). c, cpl3 has a lower relative photosynthetic electron transport rate than the wild-type strain (WT) and comp1. Error bars indicate standard deviations (n = 3 for WT and comp1; n = 7 for cpl3). d, Whole-cell proteomics (Supplementary Table 14) indicate that cpl3 is deficient in accumulation of PSII, PSI, and the chloroplast ATP synthase. Each gray dot represents one Chlamydomonas protein. PSII, PSI and ATP synthase subunits are highlighted as black or red symbols. e, Western blots show that CPL3 is required for normal accumulation of the PSII subunit CP43, the PSI subunit PsaA, and the chloroplast ATP synthase subunit ATPC. α-tubulin was used as a loading control. See also Supplementary Fig. 7c. f, A heatmap of the protein abundance of subunits in the light reactions protein complexes or enzymes in the CBB cycle in cpl3 relative to the wild type based on proteomics data. Depicted subunits that were not detected by proteomics are filled with gray. Nuclear- and chloroplast-encoded proteins are labeled in black and red fonts respectively. A stack of horizontal ovals indicates different isoforms for the same enzyme, such as FBA1, FBA2, and FBA3.

Consistent with the pooled growth data, cpl3 showed a severe defect in photosynthetic growth on agar, which was rescued under heterotrophic conditions (Fig. 4b). We confirmed that the CPL3 gene is disrupted in the cpl3 mutant and found that complementation with a wild-type copy of the CPL3 gene rescues the phenotype, demonstrating that the mutation in CPL3 is the cause of the growth defect of the mutant (Supplementary Note, Supplementary Fig. 6a-d). We then examined the photosynthetic performance, morphology of the chloroplast, and the composition of photosynthetic pigments and proteins in cpl3. Photosynthetic electron transport rate was decreased under all light intensities, suggesting a defect in the photosynthetic machinery (Fig. 4c). The chloroplast morphology of cpl3 appeared similar to the wild type based on chlorophyll fluorescence microscopy (Supplementary Fig. 7a). However, we observed a lower chlorophyll a/b ratio in cpl3 than in the wild type (Supplementary Fig. 7b), which suggests a defect in the accumulation or composition of the protein-pigment complexes involved in the light reactions[21]. Using whole-cell proteomics, we found that cpl3 was deficient in accumulation of all detectable subunits of the chloroplast ATP synthase (ATPC, ATPD, ATPG, AtpA, AtpB, AtpE, AtpF), some subunits of PSII (D1, D2, CP43, CP47, PsbE, PsbH), and some subunits of PSI (PsaA and PsaB) (FDR < 0.31 for each subunit, Fig. 4d,f; Supplementary Table 14). We confirmed these findings by western blots on CP43, PsaA, and ATPC (Fig. 4e; Supplementary Fig. 7c). Our results indicate that CPL3 is required for normal accumulation of thylakoid protein complexes (PSII, PSI, and ATP synthase) involved in the light reactions of photosynthesis. Our finding that 21/43 of the higher-confidence photosynthesis hit genes were uncharacterized suggests that nearly half of the genes required for photosynthesis remain to be characterized. This finding is notable, considering that genetic studies on photosynthesis extend back to the 1950s[22]. Our validation of CPL3’s role in photosynthesis illustrates the value of the uncharacterized hit genes identified in this study as a rich set of candidates for the community to pursue. More broadly, it is our hope that the mutant resource presented here will serve as a powerful complement to newly developed gene editing techniques[23-28], and that together these tools will help the research community generate fundamental insights in a wide range of fields, from organelle biogenesis and function to organism-environment interactions.

Methods

Generation of the indexed and barcoded mutant library.

A three-step pipeline was developed for the generation of an indexed, barcoded library of insertional mutants in Chlamydomonas (Fig. 1b, Supplementary Fig. 1). To generated mutants, CC-4533[58] (“wild type” in text and figures) cells were transformed with DNA cassettes that randomly insert into the genome, confer paromomycin resistance for selection, and inactivate the genes they insert into. Each cassette contained two unique 22 nucleotide barcodes, one at each end of the cassette (Supplementary Fig. 1a-d; Supplementary Note). Transformants were arrayed on agar plates and each insertion in a transformant would contain two barcodes. The barcode sequences as well as the insertion site were initially unknown (Supplementary Fig. 1e). To determine the sequences of the barcodes in each colony, combinatorial pools of the individual mutants were generated, with DNA extracted, and barcodes amplified and deep-sequenced. The combinatorial pooling patterns were designed so that each colony was included in a different combination of pools, allowing us to determine the barcode sequences associated with individual colonies based on which pools the sequences were found in (Supplementary Fig. 1f and Supplementary Fig. 2a-e; Supplementary Note). This procedure was similar in concept to the approach we used in our pilot study[9], but it consumed significantly less time because we used a simple PCR amplifying only the barcodes instead of a multi-step flanking sequence extraction protocol (ChlaMmeSeq[58]) on each combinatorial pool. To determine the insertion site associated with each barcode, the library was pooled into a single sample or six separate samples. The barcodes and their flanking genomic DNA were PCR amplified using LEAP-Seq[9] (Supplementary Fig. 1g and Supplementary Fig. 2f-j; Supplementary Note). The flanking sequences associated with each barcode were obtained by paired-end deep sequencing[59,60]. The final product is an indexed library in which each colony has known flanking sequences that identify the genomic insertion site, and barcode sequences that facilitate pooled screens in which individual mutants can be tracked by deep sequencing (Fig. 3a).

Insertion verification PCR.

The PCR reactions were performed in two steps to verify the insertion site[9] (Supplementary Table 6): (1) Genomic locus amplification: genomic primers that are ~1 kb away from the flanking genomic sequence reported by LEAP-Seq were used to amplify the genomic locus around the flanking sequence. If wild type produced the expected PCR band but the mutant did not produce it or produced a much larger product, this indicated that the genomic locus reported by LEAP-Seq may be disrupted by the insertional cassette and we proceeded to the second step; (2) Genome-cassette junction amplification: one primer binding to the cassette (oMJ913 for the 5’ side and oMJ944 for the 3’ side, Supplementary Table 6) and the other primer binding to flanking Chlamydomonas genomic DNA (one of the genomic primers from the first step) were used to amplify the genome-cassette junction. If the mutant produced a PCR band with expected size that was confirmed by sequencing but wild type did not produce the expected PCR band, we categorized this insertion as “confirmed.” In some mutants, genomic primers surrounding the site of insertion did not yield any PCR products in wild type or the mutant even after several trials, possibly due to incorrect reference genome sequence or local PCR amplification difficulties. These cases were grouped as “failed PCR” and were not further analyzed. 72 mutants (24 insertions each for confidence levels 1 and 2, confidence level 3 and confidence level 4) were chosen randomly from the library and tested. The genomic DNA template was prepared from a single colony of each mutant using the DNeasy Plant Mini Kit (69106, Qiagen). The PCRs were performed using the Taq PCR core kit (201225, Qiagen) as described before[58]. PCR products of the expected size were verified by Sanger sequencing.

Southern blotting.

Southern blotting was performed as previously described in detail[9]. Genomic DNA was digested with StuI enzyme (R0187L, New England Biolabs) and separated on a 0.7% Tris-borate-EDTA (TBE) agarose gel. The DNA in the gel was depurinated in 0.25 M HCl, denatured in a bath of 0.5 M NaOH, 1M NaCl, neutralized in a bath of 1.5 M Tris-HCl, pH 7.4, 1.5 M NaCl, and finally transferred onto a Zeta-probe membrane (1620159, Bio-Rad) overnight using the alkaline transfer protocol given in the manual accompanying the membrane. On the next day, the membrane was gently washed with saline-sodium citrate (2xSSC: 0.3 M NaCl, 0.03 M sodium citrate), dried with paper towel, and UV cross-linked twice using the Stratalinker1800 (Stratagene). For probe generation, the AphVIII gene on CIB1 was amplified using primers oMJ588 and oMJ589 (Supplementary Table 1). The PCR product was purified and labeled according to the protocol of Amersham Gene Images AlkPhos Direct Labeling and Detection System (RPN3690, GE Healthcare). The membrane was hybridized at 60°C overnight with10 ng probe/mL hybridization buffer. On the next day, the membrane was washed with primary and secondary wash buffers and then visualized using a CL-XPosure film (34093, Thermo Fisher).

Analyses of insertion distribution and identification of hot/cold spots.

A mappability metric was defined to quantify the fraction of all possible flanking sequences from any genomic region that can be uniquely mapped to that region[58]. Calculation of mappability, hot/cold spot analysis and simulations of random insertions were performed as described previously[58], except that a 30 bp flanking sequence lengths instead of a mix of 20 bp and 21 bp was used (because we now use 30 bp flanking sequence data derived from LEAP-Seq, rather than 20/21 bp ChlaMmeSeq sequences), and the v5.5 Chlamydomonas genome instead of the v5.3 genome was used[12]. This analysis was done on the original full set of mapped insertions, to avoid introducing bias from the choice of mutants into the consolidated set. The hot/cold spot analysis was performed on confidence level 1 insertions only, to avoid introducing bias caused by junk fragments and their imperfect correction. The full list of statistically significant hot/cold spots is provided in Supplementary Table 7.

Identification of underrepresented gene ontology (GO) terms.

For each GO category, we calculated the total number of insertions in all genes annotated with the GO term and the total mappable (mappability defined in a Supplementary Note) length of all such genes, and compared them to the total number of insertions in and total mappable length of the set of flagellar proteome genes[11]. We compared these numbers using Fisher’s exact test, and did correction for multiple comparisons[61] to obtain the false discovery rate (FDR). This analysis was done on the original full set of mapped insertions to avoid introducing bias from the choice of mutants into the consolidated set. We decided to use the flagellar proteome as the comparison set because flagellar genes are very unlikely to be essential; we did not use intergenic insertions or the entire genome because we know that the overall insertion density differs between genes and intergenic regions. The statistically significant results are listed in Supplementary Table 8.

Prediction of essential genes.

To predict essential genes in Chlamydomonas, we sought to generate a list of genes that have fewer insertions than would be expected randomly was generated. Among them, those with 0 insertion are considered candidate essential genes. To achieve these, for each gene, we calculated the total number of insertions in that gene and the total mappable length of that gene, and compared them to the total number of insertions in and total mappable length of the set of flagellar proteome genes[11], as what we have performed on each GO category. The resulting list of genes with statistically significantly fewer insertions than expected is discussed in a Supplementary Note and shown in Supplementary Table 9: this includes 203 genes with no insertions, and 558 genes with at least one insertion. However, only genes 5 kb or longer yield a false discovery rate (FDR) of 0.05 or less when they have no insertions - our overall density of insertions is not high enough to detect smaller essential genes.

Pooled Screens.

Library plates that were replicated once every four weeks onto fresh medium were switched to a 2-week replication interval to support uniform colony growth before pooling. Cells were pooled from 5-days-old library plates: first, for each set of eight agar plates, cells were scraped using the blunt side of a razor blade (55411–050, VWR) and resuspended in 40 mL liquid TAP medium in 50-mL conical tubes. Second, cells clumps were broken up by pipetting, using a P200 pipette tip attached to a 10-mL serological pipette. In addition, cells were pipetted through a 100 µm cell strainer (431752, Corning). Third, these sub-pools were combined as the master pool representing the full library. The master pool was washed with TP, and resuspended in TP. Multiple aliquots of 2 × 108 cells were pelleted by centrifugation (1,000g, 5 min, room temperature) and the supernatant was removed by decanting. Some aliquots were used for inoculation of pooled cultures, whereas other aliquots were frozen at −80 °C as initial pool samples for later barcode extraction to enable analysis of reproducibility between technical replicates. For pooled growth, 20 L TAP or TP in transparent Carboy containers (2251–0050, Nalgene) were inoculated with the initial pool to a final concentration of 2× 104 cells/mL. Cultures were grown under 22°C, mixed using a conventional magnetic stir bar and aerated with air filtered using a 1 µm bacterial air venting filter (4308, Pall Laboratory). The TAP culture was grown in dark. For the two replicate TP cultures, the light intensity measured at the surface of the growth container was initially 100 µmol photons m−2 s−1, and then increased to 500 µmol photons m−2 s−1 after the culture reached ~2× 105 cells/mL. When the culture reached the final cell density of 2× 106 cells/mL after 7 doublings, 2× 108 cells were pelleted by centrifugation (1,000g, 5 min, room temperature) for DNA extraction and barcode sequencing.

Molecular characterization of the cpl3 mutant.

Mutant genotyping PCRs were performed as previously described[9]. To complement the cpl3 mutant, the wild-type CPL3 gene was PCR amplified and cloned into the vector pRAM118 vector that contains the aph7’’ gene[62], which confers resistance to hygromycin B. In this construct, the expression of CPL3 is under the control of the PSAD promoter. The construct was linearized before being transformed into the cpl3 mutant. Transformants were robotically arrayed and assayed in colony sizes in the presence and absence of acetate respectively (Supplementary Fig. 6, c and d). Three representative lines that showed rescued photosynthetic growth were used in further phenotypic analyses (Fig. 4).

Analyses of growth, chlorophyll, and photosynthetic electron transport.

For all physiological and biochemical characterizations of cpl3 below, we grew cells heterotrophically in the dark to minimize secondary phenotypes due to defects in photosynthesis. For spot assays, cells were grown in TAP medium in dark to log phase to around 106 cells per mL. Cells were washed in TP and spotted onto solid TAP medium and TP medium respectively. The TAP plates were incubated in dark for 12 d before being imaged. The TP plates were incubated under 30 µmol photons m−2 s−1 light for 1 d, 100 µmol photons m−2 s1 light for 1 d, and then 500 µmol photons m−2 s−1 light for 4 d. Chlorophyll a and b concentrations were measured as previously described[63] using TAP-dark grown cells. We used TAP-dark-grown instead of TP-light-grown cells for chlorophyll analyses, photosynthetic performance analyses, microscopy, proteomics, and western blots (below) to avoid observing secondary effects due to the photosynthetic defects of the cpl3 mutants. To measure photosynthetic electron transport rate, TAP-dark grown cells were collected, re-suspended in fresh TAP medium, and dark acclimated for 20 min. Cells were then measured in chlorophyll fluorescence under a series of increasing light intensities using the “Light Curve” function on a DUAL-PAM-100 fluorometer (Walz). PSII quantum yield (ΦPSII) was quantified as previously described[64]. Relative electron transport rate (rETR) was calculated according to the following equation rETR = ΦPSII x I. I represents the emitted irradiance.

Proteomics.

TAP-dark-grown cells were collected by centrifugation and flash-frozen. Proteins were extracted from the frozen pellets by resuspension in lysis buffer (6M guandium Hydrochloride, 10mM tris(2-carboxyethyl)phosphine, 40mM chloroacetamide, 100mM Tris pH8.5, 1x MS-Safe protease inhibitor, 1x Phosphatase inhibitor cocktail II), grinding with liquid nitrogen, followed by sonication. Protein lysates were then digested with trypsin (Promega) into peptides. Three biological replicates were processed for each strain. The samples were labeled with tandem mass tags (TMTs), multiplexed and then fractionated before tandem mass spectrometry analyses. Briefly, each sample was labeled with the TMT labeling reagent (Thermo Fisher) according to the manufacturer’s instructions. The samples were then mixed in equimolar amounts and desalted using C18-stage tips[65]. The dried peptide mix was then separated using strong cation exchange (SCX) stage-tips[66] into four fractions. Each of the four fractions were then diluted with 1% trifluoroacetic acid (TFA) and separated into three fractions using SDB-RPS stage tips. This procedure initially resulted in a total of 12 fractions. Fractions 1–3 (the children of the first SCX fraction) were pooled together yielding 10 final fractions. Each final fraction was diluted and injected per run using an Easy-nLC 1200 UPLC system (Thermo Fisher). Samples were loaded onto a nano capillary column packed with 1.9 µm C18-AQ (Dr. Maisch) mated to metal emitter in-line with a Fusion Lumos (Thermo Fisher). Samples were eluted using a split gradient of 10–20% solution B (80% ACN with 0.1% FA) in 32 min and 20–40% solution B in 92 min followed column wash at 100% solution B for 10 min. The mass spectrometer was operated in a data-dependent mode with the 60,000 resolution MS1 scan (380–1500 m/z), AGC target of 4e5 and max injection time of 50ms. Peptides above threshold 5e3 and charges 2–7 were selected for fragmentation with dynamic exclusion after 1 time for 60 s and 10 ppm tolerance. MS1 isolation windows of 1.6m/z, MS2 isolation windows 2 and HCD NCE of 55% were selected. MS3 fragments were detected in the Orbitrap at 50,000 resolution in the mass range of 120–500 with AGC 5e4 and max injection time of 86 ms. The total duty cycle was set to 3.0 sec. Raw files were searched with MaxQuant[67], using default settings for MS3 reporter TMT 10-plex data. Files were searched against sequences of nuclear, mitochondrial, and chloroplast-encoded Chlamydomonas proteins supplemented with common contaminants[12,68,69]. Raw files were also analyzed within the Proteome Discoverer (Thermo Fisher) using the Byonic[70] search node (Protein Metrics). Data from Maxquant and Proteome Discoverer were combined in Scaffold Q+ (Proteome Software Inc.), which was used to validate MS/MS based peptide and protein identifications. Peptide identifications were accepted if they could be established at greater than 80.0% probability by the Scaffold Local FDR algorithm. Protein identifications were accepted if they could be established at greater than 96.0% probability and contained at least 2 identified peptides. Scaffold Q+ un-normalized data were exported in the format of the log2 value of the reporter ion intensities, which reflect the relative abundances of the same protein among different samples multiplexed. Each sample was then normalized to a median of 0 (by subtracting the original median from the raw values, since the values are log2). For each gene, for each pair of samples, the normalized log2 intensity values from the three replicates of one sample were compared against those for the other sample using a standard t-test. The resulting P values were adjusted for multiple testing[61], yielding a false discovery rate (FDR) for each gene in each pair of samples. We note that our calculation of FDR does not take into account the spectral count of each protein (provided in Supplementary Table 14), which is related to the absolute abundance of the protein and impacts the accuracy of proteomic measurements. Specifically, proteins with a low spectral count are likely of low abundance in cells and often exhibit a large variation in the intensity value between the biological replicates.

Western blotting.

TAP-dark grown cells were pelleted by centrifugation, resuspended in an extraction buffer containing 5 mM HEPES-KOH, pH 7.5, 100 mM dithiothreitol, 100 mM Na2CO3, 2% (w/v) SDS, and 12% (w/v) sucrose, and lysed by boiling for 1 min. Extracted proteins were separated on SDS-PAGE (12% precast polyacrylamide gels, Bio-Rad) using tubulin as a loading and normalization control. Polypeptides were transferred onto polyvinylidene difluoride membranes using a semidry blotting apparatus (Bio-Rad) at 15 volts for 30 minutes. For western blot analyses, membranes were blocked for 1 h at room temperature in Tris-buffered saline-0.1% (v/v) Tween containing 5% powdered milk followed by a 1 h incubation of the membranes at room temperature with the primary antibodies in Tris-buffered saline-0.1% (v/v) Tween containing powdered milk (3% [w/v]). Primary antibodies were diluted according to the manufacturer’s recommendations. All antibodies were from Agrisera and the catalog numbers for the antibodies against CP43, PsaA, ATPC, and α-tubulin were AS11–1787, AS06–172-100, AS08–312, and AS10–680, respectively. Proteins were detected by enhanced chemiluminescence (K-12045-D20, Advansta) and imaged on a medical film processor (Konica) as previously described[9].

Additional methods.

Additional method details are provided in a Supplementary Note.

Statistical analyses.

Statistical methods and tests used are provided throughout the manuscript. Fisher’s exact test with Benjamini-Hochberg correction[61] for multiple comparisons was used to identify underrepresented gene ontology terms, essential genes, hit genes in the photosynthesis screen, and for the analysis of candidate gene enrichment. The binomial test with Benjamini-Hochberg correction for multiple comparisons was used for the hot/cold spot analysis. A t-test with Benjamini-Hochberg correction for multiple comparisons was used for analysis of the proteomics data. Please see the corresponding Methods or Supplementary Note section for details on each analysis.

Life Sciences Reporting Summary.

Further information on experimental design is available in the Life Sciences Reporting Summary linked to this paper.

Code availability.

All programs written for this work are deposited at GitHub (see URLs).

Data availability.

Mutants’ insertion details and distribution information are available through the CLiP website: https://www.chlamylibrary.org/. The mass spectrometry proteomics data on cpl3 have been deposited to the ProteomeXchange Consortium via the PRIDE[71] partner repository with the dataset identifier PXD012560. Other data that support the findings of this study are available from the corresponding author upon request.

67 in total

Review 1. Chlamydomonas and Arabidopsis. A dynamic duo.

Authors: Benjamin L Gutman; Krishna K Niyogi
Journal: Plant Physiol Date: 2004-06 Impact factor: 8.340

2. The essential gene set of a photosynthetic organism.

Authors: Benjamin E Rubin; Kelly M Wetmore; Morgan N Price; Spencer Diamond; Ryan K Shultzaberger; Laura C Lowe; Genevieve Curtin; Adam P Arkin; Adam Deutschbauer; Susan S Golden
Journal: Proc Natl Acad Sci U S A Date: 2015-10-27 Impact factor: 11.205

3. GENETIC CONTROL OF PHOTOSYNTHESIS IN CHLAMYDOMONAS REINHARDI.

Authors: R P Levine
Journal: Proc Natl Acad Sci U S A Date: 1960-07 Impact factor: 11.205

4. The GreenCut2 resource, a phylogenomically derived inventory of proteins specific to the plant lineage.

Authors: Steven J Karpowicz; Simon E Prochnik; Arthur R Grossman; Sabeeha S Merchant
Journal: J Biol Chem Date: 2011-04-22 Impact factor: 5.157

5. AtCCR4a and AtCCR4b are Involved in Determining the Poly(A) Length of Granule-bound starch synthase 1 Transcript and Modulating Sucrose and Starch Metabolism in Arabidopsis thaliana.

Authors: Yuya Suzuki; Toshihiro Arae; Pamela J Green; Junji Yamaguchi; Yukako Chiba
Journal: Plant Cell Physiol Date: 2015-01-27 Impact factor: 4.927

6. LOW PSII ACCUMULATION1 is involved in efficient assembly of photosystem II in Arabidopsis thaliana.

Authors: Lianwei Peng; Jinfang Ma; Wei Chi; Jinkui Guo; Shuyong Zhu; Qingtao Lu; Congming Lu; Lixin Zhang
Journal: Plant Cell Date: 2006-03-10 Impact factor: 11.277

7. Retrograde bilin signaling enables Chlamydomonas greening and phototrophic survival.

Authors: Deqiang Duanmu; David Casero; Rachel M Dent; Sean Gallaher; Wenqiang Yang; Nathan C Rockwell; Shelley S Martin; Matteo Pellegrini; Krishna K Niyogi; Sabeeha S Merchant; Arthur R Grossman; J Clark Lagarias
Journal: Proc Natl Acad Sci U S A Date: 2013-01-23 Impact factor: 11.205

8. MRL1, a conserved Pentatricopeptide repeat protein, is required for stabilization of rbcL mRNA in Chlamydomonas and Arabidopsis.

Authors: Xenie Johnson; Katia Wostrikoff; Giovanni Finazzi; Richard Kuras; Christian Schwarz; Sandrine Bujaldon; Joerg Nickelsen; David B Stern; Francis-André Wollman; Olivier Vallon
Journal: Plant Cell Date: 2010-01-22 Impact factor: 11.277

9. Targeting of Photoreceptor Genes in Chlamydomonas reinhardtii via Zinc-Finger Nucleases and CRISPR/Cas9.

Authors: Andre Greiner; Simon Kelterborn; Heide Evers; Georg Kreimer; Irina Sizova; Peter Hegemann
Journal: Plant Cell Date: 2017-10-04 Impact factor: 11.277

10. DNA-free two-gene knockout in Chlamydomonas reinhardtii via CRISPR-Cas9 ribonucleoproteins.

Authors: Kwangryul Baek; Duk Hyoung Kim; Jooyeon Jeong; Sang Jun Sim; Anastasios Melis; Jin-Soo Kim; EonSeon Jin; Sangsu Bae
Journal: Sci Rep Date: 2016-07-28 Impact factor: 4.379

52 in total

1. The Mars1 kinase confers photoprotection through signaling in the chloroplast unfolded protein response.

Authors: Karina Perlaza; Hannah Toutkoushian; Morgane Boone; Mable Lam; Masakazu Iwai; Martin C Jonikas; Peter Walter; Silvia Ramundo
Journal: Elife Date: 2019-10-15 Impact factor: 8.140

2. Scaffold subunits support associated subunit assembly in the Chlamydomonas ciliary nexin-dynein regulatory complex.

Authors: Long Gui; Kangkang Song; Douglas Tritschler; Raqual Bower; Si Yan; Aguang Dai; Katherine Augspurger; Jason Sakizadeh; Magdalena Grzemska; Thomas Ni; Mary E Porter; Daniela Nicastro
Journal: Proc Natl Acad Sci U S A Date: 2019-10-28 Impact factor: 11.205

3. A life-history trade-off gene with antagonistic pleiotropic effects on reproduction and survival in limiting environments.

Authors: Rani M S Saggere; Christopher W J Lee; Irina C W Chan; Dion G Durnford; Aurora M Nedelcu
Journal: Proc Biol Sci Date: 2022-01-26 Impact factor: 5.349

4. Comparative genomics in Chlamydomonas: understanding the past, envisioning the future.

Authors: Michela Osnato
Journal: Plant Cell Date: 2021-05-31 Impact factor: 11.277

5. MAR1 links membrane adhesion to membrane merger during cell-cell fusion in Chlamydomonas.

Authors: Jennifer F Pinello; Yanjie Liu; William J Snell
Journal: Dev Cell Date: 2021-11-22 Impact factor: 12.270

6. Life cycle and functional genomics of the unicellular red alga Galdieria for elucidating algal and plant evolution and industrial use.

Authors: Shunsuke Hirooka; Takeshi Itabashi; Takako M Ichinose; Ryo Onuma; Takayuki Fujiwara; Shota Yamashita; Lin Wei Jong; Reiko Tomita; Atsuko H Iwane; Shin-Ya Miyagishima
Journal: Proc Natl Acad Sci U S A Date: 2022-10-04 Impact factor: 12.779

Review 7. Prospects for Engineering Biophysical CO₂ Concentrating Mechanisms into Land Plants to Enhance Yields.

Authors: Jessica H Hennacy; Martin C Jonikas
Journal: Annu Rev Plant Biol Date: 2020-03-09 Impact factor: 26.379

8. Single-cell RNA sequencing of batch Chlamydomonas cultures reveals heterogeneity in their diurnal cycle phase.

Authors: Feiyang Ma; Patrice A Salomé; Sabeeha S Merchant; Matteo Pellegrini
Journal: Plant Cell Date: 2021-05-31 Impact factor: 12.085

9. Co-expression networks in Chlamydomonas reveal significant rhythmicity in batch cultures and empower gene function discovery.

Authors: Patrice A Salomé; Sabeeha S Merchant
Journal: Plant Cell Date: 2021-05-31 Impact factor: 12.085

10. High Carotenoid Mutants of Chlorella vulgaris Show Enhanced Biomass Yield under High Irradiance.

Authors: Zeno Guardini; Luca Dall'Osto; Simone Barera; Mehrdad Jaberi; Stefano Cazzaniga; Nicola Vitulo; Roberto Bassi
Journal: Plants (Basel) Date: 2021-05-01