| Literature DB >> 25757610 |
Agustina Llanos1,2,3,4, Jean Marie François5,6,7, Jean-Luc Parrou8,9,10.
Abstract
BACKGROUND: A critical step in the RT-qPCR workflow for studying gene expression is data normalization, one of the strategies being the use of reference genes. This study aimed to identify and validate a selection of reference genes for relative quantification in Talaromyces versatilis, a relevant industrial filamentous fungus. Beyond T. versatilis, this study also aimed to propose reference genes that are applicable more widely for RT-qPCR data normalization in filamentous fungi.Entities:
Mesh:
Substances:
Year: 2015 PMID: 25757610 PMCID: PMC4342825 DOI: 10.1186/s12864-015-1224-y
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
List of putative reference genes and genes of interest
|
|
|
|
|
|
|
|
|---|---|---|---|---|---|---|
| R1 | DUF221 domain protein ( | Vacuolar membrane (GO:0005774) | Transmembrane protein with unknown function | Fw: CGGAACGCCCCATTGACC | 95.1% | 126 bp |
| Rv: TTGGATGCTTATGTTTTGCTCTCG | ||||||
| R2 | Ubiquitin carrier protein ( | Ligase activity (GO:0016874) | Involved in the ubiquitin mediated proteolysis | Fw: TCGTTGAGTAGACTCTGAATGCTG | 99.2% | 125 bp |
| Cellular response to stress (GO:0033554) | ||||||
| Rv: AGCCAGATGTTCCACCCG | ||||||
| Cytoplasm (GO:0005737) | ||||||
| R3 | CECR1 family adenosine deaminase ( | Adenosine deaminase activity (GO:0004000) | Involved in the purin metabolism | Fw: CTGCGCAATGCAAAGTCATGTCTCTG | 100.7% | 97 bp |
| Rv: CCCAGGTCGAAGATCCCTTTATCCA | ||||||
| R4 | Mitochondrial membrane fission protein ( | Metal ion binding (GO:0046872) | Mitochondrial complex that promotes mitochondrial fission | Fw: GTTCAACTACGCCTGGGGACTC | 101.1% | 91 bp |
| Mitochondrial fission (GO:0000266) | ||||||
| Rv: AGCGGTGCGAAAAATCTGGG | ||||||
| Membrane (GO:0016020) | ||||||
| R5 | Copper-transporting ATPase ( | Nucleotide binding (GO:0000166) | Fw: TGGTGCCCTGTGCCAACTCTCCCAGTC | 103.6% | 78 bp | |
| Cellular metal ion homeostasis (GO:0006875) | ||||||
| Rv: TTGCTGCGGGTGCTTTTG | ||||||
| Membrane (GO:0016020) | ||||||
| R6 | Cohesin complex subunit ( | DNA secondary structure binding (GO:0000217) | Involved in chromosomes segregation during mitosis | Fw: GTATTTGCGGAGATCCAGAGTGAG | 93.1% | 102 bp |
| Mitotic sister chromatid segregation (GO:0000070) | Rv: TTGAAGACGGGTCTGTTCCA | |||||
| Nucleus (GO:0005634) | ||||||
| R7 | Spo7-like protein ( | Phosphatase activity (GO:0016791) | Involved in the spore formation process | Fw: GCCGATGGTGCTGATGTTGG | 102.5% | 110 bp |
| Sporulation (GO:0043934) | ||||||
| Rv: AGAACGCCAACGAGCCCG | ||||||
| Integral to membrane (GO:0016021) | ||||||
| R8 | SAGA-like transcriptional regulatory complex subunit Spt3 ( | Transferase activity (GO:0016740) | Component of the nuclosomal histone acetyltransferase (SAGA) complex | Fw: ACGACTTGTTGGCGGACG | 96.3% | 95 bp |
| Chromatin modification (GO:0016568) | ||||||
| Rv: GAGATTCAGCAGATGATGTTTGTC | ||||||
| Nucleus (GO:0005634) | ||||||
| R9 | DUF500 domain protein ( | Actin filament organization (GO:0007015) | Cytoskeleton organization | Fw: ACTTGGCCGGTTGTGCGTTC | 98.5% | 101 bp |
| Cytoplasm (GO:0005737) | Rv: TTGGTGTTCCGGCGGCTG | |||||
| R10 | Rho GTPase activator ( | Rho GTPase activator activity (GO:0005100) | Involved in signal transduction | Fw: AGGAGGATGAAAGTAAAGGACCCC | 100.5% | 159 bp |
| Small GTPase mediated signal transduction (GO:0007264) | ||||||
| Rv: AAACCCCACACTTGGCGAC | ||||||
| Intracellular (GO:0005622) | ||||||
| R11 | AP-2 adaptor complex subunit beta ( | Transporter activity (GO:0005215) | Involved in chlatrin-dependent endocytosis | Fw: TTTCGCACATAGGGGTCG | 98.4% | 148 bp |
| Rv: TTTTGGTCGATGATATGGACG | ||||||
| R12 | Protein translocation complex componenet ( | Protein transporter activity (GO:0008565) | Involved in the protein progression in endoplasmic reticulum | Fw: CGCTGGAACAAGAAAAATACG | 98.2% | 117 bp |
| Post-translational protein targeting to membrane (GO:0006620) | ||||||
| Rv: ACGAACGATATGCGCCAA | ||||||
| Endoplasmic reticulum (GO:0005783) | ||||||
|
| Beta-tubulin | Nucleotide binding (GO:0000166) | Cytoskeleton | Fw: GTTCTGGACGTTGCGCATCTG | 97.2% | 110 bp |
| Cytoskeleton (GO:0005856) | ||||||
| Rv: TGATGGCCGCTTCTGACTTCC | ||||||
|
| Arabinofuranosidase-B2 | Hydrolase activity, acting on glycosyl bonds (GO:0016798) | Sugar metabolism | Fw: CGGAGCTTGGGTGAGATGGTTC | 103.6% | 112 bp |
| Carbohydrate metabolic process (GO:0005975) | Rv: CGGCGGCGTTGCTAATGC | |||||
| Extracellular region (GO:0005576) | ||||||
|
| Xylanase C | Hydrolase activity, acting on glycosyl bonds (GO:0016798) | Sugar metabolism | Fw: CAAATGGCGACAATGGCG | 94.4% | 104 bp |
| Xylan metabolic process (GO:0045493) | Rv: TGAGTACGTGACAGTCTGTGCATTG | |||||
| Extracellular region (GO:0005576) |
The annotation and GO terms were taken from the Talaromyces versatilis genome (basionyme Penicillium funiculosum, ADISSEO proprietary sequence, unpublished). Forward (Fw) and reverse (Rv) primer sequences used for RT-qPCR. The two genes at the bottom of the table marked with an asterisk (*) correspond to the genes of interest.
Figure 1Distribution of the raw Cq values. For each gene, the box-plot gathered all the 66 raw Cq values obtained from the 33 duplicated culture conditions of T. versatilis. The lower and upper boundaries of the box (interquartile) represent the 25th and the 75th percentile, respectively. The line within the box corresponds to the median and the cross to the mean of the distribution, while the whiskers indicate the highest and the lowest Cq values, with the exception of the outliers that are represented by the squares outside the whiskers.
Figure 2geNorm–based ranking of the putative reference genes. (A) Genes were ranked from the least stable (on the left) to the most stable (on the right) according to their M value (Y axis). This classification was independently performed by using different sets of conditions: the ‘All conditions’ included the whole set of culture conditions studied by RT-qPCR in this work; the ‘C sources’ subset gathered the 18 samples obtained from growth on different sugars; the ‘Stress’ subset corresponded to 6 samples harvested during stress exposition; the ‘Germination’ subset included the 6 germination time points. For each set of conditions, the result of the classification was given below the X-axis (arbitrary colours attributed to each gene for the sake of clarity). (B) Result of the pairwise variation analysis between NFn and NFn+1 to determine the optimal number of genes for reliable normalization. Values below the 0.15 threshold mean that n genes might be sufficient.
Figure 3Hierarchical ANOVA of the putative reference genes. Three-level, nested ANOVA with ‘genes’ as the first level, ‘culture conditions’ as the second level and ‘biological replicates’ as the third level. As in Figure 2, this ANOVA was carried out using different sets of conditions: ‘C sources’ subset (A); ‘Stress’ subset (B); ‘Germination’ subset (C). Left graph: relative expression values (Log (base 2)) as a function of the different conditions for the different genes, taking as the control conditions the glucose sample (A & B subsets) and the T0 time-point for spore germination (C subset). Two values were used for each condition (i.e. duplicated experiment). Right panel: partitioning of the variance into the three levels (in %).
Figure 4Normalization bias analysis according to reference genes selection. (A) Example of normalized transcript levels of abf-B2 and xynC under different culture conditions (Thio-gentiobiose, Arbocel and C starvation samples) using 4 different Normalization Factors (NFs): NF(R2, R10, R6), NF(R5,R11,R3), NF(R3) and NF(. Log (base 2) of FC values on the Y axis (mean ± SD, n = 2 in this experiment) using glucose as the calibrator sample. (B) Comparison of NF(R2, R10, R6) to NF calculated from less stable genes, as well as from single genes such as R3 and β-tub. For each condition of interest (X axis, see Additional file 1), we calculated a normalization bias (i.e. under- or over-estimation of the normalised expression value of GOIs) as the ratio between the theoretically best NF (NF(R2, R10, R6) as determined from geNorm classification by using the entire set of conditions) and NF calculated from other combinations of reference genes. Log (base 2) of the normalization bias is represented on the Y axis. Yellow zone: less than 1.5 fold bias; Green zone: 1.5─2 fold bias; Blue zone: 2─3 fold bias; Red zone: 3─8 fold bias. (C) Quantile plot of the normalization bias values for each NF. The normalization bias (Log (base 2)) is represented on the X axis and the same colour code used in (B) was applied. The quantile fractions are represented on the Y axis. (♦) NF(R12,R7,R8), (☐) NF(R4,R9,R1), (◯) NF(R5,R11,R3), (+) NF(R3) and (×) NF(.
Figure 5Heat map of RNA-seq based expression data of putative reference genes, collected from 18 phylogenetically distant fungi. For each RNA-seq dataset (study), we calculated for each gene fold changes (FC) as the ratio between the expression in a condition of interest and the expression in the condition that was defined as the control in this study. Each line represents a condition of interest, each column a gene of interest (corresponding names of the genes are given in Figure 6). Genes have been distributed in three groups: the ‘R series’ that corresponds to 12 candidate reference genes pre-selected from T. versatilis data; the ‘C series’ that corresponds to more classic reference genes previously used in most of gene expression studies, including for filamentous fungi; and the ‘Sc series’ that corresponds to genes homologous to S. cerevisiae genes, which were previously validated as promising reference genes in this yeast species. Numbers reported in the heat map correspond to log (base 2) of FC values. Colour scale and correspondences between Log (base 2) and FC values are indicated in the legend (green colour set for a fold-change of 1 (log2 = 0); red colour arbitrary set for differential expression equal or higher than 12 (|log2| ≥ 3,5). Empty cells: data not available.
Figure 6Distribution of fold change values. For each gene, the box-plot gathered about a hundred log2(FC) values presented in Figure 5 (Legend as in Figure 1). The extreme values amongst outliers are marked with a red asterisk. The colours of the boxes relate the classes that were determined from the HAC (see Figure 7). From the top to the bottom, we listed the new candidates (‘R series’), the classical reference genes (‘C series’) and the putative references homologous to S. cerevisiae genes (Sc1─Sc4). The right panel resumes the type of distribution (normal or not), average, median and interquartile for each gene.
Figure 7Classification of the reference genes according to their median and interquartile. Scatter plot of the interquartile versus median. The clusters that were obtained by hierarchical classification (HAC, see Additional file 7) are circled with different colours. The black crosses indicate the centroid of each cluster.