| Literature DB >> 21649902 |
Rob W Ness1, Mathieu Siol, Spencer C H Barrett.
Abstract
BACKGROUND: The shift from cross-fertilization to predominant self-fertilization is among the most common evolutionary transitions in the reproductive biology of flowering plants. Increased inbreeding has important consequences for floral morphology, population genetic structure and genome evolution. The transition to selfing is usually characterized by a marked reduction in flower size and the loss of traits involved in pollinator attraction and the avoidance of self-fertilization. Here, we use short-read sequencing to assemble, de novo, the floral transcriptomes of three genotypes of Eichhornia paniculata, including an outcrosser and two genotypes from independently derived selfers, and a single genotype of the sister species E. paradoxa. By sequencing mRNA from tissues sampled at various stages of flower development, our goal was to sequence and assemble the floral transcriptome and identify differential patterns of gene expression.Entities:
Mesh:
Year: 2011 PMID: 21649902 PMCID: PMC3128866 DOI: 10.1186/1471-2164-12-298
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Sequencing statistics for this study.
| Sample | Number of | Total raw | Reads after | Bases after |
|---|---|---|---|---|
| 39,198,840 | 1,567,953,600 | 24,539,860 | 812,538,461 | |
| 38,374,454 | 1,534,978,160 | 24,796,504 | 827,426,981 | |
| 39,047,450 | 1,561,898,000 | 23,555,014 | 764,543,073 | |
| 38,980,224 | 1,559,208,960 | 22,323,972 | 740,370,472 |
The number of reads and total amount of sequence generated for each sample of Eichhornia. Values are shown for both raw sequence and the sequence after trimming from reads with low quality bases.
Figure 1Histogram of the frequency of different contigs sizes in transcriptome assemblies of . Blue bars represent the distribution of contig sizes for the initial de novo Oases assemblies and the red bars represent the final consensus transcriptome after being processed in our pipeline. The distribution has shifted to the right indicating more long contigs.
Summary statistics for reference transcriptome through progressive stages of assembly.
| Assembly stage | Mean | Total | Number | |
|---|---|---|---|---|
| Oases assembly | 611 | 374.1 | 21,311,238 | 56,971 |
| Reciprocal BLAST | 807 | 703.5 | 15,919,812 | 22,630 |
| Contig elongation | 812 | 709.9 | 16,065,194 | 22,630 |
| Reduced fragmentation | 819 | 717.8 | 15,995,086 | 22,282 |
| Final (including singletons)1 | 1,129 | 884.3 | 23,869,762 | 26,994 |
1 Singletons over 1000 bp with BLASTx hits where e < 1 × 10-15 are included in the final consensus
For the first stage all statistics are the average across the four independently sequenced genotypes.
Summary statisitcs for read mapping.
| Sample | Number | Mean coverage | Number of | Total sequence |
|---|---|---|---|---|
| 20,653 | 21.8 ± 33.7 | 3,994 | 17,308,982 | |
| 24,849 | 24.3 ± 34.1 | 4,979 | 21,362,493 | |
| 24,527 | 21.7 ± 32.1 | 1,659 | 20,943,070 | |
| 24,142 | 20.5 ± 27.9 | 895 | 20,603,010 |
Results of read mapping using the original trimmed short reads for each of the four Eichhornia samples against the reference transcriptome
Number of single nucleotide polymorphisms (SNPs) between pairs of Eichhornia genotypes.
| Sample | ||||
|---|---|---|---|---|
| Brazil | Jamaica | Nicaragua | ||
| - | 0.048 | 0.048 | 0.047 | |
| 202,687 | - | 0.014 | 0.009 | |
| 200,576 | 58,410 | - | 0.012 | |
| 195,967 | 36,998 | 51,409 | - | |
Values reflect the polymorphism statistics for 5011 loci that were present in all four samples and homozygous in the three selfing genotypes. Values below the diagonal show the total number of SNPs and values above the diagonal show the number SNPs per bp (4,207,280 bp total).
Figure 2Proportion of each of the assembled sequences that is aligned to its top BLASTx protein hit versus the proportion of the top hit which is covered by the assembled query sequence. This plot demonstrates the size of our assembled transcripts relative to their homologs in other plant species. Dense clusters of points along the top of the figure represent loci entirely aligned to their respective protein hits and points along the right are genes fully covering their best BLAST protein hit.
Figure 3Distribution of genes in the transcriptome assembly assigned to broad GO categories. Cellular components (blue) and biological processes (red). Percentages indicate the proportion of sequences assigned within each subcategory using the program BLAST2GO, which assigns putative function to each transcript based on similarity to proteins with GO annotations from other organisms.
Figure 4Pairwise correlations of gene expression between the four genotypes: . Above the diagonal are the correlation coefficients for data plotted below the diagonal. Each pair below the diagonal is expression level plotted on a log scale, measured in FPKM, which estimates gene expression from the number of reads that are derived from each transcript in our samples. All correlations are significant at P < 0.00001.
Pollen and flower development genes that were differentially expressed in both independently derived selfing genotypes of Eichhornia paniculata and were also identified from selfing Eichhornia paradoxa.
| Homolog name | Fold expression change1 |
|---|---|
| 2Gibberellin receptor ( | 0.51**** |
| 2Peptide transport protein ( | 0.62**** |
| 2 | 0.64* |
| 2 | 0.77** |
| 2Exocyst complex component ( | 1.29* |
| 2SHK1 binding protein 1 ( | 1.38** |
| 2 | 1.64** |
| 2Cellulose synthase ( | 0.55*** |
| Pollen-pistil incompatibility 2 ( | 0.62**** |
| 0.70*** | |
| Auxin signaling F-box 2 ( | 0.71**** |
| Transport inhibitor response 1 ( | 0.89* |
| Beta-amylase ( | 0.99**** |
| Auxin response factor 8 ( | 1.08**** |
| 1.15** | |
| Suppressor of | 1.18* |
| Phosphoglucose isomerase ( | 1.35** |
| Auxin response factor 6 ( | 1.50** |
| Regulatory particle non-atpase 10 ( | 1.50* |
| Phosphatidylglycerolphosphate synthase 1 ( | 1.60* |
| myb transcription factor ( | 1.79**** |
| Glutaminyl-tRNA synthetase ( | 1.93**** |
* p < 0.05, ** p < 0.01, ***p < 0.001, ****p < 0.0001
1 Measured as the mean ratio of expression (FPKM) in each selfer relative to the outcrossing genotype
2 These genes were differentially expressed in the selfing genotypes of E. paniculata only