| Literature DB >> 27835957 |
Bianca Baccili Zanotto Vigna1, Fernanda Ancelmo de Oliveira2, Guilherme de Toledo-Silva2,3, Carla Cristina da Silva2, Cacilda Borges do Valle4, Anete Pereira de Souza5,6.
Abstract
BACKGROUND: Urochloa humidicola (Koronivia grass) is a polyploid (6x to 9x) species that is used as forage in the tropics. Facultative apospory apomixis is present in most of the genotypes of this species, although one individual has been described as sexual. Molecular studies have been restricted to molecular marker approaches for genetic diversity estimations and linkage map construction. The objectives of the present study were to describe and compare the leaf transcriptome of two important genotypes that are highly divergent in terms of their phenotypes and reproduction modes: the sexual BH031 and the aposporous apomictic cultivar BRS Tupi.Entities:
Keywords: Brachiaria; RNA-seq; de novo transcriptome assembly; molecular markers; tropical grasses
Mesh:
Substances:
Year: 2016 PMID: 27835957 PMCID: PMC5106776 DOI: 10.1186/s12864-016-3270-5
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Urochloa humidicola genotypes sequenced. a) Sexual accession BH031 and b) apomictic cv. BRS Tupi; both are maintained at the germplasm bank at Embrapa Gado de Corte
Summary of Illumina sequencing output statistics
| BH031 | BRS Tupi | Total | ||
|---|---|---|---|---|
| Sequenced | Number of reads | 73779964 | 89795562 | 163575526 |
| Total bases (Gb) | 5.90 | 7.18 | 13.09 | |
| Filtered | Number of reads | 57322934 | 75878804 | 133201738 |
| Total bases (Gb) | 4.59 | 6.07 | 10.66 | |
| High quality data | 77.8 % | 84.5 % | 81.4 % |
Comparison of the de novo assemblies generated using the Trinity and CLC Genomics Workbench programs
| Trinity | CLC Genomics Workbench | ||
|---|---|---|---|
| K-mer (bp) | 25 | 25 | 45 |
| Total number of contigs | 76196 | 51291 | 44899 |
| Min contig length (bp) | 301 | 300 | 300 |
| Max contig length (bp) | 7392 | 6174 | 5506 |
| Mean contig length (bp) | 877 | 617 | 551 |
| Median contig length (bp) | 637 | 454 | 435 |
| N50 (bp) | 1152 | 665 | 562 |
| Mapped reads (%) | 82.94 | 79.53 | 80.23 |
Summary of the annotated U. humidicola transcriptome
| No. of annotated unigenes | % of annotated unigenes | |
|---|---|---|
| NCBI nr | 23,071 | 65.7 % |
| UniProtKB/Swiss-Prot | 14,082 | 40.1 % |
| Gene Ontology | 17,255 | 49.2 % |
| KEGG | 5,324 | 15.2 % |
| Pfam | 10,880 | 31.0 % |
Fig. 2Comparison of the size ranges of the unigenes and annotated (NCBI nr) unigenes
Fig. 3GO classification for the U. humidicola leaf transcriptome
Fig. 4Distribution of the top 10 Pfam domains identified in the U. humidicola unigenes
Ten most abundant transcripts identified in the U. humidicola leaf transcriptome
| Putative gene | E-value | FPKM | UniProtKB |
|---|---|---|---|
| photosystem II reaction center protein M | 7.00E-14 | 12514.83 | sp|A1E9R2|PSBM_SORBI |
| uncharacterized ycf68 protein | -- | 3330.18 | sp|P12173 |YCF68_ORYSJ |
| DNA-binding protein MNB1B | 8.00E-52 | 3004.90 | sp|P27347|MNB1B_MAIZE |
| putative uncharacterized protein ART2 | 3.00E-16 | 2583.30 | sp|Q8TGM7|ART2_YEAST |
| metallothionein-like protein 1A | 4.00E-19 | 2575.12 | sp|P0C5B3|MT1A_ORYSJ |
| thiamine thiazole synthase 2, chloroplastic | 0.0 | 2422.49 | sp|C5X2M4|THI42_SORBI |
| protein TIFY 10A | 2.00E-25 | 2167.52 | sp|Q9LMA8|TI10A_ARATH |
| cysteine proteinase 1 | 0.0 | 2144.41 | sp|Q10716|CYSP1_MAIZE |
| No-hit | -- | 2090.78 | -- |
| probable polyamine oxidase 2 | 0.0 | 2076.58 | sp|Q9SKX5|PAO2_ARATH |
Fig. 5Enriched gene ontology (GO) terms for the unique U. humidicola unigenes
Summary of the SSR search results
| Search Item | Numbers |
|---|---|
| Total number of sequences examined | 35,093 |
| Total size of examined sequences (bp) | 30,553,233 |
| Total number of identified SSRs | 4,489 |
| Number of SSR-containing sequences | 3,491 |
| Number of sequences containing more than 1 SSR | 566 |
| Number of SSRs present in compound formation | 191 |
| Dinucleotide | 363 |
| Trinucleotide | 3,868 |
| Tetranucleotide | 173 |
| Pentanucleotide | 54 |
| Hexanucleotide | 31 |
Length distribution of the SSRs based on the number of repeats
| N. of repeats | Di- | Tri- | Tetra- | Penta- | Hexa- | Total |
|---|---|---|---|---|---|---|
| 4 | - | 2911 | 135 | 44 | 30 | 3120 |
| 5 | - | 718 | 34 | 10 | 0 | 762 |
| 6 | 159 | 168 | 2 | 0 | 0 | 329 |
| 7 | 92 | 62 | 2 | 0 | 0 | 156 |
| 8 | 34 | 8 | 0 | 0 | 0 | 42 |
| 9 | 26 | 1 | 0 | 0 | 0 | 27 |
| 10 | 20 | 0 | 0 | 0 | 1 | 21 |
| 11 | 24 | 0 | 0 | 0 | 0 | 24 |
| ≥12 | 8 | 0 | 0 | 0 | 0 | 8 |
Fig. 6Frequency distribution of SSRs based on the motif sequence types
Summary of the putative SNPs identified using the CLC Genomics Workbench
| Number of unigenes | 35,093 |
| Total bases | 66,854,141 |
| Number of SNPs | 560,298 |
| SNP frequency | 1 per 119 bp |
|
| 358,916 |
| A ↔ G | 179,262 |
| C ↔ T | 179,654 |
|
| 192,409 |
| A ↔ C | 46,978 |
| A ↔ T | 45,773 |
| C ↔ G | 52,161 |
| G ↔ T | 47,497 |
Fig. 7qPCR expression analysis of differentially expressed and unique sequences of BRS Tupi and BH031. a: Triose phosphate/phosphate translocator (TPT)*; b: cysteine proteinase 2 (CYSP2); c: sodium/metabolite cotransporter BASS2; d: thiamine thiazole synthase 2 (THI42); e: premnaspirodiene oxygenase (C7D55)*; f: calmodulin-binding transcription activator (CMTA1)*; g: no-hit*. *: Statistically significant differences in gene expression between the genotypes analyzed using REST software