| Literature DB >> 28983310 |
Shenghui Zhou1, Baiqiang Yan1, Fei Li1, Jinpeng Zhang1, Jing Zhang1, Huihui Ma1, Weihua Liu1, Yuqing Lu1, Xinming Yang1, Xiuquan Li1, Xu Liu1, Lihui Li1.
Abstract
Agropyron cristatum, which is a wild grass of the tribe Triticeae, grows widely in harsh environments and provides many desirable genetic resources for wheat improvement. However, unclear interspecific phylogeny and genome-wide variation has limited the utilization of A. cristatum in the production of superior wheat varieties. In this study, by sequencing the transcriptome of the representative tetraploid A. cristatum Z559 and the common wheat variety Fukuhokomugi (Fukuho), which are often used as parents in a wide cross, their phylogenetic relationship and interspecific variation were dissected. First, 214,854 transcript sequences were assembled, and 3,457 orthologous genes related to traits of interest were identified in A. cristatum. Second, a total of 72 putative orthologous gene clusters were used to construct phylogenetic relationships among A. cristatum, Triticeae and other genomes. A clear division between A. cristatum and the other Triticeae species was revealed. Third, the sequence similarity of most genes related to traits of interest is greater than 95% between A. cristatum and wheat. Therefore, using the 5% mismatch parameter for A. cristatum, we mapped the transcriptome sequencing data to wheat reference sequences to discover the variations between A. cristatum and wheat and 862,340 high-quality variants were identified. Additionally, compared with the wheat A and B genomes, the P and D genomes displayed an obviously larger variant density and a longer evolutionary distance, suggesting that A. cristatum is more distantly related to the wheat D genome. Finally, by using Kompetitive Allele Specific PCR array (KASPar) technology, 37 of 53 (69.8%) SNPs were shown to be genuine in Z559, Fukuho, and additional lines with seven different P chromosomes, and function of the genes in which these SNPs are located were also determined. This study provides not only the first insights into the phylogenetic relationships between the P genome and Triticeae but also genetic resources for gene discovery and specific marker development in A. cristatum, and this information will be vital for future wheat-breeding efforts. The sequence data have been deposited in the Sequence Read Archive (SRA) database at the NCBI under accession number SRP090613.Entities:
Keywords: RNA-Seq; SNP; interspecific variation; phylogenetic relationship; wheat; wild relatives
Year: 2017 PMID: 28983310 PMCID: PMC5613732 DOI: 10.3389/fpls.2017.01644
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
Figure 1Morphological characterization of A. cristatum. (A) The A. cristatum growing in the natural environment. (B) Spike morphological characterization. (C) Spikelet morphological characterization.
Figure 2Pipeline used to dissect phylogenetic relationship and interspecific variation between A. cristatum and wheat in this study.
Summary of the RNA-Seq data.
| Z559 | 64,254,354 | 64,243,343 | 99.98 | 52.00 |
| Fukuho | 42,700,222 | 42,692,045 | 99.98 | 52.00 |
Statistics of the A. cristatum transcriptome assembly and annotation.
| Number of transcripts assembled | 214,854 |
| Maximum transcript length (bp) | 13,610 |
| Minimum transcript length (bp) | 201 |
| Average transcript length (bp) | 611 |
| N50 length (bp) | 792 |
| Number of transcripts with coding sequences | 78,912 |
| Number of transcripts with blast hits to NR | 57,183 |
| Number of transcripts with blast hits to Pfam | 38,208 |
| Number of transcripts with blast hits to SwissProt | 55,157 |
| Number of transcripts with blast hits to KOG | 43,487 |
| Number of transcripts with GO terms | 18,073 |
| Number of transcripts with at least one hit in these databases | 66,346 |
Figure 3Transcript length distribution in A. cristatum.
Figure 4Phylogenetic relationships between A. cristatum and other genomes. Numbers at each node are the bootstrap values, shown as percentages. The branch lengths are the same as the evolutionary distances.
Figure 5Density distribution of A. cristatum transcript sequences identity compared to the wheat.
Figure 6CIRCOS visualization of different data at the wheat genome-wide level. (A) Karyotype of the wheat genome. (B) Location of homologous genes, including SNPs related to traits. Agronomic traits (red), biotic stress (blue) abiotic stress (green). (C) Gene content density distribution; Gene density was calculated in a 3-Mb window. (D) Transcriptional density distribution in wheat. Transcript regions density was calculated in 3-Mb window intervals. (E) Transcriptional density distribution in A. cristatum. Transcript region density was calculated in 3-Mb window intervals. (F) Variant distribution by chromosome. SNP density was calculated in 3-Mb window intervals. (G) Variant density in transcript regions. Variant density was calculated in transcript regions at 3-Mb window intervals.
Figure 7Characterization of SNPs between A. cristatum and wheat. (A) Distribution of SNPs in the wheat genome. (B) Density distribution of SNPs in the wheat genome. Density was measured by averaging the number of SNPs per 1 Mb of the wheat genomic region. (C) Distribution of the number of SNPs per gene. (D) Frequency of different substitution types in the identified SNPs. (E) Distribution of SNPs in different genomic regions.
Functional information of genes in which SNPs are located.
| PSBD_ARATH | Photosystem II D2 protein | Photosynthesis (Armbruster et al., | |
| PSBD_ARATH | Photosystem II D2 protein | Photosynthesis (Armbruster et al., | |
| PSBD_ARATH | Photosystem II D2 protein | Photosynthesis (Armbruster et al., | |
| PSBD_ARATH | Photosystem II D2 protein | Photosynthesis (Armbruster et al., | |
| CBH32509 | Trehalose synthase | Plant architecture (Chary et al., | |
| CBH32509 | Trehalose synthase | Plant architecture (Chary et al., | |
| SPSA4_ORYSJ | Sucrose phosphate synthase 4F | Sucrose synthesis (Lutfiyya et al., | |
| GIGAN_ORYSJ | Protein GIGANTEA | Photoperiodic flowering (Fowler et al., | |
| COBL3_ORYSJ | Protein BRITTLE CULM1-like 4 | Mechanical properties of plants (Li Y. et al., | |
| AAT1_ARATH | Transaminase A | Nitrogen assimilation (Schultz et al., | |
| CSLD2_ORYSI | Cellulose synthase-like protein D2 | Cellulose synthase (Hazen et al., | |
| GUN7_ORYSJ | Endoglucanase 7 | Internode elongation (Zhou et al., | |
| BAI52979 | G1-like1 protein | Floral organs development (Yoshida et al., | |
| HDA19_ARATH | Histone deacetylase 19 | Pathogen response (Zhou et al., | |
| GH38_ORYSI | Auxin-responsive GH3-like protein 8 | Floral organs development (Prasad et al., | |
| GH38_ORYSI | Auxin-responsive GH3-like protein 8 | Floral organs development (Prasad et al., | |
| UGDH4_ORYSJ | UDP-glucose 6-dehydrogenase 4 | Disease resistance (Klinghammer and Tenhaken, | |
| SGT1_ORYSJ | Protein SGT1 homolog | Disease resistance (Wang et al., | |
| GDT13_ORYSJ | GDT1-like protein 3 | Disease resistance (Rice Chromosomes 11 and 12 Sequencing Consortia, | |
| AKP45150 | Zinc finger protein | Disease resistance (Hurni et al., | |
| VPS4_ARATH | Protein SUPPRESSOR OF K(+) TRANSPORT GROWTH DEFECT 1 | Plant defense (Wang et al., | |
| ALN98169 | RING finger ubiquitin E3 ligase | Heat tolerance (Liu et al., | |
| FENR2_ORYSJ | Ferredoxin–NADP reductase | Photosynthesis (Aoki et al., | |
| MCCA_ORYSJ | MCCase subunit alpha | Leaf senescence (Lee et al., | |
| AAO72583 | PHD-finger protein | Stress response (Cooper et al., | |
| RENT1_ARATH | ATP-dependent helicase UPF1 | Plant defense (Jeong et al., | |
| RENT1_ARATH | ATP-dependent helicase UPF1 | Plant defense (Jeong et al., | |
| C3H41_ORYSJ | E3 ubiquitin-protein ligase makorin | Germinating (Arumugam et al., | |
| MPK4_ARATH | Mitogen-activated protein kinase 4 | Abiotic stresses (Ichimura et al., | |
| BAH84761 | alpha/beta hydrolase-fold family protein | Leaf senescence (Morita et al., | |
| GIGAN_ORYSJ | Protein GIGANTEA | Photoperiodic flowering (Fowler et al., | |
| AAT1_ARATH | Transaminase A | Nitrogen assimilation (Schultz et al., |
Chromosome represents chromosome in which SNP is located; LOCUS represents protein ID in NCBI; Encoded protein represents encoded protein of gene in which SNP is located; Related trait/pathway represents trait/pathway associated with protein.