| Literature DB >> 27152648 |
Jieqin Li1, Lihua Wang1, Qiuwen Zhan1, Yanlong Liu1, Xiaocui Yang1.
Abstract
Sudangrass, Sorghum sudanense, is an important forage in warm regions. But little is known about its genome. In this study, the transcriptomes of sudangrass S722 and sorghum Tx623B were sequenced by Illumina sequencing. More than 4Gb bases were sequenced for each library. For Tx623B and S722, 88.79% and 83.88% reads, respectively were matched to the Sorghum bicolor genome. A total of 2,397 differentially expressed genes (DEGs) were detected by RNA-Seq between the two libraries, including 849 up-regulated genes and 1,548 down-regulated genes. These DEGs could be divided into three groups by annotation analysis. A total of 44,495 single nucleotide polymorphisms (SNPs) were discovered by aligning S722 reads to the sorghum reference genome. Of these SNPs, 61.37% were transition, and this value did not differ much between different chromosomes. In addition, 16,928 insertion and deletion (indel) loci were identified between the two genomes. A total of 5,344 indel markers were designed, 15 of which were selected to construct the genetic map derived from the cross of Tx623A and Sa. It was indicated that the indel markers were useful and versatile between sorghum and sudangrass. Comparison of synonymous base substitutions (Ks) and non-synonymous base substitutions (Ka) between the two libraries showed that 95% orthologous pairs exhibited Ka/Ks<1.0, indicating that these genes were influenced by purifying selection. The results from this study provide important information for molecular genetic research and a rich resource for marker development in sudangrass and other Sorghum species.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27152648 PMCID: PMC4859472 DOI: 10.1371/journal.pone.0154947
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
RNA-sequencing results for sorghum Tx623B and sudangrass S722.
| Tx623B | S722 | |
|---|---|---|
| Total reads | 24,257,222 | 23,898,115 |
| Total base | 4.85 Gb | 4.77 Gb |
| GC (%) | 58.26 | 58.15 |
| CycleQ20 (%) | 92.90 | 93.03 |
| Mapped reads | 43,077,834 (88.79%) | 40,089,957 (83.88%) |
| Perfectly mapped reads | 26,780,019 (62.17%) | 39,373,636 (53.32%) |
| Mismatched reads | 5,881,880 (13.66%) | 8,354,470 (20.84%) |
| Indel reads | 8,825,105 (20.49%) | 7,949,694 (19.83%) |
| Indel and mismatched reads | 1,590,830 (3.6%) | 2,408,455 (6.01%) |
| Identities | 98.60% | 98.21% |
Fig 1Functional annotation of sequences based on Gene Ontology (GO) categorization.
The 102 new genes annotated using different annotation databases.
| Anno_Database | Annotated_Number | 300≤Length<1000 | Length≥1000 |
|---|---|---|---|
| COG_Annotation | 102 | 23 | 78 |
| GO_Annotation | 425 | 125 | 299 |
| nr_Annotation | 876 | 287 | 584 |
| All_Annotated | 891 | 290 | 596 |
Fig 2Functional classification of the new genes using COG.
SNPs identified in Sorghum sudanense S722.
| Chromosome | SNP_number | SNP_number_gene | SNP_number_intergenic | Transition (%) | Transversion (%) |
|---|---|---|---|---|---|
| Chromosome_1 | 6,965 | 5,545 | 1,420 | 61.84 | 38.16 |
| Chromosome_2 | 5,528 | 4,335 | 1,193 | 61.03 | 38.97 |
| Chromosome_3 | 5,438 | 4,308 | 1,130 | 59.07 | 40.93 |
| Chromosome_4 | 5,401 | 4,203 | 1,198 | 61.78 | 38.22 |
| Chromosome_5 | 2,965 | 2,010 | 955 | 60.20 | 39.80 |
| Chromosome_6 | 4,132 | 3,140 | 992 | 61.79 | 38.21 |
| Chromosome_7 | 2,830 | 2,052 | 778 | 62.30 | 37.70 |
| Chromosome_8 | 3,287 | 2,350 | 937 | 61.15 | 38.85 |
| Chromosome_9 | 3,649 | 2,668 | 981 | 61.61 | 38.39 |
| Chromosome_10 | 3,957 | 2,958 | 999 | 63.58 | 36.42 |
| Super_10 | 343 | 182 | 161 | 60.79 | 39.21 |
| Total | 44,495 | 33,751 | 10,744 | 61.37 | 38.63 |
Fig 3The distribution of indel loci in sorghum (A: the distribution of base differences, X axis is base difference, and Y axis is number; B: the distribution of indel in different chromosomes, X axis is chromosome, and Y axis is number).
Fig 4The genetic map of the F2 population derived from the cross of sorghum Tx623A and sudangrass Sa.
Fig 5A histogram of Ka/Ks distribution.