| Literature DB >> 31741783 |
Na Ding1,2, Huihui Cui1, Ying Miao1, Jun Tang3,4, Qinghe Cao3,4, Yonghai Luo1,2.
Abstract
BACKGROUND: Sweet potato (Ipomoea batatas (L.) Lam.) is one of the most important crops in many developing countries and provides a candidate source of bioenergy. However, neither a complete reference genome nor large-scale full-length cDNA sequences for this outcrossing hexaploid crop are available, which in turn impedes progress in research studies in I. batatas functional genomics and molecular breeding.Entities:
Keywords: Alternative splicing; Comparative transcriptome analysis; Single-molecular real-time sequencing; Sweet potato
Year: 2019 PMID: 31741783 PMCID: PMC6859871 DOI: 10.7717/peerj.7933
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 2.984
Figure 1Plant materials used in this study and summary of PacBio RS II single-molecule real-time (SMRT) sequencing.
(A–H) Photos showing the developmental stages and overall morphology of eight tissues in I. batatas used for SMRT sequencing in this study. (A) Young leaves; (B) mature leaves; (C) apical shoots; (D) mature stems; (E) fibrous roots; (F) initiating tuberous roots; (G) expanding tuberous roots; (H) mature tuberous roots. The photos were adopted from our previous report (Ding et al., 2017). Number and length distributions of 220,035 reads in I. batatas (I) and 195,188 reads in I. trifida (J) from different PacBio libraries (fractionated size: 1–2, 2–3, >3 kb); Proportion of different types of PacBio reads in I. batatas (K) and I. trifida (L).
Summary of PacBio sequencing in this study.
| Reads of insert of PacBio sequencing | 220,035 | 195,188 |
| Bases of insert of PacBio sequencing (bp) | 701,923,565 | 527,497,043 |
| Reads of Illumina sequencing for correction | 71,360,785 | 39,372,131 |
| Bases of Illumina sequencing for correction (bp) | 17,972,706,252 | 11,772,267,169 |
| Number of non-full-length PacBio reads | 102,510 | 85,680 |
| Number of full-length non-chimeric PacBio reads | 109,814 | 101,630 |
| Average length of full-length non-chimeric PacBio reads (bp) | 8,641 | 8,488 |
| Number of non-redundant transcripts after correction | 53,861 | 51,184 |
| N50 of non-redundant transcripts after correction (bp) | 2,933 | 2,642 |
| Mean of non-redundant transcripts after correction (bp) | 2,421 | 2,190 |
| Number of non-redundant full-length transcripts after correction | 34,963 | 33,637 |
Figure 2Analysis of putative long noncoding RNA (lncRNA).
Venn Diagrams of lncRNAs predicted from Ib53861 (A) and It51184 (B) by four programs (CPAT, CPC, CNCI, and Pfam). (C) Types and numbers of lncRNAs that could be clarified in our analysis. (D) Homolog relationship of predicted lncRNAs between I. batatas and I. trifida.
Summary of alternative splicing analysis.
| Number of isoforms of the datasets | 53,861 | 51,184 |
| Number of isoforms mapped to genome sequences | 32,660 | 46,249 |
| Number of isoforms (with multiple involvements) in AS events | 26,146 | 40,473 |
| Number of isoforms (with one involvement) in AS events | 10,562 | 17,826 |
| Number of detected AS events | 5,251 | 8,901 |
| Maximal number of isoforms in a single AS event | 35 | 46 |
| Mean number of isoforms per AS event | 4.98 | 4.55 |
| Number of loci occuring AS events | 2,047 | 3,640 |
| Maximal number of AS events in a single locus | 45 | 38 |
| Mean number of AS events per locus | 2.57 | 2.45 |
| Mean number of isoforms (with one involvement) per locus | 5.16 | 4.90 |
| Proportion of isofroms undergone AS | 32.34% | 38.54% |
| Number of estimated loci in the datasets | 10,439 | 10,452 |
Figure 3Analysis and validation of alternative splicing (AS).
(A) Diagrams showing five major AS types. (B) Proportions of major AS types predicted out of the dataset Ib53861 and It51184. (C) Homolog relationship of isoforms carrying putative AS events between I. batatas and I. trifida. (D) Proportion distribution of isoform number per AS event in I. batatas and I. trifida. (E) Proportion distribution of AS events per locus in I. batatas and I. trifida. (F) Diagram and (G) RT-PCR validation of AS events in ten I. batatas genes.