| Literature DB >> 26622062 |
Kazuhiro Sato1, Tsuyoshi Tanaka2, Shuji Shigenobu3, Yuka Motoi4, Jianzhong Wu2, Takeshi Itoh2.
Abstract
Full-length (FL) cDNA sequences provide the most reliable evidence for the presence of genes in genomes. In this report, detailed gene structures of barley, whole genome shotgun (WGS) and additional transcript data of the cultivar Haruna Nijo were quality controlled and compared with the published Morex genome information. Haruna Nijo scaffolds have longer total sequence length with much higher N50 and fewer sequences than those in Morex WGS contigs. The longer Haruna Nijo scaffolds provided efficient FLcDNA mapping, resulting in high coverage and detection of the transcription start sites. In combination with FLcDNAs and RNA-Seq data from four different tissue samples of Haruna Nijo, we identified 51,249 gene models on 30,606 loci. Overall sequence similarity between Haruna Nijo and Morex genome was 95.99%, while that of exon regions was higher (99.71%). These sequence and annotation data of Haruna Nijo are combined with Morex genome data and released from a genome browser. The genome sequence of Haruna Nijo may provide detailed gene structures in addition to the current Morex barley genome information.Entities:
Keywords: Hordeum; RNA-Seq; full-length cDNA; genome sequencing
Mesh:
Substances:
Year: 2015 PMID: 26622062 PMCID: PMC4755524 DOI: 10.1093/dnares/dsv033
Source DB: PubMed Journal: DNA Res ISSN: 1340-2838 Impact factor: 4.458
Sources of WGS sequencing data for Haruna Nijo
| Method | Read | No. of reads | No. of bases (bp) |
|---|---|---|---|
| Illumina | Paired end (PE) | 3,005,632,276 | 298,945,920,500 |
| 454 | PE 20 kb | 14,242,510 | 4,569,072,624 |
| 454 | PE 8 kb | 2,955,788 | 1,016,746,348 |
| 454 | Single end 500 bp[ | 70,936,197 | 25,410,875,001 |
Results of whole-genome assembly of Haruna Nijo and Morex[3]
| Data | Haruna Nijo | Morex |
|---|---|---|
| Total length (bp) | 2,005,970,672 | 1,868,648,155 |
| N50 bp | 3,539 | 1,425 |
| No. of contigs/scaffolds | 1,712,236 | 2,670,738 |
| No. of ‘N’ | 225,899,677 | 50,995,654 |
| No. of non-‘N’ | 1,780,070,995 | 1,817,652,501 |
| GC (%) | 45.0 | 44.4 |
Figure 1.Distribution of TREP repeat categories on the Haruna Nijo and Morex barley genome assemblies. Frequency of each repeat content was calculated by the result of repeat masking using the TREP library. Total bases of each repeat region on the genomes were divided by those of all repeat regions.
FLcDNA mapping on genome assemblies
| Coverage (%) | No. of hit FLcDNAs (ratio/total) | |
|---|---|---|
| Haruna Nijo | Morex[ | |
| 50 | 26,241 (0.917) | 24,621 (0.860) |
| 70 | 22,835 (0.798) | 19,926 (0.696) |
| 80 | 21,429 (0.749) | 17,706 (0.619) |
| 90 | 20,153 (0.704) | 15,361 (0.537) |
| 95 | 19,485 (0.681) | 14,044 (0.491) |
Figure 2.Gene structure of closed flowering locus cly1 (AK371953). In the Haruna Nijo genome, the complete gene structures of cly1 including upstream and downstream regions were identified. In the Morex genome,[3] the first exon and upstream regions were truncated because of insufficient length of the contig.
Sequencing and mapping results of Haruna Nijo RNA-Seq data
| Sample | No. of read pairs | Aligned pairs | Concordant pair alignment rate (%) |
|---|---|---|---|
| Leaf | 4,977,702 | 3,738,352 | 75.0 |
| Root | 3,981,366 | 2,921,944 | 73.2 |
| Seed | 7,024,371 | 5,691,954 | 81.0 |
| Spike | 4,292,946 | 3,399,764 | 79.1 |
Figure 3.Example screenshots of (A) Haruna Nijo contig_55003, (B) Morex contig_156845 and (C) Morex contig_68185 under the platform of Gbrowse. Separated genomic regions of Morex which correspond to the single FLcDNA region in Haruna Nijo. This figure is available in black and white in print and in colour at DNA Research online.
Figure 4.Example screenshots of bex-db (http://barleyflc.dna.affrc.go.jp/bexdb/) for a FLcDNA (AK370496). This figure is available in black and white in print and in colour at DNA Research online.