| Literature DB >> 25924014 |
Seung Bum Seo, Xiangpei Zeng, Jonathan L King, Bobby L Larue, Mourad Assidi, Mohamed H Al-Qahtani, Antti Sajantila, Bruce Budowle.
Abstract
BACKGROUND: Massively parallel sequencing (MPS) technologies have the capacity to sequence targeted regions or whole genomes of multiple nucleic acid samples with high coverage by sequencing millions of DNA fragments simultaneously. Compared with Sanger sequencing, MPS also can reduce labor and cost on a per nucleotide basis and indeed on a per sample basis. In this study, whole genomes of human mitochondria (mtGenome) were sequenced on the Personal Genome Machine (PGMTM) (Life Technologies, San Francisco, CA), the out data were assessed, and the results were compared with data previously generated on the MiSeqTM (Illumina, San Diego, CA). The objectives of this paper were to determine the feasibility, accuracy, and reliability of sequence data obtained from the PGM.Entities:
Mesh:
Substances:
Year: 2015 PMID: 25924014 PMCID: PMC4315160 DOI: 10.1186/1471-2164-16-S1-S4
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1A concentric Circos plot of the mtGenome. A concentric Circos plot of the mtGenome representing mean coverage (outer circle; n=24); homopolymers, n≥4 bases, per region nucleotide position (middle circle; n=24); and mean coverage differentiated by reverse (dark) or forward (light) strand (inner circle; n=24). The rose diagram in the center is included for nucleotide position orientation and scale bars are included to the left of the individual plots to approximate values. The control region is offset slightly for orientation.
Figure 2A concentric Circos plot of the mtGenome. A concentric Circos plot of the mtGenome representing mean coverage (outer circle; n=24); C homopolymers interrupted by another base (e.g., CnTCn), per region nucleotide position (middle circle; n=24); and mean coverage differentiated by reverse (dark) or forward (light) strand (inner circle; n=24). The rose diagram in the center is included for nucleotide position orientation and scale bars are included to the left of the individual plots to approximate values. The control region is offset slightly for orientation.
Comparison of homopolymers between high coverage and low coverage areas across 24 samples.
| High coverage areas1 | Low coverage areas2 | |
|---|---|---|
| Homopolymer type | Number of Homopolymers | Number of Homopolymers |
| AA | 276 | 320 |
| GG | 167 | 81 |
| TT | 238 | 225 |
| AAA | 104 | 95 |
| GGG | 31 | 12 |
| TTT | 82 | 42 |
| AAAA | 22 | 33 |
| GGGG | 4 | 2 |
| TTTT | 18 | 12 |
| AAAAA | 10 | 11 |
| GGGGG | 2 | 0 |
| TTTTT | 4 | 3 |
| AAAAAA | 4 | 6 |
| GGGGGG | 1 | 0 |
| TTTTTT | 2 | 1 |
| AAAAAAA | 2 | 2 |
| TTTTTTT | 1 | 0 |
| AAAAAAAA | 0 | 1 |
1 Total number of bases in high coverage areas is 6858, and GC content is 44%. 2 Total number of bases in low coverage areas is 6760, and GC content is 47%. 3 For quick reference, cytosine homopolymers are in bold.
Figure 3The overall strand bias display for all 24 samples. X axis is the ratio of coverage between the forward and reverse strands at each nucleotide position (lower coverage/higher coverage). Y axis is the number of positions with specific percentages of strand bias.
Figure 4Deletion pattern at np 11643 in two samples. Top panel is a PGM result (no. 23); middle panel is a PGM result (no. 22); and bottom panel is a MiSeq result (no. 23). There was no deletion pattern in the MiSeq results.
Figure 59-bp deletion pattern was identified in PGM data in one sample (no. 14). The top panel displays fastq files generated by the PGM aligned using TMAP. The bottom panel displays the same fastq files aligned using BWA.