| Literature DB >> 23437111 |
Chih-Ming Hung1, Rong-Chien Lin, Jui-Hua Chu, Chia-Fen Yeh, Chiou-Ju Yao, Shou-Hsien Li.
Abstract
The information from ancient DNA (aDNA) provides an unparalleled opportunity to infer phylogenetic relationships and population history of extinct species and to investigate genetic evolution directly. However, the degraded and fragmented nature of aDNA has posed technical challenges for studies based on conventional PCR amplification. In this study, we present an approach based on next generation sequencing to efficiently sequence the complete mitochondrial genome (mitogenome) of two extinct passenger pigeons (Ectopistes migratorius) using de novo assembly of massive short (90 bp), paired-end or single-end reads. Although varying levels of human contamination and low levels of postmortem nucleotide lesion were observed, they did not impact sequencing accuracy. Our results demonstrated that the de novo assembly of shotgun sequence reads could be a potent approach to sequence mitogenomes, and offered an efficient way to infer evolutionary history of extinct species.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23437111 PMCID: PMC3577829 DOI: 10.1371/journal.pone.0056301
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Sequence information for paired-end reads by illumina genomic sequencing of two passenger pigeons.
| Voucher | Read | % Q20 | Duplicate | Assembled Contig |
| BMNH1149 | 42,135,314 | 97.5 | 2,824,624 | 368,118 |
| BMNH1389 | 64,100,016 | 93.6 | 12,109,764 | 161,686 |
Read indicates the total numbers of 90 bp reads, % Q20 indicates the percentages of reads passing Q20 (the probability of a base call error ≤0.01), duplicate indicates the number of duplicate reads removed and assembled contig indicates the number of assembled contigs ≥300 bp.
Figure 1Average sequence coverage of contigs against their lengths (bp) for the de novo assemblies based on Illumina paired-end reads of (A) BMNH1149 and (B) BMNH1389.
Arrows indicate the contigs that were annotated as the mitogenomes of passenger pigeons.
Figure 2Sequence coverage (in red) and frequency of the major allele (in green) of each base pair along the mitogenomes of BMNH1149 (upper) and BMNH1389 (lower).
Both mitogenomes start from the first base of tRNA-Phe.
Figure 3Metagenomic composition of the Illumina sequence reads from the toe pads of two passenger pigeons, BMNH1149 and BMNH1389.
Percentage of non-duplicate reads blasted to chicken, human bacterial and fungal genome references downloaded from the NCBI genome database.
Figure 4Frequency for 12 categories of sequence mismatches for raw read sequences against the consensus sequence of mitogenomes of BMNH1149 (in green) and BMNH1389 (in red).
Pairwise divergence among the three passenger pigeon mitogenomes.
| tRNA-Phe | 12S rRNA | tRNA-Val | 16S rRNA | tRNA-Leu1 | ND1 | tRNA-Ile | tRNA-Gln | |||||||||||
| Alignment length | 69 | 972 | 73 | 1633 | 74 | 966 | 71 | 71 | ||||||||||
| BMNH1149/BMNH1389 | 0/NA/0/0 | 0/NA/0/0 | 0/NA/0/0 | 3/NA/27/20 | 0/NA/0/0 | 1/1/0/0 | 0/NA/0/0 | 0/NA/0/0 | ||||||||||
| BMNH1149/JQ692598 | 2/NA/1/0 | 0/NA/0/0 | 0/NA/0/0 |
| 0/NA/0/0 | 1/1/0/0 | 0/NA/0/0 | 0/NA/0/0 | ||||||||||
| BMNH1389/JQ692598 | 2/NA/1/0 | 0/NA/0/0 | 0/NA/0/0 |
| 0/NA/0/0 | 0/0/0/0 | 0/NA/0/0 | 0/NA/0/0 | ||||||||||
|
|
|
|
|
|
|
|
| |||||||||||
| Alignment length | 69 | 1042 | 71 | 69 | 69 | 67 | 72 | 1551 | ||||||||||
| BMNH1149/BMNH1389 | 0/NA/0/0 | 1/0/1/0 | 0/NA/0/0 | 0/NA/0/0 | 0/NA/0/0 | 0/NA/0/0 | 0/NA/0/0 | 2/1/0/0 | ||||||||||
| BMNH1149/JQ692598 | 0/NA/0/0 | 2/1/1/0 | 0/NA/0/0 | 0/NA/0/0 | 0/NA/0/0 | 0/NA/0/0 | 0/NA/0/0 | 0/0/0/0 | ||||||||||
| BMNH1389/JQ692598 | 0/NA/0/0 | 1/1/1/0 | 0/NA/0/0 | 0/NA/0/0 | 0/NA/0/0 | 0/NA/0/0 | 0/NA/0/0 | 2/1/0/0 | ||||||||||
|
|
|
|
|
|
|
|
| |||||||||||
| Alignment length | 74 | 69 | 684 | 71 | 168 | 684 | 784 | 69 | ||||||||||
| BMNH1149/BMNH1389 | 0/NA/0/0 | 0/NA/0/0 | 0/0/0/0 | 0/NA/0/0 | 0/0/0/0 | 0/0/0/0 | 1/0/0/0 | 0/NA/0/0 | ||||||||||
| BMNH1149/JQ692598 | 0/NA/0/0 | 0/NA/0/0 | 0/0/0/0 | 0/NA/0/0 | 0/0/0/0 | 0/0/0/0 | 0/0/0/0 | 0/NA/0/0 | ||||||||||
| BMNH1389/JQ692598 | 0/NA/0/0 | 0/NA/0/0 | 0/0/0/0 | 0/NA/0/0 | 0/0/0/0 | 0/0/0/0 | 1/0/0/0 | 0/NA/0/0 | ||||||||||
|
|
|
|
|
|
|
|
| |||||||||||
| Alignment length | 352 | 69 | 297 | 1378 | 69 | 66 | 71 | 1815 | ||||||||||
| BMNH1149/BMNH1389 | 0/0/0/0 | 0/NA/0/0 | 0/0/0/0 | 0/0/0/0 | 0/NA/0/0 | 0/NA/0/0 | 0/NA/0/0 | 5/1/0/0 | ||||||||||
| BMNH1149/JQ692598 | 0/0/0/0 | 0/NA/0/0 | 0/0/0/0 | 0/0/0/0 | 0/NA/0/0 | 0/NA/0/0 | 0/NA/0/0 | 5/1/0/0 | ||||||||||
| BMNH1389/JQ692598 | 0/0/0/0 | 0/NA/0/0 | 0/0/0/0 | 0/0/0/0 | 0/NA/0/0 | 0/NA/0/0 | 0/NA/0/0 | 0/0/0/0 | ||||||||||
|
|
|
|
|
|
| |||||||||||||
| Alignment length | 1143 | 68 | 70 | 522 | 71 | 1679 | ||||||||||||
| BMNH1149/BMNH1389 | 2/0/0/0 | 0/NA/0/0 | 0/NA/0/0 | 1/0/0/0 | 0/NA/0/0 | 4/NA/310/0 | ||||||||||||
| BMNH1149/JQ692598 | 2/0/0/0 | 0/NA/0/0 | 0/NA/0/0 |
| 0/NA/0/0 |
| ||||||||||||
| BMNH1389/JQ692598 | 2/0/0/0 | 0/NA/0/0 | 0/NA/0/0 |
| 0/NA/0/0 |
| ||||||||||||
We divided the mitogenome into regions encoding 22 transfer RNA genes (Phenylalanine [tRNA-Phe], Valine [tRNA-Val], Leucine [tRNA-Leu], Isoleucine [tRNA-Ile], Glutamine [tRNA-Gln], Methionine [tRNA-Met], Tryptophan [tRNA-Trp], Alanine [tRNA-Ala], Asparagine [tRNA-Asn], Cysteine [tRNA-Cys], Tyrosine [tRNA-Tyr], Serine [tRNA-Ser], Asparate [tRNA-Asp], Lysine [tRNA-Lys], Glycine [tRNA-Gly], Arginine [tRNA-Arg], Histidine [tRNA-His], Serine [tRNA-Ser], Leucine [tRNA-Leu], Threonine [tRNA-Thr], Proline [tRNA-Pro], Glutamate [tRNA-Glu]), 13 protein-coding genes (NADH dehydrogenase subunit 1 [ND1], NADH dehydrogenase subunit 2 [ND2], cytochrome c oxidase subunit I [COX1], cytochrome c oxidase subunit II [COX2], ATP synthase F0 subunit 8 [ATP8], ATP synthase F0 subunit 6 [ATP6], cytochrome c oxidase subunit III [COX3], NADH dehydrogenase subunit 3 [ND3], NADH dehydrogenase subunit 4L [ND4L], NADH dehydrogenase subunit 4 [ND4], NADH dehydrogenase subunit 5 [ND5], cytochrome b [CYTB], NADH dehydrogenase subunit 6 [ND6]), two ribosomal RNA genes (12 s rRNA and16 s rRNA) and a control region. Alignment length (bp), and number of segregating sites/number of nonsynonymous substitutions/number of gaps/number of ambiguous sites are shown for each of these regions. Numbers that are exceptionally high were bolded.