| Literature DB >> 28702491 |
Konstantin Gunbin1, Leonid Peshkin2, Konstantin Popadin3, Sofia Annis4, Rebecca R Ackermann5, Konstantin Khrapko4.
Abstract
The data and methods presented in this article are supplementing the research article "Integration of mtDNA pseudogenes into the nuclear genome coincides with speciation of the human genus. A hypothesis", DOI: 10.1016/j.mito.2016.12.001 (Gunbin et al., 2017) [1]. Mitochondrial DNA is known to get inserted into nuclear DNA to form NUMTs, i.e. nuclear DNA pseudogenes of the mtDNA. We present here the sequences of selected NUMTs, in which time of integration can be determined with sufficient precision. We report their chromosomal positions , their position within the great ape mtDNA phylogeny, and their times of integration into the nuclear genome. The methods used to generate the data and to control their quality are also presented. The dataset is made publicly available to enable critical or extended analyzes.Entities:
Keywords: Human evolution; Mitochondrial DNA; NUMT; Pseudogene; Punctuated evolution; Speciation
Year: 2017 PMID: 28702491 PMCID: PMC5491396 DOI: 10.1016/j.dib.2017.05.024
Source DB: PubMed Journal: Data Brief ISSN: 2352-3409
Fig. S118 Selected NUMTs are not products of post-insertion duplication. Joint Bayesian phylogenic tree of the entire set of 18 selected NUMTs and the 5 higher ape mtDNA sequences. The Gibbon used as outgroup is not shown. Note that none of the NUMTs have any stems in common other than those shared with at least one mtDNA sequence. This implies that among the selected 18 NUMTs there were no duplications after the insertion into the nuclear genome.
Fig. S2Example of a “hidden” substitution. “Anctr” (for Ancestor), the coalescence point of the NUMT branch and the human mitochondrial lineage (“Human”). “Luca” – coalescence point of the human and the Chimpanzee (Chimp) lineages. A “hidden” substitution arises, for example, when a position in the reconstructed interior node Anctr is determined in error (T instead of the real C). Then the true C>T substitution in the NUMT branch becomes “hidden”. Hidden substitution can be revealed by manual analysis of reconstructed ancestral sequences. The reconstruction of ancestral sequences was made by marginal algorithm based on maximum likelihood estimation of parameters.
Fig. S3Probability of equally or more extreme (more clustered) distribution of random data points as compared to the distribution of the actual NUMTs’ insertion times. (See Section 2.6).
| Subject area | |
| More specific subject area | |
| Type of data | |
| How data was acquired | |
| Data format | |
| Experimental factors | |
| Experimental features | |
| Data source location | |
| Data accessibility |