Literature DB >> 16381973

mtDB: Human Mitochondrial Genome Database, a resource for population genetics and medical sciences.

Max Ingman1, Ulf Gyllensten.   

Abstract

The mitochondrial genome, contained in the subcellular mitochondrial network, encodes a small number of peptides pivotal for cellular energy production. Mitochondrial genes are highly polymorphic and cataloguing existing variation is of interest for medical scientists involved in the identification of mutations causing mitochondrial dysfunction, as well as for population genetics studies. Human Mitochondrial Genome Database (mtDB) (http://www.genpat.uu.se/mtDB) has provided a comprehensive database of complete human mitochondrial genomes since early 2000. At this time, owing to an increase in the number of published complete human mitochondrial genome sequences, it became necessary to provide a web-based database of human whole genome and complete coding region sequences. As of August 2005 this database contains 2104 sequences (1544 complete genome and 560 coding region) available to download or search for specific polymorphisms. Of special interest to medical researchers and population geneticists evaluating specific positions is a complete list of (currently 3311) mitochondrial polymorphisms among these sequences. Recent expansions in the capabilities of mtDB include a haplotype search function and the ability to identify and download sequences carrying particular variants.

Entities:  

Mesh:

Year:  2006        PMID: 16381973      PMCID: PMC1347373          DOI: 10.1093/nar/gkj010

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

The mitochondrial genome supplies parts of the protein machinery that are necessary for oxidative phosphorylation (OXPHOS), by utilizing a series of five multiple-subunit enzymes located within the mitochondrial inner membrane. The complex constituents are encoded by both nuclear and mitochondrial genes. A genetic defect could therefore be due to mutations in genes of either system. Since new mutations are introduced more frequently to the mitochondrial genome, a higher proportion of mitochondrial dysfunction is due to mitochondrial DNA (mtDNA) mutations. A number of human diseases have been shown to be caused by mitochondrial mutations, such as Leber's hereditary optic neuropathy (LHON) (1) and neurogenetic muscle weakness, ataxia, and retinitis pigmentosa (NARP) (2). In the evaluation of a possible functional effect of a mitochondrial variant found in a group of patients, reliable population frequency data for the variant under study is needed. The Human Mitochondrial Genome Database (mtDB) provides such a compilation of available genome sequences information for this purpose. The mtDNA of most metazoan species (including humans) is predominantly maternally inherited (3). This clonal inheritance coupled with a substitution rate that in vertebrates is typically 5 to 10 times that of nuclear DNA (4) has made mitochondria an attractive source of DNA polymorphism data for population genetics studies in a wide range of species. The lack of recombination among maternal and paternal mitochondrial genomes allows the tracing of a direct genetic line where all polymorphism is due to mutation and the high substitution rate makes it possible to study variation between closely related individuals (i.e. within species). mtDNA sequences have been the main tool in a large number of studies of human evolution. The Human Mitochondrial Genome Database (mtDB) is a repository for these sequences and will provide scientists with access to a common resource for future studies in this field. Since 2000, with the publication of the first comprehensive study on complete human mitochondrial genome sequences (5), the amount of data available from mitochondrial genomes has been growing rapidly. However, polymorphism information from these data is becoming more time consuming to produce. The mtDB provides a unique resource to both medical and human population genetic researchers. Here, published mitochondrial genome sequences are collected from GenBank and other sources (not all sequences are submitted to GenBank) and made available for download. In addition, extensive polymorphism information from the complete dataset is easily accessible.

Database content

The mtDB database contains three principal types of content for researchers: Download of all mitochondrial sequences either as individuals or population sets. The sequences are grouped into 10 major geographic regions based on the population origin of the donor (Table 1). In cases where the geographic origin of the donor is different from their supposed historical background, the sequences are listed under the heading that best fits their donors' ancestry. For example, African American, European American and Asian American sequences are not listed under North America but under the headings Africa, Europe and Asia, respectively. Large sets from the same population are available as batches of individual files. All sequences are cross-referenced to their original publications and to GenBank accession numbers, where available. There are currently 2104 mitochondrial sequences at mtDB.
Table 1

Summary table of the number of sequences from each of the 10 geographic regions

PopulationCompleteCoding region
Africa
    American56
    Other58
America (North)6
America (South)3
Asia
    Japanese672
    Chinese48
    American69
    Other70
Australia
    Aborigine20
Europe535
    American241435
    Finnish192
    Italian62
    Other40
Melanesia
    New Guinea21
    Other2
Middle East7
Polynesia6
South Asia
    Indian75
    Other21
Total:1544560

Population groups for which a large number of sequences are available are indicated separately. Coding region sequences extend from np 577 to 16023, relative to CRS (6).

A list of all variable positions [numbered as in Cambridge Reference Sequence, CRS (6)] among complete, or near complete, mitochondrial sequences (Figure 1). Currently, 3311 polymorphic sites are identified and characterized in tabulated form. This table comprises a separate line for each variable site with a count of how many sequences contain each particular nucleotide variant at that site, the genic location of that site, the codon number and position and details of amino acid changes. An interested researcher can click on the number of a particular variant to obtain a list of all sequences that contain that particular variant. These sequences can then be downloaded from the list. All insertions relative to CRS have been removed.
Figure 1

Truncated table of polymorphic sites. Each row of the table shows nucleotide position [relative to Anderson (Cambridge Reference Sequence, CRS) (6)], CRS nucleotide state at that position, the number of database sequences with A, G, C, T or gap, and the functional region that the site is in. If the functional region is a protein coding gene, also listed is the codon number, the codon position, the amino acid state in CRS and for the variant, whether the change is synonymous or not. Clicking the number of sequences with a particular nucleotide state will retrieve a list of all sequences that carry that particular variant.

A search function for mitochondrial haplotypes. This goes a step beyond the list of variable positions in that sequences carrying specific haplotypes can be retrieved by entering the position and nucleotide for up to 10 loci. Only sequences that match all these criteria will be returned. Again, these sequences can then be downloaded from the database. Some population genetics researchers use predefined haplotypes (haplogroups) purported to designate specific mitochondrial lineages. As a compliment to our search function, this page has a link to a haplogroup tree where clicking the individual haplogroup letters will return a list of all sequences that belong to that particular group.

Database interface

To facilitate easy updating of mtDB, all data pages are produced dynamically by PHP scripts. PHP is an easy-to-use scripting language that integrates well with HTML. Data is parsed on the server machine and an HTML output is sent to the client. This is independent of the client's operating system, browser and installed options. The only exception to this, is the polymorphic sites, nucleotide variants and amino acid states list which is produced by a separate script and the HTML output saved to avoid long processing time for individual requests. The core database is a text file of aligned sequences. New sequences can be simply pasted to this list and are then included in searches.

CONCLUSIONS

mtDB is the only comprehensive online source for the data contained within it. This includes the sequences themselves as many have not been deposited in a publicly available database such as GenBank. The list of mitochondrial polymorphisms continually grows with the addition of new sequences and is an important resource for phylogenetic and medical studies. The ability to search for multiple-variant haplotypes adds further detail to the latent data. We are committed to the maintenance of this database and hope that it will be a useful resource for researchers for years to come.
  6 in total

1.  Rapid evolution of animal mitochondrial DNA.

Authors:  W M Brown; M George; A C Wilson
Journal:  Proc Natl Acad Sci U S A       Date:  1979-04       Impact factor: 11.205

2.  A new mitochondrial disease associated with mitochondrial DNA heteroplasmy.

Authors:  I J Holt; A E Harding; R K Petty; J A Morgan-Hughes
Journal:  Am J Hum Genet       Date:  1990-03       Impact factor: 11.025

3.  Mitochondrial DNA mutation associated with Leber's hereditary optic neuropathy.

Authors:  D C Wallace; G Singh; M T Lott; J A Hodge; T G Schurr; A M Lezza; L J Elsas; E K Nikoskelainen
Journal:  Science       Date:  1988-12-09       Impact factor: 47.728

4.  Sequence and organization of the human mitochondrial genome.

Authors:  S Anderson; A T Bankier; B G Barrell; M H de Bruijn; A R Coulson; J Drouin; I C Eperon; D P Nierlich; B A Roe; F Sanger; P H Schreier; A J Smith; R Staden; I G Young
Journal:  Nature       Date:  1981-04-09       Impact factor: 49.962

5.  Mitochondrial genome variation and the origin of modern humans.

Authors:  M Ingman; H Kaessmann; S Pääbo; U Gyllensten
Journal:  Nature       Date:  2000-12-07       Impact factor: 49.962

6.  Maternal inheritance of human mitochondrial DNA.

Authors:  R E Giles; H Blanc; H M Cann; D C Wallace
Journal:  Proc Natl Acad Sci U S A       Date:  1980-11       Impact factor: 11.205

  6 in total
  136 in total

1.  Molecular diagnosis of infantile mitochondrial disease with targeted next-generation sequencing.

Authors:  Sarah E Calvo; Alison G Compton; Steven G Hershman; Sze Chern Lim; Daniel S Lieber; Elena J Tucker; Adrienne Laskowski; Caterina Garone; Shangtao Liu; David B Jaffe; John Christodoulou; Janice M Fletcher; Damien L Bruno; Jack Goldblatt; Salvatore Dimauro; David R Thorburn; Vamsi K Mootha
Journal:  Sci Transl Med       Date:  2012-01-25       Impact factor: 17.956

2.  Inheritance of mitochondrial DNA recombinants in double-heteroplasmic families: potential implications for phylogenetic analysis.

Authors:  Gábor Zsurka; Kevin G Hampel; Tatiana Kudina; Cornelia Kornblum; Yevgenia Kraytsberg; Christian E Elger; Konstantin Khrapko; Wolfram S Kunz
Journal:  Am J Hum Genet       Date:  2006-12-27       Impact factor: 11.025

3.  What is a 'novel' mtDNA mutation--and does 'novelty' really matter?

Authors:  Hans-Jürgen Bandelt; Antonio Salas; Claudio M Bravi
Journal:  J Hum Genet       Date:  2006-10-05       Impact factor: 3.172

4.  Investigation of the mitochondrial genome in patients with atypical motor neuron disease.

Authors:  Catherine Phoenix; Geoffrey A Taylor; Judith Hartley; Hannah Nixon; Paul G Ince; Pamela J Shaw; Douglass M Turnbull; Robert W Taylor
Journal:  J Neurol       Date:  2007-03-31       Impact factor: 4.849

5.  Enrichment of longevity phenotype in mtDNA haplogroups D4b2b, D4a, and D5 in the Japanese population.

Authors:  Gabriela Alexe; Noriyuki Fuku; Erhan Bilal; Hitomi Ueno; Yutaka Nishigaki; Yasunori Fujita; Masafumi Ito; Yasumichi Arai; Nobuyoshi Hirose; Gyan Bhanot; Masashi Tanaka
Journal:  Hum Genet       Date:  2007-02-17       Impact factor: 4.132

6.  Mitochondrial haplogroup X is associated with successful aging in the Amish.

Authors:  Monique D Courtenay; John R Gilbert; Lan Jiang; Anna C Cummings; Paul J Gallins; Laura Caywood; Lori Reinhart-Mercer; Denise Fuzzell; Claire Knebusch; Renee Laux; Jacob L McCauley; Charles E Jackson; Margaret A Pericak-Vance; Jonathan L Haines; William K Scott
Journal:  Hum Genet       Date:  2011-07-13       Impact factor: 4.132

7.  Power and SNP tagging in whole mitochondrial genome association studies.

Authors:  Allan F McRae; Enda M Byrne; Zhen Zhen Zhao; Grant W Montgomery; Peter M Visscher
Journal:  Genome Res       Date:  2008-03-20       Impact factor: 9.043

8.  The diversity present in 5140 human mitochondrial genomes.

Authors:  Luísa Pereira; Fernando Freitas; Verónica Fernandes; Joana B Pereira; Marta D Costa; Stephanie Costa; Valdemar Máximo; Vincent Macaulay; Ricardo Rocha; David C Samuels
Journal:  Am J Hum Genet       Date:  2009-05-07       Impact factor: 11.025

9.  Drawing the history of the Hutterite population on a genetic landscape: inference from Y-chromosome and mtDNA genotypes.

Authors:  Irene Pichler; Christian Fuchsberger; Christa Platzer; Minal Calişkan; Fabio Marroni; Peter P Pramstaller; Carole Ober
Journal:  Eur J Hum Genet       Date:  2009-10-21       Impact factor: 4.246

10.  A novel NDUFA1 mutation leads to a progressive mitochondrial complex I-specific neurodegenerative disease.

Authors:  Prasanth Potluri; Antonio Davila; Eduardo Ruiz-Pesini; Dan Mishmar; Sean O'Hearn; Saege Hancock; Mariella Simon; Immo E Scheffler; Douglas C Wallace; Vincent Procaccio
Journal:  Mol Genet Metab       Date:  2009-01-29       Impact factor: 4.797

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.