| Literature DB >> 19412437 |
Yanhui Fan1, Wenjuan Wang, Guoji Ma, Lijing Liang, Qi Shi, Shiheng Tao.
Abstract
Nucleotide insertions and deletions (indels) are responsible for gaps in the sequence alignments. Indel is one of the major sources of evolutionary change at the molecular level. We have examined the patterns of insertions and deletions in the 19 mammalian genomes, and found that deletion events are more common than insertions in the mammalian genomes. Both the number of insertions and deletions decrease rapidly when the gap length increases and single nucleotide indel is the most frequent in all indel events. The frequencies of both insertions and deletions can be described well by power law.Entities:
Keywords: Insertion; deletion; gap; indel; mammalian genome.
Year: 2007 PMID: 19412437 PMCID: PMC2671719 DOI: 10.2174/138920207783406479
Source DB: PubMed Journal: Curr Genomics ISSN: 1389-2029 Impact factor: 2.236
Genome Assemblies Included in the 28-way Multiple Alignments
| Organism | Species | Release Date | UCSC Version |
|---|---|---|---|
| human | Homo sapiens | Mar 2006 | hg18 |
| chimpanzee | Pan troglodytes | Mar 2006 | panTro2 |
| rhesus | Macaca mulatta | Jan 2006 | rheMac2 |
| bushbaby | Otolemur garnetti | Dec 2006 | otoGar1 |
| tree shrew | Tupaia belangeri | Dec 2006 | tupBel1 |
| mouse | Mus musculus | Feb 2006 | mm8 |
| rat | Rattus norvegicus | Nov 2004 | rn4 |
| guinea pig | Cavia porcellus | Oct 2005 | cavPor2 |
| rabbit | Oryctolagus cuniculus | May 2005 | oryCun1 |
| shrew | Sorex araneus | June 2006 | sorAra1 |
| hedgehog | Erinaceus europaeus | June 2006 | eriEur1 |
| dog | Canis familiaris | May 2005 | canFam2 |
| cat | Felis catus | Mar 2006 | felCat3 |
| horse | Equus caballus | Feb 2007 | equCab1 |
| cow | Bos taurus | Aug 2006 | bosTau3 |
| armadillo | Dasypus novemcinctus | May 2005 | dasNov1 |
| elephant | Loxodonta africana | May 2005 | loxAfr1 |
| tenrec | Echinops telfairi | July 2005 | echTel1 |
| opossum | Monodelphis domestica | Jan 2006 | monDom4 |
| platypus | Ornithorhychus anatinus | Mar 2007 | ornAna1 |
| lizard | Anolis carolinensis | Feb 2007 | anoCar1 |
| chicken | Gallus gallus | May 2006 | galGal3 |
| frog | Xenopus tropicalis | Aug 2005 | xenTro2 |
| fugu | Takifugu rubripes | Oct 2004 | fr2 |
| tetraodon | Tetraodon nigroviridis | Feb 2004 | tetNig1 |
| stickleback | Gasterosteus aculeatus | Feb 2006 | gasAcu1 |
| medaka | Oryzias latipes | Apr 2006 | oryLat1 |
| zebrafish | Danio rerio | Mar 2006 | danRer4 |
The Details about the Data
| Name | Last Modified | Size | Name | Last Modified | Size |
|---|---|---|---|---|---|
| chr1.maf.gz | 2007-5-30 17:09 | 822M | chr13.maf.gz | 2007-5-30 17:27 | 338M |
| chr2.maf.gz | 2007-5-30 17:52 | 878M | chr14.maf.gz | 2007-5-30 17:31 | 321M |
| chr3.maf.gz | 2007-5-30 18:04 | 735M | chr15.maf.gz | 2007-5-30 17:34 | 289M |
| chr4.maf.gz | 2007-5-30 18:10 | 636M | chr16.maf.gz | 2007-5-30 17:37 | 278M |
| chr5.maf.gz | 2007-5-30 18:16 | 641M | chr17.maf.gz | 2007-5-30 17:40 | 284M |
| chr6.maf.gz | 2007-5-30 18:22 | 609M | chr18.maf.gz | 2007-5-30 17:42 | 271M |
| chr7.maf.gz | 2007-5-30 18:28 | 528M | chr19.maf.gz | 2007-5-30 17:44 | 140M |
| chr8.maf.gz | 2007-5-30 18:33 | 499M | chr20.maf.gz | 2007-5-30 17:54 | 219M |
| chr9.maf.gz | 2007-5-30 18:37 | 418M | chr21.maf.gz | 2007-5-30 17:56 | 110M |
| chr10.maf.gz | 2007-5-30 17:14 | 475M | chr22.maf.gz | 2007-5-30 17:57 | 107M |
| chr11.maf.gz | 2007-5-30 17:19 | 474M | chrX.maf.gz | 2007-5-30 18:41 | 405M |
| chr12.maf.gz | 2007-5-30 17:24 | 460M | chrY.maf.gz | 2007-5-30 18:41 | 23M |
Ratios of Deletions to Insertions and the Percentage of the Single Nucleotide Gap
| Organism | RN | RL | P | |
|---|---|---|---|---|
| Insertion | Deletion | |||
| chimpanzee | 1.26 : 1 | 1.32 : 1 | 28.63% | 26.54% |
| Rhesus | 1.34 : 1 | 1.20 : 1 | 32.52% | 32.50% |
| Bushbaby | 1.48 : 1 | 1.10 : 1 | 42.44% | 46.13% |
| tree shrew | 1.66 : 1 | 1.03 : 1 | 40.53% | 45.93% |
| Mouse | 1.78 : 1 | 1.24 : 1 | 35.31% | 40.87% |
| Rat | 1.87 : 1 | 1.32 : 1 | 36.14% | 41.17% |
| guinea pig | 1.69 : 1 | 1.26 : 1 | 40.50% | 42.21% |
| Rabbit | 12.82 : 1 | 17.26 : 1 | 71.00% | 46.74% |
| Shrew | 1.54 : 1 | 1.04 : 1 | 38.82% | 41.77% |
| Hedgehog | 1.34 : 1 | 0.96 : 1 | 39.44% | 40.20% |
| Dog | 1.72 : 1 | 1.26 : 1 | 39.80% | 43.72% |
| Cat | 1.34 : 1 | 1.11 : 1 | 43.33% | 43.19% |
| Horse | 1.54 : 1 | 1.18 : 1 | 38.89% | 45.64% |
| Cow | 1.49 : 1 | 1.03 : 1 | 38.38% | 42.67% |
| Armadillo | 1.43 : 1 | 1.20 : 1 | 42.88% | 43.35% |
| Elephant | 1.21 : 1 | 0.93 : 1 | 41.78% | 44.28% |
| Tenrec | 1.29 : 1 | 0.92 : 1 | 39.88% | 43.03% |
| Opossum | 0.85 : 1 | 0.60 : 1 | 29.27% | 32.74% |
RN denotes the ratio between the total number of deletions and insertions.
RL denotes the ratio between the total length of deletions and insertions.
P denotes the percentage of the single nucleotide insertion or deletion.
Estimates of the Parameters
| Organism | a | b | R2 | D | P-value | |||||
|---|---|---|---|---|---|---|---|---|---|---|
| Del | Ins | Del | Ins | Del | Ins | Del | Ins | Del | Ins | |
| chimpanzee | 0.369 | 0.381 | 1.059 | 1.096 | 0.975 | 0.976 | 0.100 | 0.100 | 1.0 | 1.0 |
| Rhesus | 0.442 | 0.414 | 1.259 | 1.193 | 0.974 | 0.984 | 0.100 | 0.100 | 1.0 | 1.0 |
| Bushbaby | 0.528 | 0.492 | 1.537 | 1.434 | 0.993 | 0.989 | 0.100 | 0.100 | 1.0 | 1.0 |
| tree shrew | 0.572 | 0.481 | 1.624 | 1.400 | 0.990 | 0.989 | 0.100 | 0.100 | 1.0 | 1.0 |
| Mouse | 0.528 | 0.464 | 1.492 | 1.324 | 0.986 | 0.974 | 0.100 | 0.100 | 1.0 | 1.0 |
| Rat | 0.532 | 0.470 | 1.502 | 1.342 | 0.986 | 0.975 | 0.100 | 0.100 | 1.0 | 1.0 |
| guinea pig | 0.546 | 0.500 | 1.538 | 1.433 | 0.985 | 0.986 | 0.100 | 0.100 | 1.0 | 1.0 |
| Rabbit | 0.547 | 0.510 | 1.573 | 1.883 | 0.989 | 0.977 | 0.100 | 0.100 | 1.0 | 1.0 |
| Shrew | 0.550 | 0.474 | 1.539 | 1.372 | 0.981 | 0.989 | 0.100 | 0.100 | 1.0 | 1.0 |
| Hedgehog | 0.529 | 0.474 | 1.487 | 1.376 | 0.981 | 0.988 | 0.100 | 0.100 | 1.0 | 1.0 |
| Dog | 0.557 | 0.476 | 1.569 | 1.384 | 0.983 | 0.989 | 0.100 | 0.100 | 1.0 | 1.0 |
| Cat | 0.534 | 0.487 | 1.526 | 1.436 | 0.987 | 0.992 | 0.100 | 0.100 | 1.0 | 1.0 |
| Horse | 0.557 | 0.486 | 1.591 | 1.393 | 0.992 | 0.984 | 0.100 | 0.100 | 1.0 | 1.0 |
| Cow | 0.542 | 0.475 | 1.530 | 1.369 | 0.985 | 0.986 | 0.100 | 0.100 | 1.0 | 1.0 |
| Armadillo | 0.539 | 0.502 | 1.536 | 1.457 | 0.989 | 0.990 | 0.100 | 0.100 | 1.0 | 1.0 |
| Elephant | 0.532 | 0.491 | 1.527 | 1.433 | 0.990 | 0.994 | 0.100 | 0.100 | 1.0 | 1.0 |
| Tenrec | 0.515 | 0.482 | 1.487 | 1.398 | 0.994 | 0.988 | 0.100 | 0.100 | 1.0 | 1.0 |
| Opossum | 0.473 | 0.391 | 1.325 | 1.123 | 0.962 | 0.979 | 0.100 | 0.100 | 1.0 | 1.0 |
The maximum difference between the cumulative distributions in the KS-test.
The P-value of the KS-test.
All data analyses were performed using SPSS version 15.0 (SPSS 2006).