| Literature DB >> 22807622 |
Dapeng Wang1, Yao Su, Xumin Wang, Hongxing Lei, Jun Yu.
Abstract
BACKGROUND: Repetitive sequences (RSs) are redundant, complex at times, and often lineage-specific, representing significant "building" materials for genes and genomes. According to their origins, sequence characteristics, and ways of propagation, repetitive sequences are divided into transposable elements (TEs) and satellite sequences (SSs) as well as related subfamilies and subgroups hierarchically. The combined changes attributable to the repetitive sequences alter gene and genome architectures, such as the expansion of exonic, intronic, and intergenic sequences, and most of them propagate in a seemingly random fashion and contribute very significantly to the entire mutation spectrum of mammalian genomes. PRINCIPALEntities:
Keywords: intron size; mammalian genomes; satellite sequences; transposable elements
Year: 2012 PMID: 22807622 PMCID: PMC3396637 DOI: 10.4137/EBO.S9758
Source DB: PubMed Journal: Evol Bioinform Online ISSN: 1176-9343 Impact factor: 1.625
Figure 1Percentage of introns with retrotransposons, DNA transposons, and satellites.
Notes: The fractions of introns with repeats are displayed over intron length intervals. The ten intervals of intron lengths are defined as: 1, (50–150); 2, (151–300); 3, (301–600); 4, (601–1000); 5, (1001–1400); 6, (1401–2000); 7, (2001–3000); 8, (3001–5000); 9, (5001–10000); and 10, (10001+).
Percentage of introns with classified into repetitive families.
| Class/family | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| DNA:Chapaev-Chap3 | – | – | – | – | – | – | – | – | – | – | – | – | – | 13% | – | – |
| DNA:DNA | – | – | – | – | – | – | – | – | – | – | – | – | – | – | – | 35% |
| DNA:En-Spm | – | – | – | – | – | – | – | – | – | – | – | – | – | – | – | 14% |
| DNA:Harbinger | – | – | – | – | – | – | – | – | – | – | – | – | – | – | 21% | 11% |
| DNA:MER1_type | – | 21% | 19% | 16% | – | 21% | – | 12% | 11% | 12% | – | – | – | – | – | – |
| DNA:T2 | – | – | – | – | – | – | – | – | – | – | – | – | – | – | 16% | – |
| DNA:TcMar-Tc1 | – | – | – | – | – | – | – | – | – | – | – | – | – | – | – | 15% |
| DNA:TcMar-Tigger | 10% | – | – | – | – | – | – | – | – | – | – | – | – | – | – | – |
| DNA:hAT | – | – | – | – | – | – | – | – | – | – | – | – | – | – | – | 23% |
| DNA:hAT-Ac | – | – | – | – | – | – | – | – | – | – | – | – | – | – | – | 11% |
| DNA:hAT-Charlie | 21% | – | – | – | 19% | – | 20% | – | – | – | 17% | – | – | – | 12% | 20% |
| LINE:CR1 | – | – | – | – | – | – | – | – | – | – | 23% | – | 19% | – | – | – |
| LINE:L1 | 27% | 27% | 27% | 27% | 27% | 27% | 26% | 23% | 20% | 20% | 30% | – | – | – | – | – |
| LINE:L2 | 27% | 27% | 25% | 21% | 25% | 29% | 26% | – | – | 13% | 36% | 54% | – | – | – | – |
| LINE:Penelope | – | – | – | – | – | – | – | – | – | – | – | – | – | 12% | – | – |
| LINE:RTE | – | – | – | 13% | – | – | – | – | – | – | 13% | – | – | – | – | – |
| LINE:RTE-BovB | – | – | – | – | – | – | 18% | – | – | – | – | – | – | – | – | – |
| LTR:ERV1 | – | – | – | – | – | – | – | – | – | – | 18% | – | – | – | – | – |
| LTR:ERVL-MaLR | 14% | – | – | – | – | – | 15% | – | – | – | – | – | – | – | – | – |
| LTR:MaLR | – | 13% | 13% | – | – | 11% | – | 19% | 17% | 13% | – | – | – | – | – | – |
| SINE:Alu | 49% | 49% | 51% | – | – | – | – | – | – | – | – | – | – | – | – | – |
| SINE:B1 | – | – | – | – | – | – | – | 41% | 35% | 37% | – | – | – | – | – | – |
| SINE:B2 | – | – | – | – | – | – | – | 31% | 30% | – | – | – | – | – | – | – |
| SINE:B4 | – | – | – | – | – | – | – | 32% | 30% | 26% | – | – | – | – | – | – |
| SINE:BovA | – | – | – | 36% | – | – | – | – | – | – | – | – | – | – | – | – |
| SINE:ID | – | – | – | – | – | – | – | 12% | 22% | – | – | – | – | – | – | – |
| SINE:MIR | 35% | 36% | 36% | 29% | 33% | 37% | 35% | 15% | 13% | 21% | 62% | 57% | – | – | – | – |
| SINE:SINE | – | – | – | 20% | – | 28% | 45% | – | – | – | 25% | – | – | – | – | – |
| SINE:Sauria | – | – | – | – | – | – | – | – | – | – | – | – | – | 15% | – | – |
| SINE:V | – | – | – | – | – | – | – | – | – | – | – | – | – | – | – | 12% |
| SINE:tRNA-Glu | – | – | – | 16% | – | – | – | – | – | – | – | – | – | – | – | – |
| SINE:tRNA-Lys | – | – | – | – | 34% | – | – | – | – | – | – | – | – | – | – | – |
| Satellite:Satellite | – | – | – | – | – | – | – | – | – | – | – | – | – | – | – | 16% |
| Simple repeat:Simple repeat | 27% | 26% | 27% | 20% | 24% | 17% | 18% | 44% | 40% | 26% | 31% | 20% | 13% | 30% | 14% | 34% |
Notes: The percentages are fractions of introns with the selected repeats over all introns in the listed species and only those greater than 10% are showed in the table. The species codes are: 1, human; 2, orangutan; 3, macaque; 4, cow; 5, panda; 6, horse; 7, elephant; 8, mouse; 9, rat; 10, guinea pig; 11, opossum; 12, platypus; 13, chicken; 14, anole; 15, frog; and 16, zebrafish.
Figure 2Percentage of introns with selected repeat families.
Note: The intron length intervals are defined in the same way as what in Figure 1.
Figure 3Percentage of the numbers of the four intron classes.
Note: TS, T, S, and N stand for introns with TE and SS, TE only, SS only, and without any of the two basic types, respectively.
Figure 4Length comparison of the four intron classes.
Note: The asterisks indicate significant differences between neighbouring data groups based on Wilcoxon rank sum test and cut-off <0.05.
Figure 5Position index comparisons for the four intron classes.
Note: The asterisks indicate significant differences between neighbouring data groups based on Wilcoxon rank sum test and cut-off <0.05.
Figure 6Length comparisons of the four TE-containing intron classes.
Note: The asterisks indicate significant differences between neighbouring data groups based on Wilcoxon rank sum test and cut-off <0.05.
Figure 7Percentage of genes in four classes.
Note: TS, T, S, and N denote genes with TE and SS, TE only, SS only, and with none of the two repeat types, respectively.
Mammal-specific GO term enrichment of the four gene classes.
| Class | GO code | GO name | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| TS | GO:0016324 | Apical plasma membrane | * | – | – | – | – | – | * | * | * | – | – | – |
| TS | GO:0005516 | Calmodulin binding | * | – | – | * | – | – | – | * | * | – | – | – |
| TS | GO:0006812 | Cation transport | * | – | – | * | – | – | – | * | * | – | – | – |
| TS | GO:0005856 | Cytoskeleton | * | – | – | * | – | – | – | * | * | – | – | – |
| TS | GO:0005829 | Cytosol | * | – | – | – | – | – | * | * | * | – | – | – |
| TS | GO:0005783 | Endoplasmic reticulum | – | – | * | * | – | – | – | * | * | – | – | – |
| TS | GO:0005887 | Integral to plasma membrane | * | – | – | – | * | – | * | * | * | – | * | – |
| TS | GO:0023034 | Intracellular signaling pathway | * | – | – | * | – | – | – | * | * | – | – | – |
| TS | GO:0005216 | Ion channel activity | * | – | * | * | – | – | – | * | * | – | * | – |
| TS | GO:0008237 | Metallopeptidase activity | * | – | – | – | – | – | * | * | * | – | – | – |
| TS | GO:0042803 | Protein homodimerization activity | * | – | – | * | – | – | – | * | * | – | – | – |
| T | GO:0005576 | Extracellular region | – | * | * | * | * | * | * | – | * | – | – | – |
| S | GO:0030326 | Embryonic limb morphogenesis | * | – | – | * | – | – | – | * | * | – | * | – |
| S | GO:0009954 | Proximal/distal pattern formation | * | – | – | * | – | – | – | * | * | * | * | – |
| S | GO:0030528 | Transcription regulator activity | * | – | * | * | * | * | * | * | * | – | * | – |
| N | GO:0009952 | Anterior/posterior pattern formation | * | * | * | * | * | * | * | * | * | * | – | – |
| N | GO:0048706 | Embryonic skeletal system development | – | * | * | – | – | * | * | – | – | * | – | – |
| N | GO:0048704 | Embryonic skeletal system morphogenesis | – | – | * | * | – | * | * | – | * | – | * | – |
| N | GO:0005576 | Extracellular region | * | * | * | – | * | * | * | * | * | * | – | – |
| N | GO:0005179 | Hormone activity | * | * | * | – | – | * | * | – | – | * | – | – |
| N | GO:0030528 | Transcription regulator activity | * | – | * | * | * | * | * | * | * | – | * | – |
Notes: The species codes are the same as what listed in Table 1. The asterisks indicate enrichment of GO terms.
Primate-specific GO term enrichment of the four gene classes.
| Class | GO code | GO name | Human | Orangutan | Macaque |
|---|---|---|---|---|---|
| TS | GO:0005201 | Extracellular matrix structural constituent | * | – | – |
| TS | GO:0031965 | Nuclear membrane | * | – | – |
| TS | GO:0008360 | Regulation of cell shape | * | – | – |
| T | GO:0019882 | Antigen processing and presentation | * | – | – |
| T | GO:0019886 | Antigen processing and presentation of exogenous peptide antigen via MHC class II | * | – | – |
| T | GO:0002504 | Antigen processing and presentation of peptide or polysaccharide antigen via MHC class II | * | – | – |
| T | GO:0004004 | ATP-dependent RNA helicase activity | * | – | – |
| T | GO:0005125 | Cytokine activity | – | * | – |
| T | GO:0022625 | Cytosolic large ribosomal subunit | * | – | – |
| T | GO:0010008 | Endosome membrane | * | – | – |
| T | GO:0004308 | Exo-alpha-sialidase activity | * | – | – |
| T | GO:0031640 | Killing of cells of another organism | * | – | – |
| T | GO:0005765 | Lysosomal membrane | * | – | – |
| T | GO:0042613 | MHC class II protein complex | * | – | – |
| T | GO:0032395 | MHC class II receptor activity | * | – | – |
| T | GO:0005763 | Mitochondrial small ribosomal subunit | * | – | – |
| T | GO:0000398 | Nuclear mRNA splicing, via spliceosome | * | – | – |
| T | GO:0005730 | Nucleolus | * | – | – |
| T | GO:0019887 | Protein kinase regulator activity | * | – | – |
| T | GO:0003723 | RNA binding | * | – | – |
| T | GO:0008380 | RNA splicing | * | – | – |
| T | GO:0019843 | rRNA binding | * | – | – |
| T | GO:0005681 | Spliceosomal complex | * | – | – |
| T | GO:0006414 | Translational elongation | * | – | – |
| T | GO:0017070 | U6 snRNA binding | * | – | – |
| S | GO:0004869 | Cysteine-type endopeptidase inhibitor activity | – | * | – |
| S | GO:0044424 | Intracellular part | – | – | * |
| S | GO:0045665 | Negative regulation of neuron differentiation | * | – | – |
| S | GO:0009887 | Organ morphogenesis | * | – | – |
| S | GO:0002876 | Positive regulation of chronic inflammatory response to antigenic stimulus | * | – | – |
| S | GO:0002925 | Positive regulation of humoral immune response mediated by circulating immunoglobulin | * | – | – |
| S | GO:0010843 | Promoter binding | * | – | – |
| S | GO:0007519 | Skeletal muscle tissue development | – | * | – |
| S | GO:0005164 | Tumor necrosis factor receptor binding | * | – | – |
| N | GO:0002474 | Antigen processing and presentation of peptide antigen via MHC class I | * | – | – |
| N | GO:0007267 | Cell-cell signaling | * | – | – |
| N | GO:0009987 | Cellular process | * | – | – |
| N | GO:0010467 | Gene expression | * | – | – |
| N | GO:0008201 | Heparin binding | * | – | – |
| N | GO:0042309 | Homoiothermy | – | – | * |
| N | GO:0050825 | Ice binding | – | – | * |
| N | GO:0048535 | Lymph node development | * | – | – |
| N | GO:0032393 | MHC class I receptor activity | * | – | – |
| N | GO:0000122 | Negative regulation of transcription from RNA polymerase II promoter | – | – | * |
| N | GO:0048663 | Neuron fate commitment | * | – | * |
| N | GO:0005184 | Neuropeptide hormone activity | * | – | – |
| N | GO:0004522 | Pancreatic ribonuclease activity | * | – | – |
| N | GO:0010552 | Positive regulation of gene-specific transcription from RNA polymerase II promoter | – | – | * |
| N | GO:0045084 | Positive regulation of interleukin-12 biosynthetic process | * | – | – |
| N | GO:0045944 | Positive regulation of transcription from RNA polymerase II promoter | – | – | * |
| N | GO:0050826 | Response to freezing | – | – | * |
| N | GO:0016471 | Vacuolar proton-transporting V-type ATPase complex | * | – | – |
Note: The asterisks indicate significant enrichment of GO terms.
Large-mammal-specific GO term enrichment of the four gene classes.
| Class | GO code | GO name | Cow | Panda | Horse | Elephant |
|---|---|---|---|---|---|---|
| TS | GO:0006754 | ATP biosynthetic process | – | – | – | * |
| TS | GO:0015662 | ATPase activity, coupled to transmembrane movement of ions, phosphorylative mechanism | * | – | – | – |
| TS | GO:0006821 | Chloride transport | – | – | – | * |
| TS | GO:0007214 | Gamma-aminobutyric acid signaling pathway | – | – | – | * |
| TS | GO:0051536 | Iron-sulfur cluster binding | – | – | * | – |
| TS | GO:0016459 | Myosin complex | – | – | – | * |
| TS | GO:0004725 | Protein tyrosine phosphatase activity | – | – | – | * |
| TS | GO:0005097 | Rab GTPase activator activity | * | – | – | * |
| TS | GO:0032313 | Regulation of Rab GTPase activity | – | – | – | * |
| TS | GO:0048010 | Vascular endothelial growth factor receptor signaling pathway | – | – | – | * |
| T | GO:0022900 | Electron transport chain | * | – | – | – |
| T | GO:0007186 | G-protein coupled receptor protein signaling pathway | * | – | – | – |
| T | GO:0016021 | Integral to membrane | * | – | – | – |
| T | GO:0005743 | Mitochondrial inner membrane | * | – | – | * |
| T | GO:0005747 | Mitochondrial respiratory chain complex I | – | * | – | – |
| T | GO:0005515 | Protein binding | – | – | – | * |
| T | GO:0070469 | Respiratory chain | * | – | – | – |
| S | GO:0019882 | Antigen processing and presentation | – | * | – | – |
| S | GO:0042742 | Defense response to bacterium | * | – | – | – |
| S | GO:0030901 | Midbrain development | * | – | – | – |
| S | GO:0048663 | Neuron fate commitment | * | – | – | – |
| S | GO:0045449 | Regulation of transcription | – | * | – | – |
| N | GO:0022627 | Cytosolic small ribosomal subunit | – | – | * | – |
| N | GO:0016021 | Integral to membrane | * | – | – | – |
| N | GO:0045449 | Regulation of transcription | – | * | – | – |
Note: The asterisks indicate significant enrichment of GO terms.
Rodent-specific GO term enrichment of the four gene classes.
| Class | GO code | GO name | Mouse | Rat | Guinea pig |
|---|---|---|---|---|---|
| TS | GO:0015629 | Actin cytoskeleton | * | – | – |
| TS | GO:0008415 | Acyltransferase activity | – | * | – |
| TS | GO:0045177 | Apical part of cell | * | – | – |
| TS | GO:0006915 | Apoptosis | * | – | – |
| TS | GO:0030424 | Axon | * | * | – |
| TS | GO:0008013 | Beta-catenin binding | – | * | – |
| TS | GO:0005975 | Carbohydrate metabolic process | * | – | – |
| TS | GO:0007049 | Cell cycle | * | * | – |
| TS | GO:0051301 | Cell division | * | – | – |
| TS | GO:0042995 | Cell projection | * | – | – |
| TS | GO:0009986 | Cell surface | * | * | – |
| TS | GO:0016568 | Chromatin modification | * | – | – |
| TS | GO:0000777 | Condensed chromosome kinetochore | * | – | – |
| TS | GO:0016023 | Cytoplasmic membrane-bounded vesicle | – | * | – |
| TS | GO:0031410 | Cytoplasmic vesicle | * | * | – |
| TS | GO:0030425 | Dendrite | – | * | – |
| TS | GO:0006281 | DNA repair | * | * | – |
| TS | GO:0009055 | Electron carrier activity | * | – | – |
| TS | GO:0005768 | Endosome | * | * | – |
| TS | GO:0009897 | External side of plasma membrane | – | * | – |
| TS | GO:0031012 | Extracellular matrix | – | * | – |
| TS | GO:0005925 | Focal adhesion | – | * | – |
| TS | GO:0005525 | GTP binding | * | – | – |
| TS | GO:0005096 | GTPase activator activity | * | * | – |
| TS | GO:0004386 | Helicase activity | * | – | – |
| TS | GO:0042802 | Identical protein binding | * | * | – |
| TS | GO:0030027 | Lamellipodium | * | – | – |
| TS | GO:0016042 | Lipid catabolic process | * | – | – |
| TS | GO:0042470 | Melanosome | – | * | – |
| TS | GO:0008168 | Methyltransferase activity | * | – | – |
| TS | GO:0005874 | Microtubule | * | – | – |
| TS | GO:0008017 | Microtubule binding | * | – | – |
| TS | GO:0005739 | Mitochondrion | * | * | – |
| TS | GO:0007067 | Mitosis | * | – | – |
| TS | GO:0006397 | mRNA processing | – | * | – |
| TS | GO:0043066 | Negative regulation of apoptosis | – | * | – |
| TS | GO:0043025 | Neuronal cell body | – | * | – |
| TS | GO:0005634 | Nucleus | * | * | – |
| TS | GO:0030165 | PDZ domain binding | – | * | – |
| TS | GO:0048471 | Perinuclear region of cytoplasm | * | * | – |
| TS | GO:0005777 | Peroxisome | * | * | – |
| TS | GO:0035091 | Phosphoinositide binding | * | – | – |
| TS | GO:0043065 | Positive regulation of apoptosis | – | * | – |
| TS | GO:0043123 | Positive regulation of I-kappaB kinase/NF-kappaB cascade | * | – | – |
| TS | GO:0014069 | Postsynaptic density | – | * | – |
| TS | GO:0006813 | Potassium ion transport | * | – | – |
| TS | GO:0042734 | Presynaptic membrane | – | * | – |
| TS | GO:0043234 | Protein complex | * | * | – |
| TS | GO:0032403 | Protein complex binding | * | * | – |
| TS | GO:0019904 | Protein domain specific binding | – | * | – |
| TS | GO:0046982 | Protein heterodimerization activity | – | * | – |
| TS | GO:0019901 | Protein kinase binding | – | * | – |
| TS | GO:0008104 | Protein localization | – | * | – |
| TS | GO:0008565 | Protein transporter activity | * | – | – |
| TS | GO:0016567 | Protein ubiquitination | * | * | – |
| TS | GO:0045449 | Regulation of transcription | * | – | – |
| TS | GO:0006974 | Response to DNA damage stimulus | – | * | – |
| TS | GO:0042493 | Response to drug | – | * | – |
| TS | GO:0001666 | Response to hypoxia | – | * | – |
| TS | GO:0007584 | Response to nutrient | – | * | – |
| TS | GO:0014070 | Response to organic cyclic substance | – | * | – |
| TS | GO:0004871 | Signal transducer activity | * | * | – |
| TS | GO:0005625 | Soluble fraction | * | * | – |
| TS | GO:0015293 | Symporter activity | – | * | – |
| TS | GO:0019717 | Synaptosome | * | * | – |
| TS | GO:0005802 | Trans-Golgi network | * | – | – |
| TS | GO:0006511 | Ubiquitin-dependent protein catabolic process | * | – | – |
| TS | GO:0004842 | Ubiquitin-protein ligase activity | – | * | – |
| S | GO:0009653 | Anatomical structure morphogenesis | – | * | – |
| S | GO:0001658 | Branching involved in ureteric bud morphogenesis | * | – | – |
| S | GO:0045165 | Cell fate commitment | * | * | – |
| S | GO:0042733 | Embryonic digit morphogenesis | * | – | – |
| S | GO:0060441 | Epithelial tube branching involved in lung morphogenesis | * | – | – |
| S | GO:0042472 | Inner ear morphogenesis | * | – | – |
| S | GO:0003676 | Nucleic acid binding | – | * | – |
| S | GO:0048709 | Oligodendrocyte differentiation | * | – | – |
| S | GO:0001569 | Patterning of blood vessels | * | – | – |
| S | GO:0005550 | Pheromone binding | * | – | – |
| S | GO:0008284 | Positive regulation of cell proliferation | * | – | – |
| S | GO:0010552 | Positive regulation of gene-specific transcription from RNA polymerase II promoter | * | – | – |
| S | GO:0045666 | Positive regulation of neuron differentiation | * | * | – |
| S | GO:0010468 | Regulation of gene expression | * | * | – |
| S | GO:0048536 | Spleen development | – | * | – |
| S | GO:0030878 | Thyroid gland development | * | – | – |
| S | GO:0016564 | Transcription repressor activity | * | – | – |
| N | GO:0006935 | Chemotaxis | * | – | – |
| N | GO:0001533 | Cornified envelope | * | – | – |
| N | GO:0006952 | Defense response | * | – | – |
| N | GO:0042742 | Defense response to bacterium | * | – | – |
| N | GO:0005615 | Extracellular space | * | * | – |
| N | GO:0006954 | Inflammatory response | * | * | – |
| N | GO:0007389 | Pattern specification process | – | * | – |
| N | GO:0004252 | Serine-type endopeptidase activity | – | * | – |
Note: The asterisks stand for significant enrichment of GO terms.
Comparisons of incremental ratio of TEs and SSs.
| Species | TS | T | S | T + S |
|---|---|---|---|---|
| Human | 0.833 | 0.612 | 0.080 | 0.693 |
| Orangutan | 0.736 | 0.574 | 0.067 | 0.641 |
| Macaque | 0.700 | 0.569 | 0.063 | 0.632 |
| Cow | 0.661 | 0.435 | 0.065 | 0.500 |
| Panda | 0.512 | 0.384 | 0.050 | 0.434 |
| Horse | 0.545 | 0.426 | 0.059 | 0.486 |
| Elephant | 0.660 | 0.484 | 0.094 | 0.578 |
| Mouse | 0.500 | 0.346 | 0.071 | 0.418 |
| Rat | 0.458 | 0.329 | 0.078 | 0.406 |
| Guinea pig | 0.302 | 0.244 | 0.056 | 0.301 |
| Opossum | 0.867 | 0.556 | 0.103 | 0.660 |
| Platypus | 0.749 | 0.623 | 0.091 | 0.714 |
| Chicken | 0.090 | 0.183 | 0.023 | 0.205 |
| Anole | 0.116 | 0.126 | 0.035 | 0.161 |
| Frog | 0.330 | 0.339 | 0.081 | 0.420 |
| Zebrafish | 1.202 | 1.207 | 0.263 | 1.471 |
Note: Incremental ratio is defined as X/(1 − X), where X equals to the median length percentage of repeats in introns.
Figure 8Topological trees constructed based on TE (A) and SS (B).
Note: A detailed procedure is described in Methods.
Species names and the numbers of introns used in this study.
| Short name | Full name | Version | Number of introns |
|---|---|---|---|
| Human | hg19 | 191,918 | |
| Orangutan | ponAbe2 | 108,083 | |
| Macaque | rheMac2 | 135,376 | |
| Cow | bosTau4 | 155,350 | |
| Panda | ailMel1 | 136,011 | |
| Horse | equCab2 | 128,897 | |
| Elephant | loxAfr3 | 127,667 | |
| Mouse | mm9 | 183,175 | |
| Rat | rn4 | 154,905 | |
| Guinea pig | cavPor3 | 119,495 | |
| Opossum | monDom5 | 137,533 | |
| Platypus | ornAna1 | 101,406 | |
| Chicken | galGal3 | 128,491 | |
| Anole | anoCar2 | 122,041 | |
| Frog | xenTro3 | 136,091 | |
| Zebrafish | danRer7 | 207,279 |