| Literature DB >> 16899120 |
Alexandra T Marques1, Agostinho Antunes, Pedro A Fernandes, Maria J Ramos.
Abstract
BACKGROUND: The Abeta-binding alcohol dehydrogenase/17beta-hydroxysteroid dehydrogenase type 10 (ABAD/HSD10) is an enzyme involved in pivotal metabolic processes and in the mitochondrial dysfunction seen in the Alzheimer's disease. Here we use comparative genomic analyses to study the evolution of the HADH2 gene encoding ABAD/HSD10 across several eukaryotic species.Entities:
Mesh:
Substances:
Year: 2006 PMID: 16899120 PMCID: PMC1559703 DOI: 10.1186/1471-2164-7-202
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Genomic organization of HADH2 genes. Exons are represented by boxes and introns by lines; all are shown to scale (bar = 200 bp) except for intronic regions larger than 500 bp that are represented as slashed lines. Known 5' and 3' untranslated sequences are represented by boxes with black streaks. Exon sizes in base pairs are indicated above the boxes and intron sizes below the horizontal lines. All the genes are represented in the 5' → 3' orientation, with the exons being numbered in the human gene. Unknown regions of genes are represented by unscaled hatched lines and boxes with a question mark.
Accsession numbers, chromosome location and GC content of HADH2 genes
| human ( | ENSG00000072506 | chromosome X (p11.2) | 3.11 | 52 | 66 | 46 | 63 |
| chimpanzee ( | ENSPTRG00000021929 | chromosome X | 3.12 | 52 | 66 | 46 | 63 |
| orangutan ( | a | - | 3.11 | 52 | 66 | 45 | 63 |
| rhesus monkey ( | ENSMMUG00000009296 | geneScaffold_500 | 3.17 | 51 | 66 | 45 | 65 |
| rat ( | ENSRNOG00000003049 | chromosome X | 2.45 | 48 | 64 | 44 | 52 |
| mouse ( | ENSMUSG00000025260 | chromosome X | 2.56 | 49 | 64 | 44 | 50 |
| dog ( | ENSCAFG00000016277 | chromosome X | 2.29 | 52 | 66 | 44 | 61 |
| cat ( | a | - | 2.30 | 51 | 66 | 44 | 60 |
| cow ( | ENSBTAG00000017779 | chromosome X | 2.32 | 52 | 64 | 46 | 62 |
| pig ( | TC220713 | - | - | - | 66 | 45 | 66 |
| opossum ( | ENSMODG00000020889 | scaffold_255 | 1.80 | 55 | 66 | 46 | 64 |
| western clawed frog ( | ENSXETG00000007721/TC2334 | scaffold_154 | 3.17 | 44 | 63 | 43 | 48 |
| african clawed frog ( | TC274805 | - | - | - | 62 | 44 | 48 |
| zebrafish ( | ENSDARG00000017781 | chromosome 18 | 9.38 | 33 | 62 | 47 | 49 |
| Fugu ( | SINFRUG00000123807 | scaffold_2159 | 1.81 | 51 | 66 | 47 | 69 |
| Tetraodon ( | GSTENG00023338001 | chromosome 9 | 1.73 | 57 | 67 | 48 | 81 |
| fruitfly ( | CG7113 | chromosome X | 1.61 | 51 | 62 | 44 | 83 |
| mosquito ( | ENSANGG00000012647 | chromosome 2R | 0.77 | 60 | 67 | 42 | 73 |
| honeybee ( | ENSAPMG00000015843 | group11 | 1.49 | 18 | 54 | 38 | 12 |
| F01G4.2 | chromosome IV | 2.50 | 34 | 63 | 43 | 35 | |
| CBG06017 | assembly cb25.fpc0143 | 1.13 | 43 | 61 | 43 | 40 | |
a These gene sequences were manually inferred based on species specific WGS sequences retrieved from a tblastn search to NCBI trace archive: orangutan – ti812046525, ti812049734, ti848517349, ti718725185; cat – ti651599423, ti653340876, ti825822144, ti646417680, ti818302482, ti826254688
b The gene sequence does not include the 5' UTR sequence
c The gene sequences does not include the UTR sequences
d No information available about intronic sequences
e The gene sequence does not include the first exon, first intron and UTRs sequences
f The gene sequence does not include the first intron and UTR sequences
Figure 2Alternative splicing events identified in the human and dog HADH2 genes. A) Schematic illustration of the splicing pathways of human and dog HADH2 genes. The typical exon-intron organization of human and dog HADH2 genes is shown. Exons (E), which are of same lengths in human and dog genes, are represented by coloured boxes and shown to scale (bar = 200 bp). Introns and UTR regions are schematized by unscaled lines and boxes with black streaks, respectively. Intron phases are indicated by numbers above the lines. Alternative splicing (AS) pathway A, identified in human and dog, involves utilization of an alternative donor splice site (ss) in the exon 5, resulting in a nine amino acid deletion. The sequences of human and dog intron 5 splice sites (ss) and of the alternative donor splice site are shown. The conserved nucleotides that identify the U2-type consensus sequences for the GT splice donor site (MAGGTRAGT), GC splice donor site (MAGGCAAGT) and acceptor splice site (NCAGGT) are in red, violet, and green. Alternative splicing (AS) pathway B, identified in dog, involves the skipping of exon 5 which, due to the different phase of introns 4 and 5, leads to the appearance of an earlier stop codon (TGA) in the exon 6, resulting in the replacement of the last 92 amino acids by seven new ones. B) Amino acid sequences of the dog ABAD/HSD10 and variants produced by AS. The nine amino acids deleted in the variant 2 (252 amino acids) are in red (The nine amino acids deleted in the human variant 2 homologue are identical). The seven new amino acids in the variant 3 are in blue.
Figure 4Localization of the strand β. The strand βF is highlighted in red both in the human ABAD/HSD10 tetramer (A) and in the one of its monomers (B) (PDB code 1U7T). This strand together with other six (coloured yellow) comprise the core β-sheet of the Rossmann fold dinucleotide-binding motif. Helices are in blue and the other secondary structure elements are in white.
Figure 3Conservation of the human and dog alternative donor splice site in the exon 5 of the other eutherian mammals. The last 30 nucleotides (in uppercase) of exon 5 and the first 10 nucleotides (in lowercase) of intron 5 are given. The canonical GT dinucleotides in the common donor splice site of intron 5 are in blue while the canonical GT and non-canonical GC dinucleotides in the alternative donor splice site are in red.
Figure 5Percent identity plots (PIPs) comparing the HADH2 gene and 5'-flanking sequences between human and the other vertebrates (A), Tetraodon and the other fishes (B), and . The reference sequences are scaled in kilobases. Percent sequence identity (50% – 100%) is shown on the y-axis. RepeatMasker was used to locate repeat elements. The gene orientation (5' → 3') is shown by the arrow. Exons are illustrated by vertical black boxes and highlighted blue across the plots; UTRs are highlighted yellow across the plots. For cat and orangutan, only the 1.2 kb and 200 bp 5'-flanking sequences were available. LTR, long terminal repeat elements; MIR, mammalian interspersed repeats; SINE, short interspersed nuclear elements.
Figure 6Saturation plots of third codon position transitions and transversions versus corrected pairwise sequence divergence. All comparisons between vertebrates and invertebrates are shown. The dots coloured blue in the first plot are for vertebrate pairwise comparisons, showing that third codon position transversions are not saturated for vertebrates, starting to saturate only for pairwise comparisons involving the invertebrate sequences. The dots coloured red in the second plot are for pairwise comparisons among eutherian mammals (the small saturation observed is due to some pairwise comparisons involving the mouse and rat sequences).
Figure 7Phylogenetic relationships of the 21 HADH2 genes analysed. The Bayesian phylogenetic tree is shown (the ML tree produced an identical topology). Numbers above the branches are BPP and BS (based on 100 replicates) values (BPP/BS). The branch marked with (*) received a BS lower than 50%.
Figure 8Sliding window plots of the percentage of amino acid identity for comparisons between the human ABAD/HSD10 and the orthologue sequences within each group of species analysed. The mammals plot does not include the opossum sequence as information about the amino acids encoded by the first exon was missing. The coloured bars at the top ilustrate the localizations of important functional regions of ABAD/HSD10. Window size was 10 bp and windows were moved in 1 bp steps.
Results of the gene level approach (PAML) applied in eutherian mammal HADH2 genes
| Neutral (M1a) | -2436.668 | ||||
| ω0 = 0.035 (ω1 = 1) | M1a vs M2a | 0 | 1 | ||
| Selection (M2a) | -2436.668 | ||||
| ω0 = 0.035 (ω1 = 1) ws = 1 | |||||
| Beta (M7) | -2444.085 | ||||
| Beta&w (M8) | -2444.074 | M7 vs M8 | 0.022 | 0.989 | |
| One ratio (M0) | -2473.927 | ω = 0.176 | |||
| Free-ratio | -2462.975 | M0 vs free-ratio | 22.420 | 0.146 | |
| Two-ratio | -2470.333 | ω0 = 1.302 (foreg.) | M0 vs two-ratio | 7.188 | 0.007* |
| ω1 = 0.167 (back.) |
NOTE. – In M1 and M2 models, p0, p1, and ps are the proportions of codons estimated to each ω class. In M7 and M8 models, p and q are the parameters of the beta distribution and ps is the proportion of codon sites assigned to the positive selection class. M0 assumes a constant ω value while the free-ratio model assumes an independent ω ratio for each branch. Two-ratio model assumes a different ω ratio for the foreground branch (branch separating human and chimpanzee from orangutan) and background branches. 2Δl is the log likelihood ratio statistic and l is the log likelihood value estimated for each model. No sites were identified under positive selection within the 95% confidence level by models M2a and M8; sites 5, 14, 49, 56, 57, 106, 59, 98, 103, 123, 239 were identified under model M8 with a P < 0.75.
Results of the protein-level approach (TreeSAAP) showing the amino acid sites and physicochemical amino acid properties influenced by positive-destabilizing selection among eutherian mammal ABAD/HSD10 proteins.
| Substrate binding-cleft sites | 56 | ||
| 95 | 57 | ||
| 59 | |||
| 202 | 62 | ||
| 64 | |||
| Sites in insertion 1 region of loop D | 70 | ||
| 102 | 73 | ||
| 103 | 76 | ||
| Other sites | 123 | ||
| 5 | 135 | ||
| 7 | 177 | ||
| 15 | 189 | ||
| 25 | 194 | ||
| 28 | 237 | ||
| 44 | 239 | ||
| 46 | 240 | ||
| 49 | 244 | ||
| 50 | |||
Note. – Alpha-helical tendencies (Pα), Average number of surrounding residues (Ns), Beta-structure tendencies (P), Bulkiness (B), Buriedness (B), Chromatographic index (R), Coil tendencies (P), Composition (C), Compressibility (K), Equilibrium constant (ionization of COOH) (pK'), Power to be at the middle of alpha-helix (a), Helical contact area (C), Hydropathy (H), Isoelectric point (pHi), Long-range non-bonded energy (E), Mean r.m.s. fluctuation displacement (F), Molecular volume (M), Molecular weight (M), Normalized consensus hydrophobicity (H), Partial specific volume (V), Polar requirement (P), Polarity (P), Power to be at the C-terminal (α), Power to be at the N-terminal (α), Refractive index (μ), Short and medium range non-bonded energy (E), Solvent accessible reduction ratio (R), Surrounding hydrophobicity (H), Thermodynamic transfer hydrohphobicity (H), Total non-bonded energy (E), Turn tendencies (P). Residues in bold are those selected in the branch separating human and chimpanzee from orang-utan.
Figure 9Amino acid sites in important functional regions predicted to be under molecular adaptation by the protein-level approach. The ribbon representation of a human ABAD/HSD10 monomer (PDB code 1U7T) is shown. The sites under molecular adaptation identified in the insertion 1 region and in regions comprising the substrate binding cleft are represented in the CPK form, coloured red and yellow, respectively. Sites in bold are the sites identified to be under positive selection in the branch separating human and chimpanzee from orangutan. The loop D region which is important for interaction of ABAD/HSD10 with Aβ is in green.
Figure 10Selective pressures across the different eutherian mammal lineages. A) ML phylogeny of placental mammal HADH2 genes. The numbers above the branches are the ω ratio (left) estimated under the free-ratio model using PAML and the number of sites (right) that showed destabilizing selection among the physicochemical properties tested using TreeSAAP. A ω >1 suggests that positive selection has acted along that lineage; a ω = 0 reflects the absence of nonsynonymous substitutions. Below the branches, grey boxes list the sites showing destabilizing selection belonging to important functional regions of the protein (see table 3). The analysis was conducted using an unrooted topology (the topology is rooted for convenience). B) Nucleotide substitutions that occurred among great apes and between orangutan and rhesus monkey. The numbers refer to codon positions; for the nonsynonymous substitutions is listed the corresponding amino acid changes.