| Literature DB >> 18647405 |
Riccardo Papa1, Clayton M Morrison, James R Walters, Brian A Counterman, Rui Chen, Georg Halder, Laura Ferguson, Nicola Chamberlain, Richard Ffrench-Constant, Durrell D Kapan, Chris D Jiggins, Robert D Reed, William O McMillan.
Abstract
BACKGROUND: With over 20 parapatric races differing in their warningly colored wing patterns, the butterfly Heliconius erato provides a fascinating example of an adaptive radiation. Together with matching races of its co-mimic Heliconius melpomene, H. erato also represents a textbook case of Müllerian mimicry, a phenomenon where common warning signals are shared amongst noxious organisms. It is of great interest to identify the specific genes that control the mimetic wing patterns of H. erato and H. melpomene. To this end we have undertaken comparative mapping and targeted genomic sequencing in both species. This paper reports on a comparative analysis of genomic sequences linked to color pattern mimicry genes in Heliconius.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18647405 PMCID: PMC2515155 DOI: 10.1186/1471-2164-9-345
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Cross design and wing phenotypes. Color pattern phenotypes observed in crosses between 'grand-parental' H. himera (middle) and H. erato cyrbia (left) and H. erato notabilis (right) resulting in two pairs of F1 parents with females on left, males on right. Each pair of F1 parents produced F2s with H. himera × H. erato cyrbia offspring (A) and the H. himera × H. erato notabilis offspring (B). In both F2 families, the observed phenotypic differences among individuals are consistent with the interaction of two co-dominant loci, as described in the Methods.
Figure 2. Annotation of the BAC sequence (clone BBAM-25K4, accession number AC216670) tightly linked to the D color pattern gene. Starting from the right: A) 13.5 cM interval Linkage analysis of LG18, with the gene that control the red pigment (D locus); B) fingerprinting of the positive clones obtained by probing the AFLP CC-CAC-491 (dotted bar); C) sequence analysis of the BAC clone 25_K04, where black circles represent hypothetical ORFs greater than 60 amino acids, with the larger circles representing putative ORFs greater than 150 amino acids. Within each bar, the grey areas indicate repetitive sequence with the black regions indicating exon/intron structure of 2 predicted proteins (with arrow indicating direction) showing a high similarity to known proteins in other arthropods. For gene annotations see Table 2.
Figure 3Fine-scale synteny and sequence conservation between . Top bar represents approximately 180 kb of sequence covering two BACs (clone AEHM-41C10 accession number CR974474; and clone AEHM-7G12 accession number CT955980) for H. melpomene and the bottom bar represents 180 kb of sequence in two large contigs for H. erato (clone BBAM-38A20, accession numbers AC193804). Black circles below bars represent hypothetical open reading frames (ORFs) greater than 60 amino acids, with the larger circles representing putative ORFs greater than 150 amino acids. Within each bar, the grey areas indicate repetitive sequence with the black regions indicating exon/intron structure of 13 predicted proteins (with arrow above bar indicating direction) showing a high similarity to known proteins in other arthropods. A visual representation of the global alignment between the two genomic sequences and the level of synteny is show at the bottom of the figure. The lines between the two sequences unite regions with high sequence identity (>85% of similarity). For gene annotations see Table 2.
Figure 4Gene conservation between . Comparison of homologous genomic regions linked to the Cr color pattern gene in H. erato and Yb color pattern gene in H. melpomene. The top bar represents approximately 280 kb of sequence covering three BACs (clone AEHM-41C10 accession number CR974474, clone AEHM-7G12 accession number CT955980, and clone AEHM-11J7 accession number CU367882) for H. melpomene. The bottom bar represents approximately 350 kb of sequence covering three BACs (BBAM-12K4 accession number AC220074, BBAM-38A20 accession number AC193804, and BBAM-27D18 accession number AC199750). Genes showing a high similarity to know proteins in other arthropods are represented with black squares and arrows indicating orientation. The lines between the two sequences unite the homologous genes giving a visual representation of the overall synteny between the two species.
(A, B) – Heliconius repetitive elements.
| A. | ||||||||||||||||||
| MARINER_HC | MARI_BM | GYPSY70-I_AG | Sake_BM | DOC5_DM | G4_DM | ZENON_BM | ||||||||||||
| Mariner | Mariner | Gypsy | Daphne | Jockey | Jockey | CR1 | ||||||||||||
| DNA transposon | DNA transposon | LTR retroposon | LTR retroposon | Non-LTR retroposon | Non-LTR retroposon | Non-LTR retroposon | ||||||||||||
| 1255 | 1310 | 4858 | 5140 | 2791 | 3856 | 2599 | ||||||||||||
| era | mel | era | mel | era | mel | era | mel | era | mel | era | mel | era | mel | |||||
| # regions masked | 3 | NA | 3 | NA | NA | 1 | 2 | 1 | 5 | NA | 1 | NA | 5 | 1 | ||||
| Avg length of masked region (+/- Std.Dev.) | 85.33 (51.03) | NA | 159.33 (28.11) | NA | NA | 2062 | 253 (173.95) | 73 | 440 (275.77) | NA | 1243 | NA | 520.8 (618.74) | 44 | ||||
| Min length of masked region | 46 | NA | 127 | NA | NA | NA | 46 | NA | 245 | NA | NA | NA | 94 | NA | ||||
| Max length of masked region | 143 | NA | 178 | NA | NA | NA | 143 | NA | 635 | NA | NA | NA | 1574 | NA | ||||
| Total nucleotide masked | 256 | NA | 478 | NA | NA | 2062 | 256 | 73 | 880 | NA | 1243 | NA | 2604 | 44 | ||||
| Proportion of BACs masked | 0.07% | NA | 0.14% | NA | NA | 1.05% | 0.07% | 0.04% | 0.25% | NA | 0.35% | NA | 0.74% | 0.02% | ||||
| B. | ||||||||||||||||||
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 8 | ||||||||||
| era | mel | era | mel | era | mel | era | mel | era | mel | era | mel | era | mel | era | mel | era | mel | |
| Length of core motif | 469 | NA | 460 | 598 | 428 | NA | 384 | NA | 314 | NA | 259 | NA | 266 | NA* | 147 | NA | NA | 306 |
| # regions masked | 16 | 2 | 7 | 11 | 29 | 1 | 5 | NA | 11 | NA | 17 | 4 | 310 | 129 | 40 | 9 | NA | 10 |
| Mean length of masked region (Std.Dev.) | 225. (121.71) | 70.5 (16.26) | 415 (118.3) | 406.27 (302.12) | 166.69 (127.26) | 119 (NA) | 341.8 (100.7) | NA | 189.36 (90.64) | NA | 125.59 (92.66) | 76.75 (38.87) | 155.25 (70.85) | 129.95 (64.26) | 89.1 (27.99) | 80.11 (21.98) | NA | 244.1 (96.74) |
| Minimum length of masked region | 69 | 59 | 147 | 29 | 37 | NA | 162 | NA | 38 | NA | 28 | 55 | 38 | 40 | 38 | 55 | NA | 46 |
| Maximum length of masked region | 439 | 82 | 470 | 740 | 449 | NA | 394 | NA | 314 | NA | 272 | 135 | 287 | 280 | 148 | 116 | NA | 314 |
| Total nucleotide masked | 3603 | 141 | 2905 | 4469 | 4834 | 119 | 1709 | NA | 2083 | NA | 2135 | 307 | 48126 | 16763 | 3564 | 721 | NA | 2441 |
| Amount of BACs masked | 1.0% | 0.1% | 0.8% | 2.3% | 1.4% | 0.1% | 0.5% | NA | 0.6% | NA | 0.6% | 0.2% | 13.6% | 8.5% | 1.0% | 0.4% | NA | 1.2% |
Summary of identification and repeat masking of repetitive elements in BAC sequences from H. erato (accession numbers: AC193804, AC216670) and H. melpomene (accession numbers: CR974474, CT955980). A) Repetitive elements identified via similarity to previously described sequences. B) Novel repetitive elements unique to Heliconius identified de novo using the RepeatMasker software (additional file 1: Novel repetitive elements in Heliconius (row sequences)). Each core motif is a summary sequence representing a different and unique set of related repeat sequences. Motifs four and five were not observed in H. melpomene; Motif nine was not oberseved in H. erato. Core Motifs 1,2,3,4,5,6,7 and 8 were identified by RepeatFinder only in H. erato but masked by RepeatMasker in H. melpomene. (* RepeatFinder identified fragments of this motif in H. melpomene, but the groups did not overlap to give a core motif comparable to that in H. erato).
BAC annotation summary.
| Predicted gene | BAC Accession number | Organism | Identities | E value | ||
| H. erato | H. melpomene | B. mori | ||||
| Putative reverse transcriptase (RVT) | NA | CR974474 | 2529(818322–834985;7e-13)* | Medicago truncatula | 48% | e-140 |
| HEC01402 (DT665615) | NA | CR974474 | 2795(2232547–2233374;9e-26)* | Heliconius erato | 92% | 0.0 |
| Putative reverse transcriptase (RVT) | NA | CR974474 | 2136(919260–923481;1e-10)* | Aedes aegypti | 29% | 5e-76 |
| Galactokinase | AC193804/AC220074 | NA | 2829(2954764–2958794;5e-43)* | Xenopus laevis | 42% | 1e-61 |
| Rab geranylgeranyl transferase b subunit (βggt-II) | AC193804/AC220074 | CR974474 | 2829(2933018–2938173;e-129)* | Danio rerio | 68% | 1e-98 |
| Glucose dehydrogenase (GDeh) | AC193804/AC220074 | CR974474 | 2829(2929161–2931568;e-111)* | Aedes aegypti | 36% | 3e-91 |
| Forkhead box J1 (F-head) | AC193804/AC220074 | CR974474 | 2829(2921212–2923967;e-123)* | Tribolium castaneum | 43% | 3e-57 |
| Acylamino-acid-releasing enzyme (AARE) | AC193804/AC220074 | CR974474 | 2829(2895678–2909890;e-125)* | Tribolium castaneum | 39% | 2e-84 |
| Putative reverse transcriptase (RVT) | AC193804 | NA | 3058(6735155–6740613;e-100)* | Drosophila simulans | 42% | 0.0 |
| Acylamino-acid-releasing enzyme (AARE) | AC193804 | CR974474/CT955980 | 2829(2910836–2912522;3e-65)* | Tribolium castaneum | 50% | 2e-48 |
| Trehalase (Treh1) | AC193804 | CT955980 | 2829(2866386–2868122;0.0)* | Bombyx mori | 50% | e-158 |
| Trehalase (Treh2) | AC193804 | CT955980 | 2829(2861182–2862921;0.0)* | Bombyx mori | 60% | 0.0 |
| HEC03006 (DT668569) | AC193804 | NA | 3058(4660174–4667549;2e-08) | Heliconius erato | 96% | 0.0 |
| Methionine-rich storage protein 2 (MRSP) | AC216670 | NA | 3026(2396448–2401216;0.0)* | Manduca sexta | 66% | 0.0 |
| HEC00815 (DT661873) | AC216670 | NA | NA | Heliconius erato | 96% | 0.0 |
| B9 | AC199750 | CT955980 | 2829(2830788–2833491;3e-79)* | Danio rerio | 39% | 3e-33 |
| Unkempt (Unk) | AC199750 | CU367882 | 2829(2748736–2755694;e-110)* | Aedes aegypti | 76% | e-102 |
| Putative DNA helicase recQ (Helic) | AC199750 | CU367882 | 2829(2870771–2872660;0.0)* | Apis mellifera | 51% | e-173 |
| Beta fructosidase FruA (βfruct) | AC199750 | CU367882 | 2829(2704806–2706326;e-176)* | Apis mellifera | 36% | 7e-74 |
Annotation summary for four H. erato BACs (accession numbers: AC193804, AC216670, 220074, 199750). Predicted gene function and BLASTp data (species hit, % identity, and e-value) are reported, as are accession numbers of the H. erato, H. melpomene, and B. mori genomic sequences containing the predicted genes. (* scaffold ID, position and e value as seen in the silkworm genome database: SilkDB)
Figure 5Conservation of gene order and distances between . VISTA analysis shows sequence conservation between coding regions in the Cr-linked H. erato BAC clones (BBAM-38A20, accession number AC193804; BBAM-27D18, accession number AC199750 and BBAM-12K4, accession number AC220074) and the B. mori scaffold sequence 2829 (position: 2706326–2993165). Genes showing a high similarity to know proteins in other arthropods are represented with black squares and arrows indicating orientation (when orientation is different arrows are displayed for both species). The lines between the two sequences unite the homologous genes giving a visual representation of the overall synteny between the two species.