| Literature DB >> 27659211 |
Alexie Papanicolaou1, Marc F Schetelig2, Peter Arensburger3, Peter W Atkinson4,5, Joshua B Benoit6, Kostas Bourtzis7,8, Pedro Castañera9, John P Cavanaugh6, Hsu Chao10, Christopher Childers11, Ingrid Curril12, Huyen Dinh10, HarshaVardhan Doddapaneni10, Amanda Dolan13, Shannon Dugan10, Markus Friedrich14, Giuliano Gasperi15, Scott Geib16, Georgios Georgakilas17, Richard A Gibbs10, Sarah D Giers18, Ludvik M Gomulski15, Miguel González-Guzmán9, Ana Guillem-Amat9, Yi Han10, Artemis G Hatzigeorgiou17, Pedro Hernández-Crespo9, Daniel S T Hughes10, Jeffery W Jones19, Dimitra Karagkouni17, Panagiota Koskinioti20, Sandra L Lee10, Anna R Malacrida15, Mosè Manni15, Kostas Mathiopoulos20, Angela Meccariello21, Shwetha C Murali10, Terence D Murphy22, Donna M Muzny10, Georg Oberhofer12, Félix Ortego9, Maria D Paraskevopoulou17, Monica Poelchau11, Jiaxin Qu10, Martin Reczko23, Hugh M Robertson18, Andrew J Rosendale6, Andrew E Rosselot6, Giuseppe Saccone21, Marco Salvemini21, Grazia Savini15, Patrick Schreiner5, Francesca Scolari15, Paolo Siciliano15, Sheina B Sim16, George Tsiamis8, Enric Ureña9, Ioannis S Vlachos17, John H Werren13, Ernst A Wimmer12, Kim C Worley10, Antigone Zacharopoulou24, Stephen Richards10, Alfred M Handler25.
Abstract
BACKGROUND: The Mediterranean fruit fly (medfly), Ceratitis capitata, is a major destructive insect pest due to its broad host range, which includes hundreds of fruits and vegetables. It exhibits a unique ability to invade and adapt to ecological niches throughout tropical and subtropical regions of the world, though medfly infestations have been prevented and controlled by the sterile insect technique (SIT) as part of integrated pest management programs (IPMs). The genetic analysis and manipulation of medfly has been subject to intensive study in an effort to improve SIT efficacy and other aspects of IPM control.Entities:
Keywords: Chromosomal synteny; Gene family evolution; Insect adaptation; Insect invasiveness; Insect orthology; Medfly genome; Medfly integrated pest management (IPM); Tephritid genomics
Mesh:
Year: 2016 PMID: 27659211 PMCID: PMC5034548 DOI: 10.1186/s13059-016-1049-2
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Medfly genome assembly metrics for NCBI Genome assembly accession GCA_000347755.2 that replaces assembly GCA_000347755.1 after removal of bacterial contaminant sequences
| Genome assembly | Contigs (n) | 25,233 |
| Contig N50 | 45,879 bp | |
| Scaffolds (n) | 1806 | |
| Scaffold N50 | 4,118,346 bp | |
| Size of final assembly | 479,047,742 | |
| Size of final assembly - without gaps | 440,703,716 bp | |
| NCBI Genome Assembly Accession | GCA_000347755.2 |
Medfly genome NCBI annotation features for the assembly Ccap_1.0 (see http://www.ncbi.nlm.nih.gov/genome/annotation_euk/Ceratitis_capitata/101/ for details and additional features)
| Feature | Count | Mean length (bp) |
|---|---|---|
| Genes and pseudogenes | 14,652 | _ |
| Protein-coding | 14,162 | _ |
| Non-coding | 385 | _ |
| Pseudogenes | 105 | _ |
| Genes with variants | 3527 | _ |
| Genes | 14,547 | 16,014 |
| All transcripts | 24,125 | 2903 |
| Messenger RNA | 23,075 | 2979 |
| Miscellaneous RNA | 238 | 3506 |
| Transfer RNA | 416 | 74 |
| Long non-coding RNA | 396 | 1074 |
| Single-exon transcripts | 2833 | 1193 |
| CDSs | 23,075 | 2198 |
| Exons | 77,742 | 465 |
| Introns | 62,132 | 4117 |
Fig. 1Genome-wide phylogenomics and orthology. The phylogenetic relationship of C. capitata and 13 species in Arthropoda was estimated using a maximum likelihood analysis of a concatenation of 2591 single-copy orthologous protein sequences, 1000 bootstrap replicates, and rooted with D. pulex. The scale bar represents 0.1 amino acid substitution per site and the asterisks represent nodes with a bootstrap value of 100. Horizontal bars for each species show the absolute number of proteins that are: single-copy orthologs in all species, present in all species (not necessarily in single-copy), present in the majority of species in the analysis, present in a minority of the species (patchy distribution) in the analysis, and unique to the species. Species/strain designations are: Acyrthosiphon pisum (AcP), Aedes aegypti (strain Liverpool) (AeA), Anopheles gambiae (strain PEST) (AnG), Apis mellifera (ApM), Bombyx mori (BoM), Ceratitis capitata (CeC), Cimex lectularius (CiL), Culex quinquefasciatus (strain Johannesburg) (CuQ), Daphnia pulex (DaP), Drosophila melanogaster (DrM), Manduca sexta (MaS), Musca domestica (MuD), Pediculus humanus (PeH), Solenopsis invicta (SoI), and Tribolium castaneum (TrC)
Fig. 2C. capitata genome scaffold map based on scaffold linkage of annotated genes and microsatellite (Medflymic) sequences previously localized to map banding positions by in situ hybridization to autosomal polytene chromosomes (chromosomes 2 to 6). The larval salivary gland polytene chromosome map [193] presented includes left (L) and right (R) autosomal chromosome arms linked at a centromeric region (K). Arrows with adjacent scaffold numbers point to mapped loci positions of designated genes/microsatellites, with bracketed positions used for less precise mapping. See Additional file 2: Table S6 for sequence and scaffold accession numbers and sizes, in addition to map positions for sex-linked (chromosome 1; X and Y) genes/sequences mapped to undefined loci on mitotic non-polytenized chromosome spreads [20, 21]
Fig. 3In situ hybridization mapping of piggyBac transformation vector insertions on chromosome 5 having the D53 inversion used in the VIENNA-8 tsl genetic sexing strain. Top: a schematic of chromosome 5 showing the piggyBac vector insertion sites along with other mapped genes and the D53 inversion breakpoints. Bottom: images of the yellow fluorescent-tagged hybridization site loci (arrows) on third larval instar salivary gland polytene chromosome spreads
Fig. 4Comparison of gene numbers for odorant-binding proteins (OBPs), odorant receptors (ORs), gustatory receptors (GRs), and ionotrophic receptors (IRs) in C. capitata, D. melanogaster, and M. domestica. Gene numbers provided above each bar
Fig. 5Phylogenetic relationships of OR proteins from C. capitata, D. melanogaster, and M. domestica. The unrooted maximum likelihood (log likelihood = –140908) tree was inferred using the Le and Gascuel model [208] with a discrete Gamma distribution and some invariable sites. Bootstrap values greater than 50 % (1000 replications) are shown. Suffixes after the gene/protein names are: -CTE, C-terminus missing; -PSE, pseudogene
Fig. 6Phylogenetic tree of C. capitata GR proteins with those from D. melanogaster and M. domestica. The maximum likelihood tree was rooted by assigning the carbon dioxide and sugar receptor subfamilies as the outgroup. Clades discussed in the text are indicated on the outer edge
Fig. 7Comparison of predicted AQP amino acid sequences from C. capitata and other indicated Diptera. The neighbor-joining tree was produced using MEGA6 using Dayhoff Model and pairwise matching; branch values indicate support following 3000 bootstraps with values below 20 % omitted. Classification is based upon Finn et al., Benoit et al., and Fabrick et al. [87, 88, 226]. Drip Drosophila integral protein, Prip Pyrocoelia rufa integral protein, Eglp entomoglyceroporin, AQP aquaporin. Unorthodox AQPs are not included in this analysis
Fig. 8Bootstrap PhyML tree (http://phylogeny.lirmm.fr/) performed with protein sequences of the CYP3 and mitochondrial clans of cytochrome P450 genes found in the genome of C. capitata (red) and D. melanogaster (blue). Expanded CYP6 and CYP12 subfamilies are highlighted. Branch length scale indicates average residue substitutions per site
Fig. 9Bootstrap PhyML tree of C. capitata (red) and D. melanogaster (blue) esterase protein sequences (http://phylogeny.lirmm.fr/). Clades are indicated by letters, A–N. Branch length scale indicates average residue substitutions per site
Fig. 10Phylogenetic tree demonstrating relationships of Tweedle proteins from Ceratitis capitata (Cc), Drosophila melanogaster (Dm), Musca domestica (Md), Anopheles gambiae (Ag), Aedes aegypti (Aa), Glossina morsitans (Gm), Culex quinquefasciatus (Cq), Cimex lectularius (Cl), Rhodnius prolixus (Rp), Tribolium castaneum (Tc), Pediculus humanus (Ph), and Acyrthosiphon pisum (Ap). The tree was constructed using the neighbor-joining method in MEGA6; Poisson correction and bootstrap replicates (2000 replicates) were used
Fig. 11Pro-apoptotic RHG gene group syntenic relationships and relative distances in D. melanogaster (top) and C. capitata (bottom). A comparison between the RHG regions, including the hid, grim, rpr (reaper), and skl (sickle) genes, located on chromosome 3L (75C) in D. melanogaster and on chromosome 6R (scaffold 2; NW_004523691) in C. capitata, reveals a similar organization of genes in the two species. The RHG region in C. capitata is 2.9-fold larger relative to D. melanogaster, which correlates approximately to the relative total genome size of the two species