| Literature DB >> 27821634 |
Gleb N Artemov1, Ashley N Peery2, Xiaofang Jiang3,4, Zhijian Tu3,4, Vladimir N Stegniy1, Maria V Sharakhova5,2, Igor V Sharakhov5,2,3.
Abstract
The genome of the Neotropical malaria vector Anopheles albimanus was sequenced as part of the 16 Anopheles Genomes Project published in 2015. The draft assembly of this species consisted of 204 scaffolds with an N50 scaffold size of 18.1 Mb and a total assembly size of 170.5 Mb. It was among the smallest genomes with the longest scaffolds in the 16 Anopheles species cluster, making An. albimanus the logical choice for anchoring the genome assembly to chromosomes. In this study, we developed a high-resolution cytogenetic photomap with completely straightened polytene chromosomes from the salivary glands of the mosquito larvae. Based on this photomap, we constructed a chromosome-based genome assembly using fluorescent in situ hybridization of PCR-amplified DNA probes. Our physical mapping, assisted by an ortholog-based bioinformatics approach, identified and corrected nine misassemblies in five large genomic scaffolds. Misassemblies mostly occurred in junctions between contigs. Our comparative analysis of scaffolds with the An. gambiae genome detected multiple genetic exchanges between pericentromeric regions of chromosomal arms caused by partial-arm translocations. The final map consists of 40 ordered genomic scaffolds and corrected fragments of misassembled scaffolds. The An. albimanus physical map comprises 98.2% of the total genome assembly and represents the most complete genome map among mosquito species. This study demonstrates that physical mapping is a powerful tool for correcting errors in draft genome assemblies and for creating chromosome-anchored reference genomes.Entities:
Keywords: Anopheles albimanus; fluorescence in situ hybridization; mosquito genome; physical mapping; polytene chromosomes
Mesh:
Year: 2017 PMID: 27821634 PMCID: PMC5217105 DOI: 10.1534/g3.116.034959
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Figure 1A phase-contrast image of well-polytenized chromosomes from salivary glands of An. albimanus larva. Chromosome arm names are indicated as X, 2R, 2L, 3R, and 3L; the chromocenter is shown as CC.
Figure 2A high-resolution cytogenetic map and physical genome map for An. albimanus. Numbered divisions and lettered subdivisions are shown below the chromosome images. Horizontal lines and arrows indicate the order and orientation of genomic scaffolds. The names of genomic scaffolds are shown above horizontal lines. Fragments of misassembled scaffolds are marked by a scaffold name followed by a capital letter. The start and end positions of the genomic scaffolds are shown by vertical arrows corresponding to mapped FISH probes. The dotted horizontal line in 3R:34B indicates a predicted adjacency of scaffolds KB672364 and KB672405.
Misassemblies within the An. albimanus genome
| Fragments of Misassembled Scaffold | Size (bp) | Coordinates Within Original Scaffolds | Chromosomal Location | Orientation | Predicted by HMM |
|---|---|---|---|---|---|
| KB672287A | 2,754,385 | 1–2,754,385 | 3R:26AB | + | No |
| KB672287B | 360,489 | 2,754,680–3,115,168 | 3R:34A | + | No |
| KB672298A | 426,774 | 1–426,774 | 2R:12A | + | Yes |
| KB672298B | 2,468,322 | 427,598–2,895,919 | 3R:32AC | − | Yes |
| KB672397A | 11,932,447 | 1–11,932,447 | 2L:23B-25B | − | Yes |
| KB672397B | 4,937,263 | 11,985,224–16,922,486 | 2R:10C-11C | − | No |
| KB672397C | 6,174,178 | 16,942,420–23,116,597 | 2R:9A-10B | − | No |
| KB672397D | 910,984 | 23,155,150–24,066,133 | 2R:14BC | − | No |
| KB672435A | 3,408,218 | 1–3,408,218 | 2L:16A-17B | + | Yes |
| KB672435B | 7,248,733 | 3,505,816–10,754,548 | 2R:12A-13A | − | Yes |
| KB672435С | 883,903 | 10,761,531–11,645,433 | 3R:33A | − | Yes |
| KB672435D | 3,727,725 | 11,655,887–15,383,611 | 2R:14C-15A | − | Yes |
| KB672468A | 5,915,397 | 1–5,915,397 | 2R:13C-14B | − | Yes |
| KB672468B | 136,759 | 6,032,887–6,169,645 | 3R:35B | ? | Yes |
Figure 3Physical mapping of misassembled fragments in scaffold KB672435. Fragments KB672435A, KB672435B, KB672435C, and KB672435D are found in 2L (A), in 2R (B), in 3R (C), and in 2R (D), respectively.
Figure 4Results of the HMM analysis and FISH mapping of scaffolds KB672353 and KB672375. Flags mark the start and the end positions of bioinformatically predicted “misassembled” fragments of genomic scaffolds KB672353 (A) and KB672375 (B). Colored boxes in (A and B) indicate positions of the genes used in FISH. FISH results demonstrate the lack of misassemblies in scaffolds KB672353 (C) and KB672375 (D).
Figure 5Localization of orthologous genes in pericentromeric regions of An. gambiae and An. albimanus. (A) The yellow and red shades demonstrate positions of orthologous genes in the e2 and e4 pericentromeric regions. (B) The blue and the green shades indicate gene positions in the e3 and e5 pericentromeric regions. Red arrows show predicted positions of rearrangement breakpoints in scaffolds KB672353 (A) and KB672375 (B). Red circles represent centromeres. The asterisk indicates scaffold KB672468B. Black arrows show locations of scaffolds and chromosome regions.
Proportions of An. albimanus polytene chromosomes and mapped genome
| Chromosome X | Arm 2R | Arm 2L | Arm 3R | Arm 3L | Total | |
|---|---|---|---|---|---|---|
| Average length, µm | 58.6 | 244.8 | 167.4 | 161 | 161 | 792.8 |
| Relative length, % | 7.4 | 30.9 | 21.1 | 20.3 | 20.3 | 100.0 |
| Mapped genome, Mb | 11.8 | 51.3 | 38.0 | 32.7 | 33.6 | 167.4 |
| Proportion of mapped genome, % | 7.1 | 30.6 | 22.7 | 19.5 | 20.1 | 100 |
Figure 6A comparison between the physical (left) and genetic (right) maps of the An. albimanus chromosome 2. Lines connect positions of the same markers on the physical and genetic maps.