| Literature DB >> 32883756 |
Austin Compton1,2, Jiangtao Liang2,3, Chujia Chen2,4, Varvara Lukyanchikova2,3,5, Yumin Qi6,2, Mark Potters6,2, Robert Settlage7, Dustin Miller8, Stéphane Deschamps9, Chunhong Mao10, Victor Llaca9, Igor V Sharakhov2,3,4,11, Zhijian Tu1,2,4.
Abstract
Chromosome level assemblies are accumulating in various taxonomic groups including mosquitoes. However, even in the few reference-quality mosquito assemblies, a significant portion of the heterochromatic regions including telomeres remain unresolved. Here we produce a de novo assembly of the New World malaria mosquito, Anopheles albimanus by integrating Oxford Nanopore sequencing, Illumina, Hi-C and optical mapping. This 172.6 Mbps female assembly, which we call AalbS3, is obtained by scaffolding polished large contigs (contig N50 = 13.7 Mbps) into three chromosomes. All chromosome arms end with telomeric repeats, which is the first in mosquito assemblies and represents a significant step toward the completion of a genome assembly. These telomeres consist of tandem repeats of a novel 30-32 bp Telomeric Repeat Unit (TRU) and are confirmed by analyzing the termini of long reads and through both chromosomal in situ hybridization and a Bal31 sensitivity assay. The AalbS3 assembly included previously uncharacterized centromeric and rDNA clusters and more than doubled the content of transposable elements and other repetitive sequences. This telomere-to-telomere assembly, although still containing gaps, represents a significant step toward resolving biologically important but previously hidden genomic components. The comparison of different scaffolding methods will also inform future efforts to obtain reference-quality genomes for other mosquito species.Entities:
Keywords: Anopheles albimanus; Bionano; Genome assembly; Hi-C; Oxford Nanopore
Mesh:
Year: 2020 PMID: 32883756 PMCID: PMC7534423 DOI: 10.1534/g3.120.401654
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.542
Figure 1Sample and data collection scheme. The figure shows the cross of a single male and female mosquito to produce female F1 offspring. The F0 father and mother were sequenced individually using Illumina HiSeq to obtain short reads. Genomic DNA from a pool of the female F1 sibling was sequenced using three Oxford Nanopore MinION flow cells to produce long reads. Genomic DNA from a pool of the paternal half-sibling F1 males was used to generate Bionano DLS optical mapping. Oxford Nanopore reads were used to generate the contigs using Canu (Koren ) and the Canu contigs were polished using the parental Illumina reads. The polished Canu contigs were scaffolded using the Bionano DLS optical map to generate the AalbS3 assembly.
Figure 2A) Chromosome level hybrid scaffolds produced by aligning polished Nanopore contigs to Bionano optical maps. The orientation of the hybrid scaffolds as shown in the figure from left to right: Chr2 R to L, Chr3 L to R, and X centromere to telomere. B) Hi-C contact matrix for the three chromosomes of the hybrid assembly. The order of the three An. albimanus chromosomes, shown in green boxes, are 2, X, and 3. Note that the final AalbS3 assembly includes the rDNA clusters and centromeric repeats which are not shown in the figure.
Comparison overview of AalbS2 and AalbS3
| AalbS2 | AalbS3 | |
|---|---|---|
| Total Length | 173.3 Mb | 172.6 Mb |
| Total Length Excluding Ns | 163.5 Mb | 170.9 Mb |
| Contig N50 | 0.2 Mb | 13.7 Mb |
| Number of scaffolds | 5 chromosome arms plus 196 scaffolds | 3 chromosomes plus 4 other scaffolds |
| Scaffold N50 | 37.9 Mb | 89 Mb |
| Telomere | Not detected | Found at the end of all five chromosome arms |
| rDNA cluster | Not detected | An 800 kb scaffold containing the rDNA cluster |
| BUSCO | 98.9% | 99.0% |
| Total Repeat Content | 2.20% | 5.06% |
Figure 3Alignments of variants of the telomeric repeat unit (TRU, panel A) and the principle underlying the strategy to verify telomere sequences using long reads (panels B-E). T: telomere, which is a tandem repeat of TRUs, or (TRU)n; S: single- or low-copy DNA sequences; R1-R5: five copies of an interspersed repeat.
ONT Telomeric long reads
| Chromosome Arm | Number of Reads beginning with TRU | Number of Reads terminating with TRU | Average Length of Reads with TRU |
|---|---|---|---|
| X | 6 | 4 | 60.56 |
| 2R | 6 | 9 | 59.86 |
| 2L | 4 | 19 | 53.72 |
| 3R | 17 | 7 | 54.53 |
| 3L | 4 | 7 | 55.75 |
Only reads longer than 40 kb were analyzed.
Figure 4Fluorescence in situ hybridization and mapping of the An. albimanus telomeric oligonucleotide probe on polytene chromosomes from the 4th instar larva. (A) Left panel: Chromosomes hybridized with Cy3-labeled oligonucleotide probe (red) and counterstained with the fluorophore YOYO-1 (green). Right panel: An inverted grayscale image of the FISH results of the oligonucleotide probe. Chromosome arms are labeled as X, 2R, 2L, 3R, and 3L; the chromocenter is labeled by CC; the nucleolus is labeled by N. Scale bar – 10μm.(B). An enlarged view of representative chromosome arms hybridized with Cy3-labeled oligonucleotide probe (red).
Figure 5Bal31 PFGE and Southern Blot. HMW genomic DNA from An. albimanus pupae was digested with Bal31 exonuclease at time point increments followed by restriction enzyme digestion with XbaI then separated by Pulsed-Field Gel Electrophoresis (left). The separated DNA was transferred to charged Nylon membrane via downward capillary action, hybridized with a DIG- labeled telomeric probe, and detected using chemiluminescence (right).