| Literature DB >> 33312190 |
Afiya Razia Chida1, Samathmika Ravi1, Suvratha Jayaprasad1, Kiran Paul1, Jaysmita Saha1, Chinjusha Suresh1, Saurabh Whadgar1, Naveen Kumar2, Raksha Rao K1, Chaitali Ghosh2, Bibha Choudhary1, Suresh Subramani2, Subhashini Srinivasan1,2.
Abstract
Malaria remains a major healthcare risk to growing economies like India, and a chromosome-level reference genome of Anopheles stephensi is critical for successful vector management and understanding of vector evolution using comparative genomics. We report chromosome-level assemblies of an Indian strain, STE2, and a Pakistani strain SDA-500 by combining draft genomes of the two strains using a homology-based iterative approach. The resulting assembly IndV3/PakV3 with L50 of 9/12 and N50 6.3/6.9 Mb had scaffolds long enough for building 90% of the euchromatic regions of the three chromosomes, IndV3s/PakV3s, using low-resolution physical markers and enabled the generation of the next version of genome assemblies, IndV4/PakV4, using HiC data. We have validated these assemblies using contact maps against publicly available HiC raw data from two strains including STE2 and another lab strain of An. stephensi from UCI and compare the quality of the assemblies with other assemblies made available as preprints since the submission of the manuscript. We show that the IndV3s and IndV4 assemblies are sensitive in identifying a homozygous 2Rb inversion in the UCI strain and a 2Rb polymorphism in the STE2 strain. Multiple tandem copies of CYP6a14, 4c1, and 4c21 genes, implicated in insecticide resistance, lie within this inversion locus. Comparison of assembled genomes suggests a variation of 1 in 81 positions between the UCI and STE2 lab strains, 1 in 82 between SDA-500 and UCI strain, and 1 in 113 between SDA-500 and STE2 strains of An. stephensi, which are closer than 1 in 68 variations among individuals from two other lab strains sequenced and reported here. Based on the developmental transcriptome and orthology of all the 54 olfactory receptors (ORs) to those of other Anopheles species, we identify an OR with the potential for host recognition in the genus Anopheles. A comparative analysis of An. stephensi genomes with the completed genomes of a few other Anopheles species suggests limited inter-chromosomal gene flow and loss of synteny within chromosomal arms even among the closely related species.Entities:
Keywords: comparative genomics; cytochromeP450; developmental transcriptome; gene expression profile; genome browser; homology-based assembly; olfactory receptors; simulated mate-pair
Year: 2020 PMID: 33312190 PMCID: PMC7703621 DOI: 10.3389/fgene.2020.565626
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
FIGURE 1Schematic diagram showing the iterative approach followed to improve the assemblies of both strains of An. stephensi.
Homology-based iterative assembly.
| Longest scaffold (Mb) | 5.9/3.4 | 21.3/9.8 | 24.3/17.1 | 47.5/48.9 | 10.2/7 | 58/62.5 |
| L50 | 37/85 | 11/22 | 9/12 | 3/4 | 16/38 | 2/2 |
| N50* (Mb) | 1.7/0.8 | 5.09/3.18 | 6.5/6.9 | 39.3/18.9 | 4.19/1.7 | 43/51.4 |
| Number of scaffolds | 7806/1110 | 7661/770 | 7657/742 | 7423/638 | 7677/873 | 5/5 |
| Assembly size (Mb) | 210/225 | 213/230 | 213/244 | 220/244.4 | 210/225 | 180/193.5 |
| Percent Ns | 5.62/12.95 | 6.9/14.88 | 6.91/19.71 | 7.14/19.73 | 5.6/12.9 | 6.44/16.93 |
FIGURE 2Generation of pseudomolecules. (A,B) Karyogram showing placement of scaffolds from IndV3 (top-left) and PakV3 (top-right) assemblies used to stitch chromosome level assemblies, IndV3s and PakV3s, with DNA marker location on scaffolds. (C) Dot plot of IndV3s against PakV3s.
FIGURE 3Homology-based scaffolding. (A) Scaffolds after homology-based scaffolding covering the range of physical markers for only 2R. (B) Karyogram of IndV4. (C) Dot plot of UCI assembly against all assemblies of the STE2 strain reported here including IndV3s (top), IndV4uci/IndV4ste2 (middle, only 3R is different), and IndV4qm (bottom). Inset shows contact maps of the 2R arm with HiC reads from the STE2 strain against the respective STE2 assemblies. The blue curved arrow shows the manually moved block. The blue strip, green, and peach circles highlight potential inversions against UCI assembly in respective STE2 assemblies.
FIGURE 4Validation of assemblies. (A) Synteny and dot plot of IndV3s against AsteI2_V4. (B) Synteny and dot plot of IndV4qm against AsteI2_V4 assembly. (C) Contact maps for 2R of IndV4qma (left) and AsteI2_V4 (right) using HiC data from the STE2 strain. (D) Contact maps of chromosome 2 of the UCI assembly against the two HiC datasets for the UCI and STE2 strains.
FIGURE 5(A) Density of cytochrome genes across the genome of An. stephensi UCI assembly, the cyan arc representing the 2Rb inversion locus and the red dots representing locations of cytochrome P450 genes across the genome. (B) SNP density at a window of 10,000 bases within STE2 (blue) and SDA-500 (red) strains compared to UCI strain (right) with purple arc showing 2Rb inversion locus. (C) Developmental expression profiles of the four copies of the CYP6a14 gene.
List of cytochromes P450 genes within the 2Rb inversion locus in both the UCI and the IndV3 assemblies.
| 4c1 | 3760, 4551, 4615, 4614, 4624 | 5865, 6996, 7096, 7095, 7108 |
| 4d2 | 3996, 3994, 3998 | 6192, 6192, 6183 |
| 4d14 | 3995 | 6192 |
| 6a13 | 4446 | 6841 |
| 6a2 | 4444, 4451, 4449 | 6839, 6845, 6843 |
| 6a14 | 4445, 4448, 4450, 4447 | 6840, 6843, 6844, 6842 |
| CYP-1 | 4512 | 6919 |
| 313a4 | 4616 | 7097 |
| 4c21 | 4626, 4622, 4625 | 7110, 7105, 7109 |
FIGURE 6Phylogenetic relationships of olfactory receptors. The 54 ORs of An. stephensi (black) predicted from IndV3s with the 42 ORs predicted for An. funestus (red). Embedded numbers in green are the accession IDs of An. gambiae ORs. The green dot represents ORs missing in An. funestus and expanded into five ORs in An. gambiae including accessions 13, 15, 16, 17, 55. The cyan dot represents ORs missing in An. stephensi, duplicated in tandem in An. funestus and duplicated in An. gambiae with accessions 43–44. The orange dot is missing in An. funestus and expanded into six ORs in An. gambiae with accession IDs of 69–74. The blue arrows and orange triangle show ORs of D. melanogaster based on their orthology to An. gambiae reported elsewhere (Hill et al., 2002).