Literature DB >> 30261042

Complete mitogenome of Anopheles sinensis and mitochondrial insertion segments in the nuclear genomes of 19 mosquito species.

Yi-Ran Ding1,2, Bo Li1,2, Yu-Juan Zhang1,2, Qi-Meng Mao1,2, Bin Chen1,2.   

Abstract

Anopheles sinensis is a major malarial vector in China and Southeast Asia. The mitochondria is involved in many important biological functions. Nuclear mitochondrial DNA segments (NUMTs) are common in eukaryotic organisms, but their characteristics are poorly understood. We sequenced and analyzed the complete mitochondrial (mt) genome of An. sinensis. The mt genome is 15,418 bp long and contains 13 protein-coding genes (PCGs), two rRNAs, 22 tRNAs and a large non-coding region. Its gene arrangement is similar to previously published mosquito mt genomes. We identified and analyzed the NUMTs of 19 mosquito species with both nuclear genomes and mt genome sequences. The number, total length and density of NUMTs are significantly correlated with genome size. About half of NUMTs are short (< 200 bp), but larger genomes can house longer NUMTs. NUMTs may help explain genome size expansion in mosquitoes. The expansion due to mitochondrial insertion segments is variable in different insect groups. PCGs are transferred to nuclear genomes at a higher frequency in mosquitoes, but NUMT origination is more different than in mammals. Larger-sized nuclear genomes have longer mt genome sequences in both mosquitoes and mammals. The study provides a foundation for the functional research of mitochondrial genes in An. sinensis and helps us understand the characteristics and origin of NUMTs and the potential contribution to genome expansion.

Entities:  

Mesh:

Substances:

Year:  2018        PMID: 30261042      PMCID: PMC6160108          DOI: 10.1371/journal.pone.0204667

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

Mitochondria are eukaryotic cell organelles that are mainly involved in oxidative phosphorylation [1-2]. The conservation, easy alignment, maternal inheritance, and straightforward gene orthology of mitochondrial (mt) genomes have made the mt genome important for studies of phylogeny and evolution [3-6]. Mt genomes are sometimes associated with insecticide resistance. Several transcripts encoding enzymes such as NADH dehydrogenase and ATP synthase, which are involved in the production of energy within the respiratory chain, were over-expressed in Aedes aegypti larvae exposed to insecticides [7]. Anopheles sinensis is a major malarial vector in Asia [8], and its mt genome was partially sequenced (15,076 bp) and annotated using collections from Shandong, China [9]. An. sinensis is being used as a model species to study the molecular mechanism of insecticide resistance [10-11]. To elucidate the function of the mt genome in insecticide resistance, the mt genome need to be completely sequenced and annotated with a laboratory strain originally collected from Jiangsu, China. Nuclear mitochondrial DNA segments (NUMT) (mitochondrial DNA in the nuclear genome) exist widely in eukaryotes [12]. NUMTs probably arise from nonhomologous recombination of nuclear DNA with mt genome segments from damaged mitochondria [13,14]. The NUMTs found in the nuclear genome are highly similar to mtDNA sequences, so they are easily amplified using universal primers of mtDNA target sequence [15-17]. The existence of NUMTs does not only increases the difficulty of obtaining mtDNA target sequences, but also leads to incorrect conclusions in molecular identification [16-19]. An possible example is the molecular identification of Dendrocygna arcuate, which are completely different based on COI-NUMTs and COI-mtDNA, respectively [20]. At the same time, the transferred mtDNA sequences are thought to be molecular fossils in the nuclear genome and thus identification of NUMTs can help in phylogeny and evolution research [21]. Therefore, NUMTs can be useful molecular markers for the phylogenetic studies of Homo sapiens [22] and Arabidopsis thaliana [23]. Therefore, the genome-wide identification of NUMTs is very important for the molecular identification and phylogenetic study. NUMTs have been identified in several insect species, including Atta cephalotes [24], Podisma pedestris [25] and Sitobion miscanthi [26]. There is no NUMT in the An. gambiae genome [21], whereas NUMTs exist in Culex quinquefasciatus and Aedes aegypti genomes with the NUMT number and total length varying between the two species [27]. The genomes of An. gambiae [28], Cx. quinquefasciatus [29], Aedes aegypti [30] and Ae. albopictus [31] have been sequenced and annotated. Recently 16 Anopheles genomes have been reported [32]. The nuclear genome of An. sinensis has been sequenced at the Chongqing Normal University, China. We asked if the number, total length and density of NUMTs are correlated with genome size. Does the mt genome contribute to nuclear genome expansion? Which kinds of genes are most likely to be transferred to the nuclear genome? Sixteen mosquito species, with known phylogeny and known nuclear genome and mt genome sequences, provided us with an opportunity to answer these questions. We sequenced the complete mt genome of An. sinensis and analyzed its characteristics, including gene organization, base composition, codon usage, and tRNA secondary structure. We also studied the NUMTs of 19 mosquito species for which both mt genome and nuclear genome sequences are available. The NUMT characteristics, including NUMT number, position, and density in each nuclear genome were determined. This study is significant for the annotation of the complete mt genome of An. sinensis and for understanding NUMT status and characteristics in mosquitoes.

Materials and methods

Sequencing and analysis of An. sinensis mt genome

The An. sinensis sample for mt genome sequencing was from an An. sinensis pyrethroid-susceptible laboratory colony. This colony was established from individuals collected in Wuxi, Jiangsu Province, China and cultured at Chongqing Normal University, China. Mitochondrial genomic DNA was isolated from a single adult female using the sodium dodecyl sulfate (SDS)/proteinase K digestion method [33]. Mitochondrial DNA fragments, ranging in size from 822 bp to 1371 bp, were amplified by PCR using a primer set specific for mosquitoes [34]. A 25-μl PCR reaction was prepared with Takara Ex Taq polymerase (Takara, Japan) and 1.5 μl of 25 mM MgCl2, 2.5 μl of 10×PCR Buffer (Mg2+ free), 3 μl of a dNTP mixture (2.5 mM each), 1 μl of each 10 mM primer [34], 0.2 μl of 5 U/μl Taq polymerase and 1 μl of the mt genome DNA template. PCR thermal cycling included a 5 min initial denaturation at 94°C for 5 min, followed by 35 cycles of denaturation at 94°C for 1 min, annealing at 48°C–55°C for 45 s, and elongation at 68°C for 1 min, followed by a final elongation for 10 min at 72°C. The PCR products were electrophoresed on a 1.0% agarose gel and then purified using a QIAquick Gel Extraction Kit (QIAGEN, Hilden, Germany). The purified PCR products were directly sequenced using the primer sets except for the control region. The purified products were loaded into pMD-19T vectors, cloned, and then sequenced. All fragments were sequenced in both directions. The obtained sequences were edited using DANMAN (http://www.lynnon.com/) and were identified in reference to annotated mosquito mt genome sequences through alignment using Clustal X [35]. The sequences of PCGs were translated into amino acids using MEGA version 6.0 [36]. Almost all tRNAs were also recognized by the online tRNAscan-SE Search Server v.1.21 [37], and the tRNAs (tRNA, tRNA) that could not be found by tRNAscan-SE were confirmed by sequence homology comparison. The CR was examined for repeats and special structures with the aid of the Tandem Repeats Finder (http://tandem.bu.edu/trf/trf.html) [38]. The nucleotide composition was calculated using DNA Star (http://www.dnastar.com/). Codon usage bias was calculated using MEGA version 6.0 [36]. Strand asymmetry was evaluated by AT Skew and GC Skew using the formulae AT skew = [A%−T%]/ [A%+T%] and GC skew = [G%−C%]/ [G%+C%], respectively [39].

Nuclear mitochondrial DNA insertion analysis

Eighteen mosquito species’ nuclear genome sequences in FASTA format were downloaded from VectorBase (https://www.vectorbase.org/). The genome sequence of An. sinensis was obtained from Chongqing Normal University (unpublished data) (S1 Table). The An. sinensis genome has an assembly scaffold size of 194.49 Mb, with gene area coverage 98.90%. Of the 19 mosquito species with genomes analyzed for NUMTs, three species belonged to the Culicinae with two species in the genus Aedes and one in the genus Culex. The remaining 16 species were all in the genus Anopheles and belonged to the Anophelinae. The 18 previously determined mt genome sequences were downloaded from NCBI, and the mt genome sequence of An. sinensis was obtained in the present study (S1 Table). The NUMTs were identified using mt genome sequences to search against the nuclear genome sequence using BLASTN for each species. The significance threshold for the BLASTN search was set to E<10–4, with the window slide of 30 bp [40]. The number, total length and density (NUMTs per Mb of nuclear genome sequence) were calculated with Excel 2010 and used to measure the basic characteristics of NUMTs’ occurrence in the genome. Excel 2010 was also used to count the number and different lengths of the NUMTs. The statistical analyses were carried out using TIBCO Statistica (https://www.tibco.com/products/tibco-statistica). An R package was written to calculate the location and frequency of different areas of the mt genome sequences that had been transferred to the nuclear genome.

Results and discussion

An. sinensis mt genome and its organization

We determined the complete mt genome sequence of An. sinensis. It is a typical circular and double-stranded molecule of 15,418 bp (GenBank accession MF322628). The obtained mt genome sequence was 342 bp longer than an earlier report of the An. sinensis mt genome [9], and the difference was mainly due to earlier incomplete sequencing of the mt genome control region (CR). Other genes lengths in our assembly of An. sinensis mtgenome are same as previous assembly sequence (Genbank accession NC028016) expect for control region. The two mt genome sequences have a 98.6% similarity, and a total of 151 single nucleotide polymorphisms (SNPs) were identified between them, except for CR. The mt genome sequence of An. sinensis contains a conserved set of 37 genes, including 13 protein-coding genes (PCGs), two rRNA genes (lrRNA and srRNA), 22 tRNA genes (tRNAs) and a large non-coding region (CR, also known as the AT-rich region) (Fig 1). Twenty-three genes are located on the majority strand (J-strand), while 14 genes reside on the minority strand (N-strand) (Fig 1 and Table 1). The gene arrangement is the same as in previously published mosquito mt genomes, and the unique difference of the arrangement from other dipteran species mt genomes is that the latter have the order trnA-trnR [41]. This arrangement difference might be associated with different adaptations and evolutionary histories of mosquitoes [42].
Fig 1

Structure of the Anopheles sinensis mt genome.

The color-filled blocks indicate tRNAs, and the unfilled white blocks denote protein-coding genes (PCGs), rRNAs and control region (CR). The tRNAs, PCGs, rRNAs and CR located on the majority strand are marked in black, and those on the minority strand are marked in red.

Table 1

Organization of the Anopheles sinensis mt genome.

GenePositionStrandSize (bp)AnticodonStart codonStop codonNo. of intergenic nucleotides
tRNAIle1–68+68GAT
tRNAGln66–134-69TTG-3
tRNAMet134–202+69CAT-1
ND2203–1228+1026ATTTAA0
tRNATrp1227–1295+69TCA-2
tRNACys1295–1358-64GCA-1
tRNATyr1360–1425-66GTA1
COI1424–2965+1542TCGTAA-2
tRNALeu (UUR)2961–3026+66TAA-5
COII3028–3712+685ATGT1
tRNALys3713–3784+72CTT0
tRNAAsp3793–3861+69GTC8
ATP83862–4023+162ATTTAA0
ATP64017–4697+681ATGTAA7
COIII4697–5483+787ATGT-1
tRNAGly5484–5550+67TCC0
ND35551–5904+354ATATAA0
tRNAArg5903–5966+64TCG-2
tRNAAla5967–6032+66TGC0
tRNAAsn6033–6099+67GTT0
tRNASer (AGN)6102–6168-67GCT2
tRNAGlu6170–6235+66TTC1
tRNAPhe6234–6300-67GAA0
ND56301–8043-1743GTGTAA0
tRNAHis8044–8108-65GTG0
ND48106–9450-1345ATGT-3
ND4L9444–9743-300ATGTAA-7
tRNAThr9750–9814+65TGT6
tRNAPro9815–9880-66TGG0
ND69883–10407+527ATTTAA2
CYTB10407–11543+1137ATGTAA-1
tRNASer (UCN)11542–11607+66TGA-2
ND111628–12572-945ATTTAA20
tRNALeu (CUN)12579–12644-66TAG6
lrRNA12645–13972-13280
tRNAVal13973–14044-72TAC0
srRNA14045–14841-7970
CR14842–154185770

Structure of the Anopheles sinensis mt genome.

The color-filled blocks indicate tRNAs, and the unfilled white blocks denote protein-coding genes (PCGs), rRNAs and control region (CR). The tRNAs, PCGs, rRNAs and CR located on the majority strand are marked in black, and those on the minority strand are marked in red.

Characteristics of the An. sinensis mt genome

The A+T content of An. sinensis mt genome is 78.34%, and the A+T contents of PCGs, tRNAs, rRNAs and CR are 76.85%, 78.59%, 81.46% and 93.58%, respectively (Table 2). This result is similar to the universal feature presumed from earlier reported mosquito mt genomes [43-51] in that the CR has the highest A+T content, followed by rRNAs. For the 13 PCGs in the An. sinensis mt genome, the third codon position has a higher A+T content (94.24%), followed by the first codon position (69.46%) and the second codon position (66.84%). The results supports the data from other known mosquito mt genomes [43-51] in that the 3rd codon position has the highest A+T content, followed by the 1st codon position and 2nd codon position.
Table 2

Nucleotide composition of the An. sinensis mt genome.

Mitogenome regionT content%TC content%CA content%AG content%GA+T content%A+TAT SkewGC Skew
Whole genome38.1412.5040.209.1578.340.026-0.155
Protein-coding genes43.9411.0432.9012.1276.85-0.1440.047
First codon position37.7210.9331.7419.6169.46-0.0860.284
Second codon position46.3219.2120.5213.9666.84-0.386-0.158
Third codon position47.792.9746.452.7994.24-0.014-0.033
Protein-coding genes-J41.5213.0534.0811.3675.60-0.098-0.069
First codon position34.2113.1832.8519.7667.06-0.0200.200
Second codon position44.6821.5520.9912.7865.66-0.361-0.255
Third codon position45.684.4148.391.5394.070.029-0.485
Protein-coding genes-N47.797.8431.0413.3278.83-0.2120.259
First codon position43.307.3629.9819.3673.28-0.1820.449
Second codon position48.9215.4819.7815.8268.70-0.4240.011
Third codon position51.150.6943.374.7994.52-0.0820.747
tRNA genes39.029.2139.5712.2078.590.0070.140
tRNA genes-J37.9910.5340.7310.7678.720.0350.011
tRNA genes-N40.537.3137.8714.2978.40-0.0340.323
rRNA43.156.5438.3112.0081.46-0.0590.294
Control region49.913.9943.672.4393.58-0.067-0.243
AT-skew and GC-skew have also been widely used to measure the nucleotide compositional behaviors of mt genomes [42]. The AT skew and GC skew of the An. sinensis mt genome are 0.026 and -0.155, respectively (Table 2). The AT-skew values are positive, and the GC-skew values are negative for all other mosquito mt genomes [43-51], which indicated overall mt genome preference for the use of A and C. The PCGs of the An. sinensis mt genome show an overall negative AT-skew (-0.144) and positive GC-skew (0.047). It is a common phenomenon that the PCGs of mosquito mt genomes prefer to use T and G [43-51]. All 13 PCGs in the An. sinensis mt genome use ATN as the start codon, except for COI, which uses the special start codon TCG, and ND5, which uses GTG as the start codon. Use of GTG as a start codon has been documented for mtDNA-encoded proteins in various organisms, including Anopheles species [52]. All 13 PCGs use the complete stop codon TAA, except for COII, COIII and ND4, which use the incomplete T as a stop codon. There is no other mosquito species with a mt genome that uses TAG as a stop codon [43-51]. The usage bias of amino acids for the 13 PCGs was identified in the An. sinensis mt genome. Leu has the highest percentage (16.05%), followed by Phe (9.67%), Ser (9.30%) and Ile (9.24%), and Cys has the lowest percentage (1.10%) (S1 Fig). This order is similar to other mosquito mt genomes [43-51]. Leucine has an inferred high usage frequency, and as a hydrophobic amino acid, it can be a component of many transmembrane proteins in the mitochondria. There are a total of 3733 codons in the An. sinensis mt genome, excluding termination codons, which is within the codon number range of other insect mt genomes (3585–3746) [53]. For the relative synonymous codon usage (RSCU), UUA is the most used codon (RSCU value 5.28) in the An. sinensis mt genome, followed by CGA (3.24), UCA (2.60), GGA (2.50), UCU (2.50), CCU (2.24), GCU (2.14), GUU (2.10) and ACA (2.10), and ACG is the least used codon (0.02) (Fig 2). The third codon position has a higher usage frequency of A (46.45%) or U (47.79%) in the An. sinensis mt genome. This phenomenon is consistent with previously reported mosquito mt genomes [43-51].
Fig 2

Relative synonymous codon usage (RSCU) in the An. sinensis mt genome.

The RSCU values are represented on the y-axis, while the codon families for each amino acid are denoted on the x-axis.

Relative synonymous codon usage (RSCU) in the An. sinensis mt genome.

The RSCU values are represented on the y-axis, while the codon families for each amino acid are denoted on the x-axis. All tRNAs of the An. sinensis mt genome can folded into the typical clover-leaf secondary structure, except for tRNA, which has lost the DHU stem (S2 Fig). This is a common feature in metazoan mt genomes [42]. Consistent with other known mosquito mt genomes, all tRNA lengths range from 64 to 72 bp. There are 18 mismatches, all as GU base pairs, found in the 12 tRNAs of the An. sinensis mt genome (S2 Fig). The large subunit rRNA (16SrRNA), 1328 bp long, is located between ND1 and tRNA, and the small subunit rRNA (12SrRNA), 797 bp long, is between tRNA and control region, both on the minority strand. The total A+T content of rRNA genes is 81.46%. The CR plays an important role in the regulation of replication and the transcription of the mt genome [54-55]. The CR region of the An. sinensis mt genome is 577 bp long and is located between 12SrRNA and tRNA. The A+T content of this region (93.58%) is higher than other regions of the An. sinensis mt genome. There is a poly-T stretch of 18 bp to be identified, which may be a recognition site for the initiation of replication in the mt genome [56]. In addition, there are two 46 bp long tandem repeats found in the CR. The tandem repeat structures in CRs are common, but the length and tandem repeat time vary in the other known mosquito mt genomes [6].

NUMTs and their comparison in mosquitoes

We identified NUMTs of 19 mosquito species with both known mt genomes and nuclear genome sequences. In the subfamily Anophelinae, all 16 species investigated belong to the genus Anopheles. Out of the 16, 10 species with genome sizes 141.20–268.44 Mb have no NUMTs, and the remaining 6 species with genome sizes 132.94–217.57 Mb each have 1–3 NUMTs (Table 3). In the subfamily Culicinae, Cx. quinquefasciatus (genome size 574.57 Mb), Ae. aegypti (1,342.21 Mb) and Ae. albopictus (1,868.07 Mb) have 13, 122 and 196 NUMTs, respectively. This suggests that the NUMT number is significantly correlated with genome size (R = 0.9871, p = 5.71E-15).
Table 3

Number, total length and density of nuclear mitochondrial segments (NUMTs) in 19 mosquito species of nuclear genomes.

Genus/Subgenus/SeriesSpeciesSizes of mitogenome (kb)/nuclear genome (Mb)Number/total length (bp) of NUMTsNUMT density (bp/Mb nuclear genome)
Anophelinae
Anopheles/Cellia/NeomyzomyiaAn. farautia15.412/175.522/2081.19
An. dirus15.404/209.790/00.00
Anopheles/Cellia/NeocelliaAn. stephensi15.387/216.260/00.00
An. maculatusb14.850/141.200/00.00
Anopheles/Cellia/MyzomyiaAn. minimus15.395/195.701/430.22
An. culicifacies15.364/198.030/00.00
Anopheles/Cellia/PyretophorusAn. christyib14.967/169.041/3091.83
An. epiroticus15.379/216.830/00.00
An. melas15.366/222.010/00.00
An. merus15.365/244.340/00.00
An. coluzzii15.441/218.220/00.00
An. arabiensis15.369/239.130/00.00
An. gambiae15.363/268.440/00.00
Anopheles/Anopheles/MyzorhynchusAn. sinensis15.418/194.491/2161.11
Anopheles/Anopheles/AnophelesAn. atroparvus15.458/217.573/1700.78
Anopheles/Nyssorhynchus/ArgyritarsisAn. darlingi15.386/132.941/1220.92
Culicinae
Culex/Culex/Cx. quinquefasciatus15.587/574.5713/28,43151.92
Aedes/Stegomyia/Ae. aegypti16.655/1,342.21122/92,93469.24
Ae. albopictus16.665/1,868.07196/60,32232.29

a Lack of partial control region.

b Lack of control region.

a Lack of partial control region. b Lack of control region. The NUMT lengths in the six Anophelinae species range from 43 bp to 309 bp, while lengths in the three Culicinae species range from 37 bp to 15,580 bp. If lengths are divided into three classes (large-size (>2 kb), medium-size (200 bp to 2 kb) and small-size (< 200 bp), it is seen that the large-sized NUMTs only exist in the three Culicinae species, with the largest NUMT (15,580 bp) existing in Cx. quinquefasciatus (S2 Table). This suggests that larger genomes can house longer NUMTs. If we assemble the NUMTs, the numbers of large-sized, medium-sized and small-sized NUMTs are 12, 157 and 171, respectively. This suggests that half of the NUMTs are small-sized (< 200 bp). Most of the longer NUMTs would be disruptive to shorter NUMTs through nucleotide deletions or insertions in the evolutionary process, which is the probable reason why half of NUMTs are small [26,27]. In the 16 species of Anophelinae, the total length of NUMTs in each species ranges from 0 bp to 309 bp, and the density ranges from 0 bp/Mb to 1.83 bp/Mb of the nuclear genome. The average total length and density are 66.75 bp and 0.38 bp/Mb, respectively (Table 3 and Fig 3). In the three species of Culicinae, the total length of NUMTs in each species range from 28,431 bp to 92,934 bp, and the density range from 32.29 bp/Mb to 69.24 bp/Mb, with the average total length and density being 60,562.33 bp and 51.15 bp/Mb, respectively (Table 3 and Fig 3). The Culicinae species have greater length and density of NUMTs than the Anophelinae species. Statistical analyses showed that the total length of NUMTs is significantly correlated with genome size (R = 0.9104, p = 6.31E-08) in all 16 species investigated, which is consistent with the NUMT investigation of 85 species [57]. In addition, the density of NUMTs is significantly correlated with genome size in these mosquitoes (R = 0.7667, p = 1.29E-04). The total length and density of NUMTs are also significantly correlated (R = 0.9219, p = 2.04E-08). These results suggest that NUMTs could contribute to the expansion of genome size.
Fig 3

Length (bp) and density (bp/Mb) of nuclear mitochondrial DNA segments (NUMTs) in nine mosquito species.

ASI: An. sinensis, ACH: An. christyi, ADA: An. darlingi, AFA: Anopheles farauti, AMI: An. minimus, AAT: An. atroparvus, CQU: Cx. quinquefasciatus, AAE: Ae. aegypti, AAL: Ae. albopictus.

Length (bp) and density (bp/Mb) of nuclear mitochondrial DNA segments (NUMTs) in nine mosquito species.

ASI: An. sinensis, ACH: An. christyi, ADA: An. darlingi, AFA: Anopheles farauti, AMI: An. minimus, AAT: An. atroparvus, CQU: Cx. quinquefasciatus, AAE: Ae. aegypti, AAL: Ae. albopictus. In the genetics of other insects, the total length and density of NUMTs in the Drosophila melanogaster (genome ~170 Mb), Tribolium castaneum (~150 Mb) and Apis mellifera (~230 Mb) are 777 bp and 4.57 bp/Mb [58], 8,821 bp and 58.81 bp/Mb [40], and 275,022 bp and 1195.75 bp/Mb [40], respectively. The genome sizes in these three species are comparable with the Anophelinae species; however, the total length and density of NUMTs of the three species are larger than those in the Anophelinae species, especially for T. castaneum and A. mellifera. The genome size expansion due to mitochondrial insertion segments is variable in different insect groups. The A. mellifera nuclear genome recombination rate is much greater than that of D. melanogaster [59], suggesting that the NUMT total length and density in the nuclear genome is related to the genome recombination rate. The largest number of NUMTs in the genomes of the nine mosquito species originated from the COI (39 NUMTs) gene, followed by the 16SrRNA (31), CytB (27) and ND5 (22) genes. The least number of NUMTs are from the CR (6) (Fig 4). In the six Anophelinae species, NUMTs were derived from only seven mitochondrial genes, including three COII in An. atroparvus, two COIII in An. farauti, one ND4 in An. sinensis, one ND5 in An. christyi, one 12SrRNA in An. minimus and one tRNA-tRNA in An. darlingi. In the three Culicinae species, NUMTs in Ae. aegypti and Cx. quinquefasciatus cover the whole mt genome. Similarly, NUMTs in Ae. albopictus also cover the whole mt genome, with the exception of ND4, 12SrRNA and CR. These data suggest that the PCGs are transferred to the nuclear genome at a higher frequency, and the larger-sized nuclear genomes in mosquitoes have larger-sized mt genome sequences.
Fig 4

NUMT distribution position in nine mosquito species of mt genomes.

The outermost cycle denotes the structure of the reference mt genome, and the second cycle from outside marks the starting positions of the NUMTs for Ae. albopictus (orange), Ae. aegypti (green), Cx. quinquefasciatus (blue) and six Anopheles species as a group (red). The four cycles entad show the NUMT regions of the three and six Anopheles species as a group, respectively.

NUMT distribution position in nine mosquito species of mt genomes.

The outermost cycle denotes the structure of the reference mt genome, and the second cycle from outside marks the starting positions of the NUMTs for Ae. albopictus (orange), Ae. aegypti (green), Cx. quinquefasciatus (blue) and six Anopheles species as a group (red). The four cycles entad show the NUMT regions of the three and six Anopheles species as a group, respectively. The largest numbers of NUMTs were derived from the 16SrRNA gene in three mammals, Sus scrofa (6), Pan troglodytes (65) and Homo sapiens (53). In Mus musculus, the largest number of NUMTs originated from CR (11), followed by ND2 (7) and ND4 (6), but no NUMT was found from tRNA [60]. This suggests that the NUMT origination may be different in mosquitoes and mammals. The genome sizes of M. musculus, S. scrofa, P. troglodytes and H. sapiens are 2.7 G, 2.3 G, 2.9 G and 2.9 G, and the mt genome coverage rates of NUMT are 84.7%, 32.6%, 100% and 100%, respectively [60]. The larger-sized nuclear genomes in mammals appear to have a wider range of mt genome sequence coverage rates.

Conclusion

We studied mitochondrial genes in An. sinensis through analysis of the complete mt genome sequence. NUMT analysis of nineteen mosquito species led to the conclusion that the number, total length and density of NUMTs are significantly correlated with genome size. NUMTs are an important cause of nuclear genome size expansion in mosquitoes. The genome size expansion due to mitochondrial insertion segments is variable in different insect groups. PCGs are transferred to the nuclear genome at a higher frequency in mosquitoes, but the NUMT origination is quite different from mammals. Larger-sized nuclear genomes, in both mosquitoes and mammals, have a wider range of transferred mt genome sequences.

The percentage of each amino acid in the An. sinensis mt genome.

N-strand: majority strand; J-strand: minority strand. (TIF) Click here for additional data file.

Inferred secondary structures of tRNAs in the An. sinensis mt genome.

The tRNAs are labeled with their corresponding amino acids. (TIF) Click here for additional data file.

Mt genome and nuclear genome sequence information of 19 mosquito species.

(DOC) Click here for additional data file.

Positions and lengths of NUMTs in the nuclear genomes of nine species.

(DOC) Click here for additional data file.
  57 in total

1.  Mitochondrial pseudogenes: evolution's misplaced witnesses.

Authors:  D Bensasson; D -X. Zhang; D L. Hartl; G M. Hewitt
Journal:  Trends Ecol Evol       Date:  2001-06-01       Impact factor: 17.712

2.  Many species in one: DNA barcoding overestimates the number of species when nuclear mitochondrial pseudogenes are coamplified.

Authors:  Hojun Song; Jennifer E Buhay; Michael F Whiting; Keith A Crandall
Journal:  Proc Natl Acad Sci U S A       Date:  2008-08-29       Impact factor: 11.205

3.  Nuclear integrations: challenges for mitochondrial DNA markers.

Authors:  D X Zhang; G M Hewitt
Journal:  Trends Ecol Evol       Date:  1996-06       Impact factor: 17.712

4.  Numerous transposed sequences of mitochondrial cytochrome oxidase I-II in aphids of the genus Sitobion (Hemiptera: Aphididae).

Authors:  P Sunnucks; D F Hales
Journal:  Mol Biol Evol       Date:  1996-03       Impact factor: 16.240

5.  The complete mitochondrial DNA genome of Aedes vigilax (Diptera: Culicidae).

Authors:  C M Hardy; L N Court; M J Morgan
Journal:  Mitochondrial DNA A DNA Mapp Seq Anal       Date:  2015-06-23       Impact factor: 1.514

6.  Complete sequences of mitochondria genomes of Aedes aegypti and Culex quinquefasciatus and comparative analysis of mitochondrial DNA fragments inserted in the nuclear genomes.

Authors:  Susanta K Behura; Neil F Lobo; Brian Haas; Becky deBruyn; Diane D Lovin; Martin F Shumway; Daniela Puiu; Jeanne Romero-Severson; Vishvanath Nene; David W Severson
Journal:  Insect Biochem Mol Biol       Date:  2011-05-27       Impact factor: 4.714

7.  Patterns of nucleotide composition at fourfold degenerate sites of animal mitochondrial genomes.

Authors:  N T Perna; T D Kocher
Journal:  J Mol Evol       Date:  1995-09       Impact factor: 2.395

8.  Sequencing and analysis of the complete mitochondrial genome in Anopheles culicifacies species B (Diptera: Culicidae).

Authors:  Ya-Qiong Hua; Zhen-Tian Yan; Wen-Bo Fu; Qi-Yi He; Yong Zhou; Bin Chen
Journal:  Mitochondrial DNA A DNA Mapp Seq Anal       Date:  2015-06-26       Impact factor: 1.514

9.  First Mitochondrial Genome from Nemouridae (Plecoptera) Reveals Novel Features of the Elongated Control Region and Phylogenetic Implications.

Authors:  Zhi-Teng Chen; Yu-Zhou Du
Journal:  Int J Mol Sci       Date:  2017-05-05       Impact factor: 5.923

10.  De novo transcriptome sequencing and sequence analysis of the malaria vector Anopheles sinensis (Diptera: Culicidae).

Authors:  Bin Chen; Yu-Juan Zhang; Zhengbo He; Wanshun Li; Fengling Si; Yao Tang; Qiyi He; Liang Qiao; Zhentian Yan; Wenbo Fu; Yanfei Che
Journal:  Parasit Vectors       Date:  2014-07-07       Impact factor: 3.876

View more
  2 in total

1.  Novel Mitochondrial DNA Lineage Found among Ochlerotatus communis (De Geer, 1776) of the Nordic-Baltic Region.

Authors:  Heli Kirik; Lea Tummeleht; Tobias Lilja; Olavi Kurina
Journal:  Insects       Date:  2020-06-26       Impact factor: 2.769

2.  Mitochondrial genome sequencing and phylogeny of Haemagogus albomaculatus, Haemagogus leucocelaenus, Haemagogus spegazzinii, and Haemagogus tropicalis (Diptera: Culicidae).

Authors:  Fábio Silva da Silva; Ana Cecília Ribeiro Cruz; Daniele Barbosa de Almeida Medeiros; Sandro Patroca da Silva; Márcio Roberto Teixeira Nunes; Lívia Carício Martins; Jannifer Oliveira Chiang; Poliana da Silva Lemos; Gabriel Muricy Cunha; Renato Freitas de Araujo; Hamilton Antônio de Oliveira Monteiro; Joaquim Pinto Nunes Neto
Journal:  Sci Rep       Date:  2020-10-12       Impact factor: 4.379

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.