| Literature DB >> 29718503 |
Yao Tan1, Bing Jia1, Yuan-Ming Chi1, Hai-Bin Han2, Xiao-Rong Zhou1, Bao-Ping Pang1.
Abstract
Lygus pratensis is a phytophagous pest responsible for yield losses in Bt alfalfa and other economic crops in Northwestern China. To better characterize Miridae at the genomic level, the complete mitochondrial (mt) genome of L. pratensis was sequenced and analyzed in this study. The mt genome was amplified via the polymerase chain reaction to generate overlapping fragments. These fragments were then sequenced, spliced, and analyzed to include the examination of nucleotide composition, codon usage, compositional biases, protein-coding genes (PCGs), and RNA secondary structures. Phylogenetic relationships between L. pratensis and other species in different Heteroptera families were also examined. The mt genome was found to be a typical circular genome with a length of 16,591 bp and a total AT content of 75.1%, encoded for 13 PCGs, 22 transfer RNAs (tRNAs), 2 ribosomal RNAs (lrRNA and srRNA), and a noncoding control region. The nucleotide composition of the entire mt genome was heavily biased toward A and T. All of the tRNAs were predicted to have classic clover leaf structures, but three of the tRNAs (tRNAAsn, tRNAHis, tRNAHis) were missing the TΨC loop. The control region (2,017 bp), which was found to be located between 12S and tRNAIle, contained three tandem repeat elements. Phylogenetic analyses showed that L. pratensis is closely related to the other three examined Lygus bugs, and that it is a sister group to Apolygus and Adelphocoris. This study confirms the usability of the mt genome in phylogenesis studies pertaining to the Lygus genus, within Miridae.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29718503 PMCID: PMC5893962 DOI: 10.1093/jisesa/iey035
Source DB: PubMed Journal: J Insect Sci ISSN: 1536-2442 Impact factor: 1.857
Primers used for amplifying the complete mt genome
| Fragment | Primer name | Nucleotide sequences (5′→3′) | Sequence size (bp) | Starting site–stop site |
|---|---|---|---|---|
| 1 | LLF1 | ACTAAATCCGTAGAGATC | 973 | 82–1055 |
| LLR1 | CGAAAGTGTCACTATTGATG | |||
| 2 | LLF2 | GAATAACAGAAAAGTTATC | 1,069 | 891–1960 |
| LLR2 | CGGTGATTCCCACCGATC | |||
| 3 | LLF3 | CATAATGGAGCATCAGTAG | 1,108 | 1796–2904 |
| LLR3 | CAGTATATGAATGTTCAGC | |||
| 4 | LLF4 | GATTACCCTGATAGATTC | 1,300 | 2708–4008 |
| LLR4 | ATGCTTGTTGCTGGGTCG | |||
| 5 | LLF5 | CCCAATTTGATGAACAAC | 1,171 | 3823–4994 |
| LLR5 | TCCTTTAGGTGGTCATAG | |||
| 6 | LLF6 | GATGACGAGATATCGTACG | 1,051 | 4803–5854 |
| LLR6 | GTTATTGTTAACTACATCC | |||
| 7 | LLF7 | TTCTAATCTTCGATATCG | 1,242 | 5670–6911 |
| LLR7 | TACCTTATACATCTTCTTG | |||
| 8 | LLF8 | CTCACAGTGAATATGACAG | 970 | 6746–7716 |
| LLR8 | TGCTGATTTCTTCAATGG | |||
| 9 | LLF9 | TTCGGTTAGTCATAACAG | 1,494 | 7470–8964 |
| LLR9 | GAATGTAGATTAATTCCTAC | |||
| 10 | LLF10 | TGCCATCAACATTGAAAG | 1,261 | 8754–10015 |
| LLR10 | GTAAGTCTTATCAATATCC | |||
| 11 | LLF11 | CAAACCCTACTTACATCAG | 1,297 | 9833–11130 |
| LLR11 | AATTGATGCAATCATGGC | |||
| 12 | LLF12 | GAATCAAGGATTATATAGG | 1,322 | 10909–12231 |
| LLR12 | TGCAACCATTTTCTGATGG | |||
| 13 | LLR13 | AGTCTGAGCAACAGCACG | 1,091 | 11950–13041 |
| LLF13 | GATCTAACCTGCCCTATG | |||
| 14 | LLR14 | GCTCTATAGGGTCTTCTCG | 980 | 12842–13822 |
| LLF14 | GTCGCTCCCAACATATGTTG | |||
| 15 | LLR15 | CTGATACAAAAGGTAAGC | 994 | 13515–14509 |
| LLF15 | GTTTTATTCATGCATATGG | |||
| 16 | LLR16 | GTATAACCGCAACTGCTGG | 2,530 | 14328–1659181 |
| LLR16 | CTAATAACAATAATGGTTC |
Fig. 1.Structure of the Lygus pratensis mt genome. The color blocks contain tRNAs that are labeled according to IUPAC-IUB single-letter amino acid codes. Genes that are underlined undergo transcription from left to right, while those without underlines proceed from right to left.
Organization of the Lygus pratensis mt genome
| Gene | Direction | Strand | Position (bp) | Anticodon | Size (bp) | Start codon | Stop codon | Intergenic nucleotide (bp) |
|---|---|---|---|---|---|---|---|---|
| tRNA-Ile (I) | F | J | 1–65 | GAT | 65 | – | – | 0 |
| tRNA-Gln (Q) | R | N | 63–131 | TTG | 69 | – | – | −3 |
| tRNA-Met (M) | F | J | 132–199 | CAT | 68 | – | – | 0 |
| ND2 | F | J | 200–1,207 | – | 1,008 | ATT | TAA | 0 |
| tRNA-Trp (W) | F | J | 1,206–1,270 | TCA | 65 | – | – | −2 |
| tRNA-Cys (C) | R | N | 1,262–1,325 | GCA | 63 | – | – | −8 |
| tRNA-Tyr (Y) | R | N | 1,325–1,391 | GTA | 66 | – | – | 0 |
| COI | F | J | 1,392–2,925 | – | 1,534 | ATG | T- | 0 |
| tRNA-Leu (L) | F | J | 2,926–2,991 | TAA | 66 | – | – | 0 |
| COII | F | J | 2,992–3,672 | – | 681 | ATC | TAA | 0 |
| tRNA-Lys (K) | F | J | 3,673–3,743 | CTT | 70 | – | – | 0 |
| tRNA-Asp (D) | F | J | 3,745–3,810 | GTC | 65 | – | – | 2 |
| ATPase8 | F | J | 3,809–3,971 | – | 162 | ATT | TAG | 0 |
| ATPase6 | F | J | 3,965–4,648 | – | 684 | ATG | TAA | −7 |
| COIII | F | J | 4,634–5,418 | – | 784 | ATG | T- | −14 |
| tRNA-Gly (G) | F | J | 5,419–5,481 | TCC | 62 | – | – | 0 |
| ND3 | F | J | 5,482–5,832 | – | 352 | ATA | T- | 0 |
| tRNA-Ala (A) | F | J | 5,834–5,895 | TGC | 62 | – | – | 1 |
| tRNA-Arg (R) | F | J | 5,899–5,965 | TCG | 67 | – | – | 3 |
| tRNA-Asn (N) | F | J | 5,964–6,028 | GTT | 65 | – | – | −2 |
| tRNA-Ser (S) | F | J | 6,028–6,096 | GCT | 69 | – | – | −1 |
| tRNA-Glu (E) | F | J | 6,096–6,159 | TTC | 64 | – | – | −1 |
| tRNA-Phe (F) | R | N | 6,158–6,219 | GAA | 62 | – | – | −2 |
| ND5 | R | N | 6,220–7,918 | – | 1699 | ATT | TAA | 0 |
| tRNA-His (H) | R | N | 7,919–7,983 | GTG | 64 | – | – | 0 |
| ND4 | R | N | 7,984–9,311 | – | 1,327 | ATG | TAA | 0 |
| ND4L | R | N | 9,303–9,608 | – | 306 | ATT | TAA | −7 |
| tRNA-Thr (T) | F | J | 9,611–9,678 | TGT | 68 | – | – | 2 |
| tRNA-Pro (P) | R | N | 9,679–9,742 | TGG | 64 | – | – | 0 |
| ND6 | F | J | 9,744–10,232 | – | 489 | ATT | TAA | 1 |
| CytB | F | J | 10,234–11,367 | – | 1134 | ATG | TAA | 1 |
| tRNA-Ser (S) | F | J | 11,368–11,433 | TGA | 67 | – | – | −1 |
| ND1 | R | N | 11,441–12,367 | – | 927 | ATT | TAA | 7 |
| tRNA-Leu (L) | R | N | 12,368–12,431 | TAG | 64 | – | – | 0 |
| 16S-rRNA (lrRNA) | R | N | 12,432–13,681 | – | 1250 | – | – | 0 |
| tRNA-Val (V) | R | N | 13,682–13,750 | TAC | 69 | – | – | 0 |
| 12S-rRNA (srRNA) | R | N | 13,751–14,575 | – | 824 | – | – | 0 |
| Control region | – | – | 14,576–16,591 | – | 2,017 | – | – | 0 |
Lygus pratensis mt genome nucleotide composition and skew
| Feature | %A | %C | T% | G% | A+T% | AT | GC | No. of nucleotide |
|---|---|---|---|---|---|---|---|---|
| Whole genome | 42.7 | 14.6 | 32.4 | 10.3 | 75.1 | 0.14 | −0.17 | 16591 |
| PCGs | 43.8 | 10.4 | 30.9 | 14.9 | 74.7 | 0.17 | 0.18 | 11087 |
| PCGs (J) | 39.2 | 15.2 | 33.8 | 11.8 | 73.0 | 0.07 | −0.13 | 6828 |
| PCGs (N) | 43.6 | 7.0 | 22.4 | 12.5 | 66.0 | 0.32 | 0.28 | 4979 |
| tRNA genes | 41.3 | 10.1 | 36.2 | 12.4 | 77.5 | 0.07 | 0.10 | 1427 |
| tRNA genes (J) | 41.1 | 10.8 | 36.5 | 11.6 | 77.6 | 0.06 | 0.10 | 919 |
| tRNA genes (N) | 41.7 | 15.2 | 35.9 | 7.3 | 77.6 | 0.07 | −0.35 | 521 |
| rRNA genes | 44.2 | 12.8 | 34.9 | 8.1 | 79.1 | 0.12 | −0.22 | 2074 |
| Control region | 36.8 | 16.2 | 35.4 | 11.7 | 72.2 | 0.02 | −0.16 | 2017 |
AT skew = (A−T)/(A+T).
GC skew = (G−C)/(G+C).
Fig. 2.Relative synonymous codon usage (RSCU) within the Lygus pratensis mt genome.
Fig. 3.Predicted tRNA secondary structures. Lines (-) indicate Watson–Crick base pairing, with unmatched base pairs indicated by red dots (● ).
Fig. 4.Lygus pratensis control region organization. The locations and copy numbers of tandem repeats are indicated with colored ovals, while the remaining sequences are depicted with gray rectangles.
Fig. 5.Phylogenetic tree construction inferred from 14 PCGs from 11 Miridae species and 3 out-group Miridae. Numbers close to the nodes are bootstrap support values from the NJ analysis, ML analysis, and Bayesian posterior probability analyses. Miridae species ●; Anthocoridae species ●; Reduviidae species ● ; Nabidae species ● ; Lygus pratensis ●.