| Literature DB >> 27408534 |
Jun-Hong Lin1, Wei-Chuan Zhou2, Hong-Li Ding1, Pei Wang2, Hong-Mu Ai1.
Abstract
The land snail Cernuella virgata (da Costa, 1778) is widely considered as a pest to be quarantined in most countries. In this study, the complete mitochondrial genome of Cernuella virgata is published. The mitochondrial genome has a length of 14,147 bp a DNA base composition of 29.07% A, 36.88% T, 15.59% C and 18.46% G, encoding 13 protein-coding genes (PCGs), 22 transfer RNA (tRNA) genes and two ribosomal RNA (rRNA) genes. The complete nucleotide composition was biased toward adenine and thymine, A+T accounting for 69.80%. Nine PCGs and 14 tRNA genes are encoded on the J strand, and the other four PCGs and eight tRNA genes are encoded on the N strand. The genome also includes 16 intergenic spacers. All PCGs start strictly with ATN, and have conventional stop codons (TAA and TAG). All tRNAs fold into the classic cloverleaf structure, except tRNA(Arg) , tRNA(Ser(UCN)) , tRNA(Ser(AGN)) and tRNA(Pro) . The first three lack the dihydrouridine arm while the last lacks the TψC arm. There are 502 bp long noncoding regions and 418bp long gene overlaps in the whole mitochondrial genome, accounting for 3.54% and 2.95% of the total length respectively. Phylogenetic analyses based on the sequences of the protein coding genes revealed a sister group relationship between the Hygromiidae and the Helicidae.Entities:
Keywords: DNA sequencing; phylogeny; plant quarantine; secondary structure; white snail
Year: 2016 PMID: 27408534 PMCID: PMC4926662 DOI: 10.3897/zookeys.589.7637
Source DB: PubMed Journal: Zookeys ISSN: 1313-2970 Impact factor: 1.546
Primer pairs used for PCR amplification.
| No. of fragment | Primer name | Nucleotide sequence (5’ – 3’) and location | Size (bp) | Reference |
|---|---|---|---|---|
| 1 | LCO-1490 | GGTCAACAAATCATAAAGATATTGG |
| |
| HCO-2198 | TAAACTTCAGGGTGACCAAAAAATCA |
| ||
| 2 | F1231 | GAACGGGTTAGTTTGTTTGTCT(490–511) | 1763 | Present study |
| R1231 | TAGGGTCTTCTCGTCTATTATGGT(2229–2252) | Present study | ||
| 3 | 16Sar-L | CGCCTGTTTATCAAAAACAT |
| |
| 16Sbr-H | CCGGTCTGAACTCAGATCACGT |
| ||
| 4 | 123F116 | TGTAACCATAATAGACGAGAAGACC(2225–2249) | 4545 | Present study |
| 123R1b | TAGGAGCAAAAAATACTACCAGAAA(6745–6769) | Present study | ||
| 5 | 144F | TGAGSNCARATGTCNTWYTG |
| |
| 272R | GCRAANAGRAARTACCAYTC |
| ||
| 6 | 123Fb | CTTTTCACCCCTACTTTAC(6683–6701) | 1044 | Present study |
| 123RII | ACTCCCTTTCAGGTGTTAT(7708–7726) | Present study | ||
| 7 | FCOII | AAATAATGCTATTTCATGAYCAYG |
| |
| RCOII | GCTCCGCAAATCTCTGARCAYTG |
| ||
| 8 | F1233 | AGTTACATTGGCCCTCCCTAGTCTTCGC(7560–7587) | 6930 | Present study |
| R1233 | GTAAACGGTTCAACCTGTACCAGCTCCC(315–342) | Present study |
Summary of samples used in this study.
| Subclass /order | Family | Species | Accession number | Reference |
|---|---|---|---|---|
|
|
|
|
| Present study |
|
|
|
|
| |
|
|
| Ding et al. 2015 (submitted) | ||
|
|
|
|
| |
|
|
| Deng et al. 2014 | ||
|
|
| Huang et al. 2015 | ||
|
|
| Huang et al. 2015 | ||
|
|
|
|
| |
|
|
|
| ||
|
|
|
| ||
|
|
|
|
| |
|
|
|
|
| |
|
|
|
|
| |
|
|
|
|
| Liu et al. 2012 |
|
|
|
|
| Knudsen et al. 2006 |
Figure 1.The mt genome of . The tRNA genes are labeled based on the IUPACIUB single letter amino acid codes. Genes with underline illustrate the direction of transcription from 3’ to 5’, and without underline revealing from 5’ to 3’. Numbers and overlapping lines within the circle indicate PCR fragments amplified for sequencing (see Table 1).
Organization of the mt genome.
| Gene | Direction | Location | Size (bp) | Anticodon | Start codon | Stop codon | Intergenic nucleotides |
|---|---|---|---|---|---|---|---|
|
| F | 1–1497 | 1497 | ATT | TAA | 26 | |
|
| F | 1494–1554 | 61 | 1524–1526 TAC | –4 | ||
|
| F | 1555–2567 | 1013 | 0 | |||
|
| F | 2568–2628 | 61 | 2597–2599 TAG | 0 | ||
|
| F | 2629–2685 | 57 | 2655–2657 TGG | 0 | ||
|
| F | 2687–2748 | 62 | 2718–2720 TGC | 1 | ||
|
| F | 2767–3222 | 456 | ATA | TAA | 18 | |
|
| F | 3227–4888 | 1662 | ATT | TAA | 4 | |
|
| F | 4804–5769 | 966 | ATG | TAG | –85 | |
|
| F | 5847–6215 | 369 | ATT | TAA | 77 | |
|
| F | 6151–7167 | 1017 | ATA | TAG | –65 | |
|
| F | 7157–7214 | 58 | 7188–7190 GTC | –11 | ||
|
| F | 7215–7276 | 62 | 7245–7247 GCA | 0 | ||
|
| F | 7283–7341 | 59 | 7313–7315 GAA | 6 | ||
|
| F | 7387–8031 | 645 | ATT | TAA | 45 | |
|
| F | 8015–8083 | 69 | 8046–8048 GTA | –17 | ||
|
| F | 8071–8132 | 62 | 8102–8104 TCA | –13 | ||
|
| F | 8282–8341 | 60 | 8311–8313 TCC | 149 | ||
|
| F | 8338–8398 | 61 | 8369–8371 GTG | –4 | ||
|
| R | 8400–8457 | 58 | 8427–8429 TTG | 1 | ||
|
| R | 8457–8513 | 57 | 8485–8487 TAA | –1 | ||
|
| R | 8485–8754 | 270 | ATG | TAA | –29 | |
|
| R | 8743–8804 | 62 | 8771–8773 GTT | –12 | ||
|
| R | 8807–9472 | 666 | ATG | TAA | 2 | |
|
| R | 9458–9517 | 60 | 9489–9491 TCG | –15 | ||
|
| R | 9518–9578 | 61 | 9547–9549 TTC | 0 | ||
|
| R | 9579–10277 | 699 | 0 | |||
|
| R | 10278–10343 | 66 | 10306–10308 CAT | 0 | ||
|
| R | 10304–10735 | 432 | ATA | TAA | –40 | |
|
| R | 10691–10743 | 53 | 10723–10725 TGA | –45 | ||
|
| F | 10820–10880 | 61 | 10844–10846 GCT | 76 | ||
|
| F | 10904–12178 | 1275 | ATT | TAG | 23 | |
|
| R | 12182–12246 | 65 | 12210–12212 TGT | 3 | ||
|
| R | 12170–13051 | 882 | ATG | TAA | –77 | |
|
| F | 13068–13127 | 60 | 13096–13098 GAT | 16 | ||
|
| F | 13182–14060 | 879 | ATA | TAG | 54 | |
|
| F | 14062–14121 | 60 | 14090–14092 TTT | 1 |
Note: Negative numbers indicate adjacent gene overlap.
Nucleotide composition and skewness of the mt genome.
| Proportion of nucleotides | ||||||||
|---|---|---|---|---|---|---|---|---|
| Feature | %A | %T | %G | %C | %A+T | AT Skew | GC Skew | No. of nucleotides |
| Whole genome | 29.07 | 36.88 | 18.46 | 15.59 | 69.80 | –0.12 | 0.08 | 14147 |
| Protein coding genes | 26.39 | 39.31 | 18.43 | 15.87 | 69.26 | –0.20 | 0.07 | 10977 |
| Protein coding genes (J) | 26.08 | 39.96 | 18.70 | 15.26 | 69.17 | –0.21 | 0.10 | 8739 |
| Protein coding genes (N) | 27.61 | 36.77 | 17.38 | 18.23 | 69.67 | –0.14 | –0.02 | 2034 |
|
| 31.46 | 34.23 | 18.73 | 15.58 | 71.41 | –0.04 | 0.09 | 1335 |
|
| 29.82 | 34.77 | 20.30 | 15.10 | 70.77 | –0.08 | 0.15 | 788 |
|
| 33.82 | 33.46 | 16.45 | 16.27 | 72.54 | 0.01 | 0.01 | 547 |
|
| 32.83 | 35.63 | 17.00 | 14.54 | 72.42 | –0.04 | 0.08 | 1712 |
Figure 2.in the mt genome. Codon families are provided on the x axis.
Relative synonymous codon usage
Figure 3.Inferred secondary structures of 22 tRNA genes in . Dashes (-) indicate Watson-Crick base pairing and bullets (•) indicate G-U base pairing.
Figure 4.Phylogenetic tree inferred by method based on 13 protein genes. The tree is rooted with and . Numbers on the nodes represent bootstrap values.
maximum likelihood