| Literature DB >> 25493046 |
Pei Wang1, Hai-Fan Yang2, Wei-Chuan Zhou1, Chung-Chi Hwang3, Wei-Hong Zhang4, Zhou-Xing Qian5.
Abstract
The complete mitochondrial (mt) genome of the snail Camaenacicatricosa (Müller, 1774) has been sequenced and annotated in this study. The entire circular genome is 13,843 bp in size and represents the first camaenid mt genome, with content of 31.9%A, 37.9%T, 13.5%C and 16.7%G. Gene content, codon usage and base organization show similarity to a great extent to the sequenced mt genome from Stylommatophora, whereas, gene order is different from them, especially the positions of tRNA(Cys) , tRNA(Phe) , COII, tRNA(Asp) , tRNA(Gly) , tRNA(His) and tRNA(Trp) . All protein coding genes use standard initiation codons ATN except for COII with GTG as start signal. Conventional stop codons TAA and TAG have been assigned to all protein coding genes. All tRNA genes possess the typical clover leaf structure, but the TψC arm of tRNA(Asp) and dihydrouridine arm of tRNA(Ser(AGN)) only form a simple loop. Shorter intergenic spacers have been found in this mt genome. Phylogenetic study based on protein coding genes shows close relationship of Camaenidae and Bradybaenidae. The presented phylogeny is consistent with the monophyly of Stylommatophora.Entities:
Keywords: Camaenacicatricosa; Camaenidae; Stylommatophora; mitochondrial genome; secondary structure
Year: 2014 PMID: 25493046 PMCID: PMC4258619 DOI: 10.3897/zookeys.451.8537
Source DB: PubMed Journal: Zookeys ISSN: 1313-2970 Impact factor: 1.546
Primer pairs used for PCR amplification.
| No. of fragment | Primer name | Nucleotide sequence (5’–3’) and location | Size (bp) | Reference |
|---|---|---|---|---|
| 1 | LCO-1490 | GGTCAACAAATCATAAAGATATTGG | ||
| HCO-2198 | TAAACTTCAGGGTGACCAAAAAATCA | |||
| 2 | Fcoi | TGAACTGTTTATCCTCCAC (364–382) | 1908 | Present study |
| RL | TAGGGTCTTCTCGTCTTT (2254–2271) | Present study | ||
| 3 | 16Sar-L | CGCCTGTTTATCAAAAACAT | ||
| 16Sbr-H | CCGGTCTGAACTCAGATCACGT | |||
| 4 | FL2 | CGATGTTGGATTAGGAAGTTGA (2415–2436) | 4267 | Present study |
| Rcb2 | TAAAGGATTTGTTGACCCACG (6661–6681) | Present study | ||
| 5 | 144F | TGAGSNCARATGTCNTWYTG | ||
| 272R | GCRAANAGRAARTACCAYTC | |||
| 6 | Fcb | GTGGGTCAACAAATCCTT (6662–6679) | 816 | Present study |
| Rcoii | ATGAACACCTCGGGTAGT (7460–7477) | Present study | ||
| 7 | FCOII | AAATAATGCTATTTCATGAYCAYG | ||
| RCOII | GCTCCGCAAATCTCTGARCAYTG | |||
| 8 | SF1F | AAATTCCATTAGAGGGGCTTATACGCCGCC (6984–7013) | 6957 | Present study |
| SF1R | CAAGAGATAGTCCCGTACCAACTATGCCGC (68–79) | Present study |
Summary of samples information used in this study.
| Subclass /order | Family | Species | Accession number | Reference |
|---|---|---|---|---|
| Present study | ||||
Figure 1.The mt genome of . The tRNA genes are labeled based on the IUPACIUB single letter amino acid codes. Genes with underline illuminate the direction of transcription from 3’ to 5’, and without underline illuminating from 5’ to 3’. Numbers and overlapping lines within the circle indicate PCR fragments amplified for sequencing (see Table 1).
Figure 2.Gene arrangement of nine mt genomes in the order .
Organization of the mt genome.
| Gene | Direction | Location | Size (bp) | Anticodon | Start codon | Stop codon | Intergenic nucleotides |
|---|---|---|---|---|---|---|---|
| F | 1–1527 | 1527 | ATG | TAG | |||
| F | 1527–1585 | 59 | 1557–1559 TAC | –1 | |||
| F | 1586–2582 | 997 | 0 | ||||
| F | 2583–2642 | 60 | 2611–2613 TAG | 0 | |||
| F | 2640–2702 | 63 | 2669–2671 TGG | –3 | |||
| F | 2705–2764 | 60 | 2735–2737 TGC | 2 | |||
| F | 2784–3251 | 468 | ATA | TAA | 19 | ||
| F | 3202–4893 | 1692 | ATT | TAA | –50 | ||
| F | 4914–5786 | 873 | ATA | TAA | 20 | ||
| F | 5797–6072 | 276 | ATA | TAA | 10 | ||
| F | 6076–7188 | 1113 | ATG | TAA | 3 | ||
| F | 7185–7246 | 62 | 7215–7217 GCA | –4 | |||
| F | 7249–7309 | 61 | 7279–7281 GAA | 2 | |||
| F | 7310–7984 | 675 | GTG | TAG | 0 | ||
| F | 7989–8048 | 60 | 8019–8021 GTC | 4 | |||
| F | 8075–8136 | 62 | 8105–8107 GTA | 26 | |||
| F | 8132–8191 | 60 | 8162–8164 TCC | –5 | |||
| F | 8188–8246 | 59 | 8218–8220 GTG | –4 | |||
| F | 8255–8314 | 60 | 8282–8284 TCA | 8 | |||
| R | 8311–8369 | 59 | 8338–8340 TTG | –4 | |||
| R | 8366–8429 | 64 | 8398–8400 TAA | –4 | |||
| R | 8431–8595 | 165 | ATG | TAA | 1 | ||
| R | 8597–8652 | 56 | 8620–8622 GTT | 1 | |||
| R | 8652–9332 | 681 | ATT | TAA | –1 | ||
| R | 9309–9366 | 58 | 9339–9341 TCG | –24 | |||
| R | 9366–9430 | 65 | 9393–9395 TTC | –1 | |||
| R | 9431–10112 | 682 | 0 | ||||
| R | 10113–10174 | 62 | 10140–10142 CAT | 0 | |||
| R | 10165–10524 | 360 | ATA | TAA | –10 | ||
| R | 10517–10569 | 53 | 10548–10550 TGA | –8 | |||
| F | 10570–10629 | 61 | 10594–10596 GCT | 0 | |||
| F | 10648–11988 | 1341 | ATA | TAA | 18 | ||
| R | 11940–11999 | 60 | 11967–11969 TGT | –49 | |||
| R | 11965–12792 | 828 | ATT | TAA | –35 | ||
| F | 12822–12885 | 64 | 12852–12854 GAT | 29 | |||
| F | 12887–13828 | 942 | ATG | TAA | 1 | ||
| F | 13790–13843 | 54 | 13819–13821 TTT | –39 |
Note: Negative numbers indicate adjacent gene overlap.
Nucleotide composition and skew of the mt genome.
| Proportion of nucleotides | ||||||||
|---|---|---|---|---|---|---|---|---|
| Feature | %A | %T | %G | %C | %A+T | AT Skew | GC Skew | No. of nucleotides |
| Whole genome | 31.90 | 37.90 | 16.70 | 13.50 | 69.80 | –0.09 | 0.11 | 13843 |
| Protein coding genes | 31.18 | 38.14 | 17.05 | 13.64 | 69.32 | –0.10 | 0.11 | 10941 |
| Protein coding genes (J) | 28.83 | 40.41 | 17.54 | 13.23 | 69.24 | –0.17 | 0.14 | 8907 |
| Protein coding genes (N) | 28.22 | 41.45 | 15.44 | 14.90 | 69.67 | –0.19 | 0.02 | 2034 |
| tRNA genes | 34.95 | 36.46 | 15.81 | 12.78 | 71.41 | –0.02 | 0.11 | 1322 |
| tRNA genes (J) | 33.96 | 36.80 | 17.51 | 11.72 | 70.77 | –0.04 | 0.20 | 845 |
| tRNA genes (N) | 35.85 | 36.69 | 14.68 | 12.79 | 72.54 | –0.01 | 0.07 | 477 |
| rRNA genes | 35.14 | 37.28 | 14.83 | 12.75 | 72.42 | –0.03 | 0.08 | 1679 |
Figure 3.Inferred secondary structures of 22 tRNA genes in . Dashed (-) indicates Watson-Crick base pairing and (•) indicates G-U base pairing.
Figure 4.Relative synonymous codon usage (RSCU) in the mt genome. Codon families are provided on the x axis.
Figure 5.Phylogenetic tree inferred by maximum likelihood (ML) and maximum parsimony (MP) methods based on 13 protein genes. The tree is rooted with . Numbers on or under the nodes represent bootstrap values of MP and ML respectively.