| Literature DB >> 32680454 |
Ze Chen1, Yibo Xuan1,2, Guangcai Liang3, Xiaolong Yang1, Zhijun Yu1, Stephen C Barker4, Samuel Kelava4, Wenjun Bu2, Jingze Liu5, Shan Gao6,7.
Abstract
BACKGROUND: In the present study, we used long-PCR amplification coupled with Next-Generation Sequencing (NGS) to obtain complete mitochondrial (mt) genomes of individual ticks and unprecedently performed precise annotation of these mt genomes. We aimed to: (1) develop a simple, cost-effective and accurate method for the study of extremely high AT-content mt genomes within an individual animal (e.g. Dermacentor silvarum) containing miniscule DNA; (2) provide a high-quality reference genome for D. silvarum with precise annotation and also for future studies of other tick mt genomes; and (3) detect and analyze mt DNA variation within an individual tick.Entities:
Keywords: Mitochondrial DNA; Precise annotation; Short tandem repeat; Tick; Transposon
Mesh:
Year: 2020 PMID: 32680454 PMCID: PMC7367389 DOI: 10.1186/s12864-020-06906-2
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Long-PCR amplification of each entire mt genome. All the primers and PCR reaction conditions are listed in Table 1. The tRNA genes are represented by their single letter codes. CR1 and CR2 represents the control region 1 and the control region 2, respectively. Translocated genes are reported in the same colour. All the reference genomes of tick mitochondria read in the 5′ → 3′ direction as the major coding strand (J-strand). Genes on the J-strand and the N-strand are shown high and low, respectively. a. The tick mt genomes were classified into three types, which are type I, II and III (Results). The type III mt genomes were amplified into two large segments (L1&L2 or L3&L4) by long-PCR using total DNA from individual ticks. Using the complete D. silvarum mt genome (GenBank: MN347015), L1, L2, L3 and L4 were estimated as ~ 9.6, ~ 8.2, ~ 7.2 and ~ 9.2 Kbp in size (Table 1), respectively. b. The type III mt genomes of ticks read clockwise in the 5′ → 3′ direction
PCR primers for the Dermacentor mt genomes
| Forward primer | Reverse primer | Segment | Size(bp) |
|---|---|---|---|
| TCAGTCATTTTACCGCGATGA | GCTCAAATTCCATTCTCTGC | L1 | 9580 |
| AGCTGTTACTAACGTTGAGG | AGGATGTTGATGGATCGAAA | L2 | 8156 |
| GCTAKTGGGTTCATACCCCAA | CGACCTCGATGTTGGATTAGGA | L3 | 7155 |
| CCAACCTGATTCWCATCGGTCT | TCATCGCGGTAAAATGACTGA | L4 | 9187 |
| TGCTGCTGGCACAAATTTAGC | CAAGATGACCCTAAATTCAGGCA | CR1 | 483 |
| GGAGCTATACCAATTGAATATCCC | TTGGGGTATGAACCCAATAGC | CR2 | 645 |
| TGCATTCAGTTTCGGCCTGA | CCGGCTGTCTCATCTATTGAC | R2 | 3616 |
| CTATTCCGGCATAGTAAAATGCCTG | CAAGCTTATGCACCCTTTTCAATAC | R1 | 570 |
These primers were designed to amplify large segments (L1, L2, L3 and L4) and short segments (CR1, CR2, R1 and R2) in the mt genomes of the genus Dermacentor. Their PCR reaction conditions can be seen in the Methods. Based on the results using 100 individual ticks from four species, the primers for L3 and L4 were optimized to amplify more species of the genus Dermacentor than those of L1 and L2. The R2 segment spanned tRNAArg, tRNAAsn, tRNASer, tRNAGlu, ND1, tRNALeu, 16S rRNA, tRNAVal, 12S rRNA and CR1. The R1 segment spanned tRNAIle, tRNAGln, R1, tRNAPhe and ND5. The segment sizes were estimated using the D. silvarum mt genome (GenBank: MN347015)
Fig. 2Precise annotation of mt tRNAs and control regions. a. CR1 and CR2 were determined in the D. silvarum mt genome (GenBank: MN347015). b. In MN347015, small RNA A[U]7 was produced from between tRNACys and tRNAMet. One of tRNASer and tRNACys had no D-arms, whereas tRNAAla, tRNAGlu, tRNATyr and tRNAPhe had unstable T-arms (indicated in black box)
Fig. 3The transposon-like element in the Dermacentor silvarum mt genome. All the mt genomes read in the 5′ → 3′ direction as the J-strand. The genes from the J-strand and the N-strand are indicated in red and blue colours, respectively. a. The genes from the J-strand and the N-strand are deployed upward and downward, respectively. b. R1 and R2 were composed of several repeat units, respectively. And the repeat units in R1 are reverse complimentary to those in R2. In total, three types of repeat units (type 1, 2 and 3) of R1 were identified. c. R1 and R2 were determined to have 5 repeat units in the D. silvarum mt genome (GenBank: MN347015)
Precise annotations of the Dermacentor silvarum mt genome
| Gene | Strand | Start | End | Length |
|---|---|---|---|---|
| tRNA-Met | (+) | 1 | 67 | 67 |
| ND2 | (+) | 68 | 1028 | 961 |
| tRNA-Trp | (+) | 1029 | 1091 | 63 |
| tRNA-TyrAS | (+) | 1092 | 1149 | 58 |
| COI | (+) | 1150 | 2686 | 1537 |
| COII | (+) | 2687 | 3359 | 673 |
| tRNA-Lys | (+) | 3360 | 3429 | 70 |
| tRNA-Asp | (+) | 3430 | 3491 | 62 |
| ATP8/6 | (+) | 3492 | 4326 | 835 |
| COIII | (+) | 4327 | 5104 | 778 |
| tRNA-Gly | (+) | 5105 | 5171 | 67 |
| ND3 | (+) | 5172 | 5511 | 340 |
| tRNA-Ala | (+) | 5512 | 5576 | 65 |
| Intergenic | (+) | 5577 | 5578 | 2 |
| tRNA-Arg | (+) | 5579 | 5640 | 62 |
| tRNA-Asn | (+) | 5641 | 5708 | 68 |
| Intergenic | (+) | 5709 | 5712 | 4 |
| tRNA-Ser | (+) | 5713 | 5769 | 57 |
| tRNA-Glu | (+) | 5770 | 5834 | 65 |
| R2 | * | 5825 | 5987 | 163 |
| HAS1 | (+) | 5835 | 9258 | 3424 |
| tRNA-Ile | (+) | 9259 | 9322 | 64 |
| HAS2 | (+) | 9323 | 12,930 | 3608 |
| tRNA-Thr | (+) | 12,931 | 12,991 | 61 |
| tRNA-ProAS/ND6 | (+) | 12,992 | 13,503 | 512 |
| Cytb | (+) | 13,504 | 14,585 | 1082 |
| tRNA-Ser | (+) | 14,586 | 14,652 | 67 |
| tRNA-LeuAS/CR2 | (+) | 14,653 | 15,023 | 371 |
| CR2 | * | 14,717 | 15,023 | 307 |
| tRNA-Cys | (+) | 15,024 | 15,086 | 63 |
| Intergenic | (+) | 15,087 | 15,094 | 8 |
| tRNA-Tyr | (−) | 1090 | 1151 | 62 |
| LAS1 | (−) | 1152 | 5984 | 4833 |
| ND1 | (−) | 5985 | 6903 | 919 |
| tRNA-Leu | (−) | 6904 | 6964 | 61 |
| 16S rRNA | (−) | 6965 | 8185 | 1221 |
| tRNA-Val | (−) | 8186 | 8247 | 62 |
| 12S rRNA | (−) | 8248 | 8949 | 702 |
| CR1 | (−) | 8950 | 9258 | 309 |
| tRNA-IleAS | (−) | 9259 | 9323 | 65 |
| tRNA-Gln | (−) | 9324 | 9390 | 67 |
| R1 | (−) | 9391 | 9559 | 169 |
| tRNA-Phe | (−) | 9560 | 9620 | 61 |
| ND5 | (−) | 9621 | 11,275 | 1655 |
| tRNA-His | (−) | 11,276 | 11,341 | 66 |
| ND4/4 L | (−) | 11,342 | 12,928 | 1587 |
| tRNA-ThrAS | (−) | 12,929 | 12,991 | 63 |
| tRNA-Pro | (−) | 12,992 | 13,055 | 64 |
| LAS2 | (−) | 13,056 | 14,654 | 1599 |
| tRNA-Leu | (−) | 14,655 | 14,716 | 62 |
| LAS3 | (−) | 14,717 | 1089 | 1467 |
This reference sequence is available at the NCBI GenBank database under the accession number MN347015. J(+) and N(−) represent the major and minor coding strands of the mt genome, respectively. Control Region 1 (CR1) and tandem Repeat 1 (R1) were annotated as full-length RNAs cleaved from the minor coding strand (N-strand) primary transcript, whereas CR2 and R2 were annotated as DNA regions (*). The “AS” suffix represents antisense. H-strand Antisense Segment 1 (HAS 1) represents R2/ND1AS/tRNALeuAS/(16S rRNA)AS/tRNAValAS/(12S rRNA)AS/CR1. HAS2 represents tRNAGlnAS/R1/tRNAPheAS/ND5AS/tRNAHisAS/(ND4/4 L)AS. L-strand Antisense Segment 1 (LAS1) represents COIAS/COIIAS/tRNALysAS/tRNA-AspAS/(ATP8/6)AS/COIIIAS/tRNAGlyAS/ND3AS/tRNAAlaAS/tRNAArgAS/tRNAAsnAS/tRNASerAS/tRNAGluAS/R2. LAS2 represents ND6AS/CytbAS/tRNASerAS. LAS3 represents CR2/tRNACysAS/tRNAMetAS/ND2AS/tRNATrpAS
Mitochondrial CNV of STRs within a D. silvarum individual
| Position | Gene | Ref | Allele | Depth |
|---|---|---|---|---|
| 1810 | COI | [G]8 | Ref/+ 1G/−1G/+2GG/T/G/C/+ 1 T/*/−2GG | 352,526/6177/3116/150/32/25/19/19/10/6 |
| 2573 | COI | [A]9 | Ref/+ 1A/− 1A/T/+2AA/A/+3AAA/−2AA | 277,857/7636/1788/223/170/13/9/8 |
| 3441 | tRNA-Asp | [A]8 | Ref/+ 1A/+2AA/−1A/+ 1C/+3AAA/+ 1G/G/C/A | 438,644/195894/3402/1011/69/58/17/10/7/5 |
| 3619 | ATP8/6 | [A]10 | Ref/+ 1A/−1A/+2AA/−2AA/T/+3AAA/A/C | 400,525/23182/9717/997/185/137/49/14/11 |
| 4592 | COIII | [T]11 | Ref/+ 1 T/−1 T/+2TT/−2TT/+3TTT/T/+ 1A | 107,531/10859/6363/1027/140/110/20/10 |
| 5524 | tRNA-Ala | [A]10 | Ref/+1A/−1A/+2AA/−2AA/A/+3AAA | 97,659/3329/2622/131/27/19/9 |
| 6361 | ND1 | [A]10 | Ref/+1A/−1A/+2AA/−2AA/+3AAA/C | 94,957/4984/3410/313/18/17/8 |
| 7324 | 16S rRNA | [TA]9 | Ref/−2TA/+2TA | 93,454/4152/3088 |
| 7737 | 16S rRNA | [A]9 | Ref/+1A/−1A/+2AA/C/A | 89,380/1277/890/32/6/5 |
| 8042 | 16S rRNA | [T]9 | Ref/+1 T/−1 T/+2TT/−2TT/G | 96,225/1795/1552/40/8/5 |
| 8243 | 12S/tRNA-Val | [T]10 | Ref/+1 T/−1 T/+2TT/−2TT/+3TTT/G | 99,234/3993/2326/151/31/8/7 |
| 8885 | 12S rRNA | [T]10 | Ref/+1 T/−1 T/+2TT/−2TT/T/+3TTT | 86,875/4767/3630/287/38/36/18 |
| 10,026 | ND5 | [A]10 | Ref/−1A/+1A/+2AA/−2AA/+3AAA/C | 99,166/5465/3630/134/82/6/5 |
| 10,298 | ND5 | [A]13 | Ref/−1A/+1A/−2AA/+2AA/−3AAA/+3AAA/C | 91,464/17434/7221/2402/721/241/61/5 |
| 11,216 | ND5 | [A]17 | Ref/−1A/+1A/−2AA/+2AA/−3AAA/+3AAA/A/*/+2TA/+ 1C | 138,074/26622/18595/4443/2859/542/413/96/24/5/5 |
| 11,483 | ND4/4 L | [TA]6 | Ref/−2TA/+2TA/A/T | 200,829/3596/2031/19/7 |
| 11,962 | ND4/4 L | [A]11 | Ref/−1A/+1A/−2AA/+2AA/C/+3AAA/A/−3AAA | 203,481/18678/10550/646/496/28/20/11/10 |
| 12,082 | ND4/4 L | [A]8 | Ref/−1A/+1A/T/+2AA/A | 215,342/1501/821/48/12/7 |
| 12,505 | ND4/4 L | [A]9 | Ref/−1A/+1A/+2AA/C/−2AA | 227,869/3067/2631/47/20/9 |
| 12,697 | ND4/4 L | [A]8 | Ref/+1A/−1A/+2AA/C/A/G | 230,254/1922/497/23/22/6/6 |
This reference sequence (GenBank: MN347015) uses the consensus DNA sequence of one Dermacentor silvarum individual collected from Qiangyang, Gansu province of China. All the STR variants were selected using a very strict parameter (Methods). Ref was designated as the STR reference, as it occured the most frequently in mtDNAs within one individual tick. In the allele column, insertions and deletions were noted as “+” and “-” following the specifications in pileup format [20]. The numbers in the depth column are one-to-one corresponding to the variants in the allele column. For example, the STR position 1810 had a reference sequence [G]8 and a variant [G]9 noted by “+G”, which had a depth 6177