Literature DB >> 35002369

Complete mitochondrial genome of Manispentadactylapentadactyla (Mammalia: Pholidota), an endemic subspecies of Chinese pangolin: mitogenome characterisation and phylogenetic implications.

Nick Ching-Min Sun1,2, Chi-Chun Huang3, Yu-Wei Tseng4, Tulshi Laxmi Suwal5,6, Meng-Jou Chi7, Nian-Hong Jang-Liaw8, Kuo-Hsiang Hung4,9.   

Abstract

The Chinese pangolin Manispentadactyla is critically endangered because of over-exploitation and illegal trafficking and includes three subspecies. However, the taxonomic status of the three subspecies of the Chinese pangolin has not been well resolved, which impedes regional conservation and illegal trade traces. In this study, the complete mitogenome sequence of M.p.pentadactyla, an endemic subspecies of the Chinese pangolin in Taiwan, was determined. The complete mitogenome of M.p.pentadactyla is 16,570 base pairs (bp) in length with 13 protein-coding genes (PCG), 23 transfer RNAs (tRNAs), two ribosomal RNAs and a 1164 bp control region. The overall base composition of the genome showed a slight A + T bias (59.9%), positive AT skew (0.1515) and negative GC skew (-0.3406), which is similar to that of other pangolins. All PCGs started with a typical ATN codon and all tRNAs were typical cloverleaf-shaped secondary structures, except for tRNA-Ser(GCU). Phylogenetic analysis indicated a monophyletic relationship for M.p.pentadactyla and M.p.aurita and was monophyletic for M.p.pentadactyla, but paraphyletic for M.p.aurita. The paraphyly of M.p.aurita resulted from an incomplete lineage sorting. This study enriched the mitogenome database of the Chinese pangolin and the molecular information obtained should be very useful for future research on mitogenome evolution and genetic diversification in M.pentadactyla. Nick Ching-Min Sun, Chi-Chun Huang, Yu-Wei Tseng, Tulshi Laxmi Suwal, Meng-Jou Chi, Nian-Hong Jang-Liaw, Kuo-Hsiang Hung.

Entities:  

Keywords:  Manispentadactyla; critically endangered; mitogenome; phylogeny

Year:  2021        PMID: 35002369      PMCID: PMC8732882          DOI: 10.3897/BDJ.9.e77961

Source DB:  PubMed          Journal:  Biodivers Data J        ISSN: 1314-2828


Introduction

Pangolins (: ) are scaly-bodied mammals that inhabit a wide range of ecosystems, including secondary subtropical rainforests, bamboo forests, broadleaf forests, savanna woodlands, grasslands and agricultural landscapes in Africa and Asia (Chao et al. 2020). Pangolins are almost exclusively termite- and ant‐eating species, consuming all life stages of their prey, including eggs, larvae, pupae and adults. An individual pangolin consumes several million prey items each year (Harrison 1961, Pietersen et al. 2016, Lee et al. 2017, Sun et al. 2020). Thus, pangolins provide an essential ecological function in maintaining the balance of ecosystems and biodiversity by regulating the number of ants and termites (Del Toro et al. 2012, Chao et al. 2020). The Chinese pangolin Linnaeus, 1758 (: ), is widely found in East Asia, northern Southeast Asia and parts of South Asia, including Nepal, North India, Bhutan, Bangladesh, Myanmar and the northern Indochinese Peninsula, throughout most regions to the south of the Yangtze River in China and the Islands of Hainan and Taiwan (Wu et al. 2020). Over the past several decades, the Chinese pangolins have been over-exploited because of the massive demand for traditional medicines, health supplements, leather and ornaments (Challender et al. 2014). The Chinese pangolin is categorised as ‘Critically Endangered’ on the IUCN Red List of Threatened Species (Challender et al. 2019) and is also listed in Appendix I of the Convention on International Trade in Endangered Species of Wild Fauna and Flora (CITES 2021). Three subspecies of the Chinese pangolin have been identified in some reports based on morphological traits (Allen 1906, Allen 1938). The type specimen of was collected from Taiwan and the nominal subspecies, Linnaeus, 1758, refers to the Taiwanese population (Sun et al. 2019). The other two subspecies are Hodgson, 1836, found in mainland Asia and Allen, 1906 on the Hainan Island in China (Wu et al. 2020). As pangolins may display a greater intraspecific molecular variation than other mammals, assessment within a total evidence framework is required (Pietersen and Challender 2020). Thus, the subspecies status of the Chinese pangolin remains to be clarified with genetic evidence, including the nominal Formosan pangolin in Taiwan. However, the taxonomic status of all three subspecies has not been well resolved by genetic evidence (Wu et al. 2007, Hua et al. 2020, Wu et al. 2020). This impedes regional conservation and potential reintroduction, as well as illegal trade traces and law enforcement. An increase in pangolin numbers has been documented in many areas throughout Taiwan (Wu et al. 2007, Sun et al. 2019). In addition, the genetic structure of in a local population in eastern Taiwan has been assessed (Sun et al. 2020). In this study, we further determined the complete mitogenome of and constructed a phylogenetic tree of all eight pangolin species in . Furthermore, comparative mitochondrial genome analysis was carried out to identify similarities and differences between the subspecies of the Chinese pangolin.

Material and methods

Sample collection and genomic DNA extraction

The sample of was obtained from eastern Taiwan (23°7'N, 121°12'E), with permission from the Taiwan Forestry Bureau (permit number 1101607633 as required by the Wildlife Conservation Act 2013). The tissue was preserved in 70% ethanol for subsequent DNA extraction. Genomic DNA was extracted using a phenol-chloroform protocol (Sambrook and Russell 2001). The spectrophotometer NanoDrop 2000 (Thermo Fisher Scientific, MA, USA) was used to measure genomic DNA concentration.

Genome sequence assembly and analyses

Before genome sequencing, the quality/quantity of DNA samples were assessed using the Agilent Genomic DNA ScreenTape assay in conjunction with the 4200 TapeStation system (Agilent Technologies). The 10 ug of total DNA was sonicated using a Covaris M220 Focused-ultrasonicator to a size ranging from 400 to 500 bp. Subsequently, genomic DNA was used for library preparation using the Illumina Truseq DNA Sample Preparation Kit (Illumina, San Diego, USA), following the manufacturers’ preparation protocol. Genome sequencing was performed using the Illumina HiSeq platform for PE 2 × 150 bp sequencing. The raw sequences were filtered to obtain qualified reads using FASTP v.0.20 (Chen et al. 2018) and FLASH v.1.2 was used to merge paired-end reads (Magoč and Salzberg 2011). The complete circular mitochondrial genome of was assembled de novo with MitoFinder v.1.3 (Allio et al. 2020). Approximately 32.34 GB of clean data were obtained, yielding about 2192-fold depth of coverage of the mitogenome.

Mitogenome analyses

The assembled mitochondrial genome was annotated using the MITOS2 web server to predict the location of protein-coding genes (PCGs), transfer RNAs (tRNAs), ribosomal RNAs (rRNAs) and putative secondary structures of tRNAs (http://mitos2.bioinf.uni-leipzig.de/index.py) (Donath et al. 2019). The sequence with annotated features has been deposited in GenBank (Accession Number MZ868226) and a circular map of the mitochondria was generated using OGDRAW v.1.3.1 (Lohse et al. 2007). All PCGs codon usage and nucleotide frequencies were obtained using the Molecular Evolutionary Genetics Analysis software MEGA version X (Kumar et al. 2018). We also calculated AT skew and GC skew for full mitochondria and genes using the formula: AT skew = (A–T) / (A+T) and GC skew = (G–C) / (G+C) (Perna and Koeher 1995).

Phylogenetic analyses

Phylogenetic relationships were constructed based on the full mitogenome and 13 PCG sequences of , and other seven closely related pangolins (Suppl. material 1). Sequence alignment was conducted using the MAFFT online server (Katoh et al. 2019). The phylogeny with Maximum Likelihood (ML) method was performed in IQ-TREE v.2.1.3 (Nguyen et al. 2015). The GTR + F + I + G4 model and an ultrafast bootstrap approximation algorithm were selected (Minh et al. 2013).

Results and Discussion

Characteristics and composition of mitogenome

The mitogenome sequence of was 16,570 bp in length and contained 22 tRNAs, two rRNAs and 13 PCGs (Fig. 1, Table 1). The mitogenome size was well within the range found in , from 16540 (, GenBank No. MF536684) to 16,577 bp (, KT445978) (Suppl. material 1). The gene order, gene identity and gene number were consistent with those of pangolins mitogenomes (Du Toit et al. 2017). The light and heavy strands each contain their own arrangement of genes, proteins or loci. The 14 tRNA, two rRNA and 12 PCGs were located in the heavy strand and other genes in the light strand (Fig. 1, Table 2). The frequencies of adenine (A), cytosine (C), guanine (G) and thymine (T) were 34.5%, 26.9%, 13.2% and 25.4%, respectively. In addition, the nucleotide composition of the mitogenome was slightly A + T biased, which is similar to that of the other pangolins (Suppl. material 1) (Du Toit et al. 2017, Hua et al. 2020). These results also confirmed the A + T bias, which has been reported in several other mammalian species (Dou et al. 2016). The skew statistics of the whole genome showed that the whole mitochondrial genome of is AT skewed (0.1515) and GC skewed (-0.3406). Positive AT skew and negative GC skew were also found in the other pangolins (Suppl. material 1).
Figure 1.

Complete mitochondrial genome map of . The grey small circle represents GC content graph.

Table 1.

Characteristics of the mitochondrial genome of .

GenePositionLength (bp)AnticodonCodonIntergenic nucleotides*Strand
FromToStartStop
tRNA-Phe16868GAA0H
12S ribosomal RNA6910289600H
tRNA-Val1028109366UAC-1H
16S ribosomal RNA1094265915660H
tRNA-Leu (UAA)2660273374UAA0H
ND127373693957ATGTAA3H
tRNA-Ile3693376169GAU-1H
tRNA-Gln3759383173UUG-3L
tRNA-Met3833390169CAU1H
ND2387549131039ATGT---27H
tRNA-Trp4941500767UCA27H
tRNA-Ala5011507969UGC3L
tRNA-Asn5081515373GUU1L
tRNA-Cys5186525065GCA32L
tRNA-Tyr5251531767GUA0L
COX1531968691551ATGAGA1H
tRNA-Ser(UGA)6865693369UGA-5L
tRNA-Asp6941700767GUC7H
COX270087691684ATGTAA0H
tRNA-Lys7694775764UUU2H
ATP877597959201ATGTAA1H
ATP679208600681ATGTAA-40H
COX386009383784ATGT---1H
tRNA-Gly9384945269UCC0H
ND394539799347ATATA-0H
tRNA-Arg9800986667UCG0H
ND4L986710163297ATGTAA0H
ND410157115341378ATGT---7H
tRNA-His115351160268GUG0H
tRNA-Ser (GCU)116031166159GCU0H
tRNA-Leu (UAG)116631173371UAG1H
ND511743135541812ATATAA9H
ND61353814062525ATGAGA-17L
tRNA-Glu140631413169UUC0L
CYTB14135152741140ATGAGA3H
tRNA-Thr152751534167UGU0H
tRNA-Pro153411540767UGG-1L
D_loop15521165701049113H

Notes: * The numbers of nucleotides between the given and its previous gene, negative values indicate an overlap; H indicated that the genes are transcribed on the heavy strand.

Table 2.

Nucleotide composition in two rRNA, 13 protein-coding genes and 22 tRNA of mitochondrial genome of .

Adenine(%)Cytosine(%)Guanine(%)Thymine(%)A+T(%)AT skewGC skewLength (bp)
ribosomal RNA 37.9221.5417.7422.8060.720.2490-0.09672,526
12S ribosomal RNA36.8722.9217.9222.2959.160.2465-0.1224960
16S ribosomal RNA38.5720.6917.6223.1261.690.2504-0.08011,566
protein-coding genes 31.8429.3513.2825.5357.370.1100-0.377011,396
ND133.0229.9912.2324.7657.780.1430-0.4207957
ND237.6330.809.1422.4360.060.2531-0.54231,039
COX128.4327.2115.5127.8556.280.0103-0.27391,551
COX232.1628.9513.7425.1557.310.1223-0.3563684
ATP838.8129.857.4623.8862.690.2382-0.6001201
ATP633.3332.8910.7223.0656.390.1821-0.5084681
COX326.7930.9916.3325.8952.680.0171-0.3098784
ND334.2931.7010.9523.0657.350.1958-0.4865347
ND4L31.9829.9711.4526.6058.580.0918-0.4471297
ND433.6731.5711.2523.5157.180.1777-0.47451,378
ND534.9330.419.8824.7859.710.1700-0.50961,812
ND616.388.5732.5742.4858.86-0.44340.5834525
CYTB30.5331.6713.8623.9454.470.1210-0.39121,140
transfer DNA 36.4620.9415.3827.2263.680.1451-0.15311495

Protein-coding genes and codon usages

The sequence length of 13 PCGs was 11,396 bp with base compositions of 31.84%, 29.35%, 13.28% and 25.53% for A, C, G and T, respectively (Table 2). The shortest gene was ATP8 (201 bp) and the longest gene was ND5 (1,812 bp). The A + T content of 13 PCGs ranged from 52.68% (COX3) to 60.06% (ND2), with an average of 57.37%, showing a slight A + T bias. According to the AT skew and GC skew analyses, all PCGs exhibited a stronger nucleotide asymmetry with a positive AT skew, but negative GC skew, except for ND6 (Table 2). Analysis of codon usage of most PCGs provided evidence of bias in terms of the use of codons, with A and C occurring most frequently. The asymmetrical base composition observed in the PCGs of the mitogenomes was probably due to the process of mutation and/or adaptive selection (Mooers and Holmes 2000, Castellana et al. 2011). All PCGs started with a typical ATN codon: four (ND1, ND2, COX1, COX2, ATP8, ATP6, COX3, ND4L, ND4, ND6 and CYTB) with ATG and two (ND3 and ND5) with ATA. The complete stop codons, TAA and AGA, were found in six genes (ND1, COX2, ATP8, ATP6, ND4L and ND5) and three genes (COX1, ND6 and CYTB), while the remaining five genes terminated with a single base T (ND2, COX3, ND4) or one base TA (ND3) (Table 1).

Transfer and ribosomal RNA

The sequence lengths of 12S and 16S rRNA were 960 bp and 1,566 bp, respectively. The base composition of both rRNAs was 37.92% A, 21.54% C, 17.74% G and 22.80% T and the A + T content was 60.72%. The 22 tRNAs were interspersed in the whole mitochondrial genome, varying from 59 (tRNA-Ser(GCU)) to 73 nucleotides (tRNA-Gln and tRNA-Asn). The A + T content of these tRNAs was 63.68% and positive AT skew and negative GC skew were found (Table 2). The predicted secondary structures of the 22 tRNAs are shown in Fig. 2. All tRNAs were folded into typical cloverleaf-shaped secondary structures which included an amino acid accepting arm, DHU loop, anticodon loop and TψC loop, except for tRNA-Ser(GCU). The tRNA-Ser(GCU) lacks the DHU loop arm stem and DHU loop. Missing the DHU loop arm stem and DHU loop for tRNA-Ser(GCU) is common in metazoan mitochondria (Jühling et al. 2009, Watanabe et al. 2014).
Figure 2.

Putative secondary structure of the 22 tRNAs of . The tRNAs are labelled with the abbreviations of their corresponding amino acids.

Phylogenetic analysis

The topology of phylogenetic trees, based on full mitogenome sequences, illustrated that Asian and African pangolin species were separated into two distinct monophyletic clades, consistent with previous studies (Gaubert et al. 2018). and all were clustered in the same clade, but was a paraphyletic group (Fig. 3). We found that was closely related to KT445978, with 98.92% nucleotide similarity within the same clade. The sequence (KT445978) was only labelled as mtDNA sequence from NCBI and geographic location of this sample was Taiwan. The results of phylogenetic analysis indicated the individual (KT445978) was identified as and was a monophyletic group (Fig. 3). For the combined 13 PCG sequences, the phylogeny was also consistent with full mitogenome sequences. Funk and Omland (2003) reviewed the phylogenetic studies of mitochondrial DNA in animals, indicating that 23% of species were reconstructed as paraphyletic or polyphyletic relationships. Paraphyletic or polyphyletic groups are common in nature, even seen in plants (Hörandl 2006, Hörandl and Stuessy 2010). The mitogenome phylogeny of Eurasian () indicated that three subspecies (, and ) are paraphyletic groups (Mengüllüoğlu et al. 2021). The phylogeny of the paraphyletic group resulted from the stochasticity of the coalescent process, which is more likely to result in larger population sizes or shorter divergence times, leading to incomplete lineage sorting (Pamilo and Nei 1988, Díaz et al. 2018). However, introgression or hybridisation is another biological factor that causes this phenomenon (Harrison and Larson 2014). The current hybridisation between both subspecies is less possible due to the geographical barrier (Taiwan Strait) and the paraphyly of is likely a result of incomplete lineage sorting. Geographic isolation is known to contribute to divergent evolution, resulting in the monophyly of restricted to Taiwan. The results provide important clues for understanding the phylogeny of the three subspecies of . Furthermore, additional sampling of and will help us to clarify the phylogeny.
Figure 3.

Maximum Likelihood (ML) phylogenetic tree of and other pangolins. The numbers on branch lengths are bootstrap values.

Conclusions

The complete mitochondrial genome of was sequenced and the mitogenome sequence was 16,570 bp in size. The AT content was higher than the GC content, which is consistent with the findings of other pangolins. Maximum Likelihood phylogenetic analysis of the complete mitogenome and 13 PCGs showed that and all were clustered in the same clade and was a monophyletic group, but was a paraphyletic group. The complete mitogenome of , reported in this study, enriches the mitogenome database of the Chinese pangolin and provides useful information for the phylogeny and taxonomy of . Based on our study, to identify subspecies of the Chinese pangolins using mtDNA markers in the wildlife trade of the Chinese pangolin can still be difficult. The Chinese pangolins have a wide range of distribution, within which intraspecific divergences may occur in some regions, especially in west of Myanmar, north and northeast India, Bangladesh, Bhutan and Nepal. To clearly confirm the status of the subspecies of the Chinese pangolin and address the conservation efforts of wildlife trade, further studies are needed, especially for the subspecies () on Hainan Island, along with greater geographic sampling of and .

Data resources

The genome sequence data are available in GenBank (https://www.ncbi.nlm.nih.gov/) under accession no. MZ868226. Mitochondrial genome characteristics Table Characteristics of the mitochondrial genome of pangolins used in this study. File: oo_625289.pdf
  22 in total

1.  FLASH: fast length adjustment of short reads to improve genome assemblies.

Authors:  Tanja Magoč; Steven L Salzberg
Journal:  Bioinformatics       Date:  2011-09-07       Impact factor: 6.937

2.  Hybridization, introgression, and the nature of species boundaries.

Authors:  Richard G Harrison; Erica L Larson
Journal:  J Hered       Date:  2014       Impact factor: 2.645

3.  Relationships between gene trees and species trees.

Authors:  P Pamilo; M Nei
Journal:  Mol Biol Evol       Date:  1988-09       Impact factor: 16.240

4.  MEGA X: Molecular Evolutionary Genetics Analysis across Computing Platforms.

Authors:  Sudhir Kumar; Glen Stecher; Michael Li; Christina Knyaz; Koichiro Tamura
Journal:  Mol Biol Evol       Date:  2018-06-01       Impact factor: 16.240

5.  Patterns of nucleotide composition at fourfold degenerate sites of animal mitochondrial genomes.

Authors:  N T Perna; T D Kocher
Journal:  J Mol Evol       Date:  1995-09       Impact factor: 2.395

6.  Improved annotation of protein-coding genes boundaries in metazoan mitochondrial genomes.

Authors:  Alexander Donath; Frank Jühling; Marwa Al-Arab; Stephan H Bernhart; Franziska Reinhardt; Peter F Stadler; Martin Middendorf; Matthias Bernt
Journal:  Nucleic Acids Res       Date:  2019-11-18       Impact factor: 16.971

7.  Ultrafast approximation for phylogenetic bootstrap.

Authors:  Bui Quang Minh; Minh Anh Thi Nguyen; Arndt von Haeseler
Journal:  Mol Biol Evol       Date:  2013-02-15       Impact factor: 16.240

8.  IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies.

Authors:  Lam-Tung Nguyen; Heiko A Schmidt; Arndt von Haeseler; Bui Quang Minh
Journal:  Mol Biol Evol       Date:  2014-11-03       Impact factor: 16.240

Review 9.  Losing the stem-loop structure from metazoan mitochondrial tRNAs and co-evolution of interacting factors.

Authors:  Yoh-Ichi Watanabe; Takuma Suematsu; Takashi Ohtsuki
Journal:  Front Genet       Date:  2014-05-01       Impact factor: 4.599

10.  MAFFT online service: multiple sequence alignment, interactive sequence choice and visualization.

Authors:  Kazutaka Katoh; John Rozewicki; Kazunori D Yamada
Journal:  Brief Bioinform       Date:  2019-07-19       Impact factor: 11.622

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.