Literature DB >> 28369358

Comparative optical genome analysis of two pangolin species: Manis pentadactyla and Manis javanica.

Huang Zhihai1, Xu Jiang2, Xiao Shuiming2, Liao Baosheng1,2, Gao Yuan3, Zhai Chaochao4, Qiu Xiaohui1, Xu Wen1, Chen Shilin2.   

Abstract

Background: The pangolin is a Pholidota mammal with large keratin scales protecting its skin. Two pangolin species ( Manis pentadactyla and Manis javanica ) have been recorded as critically endangered on the International Union for Conservation of Nature Red List of Threatened Species. Optical mapping constructs high-resolution restriction maps from single DNA molecules for genome analysis at the megabase scale and to assist genome assembly. Here, we constructed restriction maps of M. pentadactyla and M. javanica using optical mapping to assist with genome assembly and analysis of these species. Findings: Genomic DNA was nicked with Nt.BspQI and then labeled using fluorescently labeled bases that were detected by the Irys optical mapping system. In total, 3,313,734 DNA molecules (517.847 Gb) for M. pentadactyla and 3,439,885 DNA molecules (504.743 Gb) for M. javanica were obtained, which corresponded to approximately 178X and 177X genome coverage, respectively. Qualified molecules (≥150 kb with a label density of >6 sites per 100 kb) were analyzed using the de novo assembly program embedded in the IrysView pipeline. We obtained two maps that were 2.91 Gb and 2.85 Gb in size with N50s of 1.88 Mb and 1.97 Mb, respectively. Conclusions: Optical mapping reveals large-scale structural information that is especially important for non-model genomes that lack a good reference. The approach has the potential to guide de novo assembly of genomes sequenced using next-generation sequencing. Our data provide a resource for Manidae genome analysis and references for de novo assembly. This note also provides new insights into Manidae evolutionary analysis at the genome structure level.
© The Authors 2016. Published by Oxford University Press.

Entities:  

Keywords:  Manidae; Optical mapping; Pangolin; Restriction maps

Mesh:

Year:  2016        PMID: 28369358      PMCID: PMC6082621          DOI: 10.1093/gigascience/giw001

Source DB:  PubMed          Journal:  Gigascience        ISSN: 2047-217X            Impact factor:   6.524


Data description

Background

Pangolins are the sole representatives of order Pholidota. They are a group of nocturnal mammals that are well-known for their full armor of scales. Only eight pangolin species exist worldwide, which have been distributed into three genera (Manis, Phataginus and Smutsia) according to Gaudin et al. [1]. Manis pentadactyla and Manis javanica belong to the Asian Pangolin subfamily and have long been used as an ingredient in traditional medicine in China and southeast Asia, and their colonies and habitats have thus largely been destroyed by poaching and deforestation. These two species are on the verge of extinction; they are currently recorded as critically endangered on the International Union for Conservation of Nature Red List of Threatened Species. Optical mapping is a molecular tool for chromosome-wide restriction map production [2]. During the optical mapping process, stretched linear DNA is labeled at specific sequence motifs and then exposed under a fluorescence microscope to generate an image signal, which translates to motif–distance information for further analysis [3]. Optical mapping has several advantages over traditional sequencing approaches, such as single molecule analysis and long DNA molecules, and can be used for de novo map assembly or sequencing contig anchoring. To date, optical mapping has facilitated or improved assembly arrays of large genomes, including humans [4, 5], Oropetium thomaeum [6], and Ganoderm lucidum [7]. In addition to genome assembly guidance, optical mapping offers a complementary approach for sequence variation analysis in large-region comparisons in addition to nucleotide matches and provides unique traits for evolutionary or functional analyses. At present, direct comparisons between optical maps are usually conducted in microbes but are lacking in large genomes, such as animals or plants [8]. Here, we present two optical Manis maps, and we reveal and compare their genetic structures using pairwise sequence alignments to identify their interspecific variations.

DNA extraction, labeling and data collection

All animal studies were reviewed and approved by the animal ethics review committee of Guangdong Provincial Hospital of Chinese Medicine. High molecular weight DNA was isolated from M. pentadactyla and M. javanica blood samples. A total of 3 ml of blood from an orbital sample was anticoagulated with EDTA and then shipped on ice. A total of 9 ml of red blood cell lysis solution was mixed with each 3 ml sample and rocked gently at room temperature for 10 min. The mixture was spun at 2000 × g at 4 °C for 2 min, and the supernatant was discarded. The pellet was suspended in 3 ml of phosphate-buffered saline. After removing the insoluble particulates, the mixture was spun again. The supernatant was discarded and the pellet was resuspended in 563 μl of refrigerated cell suspension buffer at a density of ∼0.5 × 107 cells/ml. For gel casting, 75 μl of resuspension buffer was mixed with 25 μl of preheated 2% agarose, and the gels were solidified at 4 °C for 45 min. The gel casts were immersed in 5 ml of lysis buffer (0.5 M EDTA, pH 9.5, 1% lauroyl sarcosine, sodium salt, and 2 mg/ml proteinase K) at 50 °C for 2 days for cell lysis. The cell-lysed gels were washed with 1× Tris-EDTA; then, the immobilized DNA was recovered by melting the gels at 70 °C for 10 min, followed by incubation with GELase (Epicentre, USA) at 43 °C for 45 min. The recovered DNA was drop dialyzed against Tris-EDTA for 4 h using a 0.1-μm membrane. The dialyzed DNA was quantified and stored for nicking. Prior to molecular nicking, the DNA was equilibrated at room temperature for 30 min and gently mixed with wide bore tips. In a 10-μl reaction system, 300 ng of equilibrated DNA was added to 7 units of Nt.BspQI (NEB, USA) nickase and 1 μl of nicking buffer and mixed. The nicking process was conducted in a thermal cycler at 37 °C for 2 h. The nicked DNA was incubated with 5 μl of labeling master mix containing 1.5 μl of labeling buffer (BioNano Genomics, USA), 1.5 μl of labeling mix (BioNano Genomics, USA), and 1 μl of Taq polymerase (NEB, USA) to flag specific motifs. The labeling process was conducted at 72 °C for 1 h. Each labeled DNA solution was mixed with 15 μl of Repair Master Mix containing 0.5 μl of 10 Thermo polymerase buffer (NEB, USA), 0.4 μl of 50× repair mix (BioNano Genomics, USA), 0.4 μl of 50 mM NAD+ (NEB, USA), 1.0 μl of Taq DNA polymerase (NEB, USA), and 2.7 μl of ultrapure water for nick repair. The repair reaction was conducted at 37 °C for 30 min, followed by the addition of 1 μl of stop solution (BioNano Genomics, USA) to stop the reaction. After ligating the nicks, the backbone of the labeled DNA was stained with IrysPrep DNA stain solution (BioNano Genomics, USA) at 4 °C overnight. The prepared samples were loaded onto Irys chips (BioNano Genomics, USA) and then applied to the chip nano-channels. The concentration time was 400 s to avoid over-staining or over-loading. The fluorescently labeled DNA was illuminated by the corresponding laser, and the signal was captured by an onboard electron microscope charge-coupled device camera (Fig. 1). The acquired images were converted to digital data with the AutoDetect software (BioNano Genomics, USA). In total, 517.874 Gb and 504.743 Gb of data were generated for M. pentadactyla and M. javanica, representing 178X and 177X coverage of their predicted genomes, respectively. Further methodological details available from protocols.io [9].
Figure 1.

Irys raw images. Labeled high molecular weight DNA was linearized in the chip nano-channel. The restriction sites were digested with Nt.BspQI and labeled with fluorescence dNTP. The DNA backbone (a,c) and labels (b,d) were detected by EM CCD with blue (473 nm) and green (532 nm) lasers, respectively. Raw images in a and b are from M. pentadactyla; and in c and d from M. javanica. The raw molecule data were detected with Irys AutoDetect 2.1.4 from the raw images

Irys raw images. Labeled high molecular weight DNA was linearized in the chip nano-channel. The restriction sites were digested with Nt.BspQI and labeled with fluorescence dNTP. The DNA backbone (a,c) and labels (b,d) were detected by EM CCD with blue (473 nm) and green (532 nm) lasers, respectively. Raw images in a and b are from M. pentadactyla; and in c and d from M. javanica. The raw molecule data were detected with Irys AutoDetect 2.1.4 from the raw images

Genome assembly

All data were filtered by IrysView 2.4 using the following criteria: molecule lengths ≥150 kb and a label signal/noise ratio ≥ 3. The number of filtered molecules was approximately 1,360,730 for M. pentadactyla with a N50 length of 275.5 kb and 1,254,380 for M. javanica with a N50 length of 281.1 kb. The label density was 10.193/100 kb for M. pentadactyla and 10.151/100 kb for M. javanica. The distance between adjacent labels ranged from 0 kb to 833.609 kb for M. pentadactyla and 0 kb to 955.352 kb for M. javanica. During the detection process, two label sites that are near each other will be detected because they cannot be separated; therefore, the distance between these sites will be 0 bp. Simple tandem repeat areas with repeat units with only one restriction site were detected by the molecules. The statistical analysis revealed that the most common simple tandem repeat unit size was 4.3 kb, followed by 5.2 kb in M. pentadactyla and 4.6 kb and 3.4 kb in M. javanica. The total length of the repeat region accounted for 0.55% and 0.45% of the raw data. The RefAligner and Assembler packages in IrysView were used for de novo assembly. IrysView 2.4 can be obtained from BioNano Genomics [10]. The software requirements are as follows: Windows Python Runtime v2.7.5, Microsoft .Net 4.5.2, and Irys tools (RefAligner and Assembler [11]). The assembly process comprised a molecule pairwise comparison, graph building and map refinement. In this work, the P-value thresholds were 1 × 10−9 for the pairwise assembly, 1 × 10−10 for the extension and refinement steps, and 1 × 10−11 for the final merging. The false positive and false negative parameters were set to 1.5/100 kb and 0.15/100 kb. Finally, 2202 maps spanning 2.91 Gb of the genome were assembled for M. pentadactyla and 2096 maps spanning 2.85 Gb of the genome were assembled for M. javanica, with N50 lengths of 1.884 Mb and 1.972 Mb, respectively (Table 1). The largest M. pentadactyla fragment was approximately 14.21 Mb in size with 1354 label sites, and the largest M. javanica fragment was approximately 10.39 Mb in size with 1004 label sites (Fig. 2).
Table 1.

Statistical analysis of the M. pentadactyla and M. javanica physical map data

M. pentadactyla M. javanica
Raw data
 Quantity (Gb)517.874 (178X)504.743 (177X)
 Number of molecules3,313,7343,439,885
 Molecule N50 (kb)216.3212.4
 Label density (per 100 kb)11.711.4
 Label SNR11.710.8
Filtered data
 Molecule length threshold (kb)150150
 Label SNR threshold33
 Quantity (Gb)360.483 (123X)339.814 (119X)
 Number of molecules1,360,7301,254,380
 Molecule N50275.5281.1
 Label density10.19310.151
 Label SNR12.711.4
Assembly statistics
 Total length (Gb)2.912.85
 Number of maps22022094
 Map N50 (Mb)1.8841.972
 Average map length (Mb)1.321.36
 Maximum map length (Mb)14.20510.385
Pairwise alignment
 Number of aligned maps21962088
 Total aligned length (Mb)4904.524716.849
 Unique aligned length (Mb)2861.222783.567

SNR signal-to-noise ratio

Figure 2.

Assembled physical map. The physical maps were assembled and extended based on the similarity and overlap of molecules. The blue bars indicate the physical map and the green bars indicate the molecules. In the absence of amplification, the molecule coverage of each part of the physical map is very uniform. a and b are physical map examples of M. pentadactyla and M. javanica, respectively

Assembled physical map. The physical maps were assembled and extended based on the similarity and overlap of molecules. The blue bars indicate the physical map and the green bars indicate the molecules. In the absence of amplification, the molecule coverage of each part of the physical map is very uniform. a and b are physical map examples of M. pentadactyla and M. javanica, respectively Statistical analysis of the M. pentadactyla and M. javanica physical map data SNR signal-to-noise ratio

Genomics comparison

A whole-genome comparison between these two species was performed with RefAligner using a P-value of 1 × 10−9. The results showed that 2196 maps covering 2.86 Gb from M. pentadactyla and 2088 maps covering 2.78 Gb from M. javanica could be mapped to one another with map rates of 97.544% and 98.282%, respectively. In total, 23,631 alignment blocks were generated. However, several reverse alignments were found in the blocks, suggesting that a series of large genome rearrangement events occurred during the divergence and evolution of these two species (Fig. 3).
Figure 3.

Physical map comparison of M. pentadactyla and M. javanica. The physical maps of M. pentadactyla and M. javanica were compared based on sequence similarity. The physical map of M. pentadactyla is shown at the top and M. javanica is shown at the bottom. The line in the middle shows the links between similar regions. The comparison shows that the two species have high similarity at the physical map level (greater than 97% aligned to each other). However, some areas contained insertions/deletions or inversions (below)

Physical map comparison of M. pentadactyla and M. javanica. The physical maps of M. pentadactyla and M. javanica were compared based on sequence similarity. The physical map of M. pentadactyla is shown at the top and M. javanica is shown at the bottom. The line in the middle shows the links between similar regions. The comparison shows that the two species have high similarity at the physical map level (greater than 97% aligned to each other). However, some areas contained insertions/deletions or inversions (below)

Conclusion

Several phylogenetic investigations have been conducted in Manidae sequences using the SRY gene, COX I gene and whole mitochondrial sequence [12]. A series of genome projects for pangolins is ongoing. However, no genome-wide comparisons have been reported to date. Here, we present two optical maps of M. pentadactyla and M. javanica generated using the Irys system. These maps can serve as reliable references for further genome assembly. The comparison of the two maps revealed similarities and differences between M. pentadactyla and M. javanica and showed that potential genome rearrangement events occurred during Manidae evolution. Our work implies that optical mapping provides reliable long-range linkage information for genome assembly and can be a suitable choice for convenient and low-coast genome-wide comparisons of highly related species. A new Pangolin genome has also recently been published [13] and this map can potentially aid in improving this reference.

Availability of supporting data and materials

Datasets supporting this Data Note are deposited at the GigaScience repository, GigaDB [14]. Megabase DNA extraction and Irys NLRS DNA labeling and data collection protocols are also available via the protocols.io repository [9].

Competing interests

Zhai Chaochao is an employee of Ultravision-tech.

Funding

This work is supported by the National Nature Science Foundation of China (81403053 and 81503469), Guangdong Provincial Hospital of Chinese Medicine Special Fund (2015KT1817), and China Academy of Chinese Medical Sciences Special Fund for Health Service Development of Chinese Medicine (ZZ0908067).

Authors’ contributions

CSL, HZH and XJ designed the project. HZH, XSM, and GY prepared the samples. XJ, LBS, and ZCC performed the experiments. LBS, XJ, XSM, and HZH analyzed the data. CSL, XJ, HZH, LBS, and QXH wrote the manuscript. Click here for additional data file. Click here for additional data file. Click here for additional data file. Click here for additional data file. Click here for additional data file. Click here for additional data file. Click here for additional data file. Click here for additional data file.
  9 in total

1.  Complete mitochondrial genome of the Manis pentadactyla (Pholidota, Manidae): comparison of M. pentadactyla and M. tetradactyla.

Authors:  Xin-Min Qin; Shi-Rong Dou; Qing-Xin Guan; Ping-Sheng Qin; Ying She
Journal:  Mitochondrial DNA       Date:  2012-02

2.  Assembly and diploid architecture of an individual human genome via single-molecule technologies.

Authors:  Matthew Pendleton; Robert Sebra; Andy Wing Chun Pang; Ajay Ummat; Oscar Franzen; Tobias Rausch; Adrian M Stütz; William Stedman; Thomas Anantharaman; Alex Hastie; Heng Dai; Markus Hsi-Yang Fritz; Han Cao; Ariella Cohain; Gintaras Deikus; Russell E Durrett; Scott C Blanchard; Roger Altman; Chen-Shan Chin; Yan Guo; Ellen E Paxinos; Jan O Korbel; Robert B Darnell; W Richard McCombie; Pui-Yan Kwok; Christopher E Mason; Eric E Schadt; Ali Bashir
Journal:  Nat Methods       Date:  2015-06-29       Impact factor: 28.547

3.  Single-molecule sequencing of the desiccation-tolerant grass Oropetium thomaeum.

Authors:  Robert VanBuren; Doug Bryant; Patrick P Edger; Haibao Tang; Diane Burgess; Dinakar Challabathula; Kristi Spittle; Richard Hall; Jenny Gu; Eric Lyons; Michael Freeling; Dorothea Bartels; Boudewijn Ten Hallers; Alex Hastie; Todd P Michael; Todd C Mockler
Journal:  Nature       Date:  2015-11-11       Impact factor: 49.962

4.  Genome mapping on nanochannel arrays for structural variation analysis and sequence assembly.

Authors:  Ernest T Lam; Alex Hastie; Chin Lin; Dean Ehrlich; Somes K Das; Michael D Austin; Paru Deshpande; Han Cao; Niranjan Nagarajan; Ming Xiao; Pui-Yan Kwok
Journal:  Nat Biotechnol       Date:  2012-08       Impact factor: 54.908

5.  Bacteriophage strain typing by rapid single molecule analysis.

Authors:  Assaf Grunwald; Moran Dahan; Anna Giesbertz; Adam Nilsson; Lena K Nyberg; Elmar Weinhold; Tobias Ambjörnsson; Fredrik Westerlund; Yuval Ebenstein
Journal:  Nucleic Acids Res       Date:  2015-05-27       Impact factor: 16.971

6.  Single-molecule optical genome mapping of a human HapMap and a colorectal cancer cell line.

Authors:  Audrey S M Teo; Davide Verzotto; Fei Yao; Niranjan Nagarajan; Axel M Hillmer
Journal:  Gigascience       Date:  2015-12-29       Impact factor: 6.524

7.  A hybrid approach for de novo human genome sequence assembly and phasing.

Authors:  Yulia Mostovoy; Michal Levy-Sakin; Jessica Lam; Ernest T Lam; Alex R Hastie; Patrick Marks; Joyce Lee; Catherine Chu; Chin Lin; Željko Džakula; Han Cao; Stephen A Schlebusch; Kristina Giorda; Michael Schnall-Levin; Jeffrey D Wall; Pui-Yan Kwok
Journal:  Nat Methods       Date:  2016-05-09       Impact factor: 28.547

8.  Comparative optical genome analysis of two pangolin species: Manis pentadactyla and Manis javanica.

Authors:  Huang Zhihai; Xu Jiang; Xiao Shuiming; Liao Baosheng; Gao Yuan; Zhai Chaochao; Qiu Xiaohui; Xu Wen; Chen Shilin
Journal:  Gigascience       Date:  2016-12-01       Impact factor: 6.524

9.  Pangolin genomes and the evolution of mammalian scales and immunity.

Authors:  Siew Woh Choo; Mike Rayko; Tze King Tan; Ranjeev Hari; Aleksey Komissarov; Wei Yee Wee; Andrey A Yurchenko; Sergey Kliver; Gaik Tamazian; Agostinho Antunes; Richard K Wilson; Wesley C Warren; Klaus-Peter Koepfli; Patrick Minx; Ksenia Krasheninnikova; Antoinette Kotze; Desire L Dalton; Elaine Vermaak; Ian C Paterson; Pavel Dobrynin; Frankie Thomas Sitam; Jeffrine J Rovie-Ryan; Warren E Johnson; Aini Mohamed Yusoff; Shu-Jin Luo; Kayal Vizi Karuppannan; Gang Fang; Deyou Zheng; Mark B Gerstein; Leonard Lipovich; Stephen J O'Brien; Guat Jah Wong
Journal:  Genome Res       Date:  2016-08-10       Impact factor: 9.043

  9 in total
  13 in total

1.  Expanding the conservation genomics toolbox: Incorporating structural variants to enhance genomic studies for species of conservation concern.

Authors:  Jana Wold; Klaus-Peter Koepfli; Stephanie J Galla; David Eccles; Carolyn J Hogg; Marissa F Le Lec; Joseph Guhlin; Anna W Santure; Tammy E Steeves
Journal:  Mol Ecol       Date:  2021-09-12       Impact factor: 6.622

2.  Transcriptomic analysis identifies genes and pathways related to myrmecophagy in the Malayan pangolin (Manis javanica).

Authors:  Jing-E Ma; Lin-Miao Li; Hai-Ying Jiang; Xiu-Juan Zhang; Juan Li; Guan-Yu Li; Li-Hong Yuan; Jun Wu; Jin-Ping Chen
Journal:  PeerJ       Date:  2017-12-22       Impact factor: 2.984

3.  Panax ginseng genome examination for ginsenoside biosynthesis.

Authors:  Jiang Xu; Yang Chu; Baosheng Liao; Shuiming Xiao; Qinggang Yin; Rui Bai; He Su; Linlin Dong; Xiwen Li; Jun Qian; Jingjing Zhang; Yujun Zhang; Xiaoyan Zhang; Mingli Wu; Jie Zhang; Guozheng Li; Lei Zhang; Zhenzhan Chang; Yuebin Zhang; Zhengwei Jia; Zhixiang Liu; Daniel Afreh; Ruth Nahurira; Lianjuan Zhang; Ruiyang Cheng; Yingjie Zhu; Guangwei Zhu; Wei Rao; Chao Zhou; Lirui Qiao; Zhihai Huang; Yung-Chi Cheng; Shilin Chen
Journal:  Gigascience       Date:  2017-11-01       Impact factor: 6.524

4.  Comparative optical genome analysis of two pangolin species: Manis pentadactyla and Manis javanica.

Authors:  Huang Zhihai; Xu Jiang; Xiao Shuiming; Liao Baosheng; Gao Yuan; Zhai Chaochao; Qiu Xiaohui; Xu Wen; Chen Shilin
Journal:  Gigascience       Date:  2016-12-01       Impact factor: 6.524

5.  Complete Chloroplast Genome Sequence and Phylogenetic Analysis of the Medicinal Plant Artemisia annua.

Authors:  Xiaofeng Shen; Mingli Wu; Baosheng Liao; Zhixiang Liu; Rui Bai; Shuiming Xiao; Xiwen Li; Boli Zhang; Jiang Xu; Shilin Chen
Journal:  Molecules       Date:  2017-08-11       Impact factor: 4.411

6.  Genome-Wide Characterization and Expression Analyses of Pleurotus ostreatus MYB Transcription Factors during Developmental Stages and under Heat Stress Based on de novo Sequenced Genome.

Authors:  Lining Wang; Wei Gao; Xiangli Wu; Mengran Zhao; Jibin Qu; Chenyang Huang; Jinxia Zhang
Journal:  Int J Mol Sci       Date:  2018-07-14       Impact factor: 5.923

7.  Investigation on Species Authenticity for Herbal Products of Celastrus Orbiculatus and Tripterygum Wilfordii from Markets Using ITS2 Barcoding.

Authors:  Jingjing Zhang; Xin Hu; Ping Wang; Bisheng Huang; Wei Sun; Chao Xiong; Zhigang Hu; Shilin Chen
Journal:  Molecules       Date:  2018-04-21       Impact factor: 4.411

8.  Complete Chloroplast Genome Sequence and Phylogenetic Analysis of Paeonia ostii.

Authors:  Shuai Guo; Lili Guo; Wei Zhao; Jiang Xu; Yuying Li; Xiaoyan Zhang; Xiaofeng Shen; Mingli Wu; Xiaogai Hou
Journal:  Molecules       Date:  2018-01-26       Impact factor: 4.411

9.  Complete Chloroplast Genome Sequence and Phylogenetic Analysis of Aster tataricus.

Authors:  Xiaofeng Shen; Shuai Guo; Yu Yin; Jingjing Zhang; Xianmei Yin; Conglian Liang; Zhangwei Wang; Bingfeng Huang; Yanhong Liu; Shuiming Xiao; Guangwei Zhu
Journal:  Molecules       Date:  2018-09-21       Impact factor: 4.411

10.  Comprehensive Characterization for Ginsenosides Biosynthesis in Ginseng Root by Integration Analysis of Chemical and Transcriptome.

Authors:  Jing-Jing Zhang; He Su; Lei Zhang; Bao-Sheng Liao; Shui-Ming Xiao; Lin-Lin Dong; Zhi-Gang Hu; Ping Wang; Xi-Wen Li; Zhi-Hai Huang; Zhi-Ming Gao; Lian-Juan Zhang; Liang Shen; Rui-Yang Cheng; Jiang Xu; Shi-Lin Chen
Journal:  Molecules       Date:  2017-05-31       Impact factor: 4.411

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.