| Literature DB >> 29238535 |
Jiang-Ni Li1, Chong He1, Peng Guo2, Peng Zhang1, Dan Liang1.
Abstract
Relative to the commonly used mitochondrial and nuclear protein-coding genes, the noncoding intron sequences are a promising source of informative markers that have the potential to resolve difficult phylogenetic nodes such as rapid radiations and recent divergences. Yet many issues exist in the use of intron markers, which prevent their extensive application as conventional markers. We used the diverse group of snakes as an example to try paving the way for massive identification and application of intron markers. We performed a series of bioinformatics screenings which identified appropriate introns between single-copy and conserved exons from two snake genomes, adding particular constraints on sequence length variability and sequence variability. A total of 1,273 candidate intron loci were retrieved. Primers for nested polymerase chain reaction (PCR) were designed for over a hundred candidates and tested in 16 snake representatives. 96 intron markers were developed that could be amplified across a broad range of snake taxa with high PCR successful rates. The markers were then applied to 49 snake samples. The large number of amplicons was subjected to next-generation sequencing (NGS). An analytic strategy was developed to accurately recover the amplicon sequences, and approximately, 76% of the marker sequences were recovered. The average p-distances of the intron markers at interfamily, intergenus, interspecies, and intraspecies levels were .168, .052, .015, and .004, respectively, suggesting that they were useful to study snake relationships of different evolutionary depths. A snake phylogeny was constructed with the intron markers, which produced concordant results with robust support at both interfamily and intragenus levels. The intron markers provide a convenient way to explore the signals in the noncoding regions to address the controversies on the snake tree. Our improved strategy of genome screening is effective and can be applied to other animal groups. NGS coupled with appropriate sequence processing can greatly facilitate the extensive application of molecular markers.Entities:
Keywords: Gloydius; Serpentes; intron; noncoding; phylogeny
Year: 2017 PMID: 29238535 PMCID: PMC5723593 DOI: 10.1002/ece3.3525
Source DB: PubMed Journal: Ecol Evol ISSN: 2045-7758 Impact factor: 2.912
List of all species used in this study
| Family | Genus | Species | Collection locality or source | Voucher |
|---|---|---|---|---|
| Typhlopidae |
|
| Hongkong, China | RE28 |
| Boidae |
|
| Private breeding | RE37 |
| Pythonidae |
|
| — | RE26 |
| Xenopeltidae |
|
| Mengla, Yunnan, China | RE22 |
| Xenoderrmatidae |
|
| Private breeding | RE51 |
| Pareatidae |
|
| Bawanglin, Hainan, China | RE47 |
| Homalopsidae |
|
| Shaoguan, Guangdong, China | RE12 |
| Elapidae |
|
| Shaoguan, Guangdong, China | RE04‐2 |
| Elapidae |
|
| Mengla, Yunnan, China | RE17 |
| Colubridae |
|
| Shaoguan. Guangdong, China | RE35 |
| Colubridae |
|
| — | RE41 |
| Colubridae |
|
| Shaoguan, Guangdong, China | RE55 |
| Colubridae |
|
| Shaoguan, Guangdong, China | RE59 |
| Viperidae |
|
| Guangzhou, Guangdong, China | RE49 |
| Viperidae |
|
| Guangzhou, Guangdong, China | RE60 |
| Viperidae |
|
| Shaoguan, Guangdong, China | RE05 |
| Viperidae |
|
| Yongzhou, Hunan, China | RE46 |
| Viperidae |
|
| Mengla, Yunnan, China | RE21 |
| Viperidae |
|
| — | RE44 |
| Viperidae |
|
| Shaoguan, Guangdong, China | RE56 |
| Viperidae |
|
| — | RE45 |
| Viperidae |
|
| Huangshan, Anhui, China | GP01 |
| Viperidae |
|
| Kuangdian. Liaoning, China | GP02 |
| Viperidae |
|
| Hebei, China | GP03 |
| Viperidae |
|
| Hunan, China | GP05 |
| Viperidae |
|
| Huangshan, Anhui, China | GP06 |
| Viperidae |
|
| Anhui, China | GP07 |
| Viperidae |
|
| Liaoning, China | GP08 |
| Viperidae |
|
| Ji'an, Jilin, China | GP09 |
| Viperidae |
|
| — | GP10 |
| Viperidae |
|
| Tonghua, Jilin, China | GP11 |
| Viperidae |
|
| Tonghua, Jilin, China | GP12 |
| Viperidae |
|
| Yan'an, Shaanxi, China | GP15 |
| Viperidae |
|
| Yan'an, Shaanxi, China | GP16 |
| Viperidae |
|
| Benxi, Liaoning, China | GPI7 |
| Viperidae |
|
| Shandong, China | GP18 |
| Viperidae |
|
| Yantai, Shandong, China | GP19 |
| Viperidae |
|
| Ji'an, Jilin, China | GP20 |
| Viperidae |
|
| Jilin, China | GP2I |
| Viperidae |
|
| Shedao island, Dalian, China | GP22 |
| Viperidae |
|
| Shedao island, Dalian, China | GP23 |
| Viperidae |
|
| Liupanshan, Ningxia, China | GP24 |
| Viperidae |
|
| Shiqu, Sichuan, China | GP25 |
| Viperidae |
|
| Zhouzhi, Shaanxi, China | GP27 |
| Viperidae |
|
| Shaanxi, China | GP28 |
| Viperidae |
|
| Teuriisland, Hokkaido, Japan | GP29 |
| Viperidae |
|
| Shangri‐La, Yunnan, China | GP30 |
| Viperidae |
|
| Shangri‐La, Yunnan, China | GP31 |
| Viperidae |
|
| Shiqu, Sichuan, China | GP33 |
Species were chosen to verify the effectiveness of the newly‐designed primers.
Figure 1Scheme of the bioinformatic pipeline
Information for the 96 novel intron markers
| Loci | Length (bp) | %GC mean | Mean | Taxa amplified (%) | Loci | Length (bp) | %GC mean | Mean | Taxa amplified (%) |
|---|---|---|---|---|---|---|---|---|---|
| ARHGAP4_8‐9 | 926 | 38.4 | .070 | 49 (100) | AGBL5_6‐7 | 627 | 40.0 | .057 | 46 (93) |
| ASXL3_9‐10 | 1,035 | 32.1 | .053 | 49 (100) | C6orf62_4‐5 | 690 | 35.3 | .083 | 46 (93) |
| CACHD1_12‐13 | 1,146 | 35.2 | .071 | 49 (100) | CPOX_1‐2 | 568 | 35.2 | .084 | 46 (93) |
| CASC3_5‐6 | 613 | 36.7 | .067 | 49 (100) | CSTF1_2‐3 | 803 | 32.3 | .060 | 46 (93) |
| CELSR3_21‐23 | 1,166 | 44.4 | .061 | 49 (100) | GOT1_4‐5 | 648 | 37.8 | .082 | 46 (93) |
| DARS_8‐9 | 1,206 | 39.9 | .060 | 49 (100) | LZTR1_14‐15 | 775 | 34.9 | .064 | 46 (93) |
| DICER1_3‐4 | 629 | 35.7 | .063 | 49 (100) | NUTM1_1‐2 | 1,580 | 41.9 | .062 | 46 (93) |
| E1F4G2_14‐15 | 626 | 36.1 | .057 | 49 (100) | SLU7_14‐15 | 769 | 40.3 | .070 | 46 (93) |
| HUWE1_56‐57 | 760 | 40.1 | .066 | 49 (100) | U2AF2_11‐12 | 911 | 39.5 | .103 | 46(93) |
| INO80_9‐11 | 766 | 32.0 | .073 | 49 (100) | UGCG_4‐5 | 1,235 | 36.9 | .061 | 46 (93) |
| IPO9_14‐15 | 632 | 33.1 | .066 | 49 (100) | VPS13B_36‐37 | 885 | 36.7 | .101 | 46 (93) |
| NIPBL_6‐7 | 802 | 34.5 | .067 | 49 (100) | ZNF608_3‐4 | 1,388 | 37.2 | .068 | 46 (93) |
| RALGAPB_6‐7 | 987 | 31.6 | .069 | 49 (100) | ANAPC1_2‐3 | 1,154 | 31.8 | .068 | 45 (91) |
| SAP18_2‐3 | 1,116 | 33.5 | .057 | 49 (100) | C6orf52_3‐4 | 1,140 | 31.0 | .071 | 45 (91) |
| SRP72_2‐4 | 915 | 34.1 | .084 | 49 (100) | LYST_45‐46 | 796 | 36.9 | .079 | 45 (91) |
| SULF1_7‐8 | 646 | 36.0 | .060 | 49 (100) | MYOF_36‐37 | 958 | 36.3 | .072 | 45 (91) |
| TBC1D16_9‐10 | 675 | 44.3 | .079 | 49 (100) | PDXK_3‐4 | 1,339 | 35.1 | .057 | 45 (91) |
| UBXN7_4‐6 | 763 | 32.0 | .065 | 49 (100) | POLR1B_4‐5 | 1,451 | 37.6 | .054 | 45 (91) |
| AQR_17‐18 | 963 | 35.3 | .096 | 48 (97) | RBM22_5‐6 | 881 | 33.5 | .078 | 45 (91) |
| CLINT1_2‐3 | 1,136 | 37.9 | .070 | 48 (97) | SEC13_6‐7 | 961 | 35.7 | .063 | 45 (91) |
| CNOT10_11‐12 | 877 | 32.3 | .074 | 48 (97) | SMC2_22‐23 | 773 | 37.0 | .082 | 45(91) |
| DDB1_16‐I8 | 781 | 36.4 | .076 | 48 (97) | YIPF3_2‐3 | 710 | 35.7 | .072 | 45 (91) |
| DDX23_9‐10 | 553 | 38.1 | .082 | 48 (97) | HIPK1_11‐12 | 836 | 37.5 | .078 | 44 (89) |
| DLST_1‐2 | 769 | 35.5 | .072 | 48 (97) | SF3B1_18‐19 | 624 | 34.3 | .180 | 44 (89) |
| DNMT1_16‐17 | 1,034 | 42.7 | .091 | 48 (97) | WDFY3_62‐63 | 1,284 | 41.1 | .077 | 44 (89) |
| DYNC1H1_69‐70 | 857 | 33.7 | .066 | 48 (97) | WDR36_7‐8 | 651 | 35.7 | .071 | 44 (89) |
| JMJD6_2‐3 | 1,563 | 36.1 | .059 | 48 (97) | PSMD12_3‐4 | 511 | 30.1 | .050 | 43 (87) |
| PRR7_1‐2 | 871 | 50.9 | .051 | 48 (97) | UBE3A_4‐5 | 1,328 | 34.0 | .065 | 43 (87) |
| RIOK3_8‐9 | 943 | 32.9 | .092 | 48 (97) | USP34_75‐77 | 1,258 | 33.1 | .075 | 43 (87) |
| RUVBL2_7‐9 | 939 | 43.8 | .096 | 48 (97) | CNOT2_13‐14 | 703 | 30.7 | .061 | 42 (85) |
| SKIV2L2_22‐23 | 978 | 30.0 | .014 | 48 (97) | MFSD4_6‐7 | 770 | 35.2 | .080 | 42 (85) |
| SSRP1_5‐6 | 542 | 35.8 | .084 | 48 (97) | MCM2_3‐4 | 1,224 | 43.2 | .069 | 41 (83) |
| TMCO3_2‐3 | 870 | 30.7 | .074 | 48 (97) | TENM3_5‐6 | 687 | 29.8 | .077 | 41 (83) |
| TOMM70A_4‐5 | 1 133 | 35.9 | .073 | 48 (97) | PRPF38A_3‐5 | 601 | 30.3 | .110 | 40 (81) |
| TRAPPC11_28‐29 | 685 | 33.7 | .069 | 48 (97) | POLR2B_4‐5 | 686 | 30.4 | .098 | 39 (79) |
| WDR47_13‐14 | 691 | 34.4 | .075 | 48 (97) | TAF2_10‐11 | 951 | 34.7 | .087 | 39 (79) |
| ZNF142_4‐5 | 1,280 | 44.8 | .057 | 48 (97) | COPB2_14‐15 | 617 | 34.3 | .093 | 37 (75) |
| CXXC1_1‐2 | 1,292 | 33.9 | .084 | 47 (95) | KIF3A_12‐13 | 1,454 | 38.3 | .120 | 36 (73) |
| DCAF7_2‐3 | 811 | 34.4 | .061 | 47 (95) | KLHL24_2‐3 | 800 | 30.9 | .054 | 36 (73) |
| DDX18_7‐8 | 598 | 32.9 | .086 | 47 (95) | MYOID_7‐8 | 1,003 | 30.4 | .145 | 32 (65) |
| DYNC2H1_43‐44 | 1,608 | 36.3 | .070 | 47 (95) | ACTR8_1‐2 | 644 | 37.3 | .095 | 30 (61) |
| FTSJ3_2‐3 | 631 | 42.7 | .077 | 47 (95) | PTPRQ_4‐5 | 883 | 32.0 | .095 | 21 (42) |
| MPDZ_4‐5 | 1,288 | 39.4 | .094 | 47 (95) | PXK_9‐10 | 684 | 34.4 | .125 | 21 (42) |
| UBA1_19‐22 | 935 | 41.3 | .062 | 47 (95) | B1RC6_17‐18 | 1,044 | 34.1 | .143 | 18 (36) |
| UBR5_13‐14 | 758 | 31.3 | .063 | 47 (95) | HBS1L_7‐8 | 1,136 | 32.3 | .112 | 18 (36) |
| USP14_2‐4 | 1,036 | 31.8 | .070 | 47 (95) | ERP44_5‐6 | 954 | 31.2 | .185 | 17 (34) |
| VPS54_6‐7 | 689 | 33.4 | .071 | 47 (95) | TTN_89‐90 | 1,359 | 32.6 | .161 | 16 (32) |
| ZSWIM8_26‐27 | 569 | 39.4 | .135 | 47 (95) | FOXK2_5‐6 | 1,244 | 31.7 | .021 | 12 (24) |
Markers are ranked by the successful rate of amplification among all 49 samples.
Figure 2Comparison of the mean pairwise distance between our intron markers and the Nuclear protein‐coding markers among different snake taxa at four different taxonomic levels (a. interfamily, b. intergenus, c. interspecies, and d. intraspecies)
Figure 3Visualization of tree space using multidimensional scaling (MDS) for the gene trees generated from the intron markers and the concatenated Maximum‐likelihood (ML) tree. Blue points represent the ML gene tree from each of the 96 markers, while the red point indicates the tree from concatenation analysis using RAxML with 96 partitions
Figure 4Phylogeny of 49 snakes inferred from the 96 intron markers. The tree was inferred by concatenation analysis using RAxML. For better display, it is shown in two parts with different scale bar. Part (a) displays the family‐level phylogeny, while part (b) exhibits the relationships within the genus Gloydius. Bootstrap supports for each branch are shown close to node and an asterisk indicates support =100