| Literature DB >> 28050581 |
Xue-Juan Li1, Li-Liang Lin1, Ai-Ming Cui1, Jie Bai1, Xiao-Yang Wang2, Chao Xin1, Zhen Zhang1, Chao Yang3, Rui-Rui Gao1, Yuan Huang1, Fu-Min Lei4.
Abstract
The data in this paper are related to the research article entitled "Taxonomic status and phylogenetic relationship of tits based on mitogenomes and nuclear segments" (X.J. Li et al., 2016) [1]. The mitochondrial genomes and nuclear segments of tits were sequenced to analyze mitochondrial characteristics and phylogeny. In the data, the analyzed results are presented. The data holds the resulting files of mitochondrial characteristics, heterogeneity, best schemes, and trees.Entities:
Year: 2016 PMID: 28050581 PMCID: PMC5192249 DOI: 10.1016/j.dib.2016.11.079
Source DB: PubMed Journal: Data Brief ISSN: 2352-3409
Fig. 1Nucleotide compositions of different mitochondrial partitions in 10 tits species. Note: AT-skew ([A−T]/[A+T]), GC-skew ([G−C]/[G+C]), PCG-1st (the first codon positions of protein-coding genes), PCG-2nd (the second codon positions of protein-coding genes), PCG-3rd (the third codon positions of protein-coding genes), tRNA-H (the tRNA genes on H-strand), tRNA-L (the tRNA genes on L-strand).
Fig. 2Conserved site percentages of mitochondrial genes among 10 tits species.
Fig. 3The heterogeneity analyzed by AliGROOVE. Note: The heterogeneity continuously decreased from −1 (red coloring) to +1 (blue coloring). A: the first and second codon positions of protein-coding genes, B: protein-coding genes with the third codon positions not using RY-coding method, C: mitochondrial genome with the third codon positions not using RY-coding method, D: nuclear segments dataset. (For interpretation of the references to color in this figure legend, the reader is referred to the web version of this article).
Fig. 4The gene trees and species tree analyzed by using ASTRAL. Note: The gene trees (A–F) were constructed based on maximum likelihood method. A: MOS; B: FGB; C: ALDOB; D: PCBD1; E: CALB1; F: mitochondrial genome; G: species tree.
Taxonomic samples in the study.
| Family | Genus | Species and subspecies | Sample locality/source | GenBank accession Nos. | |||||
|---|---|---|---|---|---|---|---|---|---|
| Mitogenome | MOS | FGB | ALDOB | PCBD1 | CALB1 | ||||
| Paridae | Beach forestry centre, Zhouqu County, Gansu Province | ||||||||
| Baihualing, Gaoligongshan, Yunnan Province | |||||||||
| Beach forestry centre, Zhouqu County, Gansu Province | |||||||||
| Dahaoping, Gaoligongshan, Yunnan Province | |||||||||
| Liancheng, Yongdeng County, Gansu Province | |||||||||
| Maoershan, Shangzhi City, Heilongjiang Province | |||||||||
| Beach forestry centre, Zhouqu County, Gansu Province | |||||||||
| Kizil, Baicheng County, Xinjiang | |||||||||
| Longqishan Nature Reserve, Fujian Province | |||||||||
| Sanguanmiao, Shaanxi Province | |||||||||
| Wen County, Gansu Province | |||||||||
| Yangxin County, Huangshi City, Hubei Province | |||||||||
| Bird Island, Qinghai Lake, Qinghai Province | |||||||||
| Luding County, Sichuan Province | |||||||||
| Remizidae | Xinxing Town, Panjin City, Liaoning Province | ||||||||
The primers used in this study.
| Name | Sequence(5′–3′) | Name | Sequence(5′–3′) |
|---|---|---|---|
| L1263b | AAAGCATRRCACTGAA | H10343b | TGGGCTCATGTGACKGTRACKCC |
| H1859b | TCGATTACAGAACAGGCTCCTCTA | L10236b | TTCTGAGCMTTCTTCCAYTCMAG |
| L1754b | TGGGATTAGATACCCCACTATG | H10884b | GGGTCRAAWCCRCATTCGTATGG |
| H2294b | TTTCAGGTGTAAGCTGAATGCTT | L10635b | CACCACTTYGGCTTYGARGCAGC |
| L2260b | CAAGGTAAGTGTACCGGAAGGTG | H11837b | ARGGTKGCTTCRAATGCRATRTARAA |
| H2891b | TGATGGCTGCTTRARGGCCCAC | L11458b | TCYACCCGAACYCACGGCTCMGA |
| L2725b | CGAGCCGGGTGATAGCTGG | H12344b | CTATGTGGCTKACKGAKGAGTAKGC |
| H3292b | TGATTGCGCTACCTTTGCACGG | L12156b | CCHAAAGCMCACGTAGAAGCMCC |
| L3218b | CGACTGTTTACCAAAAACATAGCC | H13047b | CTTTTACTTGGATTTGCACCAA |
| H3784b | CGGTCTGAACTCAGATCACG | L13040b | ATCCAATGGTCTTAGGARCCA |
| L3722b | GGTTTACGACCTCGATGTTGG | H13563b | TGRAGGGCDGCRGTGTTRGC |
| H4170b | CCYACRATRTTTGGGCCTTTKCG | L13525b | GMTGAGAAGGRGTAGGAATCATATC |
| L3803b | CTACGTGATCTGAGTTCAGACCG | H14127b | CCTATTTTTCGRATGTCYTGTTC |
| H4644b | TCRAATGGGGCRCGRTTTGTYTC | L14080b | TCAACYCACGCATTCTTYAARGC |
| L4500b | GTAGCCCAAACAATCTCMTAYGARG | H15049b | GTGTCTGCTGTGTAGTGYATDGC |
| H5201b | CCATCATTTTCGGGGTATGG | L14770b | TMGGMCCAGAAGGAYTVGC |
| L5143b | GAACCTRCACWARAGRGATCAAAAC | H15295b | CCTCAGAATGATATTTGKCCTCAKGG |
| H5766b | GGAKGAGAAGGCTAKGATTTTTCG | L14996b | AACATCTCADCHTGATGAAACTTYGG |
| L5758b | GGRGGMTGAATAGGMCTAAACCARAC | H15646b | GGYGTGAARTTTTCTGGRTCTCC |
| H6681b | GGTATAGGGTDCCRATGTCTTTRTG | L15413b | GGWGGATTYTCAGTAGACAACCC |
| L6615b | CCTCTGTAAAAAGGACTACAGCC | H16064b | CTTCAATCTTTGGYTTACAAGACC |
| H7122b | GCTGTTGTRATGAAGTTGATDGCYCC | L15725b | AAACCHGAATGATACTTCCTMTTYGC |
| L7036b | GGAACAGGATGAACYGTNTACCC | H1530b | GGTGGCTGGCACARGATTTACC |
| H7548b | GTRGCGGATGTRAAGTATGCTCG | CMOSF | GCCTGGTGCTCCATCGACTGG |
| L7525b | GTNTGAGCMCACCACATRTTYAC | CMOSR | GCAAATGAGTAGATGTCTGCT |
| H8121b | GGGCAGCCGTGRATTCATTC | FIB4F | CTGTAATATCCCGGTGGTTTCAGG |
| L7987b | TCAGACTACCCAGAYGCCTAYAC | FIB4R | ATTTCAGATGTTTCACCTCCCTTTC |
| H8628b | TCGTAGGWTCAGTATCATTGRTGNCC | AldB6F | GAGCCAGAAGTCTTACCTGAYGG |
| L8386b | GCYTCATCMCCYATCATAGAAGA | AldB7R | CAGCTGTCACCATGTTNGG |
| H9235b | TCGAAGAAGCTTAGGTTCATGGTCA | DCOH3F | AGGCCTGGCTTCATGAC |
| L8929b | GGMCAATGCTCAGAAATYTGYGG | DCOH4R | GATAAACCYGTGCARTCYTGGGTGCT |
| H9726b | AGRTGKCCTGCTGTNAGRTTNGC | Cal9F | AGGGTGTCAARATGTGTGSGAAAGA |
| L9700b | GAAACAACAAGCCTACTHATYCGHCC | Cal11R | GTANAGCTTCCCTCCATCNGACAA |
Means the primers used in LA-PCR.
The P-distance based on mitogenome dataset.
| Species | Genus | |||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0.085 | 0.085 | |||||||||||||||||||
| 0.084 | 0.051 | 0.092 | 0.088 | |||||||||||||||||
| 0.092 | 0.087 | 0.088 | 0.090 | 0.083 | 0.094 | |||||||||||||||
| 0.090 | 0.084 | 0.083 | 0.094 | 0.096 | 0.089 | 0.080 | 0.096 | |||||||||||||
| 0.096 | 0.089 | 0.089 | 0.080 | 0.096 | 0.096 | 0.091 | 0.099 | 0.094 | 0.101 | |||||||||||
| 0.091 | 0.088 | 0.088 | 0.038 | 0.093 | 0.080 | 0.093 | 0.089 | 0.079 | 0.094 | 0.082 | 0.102 | |||||||||
| 0.092 | 0.089 | 0.088 | 0.038 | 0.094 | 0.080 | 0.021 | 0.092 | 0.087 | 0.077 | 0.092 | 0.080 | 0.099 | 0.071 | |||||||
| 0.086 | 0.021 | 0.052 | 0.087 | 0.084 | 0.090 | 0.088 | 0.088 | |||||||||||||
| 0.085 | 0.052 | 0.010 | 0.088 | 0.083 | 0.089 | 0.088 | 0.088 | 0.053 | ||||||||||||
| 0.096 | 0.091 | 0.090 | 0.097 | 0.094 | 0.101 | 0.100 | 0.099 | 0.091 | 0.091 | |||||||||||
| 0.093 | 0.090 | 0.089 | 0.080 | 0.094 | 0.082 | 0.078 | 0.079 | 0.090 | 0.089 | 0.102 | ||||||||||
| 0.092 | 0.089 | 0.086 | 0.077 | 0.092 | 0.080 | 0.077 | 0.076 | 0.088 | 0.086 | 0.099 | 0.071 | |||||||||
Best schemes analyzed by PartitionFinder.
| Dataset | Subset | Subset partitions | Optimal model |
|---|---|---|---|
| Protein-coding genes | P1 | atp6_pos1, nad1_pos1, nad2_pos1, nad3_pos1, nad4L_pos1, nad4_pos1, nad5_pos1 | GTR+I+G |
| P2 | atp6_pos2, atp8_pos2, cox3_pos2, cox2_pos2, cox1_pos2, cytb_pos2, nad1_pos2, nad2_pos2, nad3_pos2, nad4L_pos2, nad4_pos2, nad5_pos2 | GTR+I+G | |
| P3 | atp6_pos3, atp8_pos1, atp8_pos3, cox3_pos3, cox2_pos3, cox1_pos3, cytb_pos3, nad1_pos3, nad2_pos3, nad3_pos3, nad4L_pos3, nad4_pos3 | GTR+G | |
| P4 | cox3_pos1, cox2_pos1, cox1_pos1, cytb_pos1 | GTR+I+G | |
| P5 | nad5_pos3, nad6_pos3 | GTR+G | |
| P6 | nad6_pos1, nad6_pos2 | GTR+G | |
| Mitogenomes | P1 | rrnS, rrnL, atp6_pos1, nad1_pos1, nad2_pos1, nad3_pos1, nad4L_pos1, nad4_pos1, nad5_pos1, trnR, trnD, trnG, trnH, trnI, trnK, trnM, trnF, trnS(agy), trnW, trnV | GTR+I+G |
| P2 | atp6_pos2, atp8_pos2, cox3_pos2, cox2_pos2, cox1_pos2, cytb_pos2, nad1_pos2, nad2_pos2, nad3_pos2, nad4L_pos2, nad4_pos2, nad5_pos2 | GTR+I+G | |
| P3 | atp6_pos3, atp8_pos3, cox3_pos3, cox2_pos3, cox1_pos3, cytb_pos3, D_loop, nad1_pos3, nad2_pos3, nad3_pos3, nad4L_pos3, nad4_pos3 | GTR+I+G | |
| P4 | cox3_pos1, cox2_pos1, cox1_pos1, cytb_pos1, trnN, trnL(uur), trnL(cun), trnS(ucn), trnT, trnY | GTR+I+G | |
| P5 | atp8_pos1, nad5_pos3, nad6_pos3 | GTR+I+G | |
| P6 | nad6_pos1, nad6_pos2, trnA, trnC, trnQ, trnE, trnP | GTR+I+G | |
| Nuclear segments | P1 | ALDOB_exon, CALB1_exon, MOS_exon, PCBD1_exon, PCBD1_intron, FGB_exon | GTR+I+G |
| P2 | ALDOB_intron, CALB1_intron, FGB_intron | GTR+G |
Note: Pos1, pos2, and pos3 indicate the first, second and third codon positions of protein-coding genes in mitogenomes, respectively.
| Subject area | Biology, Genetics and Genomics |
| More specific subject area | Phylogenetics and Phylogenomics |
| Type of data | Figures, Tables, Trees |
| How data was acquired | The analyses of A+T contents, conserved site percentages and P-distances, were obtained in MEGA 4.1 |
| Data format | Analyzed |
| Experimental factors | The RY-coding method was employed for the third sites of protein-coding genes, while nuclear dataset was divided into different parts (exons and introns). |
| Experimental features | The phylogeny employed the best schemes inferred by PartionFinder v1.1.1 |
| Data source location | Shaanxi Normal University |
| Data accessibility | Data is with this article |