| Literature DB >> 29722814 |
Chien-Yueh Lee1, Ping-Han Hsieh1, Li-Mei Chiang1, Amrita Chattopadhyay2, Kuan-Yi Li3,4, Yi-Fang Lee1, Tzu-Pin Lu5, Liang-Chuan Lai6, En-Chung Lin7, Hsinyu Lee1,8,9, Shih-Torng Ding7,9, Mong-Hsun Tsai2,9,10,11, Chien-Yu Chen3,9,12, Eric Y Chuang1,2,13.
Abstract
Background: The Mikado pheasant (Syrmaticus mikado) is a nearly endangered species indigenous to high-altitude regions of Taiwan. This pheasant provides an opportunity to investigate evolutionary processes following geographic isolation. Currently, the genetic background and adaptive evolution of the Mikado pheasant remain unclear.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29722814 PMCID: PMC5941149 DOI: 10.1093/gigascience/giy044
Source DB: PubMed Journal: Gigascience ISSN: 2047-217X Impact factor: 6.524
DNA contigs and scaffolds from the genomic data of the Mikado pheasant
| Contigs | Scaffolds | |
|---|---|---|
| Total length | 1,054,607,905 | 1,035,947,982 |
| Maximum length | 195,342 | 50,275,205 |
| Number of Ns | 0 | 19,577,473 |
| Average length | 5,050 | 110,714 |
| N50 | 13,461 | 11,461,115 |
| N75 | 6,528 | 5,708,287 |
| L50 | 22,195 | 28 |
| L75 | 50,081 | 59 |
| Counts ≥300 bp | 208,810 | - |
| Counts ≥1 kb | 123,006 | 9,357 |
| Counts ≥5 kb | 61,237 | 1,489 |
| Counts ≥10 kb | 32,868 | 928 |
The N50/N75 length is defined as the shortest sequence length at 50%/75% of the genome.
The L50/L75 count is defined as the smallest number of contigs (or scaffolds) that those length sum produces N50/N75.
Values of the genome assembly were calculated using the contigs ≥300 bp and scaffolds ≥1000 bp.
Figure 1:A chromosome-level comparison of the Mikado pheasant and the chicken. (A) A syntenic map of the Mikado pheasant and chicken genomes. The x-axis specifies the chromosome position in the chicken, whereas the y-axis specifies the scaffold position in the Mikado pheasant. The red dots (or lines) indicate that the sequences were aligned in the same orientation, and the blue dots indicate an alignment with a reverse complement. (B) A chord diagram of scaffolds with a total length greater than 500 kb and an alignment length greater than 10 kb. The orange perimeters specify the chromosomes (chr) of the chicken, whereas the purple perimeters specify the scaffolds (sc) of the Mikado pheasant. The red links represent the sequences aligned in the same orientation, and the blue links represent an alignment with a reverse complement. Yellow arrows indicate the scaffolds that were fully aligned, and gray arrows indicate the multiple alignment.
Figure 2:Evolution of gene families among various animal species. A phylogenetic tree was reconstructed based on 5,287 single-copy orthologs of 10 species. The most recent common ancestor (MRCA) contains 18,220 gene families that were used to examine gene families with expansions or contractions. The numbers of gene families with significant expansions and contractions are shown in red and blue, respectively, at each branch. The divergence times and associated 95% confidence intervals (in parentheses) are indicated at the nodes of the tree in Mya. All nodes had 100% support in 500 bootstrap replicates.
Figure 3:An identity plot of the MHC regions of the Mikado pheasant and the chicken. The chicken MHC sequence was downloaded from GenBank (AB268588). Its nucleotide sequence from 17,978 to 241,251 was aligned against the Mikado pheasant MHC sequence from 2,615 to 229,500 in scaffold208. The gene structure boxes on the horizontal and vertical axes, respectively, represent the gene loci in the Mikado pheasant and the chicken. Boxes with different sizes exhibit different gene locus sizes, and red/blue coloring indicates genes in forward/reverse orientation. The red dots (or lines) on the diagonal indicate that the sequences were aligned in the same orientation, whereas the blue dots indicate alignments with reverse complements. The green dotted lines highlight the sequence of the inverted TAPBP locus and TAP1-TAP2 block. The orange peaks show the read counts on a natural log scale of the gene expression based on our RNA-seq data. The box plot colored in purple indicates d ratios of genes.
Coding sequences of MHC-B genes in the Mikado pheasant and comparisons with the chicken
| Mikado pheasant | Chicken | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Gene | Position | Strand | Gene length | Amino acid length | Exon | Aligned base | Nucleotide identity (%) | Aligned amino acid | Amino acid identity (%) | Amino acid substitutions | dN/dS |
| KIFC1 | 2615–5304 | + | 1,140 | 380 | 7 | 1,131 | 91.76 | 377 | 90.53 | 33 | 0.8669 |
| Blec3 | 8997–11 221 | - | 552 | 183 | 5 | 507 | 82.43 | 168 | 78.14 | 25 | 0.3821 |
| Bzfp3 | 12 126–18 213 | + | 1,449 | 482 | 13 | 1,569 | 85.8 | 522 | 83.62 | 40 | 0.1884 |
| TRIM7.2 | 19 507–24 562 | - | 1,518 | 505 | 7 | 1,518 | 95.98 | 505 | 98.61 | 7 | 0.0391 |
| Bzfp2 | 27 027–29 946 | + | 1,368 | 455 | 4 | 1,396 | 70.41 | N/A | N/A | N/A | 0.2438 |
| Bzfp1 | 31 049–33 298 | - | 1,425 | 474 | 2 | 1,426 | 88.23 | 471 | 86.79 | 54 | 0.1900 |
| 44G24.1 | 37 266–37 673 | - | 408 | 136 | 1 | 408 | 85.78 | 136 | 80.15 | 27 | 0.2762 |
| IL4I1 | 42 730–46 759 | + | 1,578 | 525 | 6 | 1,572 | 92.25 | 523 | 93.75 | 25 | 0.1011 |
| TRIM7.1 | 51 325–62 131 | - | 1,758 | 585 | 8 | 1,767 | 92.49 | 588 | 92.69 | 40 | 0.1545 |
| HEP21 | 63 362–64 247 | - | 324 | 107 | 3 | 324 | 93.52 | 107 | 91.59 | 9 | 0.2148 |
| TRIM39.2 | 70 980–74 640 | - | 1,392 | 464 | 6 | 1,389 | 93.68 | 463 | 94.61 | 24 | 0.1167 |
| TRIM27.2 | 76 988–80 522 | + | 1,431 | 476 | 7 | 1,431 | 94.13 | 476 | 92.23 | 37 | 0.2415 |
| TRIM39.1 | 81 560–85 449 | - | 798 | 266 | 5 | 798 | 93.23 | 266 | 91.35 | 23 | 0.2753 |
| TRIM27.1 | 86 518–90 228 | - | 1,485 | 495 | 7 | 1,485 | 94.48 | 495 | 94.34 | 28 | 0.1715 |
| TRIM41 | 91 918–96 605 | + | 1,656 | 551 | 7 | 1,770 | 89.58 | 589 | 91.71 | 7 | 0.0375 |
| GNB2L1 | 98 038–101 512 | - | 954 | 317 | 8 | 954 | 96.86 | 317 | 100 | 0 | N/A |
| BTN1 | 103 411–114 264 | + | 930 | 309 | 8 | 939 | 74.64 | 339 | 57.26 | 96 | 0.8357 |
| BTN2 | 117 466–120 157 | + | 1,461 | 487 | 7 | 1,481 | 90.41 | 469 | 83.37 | 52 | 0.3996 |
| BG1 | 124 105–125 436 | - | 549 | 183 | 3 | 546 | 91.99 | 182 | 87.98 | 21 | 0.5591 |
| Blec2 | 131 358–133 021 | - | 579 | 192 | 5 | 579 | 86.32 | 190 | 71.88 | 52 | 0.9375 |
| Blec1 | 135 818–137 846 | + | 567 | 188 | 5 | 567 | 92.59 | 188 | 88.3 | 22 | 0.4683 |
| BLB1 | 138 411–139 729 | - | 339 | 112 | 3 | 345 | 83.38 | 113 | 42.98 | 62 | 0.7904 |
| TAPBP | 140 657–144 216 | + | 1,293 | 430 | 8 | 1,293 | 92.19 | 430 | 89.77 | 44 | 0.3179 |
| BLB2 | N/A | N/A | 792 | 263 | N/A | 792 | 92.93 | 263 | 85.93 | 37 | 1.4489 |
| BRD2 | 150 146–156 295 | - | 2,976 | 991 | 13 | 3,078 | 86.85 | 776 | 75.28 | 30 | 0.0306 |
| DMA | 160 545–162 778 | + | 789 | 263 | 4 | 789 | 92.65 | 263 | 89.73 | 27 | 0.4528 |
| DMB1 | 163 010–165 184 | + | 930 | 310 | 6 | 930 | 91.29 | 310 | 86.45 | 42 | 0.4978 |
| DMB2 | 165 617–168 363 | + | 768 | 256 | 5 | 768 | 92.71 | 256 | 92.58 | 19 | 0.1622 |
| BF1 | 169 254–170 740 | + | 996 | 331 | 5 | 1,001 | 83.66 | 345 | 64.12 | 95 | 0.7116 |
| TAP2 | 172 793–176 021 | - | 2,100 | 700 | 9 | 2,100 | 92.48 | 700 | 93.14 | 48 | 0.1675 |
| TAP1 | 176 574–180 981 | + | 1,752 | 584 | 11 | 1,739 | 93.21 | 580 | 92.81 | 38 | 0.2191 |
| BF2 | 181 900–184 038 | - | 1,530 | 509 | 6 | 1,213 | 62.28 | 326 | 57.39 | 119 | 0.8157 |
| C4 | 185 102–199 258 | + | 5,031 | 1,676 | 40 | 4,998 | 93.33 | 1,665 | 93.2 | 101 | 0.1974 |
| CenpA | 199 593–200 795 | + | 396 | 131 | 4 | 396 | 96.72 | 131 | 99.24 | 1 | 0.0324 |
| CYP21 | 201 291–205 141 | + | 1,431 | 477 | 11 | 1,431 | 92.67 | 477 | 94.13 | 28 | 0.7109 |
| TNXB | 209 524–215 604 | - | 2,472 | 824 | 10 | 2,496 | 92.14 | 832 | 92.34 | 50 | 0.2002 |
| LTB4R1 | 221 450–222 538 | + | 1,089 | 363 | 1 | 1,089 | 94.12 | 363 | 94.49 | 20 | 0.1954 |
| CD1A2 | 223 740–225 788 | - | 1,044 | 348 | 6 | 1,044 | 92.24 | 348 | 87.93 | 42 | 0.3796 |
| CD1A1 | 227 030–229 500 | - | 1,122 | 374 | 6 | 1,122 | 93.4 | 374 | 90.64 | 35 | 0.3294 |
KIFC1, kinesin family member C1; Blec, C-type lectin-like receptor; Bzfp, B-locus zinc finger-like protein; TRIM, tripartite motif containing protein; 44G24.1, histone H2B-like protein; IL4I1, interleukin 4 induced 1; HEP21, hen egg protein 21 kDa; GNB2L1, guanine nucleotide binding-like protein; BTN, B-butyrophilin protein; BG1, BG-like antigen; CD1A1/A2, CD1-like proteins.
dN/dS = ratio of nonsynonymous (dN) to synonymous (dS) substitutions.
Defined as a pseudogene in chicken.
No predicted result was identified from the DNA assembly. The transcript sequence was alternatively derived from the transcriptome assembly by RNA-seq.
Figure 4:A phylogenetic tree of Syrmaticus pheasants. The divergence times and associated 95% confidence intervals shown in parentheses are given at the branch nodes of the tree in million years ago.