| Literature DB >> 36002819 |
Hao Chen1, Min Huang2, Daoqiang Liu3, Hongbo Tang1, Sumei Zheng1,2, Jing Ouyang1, Hui Zhang2, Luping Wang1, Keyi Luo1, Yuren Gao1, Yongfei Wu1, Yan Wu1, Yanpeng Xiong1, Tao Luo1, Yuxuan Huang1, Rui Xiong1, Jun Ren2, Jianhua Huang4, Xueming Yan5.
Abstract
BACKGROUND: The blue-crowned laughingthrush (Garrulax courtoisi) is a critically endangered songbird endemic to Wuyuan, China, with population of ~323 individuals. It has attracted widespread attention, but the lack of a published genome has limited research and species protection.Entities:
Keywords: Blue-crowned laughingthrush; Conservation strategies; Cooperative breeding; Demographic inference; Genetic diversity
Mesh:
Year: 2022 PMID: 36002819 PMCID: PMC9400264 DOI: 10.1186/s12915-022-01390-4
Source DB: PubMed Journal: BMC Biol ISSN: 1741-7007 Impact factor: 7.364
Fig. 1Comparative genomic analysis of the genome assemblies. a Image of GCO (left) and GSA (right). b Distribution of GCO in China. The magenta point represents current habitat of GCO in the Wuyuan area. The grayish blue indicates extinct GCO in Simiao area. c Genomic landscape of the GCO and GSA. The dark red numbers indicate the following: (1) Synteny blocks between the two species. (2) GC content (%). (3)–(8) Represent density of tandem repeats, TEprotein, Transposon, gene, INDELs, and SNPs within a 10-kb sliding window. (9) Assembled scaffolds, green for GCO and light yellow for GSA. The numbers outside the circle indicate the scaffold number of the assembled genome. The unit of (2) is the percentage, the scale of (3)–(8) is the frequency, and the unit (9) is Mb. The density distribution of Ks (d) and Ka/Ks (e) values. f A phylogentic tree was built on the 2391 shared single-copy gene families. Acanthisitta chloris (ACS) was the outgroup. The abbreviations of species correspond to Additional file 1: Table S13. The numerical value beside each node shows the estimated divergence time. The red and green numbers represent significantly expanded and contracted gene families, respectively (left). The classified homologous genes (right). Cre., Cretaceous; Pal., Paleocene; Eoc., Eocene; Oli., Oligocene; Mio., Miocene; Pli., Pliocene; Ple., Pleistocene
Summary of assembled genomes
| Statistic | Blue-crowned laughingthrush ( | White-browed laughingthrush ( |
|---|---|---|
| Genome (Mb) | 1120.72 | 1185.73 |
| Total depth (×) | 320.11 | 296.52 |
| Total data (Gb) | 359.43 | 351.6 |
| Contig number | 3080 | 1584 |
| Contig N50 (bp) | 5,499,984 | 8,312,805 |
| Scaffold number | 1708 | 786 |
| Scaffold N50 (bp) | 11,258,674 | 22,710,109 |
| GC content (%) | 43.25 | 42.59 |
| Map rate (%) | 98.24 | 97.35 |
| Homology SNP (%) | 0.0008 | 0.0003 |
| CEGMA (%) | 84.68 | 86.29 |
| BUSCO (%) | 96.50 | 97.50 |
| Repeat elements (Mb) | 228.68 | 187.39 |
| Gene number | 20,819 | 18,367 |
Fig. 2Phylogenetic relationship population inference. a Neighbor-joining (NJ) phylogenetic tree of the nine species with 58 individuals. b Principal component analysis of 58 individuals with the first and second principal components being visualized. Color codes for different species are as in a. c Ancestry of Garrulax species. K represents assumptive ancestral lineage. Each vertical line represents one species, and each color represents one ancestral lineage. The length of each colored segment in each vertical indicates proportion of individual genome inferred from ancestral populations. Species are separated by black dotted lines. Abbreviations of species are given in Additional file 1: Table S15. d Estimation of the divergence time among eight Garrulax species. Values in square brackets indicate the divergence time ranges. Dark red nodes represent time calibration points. e Demographic inference based on the Pairwise Sequentially Markovian Coalescent (PSMC) model. The five-colored lines depict the estimated effective population size of the five species dating to roughly 10,000 years ago. The gray background colors indicate glacial periods. f, g showed effective population size in the last 10,000 years for GCO and GSA inferred via PopSizeABC. A 90% confidence interval is indicated by dotted lines. The gray shaded area represents the Song Dynasty period from 960 to 1279 AD
Fig. 3Genetic diversity analysis. Nucleotide diversity estimated based on the respective reference genomes of GCO (a) and GSA (b). c The LD decay of GCO and GSA. The black dotted line indicates the mean length of genome segments when r2=0.3. d The ROH of GCO and GSA. The x-axis denotes four lengths of ROH segments: <100 kb, 100–500 kb, 500–1000 kb, and >1000 kb. The y-axis denotes the number of different ROH segments in their categories. e The proportion of deleterious mutations in GCO and GSA. f Histogram distribution of published genome-wide estimates of 186 species (167 animals, 15 plants, 3 fungi, and 1 protist taxa), with the position of GCO and GSA heterozygosity rates indicated by asterisks
Fig. 4Selective sweeps of GCO. a The Manhattan plots of Z-transformed average pooled heterozygosity in GCO (zHp), and average fixation index (FST) between GCO and seven other Garrulax species. The shadows identified by FST and zHp represent the regions with the extremum of the two statistics. Nucleotide diversity and Tajima’s D values with non-overlapping 10-kb sliding windows on Scaffold 65: 8.7–9.74 Mb (b) and Scaffold 131: 19.58–20.62 Mb (c)