| Literature DB >> 35126472 |
Tianming Lan1, Dongming Fang1, Haimeng Li1,2, Sunil Kumar Sahu1, Qing Wang1,2, Hao Yuan3,4, Yixin Zhu1,2, Zipeng Yang3,4, Le Zhang5, Shangchen Yang6, Haorong Lu7,8, Lei Han5, Shaofang Zhang7,8, Jieyao Yu7,8, Yasser S Mahmmod9,10, Yanchun Xu5, Yan Hua11, Fengping He12, Ziguo Yuan3,4, Huan Liu1,8.
Abstract
The masked palm civet (Paguma larvata) is a small carnivore with distinct biological characteristics, that likes an omnivorous diet and also serves as a vector of pathogens. Although this species is not an endangered animal, its population is reportedly declining. Since the severe acute respiratory syndrome (SARS) epidemic in 2003, the public has been particularly concerned about this species. Here, we present the first genome of the P. larvata, comprising 22 chromosomes assembled using single-tube long fragment read (stLFR) and Hi-C technologies. The genome length is 2.41 Gb with a scaffold N50 of 105.6 Mb. We identified the 107.13 Mb X chromosome and one 1.34 Mb Y-linked scaffold and validated them by resequencing 45 P. larvata individuals. We predicted 18,340 protein-coding genes, among which 18,333 genes were functionally annotated. Interestingly, several biological pathways related to immune defenses were found to be significantly expanded. Also, more than 40% of the enriched pathways on the positively selected genes (PSGs) were identified to be closely related to immunity and survival. These enriched gene families were inferred to be essential for the P. larvata for defense against the pathogens. However, we did not find a direct genomic basis for its adaptation to omnivorous diet despite multiple attempts of comparative genomic analysis. In addition, we evaluated the susceptibility of the P. larvata to the SARS-CoV-2 by screening the RNA expression of the ACE2 and TMPRSS2/TMPRSS4 genes in 16 organs. Finally, we explored the genome-wide heterozygosity and compared it with other animals to evaluate the population status of this species. Taken together, this chromosome-scale genome of the P. larvata provides a necessary resource and insights for understanding the genetic basis of its biological characteristics, evolution, and disease transmission control.Entities:
Keywords: SARS-CoV-2; genetic diversity; genome assembly; immune system; masked palm civet; omnivorous diet
Year: 2022 PMID: 35126472 PMCID: PMC8815822 DOI: 10.3389/fgene.2021.819493
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Global statistics of sequencing data, genome assembly, and annotation of P. larvata.
| Item | Category | Number |
|---|---|---|
| Sequencing data | stLFR (Gb) | 483.5 |
| Genome survey (Gb) | 82.08 | |
| Hi-C (Gb) | 105.94 | |
| Re-sequencing (45 individuals) (Gb) | 2,593.62 | |
| RNA-seq (16 organs) (Gb) | 1,022.6 | |
| Assembly (Hi-C) | Estimated genome size (Gb) | 2.46 |
| Assembled genome size (Gb) | 2.42 | |
| Karyotype | 2n = 44 | |
| Assembly chromosome number | 22 | |
| Contig N50 (kb) | 77.18 | |
| Scaffold N50 (Mb) | 105.6 | |
| Longest scaffold (Mb) | 190.64 | |
| Annotation | GC content (%) | 42.1 |
| Repeat sequences (%) | 32.64 | |
| Number of protein-coding genes | 18,340 | |
| Number of functionally annotated genes | 18,333 | |
| Average gene length (kp) | 40.67 | |
| Average exon length (bp) | 182.84 | |
| Average intron length (kb) | 5.10 | |
| Average exon per gene | 8.67 |
FIGURE 1Overview of P. larvata genome assembly and chromosome-level synteny analysis in this study. (A) Genomic features and variation landscape of our assembled genome (500 kb window). a, population-scale π-values across 22 chromosomes; b, SNP counts per 500 kb window; c, repeat density; d, GC content; e, gene density; f, read depth mapped to the genome. (B) Comparisons of mRNA length, coding sequence (CDS) length, exon length, and intron length among the five species. The x-axis represents the length, and the y-axis represents the density. (C) Chromosome-level synteny analysis between P. larvata and F. catus, which was visualized using RectChr v1.27 (https://github.com/BGI-shenzhen/RectChr). (D) Schematic diagram of a comparative chromosome map of P. larvata (P chromosomes) and F. catus (A–F chromosomes) from previously published karyotypic study (Perelman et al., 2005). Lower bands represent the cat's chromosomes, and the upper bands represent the masked palm civet's chromosomes. All fission and fusion events found in (C) can be correspondently found in (D). We assume that the chromosome relationships between the cat and masked palm civet was accurate in the karyotypic analysis, then the highly consistent result between karyotypic and the syntenic analysis indicates the high quality of our assembled chromosome-scale genome.
FIGURE 2The identification of sex-linked regions in this study. (A) Anchoring genes on the X chromosome of the F. catus to the P. larvata and C. lupus familiaris. (B) The read depth mapped to autosomes, X-linked region, and Y-linked region of the 45 re-sequenced P. larvata individuals.
FIGURE 3Comparative genomic analysis and enrichment analysis of expanded gene families. (A) Divergence time estimated among 20 species. This tree topology was generated by MCMCtree with CDS sequences. The red and blue numbers indicate the expanded and contracted gene families for each node. Illustrations were created by adapting SMART (https://smart.servier.com) and Vecteezy (vecteezy.com) templates. (B) The significantly enriched KEGG pathways of 314 expanded gene families in the P. larvata genome compared with 19 other species.
FIGURE 4Enriched KEGG pathways related to the immune system and the RNA expression profile of ACE2 and TMPRSS2/TMPRSS4 genes. (A) Enriched KEGG pathways of the 622 PSGs of the P. larvata compared with 19 other species. Red, pathways related to immunity; blue, pathways related to reproduction; orange, pathways related to disease. (B) The number of PSGs in the eight significantly enriched KEGG pathways related to the immunity. (C) The number of genes in the expanded gene families in the six enriched KEGG pathways related to the immunity. (D) The gene number of PSGs in significantly enriched KEGG pathways related to immunity, reproduction, and disease. Some genes were shared in different pathways. (E) The expression heatmap of ACE2, TMPRSS2, and TMPRSS4 genes in 16 organs of the masked palm civet. Z-scores were calculated from TPM values for each of the three genes. Gene expression is colored from low (blue) to high (red).
FIGURE 5Genomic heterozygosity and population history of the P. larvata. (A) Demographic history of P. larvata estimated by PSMC. (B) Demographic history of P. larvata estimated by SMC++. (C) The genome-wide heterozygosity in P. larvata and 36 other published species. The generation interval and mutation rate we used here were 2 years and 2.4 × 10–9.