| Literature DB >> 35583674 |
Ping Liu1, Hai-Ying Jiang1, Lin-Miao Li1, Jia-Bin Zhou1, Wen-Zhong Huang1, Jin-Ping Chen1.
Abstract
BACKGROUND: The masked palm civet (Paguma larvata) acts as an intermediate host of severe acute respiratory syndrome coronavirus (SARS-CoV), which caused SARS, and transfered this virus from bats to humans. Additionally, P. larvata has the potential to carry a variety of zoonotic viruses that may threaten human health. However, genome resources for P. larvata have not been reported to date.Entities:
Keywords: Hi-C proximity mapping; gene family evolution; genome assembly; masked palm civet; phylogeny; positive selection
Mesh:
Year: 2022 PMID: 35583674 PMCID: PMC9116208 DOI: 10.1093/gigascience/giac041
Source DB: PubMed Journal: Gigascience ISSN: 2047-217X Impact factor: 7.658
Figure 1:(a) Synteny of corresponding chromosomes and gene density across the Paguma larvata genome. (b) Comparison of copy numbers in gene clusters that reside in the genomes of P. larvata and 18 other animals. Single-copy orthologs denote that the family can have only one gene for each species, and multicopy orthologs denote that the family clustered more than one gene for each species. Other orthologs denote the family can have any number of genes for each species except the single-copy and multicopy orthologs. Unique paralogs denote species-specific gene families, and unclustered genes denote species-specific genes that cannot cluster with any other genes. (c) Phylogenetic relationship of 19 animal genomes based on the maximum likelihood method. The black numbers indicate estimated divergent times as the number of years (millions) ago, while green and red numbers denote gene families subject to expansion and contraction for each species, respectively.
Paguma larvata genome assembly and annotation summary. CDS indicates coding region sequences; LINE indicates long interspersed nuclear elements; SINE indicates short interspersed nuclear elements; while LTR indicates long terminal repeat.
| Genome assembly statistics | |
| Total length (Gb) | 2.44 |
| Scaffold N50 length (Mb) | 111.81 |
| Scaffold N90 length (Mb) | 58.03 |
| Longest scaffold length (Mb) | 195.95 |
| Number of scaffold | 1,500 |
| Number of scaffold larger than N50 | 9 |
| Number of scaffold larger than N90 | 20 |
| Genome characteristics | |
| GC content | 42.41% |
| Number of protein-coding genes | 21,582 |
| Average transcript length (Kb) | 34.57 |
| Average CDS length (Kb) | 1.48 |
| Average exon length (bp) | 170.53 |
| Average number of exons per gene | 8.68 |
| Repetitive sequences (% of genome) | |
| DNA (Mb) | 8.23 (0.34%) |
| LINE (Mb) | 299.13 (12.27%) |
| SINE (Kb) | 613.25 (0.03%) |
| LTR (Mb) | 542.79 (22.27%) |
| Unknown (Mb) | 12.28 (0.5%) |
| Total (Mb) | 775.00 (31.8%) |
| Gene annotations (% of all genes) | |
| GO | 14,148 (65.6%) |
| InterProScan | 19,770 (91.6%) |
| KEGG | 18,200 (84.3%) |
| Nr | 20,480 (94.9%) |
| PFAM | 17,505 (81.1%) |
| SwissProt | 20,033 (92.8%) |
| Total | 20,546 |
CDS: coding region sequences; LINE: long interspersed nuclear elements; SINE: short interspersed nuclear elements; LTR: long terminal repeat.
Figure 2:Phylogenetic relationships of known human virus receptor of PDGFRA under positive selection and ACE2 receptor in P. larvata and other species.
Sequence identity of receptor under positive selection and ACE2 between masked palm civet and other animal species. The bolded values indicate the highest sequence identity.
| Species | Sequence identity (%) | |
|---|---|---|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 96.55 | 88.93 |
|
| 96.55 | 88.81 |
|
| 95.99 | 89.94 |
|
| 95.24 | 89.79 |
|
| 95.05 | 83.35 |
|
| 94.86 | 82.48 |
|
| 93.54 | 85.47 |
|
| 93.18 | 81.84 |
|
| 93.09 | 81.84 |
|
| 92.16 | 79.35 |
|
| 90.01 | 81.74 |
|
| 88.14 | 83.11 |
|
| 83.49 | 71.14 |
The bolded values indicate highest sequence identity.