| Literature DB >> 36132175 |
Jianxiang Qiu1, Zumei Liu1, Zhixin Fang2, Wenjiao Wu1.
Abstract
Coiled coils (CCs) are protein structural motifs universally found in proteins and mediate a plethora of biological interactions, and thus their reliable annotation is crucial for studies of protein structure and function. Autographa californica multiple nucleopolyhedrovirus (AcMNPV) is a large double-stranded DNA (dsDNA) virus and encodes 154 proteins. In this study, genome-wide scans of previously uncharacterized CC motifs throughout AcMNPV was conducted using CC prediction software. In total, 24 CC motifs in 19 CC proteins with high confidence were identified. The characteristic of viral CC motifs were analyzed. The CC proteins could be divided into 12 viral structural proteins and 7 non-structural proteins, including viral membrane fusion proteins, enzymes, and transcription factors. Moreover, CC motifs are conserved in the baculoviral orthologs of 14 of the 19 proteins. It is noted that five CC proteins, including Ac51, Ac66, Exon0, Ac13, and GP16, were previously identified to function in the nuclear egress of nucleocapsids, and Ac66 contains multiple CC motifs, the longest of which comprises 252 amino acids, suggesting a role of CC motifs in this process. Taken together, the CC motifs identified in this study are valuable resource for studying protein function and protein interaction networks during virus replication.Entities:
Keywords: AcMNPV; Baculovirus; Coiled-coil motif
Year: 2022 PMID: 36132175 PMCID: PMC9483598 DOI: 10.1016/j.heliyon.2022.e10588
Source DB: PubMed Journal: Heliyon ISSN: 2405-8440
Figure 1Characterization of predicted CC motifs in AcMNPV. (A) Percentage of CC proteins in AcMNPV predicted by Paircoil2 and Deepcoil. 12 (7.8%) and 21 (13.6%) proteins were predicted containing at least one CC motif by Paircoil2 (with a p-score lower than 0.03) and by Deepcoil (with a probability score higher than 0.7), respectively. After adjusting the cutoff score of Paircoil2 to 0.06, 19 proteins (12.3%) were predicted, which were all included in the 21 proteins predicted by Deepcoil. (B) Number of CC motifs in each of predicted CC proteins. The number of proteins containing 1, 2, 3, or 4 CC motifs are indicated. (C) Relative length (number of amino acids) of a CC motif within its parental proteins (where a CC motif is derived from). The number of CC motifs with different relative length to its parental proteins are indicated. (D) Distribution of the length of predicted CC motifs. The number of CC motifs with different length are indicated.
Summary of predicted coiled-coil motifs in AcMNPV
| No. | Protein ID | Gene name | ORF No. | CC region (p-score by Paircoil2) | CC prob. by DeepCoil | Oligomerization |
|---|---|---|---|---|---|---|
| 1 | NP_054031.1 | BRO | Orf2 | 133–173 (0.0468); | 0.8 | – |
| 2 | NP_054042.1 | Orf13ac | 142–178 (0.0184); | 0.8 | – | |
| 3 | NP_054045.1 | BV/ODV-E26 | Orf16 | 70–114 (0.0083) | 0.9 | – |
| 4 | NP_054052.1 | F protein | Orf23 | 75–110 (0.0459); | 0.8 | – |
| 5 | NP_054058.1 | Orf29 | 1–47 (0.0127) | 0.8 | Dimer | |
| 6 | NP_054076.1 | Orf47 | 13–50 (0.0289) | 0.8 | – | |
| 7 | NP_054080.1 | Orf51 | 289–318 (0.0505) | 0.8 | – | |
| 8 | NP_054096.1 | Orf66 | 14–46 (0.0546); | 0.9 | Dimer | |
| 9 | NP_054118.1 | CG30 | Orf88 | 116–248 (0.0043) | 0.9 | Dimer |
| 10 | NP_054134.1 | HE65 | Orf105 | 167–199 (0.0494) | 0.8 | – |
| 11 | NP_054156.1 | Chitinase | Orf126 | 315–359 (0.0320) | 0.9 | – |
| 12 | NP_054158.1 | GP64 | Orf128 | 307–348 (0.0320) | 0.9 | – |
| 13 | NP_054159.1 | P24 | Orf129 | 157–186 (0.0520) | 0.7 | – |
| 14 | NP_054160.1 | GP16 | Orf130 | 24–53 (0.0520) | 0.7 | – |
| 15 | NP_054161.1 | Pp34 | Orf131 | 176–209 (0.0298) | 0.9 | – |
| 16 | NP_054167.1 | P10 | Orf137 | 5–41 (0.0167) | 0.8 | – |
| 17 | NP_054172.1 | Exon0 | Orf141 | 169–198 (0.0589) | 0.7 | – |
| 18 | NP_054182.1 | IE2 | Orf151ac | 295–377 (0.0037); | 0.9 | Dimer |
| 19 | NP_054184.1 | PE38 | Orf153 | 18–48 (0.016); | 0.9 | Dimer |
CC, coiled-coil motifs. prob., probability.
two coiled-coil motifs with small gaps (1 aa) are considered as one coiled-coil motif by ignoring the gap in later analysis.
the regions of 301–552 and 638–785 in Ac66 consist of several short coiled-coil motifs with gaps of 1 aa and the range of p-score is indicated.
these proteins are reported to contain coiled-coil motifs previously.
means that no dimer or trimer is predicted by Multicoil.
Figure 2Clustering of viral CC proteins. (A) Viral CC proteins were divided into structural and non-structural proteins. These proteins were further classifies as crucial and non-crucial proteins. Crucial proteins (indicated in orange) are essential for efficient virus replication and high virus titer in host cells whereas non-crucial proteins are not. (B) Proteins indicated in (A) are displayed. Structural proteins are in dark and light blue and non-structural proteins are in dark and light orange. (C) CC proteins were classified into five clusters according to their function during virus replication.
Figure 3Distribution of long CC motifs in six viral proteins. Pictures of BRO, Ac13, CG30, PE38, and IE2 were the prediction results from Deepcoil. As the maximum allowed sequence length of Deepcoil is 500 aa, picture of Ac66 was the prediction result from Deepcoil2. The location of long CCs and other conserved domains in each protein was indicated.
Figure 4Conservation of CC motifs in the orthologs of each protein. The existence of CC motifs in the orthologs of the 19 proteins were predicted by ProBLAST/PSI-BLAST tool in MPI Bioinformatics Toolkit. CC motifs are conserved in the orthologs of 74% of the predicted proteins, but not in those of 26% of the proteins.