| Literature DB >> 10438601 |
Abstract
Crystallographic studies have shown that the coiled-coil motif occurs in several viral membrane-fusion proteins, including HIV-1 gp41 and influenza virus hemagglutinin. Here, the LearnCoil-VMF program was designed as a specialized program for identifying coiled-coil-like regions in viral membrane-fusion proteins. Based upon the use of LearnCoil-VMF, as well as other computational tools, we report detailed sequence analyses of coiled-coil-like regions in retrovirus, paramyxovirus and filovirus membrane-fusion proteins. Additionally, sequence analyses of these proteins outside their putative coiled-coil domains illustrate some structural differences between them. Complementing previous crystallographic studies, the coiled-coil-like regions detected by LearnCoil-VMF provide further evidence that the three-stranded coiled coil is a common motif found in many diverse viral membrane-fusion proteins. The abundance and structural conservation of this motif, even in the absence of sequence homology, suggests that it is critical for viral-cellular membrane fusion. The LearnCoil-VMF program is available at http://web.wi.mit.edu/kim Copyright 1999 Academic Press.Entities:
Mesh:
Substances:
Year: 1999 PMID: 10438601 PMCID: PMC7125536 DOI: 10.1006/jmbi.1999.2796
Source DB: PubMed Journal: J Mol Biol ISSN: 0022-2836 Impact factor: 5.469
Figure 1Common structural elements between MoMLV TM, HIV gp41 and influenza HA2. The top panel shows schematic maps of the MoMLV, HIV and influenza sequences. For each, the position of the coiled coil is shown in red, and the position of the supporting structures is shown in blue. The bottom panel shows the structures of MoMLV TM residues 45–98 (Fass , HIV gp41 residues 546–581 (Chan , and influenza HA2 residues 40–129 (Bullough . The interior coiled coils are shown in red, and the exterior supporting structures are shown in blue.
Lentivirus N-terminal coiled coils
| Virus | Add dist | Sequence | Likelihoods | |||
|---|---|---|---|---|---|---|
| L | P | C | M | |||
| bcdefgabcdefgabcdefgabcdefgabcdefgab | ||||||
| HIV-1 | 36 | SGIVQQQNNLLRAIEAQQHLLQLTVWGIKQLQARIL | 9 | 0 | 0 | 1 |
| SIV | 34 | AGIVQQQQQLLDVVKRQQELLRLTVWGTKNLQTRVS | 9 | 0 | 0 | 0 |
| Visna | 37 | QSLANATAAQQNVLEATYAMVQHVAKGVRILEARVA | 9 | 0 | 0 | 0 |
| CAEV | 37 | QTLANATAAQQDALEATYAMVQHVAKGVRILEARVA | 9 | 0 | 0 | 0 |
| FIV | 36 | ATHQETIEKVTEALKINNLRLVTLEHQVLVIGLKVE | 9 | 3 | 1 | 3 |
| EIAV | 39 | NHTFEVENSTLNGMDLIERQIKILYAMILQTHADVQ | 9 | 0 | 1 | 2 |
| BIV | 39 | ERVVQNVSYIAQTQDQFTHLFRNINNRLNVLHRRVS | 5 | 0 | 0 | 0 |
For the lentiviruses, two helical regions are found; the lentivirus N-terminal helix is shown here. In all Tables, L, P, C and M indicate the highest likelihood for the shown sequence fragment for (respectively) LearnCoil-VMF, PairCoil, COILS and Multicoil. (0 represents likelihoods less than 0.1; 1 represents likelihoods at least 0.1 and less than 0.2, etc.) For Multicoil, the likelihoods given are the total coiled-coil likelihoods (i.e. the sum of the dimer and trimer likelihoods). In all Tables lower-case letters represent positions in the heptad repeat. Add dist is the number of amino acid residues from the proteolytic maturation cleavage site to the beginning of the shown subsequence. The N36 fragment of HIV-1 gp41 (Chan is shown, and the other lentivirus sequences are aligned to it. Abbreviations and GenBank accession numbers (Benson for both lentivirus Tables are: HIV-1, human immunodeficiency virus 1 (119452); SIV, simian immunodeficiency virus (119496); Visna, visna virus (543528); CAEV, caprine arthritis encephalitis virus (399432); FIV, feline immunodeficiency virus (544245); EIAV, equine infectious anemia virus (119407); BIV, bovine immunodeficiency virus (119399).
The lentivirus C-terminal helices
| Virus | Add dist2 | Sequence | Likelihoods | |||
|---|---|---|---|---|---|---|
| L | P | C | M | |||
| abcdefgabcdefgabcdefgabcdefgabcdef | ||||||
| HIV-1 | 47 | WMEWDREINNYTSLIHSLIEESQNQQEKNEQELL | 9 | 3 | 8 | 1 |
| SIV | 43 | WQEWERKVDFLEANITALLEEAQIQQEKNMYELQ | 9 | 1 | 9 | 5 |
| Visna | 47 | WQQWEEEIEQHEANLSQLLREAALQVHIAQRDAQ | 9 | 0 | 8 | 5 |
| CAEV | 47 | WQQWERELQGYDGNLTMLLRESARQTQLAEEQVR | 9 | 0 | 7 | 0 |
| FIV | 50 | LGEWYNQTKELQQKFYEIIMNIEQNNVQVKKGLQ | 9 | 0 | 4 | 0 |
| EIAV | 44 | WDDWVSKMEDLNQEILTTLHGARNNLAQSMITFN | 0 | 0 | 0 | 0 |
| BIV | 44 | WSDLQDEYDKIEEKILKIRVDWLNSSLSDTQDTF | 0 | 0 | 0 | 0 |
Add dist2 column is the distance from the end of the region in Table 1 to the start of this region. The C34 fragment of HIV-1 gp41 (Chan is shown, and the other lentivirus sequences are aligned to it.
Representative envelope proteins for retrovirus genera mammalian C, avian C, D and BLV-HTLV
| Virus | Add dist | Sequence | Likelihoods | |||
|---|---|---|---|---|---|---|
| L | P | C | M | |||
| gabcdefgabcdefgabcdefgabcdefgabcd | ||||||
| MoMLV | 46 | DLREVEKSISNLEKSLTSLSEVVLQNRRGLDLL | 9 | 7 | 9 | 9 |
| FeLV | 46 | DIQALEESISALEKSLTSLSEVVLQNRRGLDIL | 9 | 6 | 5 | 9 |
| GALV | 47 | DLRALQDSVSKLEDSLTSLSEVVLQNRRGLDLL | 9 | 6 | 3 | 9 |
| ARV | 41 | DVQALSGTINDLQDQIDSLAEVVLQNRRGLDLL | 9 | 6 | 9 | 8 |
| RSV | 50 | QANLTTSLLGDLLDDVTSIRHAVLQNRAAIDFL | 9 | 0 | 0 | 0 |
| MPMV | 41 | DVQAISSTIQDLQDQVDSLAEVVLQNRRGLDLL | 9 | 6 | 6 | 9 |
| BLV | 41 | DQQRLITAINQTHYNLLNVASVVAQNRRGLDWL | 9 | 0 | 0 | 0 |
| HTLV-1 | 41 | DISQLTQAIVKNHKNLLKIAQYAAQNRRGLDLL | 9 | 0 | 0 | 0 |
| HTLV-2 | 41 | DISHLTQAIVKNHQNILRVAQYAAQNRRGLDLL | 9 | 0 | 0 | 0 |
| PTLV | 41 | DIDHLTRAIVKNHDNILRVAQYAAQNRRGLDLL | 9 | 0 | 0 | 0 |
For these sequences, there is only one predicted coiled coil. Add dist is the distance from the cleavage site to start of the subsequence shown. The subsequences shown correspond to the coiled coil in the Moloney murine leukemia virus crystal structure (Fass and are aligned using subsequence QNRRGLDLL (Delwart . Abbreviations and GenBank accession numbers (Benson are: MoMLV, Moloney murine leukemia virus (119478); FeLV, feline leukemia virus (119418); GALV, gibbon ape leukemia virus (119426); ARV, avian reticuloendotheliosis virus (119396); RSV, Rous sarcoma virus (119487); MPMV, Mason-Pfizer monkey virus (119482); BLV, bovine leukemia virus (119401); HTLV, human T-lymphotropic virus (119464 and 119467); PTLV, primate T-lymphtropic virus (632274).
Paramyxovirus N-terminal helix
| Virus | Add dist | Sequence | Likelihoods | |||
|---|---|---|---|---|---|---|
| L | P | C | M | |||
| gabcdefgabcdefgabcdefgabcdefgabcdefgabcdefgabc | ||||||
| BPIV-3 | 27 | EAKQAKSDIEKLKEAIRDTNKAVQSIQSSVGNLIVAVKSVQDYVNN | 9 | 8 | 9 | 8 |
| HPIV-3 | 27 | EAKQARSDIEKLKEAIRDTNKAVQSVQSSIGNLIVAIKSVQDYVNK | 9 | 8 | 9 | 8 |
| HPIV-1 | 30 | EAREARKDIALIKDSIIKTHNSVELIQRGIGEQIIALKTLQDFVNN | 7 | 0 | 1 | 0 |
| Sendai | 30 | EAREAKRDIALIKESMTKTHKSIELLQNAVGEQILALKTLQDFVND | 9 | 4 | 9 | 6 |
| CDV | 27 | QSNLNAQAIQSLRTSLEQSNKAIEEIREATQETVIAVQGVQDYVNN | 9 | 6 | 7 | 7 |
| Measles | 27 | QSMLNSQAIDNLRASLETTNQAIEAIRQAGQEMILAVQGVQDYINN | 9 | 0 | 4 | 3 |
| PPRV | 27 | QSLMNSQAIESLKTSLEKSNQAIEEIRLANKETILAVQGVQDYINN | 9 | 6 | 8 | 5 |
| PDV | 27 | QSNLNAQAIQSLRASLEQSNKAIDEVRQASQNIIIAVQGVQDYVNN | 9 | 5 | 7 | 8 |
| RPV | 27 | QSMMNSQAIESLKASLETTNQAIEEIRQAGQEMVLAVQGVQDYINN | 9 | 0 | 5 | 7 |
| HPIV-2 | 27 | KANANAAAINNLASSIQSTNKAVSDVIDASRTIATAVQAIQDHING | 9 | 0 | 1 | 0 |
| HPIV-4a | 27 | KAQENAKLILTLKKAATETNEAVRDLANSNKIVVKMISAIQNQINT | 9 | 2 | 6 | 2 |
| HPIV-4b | 27 | KAQENAQLILTLKKAAKETNDAVRDLTKSNKIVARMISAIQNQINT | 9 | 5 | 3 | 2 |
| SV5 | 27 | KANENAAAILNLKNAIQKTNAAVADVVQATQSLGTAVQAVQDHINS | 9 | 4 | 2 | 1 |
| PRV | 27 | RANKNAEKVEQLSQALGETNAAISDLIDATKNLGFAVQAIQNQINT | 9 | 5 | 4 | 5 |
| Mumps | 27 | QAQTNARAIAAMKNSIQATNRAVFEVKEGTQQLAIAVQAIQDHINT | 9 | 0 | 0 | 0 |
| NDV | 27 | QANQNAANILRLKESITATIEAVHEVTDGLSQLAVAVGKMQQFVND | 9 | 2 | 0 | 4 |
| BRSV | 20 | KVLHLEGEVNKIKNALLSTNKAVVSLSNGVSVLTSKVLDLKNYIDK | 9 | 2 | 1 | 1 |
| HRSV | 20 | KVLHLEGEVNKIKNALLSTNKAVVSLSNGVSVLTSKVLDLKNYINN | 9 | 2 | 1 | 1 |
| PVM | 20 | KTVQLESEIALIRDAVRNTNEAVVSLTNGMSVLAKVVDDLKNFISK | 9 | 0 | 4 | 0 |
| TRTV | 24 | KTIRLEGEVKAIKNALRNTNEAVSTLGNGVRVLATAVNDLKEFISK | 9 | 4 | 5 | 2 |
Add dist is the distance between the cleavage site and beginning of the region shown. Abbreviations and GenBank accession numbers (Benson for both Table 4, Table 5: paramyx, genus paramyxovirus; morbilli, genus morbillivirus; rubula, genus rubulavirus; pneumo, genus pneumovirus; BPIV, bovine parainfluenza (1353202); HPIV, human parainfluenza (138273, 138268, 138269, 1255649, and 1255651); Sendai (138276); CDV, canine distemper (138249); measles (138254); PPRV, peste-des-petits-ruminants (1085797); PDV, phocine distemper (138267); RPV, rinderpest (138275); SV5, simian parainfluenza 5 (335117); PRV, porcine rubulavirus (1808667); mumps (138259); NDV, Newcastle disease (465403); BRSV, bovine respiratory syncytial (138248); HRSV, human respiratory syncytial (138250); PVM, pneumonia virus of mice (549309); TRTV, turkey rhinotracheitis (138283).
Paramyxovirus C-terminal helix
| Virus | Add dist2 | Sequence | Likelihoods | |||
|---|---|---|---|---|---|---|
| L | P | C | M | |||
| defgabcdefgabcdefgabcdefgabcd | ||||||
| BPIV-3 | 275 | ISMELNKAKLELEESKEWIKKSNQKLDSV | 9 | 3 | 0 | 1 |
| HPIV-3 | 275 | ISIELNKAKSDLEESKEWIRRSNQKLDSI | 9 | 3 | 0 | 2 |
| HPIV-1 | 275 | ISLNLASATNFLEESKIELMKAKAIISAV | 0 | 0 | 0 | 0 |
| Sendai | 275 | ISLNLADATNFLQDSKAELEKARKILSEV | 3 | 5 | 7 | 4 |
| CDV | 268 | ISLDRLDVGTNLGNALKKLDDAKVLIDSS | 9 | 2 | 7 | 0 |
| Measles | 268 | ISLERLDVGTNLGNAIAKLEDAKELLESS | 9 | 3 | 6 | 3 |
| PPRV | 268 | ISLEKLDVGTNLGNAVTRLENAKELLDAS | 9 | 0 | 2 | 0 |
| PDV | 268 | ISLERLDVGTNLGSALKKLDDAKVLIESS | 9 | 1 | 6 | 1 |
| RPV | 268 | ISLEKLDVGTNLWNAVTKLEKAKDLLDSS | 5 | 0 | 0 | 0 |
| HPIV-2 | 275 | LSNQINSINKSLKSAEDWIADSNFFANQA | 9 | 0 | 0 | 0 |
| HPIV-4a | 275 | LSTDLNQYNQLLKSAEDHIQRSTDYLNSI | 9 | 0 | 6 | 5 |
| HPIV-4b | 275 | LSTDLNQYNQLLKSAENHIQRSNDYLNSI | 9 | 0 | 8 | 3 |
| SV5 | 275 | ISQNLAAVNKSLSDALQHLAQSDTYLSAI | 9 | 0 | 0 | 0 |
| PRV | 275 | ISGNLIAVNNSLSSALNHLATSEILRNEQ | 2 | 0 | 0 | 0 |
| Mumps | 275 | ISTELSKVNASLQNAVKYIKESNHQLQSV | 9 | 6 | 3 | 0 |
| NDV | 275 | ISTELGNVNNSISNALDKLEESNSKLDKV | 9 | 6 | 9 | 4 |
| BRSV | 287 | FDASIAQVNAKINQSLAFIRRSDELLHSV | 9 | 0 | 0 | 0 |
| HRSV | 287 | FDASISQVNEKINQSLAFIRRSDELLHNV | 9 | 0 | 0 | 0 |
| PVM | 285 | FDVAIRDVEHSINQTRTFFKASDQLLDLS | 4 | 0 | 0 | 0 |
| TRTV | 285 | FNVALDQVFESIDRSQDLIDKSNDLLGAD | 3 | 0 | 0 | 0 |
Add dist2 is the distance from the end of the region shown in Table 4 to the beginning of this region.
The filoviruses and Rous sarcoma virus
| Virus | Sequence | Likelihoods | |||
|---|---|---|---|---|---|
| L | P | C | M | ||
| gabcdefgabcdefgabcdefgabcdefgabcd | |||||
| Ebola | LANETTQALQLFLRATTELRTFSILNRKAIDFL | 8 | 0 | 0 | 0 |
| Marburg | LANQTAKSLELLLRVTTEERTFSLINRHAIDFL | 0 | 0 | 0 | 0 |
| Rous | QANLTTSLLGDLLDDVTSIRHAVLQNRAAIDFL | 9 | 0 | 0 | 0 |
| ∗∗ ∗ ∗ ∗ ∗ ∗ ∗∗ ∗∗∗∗∗ | |||||
Shown is an alignment between the putative coiled-coil region of the viral membrane-fusion proteins of Ebola, Marburg and the non-filovirus Rous sarcoma virus. Residues that are identical in all three viruses are indicated with asterisks. The regions shown correspond to the coiled-coil in MoMLV TM (Table 3). GenBank accession numbers (Benson are: Ebola virus, 290392; Marburg virus, 2459880; and Rous sarcoma virus, 119487.