Literature DB >> 23675493

The fusion protein signal-peptide-coding region of canine distemper virus: a useful tool for phylogenetic reconstruction and lineage identification.

Nicolás Sarute1, Marina Gallo Calderón, Ruben Pérez, José La Torre, Martín Hernández, Lourdes Francia, Yanina Panzera.   

Abstract

Canine distemper virus (CDV; Paramyxoviridae, Morbillivirus) is the etiologic agent of a multisystemic infectious disease affecting all terrestrial carnivore families with high incidence and mortality in domestic dogs. Sequence analysis of the hemagglutinin (H) gene has been widely employed to characterize field strains, permitting the identification of nine CDV lineages worldwide. Recently, it has been established that the sequences of the fusion protein signal-peptide (Fsp) coding region are extremely variable, suggesting that analysis of its sequence might be useful for strain characterization studies. However, the divergence of Fsp sequences among worldwide strains and its phylogenetic resolution has not yet been evaluated. We constructed datasets containing the Fsp-coding region and H gene sequences of the same strains belonging to eight CDV lineages. Both datasets were used to evaluate their phylogenetic resolution. The phylogenetic analysis revealed that both datasets clustered the same strains into eight different branches, corresponding to CDV lineages. The inter-lineage amino acid divergence was fourfold greater for the Fsp peptide than for the H protein. The likelihood mapping revealed that both datasets display strong phylogenetic signals in the region of well-resolved topologies. These features indicate that Fsp-coding region sequence analysis is suitable for evolutionary studies as it allows for straightforward identification of CDV lineages.

Entities:  

Mesh:

Substances:

Year:  2013        PMID: 23675493      PMCID: PMC3652846          DOI: 10.1371/journal.pone.0063595

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

Canine distemper virus (CDV) is a member of the Morbillivirus genus within the Paramyxoviridae family. The viral genome consists of a nonsegmented single-stranded negative RNA of 15.7 kb and encodes six structural proteins. The nucleocapsid protein, viral polymerase, and phosphoprotein are associated with the genomic RNA forming the ribonucleoprotein complex, which performs viral transcription and replication. The hemagglutinin (H) and fusion (F) proteins are the antigenic determinants of the virus and are located at the viral envelope, whereas the matrix protein is membrane associated [1]. CDV is the etiologic agent of canine distemper (CD), a severe multisystemic infectious disease, which is distributed worldwide and affects all terrestrial carnivores [2]–[4]. CD has been controlled by attenuated vaccines; however, in recent decades, several outbreaks in properly vaccinated dogs and an expansion in the host range have been reported [5]–[11]. These outbreaks might be a consequence of the emergence of new field strains able to avoid the immune response generated by the “old strains” currently used in the vaccines and/or because of the capacity of new field strains to infect other carnivore hosts [12]–[20]. The H gene shows the greatest genetic variation within the Morbillivirus genus, and it has been widely employed to characterize field strains [21]. In CDV, the amino acid divergence of the H protein reaches 10% among field strains [12], [22]. This variability in conjunction with a phylogenetic analysis provide the necessary information for the classification of CDV strains into genetic lineages: two strains belong to the same lineage if they cluster together and share an amino acid divergence of <3.5%; the strains belong to different lineages if they appear in separate clades and show values of divergence >4% [12], [22]. Currently, nine lineages have been characterized worldwide according mainly to their geographic origin [9], [22]–[25]. In South America there are two co-circulating lineages, the South America 2 (SA2) lineage exclusive to South America and the Europe 1/South America 1 (EU1/SA1) lineage, which is spread in Europe and South America [25]. The H gene can be more difficult to amplify directly from field samples because of its larger size (1824 bp) [17], [23]–[25], and its transcription level is proportionately lower in comparison to the genes located in the 3′ terminal region of the genome [1]. It has been recently established that a short region of the F gene that encodes the signal peptide of the F protein (Fsp; residues 1–135) is extremely variable. Comparative analysis of the Fsp peptide from Asian strains shows that it has substantial genetic variability, suggesting that this region could be a marker for classifying CDV strains [26], [27]. The aim of this work was to evaluate the phylogenetic resolution of the Fsp-coding region in comparison with the H gene. The analyses revealed that the Fsp-coding region is phylogenetically informative and quite useful for the identification of genetic CDV lineages, which will allow for rapid characterization of circulating strains.

Materials and Methods

Datasets

To analyze the phylogenetic resolution of the Fsp-coding region and compare it with that of the H gene, we constructed datasets for both genomic regions from the same 37 strains (Table 1). The H dataset included strains available at GenBank for eight of the nine CDV lineages; sequences from the African lineage strains were not included because there are no records of the Fsp-coding region sequences in these strains. The Fsp dataset included 29 sequences available at GenBank and eight sequences obtained here from the EU1/SA1 and SA2 lineages (Table 1).
Table 1

Strains employed for the construction of the Fsp and H datasets.

StrainFsp regionH geneLineage/Clade
UY102KC331150JN215473EU1/SA1
UY111KC331151JN215475EU1/SA1
UY128KC331152JN215476EU1/SA1
UY141KC331153JN215477EU1/SA1
Arg23KC257465FJ392652EU1/SA1
5804AY386315AY386315EU1/SA1
5804PAY386316AY386316EU1/SA1
Arg24KC257466FJ392651SA2
Arg25KC257467KC257463SA2
Arg26KC257468KC257464SA2
MS01EF445055DQ922630AS1
JL(07)1EU327875EU325728AS1
HeB(07)1EU327874EU325720AS1
BS0610EU934234EU934233AS1
GS0812-4HQ850148HQ850147AS1
GNEF596900EF445054AS1
SC01EF596902EF042818AS1
ZD01EF596904EF445051AS1
CDV GZ2JN381189JN381191AS1
NMEF596903EF445053AS1
W812BAB607905AB605890AS1
W729BAB607904AB605891AS1
007LmAB474397AB474397AS2
007Lm-1vpAB462810AB462810AS2
19876AY964110AY964110EU2
18133AY964108AY964108EU3
HLEF596901EF445052EU3
21261AY964112AY964112EU3
25259AY964114AY964114EU3
A75/17AF164967AF164967NA1
164071EU716337EU716337NA1
98-2654AY466011AY466011NA2
98-2645AY445077AY445077NA2
98-2646AY542312AY542312NA2
Onderstepoort (OS)AF378705AF378705NA2
Snyder HillGU138403GU138403NA2
Onderstepoort (97)AF014953AF014953NA2

Denomination, GenBank accession number for the Fsp-coding region and H gene, and lineage for each strain are detailed. AS1, Asia 1; AS2, Asia 2; EU1/SA1, Europe 1/South America 1; EU2, Europe 2; EU3, Europe 3; NA1, North America 1; NA2, North America 2; SA2, South America 2.

Denomination, GenBank accession number for the Fsp-coding region and H gene, and lineage for each strain are detailed. AS1, Asia 1; AS2, Asia 2; EU1/SA1, Europe 1/South America 1; EU2, Europe 2; EU3, Europe 3; NA1, North America 1; NA2, North America 2; SA2, South America 2.

Amplification of the Fsp-coding Region

Urine, ocular discharge, and clotted blood samples from dogs were subjected to Trizol RNA isolation (Invitrogen) to isolate CDV RNA (Table 1). The RT-PCR products of 681 bp encompassing the Fsp-coding region (405 bp) were amplified using the SuperScript One-Step RT-PCR kit reagents (Invitrogen) (the cycling conditions are available from the authors on request) through a set of newly designed primers, F4854∶5′-TCCAGGACATAGCAAGCCAACA-3′and R5535∶5′-GGTTGATTGGTTCGAGGACTGAA-3′ (strain N° AF378705 position numbering). The PCR products were sequenced bidirectionally on an ABI3130 automated sequencer (Applied Biosystems), and nucleotide sequences were submitted to the GenBank database (http://www.ncbi.nlm.nih.gov).

Comparative Analyses

To evaluate the phylogenetic resolution of the Fsp dataset and compare it with the H dataset, two bioinformatic approaches were employed: phylogenetic and likelihood mapping (LM) analyses.

Phylogenetic Analysis

The phylogenetic relationships among the strains were established for both datasets by maximum-likelihood trees using MEGA 5 software [28]. The substitution model used was Hasegawa-Kishino-Yano with gamma distribution (G) for both datasets. Internal-node uncertainties were assessed using 500 bootstrap replications. Amino acid p-distance analysis was implemented for both datasets with MEGA 5 software [28].

LM Analysis

The LM method was implemented for both datasets using TREE-PUZZLE software [29]. LM assesses if a dataset is suitable for phylogenetic reconstruction by the analysis of groups of four randomly chosen sequences (quartets). For each quartet, three unrooted tree topologies are possible. The phylogenetic signals are computed as probabilities that are represented in a triangle surface [30].

Results

The 681-bp amplicon that encompasses the 405-bp Fsp-coding region was obtained directly from samples of different origin. The sequences of the eight strains from South American lineages (EU1/SA1 and SA2) were added to the Fsp dataset. The Fsp and H datasets each comprised 37 sequences of strains from eight of the nine CDV lineages characterized (Table 1) and were used to perform the subsequent analyses.

Comparative Analyses: Phylogenetic and LM Analysis

Phylogenetic analysis of the Fsp and H datasets was performed to compare the relationships among the CDV strains. The Fsp and H trees clustered the same strains corresponding to eight CDV lineages with similar bootstrap values. The branch lengths were measured according to the number of substitutions per site for the substitution model (0.01 for both datasets) (Figure 1). In both trees, similar relationships were observed among most lineages. The positions of the North American lineages (NA1 and NA2) were quite different between the trees: the NA1 and NA2 lineages clustered in the same clade in the Fsp tree, whereas these lineages were not related in the H tree (Figure 1). Inter-lineage amino acid divergence (p-distance) for the Fsp dataset ranged from 19.6% to 36.9%, whereas for the H dataset it ranged from 3.4% to 9.9% (Table 2). The relationships among strains within each CDV lineage were slightly different for the two trees. Intra-lineage amino acid divergence was from 1.8% to 20.7% for the Fsp dataset and from 1.3% to 6.3% for the H dataset (Table 2). The EU2 lineage was not considered because it was represented by a single strain.
Figure 1

Phylogenetic analysis of CDV isolates.

Thirty-seven nucleotide sequences of the Fsp (left) and H (right) datasets were used. Maximum likelihood trees were constructed using the Hasegawa-Kishino-Yano (G) substitution model for both datasets and were inferred through 500 replicates. Branch lengths are measured in the number of substitutions per site, as shown by the scale bars. Unrooted trees were depicted facing each other for comparison. AS1, Asia 1; AS2, Asia 2; EU1/SA1, Europe 1/South America 1; EU2, Europe 2; EU3, Europe 3; NA1, North America 1; NA2, North America 2; SA2, South America 2; Onder, Onderstepoort strain; Snyder, Snyder-Hill strain.

Table 2

Inter- and intra-lineage amino acid divergences (p-distances).

EU1/SA111.9
3.3
SA223.51.8
4.72.2
EU22525.8
4.85.7
EU329.531.130.518.5
5.96.86.24.6
NA119.622.623.227.3
3.44.64.251.3
NA233.633.831.131.923.620.7
8.69.89.29.68.46.3
AS126.627.332.536.925.627.818.5
5.56.95.96.84.78.73.1
AS224.628.43126.523.222.233.6
6.98.26.97.36.19.96.9
EU1/SA1SA2EU2EU3NA1NA2AS1AS2

The upper values represent the divergence for Fsp; the lower values represent the divergence for the H protein. AS1, Asia 1; AS2, Asia 2; EU1/SA1, Europe 1/South America 1; EU2, Europe 2; EU3, Europe 3; NA1, North America 1; NA2, North America 2; SA2, South America 2.

Phylogenetic analysis of CDV isolates.

Thirty-seven nucleotide sequences of the Fsp (left) and H (right) datasets were used. Maximum likelihood trees were constructed using the Hasegawa-Kishino-Yano (G) substitution model for both datasets and were inferred through 500 replicates. Branch lengths are measured in the number of substitutions per site, as shown by the scale bars. Unrooted trees were depicted facing each other for comparison. AS1, Asia 1; AS2, Asia 2; EU1/SA1, Europe 1/South America 1; EU2, Europe 2; EU3, Europe 3; NA1, North America 1; NA2, North America 2; SA2, South America 2; Onder, Onderstepoort strain; Snyder, Snyder-Hill strain. The upper values represent the divergence for Fsp; the lower values represent the divergence for the H protein. AS1, Asia 1; AS2, Asia 2; EU1/SA1, Europe 1/South America 1; EU2, Europe 2; EU3, Europe 3; NA1, North America 1; NA2, North America 2; SA2, South America 2. The LM analysis showed that the Fsp and H datasets presented probabilities of 92.5% and 97.1%, respectively, in the three corners of the triangle, which represented the tree-like topologies (well-resolved phylogenies). The probabilities in the center region, representing the star-like topologies (unresolved phylogenies), were 4.9% for the Fsp dataset and 0.7% for the H dataset. The probabilities at the sides, representing the net-like region (partially unresolved phylogenies), were 1.8% and 1.5% for the Fsp and H datasets, respectively (Figure 2).
Figure 2

Likelihood mapping of the Fsp (A) and H (B) datasets.

The probabilities close to the triangle corners represent tree-like topologies (well-resolved). Those in the center and on the sides represent star-like (unresolved) and network-like signals (partially unresolved), respectively.

Likelihood mapping of the Fsp (A) and H (B) datasets.

The probabilities close to the triangle corners represent tree-like topologies (well-resolved). Those in the center and on the sides represent star-like (unresolved) and network-like signals (partially unresolved), respectively.

Discussion

The worldwide spread of canine distemper in all terrestrial carnivore families and some marine carnivores has lead to several studies of this disease in recent decades [4], [5], [6], [11], [18]. Most of the characterization studies of CDV strains have been based on H gene analysis. The direct amplification of the H gene from field samples is difficult without prior propagation of the virus in cell culture [9], [12], [13], [15], [31], [32]. Moreover, its extension (1824 bp) requires the use of several primers sets for amplification and sequencing of the full-length gene, which limits the rapid characterization of CDV strains. The genetic diversity of the Fsp-coding region in Asian strains [26], [27] encouraged us to evaluate the robustness of this region for phylogenetic analysis of the CDV lineages distributed worldwide. For this work, we constructed Fsp and H datasets and analyzed their phylogenetic resolution. To include the South American strains, we designed a new set of primers, obtaining for the first time, the Fsp-coding region sequences of eight samples belonging to the EU1/SA1 and the SA2 lineages (Table 1). The Fsp-coding region consists of only 405 bp, making it easy to amplify directly from field samples of different origin using conventional methods for RNA isolation. Phylogenetic analysis performed for the Fsp dataset clustered the same strains into eight well-defined clades according to the CDV lineages based on the H gene (Figure 1). The Fsp amino acid inter-lineage divergence was about fourfold greater with respect to the H protein. This result confirms previous reports regarding Asian strains [26], [27] and also allows us to extend the analysis to the rest of the CDV lineages worldwide, except for the African lineage. The relationships among the lineages revealed slight differences in the Fsp and the H trees. The positions of the North American lineages (NA1 and NA2) were notably different between the trees: the Fsp tree showed that the NA1 and NA2 lineages were related, whereas the H tree did not show any relationship between them. The Fsp tree might reflect the actual relationships among the North American strains according to the geographical pattern of CDV lineage distribution [9], [12], [22]–[25] (Figure 1). Moreover, the NA1 lineage includes only field strains, whereas the NA2 lineage clusters field and vaccine strains into a main clade in both trees. The field and vaccine strains of the NA2 lineage presented amino acid divergence of up to 6.3% for the H protein, being this value higher than the 4% currently accepted to define lineages [22]. The NA2 main clade would then correspond to two different lineages: the vaccine cluster formed by “old” CDV strains isolated in 1930–1950 and the current field strain clade. For Fsp, the NA2 intra-lineage divergence is 20.7%, which represents the greatest value among all lineages analyzed (Table 2). This might be attributed to the clustering of the field and vaccine strains into this clade, which includes two different lineages (Table 2). The intra-lineage divergence of Fsp was greater than that of the H protein for most of the lineages, except for the SA2 and NA1 lineages (Table 2). The SA2 lineage is composed of three strains isolated from the same geographical area (Buenos Aires, Argentina), which may have influenced their genomic homogeneity (Table 2). The NA1 lineage includes only two strains A75/17 and 164071, which showed two nucleotide synonymous changes for the Fsp-coding region, and 15 synonymous and two non-synonymous changes for the H gene. Thus, it is necessary to analyze more strains to establish if the variability of the Fsp-coding region is greater than that detected for the H gene as was observed for most of the lineages. Despite the widespread use of the H gene for phylogenetic analysis of CDV and other Paramyxovirus [33]–[35], its phylogenetic signal has not been weighted before. Herein, we applied LM analysis, which revealed a strong phylogenetic signal for the H dataset in the region of tree-like topologies (Figure 2). Thus, this genomic region is phylogenetically informative. The LM analysis for the Fsp dataset showed values >90% in the region of tree-like topologies, indicating that this genomic region is also phylogenetically informative (Figure 2). We propose that the Fsp-coding region could be used as an alternative genetic marker for evolutionary studies and the rapid identification of CDV lineages because of its easy amplification, higher variability, and phylogenetic robustness. The current criteria take into account the H protein amino acid divergence and the clustering of the strains [12], [22]. Based on the Fsp analysis, we established that two strains belong to the same lineage if they cluster in the same clade with values of amino acid divergence <19%, whereas they belong to different lineages if they cluster in different clades with divergence >19%. In conclusion, we have shown the suitability of the Fsp-coding region for phylogenetic analysis and CDV molecular characterization, which leads us to suggest this genomic region as an alternative marker to identify CDV lineages. The wide employment of the Fsp-coding region will improve the rapid and unequivocal classification of the circulating CDV strains and provide complementary information to understand CDV epidemiology.
  32 in total

1.  Phylogenetic analysis of the L and HN gene of ophidian paramyxoviruses. Brief report.

Authors:  J Kindermann; A Kübber-Heiss; P Kerschbaumer; N Nowotny
Journal:  Arch Virol       Date:  2001       Impact factor: 2.574

2.  Epizootiological investigations of canine distemper virus in free-ranging carnivores from Germany.

Authors:  K Frölich; O Czupalla; L Haas; J Hentschke; J Dedek; J Fickel
Journal:  Vet Microbiol       Date:  2000-06-12       Impact factor: 3.293

3.  Antigenic differences in the H proteins of canine distemper viruses.

Authors:  K Iwatsuki; S Tokiyoshi; N Hirayama; K Nakamura; K Ohashi; C Wakasa; T Mikami; C Kai
Journal:  Vet Microbiol       Date:  2000-02       Impact factor: 3.293

4.  Genetic variability of the glycoprotein genes of current wild-type measles isolates.

Authors:  J S Rota; K B Hummel; P A Rota; W J Bellini
Journal:  Virology       Date:  1992-05       Impact factor: 3.616

5.  Likelihood-mapping: a simple method to visualize phylogenetic content of a sequence alignment.

Authors:  K Strimmer; A von Haeseler
Journal:  Proc Natl Acad Sci U S A       Date:  1997-06-24       Impact factor: 11.205

6.  Genetic diversity of the attachment (H) protein gene of current field isolates of canine distemper virus.

Authors:  G Bolt; T D Jensen; E Gottschalck; P Arctander; M J Appel; R Buckland; M Blixenkrone-Møller
Journal:  J Gen Virol       Date:  1997-02       Impact factor: 3.891

7.  Molecular and phylogenetic analyses of the haemagglutinin (H) proteins of field isolates of canine distemper virus from naturally infected dogs.

Authors:  K Iwatsuki; N Miyashita; E Yoshida; T Gemma; Y S Shin; T Mori; N Hirayama; C Kai; T Mikami
Journal:  J Gen Virol       Date:  1997-02       Impact factor: 3.891

8.  Phylogenetic analysis of the hemagglutinin (H) gene of canine distemper viruses isolated from wild masked palm civets (Paguma larvata).

Authors:  Kyoko Hirama; Yoko Goto; Masashi Uema; Yasuyuki Endo; Ryuichi Miura; Chieko Kai
Journal:  J Vet Med Sci       Date:  2004-12       Impact factor: 1.267

9.  Studies on manifestations of canine distemper virus infection in an urban dog population.

Authors:  M Blixenkrone-Møller; V Svansson; P Have; C Orvell; M Appel; I R Pedersen; H H Dietz; P Henriksen
Journal:  Vet Microbiol       Date:  1993-10       Impact factor: 3.293

10.  Epidemiological observations on recent outbreaks of canine distemper in Tokyo area.

Authors:  T Gemma; T Watari; K Akiyama; N Miyashita; Y S Shin; K Iwatsuki; C Kai; T Mikami
Journal:  J Vet Med Sci       Date:  1996-06       Impact factor: 1.267

View more
  7 in total

1.  Outbreak of canine distemper and coinfections in a maned wolf (Chrysocyon brachyurus) and in three giant anteaters (Myrmecophaga tridactyla).

Authors:  Lucas R Souza; Marcelo P N Carvalho; Carlos E B Lopes; Marcelo C Lopes; Bruna H Campos; Érika P T Teixeira; Ellen J Mendes; Leidilene P Santos; Eduardo A Caixeta; Erica A Costa; João L R Cunha; Ana L S Fraiha; Rodrigo O S Silva; Carolina P Ramos; Mary S Varaschin; Roselene Ecco
Journal:  Braz J Microbiol       Date:  2022-07-21       Impact factor: 2.214

2.  Molecular typing of canine distemper virus strains reveals the presence of a new genetic variant in South America.

Authors:  Nicolás Sarute; Ruben Pérez; Jaime Aldaz; Amauri A Alfieri; Alice F Alfieri; Daniela Name; Jessika Llanes; Martín Hernández; Lourdes Francia; Yanina Panzera
Journal:  Virus Genes       Date:  2014-03-20       Impact factor: 2.332

3.  Central nervous system lesions caused by canine distemper virus in 4 vaccinated dogs.

Authors:  Gimena Feijóo; Kanji Yamasaki; Luis Delucchi; José Manuel Verdes
Journal:  J Vet Diagn Invest       Date:  2021-04-17       Impact factor: 1.279

4.  First genome sequence of a canine distemper virus strain from South america.

Authors:  Nicolás Sarute; María V Delgado; Lucía Carrau; Alejandro Benech; Lourdes Francia; Ruben Pérez; Yanina Panzera
Journal:  Genome Announc       Date:  2014-10-23

Review 5.  Evolution and Interspecies Transmission of Canine Distemper Virus-An Outlook of the Diverse Evolutionary Landscapes of a Multi-Host Virus.

Authors:  July Duque-Valencia; Nicolás Sarute; Ximena A Olarte-Castillo; Julián Ruíz-Sáenz
Journal:  Viruses       Date:  2019-06-26       Impact factor: 5.048

6.  The Canine Morbillivirus Strain Associated with An Epizootic in Caspian Seals Provides New Insights into the Evolutionary History of this Virus.

Authors:  Wendy K Jo; Martin Peters; Aidyn Kydyrmanov; Marco W G van de Bildt; Thijs Kuiken; Albert Osterhaus; Martin Ludlow
Journal:  Viruses       Date:  2019-09-25       Impact factor: 5.048

7.  Comparison of Growth Characteristics and Genomics of Two Canine Distemper Virus Strains Isolated From Minks in China.

Authors:  Rongshan Tao; Jie Chen; Tianyu Zhao; Chengyan Gong; Hongjun Pan; Rana Waseem Akhtar; Xue Li; Syed Aftab Hussain Shah; Qingjie Li; Jianjun Zhao
Journal:  Front Vet Sci       Date:  2020-10-29
  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.