Literature DB >> 25479071

Species based synonymous codon usage in fusion protein gene of Newcastle disease virus.

Chandra Shekhar Kumar1, Sachin Kumar1.   

Abstract

Newcastle disease is highly pathogenic to poultry and many other avian species. However, the Newcastle disease virus (NDV) has also been reported from many non-avian species. The NDV fusion protein (F) is a major determinant of its pathogenicity and virulence. The functionalities of F gene have been explored for the development of vaccine and diagnostics against NDV. Although the F protein is well studied but the codon usage and its nucleotide composition from NDV isolated from different species have not yet been explored. In present study, we have analyzed the factors responsible for the determination of codon usage in NDV isolated from four major avian host species. The F gene of NDV is analyzed for its base composition and its correlation with the bias in codon usage. Our result showed that random mutational pressure is responsible for codon usage bias in F protein of NDV isolates. Aromaticity, GC3s, and aliphatic index were not found responsible for species based synonymous codon usage bias in F gene of NDV. Moreover, the low amount of codon usage bias and expression level was further confirmed by a low CAI value. The phylogenetic analysis of isolates was found in corroboration with the relatedness of species based on codon usage bias. The relationship between the host species and the NDV isolates from the host does not represent a significant correlation in our study. The present study provides a basic understanding of the mechanism involved in codon usage among species.

Entities:  

Mesh:

Substances:

Year:  2014        PMID: 25479071      PMCID: PMC4257736          DOI: 10.1371/journal.pone.0114754

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

Newcastle disease virus (NDV) has been isolated from the various avian species around the world. Newcastle disease can result in severe economic losses to the poultry industry worldwide. NDV belongs to the genus Avulavirus under the family Paramyxoviridae [1]. However, NDV has been isolated from different non avian species [2]. Complete and partial genome sequences of NDV isolated from different species are being regularly reported from different parts of the world. NDV genome encodes six different proteins in order of a nucleoprotein (N), a phosphoprotein (P), a matrix protein (M), a fusion protein (F), an attachment protein called the hemagglutinin–neuraminidase (HN), and a large polymeraseprotein (L) from 3′-N-P-M-F-HN-L-5′ direction. The envelope of NDV contains two surface glycoproteins, the HN and the F protein. Various studies have shown that the amino acid sequence at the F protein cleavage site is a key determinant of NDV virulence [3], [4], [5], [6]. However, the cleavage of F protein in a wide range of host tissues is responsible for the systemic spread of NDV and also for its virulence [5]. The F protein being a surface glycoprotein is present on the NDV envelope and mediates its fusion with the host cell membrane. Furthermore, the F protein is assisted in its function by the HN protein and the productive infection of NDV requires cleavage of F protein precursor F0 (553 amino acid) into two subunits F1 and F2 [7]. The cleavage site amino acid sequence determines cleavage specificity and varies with the type of strain [8], [9]. The F protein cleavage site of less virulent strains of NDV consists of monobasic or dibasic amino acid residues [9]. The F protein cleavability by intracellular proteases do not take place due to the presence of one or two basic amino acids thus, extracellular proteases are required to cleave F protein limiting the tropism of NDV to respiratory and enteric tracts. In most cells, the polybasic amino acids of F protein in velogenic strains act as the cleavage recognition site for furin like proteases [8], [9]. It has been observed that F proteins of virulent NDV strains contain lysine (K) and arginine (R) at their cleavage site (112R-R-Q-R/K-R116), and a phenylalanine at position 117 of F1. This site is recognized by intracellular proteases, furin that cleave the polybasic cleavage site forming F1 subunit which is suggested to be the contributor of neurological effects [9], [10], [11]. It was postulated that the proper binding of furin protease, assisted by the presence of basic amino acids at the F protein cleavage site, leads to cleavage thus altering host-cell enzyme activity [11]. The variation in intracellular cleavage of virulent NDV F protein is observed to be dictated by the presence of arginine at position 113, 115 and 116 [12]. The substitution of the neutral amino acid, glutamine present at 114 position with an acidic or basic amino acid would attenuate NDV [4]. Furthermore, the attenuation of NDV by substituting valine to isoleucine at position 118 around the fusion cleavage site was also reported [4]. In another study, it has been shown that mutation at glycosylation site of F protein may enhance the virulence and pathogenicity of NDV [13]. It is also evident that mutation in the cytoplasmic domain of F protein can lead to the production of a hyperfusogenic virus that could ensure increased viral replication and pathogenesis in chickens [14]. Subtilisin like mammalian proteases, e.g., PC6 and PACE4 are reported as candidates for the cleavage of the F protein [15]. The F protein mediates virus penetration by inducing fusion between the viral envelope and host cell plasma membrane [3], [5], [12], [16], [17]. Various other factors are also accountable for the virulence of NDV [12]. In chicken and infected macrophages, F protein is a determinant of NDV virulence [6], [18]. We have shown that gradients of NDV virulence are multigenic, F protein is a major player of NDV virulence and pathogenicity and, the superiority of F as an antigen over HN for better and sterile immunity against NDV infection [19]. Based on our current understanding F glycoprotein is the most suitable protein for investigating the infectious capability of NDV strains. Based on pathogenic studies NDV is categorized into three major pathotypes: lentogenic (low virulence), mesogenic (moderate virulence) and velogenic (highly virulent) [20]. The synonymous codon usage is the non-random selection of frequently used codons, the selection of which is limited by codon bias for different genes [21], [22], [23], [24]. Synonymous codons are not used randomly as some codons are used more frequently than others [25]. Factors that may dictate synonymous codon usage bias include natural selection, mutational pressure, translational efficiency and compositional constraints of the mammalian genome [26], [27], [28]. Many studies have shown the contribution of codon usage bias patterns in order to understand the virus evolution [27], [29]. Although the factors responsible for the pathogenicity of the NDV due to F glycoprotein have been studied but the non-random synonymous codon usage variation in NDV isolates from different species has not been reported. A comprehensive analysis of the codon usage bias patterns of NDV isolates from different species may be necessary to understand the codon usage patterns in the virus evolution while crossing the species barrier. This analysis may pave way for future understanding of selection pressure due to host metabolome interaction and enable deciphering of the virus evolutionary trend among different species.

Materials and Methods

Gene sequences

Two hundred and one complete F gene sequences for NDV isolates from four major host species were obtained from the GenBank (Table 1). The four major avian species (chicken, duck, pigeon, and goose) were selected based on the availability of more than ten complete open reading frames (ORF) of the NDV F gene sequences from each in GenBank. The strains of NDV collected for the analysis were from all three major pathotypes, namely lentogenic, mesogenic, and velogenic. The codon usage pattern analysis was performed for the coding sequence of the F gene.
Table 1

List of Newcastle disease virus F gene used for analysis of synonymous codon usage in the study.

S.noAccession NumberCleavage Site amino acids 111 to 117PathotypeSpeciesGC3s (%)GC (%)Nc Mononucleotide frequencies (%)
ACGT
1JX524203-G-G-K-Q-G-R*L-LentogenicChicken44.445.357.829.3022.9822.3225.39
2HM117720-G-R-R-Q-K-R*F-VelogenicChicken45.545.859.129.2423.9521.8424.97
3JX390609-G-R-R-R-K-R*F-LentogenicChicken44.445.158.129.8423.4121.6625.09
4FJ939313-G-R-R-Q-K-R*F-VelogenicChicken43.644.559.630.6923.5321.0024.79
5GU187941-G-R-R-Q-R-R*F-MesogenicChicken42.045.255.029.7223.4721.7225.09
6FJ436305-G-R-R-Q-R-R*F-MesogenicChicken42.144.355.229.3622.2022.0826.35
7FJ436304-G-R-R-Q-R-R*F-MesogenicChicken41.944.455.229.2422.2022.2026.35
8FJ436303-G-R-R-Q-R-R*F-MesogenicChicken42.144.455.229.3022.2622.1426.29
9FJ436302-G-R-R-Q-R-R*F-MesogenicChicken41.944.455.129.3022.2022.1426.35
10HM357251-G-G-R-Q-G-R*L-LentogenicChicken43.044.559.430.2623.2321.3025.21
11KJ123642-G-R-R-Q-K-R*F-VelogenicChicken43.444.959.529.8223.1121.7525.32
12FJ754271-G-R-R-Q-K-R*F-VelogenicChicken40.744.054.930.0222.8021.1825.99
13JX110635-G-G-K-Q-R-R*L-LentogenicChicken44.245.157.929.3622.8622.2625.51
14KC461214-G-R-R-Q-K-R*F-VelogenicChicken40.344.455.129.6622.6221.7225.99
15JX974435-G-R-R-Q-K-R*F-VelogenicChicken45.345.758.029.1823.7721.9025.15
16FJ751919-G-R-R-Q-K-R*F-VelogenicChicken44.944.857.029.4823.1621.6625.69
17FJ751918-G-R-R-Q-K-R*F-VelogenicChicken44.944.856.929.3623.1621.6625.81
18JX316216-G-R-R-Q-K-R*F-MesogenicChicken42.945.358.529.4823.1622.0825.27
19JX867334-G-R-R-Q-K-R*F-VelogenicChicken40.043.955.730.3222.6821.1825.81
20JX519467-G-R-R-Q-K-R*F-VelogenicChicken39.944.255.329.9022.6221.5425.93
21JX119193-R-R-R-Q-K-R*F-VelogenicChicken43.145.155.530.1423.5921.4224.85
22JN618349-G-R-R-Q-K-R*F-VelogenicChicken41.544.454.629.9623.1021.2425.69
23JN618348-G-R-R-Q-K-R*F-VelogenicChicken43.045.155.729.4823.2921.7825.45
24HQ697255-G-R-R-R-K-R*F-LentogenicChicken43.345.055.529.3023.1621.7825.75
25HQ697254-G-R-R-Q-K-R*F-VelogenicChicken44.145.157.529.4823.5321.6025.39
26JN682211-G-R-R-Q-K-R*F-VelogenicChicken39.543.953.730.2022.5621.3025.93
27JN688863-G-E-Q-Q-E-R*L-LentogenicChicken47.645.959.730.9922.9822.8623.16
28JN688862-G-E-Q-Q-E-R*L-LentogenicChicken47.345.756.431.0522.7422.8623.35
29JN800306-G-R-R-Q-K-R*F-VelogenicChicken44.545.055.930.0823.8921.0624.97
30GU585905-G-R-R-Q-R-R*F-MesogenicChicken39.243.953.830.2622.8621.0625.81
31GU978777-G-R-R-Q-K-R*F-VelogenicChicken43.845.058.330.3223.5321.4224.73
32AB853928-G-R-R-Q-K-R*F-VelogenicChicken41.444.355.630.2023.0421.1825.57
33AB853927-G-R-R-Q-K-R*F-VelogenicChicken41.944.754.429.5423.1621.5425.75
34AB853926-G-R-R-Q-K-R*F-VelogenicChicken43.044.956.029.9623.4721.4225.15
35JX012096-G-R-R-Q-K-R*F-VelogenicChicken44.345.357.229.9023.7721.5424.79
36JX193076-G-R-R-Q-K-R*F-VelogenicChicken40.544.355.229.8422.8621.4225.87
37JX193075-R-R-R-Q-K-R*F-VelogenicChicken42.044.654.429.5422.8621.7225.87
38JQ015297-G-R-R-Q-K-R*F-VelogenicChicken41.544.755.530.0223.4121.3025.27
39JQ015296-G-R-R-Q-K-R*F-VelogenicChicken42.044.955.030.0223.5921.3025.09
40JQ015295-G-R-R-Q-K-R*F-VelogenicChicken42.045.055.130.0223.6521.3025.03
41HQ839733-G-R-R-Q-K-R*F-VelogenicChicken42.344.757.230.3223.8920.7025.09
42JF950509-G-R-R-Q-R-R*F-MesogenicChicken41.944.755.129.0022.4422.2026.35
43FJ766529-G-R-R-Q-K-R*F-VelogenicChicken42.044.555.930.2023.3521.1225.33
44FJ430159-G-R-R-Q-R-R*F-MesogenicChicken41.944.554.829.0622.3822.1426.41
45DQ485231-R-R-R-Q-K-R*F-VelogenicChicken42.244.455.230.0823.1021.3025.51
46DQ485230-G-R-R-Q-K-R*F-VelogenicChicken42.244.755.729.8423.1021.5425.51
47DQ485229-G-R-R-Q-K-R*F-VelogenicChicken41.744.855.329.4222.8621.9625.75
48AY562991-G-G-K-Q-G-R*L-LentogenicChicken46.846.659.028.4023.8922.6825.03
49AY562988-G-R-R-Q-K-R*F-VelogenicChicken43.845.353.929.4823.4721.7825.27
50AY562987-G-R-R-Q-K-R*F-VelogenicChicken43.945.357.129.4823.4721.7825.27
51EF201805-G-R-R-Q-R-R*F-MesogenicChicken41.944.654.929.0022.3822.2026.41
52KJ019841-G-G-R-Q-G-R*L-LentogenicChicken43.044.458.930.4523.1621.2425.15
53KF935230-G-R-R-Q-K-R*F-VelogenicChicken42.445.155.229.9023.7121.4224.97
54KF727980-G-R-R-Q-K-R*F-VelogenicChicken41.245.056.129.6623.3521.6625.33
55JX443519-G-G-K-Q-G-R*L-LentogenicChicken44.445.357.729.3022.9822.3225.39
56JF966387-G-R-R-Q-K-R*F-VelogenicChicken45.244.954.030.0823.5321.3625.03
57JF966386-G-R-R-R-K-R*F-LentogenicChicken43.145.553.030.1423.9521.4824.43
58JF966385-G-R-R-Q-K-R*F-VelogenicChicken41.744.556.630.0222.8021.7225.45
59KC844235-G-G-R-Q-G-R*L-LentogenicChicken42.944.459.030.3923.1621.2425.21
60KC542914-G-R-R-Q-K-R*F-VelogenicChicken40.944.553.829.9623.1621.3025.57
61KC542913-G-R-R-Q-K-R*F-VelogenicChicken42.044.855.030.0223.5321.3025.15
62KC542912-G-R-R-Q-K-R*F-VelogenicChicken42.044.855.030.0223.5321.3025.15
63KC542911-G-R-R-Q-K-R*F-VelogenicChicken42.444.555.230.0223.2321.3025.45
64KC542910-G-R-R-Q-K-R*F-VelogenicChicken42.844.858.029.9623.4721.3025.27
65KC542909-G-R-R-Q-K-R*F-VelogenicChicken41.044.454.230.0823.1021.2425.57
66KC542908-G-R-R-Q-K-R*F-VelogenicChicken40.944.154.630.0822.9221.1225.87
67KC542907-G-R-R-Q-K-R*F-VelogenicChicken42.444.754.930.0223.4721.2425.27
68KC542906-G-R-R-Q-K-R*F-VelogenicChicken42.144.755.030.0223.4721.2425.27
69KC542905-G-R-R-Q-K-R*F-VelogenicChicken42.244.954.830.0223.5921.3025.09
70KC542904-G-R-R-Q-K-R*F-VelogenicChicken41.744.754.429.9023.2921.3625.45
71KC542903-G-R-R-Q-K-R*F-VelogenicChicken42.044.854.429.9023.4121.3625.33
72KC542902-G-R-R-Q-K-R*F-VelogenicChicken42.544.755.029.7223.0421.6025.63
73KC542901-G-R-R-Q-K-R*F-VelogenicChicken40.043.855.430.3922.6821.1225.81
74KC542900-G-R-R-Q-K-R*F-VelogenicChicken40.043.855.430.3922.6821.1225.81
75KC542899-G-R-R-Q-K-R*F-VelogenicChicken41.944.754.529.9623.4121.2425.39
76KC542898-G-R-R-Q-K-R*F-VelogenicChicken41.344.455.929.9022.9221.4825.69
77KC542897-G-R-R-Q-K-R*F-VelogenicChicken41.544.556.329.6022.8021.7225.87
78KC542896-G-R-R-Q-K-R*F-VelogenicChicken41.544.554.330.0823.2921.1825.45
79KC542895-G-R-R-Q-K-R*F-VelogenicChicken41.744.756.529.7823.1021.6025.51
80KC542894-G-R-R-Q-K-R*F-VelogenicChicken40.944.553.829.9623.1621.3025.57
81KC542893-G-R-R-Q-K-R*F-VelogenicChicken41.144.554.429.9623.1621.3025.57
82KC542892-G-R-R-Q-K-R*F-VelogenicChicken41.544.654.829.9623.2921.3025.45
83JN682210-G-R-R-Q-K-R*F-VelogenicChicken39.543.953.730.1422.6221.3025.93
84 JN986837 -G-R-R-Q-K-R*F- Velogenic Chicken 44.145.2 52.3 29.9023.8321.3624.91
85JN986838-G-R-R-Q-K-R*F-VelogenicChicken42.444.755.830.0223.2321.4825.27
86JQ247691-G-R-R-Q-K-R*F-VelogenicChicken44.845.659.629.4223.7121.8425.04
87JN400897-G-R-R-Q-K-R*F-VelogenicChicken41.944.654.529.9623.2921.3025.45
88JN400896-G-R-R-Q-K-R*F-VelogenicChicken42.044.855.929.6023.1021.7225.57
89JF343539-G-R-R-Q-K-R*F-VelogenicChicken42.244.755.729.8423.1021.5425.51
90JF950510-G-G-R-Q-G-R*L-LentogenicChicken43.044.559.130.3923.2321.2425.15
91GU564399-R-R-R-Q-K-R*F-VelogenicChicken42.244.655.829.8423.1021.4825.57
92HQ266603-G-R-R-R-R-R*F-LentogenicChicken47.246.558.928.4623.5322.9825.03
93HQ266602-G-R-R-R-R-R*F-LentogenicChicken46.446.557.828.4623.3523.1025.09
94GQ994434-G-G-K-Q-G-R*L-LentogenicChicken41.444.656.629.7222.9821.5425.69
95GQ994433-G-G-R-Q-G-R*L-LentogenicChicken41.944.954.829.4222.8021.9625.75
96FJ386396-G-R-R-Q-K-R*F-VelogenicChicken44.144.958.629.9623.5921.3025.15
97FJ386395-G-R-R-Q-K-R*F-VelogenicChicken44.344.959.630.3923.4721.4224.73
98FJ386394-G-R-R-Q-K-R*F-VelogenicChicken44.344.958.930.3923.4121.4824.73
99FJ386393-G-R-R-Q-K-R*F-VelogenicChicken44.344.958.630.3923.7721.1224.73
100FJ386392-G-R-R-Q-K-R*F-VelogenicChicken44.344.959.630.4523.2921.6024.67
101AB605247-G-R-R-Q-K-R*F-VelogenicChicken44.545.355.028.7623.3521.9625.93
102FJ217666-G-R-R-Q-K-R*F-VelogenicChicken43.344.956.329.3622.9821.9625.69
103FJ217665-G-R-R-Q-K-R*F-VelogenicChicken43.344.958.729.4223.0421.8425.69
104JF966389-G-R-R-Q-K-R*F-VelogenicChicken46.245.056.830.2023.5921.4224.79
105JF966388-G-R-R-Q-K-R*F-VelogenicChicken45.844.754.030.3223.3521.3624.97
106KC205479-G-R-R-H-K-R*F-NAChicken44.445.258.530.4524.0721.1224.37
107KC205478-G-R-R-Q-K-R*F-VelogenicChicken45.145.757.330.0824.1921.4824.25
108KC205475-G-R-R-Q-K-R*F-VelogenicChicken45.345.558.930.0224.1321.4224.43
109GU166154-R-R-R-Q-K-R*F-VelogenicChicken42.444.255.229.6622.5021.7226.11
110KC205477-G-R-R-Q-K-R*F-VelogenicChicken44.045.258.230.3924.0721.1824.37
111 KC205476 -G-R-R-R-K-R*F- Lentogenic Chicken 45.545.5 60.0 30.1423.8921.6624.31
112EF520718-G-R-R-Q-K-R*F-VelogenicChicken45.345.357.929.9624.1321.1824.73
113FJ436306-G-R-R-Q-R-R*F-MesogenicDuck41.844.455.029.3022.2622.1426.29
114HM125898-G-G-K-Q-G-R*L-LentogenicDuck47.946.858.928.5223.8922.9224.67
115KF771883-G-R-R-Q-K-R*F-VelogenicDuck42.244.555.730.0223.3521.1825.45
116KF361507-G-E-R-Q-E-R*L-LentogenicDuck50.247.056.130.2023.7123.1622.92
117FJ754272-G-R-R-Q-K-R*F-VelogenicDuck41.744.755.229.4822.7421.9625.81
118JX401405-G-G-K-Q-G-R*L-LentogenicDuck48.146.957.728.5824.0722.8024.55
119JX401404-G-G-K-Q-G-R*L-LentogenicDuck46.946.558.928.8824.0122.5024.61
120JX401403-G-G-K-Q-G-R*L-LentogenicDuck46.446.358.728.7623.6522.6224.97
121JN688864-G-E-Q-Q-E-R*L-LentogenicDuck47.545.657.331.1122.6222.8623.41
122FJ794269-G-E-Q-Q-E-R*L-LentogenicDuck48.146.156.230.8722.8623.1023.16
123GQ849007-G-R-R-Q-K-R*F-VelogenicDuck42.045.154.729.4823.2921.8425.39
124KC894391-G-G-K-Q-G-R*L-LentogenicDuck48.147.158.628.3423.9523.1024.61
125JX193083-G-G-K-Q-G-R*L-LentogenicDuck46.146.258.428.7623.6522.5025.09
126JX193082-G-G-R-Q-G-R*L-LentogenicDuck42.844.458.630.3223.1621.2425.27
127JX193081-G-G-K-Q-G-R*L-LentogenicDuck46.546.758.328.5823.9522.7424.73
128JX193080-G-G-R-Q-G-R*L-LentogenicDuck42.944.558.530.2623.1621.3025.27
129JX193079-G-G-K-Q-G-R*L-LentogenicDuck44.345.358.029.1822.9822.2625.57
130JX193078-G-G-K-Q-G-R*L-LentogenicDuck45.946.258.928.8823.7122.4424.97
131 JX193077 -G-G-K-Q-G-R*L- Lentogenic Duck 46.946.3 59.3 28.8823.5922.6824.85
132HQ008337-G-E-R-Q-E-R*L-LentogenicDuck49.147.054.930.2623.7723.1622.80
133GQ288392-G-E-K-Q-G-R*L-LentogenicDuck42.745.058.229.3623.1021.8425.69
134GQ288391-G-G-K-Q-G-R*L-LentogenicDuck43.045.057.029.4223.0421.9625.57
135GQ288390-G-E-K-Q-G-R*L-LentogenicDuck43.445.157.629.1823.1022.0225.69
136GQ288389-G-E-K-Q-G-R*L-LentogenicDuck43.045.158.129.3023.2321.8425.63
137GQ288380-G-E-K-Q-G-R*L-LentogenicDuck43.044.958.829.4223.1621.7225.69
138GQ288379-G-E-K-Q-G-R*L-LentogenicDuck43.045.158.229.2423.1621.9025.69
139GQ288377-G-E-K-Q-G-R*L-LentogenicDuck43.445.357.529.0623.2322.0825.63
140DQ097393-G-G-R-Q-G-R*L-LentogenicDuck50.447.655.829.9023.8923.5922.62
141KC920893-G-R-R-Q-R-R*F-VelogenicDuck41.944.455.129.2422.2022.2026.35
142JN400895-G-R-R-Q-K-R*F-VelogenicDuck41.845.054.829.8423.5921.4225.15
143HQ717357-G-R-R-Q-K-R*F-VelogenicDuck40.344.355.630.0223.1021.1825.69
144HQ317395-G-R-R-Q-K-R*F-VelogenicDuck41.945.054.829.7823.5921.4225.21
145HQ317394-G-R-R-Q-R-R*F-MesogenicDuck41.944.455.129.2422.2022.1426.41
146HM063422-G-E-R-Q-G-R*L-LentogenicDuck46.146.258.828.9423.8922.3224.85
147JF893453-G-E-R-Q-E-R*L-LentogenicDuck49.547.057.130.0823.5923.3522.98
148HQ412767-G-E-R-Q-G-R*L-LentogenicDuck48.946.855.830.4523.7123.0422.8
149 HM188399 -G-R-R-Q-K-R*F- Velogenic Duck 41.744.8 54.4 29.8423.4121.3625.39
150EF521889-G-R-R-Q-K-R*F-VelogenicDuck44.444.855.429.4823.2321.6025.69
151FJ410147-G-R-R-Q-K-R*F-VelogenicPigeon43.845.457.830.3924.3121.1224.19
152FJ410145-G-R-R-Q-K-R*F-VelogenicPigeon43.645.357.930.5124.2521.0624.19
153FJ766531-G-R-R-R-K-R*F-LentogenicPigeon45.645.856.930.2024.3721.4224.01
154FJ766530-G-R-R-R-K-R*F-LentogenicPigeon45.545.856.730.2024.3721.4224.01
155FJ766528-E-K-R-Q-K-R*F-VelogenicPigeon44.345.256.430.4524.1321.0624.37
156FJ766527-G-R-R-R-K-R*F-LentogenicPigeon45.245.756.130.2624.3121.3624.07
157FJ766526-G-R-R-Q-K-R*F-VelogenicPigeon47.046.157.330.2024.8521.2423.71
158KC013040-E-R-R-Q-K-R*F-VelogenicPigeon42.444.654.630.8723.8320.7624.55
159KC013039-V-R-R-K-K-R*F-VelogenicPigeon45.145.358.330.1423.8321.4824.55
160KC013038-V-R-R-K-K-R*F-VelogenicPigeon44.145.158.030.2023.7721.3024.73
161KC013037-V-R-R-K-K-R*F-VelogenicPigeon44.045.157.429.9023.7121.3625.03
162KC013036-V-R-R-K-K-R*F-VelogenicPigeon44.244.958.130.3223.7721.1224.79
163KC013035-V-R-R-K-K-R*F-VelogenicPigeon43.744.757.630.3923.5921.0624.97
164KC013034-V-R-R-K-K-R*F-VelogenicPigeon43.845.057.930.2023.8321.1824.79
165KC013033-V-R-R-K-K-R*F-VelogenicPigeon44.245.157.930.2023.8321.3024.67
166KC013032-V-R-R-K-K-R*F-VelogenicPigeon44.645.157.930.1423.7721.3024.79
167KC013031-E-R-R-Q-K-R*F-VelogenicPigeon44.444.856.830.8124.0120.7624.43
168JX486557-R-R-R-Q-K-R*F-VelogenicPigeon46.645.956.930.2624.8521.0023.89
169JX486556-G-R-R-Q-K-R*F-VelogenicPigeon46.246.257.830.0824.7921.3623.77
170JX486555-G-R-R-Q-K-R*F-VelogenicPigeon46.246.257.830.0824.7921.3623.77
171JX486554-G-R-R-Q-K-R*F-VelogenicPigeon46.446.156.830.3224.9721.0623.65
172JX486553-G-R-R-Q-K-R*F-VelogenicPigeon46.646.257.130.224.9721.1823.65
173JX486552-G-R-R-Q-K-R*F-VelogenicPigeon44.145.157.430.7524.3120.8224.13
174JX486551-G-R-R-Q-K-R*F-VelogenicPigeon44.745.954.229.9624.2521.6024.19
175JX486550-G-R-R-Q-K-R*F-VelogenicPigeon45.745.656.230.5724.6720.8823.89
176JF827026-G-K-R-Q-K-R*F-VelogenicPigeon43.845.058.230.3223.9521.0624.67
177JX901110-G-R-R-Q-K-R*F-VelogenicPigeon45.845.854.430.2624.4921.2424.01
178JX901109-G-R-R-Q-K-R*F-VelogenicPigeon46.045.854.430.2624.5521.2423.95
179GQ429292-V-R-R-K-K-R*F-VelogenicPigeon46.045.858.630.0824.4321.3024.19
180JQ993431-G-R-R-Q-K-R*F-VelogenicPigeon44.845.357.730.7524.4920.8223.95
181JQ979176-G-R-R-Q-K-R*F-VelogenicPigeon45.145.557.430.6924.6120.8823.83
182 JF827027 -G-K-R-Q-K-R*F- Velogenic Pigeon 43.645.1 60.5 30.0823.7721.3024.85
183JN986839-G-R-R-Q-K-R*F-VelogenicPigeon46.246.056.230.2024.6721.3023.83
184 FJ986192 -G-R-R-Q-R-R*L- Lentogenic Pigeon 42.045.3 53.8 29.6623.5321.7225.09
185HM063425-G-R-R-Q-K-R*F-VelogenicPigeon46.245.956.530.1424.4321.4823.95
186EF026583-G-R-R-Q-K-R*F-VelogenicPigeon45.845.854.430.2624.4921.2424.01
187EF026579-G-R-R-Q-K-R*F-VelogenicPigeon45.845.854.530.2624.5521.2423.95
188AJ880277-G-G-R-Q-K-R*F-VelogenicPigeon44.245.658.030.3924.4321.1224.07
189DQ417113-E-K-R-Q-K-R*F-MesogenicPigeon45.845.156.530.3223.9521.1824.55
190JF713701-G-R-R-Q-K-R*F-VelogenicPigeon48.046.354.829.9624.9121.4223.71
191FJ754273-G-R-R-Q-K-R*F-VelogenicGoose41.944.856.129.4822.9221.8425.75
192KC551967-G-R-R-Q-K-R*F-VelogenicGoose44.345.353.530.4524.2921.0624.31
193KC152049-G-R-R-Q-K-R*F-VelogenicGoose44.345.353.430.3924.1921.1224.31
194KC152048-G-R-R-Q-K-R*F-VelogenicGoose44.145.353.630.3224.0121.3024.37
195 JN688865 -G-E-Q-Q-G-R*L- Lentogenic Goose 48.646.1 53.1 30.3222.3823.6523.65
196JN631747-G-R-R-Q-K-R*F-VelogenicGoose41.744.654.930.0223.2921.3025.39
197GU143550-G-R-R-Q-K-R*F-VelogenicGoose41.144.555.329.5422.7421.7225.99
198DQ659677-G-R-R-Q-K-R*F-VelogenicGoose41.744.754.629.6622.9821.6025.75
199FJ430160-G-R-R-Q-R-R*F-MesogenicGoose42.144.655.129.0022.3822.2026.41
200 AB524405 -G-E-R-Q-E-R*L- Lentogenic Goose 49.647.1 56.8 29.9623.4723.5323.04
201JF340367-G-R-R-Q-K-R*F-VelogenicGoose41.944.855.729.7223.2321.6025.45

*Represents cleavage point, NA represents an unknown pathotype. The pathotype has been identified based on the cleavage site. Bold letters indicates the lower and higher value of Nc.

*Represents cleavage point, NA represents an unknown pathotype. The pathotype has been identified based on the cleavage site. Bold letters indicates the lower and higher value of Nc.

Codon usage analysis

The patterns of codon usage were analyzed for the two hundred and one F gene sequences for NDV isolated from four major host species. The relative synonymous codon usage (RSCU) values of each codon for the F gene were calculated using Codon W 1.4.4 software. The calculation of RSCU index enables the characterization of synonymous usage of codons and is expressed as the ratio of the observed usage of codons to the expected value if all codons were used frequently. The RSCU value of 1 indicates that the codon is chosen randomly and evenly, RSCU >1 indicates that the codon usage is more frequent than the expected, and RSCU <1 indicates that the codon chosen is less frequent [30]. RSCU calculation formulae: RSCU  = Where gij  =  observed number of codons for ith codon for jth amino acid that has nj kinds of synonymous codon.

Effective number of codons

Quantification of the codon usage bias of the ORF in a gene is calculated by the effective number of codons (ENC or Nc). The Nc best estimates the absolute synonymous codon usage bias in a gene. ENC calculation formulae: ENC  = where s represents the value of G+C at the third codon position (GC3) [31]. Nc has been calculated using the Codon W 1.4.4 program. The Nc value was correlated to the percentage of GC3s. In case of biased codon usage only one codon for each amino acid is used and the Nc equals to 20. In case the Nc value is 61 then there is no bias in codon usage and all synonymous codons are equally used.

Codon adaptation index (CAI)

The CAI requires a reference set of highly expressed known gene and enables the estimation of the amount of bias (Codon W 1.4.4 program). The high value of CAI refers higher codon usage bias and expression level [32], [33], [34]. The CAI index is defined as the geometric mean of relative adaptiveness values. Non-synonymous codons and termination codons (dependent on genetic code) are excluded from the calculation. CAI values range from 0 to 1, with higher values indicating a higher proportion of the most abundant codons [35].

Chemical property analysis of amino acids using various indices

Aliphatic Index

The aliphatic index (AI) refers to the relative volume of a protein that is occupied by aliphatic side chains (alanine, isoleucine, leucine and valine) and contributes to the increased thermo-stability observed for globular proteins. The AI of a protein is calculated according to the following formula [36]. Aliphaatic index (AI)  = Here: X(A), X(V), X(I), and X(L) are mole percent (100 X mole fraction) of alanine, valine, isoleucine, and leucine, respectively. The coefficients a, b are the relative volumes of valine side chains (a  =  2.9) and of Leu/Ile side chains (b  =  3.9) relative to that of alanine side chains.

Grand average of hydropathy (GRAVY)

GRAVY is calculated as the arithmetic mean of the sum of the hydrophobic indices of each amino acid [37].

Correspondence analysis

Principal component analysis (PCA) was performed using the software XLSTAT version 2013.5.02. The PCA provides information regarding the major trend involved in the codon usage patterns measured from RSCU values and are calculated from 59 codons excluding methionine, tryptophan and all termination codons [38]. Correlation analysis was performed for the first two axes of PCA (PC1 and PC2). Pearson rank correlation analysis was performed to infer the relationships between the two axes of PCA and different variables like GRAVY, aromaticity index, aliphaticity index and GC3s.

Phylogenetic analysis

Major avian-host species of NDV

The phylogenetic relationship of the four major avian host species of NDV was studied using the MEGA6 software. The mitochondrial DNA is an important data source in building the phylogenetic [39]. Advantages of mitochondrial genome over nuclear gene is that they are unlikely to have undergone many intra specific recombination events [40]. In order to confer the phylogenetic relationship of the major avian host species the mitochondrial genome reference sequence for the four major avian host species Anas platyrhynchos (Duck, ref seq. NC009684, 16604bp), Gallus gallus (Chicken, ref seq. NC001323, 16775bp), Anser anser (Goose, ref seq. NC011196, 16738bp) and Columba livia (Pigeon, ref seq. NC013978, 17229bp) were downloaded from GenBank. The neighbor-joining method was used with parameters including pairwise deletion, 1000 replicates for bootstrap analysis and Dayhoff substitution model.

NDV strains

The phylogenetic relationship between the NDV strains was studied using MEGA6 software. The 201 complete F gene sequences were obtained from GenBank (Table 1). For the ease of understanding the labeling was done as accession no/virulence/host species. In the label L, M, V, CH, DK, GE and PN stand for lentogenic, mesogenic, velogenic, chicken, duck, goose and pigeon respectively. The neighbor-joining method was used with parameters including pairwise deletion, 1000 replicates for bootstrap analysis and Jukes-Cantor substitution model.

Results

Codon usage bias of fusion (F) gene in four major host species

The Nc value is the determinant of the degree of bias in codon usage. The four major host species showed a range of Nc values. For chicken, the maximum and minimum Nc values were 60 and 52.3, respectively. For duck, the maximum and minimum Nc values were 59.3 and 54.4, respectively. For pigeon, the maximum and minimum Nc values were 60.5 and 53.8, respectively. For goose, the maximum and minimum Nc values were 56.8 and 53.1, respectively (Table 1). The mean Nc was found to be maximum for duck and minimum for the goose. The GC3 is the amount of G + C at the third position whereas the GC is the total amount of G +C. The pigeons showed the maximum value of mean GC3s while the minimum value of mean GC3s was calculated for chicken. Similarly, maximum value of mean GC was calculated for a duck while chicken showed the minimum value (Figure 1). Slight variation in the values of mean GC3s, GC and Nc for duck and pigeon was observed. The mean GC was found to be greater than mean GC3s for all four species.
Figure 1

Improved effective number of codons index (Nc) as a measure of overall average codon usage bias (CUB) in four different species.

The actual mean Nc, mean GC3s and mean GC calculated at 95% confidence.

Improved effective number of codons index (Nc) as a measure of overall average codon usage bias (CUB) in four different species.

The actual mean Nc, mean GC3s and mean GC calculated at 95% confidence.

Species specific identification of optimal codons

Analysis of codon based on RSCU values showed 21 optimal codons for 19 different amino acids, preferentially used for F gene of NDV (Table 2). A preferential usage of the optimal codons was compared against the species' specific codon usage and the frequencies of optimal codon usage (FOP) in all species were calculated [41]. Species specific codon usage was compared with the optimal codon usage. However, analysis within species showed maximum similarity with the optimal codon usage for chicken followed by goose, duck and minimum for pigeon isolate (Table 3).
Table 2

List of species specific synonymous codon usage in F gene of Newcastle disease virus. N is the number of codons; RSCU is cumulative relative synonymous codon usage.

Amino acidCodonChickenDuckPigeonGooseOverall
NRSCUNRSCUNRSCUNRSCUNRSCU
PheUUU 497 1.21 140 1.06 192 1.21 46 1.10 875 1.18
UUC3270.791230.941250.79380.906130.82
LeuUUA 1580 1.20 4360.98 599 1.25 137 1.06 2752 1.16
UUG12670.96 467 1.05 2940.621280.9921560.91
TyrUAU 1094 1.09 335 1.01 3000.88 130 1.29 1859 1.04
UAC9210.913260.99 381 1.12 710.7116990.96
StopUAA60.1670.5520.1530.71180.27
StopUAG00.0000.0000.0000.0000.00
LeuCUU 1408 1.07 4511.015221.091240.9625051.06
CUC11870.904160.934781.001230.9522040.93
CUA11230.853720.843720.781210.9419880.84
CUG13531.03 529 1.19 602 1.26 141 1.09 2625 1.11
HisCAU 203 1.57 49 1.51 99 1.47 22 1.69 373 1.54
CAC550.43160.49360.5340.311110.46
GlnCAA 1569 1.04 490 0.89 657 1.27 1450.96 2861 1.05
CAG14540.966111.113810.73 156 1.04 26020.95
IleAUU11690.723970.733160.521100.6819920.68
AUC15700.975521.016321.031701.0629241.00
AUA 2101 1.30 683 1.26 886 1.45 203 1.26 3873 1.32
MetAUG13611.004921.004431.001321.0024281.00
AsnAAU 2433 1.31 779 1.25 861 1.26 229 1.27 4302 1.28
AAC12930.694710.755110.741320.7324070.72
LysAAA11410.884080.87 509 1.09 1010.8021590.91
AAG 1461 1.12 527 1.13 4270.91 153 1.20 2568 1.09
ValGUU8070.762730.752040.57710.6713550.71
GUC 1427 1.34 429 1.18 526 1.46 138 1.30 2520 1.33
GUA10971.033851.063530.981211.1419561.03
GUG9370.883641.003611.00940.8917560.93
AspGAU 1194 1.07 399 1.01 442 1.05 1090.99 2144 1.05
GAC10310.933890.993960.09 112 1.01 19280.95
GluGAA 1211 1.20 408 1.13 376 1.09 118 1.17 2113 1.17
GAG8010.803130.873130.91840.8315110.83
SerUCU11051.194081.344631.391151.2820911.26
UCC8800.952620.861920.57770.8614110.85
UCA 1651 1.78 436 1.43 651 1.95 155 1.73 2893 1.75
UCG3790.412080.681240.37360.407470.45
CysUGU 937 1.35 299 1.23 2330.92 93 1.38 1562 1.24
UGC4550.651890.77 276 1.08 420.629620.76
StopUGA 106 2.84 31 2.45 38 2.85 8 2.18 183 2.73
TrpUGG1191.00391.00421.00111.002111.00
ProCCU 744 1.46 228 1.35 1740.96 68 1.39 1214 1.34
CCC4540.891931.14 208 1.15 511.049061.00
CCA5311.041480.871981.09470.969241.02
CCG3080.601080.641460.80300.615920.65
ArgCGU 207 0.57 94 0.88 430.34160.45 360 0.57
CGC1830.50520.49 85 0.67 22 0.62 3420.54
CGA1200.33180.17800.63100.282280.36
CGG1930.53620.58450.35190.543190.50
ThrACU19361.306411.236151.161651.1233571.25
ACC16681.126051.166211.171671.1430611.14
ACA 1984 1.33 743 1.43 787 1.49 214 1.46 3728 1.39
ACG3870.26910.18960.18420.2866160.23
SerAGU4920.531910.631810.54560.639200.56
AGC 1057 1.14 308 1.02 393 1.18 98 1.09 1856 1.12
ArgAGA 871 2.39 1881.771921.51 79 2.23 1330 0.70
AGG6121.68 224 2.11 319 2.51 671.8912220.64
AlaGCU12060.944260.983440.771130.9020890.91
GCC12931.004571.065321.181230.9824051.05
GCA 2101 1.63 714 1.65 675 1.51 220 1.75 3710 1.62
GCG5530.431330.312380.53470.379710.42
.GlyGGU 1211 1.34 427 1.12 3050.791070.9820501.03
GGC11781.303220.84 528 1.36 1141.0421421.08
GGA8910.993280.862620.68970.8915780.80
GGG11331.254521.184551.17 119 1.09 2159 1.09

The N and RSCU value of codons more preferentially used for each amino acid are displayed in bold. The overall values depict the codon usage for the F gene for all isolates.

Table 3

Species specific codon usage of F gene in Newcastle disease virus.

Amino acidCodonCodons generally usedCodons predominantly used in F gene
ChickenDuckPigeonGoose
PheUUU UUU UUU UUU UUU UUU
UUC
LeuUUA UUA UUA UUA UUA
UUGUUG
TyrUAU UAU UAU UAU UAU
UACUAC
terUAA
terUAG
LeuCUUCUU CUG
CUC
CUA
CUG CUG CUG CUG
HisCAU CAU CAU CAU CAU CAU
CAC
GlnCAA CAA CAA CAA CAA
CAGCAG
IleAUU AUA
AUC
AUA AUA AUA AUA AUA
MetAUG AUG AUG AUG AUG AUG
AsnAAU AAU AAU AAU AAU AAU
AAC
LysAAAAAA AAG
AAG AAG AAG AAG
ValGUU GUC
GUC GUC GUC GUC GUC
GUA
GUG
AspGAU GAU GAU GAU GAU
GACGAC
GluGAA GAA GAA GAA GAA GAA
GAG
SerUCU UCA
UCC
UCA UCA UCA UCA UCA
UCG
CysUGU UGU UGU UGU UGU
UGCUGC
terUGA UGA UGA UGA UGA UGA
TrpUGG UGG UGG UGG UGG UGG
ProCCU CCU CCU CCU CCU
CCCCCC
CCA
CCG
ArgCGU CGU CGU CGU
CGCCGCCGC
CGA
CGG
ThrACU ACA
ACC
ACA ACA ACA ACA ACA
ACG
SerAGU AGC
AGC AGC AGC AGC AGC
ArgAGA AGA AGA AGA
AGGAGGAGG
AlaGCU GCA
GCC
GCA GCA GCA GCA GCA
GCG
GlyGGUGGUGGU GGG
GGC
GGA
GGG GGG GGG
The N and RSCU value of codons more preferentially used for each amino acid are displayed in bold. The overall values depict the codon usage for the F gene for all isolates.

Nc plot

The Nc value is plotted against the corresponding GC3s. The genes having their codon selection constrained to GC composition are supposed to lie on the continuous curve which represents a random codon usage. If the gene lies below the curve it represents mutational bias and translational selection. All the points were found to lie just below the curve (Figure 2). Furthermore, a significant positive correlation between the values of GC3s and Nc was observed. There was almost no difference between the CAI values for all the isolates. The average CAI was found to be 0.174 (p <0.05), which is apparently low as CAI ranges from 0 to 1. Principal component analysis using Pearson correlation was performed to evaluate the relationship between the first two axes of PCA, GRAVY, aromaticity, GC3s and aliphatic index (Table 4). The GRAVY, aromaticity, GC3s were found to have no correlation with both axes. To represent the variation in position of each codon a scatter plot for the optimal codon usage is plotted between the PC1 and PC2 (Figure 3). The PC1 accounts for 89.10% and PC2 accounts for 7.45% of the total variation. Thus the first axis accounts for the major impact on total variation in synonymous codon usage as compared to an appreciable impact by the second axis. The correlation between the isolates from four species is represented in the plot of isolates against the PC1 and PC2 (Figure 4).
Figure 2

Nc of codons used plotted against the GC3s.

The continuous curve as seen in the plot represents the relationship between GC3s and the Nc in the absence of selection. All the values lie below the expected curve. The curve represents the expected codon usage if GC compositional constraints alone account for codon usage bias.

Table 4

Summary of correlation analysis between the first two axes in COA, GC3s, GRAVY, aromaticity and aliphatic Index (AI) in the selected complete fusion (F) gene sequences for Newcastle disease virus isolates from four major avian species.

GRAVYAromaticityGC3sAliphatic Index
Axis 1r−0.5310.379−0.576−0.780
p0.4690.6210.4240.220
Axis 2r0.594−0.3320.6170.815
p0.4060.6680.3830.185

No correlation was observed between the two axes, GRAVY, aromaticity, GC3s and AI.

P-value ≤ 0.05; P-value ≤ 0.01 were used for the correlation analysis.

Figure 3

A Scatter plot of the optimal codon distribution on the first and second principal axes are derived from PCA analysis.

The first axis (PC1) explained 89.90% of total variance, while the second axis (PC2) accounted for 7.45% of total variance.

Figure 4

Relatedness among the species based on the codon usage.

Chicken, Goose and Duck are more related in terms of codon bias than Pigeon which is an outlier.

Nc of codons used plotted against the GC3s.

The continuous curve as seen in the plot represents the relationship between GC3s and the Nc in the absence of selection. All the values lie below the expected curve. The curve represents the expected codon usage if GC compositional constraints alone account for codon usage bias.

A Scatter plot of the optimal codon distribution on the first and second principal axes are derived from PCA analysis.

The first axis (PC1) explained 89.90% of total variance, while the second axis (PC2) accounted for 7.45% of total variance.

Relatedness among the species based on the codon usage.

Chicken, Goose and Duck are more related in terms of codon bias than Pigeon which is an outlier. No correlation was observed between the two axes, GRAVY, aromaticity, GC3s and AI. P-value ≤ 0.05; P-value ≤ 0.01 were used for the correlation analysis. The phylogenetic tree of the major avian host species of NDV represents the relationship between them. It was observed that Anas platyrhynchos (Duck) and Gallus gallus (Chicken) are ancestrally more correlated whereas; Anser anser (Goose) and Columba livia (Duck) bear a closer ancestral relationship (Figure 5).
Figure 5

Phylogenetic tree illustrating relationship among the four host species following the neighbour-joining method using MEGA6 software.

Parameters include: pairwise deletion, 1000 replicates for bootstrap analysis and Dayhoff substitution model. The reference sequence of the mitochondrial DNA was taken to study the relationship among the host species.

Phylogenetic tree illustrating relationship among the four host species following the neighbour-joining method using MEGA6 software.

Parameters include: pairwise deletion, 1000 replicates for bootstrap analysis and Dayhoff substitution model. The reference sequence of the mitochondrial DNA was taken to study the relationship among the host species. The phylogenetic tree of the 201 NDV isolates represents the relationship which can be categorized on two bases (Figure 6). The first basis is being species from which the strain was isolated and the second being virulence shown in that species based on the F protein cleavage site. It was observed that on the basis of species except one pigeon isolate (FJ986192) lying in region 3 all the pigeon isolates were seen to lie in region 2 (Figure 6). The region 1 and 5 consisted of isolates from chicken, duck and goose whereas; the region 4 consisted of isolates of only duck. It is evident from the phylogenetic tree that isolates from pigeon clearly out lies and is not found mixed as is seen in case of chicken, duck and goose. A group of isolates from duck is also seen in region 4 to out lie from the rest. On the basis of virulence the phylogenetic tree can be seen to represent a similar trend, all the isolates from duck species lying in region 4 are lysogenic. Most of the mesogenic isolates from chicken are seen to lie in region 3. Very few reported strains from pigeon are lentogenic and mesogenic whereas most are velogenic and seen to group together in region 2. The region 1 comprises of most of the velogenic isolates of chicken, goose and duck.
Figure 6

Phylogenetic tree illustrating relationship among the 201 Newcastle disease virus (NDV) strain (labelled as accession no/pathogenic/species) following the neighbour-joining method using MEGA6 software.

Parameters include: pairwise deletion, 1000 replicates for bootstrap analysis and Jukes-Cantor substitution model, the rate variation among sites was modelled with a gamma distribution (shape parameter  =  5). L, M and V stands for lentogenic, mesogenic and velogenic strains. CH, PN, DK and GE stands for the chicken, pigeon, duck and goose, respectively.

Phylogenetic tree illustrating relationship among the 201 Newcastle disease virus (NDV) strain (labelled as accession no/pathogenic/species) following the neighbour-joining method using MEGA6 software.

Parameters include: pairwise deletion, 1000 replicates for bootstrap analysis and Jukes-Cantor substitution model, the rate variation among sites was modelled with a gamma distribution (shape parameter  =  5). L, M and V stands for lentogenic, mesogenic and velogenic strains. CH, PN, DK and GE stands for the chicken, pigeon, duck and goose, respectively.

Discussion

NDV is one of the most important diseases of poultry and is endemic in many parts of the world. Occasionally the virus has also been reported from many different animal species. The F protein determines the extent of infectivity of NDV [3], [5]. Although the major players of NDV virulence and pathogenicity are F and HN, but its gradients are largely multigenic. The F being the major antigenic determinant in NDV changed our views of considering HN as a major protective antigen [19]. The interaction between the host cell membrane and the fusion protein may depend on the type of species that gets infected with NDV. It has been shown that mutational pressure plays an important role in codon usage bias in NDV [42]. The codon bias in the F protein may vary within the species that are infected with NDV thus it is significant to address codon bias in F protein of NDV. Although NDV has been isolated from many other avian as well as non-avian hosts but, their meager number in the GenBank makes the data insufficient to consider for the present study. Four major avian species, namely chicken, duck, pigeon and goose were chosen for the present study considering the fact that most of the NDV strains are isolated from these species. Although only few NDV sequences are reported from goose, a total of 201 GenBank entries were included in the study covers the major isolates from four avian species. There is an obvious difference between these values for the isolates with lower Nc values suggesting that a lower Nc represents greater bias in codon usage. Amongst the four major host species maximum bias is for isolate from goose and least for isolate from duck. Although, there is a clear variation in the codon usage by the isolates from the four species still similarities within isolates can be observed. A plot of Nc values against GC3s effectively demonstrates heterogeneity [31]. The results suggest that the GC mutational bias is maximum in case of isolates from goose and minimum in case of isolates from duck. Low value of CAI obtained after analyzing all the isolates from the four species suggest lower codon usage bias and its low expression level. The frequencies of aromatic and aliphatic amino acids were found to have no association with the variation in codon usage in the F gene. Our results showed that the relatedness between the isolates from four species can be grouped according to the minimum positional variation. The isolates from chicken, goose and duck have least variation and can be considered as closely related isolates in terms of codon usage. In contrast, marked variation among isolates from pigeon suggests their distance in terms of codon usage. It corroborates with the fact that greater the distance between the isolates, greater is the variation in codon usage. It is also evident from the phylogenetic tree of NDV that isolates of pigeon clearly lies separately. Moreover, some of the isolates from duck also lie separately. Thus, the phylogenetic tree (Figure 6) and the relatedness between the species based on codon usage bias (Figure 4) clearly complement each other. The relationship between the host species and the NDV isolates from the host does not represent a significant correlation in our study. To the best of our understanding the present work is the most comprehensive codon bias analysis of a viral protein from species' point of view. It would be interesting to statistically investigate the NPL complex of NDV in terms of its codon usage and its role in virulence if any.
  39 in total

Review 1.  A case for evolutionary genomics and the comprehensive examination of sequence biodiversity.

Authors:  D D Pollock; J A Eisen; N A Doggett; M P Cummings
Journal:  Mol Biol Evol       Date:  2000-12       Impact factor: 16.240

2.  Variation in G + C-content and codon choice: differences among synonymous codon groups in vertebrate genes.

Authors:  A Marín; J Bertranpetit; J L Oliver; J R Medina
Journal:  Nucleic Acids Res       Date:  1989-08-11       Impact factor: 16.971

3.  The codon Adaptation Index--a measure of directional synonymous codon usage bias, and its potential applications.

Authors:  P M Sharp; W H Li
Journal:  Nucleic Acids Res       Date:  1987-02-11       Impact factor: 16.971

4.  Thermostability and aliphatic index of globular proteins.

Authors:  A Ikai
Journal:  J Biochem       Date:  1980-12       Impact factor: 3.387

5.  A simple method for displaying the hydropathic character of a protein.

Authors:  J Kyte; R F Doolittle
Journal:  J Mol Biol       Date:  1982-05-05       Impact factor: 5.469

6.  Synonymous codon usage in Escherichia coli: selection for translational accuracy.

Authors:  Nina Stoletzki; Adam Eyre-Walker
Journal:  Mol Biol Evol       Date:  2006-11-13       Impact factor: 16.240

7.  Analysis of synonymous codon usage in hepatitis A virus.

Authors:  Yiqiang Zhang; Yongsheng Liu; Wenqian Liu; Jianhua Zhou; Haotai Chen; Yin Wang; Lina Ma; Yaozhong Ding; Jie Zhang
Journal:  Virol J       Date:  2011-04-16       Impact factor: 4.099

8.  Mutation bias is the driving force of codon usage in the Gallus gallus genome.

Authors:  Yousheng Rao; Guozuo Wu; Zhangfeng Wang; Xuewen Chai; Qinghua Nie; Xiquan Zhang
Journal:  DNA Res       Date:  2011-10-27       Impact factor: 4.458

9.  Newcastle disease virus fusion and haemagglutinin-neuraminidase proteins contribute to its macrophage host range.

Authors:  Ingrid Cornax; Diego G Diel; Cary A Rue; Carlos Estevez; Qingzhong Yu; Patti J Miller; Claudio L Afonso
Journal:  J Gen Virol       Date:  2013-02-20       Impact factor: 3.891

10.  Role of fusion protein cleavage site in the virulence of Newcastle disease virus.

Authors:  Aruna Panda; Zhuhui Huang; Subbiah Elankumaran; Daniel D Rockemann; Siba K Samal
Journal:  Microb Pathog       Date:  2004-01       Impact factor: 3.738

View more
  4 in total

1.  Insight towards the effect of the multi basic cleavage site of SARS-CoV-2 spike protein on cellular proteases.

Authors:  Kamal Shokeen; Shambhavi Pandey; Manisha Shah; Sachin Kumar
Journal:  Virus Res       Date:  2022-06-06       Impact factor: 6.286

2.  Circulation, genomic characteristics, and evolutionary dynamics of class I Newcastle disease virus in China.

Authors:  Lijia Jia; Bilin Liang; Ke Wu; Runkun Wang; Haizhou Liu; Quanjiao Chen
Journal:  Virulence       Date:  2022-12       Impact factor: 5.882

3.  Evolutionary dynamics of codon usages for peste des petits ruminants virus.

Authors:  Xin Wang; Jing Sun; Lei Lu; Fei-Yang Pu; De-Rong Zhang; Fu-Qiang Xie
Journal:  Front Vet Sci       Date:  2022-08-12

Review 4.  Mechanism of Cross-Species Transmission, Adaptive Evolution and Pathogenesis of Hepatitis E Virus.

Authors:  Putu Prathiwi Primadharsini; Shigeo Nagashima; Hiroaki Okamoto
Journal:  Viruses       Date:  2021-05-14       Impact factor: 5.048

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.