Literature DB >> 28386202

Detecting early-warning signals for influenza A pandemic based on protein dynamical network biomarkers.

Jie Gao1, Kang Wang2, Tao Ding2.   

Abstract

The outbreak of influenza A comes from a relatively stable state is a critical phenomenon on epidemic. In this paper, influenza A varying from different states is studied in the method of dynamical network biomarkers (DNB). Through studying DNB of influenza A virus protein, we can detect the warning signals of outbreak for influenza A and obtain a composite index. The composite index varies along with the state of pandemic influenza, which gives a clue showing the turn point of outbreak. The low value (<1) steady state of the composite index means influenza A is normally in the relatively steady stage. Meanwhile, if the composite index of a certain year increases by more than 0.8 relative to the previous year and it is less than 1 and it increases sharply and reaches a peak being larger than 1 in next year, it means the year is normal in the critical state before outbreak and the next year is normally in the outbreak state. Therefore, we can predict the outbreak of influenza A and identify the critical state before influenza A outbreak or outbreak state by observing the variation of index value.

Entities:  

Keywords:  Dynamical network biomarker (DNB); Early-warning signal; Influenza A virus; The critical state; The outbreak state

Year:  2017        PMID: 28386202      PMCID: PMC5372459          DOI: 10.1016/j.sjbs.2017.01.048

Source DB:  PubMed          Journal:  Saudi J Biol Sci        ISSN: 2213-7106            Impact factor:   4.219


Introduction

It is proved that there exists a kind of common critical phenomenon in many complex biological processes, i.e. a relative stable state enters into another state quickly after a soon critical point (Chen et al., 2012, Liu et al., 2012) There is the kind of critical phenomenon for influenza A, because it needs only a very short period of time quickly from a relative stable state to outbreak state after a critical point. Thus in order to prevent and control the outbreak of influenza A pandemic timely and effectively, the key solution lies in predicting the critical point before the outbreak. At present, influenza A is studied from many aspects. Ya-Nan Pan et al. found that the spatio-temporal network that connects the cities with human cases along the order of outbreak timing emerges two-section-power-law edge-length distribution, using the empirical analysis and modeling studies (Pan et al., 2014, Zhang, 2016). Chang et al. (2009) studied the vaccine for influenza, so as to achieve the effect of prevention of influenza. Banerjee et al. (2015) made full comparisons for the structural features of all H1N1 HA gene sequences and the composition of global amino acid to make it possible to depict the developing trend of influenza A. He et al. (2014) also made indepth studies to identify HA protein epitopes of avian influenza virus. This paper studies the different states of influenza A using dynamical network biomarkers (DNB). Through studying DNB of influenza A virus protein, we can detect the warning signals of outbreak for influenza A and obtain a composite index. The composite index varies along with the state of pandemic influenza, which gives a clue showing the turn point of outbreak. Therefore, we can predict the outbreak of influenza A and identify the critical state before influenza A outbreak or outbreak state by observing the variation of index value. This indicates the composite index can provide reliable and significant warning information to detect the stage of influenza A, which will be significantly meaningful for the warning and prevention of influenza A pandemic.

Method

The concept of network biomarkers is set up with the development of high-throughput genomic technologies and the systematic and multidimensional study of molecular expression profiling (Liu et al., 2014, Wu et al., 2012). This concept refers to a series of markers as well as their mutual relations and has been proposed as a new marker type (Jin et al., 2008, Yao et al., 2015). Compared with traditional biomarkers, these markers can accurately distinguish disease states for taking the links between the molecules into consideration (Simon, 2005, Ludwig and Weinstein, 2005). However, it is used to diagnose the states of diseases, not for the detecting the critical point before the outbreak of diseases. The method of dynamic network biomarkers focuses on the detection and assessment of different stages of the disease in the development of disease. This is a time-dependent method (Sun et al., 2014). It studies the location changes of the markers over time and the relationship among network markers over time changing. Meanwhile, this method can construct three-dimensional images showing the interaction relationship between the markers. Therefore the study of Network markers focuses on the molecular interactions and distinguishes normal and disease states. The study of dynamic network markers focusing on dynamic changes, is helpful to discover the marker accurately, comprehensively, and further to distinguish the state of disease before outbreak. It does not only depend on the method of small sample excavation mode markers, but also make it easier for clinical application. At the same time it can be used in wide studies to find early warning signals in any biological process, such as differentiation, senescence and cell cycle of each phase as well as key change.

Results

Data

Here are ten of proteins for influenza A virus hemagglutinin (HA), matrix protein, matrix protein 2, neuraminidase, non-structural protein 1, non-structural protein 2, nucleocapsid protein, PA RNA polymerase, PB1 RNA polymerase and PB2 RNA polymerase. They are composed of 20 different amino acids link to form polymers. This paper selects influenza A virus protein sequences from 1933 to 2015 from the NCBI website (www.ncbi.nlm.nih.gov/), whereas some data in 1937, 1938, 1939, 1940, 1941, 1942, 1944, 1951, 1952, 1953, 1954 and 1955 years are absent.

Model

Defining dynamic network biomarker

Taking HA protein as an example firstly, we suppose that a HA protein marked y is linked sequentially by t numbers of amino acids. Its amino acid sequence is represented by y = x1x2⋯x, in which x ∊ {A, V, L, I, P, F, W, M, D, E, G, S, T, C, Y, N, Q, K, R, H}; i = 1,2,…,t. We suppose s-1-th year have m numbers of influenza virus HA proteins all over the world and its amino acid sequence is represented by y−1,1, y−1,2,…,y−1,. Meanwhile, We suppose s-th year have n numbers of influenza virus HA proteins all over the world and its amino acid sequence is represented by y,1, y,2,…,y,. The amino acid number of the y, is marked c,, where i = s−1,s; j = 1,2,…,q; q = max{m,n}. Sequentially selecting the i-th amino acid for y−1,1,y−1,2,…,y−1, to form a new amino acid sequence is defined Z−1,, and then take out the largest one of amino acids number. If the maximum number of amino acids has two or more than two, we take the first amino acid without loss of generality. At the same time, these amino acids are marked x, where i = 1,2,…,k; k = max{c−1,1,c−1,2,…,c−1,}. We individually connect them in order to form a new amino acid sequence (U−1 = x1x2…x) and then separately compare with corresponding amino acids of y,1,y,2,…,y, one by one. If they are different, the assignment is one, on the contrary the assignment is zero. Therefore, n new sequences are represented by E,1,E,2,…,E, are obtained in s-th year. Then we calculate their mean (M), standard deviation (SD) and coefficient of variation (CV). Their computation formulas are as follows:where f(s,i) represents the frequency of occurrence of one in sequence E,. Similarly, we calculate M, SD and CV of the other nine proteins. The protein that values of CV are the top three are defined as core protein (CP), and the others are no-core protein (NP). CP is a set of high confidence interactions of proteins, which forms a sub-network called influenza A virus proteins of the protein dynamical network biomarkers.

The early warning model for influenza A

We calculate the frequencies of the 20 kinds of amino acids, and the computation formulas are as follows:where represents the frequency of occurrence of amino acid x in amino acid sequence y. Now, we can get a 23 dimensional characteristic value vector of HA protein. By the same way, the of the other nine proteins can be calculated in turn, so we can get a characteristic value matrix (X = [V1(s),V2(s),…,V10(s)]), where V(s) represents the characteristic value vector of the t-th influenza A protein, t = 1,2,…,10. Defining the characteristic distance between proteins:where v and w represents the v-th and the w-th protein respectively. The core proteins are not only the universal indicators to detect the complex outbreak signal of influenza A, but also the dominant or driving network of the whole protein system in the development, mutation and outbreak of the critical stages. In fact, the dominant network breaks through the limits of variation in the first time, first enters to the state of variation, and then affects other proteins and lead to the transfer of the entire system. Therefore, the determination of the dominant network can not only detect system in the critical state before break out, also help to reveal the underlying mechanism of influenza A virus proteins from the dimension of dynamic network. By combining the above properties of the core proteins, we can define a composite index:where represents the average value of the core proteins’ CV, is the average value of the characteristic distance between the core proteins, and is the average value of the characteristic distance between the core and non-core proteins. When , and Is−1 < 1, Is < 1, , it can be concluded that s-th year is in the critical state before the outbreak, the s + 1 year is in the outbreak state. Although the amino acid sequence of each protein will fluctuate randomly, the composite index can provide a credible and significant early warning when the influenza A virus is close to the critical state before the outbreak or the outbreak state.

Data processing

As shown in Table 1, we can calculate the composite index of the 1934–2015 using the above methods. However, we can’t figure out the composite index of some years, because some data in 1937, 1938, 1939, 1940, 1941, 1942, 1944, 1951, 1952, 1953, 1954 and 1955 years are absent.
Table 1

Composite index values from 1934 to 2015.

YearIYearIYearI
19340.72322719700.62742819931.187957
19350.96208319710.72851619941.225976
19360.4340319722.37932219950.164168
19430.54305919730.7988819960.611397
19460.44186619740.52783519970.711091
19470.85160419750.80129419980.629408
19480.44809219762.27551919990.781102
19490.76029319772.18215720000.710281
19500.73434919780.43816920010.295353
19511.02734119790.3269720020.660193
19570.85972819800.74608220030.405805
19580.94928119810.45563220040.45421
19590.47486619820.65045420050.772595
19600.55081119830.44978920061.595902
19610.70877219840.35463220070.476057
19620.0807819850.93970220080.798138
19630.98001219861.16694720091.344778
19640.65085419870.65597120100.962241
19650.52720119881.03368120110.440067
19660.58045219890.91252720120.635735
19670.66678319901.89865620130.606009
19682.3127119911.24881820140.806321
19691.08125719921.03240120152.516147

Discussion

The dynamic network markers of Pandemic influenza virus vary in the whole process from a relatively stable state to the critical state before outbreak as well as the outbreak state, which results in the status transfer of the entire network and finally results in fluctuations in the composite index. Therefore, by observing the transformation of the composite index, we can predict the critical state before the outbreak of pandemic influenza and the outbreak state. The flu broke out in Hong Kong in 1968 and continued until 1969, 7.5 million people died during that period. In 1972, influenza broke out in Henan Province and quickly spread to the entire province. As shown in Fig. 1, in 1965, the composite index value is 0.527201; 1966 is 0.580452; 1967 is 0.666783; 1968 is 2.31271; 1969 is 1.081257; 1970 is 0.405805; 1971 is 0.728516 and 1972 is 2.379322. Because I1967 − I1966 = 0.086331 > 0.08, I1966 < 1, I1967 < 1, , , the state in 1967 is the critical state before the outbreak, the state in 1968 is the outbreak state and the state continues until 1969. Similarly, I1971 − I1972 = 0.101088 > 0.08, I1970 < 1, I1971 < 1, , so the state in 1971 is the critical state before the outbreak, the state in 1972 is the outbreak state.
Figure 1

Trend Chart of composite index values from 1965 to 1972.

The influenza A broke out in The United States, Russia and Japan in 1976 and 1977. The incidence rate was very high in young people. As shown in Fig. 2, in 1973, the composite index value is 0.79888; 1974 is 0.527835; 1975 is 0.801294; 1976 is 2.275519; 1977 is 2.182157. I1975 − I1974 = 0.273459 > 0.08, I1974 < 1, I1975 < 1, , , so the state in 1975 is the critical state before the outbreak, the state in 1976 is the outbreak state.
Figure 2

Trend Chart of composite index values from 1973 to 1977.

The influenza A broke out in The United States and Japan in 1986. Meanwhile, many countries in Asia and Europe had the outbreak of influenza A. As shown in Fig. 3, in 1983, the composite index value is 0.449789; 1984 is 0.354632; 1985 is 0.939702; 1986 is 1.166947. I1985 − I1984 = 0.58507 > 0.08, I1984 < 1, I1985 < 1, , so the state in 1985 is the critical state before the outbreak, the state in 1986 is the outbreak state.
Figure 3

Trend Chart of composite index values from 1983 to 1986.

The influenza A broke out in China in 2006. Global influenza pandemic caused by the new influenza A virus in 2009, of which 0.3 million people died (Ren and Gao, 2011, Girard et al., 2010). As shown in Fig. 4, we can observe the composite index degree reached the peak in 2006 and 2009, the composite index degree were slightly higher than the previous year in 2005 and 2008. I2005 − I2004 = 0.318385 > 0.08, I2004 < 1, I2005 < 1, I2006 < 1; I2008 − I2007 = 0.322081 > 0.08, I2007 < 1, I2008 < 1, , so the states in 2005 and 2008 are the critical states before the outbreak, the states in 2006 and 2009 are the outbreak states.
Figure 4

Trend Chart of composite index values from 2003 to 2009.

The influenza A broke out in India in 2015, of which 1.5 thousand people died (Parida et al., 2016). As shown in Fig. 5, in 2012, the composite index value is 0.63573; 2013 is 0.6060092; 2014 is 0.806321; 2015 is 2.516147. The composite index value was slightly higher than the previous year in 2005 and 2008. So we can presume the influenza a virus reach the critical state before break out in 2005 and 2008, and run up to the outbreak state in 2006 and 2009. I2014 − I2013 = 0.273459 > 0.08, I2013 < 1, I2014 < 1, , so the state in 2014 is the critical state before the outbreak, the state in 2015 is the outbreak state.
Figure 5

Trend Chart of composite index values from 2012 to 2015.

In general, the composite index varies along with the state of pandemic influenza, which gives a clue showing the turn point of outbreak. The low value (<1) steady state of the composite index means influenza A is normally in the relatively steady stage. Meanwhile, if the composite index of a certain year increases by more than 0.8 relative to the previous year, and it is less than 1 and it increases sharply and reaches a peak being larger than 1 in next year, it means the year is normally in the critical state before outbreak and the next year is normally in the outbreak state. Therefore, we can predict the outbreak of influenza A and identify the critical state before influenza A outbreak or outbreak state by observing the variation of index value.

Conclusion

We select the data of protein amino acid sequence of pandemic influenza virus between 1933 and 2015 in which only some data in a very few years are absent, and obtain a composite index by using the nature of dynamic network biomarkers. The network markers and other traditional markers cannot provide an early warning signal of the critical state before pandemic outbreak in comparison with dynamic network biomarker. Although the amino acid sequence of each protein will randomly fluctuate, the composite index can still provide reliable, significant early warning information when influenza pandemic is close to the critical state or outbreak state. This fully shows the dynamic network biomarker is more stable and accurate to determine the state in which the pandemic influenza virus, particularly the critical state of pandemic influenza. This will achieve the aim of early warning and then strengthen preventive measures in advance. This is of great significance for the research and warning of pandemic influenza virus.
  12 in total

1.  Emergence of influenza A (H1N1)pdm09 genogroup 6B and drug resistant virus, India, January to May 2015.

Authors:  Manmohan Parida; Paban Kumar Dash; Jyoti S Kumar; Gaurav Joshi; Kundan Tandel; Shashi Sharma; Ambuj Srivastava; Ankita Agarwal; Amrita Saha; Shweta Saraswat; Divyanshi Karothia; Vatsala Malviya
Journal:  Euro Surveill       Date:  2016

2.  Development and validation of therapeutically relevant multi-gene biomarker classifiers.

Authors:  Richard Simon
Journal:  J Natl Cancer Inst       Date:  2005-06-15       Impact factor: 13.506

Review 3.  Biomarkers in cancer staging, prognosis and treatment selection.

Authors:  Joseph A Ludwig; John N Weinstein
Journal:  Nat Rev Cancer       Date:  2005-11       Impact factor: 60.716

4.  Similarity of currently circulating H1N1 virus with the 2009 pandemic clone: viability of an imminent pandemic.

Authors:  Rachana Banerjee; Ayan Roy; Santasabuj Das; Surajit Basak
Journal:  Infect Genet Evol       Date:  2015-02-28       Impact factor: 3.342

Review 5.  The 2009 A (H1N1) influenza virus pandemic: A review.

Authors:  Marc P Girard; John S Tam; Olga M Assossou; Marie Paule Kieny
Journal:  Vaccine       Date:  2010-05-27       Impact factor: 3.641

6.  A monoclonal antibody recognizes a highly conserved neutralizing epitope on hemagglutinin of H6N1 avian influenza virus.

Authors:  Jie-Long He; Ming-Shou Hsieh; Rong-Huay Juang; Ching-Ho Wang
Journal:  Vet Microbiol       Date:  2014-10-25       Impact factor: 3.293

7.  The knowledge-integrated network biomarkers discovery for major adverse cardiac events.

Authors:  Guangxu Jin; Xiaobo Zhou; Honghui Wang; Hong Zhao; Kemi Cui; Xiang-Sun Zhang; Luonan Chen; Stanley L Hazen; King Li; Stephen T C Wong
Journal:  J Proteome Res       Date:  2008-07-30       Impact factor: 4.466

Review 8.  Novel swine-origin influenza virus A (H1N1): the first pandemic of the 21st century.

Authors:  Luan-Yin Chang; Shin-Ru Shih; Pei-Lan Shao; Daniel Tsung-Ning Huang; Li-Min Huang
Journal:  J Formos Med Assoc       Date:  2009-07       Impact factor: 3.282

9.  Cancer bioinformatics: a new approach to systems clinical medicine.

Authors:  Duojiao Wu; Catherine M Rice; Xiangdong Wang
Journal:  BMC Bioinformatics       Date:  2012-05-01       Impact factor: 3.169

10.  Identifying critical transitions and their leading biomolecular networks in complex diseases.

Authors:  Rui Liu; Meiyi Li; Zhi-Ping Liu; Jiarui Wu; Luonan Chen; Kazuyuki Aihara
Journal:  Sci Rep       Date:  2012-12-10       Impact factor: 4.379

View more
  1 in total

1.  Prediction of Seizure Recurrence. A Note of Caution.

Authors:  William J Bosl; Alan Leviton; Tobias Loddenkemper
Journal:  Front Neurol       Date:  2021-05-13       Impact factor: 4.003

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.