Literature DB >> 28950872

Forecasting influenza A pandemic outbreak using protein dynamical network biomarkers.

Jie Gao1,2, Kang Wang3, Tao Ding3, Shanshan Zhu3.   

Abstract

BACKGROUND: Influenza A virus is prone to mutation and susceptible to human beings and spread in the crowds when affected by the external environment or other factors. It is very necessary to forecast influenza A pandemic outbreak.
METHODS: This paper studies the different states of influenza A in the method of dynamical network biomarkers. Through establishing protein dynamical network biomarkers of influenza A virus protein, a composite index is ultimately obtained to forecast influenza A pandemic outbreak.
RESULTS: The composite index varies along with the state of pandemic influenza virus from a relatively steady state to critical state before outbreak and then to the outbreak state. When the composite index continuous decreases for 2 years and increases of more than o.1 suddenly, it means the next year is normally in the outbreak state. Therefore, we can predict and identify whether a certain year is in the critical state before influenza A outbreak or outbreak state by observing the variation of index value. Meanwhile, through data analysis for different countries influenza A pandemic outbreak in different countries can also be forecasted.
CONCLUSIONS: This indicates the composite index can provide significant warning information to detect the stage of influenza A, which will be significantly meaningful for the warning and prevention of influenza A pandemic.

Entities:  

Keywords:  Influenza A pandemic; Protein dynamical network biomarker (PDNB); The critical state; The outbreak state

Mesh:

Substances:

Year:  2017        PMID: 28950872      PMCID: PMC5615242          DOI: 10.1186/s12918-017-0460-y

Source DB:  PubMed          Journal:  BMC Syst Biol        ISSN: 1752-0509


Background

It is proved that there is a kind of common critical phenomenon in lots of complex biological process, i.e. a relative stable state quickly enters into another state after a critical point in a very short period of time [1, 2]. There is the kind of critical phenomenon for influenza A, because it needs only a very short period of time quickly from a relative stable state to outbreak state after a critical point. Thus in order to timely and effectively prevent and control the outbreak of influenza A pandemic, the key lies in predicting the critical point before the outbreak. At present, influenza A is studied from all aspects. Pan et al. found that the spatio-temporal network that connects the cities with human cases along the order of outbreak timing emerges two-section-power-law edge-length distribution, using the empirical analysis and modeling studies [3]. Chang et al. studied the vaccine for influenza, so as to achieve the effect of prevention of influenza [4]. Banerjee et al. made full comparisons for the structural features of all H1N1 HA gene sequences and the composition of global amino acid to make it possible to depict the developing trend of influenza A [5]. He et al. also made in-depth studies to identify HA protein epitopes of avian influenza virus [6]. This paper studies the different states of influenza A using DNB. Through establishing PDNB of influenza A virus protein and using the nature of DNB, a composite index is ultimately obtained to forecast influenza A pandemic outbreak. The composite index varies along with the state of pandemic influenza virus from a relatively steady state to critical state before outbreak and then to the outbreak state. Therefore, we can predict and identify whether a certain year is in the critical state before influenza A outbreak or outbreak state by observing the variation of index value. This indicates the composite index can provide significant warning information to detect the stage of influenza A, which will be significantly meaningful for the warning and prevention of influenza A pandemic. Meanwhile, through data analysis for different countries influenza A pandemic outbreak in different countries can also be forecasted.

Methods

DNB analysis

The concept of network biomarkers is set up with the development of high-throughput genomic technologies and the systematic and multidimensional study of molecular expression profiling [7, 8]. This concept refers to a series of markers as well as their mutual relations and has been proposed as a new marker type [9]. Compared with traditional biomarkers, these markers can accurately distinguish disease states for taking the links between the molecules into consideration [10, 11]. However, it is used to diagnose the states of diseases, not for the detecting the critical point before the outbreak of diseases. The method of dynamic network biomarkers focuses on the detection and assessment of different stages of the disease in the development of disease and shows it is a time-dependent method [12]. It studies the location changes of the markers over time and the relationship among network markers over time changing and then constructs three-dimensional images showing the interaction relationship between the markers. Therefore the study of Network markers focuses on the molecular interactions and distinguishes normal and disease states, and the study of dynamic network markers focuses on dynamic changes, which is helpful to discover the marker accurately and comprehensively and further to distinguish the state of disease before outbreak. It not only does not depend on the method of small sample excavation mode markers, but also make it easier for clinical application. At the same time it can be used in future studies to find early warning signals in any biological process, such as differentiation, senescence and cell cycle of each phase as well as key change.

Defining PDNB

Firstly, taking hemagglutinin (HA) protein as an example, we suppose that a HA protein marked y is linked sequentially by t numbers of amino acids. Its amino acid sequence is represented as y = x 1 x 2 ⋯ x , in which x ∈ {A, V, L, I, P, F, W, M, D, E, G, S, T, C, Y, N, Q, K, R, H}; i = 1 , 2 ,  ⋯  , t. We suppose s-1-th year have m numbers of influenza virus HA proteins all over the world and its amino acid sequence is represented as y  , y  ,  ⋯  , y . Meanwhile, We suppose s-th year have n numbers of influenza virus HA proteins all over the world and its amino acid sequence is represented as y  , y  ,  ⋯  , y . The amino acid number of the y is marked c ,where i=s-1,s; j = 1 , 2 ,  ⋯  , q;q = max {m, n}. Sequentially selecting the i-th amino acid for y  , y  ,  ⋯  , y to form a new amino acid sequence is defined as Z , and then take out the one of the largest number of amino acids. If the maximum number of amino acids has two or more than two, we take the first amino acid without loss of generality. At the same time, it is marked x , where i = 1 , 2 ,  ⋯  , k;k = max {c , c ,  ⋯ , c }.We individually connect them in order to form a new amino acid sequences (U  = x 1 x 2 ⋯ x ) and then separately compare with corresponding amino acids of y  , y  ,  ⋯  , y one by one. If they are different, the assignment is 1, on the contrary the assignment is 0. Therefore, n new sequences are represented by E  , E  ,  ⋯  , E are obtained in s-th year. Then we calculate their mean (M), standard deviation (SD) and coefficient of variation (CV). Their computation formulas are as follows: where f(s, i) represents the frequency of occurrence of one in sequence E . Similarly, we calculate M, SD and CV of the other nine proteins. The protein that the top three values of CV are defined as core protein (CP), and the others are no-core protein (NP). CP is a set of high confident interactions of proteins, which forms a sub-network called influenza A virus proteins of protein dynamical network biomarkers (see Fig. 1).
Fig. 1

Protein dynamical network biomarkers. The protein that values of CV are the top three are defined as core protein (CP), and the others are no-core protein (NP). CP is a set of high confidence interactions of proteins, which forms a sub-network called the protein dynamical network biomarkers

Protein dynamical network biomarkers. The protein that values of CV are the top three are defined as core protein (CP), and the others are no-core protein (NP). CP is a set of high confidence interactions of proteins, which forms a sub-network called the protein dynamical network biomarkers

Defining forecasting index

The frequencies of the 20 kinds of amino acids can be calculated through the computation formulas as follows:where represents the frequency of occurrence of amino acid x in amino acid sequence y . Therefore, we can get a 23 dimensional characteristic value vector of HA protein. By the same way, the of the other nine proteins can be calculated in turn, so we can get a characteristic value matrix (X=[V 1(s), V 2(s),  ⋯ , V 10(s)]), where V (s) represents the characteristic value vector of the t-th influenza A protein, t = 1 , 2 ,  ⋯  , 10. Defining the characteristic distance between proteins:where v and w respectively represents the v-th and the w-th protein. The core proteins are not only the universal indicators to detect the complex outbreak signal of influenza A, but also the dominant or driving network of the whole protein system in the development, mutation and outbreak of the critical stages. In fact, the dominant network breaks through the limits of variation in the first time, first enters to the state of variation, and then affects other proteins and lead to the transfer of the entire system. Therefore, the determination of the dominant network can not only detect system in the critical state before break out, also help to reveal the underlying mechanism of influenza A virus proteins from the dimension of dynamic network. By combining the above properties of the core proteins, we can get a composite index: Where represents the average value of the core proteins’ CV , is the average value of the characteristic distance between the core proteins, is the average value of the characteristic distance between the core and non-core proteins. When Is-3 > Is-2 > Is-1 and Is-Is-1 > 0.1, it can be concluded that s + 1 year is in the outbreak state. Although the amino acid sequence of each protein will fluctuate randomly, the composite index can provide significant early warning information when the influenza A virus is close to the critical state before the outbreak or the outbreak state.

Results

Forecasting influenza A pandemic outbreak

Ten of proteins for influenza A virus are hemagglutinin (HA), matrix protein, matrix protein 2, neuraminidase, non-structural protein 1, non-structural protein 2, nucleocapsid protein, PA RNA polymerase, PB1 RNA polymerase and PB2 RNA polymerase. They are composed of 20 different amino acids link to form polymers. This paper selects influenza A virus protein sequences from 1934 to September 2016 from the NCBI website (http://www.ncbi.nlm.nih.gov/genomes/FLU/Database/nph-select.cgi?go=database), lots of data before 1934 are absent. As shown in Table 1, by using the above methods to calculate the composite index of the 1934 to September 2016. However, we can’t figure out the composite index of some years, because some data in 1937–1942, 1944–1945, 1952–1956 years are absent.
Table 1

Composite index values from 1934 ~ 2016

YearIYearIYearI
19340.72322719710.72851619950.164168
19350.96208319722.37932219960.611397
19360.4340319730.7988819970.711091
19430.54305919740.52783519980.629408
19460.44186619750.80129419990.781102
19470.85160419762.27551920000.710281
19480.44809219772.18215720010.295353
19490.76029319780.43816920020.660193
19500.73434919790.3269720030.465805
19511.02734119800.74608220040.45421
19570.85972819810.45563220050.772595
19580.94928119820.65045420061.595902
19590.47486619830.44978920070.476057
19600.55081119840.35463220080.798138
19610.70877219850.93970220091.344778
19620.0807819861.16694720100.962241
19630.98001219870.65597120110.740067
19640.65085419881.03368120120.635735
19650.52720119890.91252720130.606009
19660.50045219901.89865620140.806321
19670.66678319911.24881820152.516147
19682.3127119921.03240120160.843627
19691.08125719931.187957
19700.40580519941.225976
Composite index values from 1934 ~ 2016

Forecasting influenza A pandemic outbreak in pandemic occurrence place

Through influenza A virus protein data analysis for different countries influenza A pandemic outbreak in different countries can also be forecasted. Take China as an example, this paper selects influenza A virus protein sequences occurred in China from the NCBI website to forecast influenza A pandemic outbreak in China. Whereas lots of data in 1954–1956, 1958–1963, 1965, 1967 years are absent, all data before 1954 and in 2016 year are absent. As shown in Table 2, by using the above methods to calculate the composite index of the 1954 to 2015. However, we can’t figure out the composite index of some years, because lots of data in 1954–1956, 1958–1963, 1965, 1967 years are absent, all data before 1954 and in 2016 year are absent.
Table 2

Composite index values in China from 1957 ~ 2015

YearIYearIYearI
19570.83593819820.56694719990.702834
19640.46471219830.48963220000.718456
19660.57142519840.46734620010.645986
19682.19821719850.89653720020.673956
19691.21756319861.02547120030.653958
19700.64564319870.63748520040.543824
19710.75645119880.65387320050.854835
19722.18953419890.71285420061.632657
19730.81574619900.76236820070.549367
19740.77635719910.77254320080.924375
19750.78034619920.80573420092.184658
19760.85562819930.78475620100.843569
19770.90276519940.81236720110.483967
19780.5368471995076534720120.513975
19790.42257619960.71435820130.563954
19800.46825719970.72685620140.606837
19810.49662919980.68354620150.663854
Composite index values in China from 1957 ~ 2015

Discussion

The dynamic network markers of Pandemic influenza virus vary in the whole process from a relatively stable state to the critical state before outbreak as well as the outbreak state, which results in the status transfer of the entire network and finally results in fluctuations in the composite index. Therefore, by observing the transformation of the composite index, we can predict the critical state before the outbreak of pandemic influenza and the outbreak state. The flu broke out in Hong Kong in 1968 and continued until 1969, of which 7.5 million people died. In 1972, influenza broke out in Henan Province and quickly spread to the entire province. As shown in Fig. 2, in 1964, the composite index value is 0.650854, 1965 is 0.527201; 1966 is 0.500452; 1967 is 0.666783; 1968 is 2.31271; 1969 is 1.081257; 1970 is 0.405805; 1971 is 0.728516. Because I1964 > I1965 > I1966 and I1967-I1966 > 0.1, 1968 is in the outbreak state. Similarly, I1968 > I1969 > I1970 and I1971-I1970 > 0.1, so 1972 is in the outbreak state.
Fig. 2

Trend Chart of composite index values from 1964 ~ 1972. Horizontal axis represents the year from 1964 ~ 1972, vertical axis represents the composite index value

Trend Chart of composite index values from 1964 ~ 1972. Horizontal axis represents the year from 1964 ~ 1972, vertical axis represents the composite index value The influenza A broke out in The United States, Russia and Japan in 1976 and 1977. Although the prevalence of this flu was typical of the outbreak, adults were slightly infected, and the incidence rate was very high in young people. As shown in Fig. 3, in 1972, the composite index value is 2.379322; 1973 is 0.79888; 1974 is 0.527835; 1975 is 0.801294. I1972 > I1973 > I1974 and I1975-I1974 > 0.1, so 1976 is in the outbreak state.
Fig. 3

Trend Chart of composite index values from 1972 ~ 1976. Horizontal axis represents the year from 1972 ~ 1976, vertical axis represents the composite index value

Trend Chart of composite index values from 1972 ~ 1976. Horizontal axis represents the year from 1972 ~ 1976, vertical axis represents the composite index value The influenza A broke out in The United States and Japan in 1986. Meanwhile, many countries in Asia and Europe had the outbreak of influenza A. As shown in Fig. 4, in 1982, the composite index value is 0.650454; 1983 is 0.449789; 1984 is 0.354632; 1985 is 0.939702. I1982 > I1983 > I1984 and I1985-I1984 > 0.1, so 1986 is in the outbreak state.
Fig. 4

Trend Chart of composite index values from 1982 ~ 1986. Horizontal axis represents the year from 1982 ~ 1986, vertical axis represents the composite index value

Trend Chart of composite index values from 1982 ~ 1986. Horizontal axis represents the year from 1982 ~ 1986, vertical axis represents the composite index value The influenza A broke out in China in 2006. Global influenza pandemic caused by the new influenza A virus in 2009, of which 0.3 million people died [13, 14]. As shown in Fig. 5, in 2002, the composite index value is 0.660193; 2003 is 0.465805; 2004 is 0.45421; 2005 is 0.772595; 2006 is 1.595902; 2007 is 0.476057; 2008 is 0.798138. I2002 > I2003 > I2004 and I2005-I2004 > 0.1, I2006 > I2007 and I2008-I2007 > 0.1, so 2006 is in the outbreak state. Although I2005 is not larger than I2006, 2006 is outbreak year and other conditions are in line, so there is still the outbreak state in 2009.
Fig. 5

Trend Chart of composite index values from 2002 ~ 2009. Horizontal axis represents the year from 2002 ~ 2009, vertical axis represents the composite index value

Trend Chart of composite index values from 2002 ~ 2009. Horizontal axis represents the year from 2002 ~ 2009, vertical axis represents the composite index value The influenza A broke out in India in 2015, of which 1.5 thousand people died [15]. As shown in Fig. 6, in 2011, the composite index value is 0.740067; 2012 is 0.63573; 2013 is 0.6060092; 2014 is 0.806321. I2011 > I2012 > I2013 and I2014-I2013 > 0.1, so 2015 is in the outbreak state.
Fig. 6

Trend Chart of composite index values from 2011 ~ 2016. Horizontal axis represents the year from 2011 ~ 2016, vertical axis represents the composite index value

Trend Chart of composite index values from 2011 ~ 2016. Horizontal axis represents the year from 2011 ~ 2016, vertical axis represents the composite index value In general, the composite index varies along with the state of pandemic influenza virus from a relatively steady state to critical state before outbreak and then to the outbreak state. When the composite index continuous decreases for 2 years and increases of more than o.1 suddenly, it means the next year is normally in the outbreak state. Therefore, we can predict and identify whether a certain year is in the critical state before influenza A outbreak or outbreak state by observing the variation of index value. Take China as an example. The flu broke out in Hong Kong in 1968 and continued until 1969, of which 7.5 million people died. In 1972, influenza broke out in Henan Province and quickly spread to the entire province. As shown in Table 2, the data in 1965 and 1967 are absent, so we cannot forecast. 1968 is 2.198217; 1969 is 1.217563; 1970 is 0.645643; 1971 is 0.756451. I1968 > I1969 > I1970 and I1971-I1970 > 0.1, so 1972 is in the outbreak state. Many countries in Asia including China had the outbreak of influenza A in 1986. As shown in Table 2, 1982 is 0.566947; 1983 is 0.489632; 1984 is 0.467346; 1985 is 0.896537. I1982 > I1983 > I1984 and I1985-I1984 > 0.1, so 1986 is in the outbreak state. The influenza A broke out in China in 2006 and 2009. As shown in Table 2, 2002 is 0.673956; 2003 is 0.653958; 2004 is 0.543824; 2005 is 0.854835; 2006 is 1.632657; 2007 is 0.549367; 2008 is 0.924375. I2002 > I2003 > I2004 and I2005-I2004 > 0.1, I2006 > I2007 and I2008-I2007 > 0.1, so 2006 and 2009 are in the outbreak state.

Conclusions

We select the data of protein amino acid sequence of pandemic influenza virus between 1934 and September 2016, and the different countries’ data such as China’s data between 1957 and 2015 in which only some data in a very few years are absent, and obtain a composite index by using PDNB. Although the amino acid sequence of each protein will randomly fluctuate, the composite index can still provide reliable, significant early warning information when influenza pandemic is close to the critical state or outbreak state. The network markers and other traditional markers cannot provide an early warning signal of the critical state before pandemic outbreak in comparison with dynamic network biomarker. This fully shows the dynamic network biomarker is more stable and accurate to determine the state in which the pandemic influenza virus, particularly the critical state of pandemic influenza. This will achieve the aim of early warning and then strengthen preventive measures in advance. This is of great significance for the research and warning of pandemic influenza virus.
  12 in total

1.  Emergence of influenza A (H1N1)pdm09 genogroup 6B and drug resistant virus, India, January to May 2015.

Authors:  Manmohan Parida; Paban Kumar Dash; Jyoti S Kumar; Gaurav Joshi; Kundan Tandel; Shashi Sharma; Ambuj Srivastava; Ankita Agarwal; Amrita Saha; Shweta Saraswat; Divyanshi Karothia; Vatsala Malviya
Journal:  Euro Surveill       Date:  2016

2.  Development and validation of therapeutically relevant multi-gene biomarker classifiers.

Authors:  Richard Simon
Journal:  J Natl Cancer Inst       Date:  2005-06-15       Impact factor: 13.506

Review 3.  Biomarkers in cancer staging, prognosis and treatment selection.

Authors:  Joseph A Ludwig; John N Weinstein
Journal:  Nat Rev Cancer       Date:  2005-11       Impact factor: 60.716

4.  Similarity of currently circulating H1N1 virus with the 2009 pandemic clone: viability of an imminent pandemic.

Authors:  Rachana Banerjee; Ayan Roy; Santasabuj Das; Surajit Basak
Journal:  Infect Genet Evol       Date:  2015-02-28       Impact factor: 3.342

Review 5.  The 2009 A (H1N1) influenza virus pandemic: A review.

Authors:  Marc P Girard; John S Tam; Olga M Assossou; Marie Paule Kieny
Journal:  Vaccine       Date:  2010-05-27       Impact factor: 3.641

6.  A monoclonal antibody recognizes a highly conserved neutralizing epitope on hemagglutinin of H6N1 avian influenza virus.

Authors:  Jie-Long He; Ming-Shou Hsieh; Rong-Huay Juang; Ching-Ho Wang
Journal:  Vet Microbiol       Date:  2014-10-25       Impact factor: 3.293

7.  The knowledge-integrated network biomarkers discovery for major adverse cardiac events.

Authors:  Guangxu Jin; Xiaobo Zhou; Honghui Wang; Hong Zhao; Kemi Cui; Xiang-Sun Zhang; Luonan Chen; Stanley L Hazen; King Li; Stephen T C Wong
Journal:  J Proteome Res       Date:  2008-07-30       Impact factor: 4.466

Review 8.  Novel swine-origin influenza virus A (H1N1): the first pandemic of the 21st century.

Authors:  Luan-Yin Chang; Shin-Ru Shih; Pei-Lan Shao; Daniel Tsung-Ning Huang; Li-Min Huang
Journal:  J Formos Med Assoc       Date:  2009-07       Impact factor: 3.282

9.  Cancer bioinformatics: a new approach to systems clinical medicine.

Authors:  Duojiao Wu; Catherine M Rice; Xiangdong Wang
Journal:  BMC Bioinformatics       Date:  2012-05-01       Impact factor: 3.169

10.  Identifying critical transitions and their leading biomolecular networks in complex diseases.

Authors:  Rui Liu; Meiyi Li; Zhi-Ping Liu; Jiarui Wu; Luonan Chen; Kazuyuki Aihara
Journal:  Sci Rep       Date:  2012-12-10       Impact factor: 4.379

View more
  1 in total

1.  Construction of Influenza Early Warning Model Based on Combinatorial Judgment Classifier: A Case Study of Seasonal Influenza in Hong Kong.

Authors:  Zi-Xiao Wang; James Ntambara; Yan Lu; Wei Dai; Rui-Jun Meng; Dan-Min Qian
Journal:  Curr Med Sci       Date:  2022-01-04
  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.