Literature DB >> 16289934

A novel fingerprint map for detecting SARS-CoV.

Lei Gao1, Yong-Sheng Ding, Hua Dai, Shi-Huang Shao, Zhen-De Huang, Kuo-Chen Chou.   

Abstract

Spike (S) protein is the most important membrane protein on the surface of severe acute respiratory syndrome coronavirus (SARS-CoV). It associates with cellular receptors to mediate infection of their target cells. Inspired by such a mechanism, an in-depth investigation into the genome sequences of S protein of SARS-CoV and its receptor are conducted thru a mathematical transformation and graphic approach. As an outcome, a novel method for visualizing the characteristic of SARS-CoV is suggested. An extensive comparison among a large number of genome sequences has proved that the characteristic thus revealed is unique for SARS-CoV. As such, the characteristic can be regarded as the fingerprint map of SARS-CoV for diagnostic usage. Moreover, the conclusion has been further supported in a real case in Guangdong province of China. The fingerprint map proposed here has the merits of clear visibility and reliability that can serve as a complementary clinical tool for detecting SARS-CoV, particularly for the cases where the results obtained by the conventional methods are uncertain or conflicted with each other.

Entities:  

Mesh:

Year:  2005        PMID: 16289934      PMCID: PMC7127393          DOI: 10.1016/j.jpba.2005.09.031

Source DB:  PubMed          Journal:  J Pharm Biomed Anal        ISSN: 0731-7085            Impact factor:   3.935


Introduction

Severe acute respiratory syndrome (SARS), which was first recognized in Guangdong province of China in November 2002, emerged as a life-threatening disease associated with pneumonia. A novel coronavirus, called SARS-coronavirus or SARS-CoV, has been proved to be the cause of SARS [1], [2], [3]. Since then, progresses in finding inhibitors against SARA from different angles have been reported (see, e.g. [4], [5], [6], [7], [8], [9]). It is also important to timely and accurately diagnose SARS-CoV, which can help understand the mechanism of causing the disease and optimize the healing treatment. Unfortunately, it is quite slow to use the routine clinical procedures to diagnose SARS-CoV, as illustrated below. SARS-CoV-specific RNA can be detected in various clinical specimens such as blood, stool, respiratory secretions, and body tissues by the polymerase chain reaction (PCR) [3]. Although the existing PCR tests were made available during the outbreak, their sensitivity and specificity were unknown because there were no “gold standard” laboratory or clinical definitions for the diagnosis of SARS [10], [11], [12]. For instance, the SARS-CoV can be detected by inoculating suitable cell cultures (e.g. Vero cells) with patient specimens (such as respiratory secretions) and propagating the virus in vitro, but the problem is: although the positive cell culture results indicate the existence of SARS-CoV in the sample tested; the negative results do not exclude SARS. Again, although the enzyme-linked immunosorbent assay (ELISA), immunofluorescence assay (IFA), and neutralization test (NT) have the strongpoint of convenience, reliability and repeatability, they are unsuitable for the acute illness [12]. Therefore, the existing diagnostic tests lack sufficient sensitivity for clinical usage in timely ruling out SARS. From the emergence of SARS-CoV, many studies have been done on its biological medicine aspects, such as pathogeny characteristics, mechanisms of causing disease, clinic diagnosis and treatment, bacterin development and the spreading rules. At the level of molecule biology, studies have been conducted on its genome sequences’ characteristic, structures and functions of the translated proteins, and the evolution relationships in sequences. SARS-CoV genomes are distinguished by the presence of a single stranded plus-sense RNA genome about 30 kb in length. Eleven open reading frames (ORF) of the viral genome are translated into 23 proteins, including four main structural proteins: Spike (S) protein, Membrane (M) protein, Envelope (E) protein, and Nucleocapsid (N) protein [13], [14], [15]. Research on other known coronaviruses has proved that among the structural proteins of coronaviruses, S protein plays a very important role in virus entry, virus–receptor interactions and their relationship to tropism [16], [17]. The S proteins of coronaviruses are large type-I transmembrane glycoproteins that are responsible for receptor building and membrane fusion. On 26 November 2003, angiotensin-converting enzyme 2 (ACE2) was identified as a functional receptor for the SARS-CoV, and S protein associate with cellular receptors to mediate infection of their target cells. Recently, in virtue of giant human antibody libraries, a human monoclonal antibody, which potently neutralizes SARS-CoV and inhibits syncytia formation between cells expressing the S protein and those expressing the SARS-CoV receptor ACE2, was identified [18]. Research on molecular evolution of the SARS-CoV reveals that the highest rate of mutation is seen in the part of the virus genome coding for the S protein. Research data suggests that through the change of S protein, the early SARS-CoV was under significant pressure to mutate in order to become an efficient human virus [19]. In addition, the research on S protein of SARS-CoV will do great favor to clinic diagnosis and bacterin development [20], [21], [22]. In view of the important roles of S protein in SARS-CoV's infection and evolution, in the current study we will regard S protein as the primary research object, investigating it at a deeper level. The previous studies on virus gene sequences only concern about the structure or property of the virus gene sequence by itself, seldom relating the investigation to the other objects such as the receptor for the virus. Here we shall take into account the interaction of SARS-CoV and its receptor. It has been observed that that after a certain kind of mathematic transformation, there is an obvious symmetric characteristic shown in gene sequences between SARS-CoV S protein segment and human ACE2; however, no such a characteristic shown in gene sequences between non-SARS-CoV S protein segment and human ACE2. The characteristic can be regarded as a fingerprint map and be applied to SARS virus detection, thus providing a simple and intuitive lab detection method to complement the traditional clinical and epidemiological methods.

Methods

Graphical approaches have been successfully used to deal with many biologically interesting problems, such as enzyme kinetics [23], [24], [25], [26], protein folding kinetics [27], analysis of codon usage in E. coli protein coding sequences [28] and HIV protein sequences [29], among many others. The advantages of graphical approach are in providing an intuitive picture, helping investigators visualize a very complicated or abstract problem and catch its essence or signature (see, e.g. [30], [31], [32], [33], [34], [35], [36], [37], [38]). Here let us use graphical approach to study the interaction of SARS-CoV and its receptor ACE2, and see if there is any special feature in the embodiment of their sequences. The mathematic transformation method presented in this paper is based on the graphic approach developed by the previous investigators [28], [29], [39], [40]. Using the graphic approach, or Z-curve method [41], we can obtain the transformed graph of the sequences and find the visualization characteristic between SARS-CoV and its receptor ACE2. Also, we do the same analysis on the S protein of non-SARS-CoV. The Z-curve is a novel method for mapping the DNA or RNA sequence into a folding curve in a three-dimensional space. Originally, the DNA (or RNA) sequences are expressed in terms of a series of four letters A, C, G, and T(U) that may be called the letter sequence representation (LSR) of the DNA (or RNA) sequences. Z-curve representation is one-to-one correspondence; it is a geometrical approach to express the LSR of DNA (or RNA) sequences. Such a geometric form of DNA (or RNA) sequence displays the new characters of sequence like symmetry, periodicity and local motif, thus providing a brand-new method for decryption of DNA or RNA sequence. The method is presented briefly as follows. Consider one strand DNA or RNA sequence with N bases. Suppose that the cumulative numbers of bases A, C, G, and T(U) occurring in this subsequence from the 1st base to the nth base in the sequence are denoted by A , C , G , and T , respectively. The Z-curve is composed of a series of nodes P 0, P 1, P 2, …,  P whose coordinates x , y and z (n  = 0, 1, 2, …,  N) are uniquely determined by the so-called Z-transform of DNA (RNA) sequence:Let us define A 0  =  C 0  =  G 0  =  T 0  = 0. The Z curve is defined as the connection of the nodes P 0, P 1, P 2, …,  P one by one sequentially with straight lines. Define x 0  =  y 0  =  z 0  = 0 so that the Z-curve always starts from the origin of the three-dimensional coordinate system.

Results and discussion

The gene sequences in our experiment with the annotation information were downloaded from the website of NCBI RefSeq project (http://www.ncbi.nih.gov/RefSeq). We have conducted experiments on the following three kinds of sequences: (1) the S protein segments of more than 80 SARS complete gene sequences; (2) human ACE2 sequences (with the accession numbers of AB046569, NM_021804, AF291820, and AY358714); (3) non-SARS coronavirus sequences including house mouse, cotton aphid, Norway rat (with the accession numbers of BC026801, AB053181, AB053182, YSCACE2, XM_228924, XM_136130, AF502082, and AB122152). We analyzed the visualization characteristics of the above sequences. The result, derived from the comparisons among a large number of gene sequences, shows a regular quadrilateral between SARS-CoV S protein segment and human ACE2 (cf. Fig. 1 ). However, there is no such feature between non-SARS coronavirus S protein segment and human ACE2 (cf. Fig. 2 ). Such a unique feature represents a fingerprint map that can be used to identify and distinguish SARS from other infectious diseases, helping the diagnosis of some ambiguous SARS cases.
Fig. 1

The fingerprint map of the SARS-CoV S protein segment formed by the mathematic transformation of Eq. (1). In each sub-graph, the black part in the left is formed by human ACE2 gene sequence while the purplish part in the right formed by S protein segment.

Fig. 2

The representation generated by the same procedure as that of Fig. 1 for the non-SARS coronavirus S protein segment. See the legend of Fig. 1 for further explanation.

The fingerprint map of the SARS-CoV S protein segment formed by the mathematic transformation of Eq. (1). In each sub-graph, the black part in the left is formed by human ACE2 gene sequence while the purplish part in the right formed by S protein segment. The representation generated by the same procedure as that of Fig. 1 for the non-SARS coronavirus S protein segment. See the legend of Fig. 1 for further explanation. Using the fingerprint map, we have analyzed a suspected case in Guangdong province of China. It was quickly found by our method that the S protein of suspected virus did display a regular quadrilateral between the S protein of suspected case and the four kinds of human ACE2 sequences, as shown in Fig. 3 . Such character can help us draw the conclusion that this is a SARS-CoV infected case, which has been confirmed by a series of conventional methods performed later.
Fig. 3

The representation generated for the corresponding segment from a diagnose SARS case in Guangdong province of China. See the legend of Fig. 1 for further explanation.

The representation generated for the corresponding segment from a diagnose SARS case in Guangdong province of China. See the legend of Fig. 1 for further explanation.

Conclusions

Gene sequence carries all the information of an organism. Since the sequence is a one-dimensional, it is not easy to get the genetic information directly from the sequence itself. The transformation from gene sequence to the visualized graph with a unique feature is an effective way to observe and study the characteristics of gene sequences. Taking the interaction of SARS-CoV and its receptor into consideration, we have obtained a unique characteristic of SARS-CoV using the graphic approach. A visualization interactive relationship has been revealed between SARS-CoV and its receptor human ACE2, which does not exist between the non-SARS-CoV and human ACE2. Such a unique feature can be deemed as the fingerprint map of SARS-CoV that can be used to diagnose suspected cases against SARS. The outcome diagnosed by the fingerprint method has been confirmed by a real SARS case in Guangdong province of China. It has not escaped our notice that the proposed fingerprint map method, along with the method based on cellular automata images reported earlier [42], can also be used to the in-depth research for revealing the molecular mechanism of the disease.
  41 in total

1.  Characterization of a novel coronavirus associated with severe acute respiratory syndrome.

Authors:  Paul A Rota; M Steven Oberste; Stephan S Monroe; W Allan Nix; Ray Campagnoli; Joseph P Icenogle; Silvia Peñaranda; Bettina Bankamp; Kaija Maher; Min-Hsin Chen; Suxiong Tong; Azaibi Tamin; Luis Lowe; Michael Frace; Joseph L DeRisi; Qi Chen; David Wang; Dean D Erdman; Teresa C T Peret; Cara Burns; Thomas G Ksiazek; Pierre E Rollin; Anthony Sanchez; Stephanie Liffick; Brian Holloway; Josef Limor; Karen McCaustland; Melissa Olsen-Rasmussen; Ron Fouchier; Stephan Günther; Albert D M E Osterhaus; Christian Drosten; Mark A Pallansch; Larry J Anderson; William J Bellini
Journal:  Science       Date:  2003-05-01       Impact factor: 47.728

2.  Identification of a novel coronavirus in patients with severe acute respiratory syndrome.

Authors:  Christian Drosten; Stephan Günther; Wolfgang Preiser; Sylvie van der Werf; Hans-Reinhard Brodt; Stephan Becker; Holger Rabenau; Marcus Panning; Larissa Kolesnikova; Ron A M Fouchier; Annemarie Berger; Ana-Maria Burguière; Jindrich Cinatl; Markus Eickmann; Nicolas Escriou; Klaus Grywna; Stefanie Kramme; Jean-Claude Manuguerra; Stefanie Müller; Volker Rickerts; Martin Stürmer; Simon Vieth; Hans-Dieter Klenk; Albert D M E Osterhaus; Herbert Schmitz; Hans Wilhelm Doerr
Journal:  N Engl J Med       Date:  2003-04-10       Impact factor: 91.245

3.  Mixtures of tight-binding enzyme inhibitors. Kinetic analysis by a recursive rate equation.

Authors:  P Kuzmic; K Y Ng; T D Heath
Journal:  Anal Biochem       Date:  1992-01       Impact factor: 3.365

4.  An analysis of base frequencies in the anti-sense strands corresponding to the 180 human protein coding sequences.

Authors:  C T Zhang; K C Chou
Journal:  Amino Acids       Date:  1996-09       Impact factor: 3.520

5.  Graphic analysis of codon usage strategy in 1490 human proteins.

Authors:  C T Zhang; K C Chou
Journal:  J Protein Chem       Date:  1993-06

6.  An extension of Chou's graphic rules for deriving enzyme kinetic equations to systems involving parallel reaction pathways.

Authors:  G P Zhou; M H Deng
Journal:  Biochem J       Date:  1984-08-15       Impact factor: 3.857

7.  Graphical rules for enzyme-catalysed rate laws.

Authors:  K C Chou; S Forsén
Journal:  Biochem J       Date:  1980-06-01       Impact factor: 3.857

8.  An ELISA for antibodies against infectious bronchitis virus using an S1 spike polypeptide.

Authors:  C-H Wang; C-C Hong; J C H Seak
Journal:  Vet Microbiol       Date:  2002-04-02       Impact factor: 3.293

9.  Steady-state kinetic studies with the polysulfonate U-9843, an HIV reverse transcriptase inhibitor.

Authors:  I W Althaus; J J Chou; A J Gonzales; R J LeMay; M R Deibel; K C Chou; F J Kezdy; D L Romero; R C Thomas; P A Aristoff
Journal:  Experientia       Date:  1994-01-15

10.  Clinical progression and viral load in a community outbreak of coronavirus-associated SARS pneumonia: a prospective study.

Authors:  J S M Peiris; C M Chu; V C C Cheng; K S Chan; I F N Hung; L L M Poon; K I Law; B S F Tang; T Y W Hon; C S Chan; K H Chan; J S C Ng; B J Zheng; W L Ng; R W M Lai; Y Guan; K Y Yuen
Journal:  Lancet       Date:  2003-05-24       Impact factor: 79.321

View more
  14 in total

1.  Study of peptide fingerprints of parasite proteins and drug-DNA interactions with Markov-Mean-Energy invariants of biopolymer molecular-dynamic lattice networks.

Authors:  Lázaro Guillermo Pérez-Montoto; María Auxiliadora Dea-Ayuela; Francisco J Prado-Prado; Francisco Bolas-Fernández; Florencio M Ubeira; Humberto González-Díaz
Journal:  Polymer (Guildf)       Date:  2009-06-03       Impact factor: 4.430

2.  Remote data retrieval for bioinformatics applications: an agent migration approach.

Authors:  Lei Gao; Hua Dai; Tong-Liang Zhang; Kuo-Chen Chou
Journal:  PLoS One       Date:  2011-06-20       Impact factor: 3.240

3.  New method for global alignment of 2 DNA sequences by the tree data structure.

Authors:  Zhao-Hui Qi; Xiao-Qin Qi; Chen-Chen Liu
Journal:  J Theor Biol       Date:  2009-12-16       Impact factor: 2.691

4.  TN curve: a novel 3D graphical representation of DNA sequence based on trinucleotides and its applications.

Authors:  Jia-Feng Yu; Xiao Sun; Ji-Hua Wang
Journal:  J Theor Biol       Date:  2009-08-11       Impact factor: 2.691

5.  Scoring function for DNA-drug docking of anticancer and antiparasitic compounds based on spectral moments of 2D lattice graphs for molecular dynamics trajectories.

Authors:  Lázaro G Pérez-Montoto; Lourdes Santana; Humberto González-Díaz
Journal:  Eur J Med Chem       Date:  2009-06-17       Impact factor: 6.514

6.  Alignment-free prediction of mycobacterial DNA promoters based on pseudo-folding lattice network or star-graph topological indices.

Authors:  Alcides Perez-Bello; Cristian Robert Munteanu; Florencio M Ubeira; Alexandre Lopes De Magalhães; Eugenio Uriarte; Humberto González-Díaz
Journal:  J Theor Biol       Date:  2008-10-17       Impact factor: 2.691

7.  Classification analysis of dual nucleotides using dimension reduction.

Authors:  Zhao-Hui Qi; Jian-Min Wang; Xiao-Qin Qi
Journal:  J Theor Biol       Date:  2009-05-27       Impact factor: 2.691

8.  Dynamic modeling of cellular response to DNA damage based on p53 stress response networks.

Authors:  Jinpeng Qi; Yongsheng Ding; Shihuang Shao
Journal:        Date:  2009-08-05

9.  Multi-target QPDR classification model for human breast and colon cancer-related proteins using star graph topological indices.

Authors:  Cristian Robert Munteanu; Alexandre L Magalhães; Eugenio Uriarte; Humberto González-Díaz
Journal:  J Theor Biol       Date:  2008-12-06       Impact factor: 2.691

10.  Generalized lattice graphs for 2D-visualization of biological information.

Authors:  H González-Díaz; L G Pérez-Montoto; A Duardo-Sanchez; E Paniagua; S Vázquez-Prieto; R Vilas; M A Dea-Ayuela; F Bolas-Fernández; C R Munteanu; J Dorado; J Costas; F M Ubeira
Journal:  J Theor Biol       Date:  2009-07-29       Impact factor: 2.691

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.