Literature DB >> 15629038

Analyzing the biology on the system level.

Wei Tong1.   

Abstract

Although various genome projects have provided us enormous static sequence information, understanding of the sophisticated biology continues to require integrating the computational modeling, system analysis, technology development for experiments, and quantitative experiments all together to analyze the biology architecture on various levels, which is just the origin of systems biology subject. This review discusses the object, its characteristics, and research attentions in systems biology, and summarizes the analysis methods, experimental technologies, research developments, and so on in the four key fields of systems biology--systemic structures, dynamics, control methods, and design principles.

Entities:  

Mesh:

Year:  2004        PMID: 15629038      PMCID: PMC5172474          DOI: 10.1016/s1672-0229(04)02002-9

Source DB:  PubMed          Journal:  Genomics Proteomics Bioinformatics        ISSN: 1672-0229            Impact factor:   7.691


Introduction

The notion of system-level understanding in biology has a long history 1., 2., but has not received renewed main attention until recently. The cause to accelerate such a change is the expansion of genome sequencing. Since the first genome sequence of Haemophilus influenzae was published in 1995 (, we have received many complete genome sequences, including Escherichia coli, Saccharomyces cerevisiae, Caenorhabditis elegans, Drosophila melanogaster, Arabidopsis thaliana, Oryza Sativa, and Homo Sapiens, which help us to obtain the whole static information of genes, regulatory regions, proteins, tRNAs, repeats and so on underlying DNA sequences—the ultimate depository. The development of other technologies also provides us a comprehensive dynamic data: DNA microarrays can practically measure mRNA expression responses of genes (; technologies for globally and quantitatively measuring protein expression are also becoming feasible (; and the two-hybrid system is enabling the construction of protein interaction maps (. All of these developments make the biological research transfer from data-poor state to data-rich state, which provide us a golden opportunity to analyze the biology on the system level. Biologists have found that many biological functions and diseases can’t be explained by the function of individual gene or protein. Instead, they should be the exhibition of an interactional network of protein-protein or proteins with other molecules 7., 8.. They also found that some particular characteristics of biological system could display perfect adaptation and homeostatic regulation despite great changes in the environment or alterations in the internal parameters of the system 9., 10.. Such robustness undoubtedly is the result of the organism’s evolution in a long time to adapt the environmental changes. To understand the magic biological functions and robustness of the system network, researchers are required to integrate the information of genome, mRNA expression, proteome, protein with protein or other molecule interactions, and so on from the system-level view, and to analyze the composition of biological system on various levels, interactions among modules, system dynamics, underlying control methods and design principles. Systems biology is an integrated process of computational modeling, system analysis, laboratory technologies, and quantitative experiments, which analyzes systemic components, basic modules, interactions among components and modules, as well as control methods, design principles, and so on. The development in this subject will greatly accelerate our in-depth understanding to the complex biological phenomena, and significantly increase the efficiency of drug discovery 11., 12.. This review describes the object, its characteristics, and research attentions in systems biology, and summarizes the analysis methods, experimental technologies, research developments, and so on in the four key fields of systems biology—systemic structure, dynamics, control methods, and design principles, which are suggested by Hiroaki Kitano (.

The Object of Systems Biology

The research object of systems biology is the whole biological system, including genes, RNAs, proteins, metabolics and cells in a narrow sense, and organs, organism and ecological system in a broad sense. A simple complexity pyramid (Figure 1; ref. ) can summarize the components of biological system on different levels in a narrow sense.
Fig. 1

The complexity pyramid of biological system by Oltvai and Barabasi (. At the lowest level, the various molecular components of genes, RNAs, proteins, and metabolites form genetic-regulatory motifs or metabolic pathways (Level 2), which in turn are the building blocks of functional modules (Level 3). These modules are nested, generating a scale-free hierarchical architecture (Level 4).

Characteristics of Biological System

Complexity

Although Vicsek thinks that the concept of complexity is currently abused and “it’s almost an empty statement to say that a system is complex because almost every real system is inherently complicated” (, the biological system is really complex, no matter from the diversity of the system’s components, the hierarchy of the network’s interactions, or the various biological functions displayed by the system, if we don’t emphasize the accuracy of the concept firstly. Carlson et al. compared the complexity between biological systems and their modern technological counterparts, such as central processing unit (CPU) and jet aircraft (.

Robustness

Robustness refers to the preservation of particular characteristics despite uncertainty in components or the environment (. Among many questions in systems biology, one issue receiving considerable attention is how robustness is achieved and how it evolves within various aspects of biological systems. For example, the precision of adaptation in the chemotaxis of E. coli is robust (; Eldar et al. found that the activation gradient of bone morphogenic protein (BMP) of Drosophila was robust to changes in gene dosage (, and the segment polarity network of Drosophila was also a robust development module (. The biological system uses at least the following four methods to attain the robustness aspect: a form of system control, redundancy, structure stability, and modularity, which are also applied widely in the engineering system (.

Evolution

Evolution is just the main difference between the biological system and the engineering one: the targets’ values or statuses are received initially on the engineering system design, whereas in biological system such targets are formed by evolution (.

Key Fields in Systems Biology

For a biological system, the system-level understanding can be derived from insight into the following four key fields—system structure, dynamics, control methods and design principles. Process in any of the four fields requires breakthroughs in our understanding of computational sciences, genomics, measurement technologies, and integration of such discoveries with existing knowledge (. The following will describe the analysis methods and developments in the four key fields of systems biology.

System structure

The research field of system structure determines the components, their relationships, and modules of the biological system, as well as builds the whole static network of system. It is the foundation of the whole systems biology research. Only when we have got an understanding of the system structure, can we then investigate the system dynamics, underlying control methods and design principles. In fact, the current research on system dynamics is mainly focusing on the structure-known systems. 1. Classifying the genome-scale genes and distributing them to various modules based on the similarity of gene expression profile, which can accelerate the determination of the biological network’s structure. The foundation of such a method is that co-expressed genes are likely to encode proteins that participate in a common structural complex, metabolic pathway, or biological process 20., 21.. There have been many clustering algorithms to classify the large-scale expression profile data produced by DNA microarray 22., 23., 24., 25., 26., 27.. For the issues about the low accuracy and high error rate of repeat experiments in the data produced by DNA microarray 28., 29., 30., Kalir et al. measured promoter activity in the flagellar system of E. coli at high accuracy and temporal resolution by means of green fluorescent protein (GFP) reporter plasmid, and ordered the genes in a flagella pathway by analyzing expression kinetics (. Extended research is to identify the cis-regulatory elements associated with the genes in the same group (, and search the genome-wide distribution of such cis-regulatory elements to complement the genes in one module given by the DNA microarray profile data (. Recent studies lead to the view that, in contrast to a simple linear assembly line, a complex and extensively coupled network has evolved to coordinate the activities of the gene expression machines (Figure 2; ref. ).
Fig. 2

A complex network of coupled interactions in eukaryotic gene expression by Maniatis and Reed (.

The genes classifying method by expression similarity still has some shortages: all the biological processes are interlocked, and proteins may play multiple cellular roles, which may produce errors in the association between genes’ expression profiles and their functions. So Ker-Chau Li presented a co-expression dynamics theory to try to solve this issue (. 2. Determining the proteins’ relationship to build a protein-protein interaction network. Protein interactions on a proteome-wide scale have already been analyzed in the methods of yeast two-hybrid assay 36., 37., protein chips (, and so on. Yeast two-hybrid assay is a high-throughput method of mapping pair-wise protein interactions on a large scale, and has been applied in the construction of large-scale proteins interaction networks of S. cerevisiae 39., 40., C. elegans (, D. melanogaster (, etc. Gavin et al. ( and Ho et al. ( took a more effective approach to identify protein complexes that contain three or more proteins. They attached tags to hundreds of different proteins as bait proteins, and allowed such baits to be expressed in the cell and form physiological complexes with other proteins, then fished out the entire complex by extracting the tag on the bait protein. At the end, the proteins extracted with the tagged bait were identified using the standard mass-spectrometry method. Figure 3 represents the protein complex network built by Gavin et al. in such a way.
Fig. 3

The yeast protein complex network, and grouping of connected complexes by Gavin et al. (.

3. Distributing certain genes to some modules that govern the biological functions or phenotypes by measuring the phenotypic aspect of haploid deletion strains. Jorgensen et al. obtained the five most potent genes and the corresponding complex network that govern the critical cell size in S. cerevisiae ( by determining the cell size distributions for the complete set of ~6000 yeast gene deletion strains (. This method is more directly than other ways to determine the system functional modules, for it directly measures the phenotypic aspects of certain gene deletion strains. Its shortages are that it can only be used to analyze the modules whose phenotypic characteristics are easy to measure, and it can’t consider the genes whose absence would lead to the cell’s death. 4. Analyzing the biological interaction network to distribute the proteins into different modules with graph theory, topological structure analysis, etc. Depending on the more reliable 11,855 interactions among 2,617 yeast proteins evaluated by von Mering et al. (, Bu et al. identified topological structures of such proteins’ interaction network (quasi-cliques and quasi-bipartites) to classify the proteins to different groups by a spectral analysis method. They found that the proteins within the same group share similar biological functions (. Because interacting proteins usually localize in the same subcellular compartments (, Rives et al. integrated the high-throughput yeast protein-protein interaction data 39., 40. and the corresponding data of protein localization in cellular compartments ( to identify modules of yeast cellular networks by a network-clustering method, which was validated by clustering known signaling-protein modules (.

System dynamics

The research field of system dynamics is to measure the components’ variety (for example, the mRNAs’ expression profiles) in a time scale or different conditions for a system with structure mainly known, choose certain mathematical model, list corresponding equations to describe the model, determine the dynamic parameters based on the kinetic data, and at last compare the prediction data with the experimental data, and if needed, revise the model. In fact, the dynamic analysis can also give useful predictions and complements to the system structure. For example, Ideker et al. integrated the methods of DNA microarrays, proteome, database searching and so on to analyze the pathway of yeast galactose utilization, and complemented some interactions to the metabolic network (. 1. Choosing a model. The choice of certain mathematical model is a key step for system dynamic analysis, which depends on the availability of the corresponding biology knowledge to compare with the scope and abstraction level of the model. With unknown rate constants for particular reactions, the constraints-based models can be adopted utilizing only the network structure information 52., 53., 54.. Because of the large number of unknown kinetic parameters and computational complexity, genome-scale models of metabolism or other biological systems have only been analyzed with the constrains-based modeling philosophy (. With some known steady-state rate constants, many modeling approaches can be used to model cellular processes and to analyze how system behavior changes and investigate dynamic aspects. These approaches include kinetic 56., 57., stochastic 58., 59., and cybernetic ones 60., 61.. Bifurcation analysis can provide a detailed illustration of dynamic behavior 62., 63.. 2. Determining the kinetic parameters. After building the model and listing the corresponding equations, the next step is to specify the kinetic parameters for various regulations. We need to measure the system elements’ varying profiles, and give the prediction profiles by model simulations, then determine the kinetic parameters by minimizing the difference between the measured profiles and the predicted ones. Ronen et al. developed a system for real-time monitoring of the transcriptional activity of operons in the SOS DNA repair system of E. coli by means of low-copy reporter plasmid, in which a promoter controls the GFP, described the subsystem by the Michaelis-Menten model, and determined the corresponding dynamic parameters (. The model for circadian oscillations in Drosophila was described by a set of ten kinetic equations 65., 66., 67., and Leloup et al. extended the rhythm model to mammals, which was governed by sixteen kinetic equations (.

Control methods

1. Determining the central components, modules or aspects of the system on different levels. After the system network is constructed, the next important analysis is to address the key components in the system, which can be the candidates for the drug targets, and the control methods, such as feedback and feedforward controls, to extend our understanding to the system. The quantitative analysis in the perturbation to protein interactions of S. cerevisiae has demonstrated that the most highly connected proteins in the cell are the most important for its survival (. Stelling et al. utilized the elementary-mode analysis to simultaneously predict the key aspects of network functionality, robustness and gene regulation of E. coli central metabolism (. 2. Exploring the relationship between the changes of system parameters or components and the variety of system’s phenotypes and functions. This research is an important application field for systems biology, which explores the variety of system’s components or parameters and the organism’s phenotypes, analyses the corresponding control methods, and gives effective predictions. On the sub-cell level: Lev Bar-Or et al. modeled the oscillations of p53 and Mdm2 proteins by a mathematical model to describe the p53-Mdm2 feedback loop (; Ideker et al. integrated genomic and proteomic methods to systematically perturb and analyze the yeast galactose utilization pathway (; methods in computational biology contribute to clarify the molecular and dynamical bases of cellular rhythms (; Davidson et al. gained some properties of the Strongylocentrotus purpuratus’s endomesoderm development process by analyzing the gene regulatory network, and tried to explain the developing phenotype (. On the cell level: Jamshidi et al. developed a mathematical application package to perform dynamic simulations of the metabolic network of human red blood cell (; the E-Cell project is developed to visualize and dynamically simulate the cell (; and the Alliance for Cellular Signaling (AfCS) is systematically exploring the signaling networks in two types of mouse cells—B lymphocytes and cardiac muscle cells 76., 77., 78.. At the organ level, the whole organ of heart can be modeled to explore its functionality (.

Design principles

The research field of design principles is to explore the principles of creating organisms by “God”, and is the highest level of systems biology. However, we haven’t uncovered the system-scale dynamics, let alone comprehensive control methods, so it is a long way to understand the system design principles underlying them. Presently there are the following ways applied in this field: 1. Exploring the similarity of system-level design between the engineering system and the biological one by the knowledge in our system design of modern engineering. Csete et al. described insights from engineering theory and practice that can shed some light on biological complexity (. Strogatz explored the topology of complex networks, such as food web, electric power grid, and regulatory network in cell (. 2. Analyzing the system network to determine some system aspects and provide clues to clarify the system design principles in the end. The interaction network of yeast proteins has been shown to have a nonrandom power-law distribution of node connectivity (number of interactions of each protein) and a low frequency of direct connections between high-connectivity nodes (. Shen-Orr et al. found that much of the transcriptional regulation network of E. coli is composed of repeated appearances of three highly significant motifs, the feedforward loop, single input module and densely overlapping regulons, each of which has a specific function in determining the gene expression (. Network motif was analyzed in complex networks, such as that of biochemistry, neurobiology, ecology, and engineering, to uncover their structural design principles (.

Attentions to the Systems Biology Research

1. The comprehensiveness in measurements. The high throughput, systemic and accurate measurements needed by systems biology research require the comprehensiveness in three aspects, that is, factor, time-line, and item. The factor comprehensiveness reflects almost all the mRNAs, proteins, and other components in the biological system. The time-line comprehensiveness represents the time frame within which measurements are made. And the item comprehensiveness refers to the simultaneous measurement of multiple items, such as mRNA and protein concentrations, phosphorylation, localization, and so forth (. 2. The breakthroughs in experimental technology. Completing the system-level analysis of biological regulation requires high throughput and exact measurements for goals that are perhaps beyond the scope of current experimental practices. Technical innovations in experimental devices, such as single-molecule measurements, femto-lasers that permit visualization of molecular interactions, and nano-technologies, are critical aspects of systems biology research (. 3. The building of mathematical model. When choosing certain model, it is important to firstly consider the purpose of model building: whether it is to obtain an in-depth understanding of system behavior or to predict complex behavior in response to complex stimuli, and we must first define the scope and abstraction level of the model (. 4. The development of standard programming language to describe the biological models and the corresponding software. It is very important to represent various biological models in a universal programming language for the information’s integration and the effective utilization of resource, which can benefit the researchers to exchange models between different model-building software, and enable them to reuse components from one model to another, thus accelerating model building. The Systems Biology Mark-up Language (SBML; ref. ) and CellML ( are trying to define a standard for an XML-based machine-readable model definition. The Systems Biology Workbench (SBW; ref. ) built on SBML provides a framework of modular open-source software for systems biology research. 5. The development of corresponding databases. There have built some databases concerned with biological pathways, interactions and so on, such as the Kyoto Encyclopedia of Genes and Genomes (KEGG; ref. ), Alliance for Cellular Signaling (AfCS; ref. ), Munich Information Center for Protein Sequences (MIPS; ref. ), and Signal Transduction Knowledge Environment (STKE; ref. ).

Conclusion

In the past tens of years, biologists have always tried to locate, clone, and express various genes, and to associate individual gene’s role with some biological functions or phenotypes, and have explained some system behavior by the characteristics of components, which in fact belong to the reductionism. These methods have produced many achievements and are still important, whereas during the progress of our research and the accumulation of genomic, proteomic and dynamic measuring data nowadays, we should exceed reductionism ( and focus on the system-level research of biology, which includes the above fields of system structure, dynamics, control methods and design principles. Because the biological system is not just a simple assembly of genes, proteins and other molecules, many of its properties cannot be fully understood by merely listing the whole components of the system and drawing maps of their interactions. If molecular biology, genomics, and proteomics have provided us some knowledge of individual components’ roles and system behavior, and listed for us most components of the system, now it is time to put them back to the system, and analyze the systemic hierarchy, dynamics, control methods and design principles.
  85 in total

1.  Analysis of p53-regulated gene expression patterns using oligonucleotide arrays.

Authors:  R Zhao; K Gish; M Murphy; Y Yin; D Notterman; W H Hoffman; E Tom; D H Mack; A J Levine
Journal:  Genes Dev       Date:  2000-04-15       Impact factor: 11.361

2.  The ERATO Systems Biology Workbench: enabling interaction and exchange between software tools for computational biology.

Authors:  M Hucka; A Finney; H M Sauro; H Bolouri; J Doyle; H Kitano
Journal:  Pac Symp Biocomput       Date:  2002

3.  Overview of the Alliance for Cellular Signaling.

Authors:  Alfred G Gilman; Melvin I Simon; Henry R Bourne; Bruce A Harris; Rochelle Long; Elliott M Ross; James T Stull; Ronald Taussig; Henry R Bourne; Adam P Arkin; Melanie H Cobb; Jason G Cyster; Peter N Devreotes; James E Ferrell; David Fruman; Michael Gold; Arthur Weiss; James T Stull; Michael J Berridge; Lewis C Cantley; William A Catterall; Shaun R Coughlin; Eric N Olson; Temple F Smith; Joan S Brugge; David Botstein; Jack E Dixon; Tony Hunter; Robert J Lefkowitz; Anthony J Pawson; Paul W Sternberg; Harold Varmus; Shankar Subramaniam; Robert S Sinkovits; Joshua Li; Dennis Mock; Yuhong Ning; Brian Saunders; Paul C Sternweis; Donald Hilgemann; Richard H Scheuermann; Dianne DeCamp; Robert Hsueh; Keng-Mean Lin; Yan Ni; William E Seaman; Paul C Simpson; Timothy D O'Connell; Tamara Roach; Melvin I Simon; Sangdun Choi; Pamela Eversole-Cire; Iain Fraser; Marc C Mumby; Yingming Zhao; Deirdre Brekken; Hongjun Shu; Tobias Meyer; Grischa Chandy; Won Do Heo; Jen Liou; Nancy O'Rourke; Mary Verghese; Susanne M Mumby; Heping Han; H Alex Brown; Jeffrey S Forrester; Pavlina Ivanova; Stephen B Milne; Patrick J Casey; T Kendall Harden; Adam P Arkin; John Doyle; Martha L Gray; Tobias Meyer; Stephen Michnick; Martin A Schmidt; Mehmet Toner; Roger Y Tsien; Madhusudan Natarajan; Rama Ranganathan; Gilberto R Sambrano
Journal:  Nature       Date:  2002-12-12       Impact factor: 49.962

4.  Metabolic network structure determines key aspects of functionality and regulation.

Authors:  Jörg Stelling; Steffen Klamt; Katja Bettenbrock; Stefan Schuster; Ernst Dieter Gilles
Journal:  Nature       Date:  2002-11-14       Impact factor: 49.962

5.  Systems biology. Life's complexity pyramid.

Authors:  Zoltán N Oltvai; Albert-László Barabási
Journal:  Science       Date:  2002-10-25       Impact factor: 47.728

6.  Network motifs: simple building blocks of complex networks.

Authors:  R Milo; S Shen-Orr; S Itzkovitz; N Kashtan; D Chklovskii; U Alon
Journal:  Science       Date:  2002-10-25       Impact factor: 47.728

7.  Functional organization of the yeast proteome by systematic analysis of protein complexes.

Authors:  Anne-Claude Gavin; Markus Bösche; Roland Krause; Paola Grandi; Martina Marzioch; Andreas Bauer; Jörg Schultz; Jens M Rick; Anne-Marie Michon; Cristina-Maria Cruciat; Marita Remor; Christian Höfert; Malgorzata Schelder; Miro Brajenovic; Heinz Ruffner; Alejandro Merino; Karin Klein; Manuela Hudak; David Dickson; Tatjana Rudi; Volker Gnau; Angela Bauch; Sonja Bastuck; Bettina Huhse; Christina Leutwein; Marie-Anne Heurtier; Richard R Copley; Angela Edelmann; Erich Querfurth; Vladimir Rybin; Gerard Drewes; Manfred Raida; Tewis Bouwmeester; Peer Bork; Bertrand Seraphin; Bernhard Kuster; Gitte Neubauer; Giulio Superti-Furga
Journal:  Nature       Date:  2002-01-10       Impact factor: 49.962

8.  Cluster analysis and display of genome-wide expression patterns.

Authors:  M B Eisen; P T Spellman; P O Brown; D Botstein
Journal:  Proc Natl Acad Sci U S A       Date:  1998-12-08       Impact factor: 11.205

9.  Comprehensive identification of cell cycle-regulated genes of the yeast Saccharomyces cerevisiae by microarray hybridization.

Authors:  P T Spellman; G Sherlock; M Q Zhang; V R Iyer; K Anders; M B Eisen; P O Brown; D Botstein; B Futcher
Journal:  Mol Biol Cell       Date:  1998-12       Impact factor: 4.138

10.  Genome-wide binding map of the histone deacetylase Rpd3 in yeast.

Authors:  Siavash K Kurdistani; Daniel Robyr; Saeed Tavazoie; Michael Grunstein
Journal:  Nat Genet       Date:  2002-06-24       Impact factor: 38.330

View more
  2 in total

1.  MAO: a Multiple Alignment Ontology for nucleic acid and protein sequences.

Authors:  Julie D Thompson; Stephen R Holbrook; Kazutaka Katoh; Patrice Koehl; Dino Moras; Eric Westhof; Olivier Poch
Journal:  Nucleic Acids Res       Date:  2005-07-25       Impact factor: 16.971

2.  Underlying principles of natural selection in network evolution: systems biology approach.

Authors:  Bor-Sen Chen; Wei-Sheng Wu
Journal:  Evol Bioinform Online       Date:  2007-09-26       Impact factor: 1.625

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.