Literature DB >> 28119814

Structure of the 40S ribosomal subunit of Plasmodium falciparum by homology and de novo modeling.

Harrison Ndung'u Mwangi1, Peter Wagacha2, Peterson Mathenge2, Fredrick Sijenyi3, Francis Mulaa4.   

Abstract

Generation of three dimensional structures of macromolecules using in silico structural modeling technologies such as homology and de novo modeling has improved dramatically and increased the speed by which tertiary structures of organisms can be generated. This is especially the case if a homologous crystal structure is already available. High-resolution structures can be rapidly created using only their sequence information as input, a process that has the potential to increase the speed of scientific discovery. In this study, homology modeling and structure prediction tools such as RNA123 and SWISS-MODEL were used to generate the 40S ribosomal subunit from Plasmodium falciparum. This structure was modeled using the published crystal structure from Tetrahymena thermophila, a homologous eukaryote. In the absence of the Plasmodium falciparum 40S ribosomal crystal structure, the model accurately depicts a global topology, secondary and tertiary connections, and gives an overall root mean square deviation (RMSD) value of 3.9 Å relative to the template׳s crystal structure. Deviations are somewhat larger in areas with no homology between the templates. These results demonstrate that this approach has the power to identify motifs of interest in RNA and identify potential drug targets for macromolecules whose crystal structures are unknown. The results also show the utility of RNA homology modeling software for structure determination and lay the groundwork for applying this approach to larger and more complex eukaryotic ribosomes and other RNA-protein complexes. Structures generated from this study can be used in in silico screening experiments and lead to the determination of structures for targets/hit complexes.

Entities:  

Keywords:  3D modeling; 40S subunit; De novo; Homology; Plasmodium falciparum; RNA structure; Ribosome

Year:  2016        PMID: 28119814      PMCID: PMC5237758          DOI: 10.1016/j.apsb.2016.10.003

Source DB:  PubMed          Journal:  Acta Pharm Sin B        ISSN: 2211-3835            Impact factor:   11.413


Introduction

Knowledge of the structure of pathogen ribosomes1, 2 is important for the development of new narrow-spectrum (species-selective) and broad-spectrum antibiotics. According to the Centers for Disease Control and Prevention (CDC), the majority of hospital-acquired infections involves drug-resistant pathogens. Of particular concern are drug-resistant Plasmodium falciparum, Mycobacterium tuberculosis, Pseudomonas aeruginosa, Enterococcus, Escherichia coli and Staphylococcus aureus. Development of new drugs against P. falciparum and M. tuberculosis is particularly important in view of the high risk of drug resistance they pose3, 4. A major step towards better drug design is understanding the biological structure of the target which, for many antibiotics is the ribosome. Thus, drug development can be expedited if the ribosomal structures for these different organisms are known. In the last decade, a number of ribosome crystal structures have been determined5, 6, 7, 8. However, despite great effort, the only organisms for which ribosome structures have been determined with atomic resolution are the thermophilic bacterium Thermus thermophilus (high resolution of the 30S, and low resolution of the 70S), the archeaon Haloarcula marismortui (high resolution of 50S only), and the eubacterium Deinococcus radiodurans (high resolution of 50S only)9, 10, 11. Thus, there is a need to predict the ribosome structures of other pathogenic prokaryotes and eukaryotes. Recently, software has been developed that can use known ribosome structures to model the structures of ribosomes from homologous organisms. Such software is capable of accounting for substitutions, deletions and insertions as well as chemical modifications in the ribosomes of other organisms12, 13. The methods described in this paper provide tools for modeling complex nucleic acid molecules using known structures of homologous nucleic acids from other organisms. These methods will also show that de novo techniques can be used to model complex nucleic acids by referencing databases of known structural motifs. By using advanced structural techniques, such as electron microscopy and nuclear magnetic resonance, new information can be generated to help understand the eukaryotic ribosome. However, generation of high resolution X-ray structures of eukaryotic 80S ribosomes lags behind that of bacteria due to their greater complexity14, 15. Cyro-electron microscopy and single particle reconstruction have been used to determine structures of a transplanting plant 80S ribosome at 5.5 Å resolution which maps together with the 6.1 Å resolution of the Saccharomyces cerevisiae 80S ribosome. Furthermore, recent advances in crystallographic methods has led to the determination of the first X-ray structure of the yeast ribosome at 3.0 Å resolution. Importantly, this has facilitated ribosomal comparisons of different species and kingdoms of life and should assist in understanding the biochemical interaction between the eukaryotic ribosome and molecules in basic life forms. Eukaryotic ribosomes are more complex than their prokaryotic counterparts and synthesis of their proteins differs at the level of initiation although their core functions are conserved. Determining the structure of the P. falciparum 40S ribosome will lead to a better understanding of the structural basis for protein synthesis in the cell. It will also enable researchers to run in silico ligand screening experiments of potential anti-malarial compounds. The eukaryotic ribosome contains two subunits, the small 40S subunit comprised of 18S rRNA and 33 proteins and the large 60S subunit composed of the 28S, 5S, 5.8S and 49 proteins6, 7, 18. The eukaryotic ribosome is highly complex with a molecular mass of ~4 million Da compared to 2.8 million Da for the prokaryote ribosome19, 20, 21. The small 40S subunit is larger by almost 500 kDa than the prokaryotic small 30S subunit5, 22, 23. In the 40S subunit, the 18S rRNA has three notable sequence insertions known as expansion segments (ES). ES3 is located in the 5′ major domain, ES6 and ES7 in the central domain, ES9 in the 3′ major domain and ES12 in the 3′ minor domain. The eukaryotic 40S subunit contains ~30 proteins, 18 of which do not have homologs in bacteria and its 18S rRNA comprises about 45% of its mass20, 25. Recent publications of eukaryotic 40S ribosome structures, such as that of Tetrahymena thermophila show more defined interactions between the ribosomal RNA and proteins. Homology modeling for evolutionally-related proteins involves two proteins that have a close common ancestor. An alignment score can be obtained to show similarities between the protein sequences, a lower score suggesting different structural and functional moieties. Inaccuracies in homology models are often traced to lower sequence identity between the two proteins. RNA modeling is more challenging than protein modeling due to structural complexities associated with RNA. For instance, the RNA backbone has 6 dihedral angles while proteins have only two. In addition, RNA residues are substantially larger in size than protein residues. Various research groups have attempted to increase the size limit and accuracy of predicted RNA structures by developing appropriate software. A good example is RNA123 which is a suite of tools for homology modeling, de novo structure prediction and analysis of RNA structures. The modeling processes a series of algorithms to account for substitutions, insertions, deletion and gap closing, and performs a final energy minimization to obtain a low energy structure.

Methods

Most homology modeling involves four steps, viz template selection, target template alignment, model building and evaluation of the model. Modeling of a structure is achieved by interactively repeating these steps until the desired model is produced. A number of different techniques for model building have been developed to meet the growing demand for accurate structures28, 29, 30. The approach adopted by the SWISS-MODEL server can best be described as rigid fragment assembly which was first implemented in RNA Composer. In this study, template-based modeling was used in conjunction with de novo prediction of motifs that were lacking in the template but present in the sequence.

Proof of concept

To provide proof of concept, we used homology modeling to generate structures of the 40S subunit of S. cerevisiae (RSCB ID 4V7R) from the crystal structure of the homologous eukaryotic 40S ribosomal subunit of T. thermophila (RSCB ID 4V5O). Sequence information homology modeling methodology was followed to obtain the model structure. The models generated were then superimposed on the actual crystal structure and the root mean square deviation (RMSD) calculated. Based on the results, we concluded the homology modeling technique was useful to rapidly generate reliable structures but it depended on the similarity of the two organisms. We also concluded that it could expedite generation of structures of organisms for which no structures were available (Table 1).
Table 1

Root mean square deviation results for the modeled Tetrahymena thermophila domains superimposed on the X-ray crystallographic structure of T. thermophila and on the template of Saccharomyces cerevisiae.

Modeled domainRMSD (Å)
T. thermophila (4V5O)S. cerevisiae (4V7R)
3′ Minor1.943.98
3′ Major2.684.24
Central3.943.67
5′ Major1.283.62
In this study, we employed the software for RNA homology modeling developed by Dr. John Santa Lucia of Wayne University that allows prediction of mutant structures given the X-ray structure of a reference sequence. The accuracy of the predictions was validated for several rRNA domains with sequence identity as low as 50%, and predicted structures with all-atom RMSD close to 2 Å were typically obtained. For the whole P. falciparum structure, much higher levels of sequence identity were obtained and thus the homology predictions reported in this study are highly accurate.

Template selection

For the 18S rRNA modeling of P. falciparum, we selected a homologous crystal structure (PDB ID 4V5O) to be used as the template. For the 40S ribosomal proteins, the template was selected based on a homology score after sequence alignment.

Alignment

18S rRNA sequence alignment was achieved by first dividing the entire 18S rRNA into its constituent domains (Fig. 1, 3′ minor, 3′ major, central and 5′ major domains). Since domains fold independently, this approach was chosen as it simplifies the examination and correction of the resultant alignments. For the 40S proteins, generation of the structure alignment was done in SWISS MODEL where a local pairwise alignment of the target sequence to the main template structures was calculated followed by a heuristic step to improve the alignment for modeling purposes. The placement of insertions and deletions was optimized considering the template structure context.
Figure 1

Tertiary structure of Plasmodium falciparum 40S front (A) and back view (B). Domains are colored (5′ major: red; central: yellow; 3′ major: blue and 3′ minor: green). Also shown are the 34 40S ribosomal proteins of Plasmodium falciparum interacting with the 18S rRNA to make the total subunit.

Model building

Generation of the model core was initiated by averaging the backbone atom positions of the T. thermophila template structure. This was followed by assessing sequence similarity by target (P. falciparum)-template (T. thermophila) alignment. To do this, an assembly of fragments compatible with the neighboring stems was constructed using constraint space programming (CSP) for proteins. In RNA 123, a flow chart of the model building algorithm based on using a fragment library of possible motifs collected from previous crystal structurers is shown in Fig. 2. The algorithm follows a scoring scheme where the best loop/motif is selected based on favorable force field energy, steric hindrance and favorable interactions like hydrogen bond formation. If a suitable loop cannot be identified, the de novo algorithm is activated to build the fragments using sequence information alone.
Figure 2

The overall process for performing homology modeling of the P. falciparum 40S ribosome as per RNA123.

Energy minimization

Deviations in the geometry of the protein or RNA structure introduced by the modeling algorithm when joining rigid fragments are regularized in the last modeling step by steepest descent energy minimization using the GROMOS96 force field in SWISS--MODEL. For RNA123, empirical force fields were used to detect parts of the model with conformational errors. Energy minimization was then performed to correct these errors.

Structure validation

Validation of models was carried out using several validation tools such as MOL-PROBITY, PRO-CHECK and MATCH CHECK. The validation tools help by adding hydrogen atoms to the crystal structure (Supplementary Information).

Results and discussion

The first step is to input the coordinates of the known template structure (i.e., the T. thermophila 40S structure) and the sequence of the pathogen or other organism (i.e., P. falciparum). The second step is to perform a sequence alignment that includes secondary structure constraints. The third step is to use the sequence alignment to guide the substitutions, deletions and insertions to be automatically made in the 3D software. The final step is to optimize the geometry of the structure using molecular dynamics and energy minimization. The ribosomal proteins were also modeled by similar methods using public and commercial servers, such as the Expasy server. This approach was found to be reasonably reliable since P. falciparum proteins within the P. falciparum domain often have sequence identity >50%. Due to the complexity of 18S rRNA, its structure was modeled by segmenting it into different domains (5′ major (Fig. 3A), central (Fig. 3B), 3′ major (Fig. 3C) and 3′ minor (Fig. 3D)) and subsequently combining them to form the complete structure (Fig. 3E). Since these domains were generated separately, residue clashes were expected upon combining them. This then requires the structure to be subjected to an additional series of energy minimizations (Table 2) in RNA123 to obtain the three dimensional (3D) structure. This was further used to anchor the 34 ribosomal proteins to give the 3D structure of P. falciparum (Fig. 4).
Figure 3

Due to the complexity of 18S rRNA, its structure was modeled by segmenting it into different domains (A) 5′ major domain, (B) central domain, (C) 3′ major domain, and (D) 3′ minor domain, and subsequently combining them to form (E) the complete structure.

Table 2

Plasmodium falciparum 18S rRNA energy optimization obtained using RNA123 which helps minimize the energy from a large positive figure to a more acceptable negative figure that is biologically functional.

Name18S rRNA.std.egy18S rRNA.opt.egy
Total inter energy2,410,217.54112−34,562.06042
Total intra energy (--Gamma En)−16,765.12946−16,733.20508
Total gamma terms energy1,433.942361,473.80569
Total gap geometry penalty2,893.753323,454.89615
Total restraint energy0.000006,083.08080
Total structure energy2,397,780.10735−46,366.56367
Figure 4

3D structures of all ribosomal proteins of the 40S subunit of P. falciparum. The backbone of all proteins is shown in color.

18S rRNA is a large biological molecule that performs multiple roles in the coding, unraveling, regulation, and expression of genes and in their translation in eukaryotes. The rRNA has a catalytic role in the ribosome that is orchestrated in the nucleolus. The longest expansion fragment (ES) in the 18S rRNA of P. falciparum, ES6 (Fig. 5), comprises 215 nucleotides framing one helix that replaces the five helices of T. thermophila and h21 in bacteria. The single helix that replaces helices A, B, C and D of T. thermophila occupies a large part of the back of the 40S body and varies significantly in structure from past models. For example, the helices extend over the back of 40S and are covered by helix A. In addition, the loop region of helix ES6E in T. thermophila forms a base pair with ES3B, yielding a broad helix that ventures from the focal point of the back towards the left foot of 40S20, 34, 35. The quaternary interaction of ES3 and ES6, together with several proteins, frames another area of the more extensive back and more conspicuous left foot elements of 40S as in T. thermophila.
Figure 5

The two-dimensional structure of P. falciparum showing the various expansion segments and the various domains.

The homologous structure of the 40S subunit of P. falciparum provides insights into the evolution of the parasite ribosome and its specific functions. It also reveals the interactions of the ribosomal RNA and proteins providing a step towards understanding eukaryotic parasite translation. We anticipate it may provide a new direction in targeting the small differences in host-parasite translation to bring about a major decline in parasitic illnesses. During secondary structure alignment by RNA123, specific deletions or insertions that account for genus specific differences appear as gaps or insertions. They occur mostly in central regions at h16–h17, ES6, h26–ES7S, ES9S, and ES10S, rather than in the highly conserved 5′ and 3′ ends (Fig. 5). It is encouraging to note that the secondary structure of two different organisms can appear very similar despite the fact that their molecular length is totally different. After secondary structure alignment, a consensus sequence can be added to aid the model to conform to its final biological function. During translation in the ribosome, steric hindrance by elements of mRNA secondary structure is dependent on the activity of the intrinsic helicase that unwinds the mRNA. P. falciparum homologs to bacterial rpS3p interact with the mRNA channel, whereas homologs to bacterial rpS4p, i.e., rpS9e, are shorter and do not extend into the mRNA channel. However, the P. falciparum–specific rpS30e has three essential residues oriented toward the mRNA channel as in T. thermophila and appears to serve as a partial positional and practical analog of bacterial rpS4p20, 36. The interpretation of bacterial structure begins with the interconnection of two segments of the small ribosomal subunit with the 5′ untranslated section of mRNA. The purpose of the association is the anti–Shine-Dalgarno grouping close to the 3′ end of the 16S rRNA and the ribosomal protein rpS1p in most bacteria7, 13, 37. A homolog of rpS1p or a Shine-Dalgarno grouping is not required to initiate interpretation of either T. thermophila or P. falciparum8, 38. The area corresponding to the bacterial Shine-Dalgarno chamber in the structure of the 40S subunit (where rpS1p has been pictured in electron microscopy investigations of the 30S subunit) is partly occupied by the proteins rpS28e and rpS26e of P. falciparum as in T. thermophile (Fig. 6). The structure of the RNA binding domain of rpS1p correlates to the b-barrel structure of rpS28e of P. falciparum39, 40, 41. The 3′ end of P. falciparum 18S rRNA, in contrast to the flexible 3′ end of bacterial 16S rRNA, appears to be kept in place by rigid connections with rpS26e. Regulation of signaling pathways by rpS4e, rpS6e, and RACK1 is comparable with those in T. thermophila13, 16, 20.
Figure 6

Side by side view of the front and back orientations of Tetrahymena thermophila and Plasmodium falciparum mRNA tunnels: the front (A) and back (C) views of T. thermophila, the front (B) and back (D) views of P. falciparum. Ribosomal proteins residue which extends into the mRNA channel site at the ribosomal head and shoulder can be easily recognized.

The crystal structure of the 40S subunit allows an insight into its signaling process. Focus is particularly centered on rpS6e, a target of the mTOR pathway; rpS4e, that is distinctly phosphorylated in both T. thermophile and P. falciparum and RACK1, a signaling focal point in the ribosome13, 20. The P. falciparum ribosomal protein, rpS6e, extends from the right to the left foot of the 40S subunit (Fig. 1). Location of the ribosomal protein, rpS4e, is found at the back of the ribosome and is firmly anchored within ES3 and ES6, of 18S rRNA. Both the P. falciparum and T. thermophila 40S subunits have structures that contain bound RACK120, 42. RACK1 makes extensive contacts not only with the phosphate backbone and bases of 18S rRNA, but also with the ribosomal proteins, rpS16e, rpS17e and rpS3e, all of which modify the signaling properties of RACK1. Undeniably, two eukaryotic signaling protein binding sites in RACK1, the site of phosphorylation, and the binding site of focal adhesion kinase (FAK),, are at least partially occluded when RACK1 is bound to the ribosome. This may partially explain the result of a recent study in yeast which suggested a correlation between the location of RACK1 at the ribosome and its signaling function20, 43, 44.

Conclusions

Modeling the structure of the P. falciparum 40S ribosomal subunit will lead to a better understanding of the structural basis for its protein-synthesizing role in the cell. The modeling platform allows for generation of a 3D atomic-level structure of pathogenic ribosomes through a combination of homology and de novo modeling. The developed RNA platform allows for a systematic study of RNA-ligand interactions and will facilitate the design of drug-like compounds that target biologically functional motifs. Furthermore, it will enable researchers in drug development to run in silico ligand screening experiments of a library of potentially active antimalarial compounds against the solved P. falciparum 40S structure as a target. Drug leads identified through this method could then lead to further biochemical and in vitro binding studies with the ultimate goal of developing new classes of antimalarial drugs. It is anticipated that the structure prediction and modeling technologies used here will dramatically reduce the time it takes from target identification to drug lead determination.
  35 in total

1.  Structure of a bacterial 30S ribosomal subunit at 5.5 A resolution.

Authors:  W M Clemons; J L May; B T Wimberly; J P McCutcheon; M S Capel; V Ramakrishnan
Journal:  Nature       Date:  1999-08-26       Impact factor: 49.962

2.  SCOP database in 2002: refinements accommodate structural genomics.

Authors:  Loredana Lo Conte; Steven E Brenner; Tim J P Hubbard; Cyrus Chothia; Alexey G Murzin
Journal:  Nucleic Acids Res       Date:  2002-01-01       Impact factor: 16.971

3.  Functional insights from the structure of the 30S ribosomal subunit and its interactions with antibiotics.

Authors:  A P Carter; W M Clemons; D E Brodersen; R J Morgan-Warren; B T Wimberly; V Ramakrishnan
Journal:  Nature       Date:  2000-09-21       Impact factor: 49.962

4.  Secondary structure of two regions in expansion segments ES3 and ES6 with the potential of forming a tertiary interaction in eukaryotic 40S ribosomal subunits.

Authors:  Gunnar Alkemar; Odd Nygård
Journal:  RNA       Date:  2004-03       Impact factor: 4.942

Review 5.  The bacterial ribosome as a target for antibiotics.

Authors:  Jacob Poehlsgaard; Stephen Douthwaite
Journal:  Nat Rev Microbiol       Date:  2005-11       Impact factor: 60.633

6.  Comparative modeling in CASP6 using consensus approach to template selection, sequence-structure alignment, and structure assessment.

Authors:  Ceslovas Venclovas; Mindaugas Margelevicius
Journal:  Proteins       Date:  2005

7.  An improved method for an efficient and easily accessible eukaryotic ribosome display technology.

Authors:  J A Douthwaite; M A Groves; P Dufner; L Jermutus
Journal:  Protein Eng Des Sel       Date:  2005-12-20       Impact factor: 1.650

Review 8.  Eukaryotic ribosomal proteins lacking a eubacterial counterpart: important players in ribosomal function.

Authors:  John Dresios; Panagiotis Panopoulos; Dennis Synetos
Journal:  Mol Microbiol       Date:  2006-03       Impact factor: 3.501

9.  RACK-1, a receptor for activated C kinase, interacts with the transcription factor NFAT and represses its transactivation.

Authors:  Juhyun Han; Minjung Kim; Ji-Hyun Oum; Heejoon Myung; Seong-Wook Lee; Sunjoo Jeong; Surendra Sharma; Jungchan Park
Journal:  Mol Cells       Date:  2002-12-31       Impact factor: 5.034

10.  Data growth and its impact on the SCOP database: new developments.

Authors:  Antonina Andreeva; Dave Howorth; John-Marc Chandonia; Steven E Brenner; Tim J P Hubbard; Cyrus Chothia; Alexey G Murzin
Journal:  Nucleic Acids Res       Date:  2007-11-13       Impact factor: 16.971

View more
  1 in total

1.  Methods for Identifying Microbial Natural Product Compounds that Target Kinetoplastid RNA Structural Motifs by Homology and De Novo Modeled 18S rRNA.

Authors:  Harrison Ndung'u Mwangi; Edward Kirwa Muge; Peter Waiganjo Wagacha; Albert Ndakala; Francis Jackim Mulaa
Journal:  Int J Mol Sci       Date:  2021-04-26       Impact factor: 5.923

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.