Literature DB >> 20016748

Identification of Anoectochilus based on rDNA ITS sequences alignment and SELDI-TOF-MS.

Chuan Gao1, Fusheng Zhang, Jun Zhang, Shunxing Guo, Hongbo Shao.   

Abstract

The internal transcribed spacer (ITS) sequences alignment and proteomic difference of Anoectochilus interspecies have been studied by means of ITS molecular identification and surface enhanced laser desorption ionization time of flight mass spectrography. Results showed that variety certification on Anoectochilus by ITS sequences can not determine species, and there is proteomic difference among Anoectochilus interspecies. Moreover, proteomic finger printings of five Anoectochilus species have been established for identifying species, and genetic relationships of five species within Anoectochilus have been deduced according to proteomic differences among five species.

Entities:  

Keywords:  Anoectochilus; ITS; SELDI; proteomic finger printing

Mesh:

Substances:

Year:  2009        PMID: 20016748      PMCID: PMC2793310          DOI: 10.7150/ijbs.5.727

Source DB:  PubMed          Journal:  Int J Biol Sci        ISSN: 1449-2288            Impact factor:   6.580


1. Introduction

Anoectochilus is one of the 27 genera of subteib. Goodyerinae of Trib. Neottieae (Orchidaceae). There are about 40 species of this genus in tropical zone areas of Asia and Australia. 20 species among them and two varieties have been found from southwest to south of China 1. Major medicinal plants named "King of medicine" or "Golden grass" of this genus are A. formosanus, A. koshunensis and A. roxburghii, which are generally used in treatment of hypertension, diabetes, heart disease, pneumonia, acute and chronic hepatitis, nephritis and other diseases 1, 2. According to the identification standards 1, we collected wild species of Anoectochilus plants in southwest areas of China and cultured axillary bud in sterile condition. For a long time, the morphological characteristics have been the main standards of phylogenetic studies. However, due to the narrow of the distribution of wild Anoectochilus and the scarce of natural reserves and the missing of flower period, traditional identification methods were usually invalid. With the development of plant molecular systematics, a growing number of nucleic acid analysis techniques are widely used in plant phylogenetic studies 3, such as DNA hybridization, DNA restriction enzyme spectrum analysis, DNA fingerprinting techniques, RFLP, RAPD, AFLP and DNA sequence analysis. At present, ITS sequence of ribosomal DNA (rDNA) has become an important molecular markers in phylogenetic development of angiosperms, and been widely used in the intragenera, intergenera and even interspecies phylogenetic studies 4-5. A series of progress has been achieved 6-7. Meanwhile, proteomics researches based on two-dimensional gel electrophoresis are also applied in the genetic evolution and classification of species. For instance, to identify the origin of cultivated species through the analysis of protein differences of cultivated and wild wheat 8, and to study the genetic diversity by protein differences of wheat varieties 9 or wheat groups planted in different areas 10. In addition to wheat, other species used 2-D separated characteristic proteins to identify species (or subspecies) , such as rice 11-12, sugarcane 13 and chilli 14, have been reported. As a new way on proteome research, SELDI-TOF-MS (Surface Enhanced Laser Desorption Ionization-Time of Flight-Mass Spectrography) technology shows a lot of advantages such as simple, fast, acute and high-flux. It can detect trace protein as fmol(10-15mol) 15-16 and is regarded as supplement of two-dimensional electrophoresis. Today, SELDI has been mostly used in clinical diagnosis through identifying several special proteins peaks of special disease 17 and also was used in the studies on plant proteome recently 18-20. In this experiment, based on the NCBI Blast comparison of ITS sequences, several named (TW, FJ) and unnamed (WS, YN1, YN2, XZ) orchid plants were only classified to genera. So the protein difference of these plants using the SELDI-TOF-MS was investigated to identify species at the first time, and the proteome fingerprint of these plants were established respectively, and the genetic relationship of five species within Anoectochilus was preliminarily explored.

2 Materials and methods

2.1 Materials

Experimental material: six tissue culture seedlings comprised TW (Taiwan province), FJ (Fujian province), WS (Yunnan province), YN1(Yunnan province), YN2 (Yunnan province) and XZ (Tibet province). The main reagents: polyvinylpyrrolidone (PVP), trichloroacetic acid (TCA), dithiothreitol (DTT), urea, 3-[(3-cholamidopropyl)dimethylamonio]-1-propane sulfonate (CHAPS) reagents for Gibco products; 2-D Quant Kit protein quantitative kit products for the Amersham; Plant Genomic DNA Kit, TIANGEN company's products. The main instruments: Master Cycleer gradient AG 22331 PCR Instrument, Eppendorf companies; protein flight mass spectrometer (PBS IIc), that is, surface-enhanced laser desorption ionization time-of-flight mass spectrometry (SELDI-TOF), chemically modified weak-cation CM10 chip, Ciphergen U.S. companies; Centrifuge 5810r desktop high-speed refrigerated centrifuge, Eppendorf company; Milli-Q ultrapure water instrument, Millipore products.

2.2 Methods

2.2.1 Plant DNA extraction and amplification of ITS sequences

Plant Genomic DNA Kit prospectus was a reference to extract plant DNA. The primers of PCR amplification of ITS sequences 21, 22: P1: CGT AAC AAG GTT TCC GTA GGT GAAC; P2: TTA TTG ATA TGC TTA AAC TCA GCG GG, conventional PCR amplification conditions.

2.2.2 Alignment of ITS sequences by Blast

Blast engine is in NCBI web site http://blast.ncbi.nlm.nih.gov/Blast.cgi.

2.2.3 Sample preparation for proteomics

When the sterile cultured plant seedlings grow to 8-10cm high, they are collected from the tender shoots of plants and sample preparation is based on the procedure described by Damerval 23, and modified as follow: samples approximately 500 mg and 50 mg polyvinylpyrrolidone(PVP) are ground in liquid nitrogen. The powder is suspended in 8-10 folds sample volume (weight/vol) of acetone (-20 ºC) + 10% vol/vol trichloroacetic acid (TCA) + 5% vol/vol dithiothreitol (DTT) and incubated overnight at -20ºC. After centrifugation, (14, 000 × g, 4 ºC, 60 min), the supernatant is removed, 5 mL of ice-cold acetone +1% vol/vol DTT+1 mM phenylmethyl sulfonylfluoride (PMSF) is added, and the samples are stored for 1 h at -20 ºC. Then centrifuge at 4 ºC, 14, 000×g for 30 min, repeat this step one time. Volatilize remains acetone of precipitation at -20 ºC. 20 mg protein powder is dissolved in 500 mL protein lysate (9.5 mol/L urea, 2% 3-[(3-cholamidopropyl) dimethylammonio] -1-propanesulfonate (CHAPS), 1% DTT), insoluble particles are removed by centrifugation (14, 000 ×g, 4 ºC, 30 min) and protein level is determined using 2-D Quant Kit(Amersham) with accurate concentration using bovine serum albumin (BSA) as standard.

2.2.4 SELDI-MS-TOF detection

Put the protein chip (Chemical modify chip CM10, Ciphergen company of USA) in processor. Add 200 µL binding buffer (50 mmol/L NaAc, pH 4.0) in each pore. Vibrate 2 times to wash the processor at room temperature; 5 min every time, then shake away the buffer. Add 100 µL sample mixed liquor in each pore (about 5μg protein per pore). After Vibrating 1 h, shake away sample. Vibrate 2 times to wash processor with 200 μL elution buffer (50 mmol/L NaAc, pH 4.0) at room temperature, 5 min every time, then shake away the buffer. Wash again with deionized water, shake away immediately. Unpack processor and take out chip, dry it in air, then put 0.5 μL SPA [Sinapinic acid, 5 mg SPA in 80 μL TFA and CAN solution (trifluoroacetic acid : acetonitrile=1:1) ] in each pore, after dry, repeat this step again then put chip in SELDI-TOF-MS instrument and detect (PBS IIc, Ciphergen company of USA).

2.2.5 Data processing and Proteins identification

Parameter set as: the detected highest molecular weight is 50 kD, the optimizing range are 2-20 kilo daltons (KD), the intensity of laser is 205, the detection sensitivity is 8. The peaks under 2 kD are deleted to remove interference of base peak. Data are gathered automatically by the analysis software Ciphergen Proteinchip 3.1 and proteins difference is analyzed by Biomarker Wizard software. Standardized graph parameter set as: intensity extent is 5-75, mass charge ratio (e/m) extent is 2000-50000, and calibrate is 6825 Da protein peak. The SPSS 11.5 software is used to analyze the data on statistics.

3 Result and Discussion

3.1 Results of alignment of ITS sequence by Blast

The ITS sequences of samples were aligned by Blast. The four highest score DNA accession number and their coverage and identity were listed in table 1. TW, FJ, YN1, YN2, WS are classified in Anoectochilus, TW could be judged as A. formosanus. Referring to its flower character, FJ can be identified as A. roxburghii. Although owning the same ITS sequences, YN1 and YN2 may be two variants or subspecies because of its differences in appearance. XZ belongs to Goodyera rather than Anoectochilus, can be identified as G. macrantha.
Table 1

Results of alignment of ITS sequence by Blast

Name (NCBI accession no.)AccessionDescriptionScoreCoverageIdent
TW GQ328777AY052780.1Anoectochilus formosanus1404100%100%
EU700340.1Anoectochilus koshunensis1399100%99%
EU817408.1Anoectochilus roxburghii1393100%99%
AJ539483.1Ludisia discolor94889%92%
FJ GQ328774AY052780.1Anoectochilus formosanus1387100%99%
EU700340.1Anoectochilus koshunensis1382100%99%
EU817408.1Anoectochilus roxburghii1376100%99%
AJ539483.1Ludisia discolor94289%91%
YN1 GQ328780 /YN2 GQ328781EU817408.1Anoectochilus roxburghii1387100%99%
AY052780.1Anoectochilus formosanus1387100%99%
EU700340.1Anoectochilus koshunensis1382100%99%
AJ539483.1Ludisia discolor95389%92%
AJ539521.1Dossinia marmorata93582%93%
WS GQ328778AY052780.1Anoectochilus formosanus1399100%99%
EU700340.1Anoectochilus koshunensis1393100%99%
EU817408.1Anoectochilus roxburghii1387100%99%
AJ539483.1Ludisia discolor94289%91%
XZ GQ328779AF366894.1Goodyera macrantha1293100%100%
AF366895.1Goodyera maximowicziana1155100%96%
AF366898.1Goodyera velutina1151100%96%
AF366897.1Goodyera schlechtendaliana1083100%94%
In table 1, Taxonomic status of the various genera and species are Eukaryota, Viridiplantae, Streptophyta, Embryophyta, Tracheophyta, Spermatophyta, Magnoliophyta, Liliopsida, Asparagales, Orchidaceae, Orchidoideae, Cranichideae, Goodyerinae, under Goodyerinae are Anoectochilus, Ludisia, Dossinia, and Goodyera respectively. The results also reveal that the classification relationships of Goodyerinae are very close. The conservative ratio of ITS sequences is above 91%. The conservative ratio of ITS sequences within Anoectochilus is even above 99% and 95% within Goodyera. Ludisia and Dossinia could not be compared as one species alone in database respectively, there is 93% conservative ratio between them. 92% conservative ratio between them and Anoectochilus are observed in the table. The conservative ratio of ITS sequences within Anoectochilus is above 99%, while dropped conservative ratio 91-92% are observed between Anoectochilus and other genera. Therefore, because of the high conservation of ITS sequences within Anoectochilus plants, they couldn't be identified under this genus through the ITS sequence alignment in most cases. Accordingly, XZ which was originally considered as Anoectochilus has been identified as G. macrantha of Goodyera by ITS alignment. So it was not brought into the next proteomics research.

3.2 Differential proteome

The former reports showed there were some differences in proteins in total proteome between similar species, which embodied the genetic variability of species. Bahrman N, etc calculated the similarity index between each pair of genotypes, and found that wild wheat T. speltoides is most similar to cultivated B genome by analyzing the total number of proteins of several wild and cultivated wheat species 8. Analysis on total proteins of the relative species, Quercus petraea and Quercus robur confirmed that there was low genetic diversity between them 24. With 2-D analysis on protein expression profiles of 18 strains of wheat allopolyploid, Zivy M, etc found 14 wheat proteins can reflect the nuclear genetic variation of wheat varieties 9. With continuous improvement of the protein extraction method, as well as the 2-D technology especially the use of big strip and IPGs of appropriate pH value, the efficiency of plant species identification and detection of genetic variation have been greatly improved. Görg A, etc analyzed gliadin maps of 14 barley varieties with different malts using the first-dimensional isoelectric focusing IPG-DALT gel, and found that 10 species showed unique gliadin spectrum, while the remaining species were divided into two groups and each group consisted of two varieties 25. Skylas DJ, etc found that it can be distinguished that four different varieties of wheat through three special areas (two Glutenin zones and a non-Glutenin area (small Heat Shock Proteins)) of total endosperm proteome by 2-D electrophoresis with acidic (pH 4-7) and alkaline (pH 6-11) strips 26. Their study also found that the difference of wheat heat shock proteins is an interesting possible target protein area in species identification。 The researches above indicate that the protein differences between species can be used for the identification of plant species.

3.2.1 Standard curve

Protein level is determined using 2-D Quant Kit(Amersham) with accurate concentration using bovine serum albumin (BSA) as standard. The standard curve is showed as figure 1.
Fig 1

Standard curve

3.2.2 Comparison of protein differential peak

Proteins peak form map (Fig.2) given by SELDI shows that there are many common protein peaks in each groups. The corresponding proteins should be the common expressive proteins in Anoectochilus and they are necessary for plant growth. But the strength disparity of some peaks indicates that the amounts of these proteins appear different, and some peaks exist in one species alone. All of the protein differences may cause the interspecies difference in Anoectochilus.
Fig 2

Protein peak shape (a and b).

Statistics analysis shows that when compared with the FJ control group, 102 different proteins/peptides are found among five groups with p<0.05(n=3). Molecular weights and corresponding peak areas of part proteins are list in table 2.
Table 2

Molecular weight and peak area of part interspecific differential proteins of Anoectochilus

M/ZpMean-FJMean-TWMean-WSMean-YN1Mean-YN2
5849.010.0132.14601.215515.21333.91674.5491
6130.280.01564.85774.43634.79231.63515.385
5133.940.01624.06643.93015.27741.23623.116
10451.800.01923.65838.09618.27730.93419.361
12081.320.01918.15164.98250.04516.68911.188
6053.500.0225.97439.39810.9905.6934.963
10283.810.02226.68037.52616.59868.42630.815
6115.950.02519.77116.57762.45249.8598.709
7278.390.02638.8538.8483.4295.2719.062
7566.280.0312.9212.5392.88210.55230.942
6077.380.0406.5038.72758.03518.1994.946
12177.270.04349.34249.20580.97275.46224.793
6089.220.0457.8227.53668.32117.3324.940
12133.440.04626.63228.94080.76549.28219.069
10525.790.04739.89140.01110.45832.26423.021
Combined with graphs and charts, the largest peaks can be chosen as characteristic peaks of identification of the Anoectochilus, such as M/Z peaks of 5133, 6130, 10451, 10283, 10526, 12133 and 12177 Daltons (Da).

3.2.3 Establishment of proteomic fingerprint

According to the interspecies differential proteins and peaks, obvious differential peaks between species were treated as target protein peaks to identify these species, with FJ (A. roxburghii) and TW (A. formosanus) as standards. In a narrow range between 4800 and 4820 Da, there are double protein peaks in FJ and TW. YN1 and YN2 exists single-peak in the 4820 Da respectively, and there was no significant difference in peak height and peak area between YN1 and YN2. No peak exists in this range in WS. In 5115 to 5130 Da, there are double-peaks in protein peak chart of FJ, WS and YN1 respectively. Almost equal height and area of the double-peaks are observed in FJ and WS, while those in YN1 add one time. TW and YN2 has a single peak alone in 5130 Da, and the peak height and peak area of TW are significantly higher than those of YN2 (Figure 2, a), the values are 48.6, 1051.5 and 29.2, 571.6 respectively. In 6000 to 8000 Da, four peaks of 6052, 6350, 6750 and 7538 Da can be considered to be characteristic peaks to distinguish Anoectochilus species. 6052 exists only in TW or YN1. 6350 peak is higher in WS or YN1, while 6750 peak exist in WS and YN1 alone. 7538 peak is much higher in YN2 than that in other samples (Figure 2, b). Within range between 8000 to 11, 000 Da, five peaks can be regarded as the characteristic peaks (8131, 9032, 9058, 10, 430 and 10, 512). The 8131 peak is higher in YN1 and WS. The double-peaks of 9032 and 9058 exist in FJ and WS, but in TW, YN1 and YN2, only one of them exists. Peak of 10, 430 does not exist, while 10512 peak is higher in WS (table 3).
Table 3

Protein fingerprint spectra of the five species within Anoectochilus

Some peaks are not considered as characteristic peaks because of the bad peak shape and lower isolating rate, although they have higher peak height and higher differences. Based on the above analysis, the molecular weight and character of the target proteins of the five species of Anoectochilus are listed in Table 3.

3.2.4 The relationships of the five species within Anoectochilus

Neither the number nor the multiples of differential proteins among the proteomes of five species of Anoectochilus is equal (Table 4).
Table 4

Interspecific differences in protein of Anoectochilus

Compared groupTotal number of differential proteins>20 folds5-20 foldsThe highest multiple
FJ/TW1102548
FJ/WS138/818
FJ/YN155/1014
FJ/YN213821529
TW/WS17031538
TW/YN1802969
TW/YN2130/1212
WS/YN19811334
WS/YN218362786
YN1/YN210011055
Total proteins number that can be identified of each species is 307
Table 4 shows that FJ might be the nearest relationship to YN1 since there are a few differential proteins and lower differential multiples between them. It can be concluded that YN1 closes to WS, followed by TW and YN2, as well as FJ and WS. And comparing with the others, the genetic relationship of WS and YN2 could be considered the remote, because of not only the maximum differential proteins but the biggest differential multiple. The affinity diagram of five species can be drawn from the above data (Figure 3).
Fig 3

Affiliation of five plants in Anoectochilus. The number 1-10 in circles shows the relationship of interspecies from near to far.

In the former study on genetic variability within species, the relationship of seven natural geographical groups of the coastal pine was investigated by using 2-D, and genetic variation of different geographical sites was compared. The research showed that these trees can be divided into three major groups, which were the Atlantic Ocean group, the Mediterranean group and North African group 27. The genetic diversity of group evolution of 11 wheat groups which planted in isolated areas was studied by David JL etc. 10. It was found that 15 proteins showed significant differences among 48 protein spots that related to genetic differences. And the similar genetic differences were shown in one isolated group even with different cultivation. The above results indicate that the natural selection of groups played a major role in evolution, and the selection effects was mainly decided by environmental conditions. In this experiment, these five plants distribute in the southwest of China, the far relationships of YN1 and YN2, as well as YN2 and TW may due to the differences of plateau climate and marine climate. But the difference between YN1 and YN2 can not be explained by climate reason, and may be caused by some kind of natural selection. Therefore, it is believed that the elevation and climate may lead to the species differentiation. However, besides these key factors, there are many other factors causing the differentiation and variation of species.

3.2.5 Concluding remark

We can determine the genus of plants through the ITS sequences blast, and the conservative ratio is more than 95% within the genus. And the conservative ratio is up to 99% within Anoectochilus, and that between the genera is 91%. Therefore, it can not be identified to the species based on the ITS sequence alignment in most cases. Up to now, it was the first time to use the SELDI-TOF-MS technology for the species identification of the orchid plants. According to the differential proteome data and peak shape diagram given by SELDI-TOF-MS, peaks with molecular weight 4800/4820, 5115/5130, 6052, 6350, 6750, 8130, 9032/9058 and 10, 512 were chosen as characteristic peaks, and protein fingerprints of five plants within Anoectochilus were established respectively. Meanwhile, the genetic variability of the five species was analyzed. It was found that a certain effect to species variation will be caused by geographical and climatic variation, but the final differentiation and variation of one species is caused by multiple factors.
  17 in total

Review 1.  Bioinformatics in the post-genome era.

Authors:  Ungsik Yu; Sung Hoon Lee; Young Joo Kim; Sangsoo Kim
Journal:  J Biochem Mol Biol       Date:  2004-01-31

2.  On-chip detection of low-molecular-weight recombinant proteins in plant crude extracts by SELDI-TOF MS.

Authors:  Amine M Badri; Karine Coenen; Louis-Philippe Vaillancourt; Charles Goulet; Dominique Michaud
Journal:  Methods Mol Biol       Date:  2009

3.  Identification of specific proteins from seed embryos by two-dimensional gel electrophoresis for the discrimination between indica and japonica rice.

Authors:  H Saruyama; N Shinbashi
Journal:  Theor Appl Genet       Date:  1992-09       Impact factor: 5.699

4.  The wheat-grain proteome as a basis for more efficient cultivar identification.

Authors:  D J Skylas; L Copeland; W G Rathmell; C W Wrigley
Journal:  Proteomics       Date:  2001-12       Impact factor: 3.984

5.  Polymorphism of pepper (Capsicum annuum L.) seed proteins studied by two-dimensional electrophoresis with immobilized pH gradients: methodical and genetic aspects.

Authors:  A Posch; B M van den Berg; C Duranton; A Görg
Journal:  Electrophoresis       Date:  1994-02       Impact factor: 3.535

6.  Phylogenetic and genomic relationships in the genus Glycine Willd. based on sequences from the ITS region of nuclear rDNA.

Authors:  K P Kollipara; R J Singh; T Hymowitz
Journal:  Genome       Date:  1997-02       Impact factor: 2.166

7.  Variations in glutelin and high molecular weight endosperm proteins among subspecies of rice (Oryza sativa L.) detected by two-dimensional gel electrophoresis.

Authors:  T Abe; R S Gusti; M Ono; T Sasahara
Journal:  Genes Genet Syst       Date:  1996-04       Impact factor: 1.517

8.  [Study on sequence difference and SNP pheomenon of rDNA ITS region in F type and H type population of Dendrobium officinale].

Authors:  Xiao-yu Ding; Zheng-tao Wang; Luo-shan Xu; Hong Xu; Kai-ya Zhou; Guo-xin Shi
Journal:  Zhongguo Zhong Yao Za Zhi       Date:  2002-02

9.  Protein profiling of single epidermal cell types from Arabidopsis thaliana using surface-enhanced laser desorption and ionization technology.

Authors:  Berit Ebert; Christian Melle; Elke Lieckfeldt; Daniela Zöller; Ferdinand von Eggeling; Joachim Fisahn
Journal:  J Plant Physiol       Date:  2008-04-18       Impact factor: 3.549

10.  Study on nuclear and cytoplasmic genome expression in wheat by two-dimensional gel electrophoresis : 1. First results on 18 alloplasmic lines.

Authors:  M Zivy; H Thiellement; D de Vienne; J P Hofmann
Journal:  Theor Appl Genet       Date:  1983-07       Impact factor: 5.699

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.