Literature DB >> 17588127

Comparative genomics of Streptomyces avermitilis, Streptomyces cattleya, Streptomyces maritimus and Kitasatospora aureofaciens using a Streptomyces coelicolor microarray system.

Nai-Hua Hsiao1, Ralph Kirby.   

Abstract

DNA/DNA microarray hybridization was used to compare the genome content of Streptomyces avermitilis, Streptomyces cattleya, Streptomyces maritimus and Kitasatospora aureofaciens with that of Streptomyces coelicolor A3(2). The array data showed an about 93% agreement with the genome sequence data available for S. avermitilis and also showed a number of trends in the genome structure for Streptomyces and closely related Kitasatospora. A core central region was well conserved, which might be predicted from previous research and this was linked to a low degree of gene conservation in the terminal regions of the linear chromosome across all four species. Between these regions there are two areas of intermediate gene conservation by microarray analysis where gene synteny is still detectable in S. avermitilis. Nonetheless, a range of conserved genes could be identified within the terminal regions. Variation in the genes involved in differentiation, transcription, DNA replication, etc. provides interesting insights into which genes in these categories are generally conserved and which are not. The results also provide target priorities for possible gene knockouts in a group of bacteria with a very large numbers of genes with unknown functions compared to most bacterial species.

Entities:  

Mesh:

Substances:

Year:  2007        PMID: 17588127      PMCID: PMC2140096          DOI: 10.1007/s10482-007-9175-1

Source DB:  PubMed          Journal:  Antonie Van Leeuwenhoek        ISSN: 0003-6072            Impact factor:   2.271


Introduction

Streptomyces are a group of aerobic high %G+C Gram positive bacteria that undergo complex differentiation to form filamentous mycelium, aerial hyphae and spores. In addition, they produce a broad range of secondary metabolites including antibiotics, antiparasitic agents, herbicides, anti-cancer drugs and various enzymes of industrial importance. Two Streptomyces species have had their complete genome sequences published, namely the model organism Streptomyces coelicolor (%G+C = 72.1) and avermictin producer Streptomyces avermitilis (%G+C = 70.7) (Bentley et al. 2002; Ikeda et al. 2003). Two important aspects of the genomes structures of Streptomyces were supported by sequence data. Firstly, that the genome size of Streptomyces is large compared to other bacteria; 8,667,507 basepairs for S. coelicolor (7,825 protein coding genes) and 9,025,608 bp (7,577 protein coding genes) for S. avermitilis. Secondly, that the genomes of these two species are linear and both ends contain unique terminal inverted repeats that probably covalently bind a terminal protein. Terminal inverted repeats and covalently bound terminal proteins are not found in the limited number of other bacteria that have linear chromosomes such as Borrelia burgdorferi and Agrobacterium tumefaciens and, up to the present, seem to be unique to the Streptomyces and perhaps other Actinobacteria (Lin et al. 1993; Chen et al. 2002; Goodner et al. 1999; Huang et al. 2004). Over 2,500 Streptomyces strains are present in the Ribosomal Database Project (http://www.rdp.cme.msu.edu), over 1,500 are available at the American Type Culture Collection (http://www.atcc.org/) and many more are held in both public and private culture collections throughout the world. Analysis of the small subunit ribosomal RNA gene sequences of Streptomyces confirms that they form a monophyletic clade, but one with considerable diversity. In addition, there is significant gene diversity at the interspecies level across the genomes of both completely sequenced Streptomyces with 2,291 gene unique to S. avermitilis and 2,307 genes unique to S. coelicolor.. This makes them particularly interesting targets for comparative genomic studies. In this study we chose four species to begin an analysis of the genomic diversity of the Streptomyces. S. avermitilis was chosen because of the availability of the complete genome sequence of this species, while Streptomyces maritimus was chosen because of its intermediate position in terms of phylogeny within the Streptomyces. Streptomyces cattleya was chosen because, based on small subunit ribosomal RNA sequence, this species is phylogenetically quite divergent from S. coelicolor and branches near the root of the Streptomyces clade. Streptomyces cattleya is a β-lactam producing species. Finally, Kitasatospora aureofaciens was chosen as this genus is very closely related to the Streptomyces. The availability of two microarrays for S. coelicolor (Lum et al. 2004; Huang et al. 2001; Vinciotti et al. 2005; http://www.surrey.ac.uk/SBMS/Fgenomics/Microarrays/index.html) makes possible a comparative genomic analysis of Streptomyces species. The genes that make up the genome of S. coelicolor have been classified based on scheme of Riley and colleagues for E. coli and modified for S. coelicolor (http://www.sanger.ac.uk/Projects/S_coelicolor/scheme.shtml). A microarray analysis of the genomes of these Streptomyces using the S. coelicolor microarray is able to provide a wide ranging comparative analysis of the conserved genome content of these Streptomyces. This type of approach, where a heterologous microarray is used to analyze the genome content of a range of strains or species, has been successfully used in a wide range of organisms (Akman and Aksoy 2001; Akman et al. 2001; Behr et al. 1999; Chan et al. 2003; Cho and Tiedje 2001; Dorrell et al. 2001; Dziejman et al. 2002; Fitzgerald et al. 2001; Gill et al. 2002; Leonard et al. 2003; Murray et al. 2001; Porwollik et al. 2002; Salama et al. 2000; Israel et al. 2001; Rajashekara et al. 2004). The strains analyzed using this approach range from intraspecies comparisons such as Campylobacter jejuni, Vibrio cholerae and Staphylococcus aureus (Dorrell et al. 2001; Dziejman et al. 2002; Fitzgerald et al. 2001) to interspecies comparisons such as Sodalis glossinidiusversus an Escherichia coli array, Salmonella bongori versus a Salmonella enterica array, Shewanella species versus Shewanella oneidensis and E. coli arrays and Brucella species versus a Brucella melitensis array (Akman et al. 2001; Chan et al. 2003; Murray et al. 2001; Rajashekara et al. 2004). In this study, we used both versions of the S. coelicolor genome microarrays to compare the gene complements of the three Streptomyces species and one Kitasatospora species. The genus Kitasatospora is closely related to the genus Streptomyces in terms of morphology, chemical taxonomy and small subunit ribosomal RNA sequence analysis. Thus, the choice of a species from this genus acts as potential outgroup in terms of overall genome structure. In terms of genes that are conserved, the types of genes of particular interest include genes involved in secondary metabolism, genes involved in chromosome replication, genes in the terminal regions of the chromosome, sigma factors, genes involved in differentiation and hypothetical genes. In terms of gene absence, the distribution of such genes along the chromosome and the apparent absence of any major housekeeping genes in a specific species are of interest. This information provides insights into genes that make up the core complement for a member of the Streptomyces and into which genes are central to defining a Streptomyces species.

Materials and methods

16S phylogeny

This was carried out on selected small subunit 16S ribosomal RNA gene sequences obtained from Ribosomal Database Project-II Release 9 (http://www.rdp.cme.msu.edu/index.jsp) and aligned using CLUSTALX (Thompson et al. 1997). The analysis was carried out using Neighbor-Joining algorithm from the same program. In the case of S. maritimus, the taxonomy of the strain was confirmed by DNA sequencing of the 16S ribosomal RNA gene.

Arrays

Two series of arrays that cover about 97% of the complete genome of Streptomyces coelicolor A3(2) (Lum et al. 2004; http://www.surrey.ac.uk/SBMS/Fgenomics/Microarrays/index.html) were used in this study. Both arrays are PCR arrays, but from different sources, namely Stanford University, USA and the University of Surrey, UK and made up of different PCR products. The Stanford array as used in this study contained sequences covering 7603 open reading frames. The Surrey microarray is made up of 7,758 unique PCR amplified sequences, 7,563 from the chromosome and 195 from SCP1. There are an additional 376 non-unique, alternative and cross-hybridizing sequences that are also spotted on to the array together with no probe spots and control spots. The two types of arrays were used to improve validation with a system using heterologous hybridization; however, only the University of Surrey array was hybridized and analyzed in duplicate. The major difference between the two arrays was that the Surrey array did not include a number of transposition element related genes, although there were other overlap differences. The sequences of the PCR products are not available for either array due to intellectual property protection requirements.

Strains and growth conditions

S. coelicolor A3(2) (SCP1+) 104, S. avermitilis ATCC 31267, S. cattleya ATCC 35852, S. maritimus Yang-Ming and K. aureofaciens ATCC 10762 were used in these studies. Fresh spores were collected and mycelium cultured in TSB liquid medium with 0.5% glycine at 30°C overnight.

Preparation of labeled DNA

Genomic DNA from a stationary phase culture was purified by the salting out procedure (Pospiech and Neumann, 1995) and had been sonicated to < 2 Kb. Four to six micrograms of sonicated genomic DNA were used as template and this was denatured in the presence of 12 μg of 72%-GC-content random hexamers in a total volume of 25 μl at 100°C for 10 min. The mixture was then snap-cooled on ice before adding the remaining reaction components: 1.5 μl of Cy3-dCTP or Cy5-dCTP (Amersham Pharmacia Biotech), 4μl Klenow fragment (NEB #212), 5μl Klenow buffer, 0.5 μl dNTP (4 mM dATP, 4 mM dTTP, 10 mM dGTP, and 0.2 mM dCTP), and 14 μl ddH2O. The random primed labeling reaction was carried out for 2–3 h at 37°C. Buffer exchange, purification and concentration of the DNA products was accomplished by three cycles of diluting the reaction mixture in 0.5 ml TE buffer (10 mM Tris and 1 mM EDTA pH 8.0) and filtering though a Microcon-30 microconcentrators (Millipore).

Microarray hybridization and data analysis

The two DNA pools to be compared were mixed and applied to an array in a hybridization mixture that contained 3.68 × SSC, 0.18% SDS, and 1 μg yeast tRNA (total 16.3 μl), which had been heated at 100°C for 5 min before being applied to array. Hybridization took place under a glass coverslip sealed by glue in a humidified Omnislide (Thermo Hybaid) at 60°C for 12–14 h. The slides were washed, dried and scanned for fluorescence using a GenePix TM 4000B scanner (Axon instruments). Average signal intensity and local background measurements were obtained for each spot on each array using GenePixPro software. The dataset was screened for aberrant spots and these were eliminated from the analysis after manual checking. Most genes are present in duplicate on the two arrays and the signal from each pair of spots was inputted into the computer program available from ScanAlyze (Eisen et al. 1998; Gollub et al. 2003). The data was then processed into a mean log2 Cy3/Cy5 ratio format. The dataset was normalized for each array separately and outputted to Excel where after checking the alignment of the datasets from each array, a mean signal for each common gene was calculated. Genes that were absent from either array, mostly transposon related genes in the University of Surrey array, were not included in the analysis. Based on Bentley et al. 2002, the mean signal and standard deviation for the core region of genes from SCO2050 to SCO5800 was calculated. The standard deviation was used to set a cut-off for gene absence at 2SD below the core mean. The microarray data is presented relative to the S. coelicolor standard in two ways. This is either as a color plot of the genes where green presents a negative hybridization signal, black represents an equal hybridization signal and red indicates a positive hybridization signal using the program Treeview (Eisen et al. 1998) or as numeric values for the signal from each gene. The microarray data for the four species described here and additional unpublished species can be accessed via rkirby@ym.edu.tw.

Comparison of the microarray dataset for S. avermitilis with the complete genome sequence

The nucleotide sequences for all the identified open reading frame from the S. avermitilis genome sequence (Ikeda et al. 2003) were compared with the genome sequence of S. coelicolor using blastn limiting the output to the best match. This E value dataset for the genes was then aligned with the S. avermitilis microarray dataset and a comparison plotted as a scatterplot. Genes showing disagreement between the two datasets were identified based on a 2 Standard Deviation (SD) cutoff for the microarray dataset and a E-10 cutoff for the blast value.

Analysis of gene presence across the chromosome

A graphical display was created by counting the number of gene detected as present from the signal based on the 2SD cutoff from each normalized microarray dataset using a moving window of 10 genes in steps of one.

Results and discussion

Comparison of S. avermitilis, S. cattleya, S. maritimus and K. aureofaciens with the S. coelicolor genome

In total, after spot and data validation, a total of 7,083 open reading frames were included in this analysis as presence on both types of array and giving analyzable signal on all three arrays. Validity in this study was initially obtained by using microarrays from two sources that presumably use different PCR products to create the arrays. In addition, the University of Surrey array was hybridized and analyzed in duplicate. In terms of gene absence based on two standard deviations as described in the “Materials and methods" section, the agreement between the Stanford array and the duplicated University of Surrey array was about 95%, while the agreement between the two University of Surrey arrays was about 98%. In order to minimize the effect of divergent individual array spots, the signal mean for each gene from the three arrays was used throughout this study. In this study, the genomic content of three Streptomyces species and one Kitasatospora species with divergent taxonomy, antibiotic production and SSU rRNA sequence are compared using two different S. coelicolor microarrays. It is clear that there are inherent limitations to this approach. Firstly, only gene absence or divergence rather than the presence of new genes can be identified. Secondly, it is not possible to clearly separate the absence of a gene from the presence of a divergent homologue of the same gene. Finally, although the order of the genes in S. coelicolor and S. avermitilis are known from their complete genome sequences and are well conserved, this does not mean that the synteny of most of them is conserved in other Streptomyces species. However, the detection of synteny across Actinobacteria including Mycobacterium tuberculosis, Corynebacteriun glutamicum and other species (Bentley et al. 2002 and unpublished data) supports a conserved central core structure to the genomes of the Actinomycetes and a priori most Streptomyces. Thus, although major chromosomal reorganizations in the central core region cannot be detected by microarray data, a basic chromosomal structure can be assumed as a first approximation; namely, a linear chromosome with variable terminal regions and a relatively well conserved core region. When the pooled data from the two arrays for the four species was analyzed using Cy-3 labeled S. coelicolor A(3)2 chromosomal DNA compared to heterologous Cy-5 labeled chromosomal DNA, a wide range of signal variation could be noted and this is shown in Supplementary Fig. 1. The SSU rRNA tree places the divergence of these four strains from S. coelicolor as S. cattleya > K. aureofaciens > S. maritimus > S.avermitilis (Fig. 1). Gene differences were present in the order S. cattleya > K. aureofaciens > S. avermitilis > S. maritimus based on −2SD cutoff below the mean signal for the core region genes. The microarray data thus shows general agreement with S. cattleya and K aureofaciens being more divergent and the other two species being relatively closer. It is interesting to note that the Kitasatospora species used in this study, K. aureofaciens, shows the same general structure as the Streptomyces species. This is not unexpected and confirms the close relationship between Kitasatospora and Streptomyces and agrees with the SSU rRNA tree data.
Fig. 1

SSU rRNA phylogenetic tree of selected Streptomyces species and other Actinomycetes that have known complete genome sequences. The species analyzed by microarray are indicated in bold

SSU rRNA phylogenetic tree of selected Streptomyces species and other Actinomycetes that have known complete genome sequences. The species analyzed by microarray are indicated in bold Further support of the reliability of the data comes from a comparison of the blastn E values for all genes and the microarray data as shown in the Fig. 2 scatterplot. This indicated 232 out of 6,832 genes show gene absence by microarray when they seem to be present by blastn and 268 out of 6,832 gene show gene presence by microarray when they seem to be absent by blastn; these results are both based on cutoffs of −2SD for the microarray data and −10 for the E value. This gives an overall reliability for S. coelicolor compared to S. avermitilis of 93%. Potential errors factors include in the case of the former type of error, poor spotting of the array at that point and choice of the PCR product sequence (the comparison is with the whole gene, as the PCR products are not available) and in the latter case cross-hybridization between multiple gene copies or a unreliable hybridization signal due to poor washing in that area. However, the results for S. avermitilis clearly support the reliability of the genome comparisons produced by this study.
Fig. 2

Scatterplot comparing gene presence/absence based on the microarray data and gene presence/absence based in blastn between Streptomyces coelicolor and Streptomyces avermitilis. See “Material and methods" for details. Box A and Box C includes genes identified as absent in S. avermitilis by the microarray dataset but present using blastn and genes present in S. avermitilis using blastn, but identified as absent by the microarray dataset. Box B includes genes that are correctly identified as absent by the microarray dataset

Scatterplot comparing gene presence/absence based on the microarray data and gene presence/absence based in blastn between Streptomyces coelicolor and Streptomyces avermitilis. See “Material and methods" for details. Box A and Box C includes genes identified as absent in S. avermitilis by the microarray dataset but present using blastn and genes present in S. avermitilis using blastn, but identified as absent by the microarray dataset. Box B includes genes that are correctly identified as absent by the microarray dataset

Distribution of gene differences across the complete chromosome of S. coelicolor for all four other Streptomyces species

The whole chromosome microarray dataset supports the following structure for the Streptomyces chromosome. Based on Fig. 3 and Supplementary Fig. 1, there is a central core of conserved probably syntenous genes that can be found across many Actinomycetes and in the S. coelicolor genome this reach from about SCO2050 to SCO5800 (Bentley et al. 2002). The regions between SCO1100 and SCO2050 and between SCO5800 and SCO7600 are also quite well conserved between the Streptomyces studied here as well as being syntenous between the S. coelicolor and S. avermitilis genome sequences. However they are not present when the genomes of these two species are compared bioinformatically to other divergent Actinomycetes. These two regions seem to be two genus specific areas. Figure 3 also clearly shows that gene conservation drops off dramatically in the terminal region. The regions from the left terminus to SCO1100 and from SCO7600 to the right terminus show much higher gene divergence that the rest of the chromosome. This agrees with the results for the S. ambofaciens sequencing studies of that species’ terminal regions (Choulet et al. 2006a, b). The gene conservation levels averaged across the four species are as follows: left terminal region (SCO0001–SCO1100) 40.9%; left genus specific region (SCO1101–SCO2050) 84.8%; core region (SCO2050–SCO5800) 79.4%; right genus specific region (SCO5801–SCO7600) 69.6% and right terminal region (SCO7601–SCO7845) 50.3%. It is noticeable that neither the size nor the distribution of conserved genes is symmetrical between the two terminal regions or the two genus specific regions. Notably, the genus specific region actually has a higher frequency of gene conservation than the core regions as a whole and that the left terminal region is much larger than the right terminal region. This possibly represents horizontal exchange of terminal regions by recombination between strains/species that involves only one terminal region. Such an event would give rise to asymmetric gene conservation similar to that detected here.
Fig. 3

Analysis of “gene presence” across the four species. Created using a moving window of 10 genes and counting the number of genes with a microarray signal >2SD below the mean for the core region genes. The Y axis is the count for “gene presence”

Analysis of “gene presence” across the four species. Created using a moving window of 10 genes and counting the number of genes with a microarray signal >2SD below the mean for the core region genes. The Y axis is the count for “gene presence” In the Karoonuthaisiri et al. (2005) study of regional gene expression in S. coelicolor, the boundaries for higher transcript levels during vegetative growth were placed at 1.5 Mb for the left arm and 2.3 Mb for the right arm. The former is midway across the left genus specific region and the latter approximately agrees with the boundary between the core and the right genus specific region. As the core region boundaries are also defined in terms of synteny with the Mycobacterium and Corynebacterium genomes as well as the data presented here, this supports the idea that the S. coelicolor chromosome structure is asymmetrical with respect to both gene conservation and gene function. It should be noted that because we are using only S. coelicolor as the source of the array data, the results do not imply that the genomes of S. cattleya, S. maritimus and K. aurefaciens are asymmetric. However, it should be noted that the S. avermitilis genome is also asymmetric (Ikeda et al. 2003). Notably, there are 22 identifiable regions where all four species show a significant degree of concurrent gene absence outside of the terminal regions (Table 1). The regions of high gene divergence are shown in Supplementary Fig. 2 in detail. Previously, Bentley et al. identified 14 regions in the S. coelicolor chromosome that were potentially laterally acquired regions. This analysis pinpoints all of these regions and quite accurately, usually to within one or two open reading frames. This suggests that other eight regions are probably quite robust when designated as potential lateral transfer regions. It also supports the usefulness of the microarray approach. All 22 regions were analyzed using Frame Plot (Artemis v7.1) and except for region B, they show abnormalities for at least some of the open reading frames compared to the G+C bias expected for the 1st, 2nd and 3rd codon positions of Streptomyces genes. Eight regions, A, B, F, I, M, O, Q and T contain transposon related genes near to or within the region. Four regions, H, N, P and R are flanked by highly conserved genes such as a ribosomal protein or sigma factor genes, which could encourage interspecific recombination. Finally, five regions consist largely of hypothetical proteins with no known similarity to any known protein as yet; these regions are G, J, L, S and W. Region L is particularly interesting as there is a central core of conserved gene flanked by two subregions that are highly not conserved. One of these genes is a putative spore septum determining protein, while the rest have unknown functions. Taken as a whole, the results suggests that S. coelicolor may have recently acquired all these regions either by transposition or by interspecific/intraspecific recombination (Wolf et al. 2002; Zhang et al. 2002). It is also unlikely that they were acquired from any of the four species studied here. There are other regions that could potentially be identified as lateral transfer positions using less stringent criteria and a wider screening of genomes might help to support these additional regions as being involved in hotizontal transfer. In addition, such a wider screen might allow the identification of possible origins of these regions in other species.
Table 1

Areas of the Streptomyces coelicolor genome identified as potentially horizontally transferred regions based on microarray parallel gene absence in all four species

RegionArea of chromosomeGenes missinga%aSignificant features
Region ASCO0996–SCO101017/2959Integrase, insertion sequence
Region BSCO2860–SCO287953/7669Rifampin ribosyl transferase
Region CSCO3249–SCO328894/15660Integrase, excisionase
Region DSCO3471–SCO3538198/26873Agarase
Region ESCO3584–SCO359930/6050
Region FSCO3929–SCO393722/3268Integrase/recombinase, fstK-like
Region GSCO3980–SCO400156/6488Hypothetical proteins
Region HSCO4052–SCO4066132/14492Boundary dnaZ gene
Region ISCO4210–SCO422337/5469
Region JSCO4247–SCO425721/3658Hypothetical proteins
Region KSCO4340–SCO435434/4085Integrase, DNA invertase
Region LSCO4509–SCO4547106/14474Hypothetical proteins
Region MSCO4613–SCO463140/6859Integrase, excisionase
Region NSCO4686–SCO470024/4455Boundary ribosomal proteins operon
Region OSCO5323–SCO535157/8071Integrase, excisionase
Region PSCO5605–SCO562046/6472Boundary sigma factor whiG
Region QSCO5632–SCO564440/4491Integrase, korSA
Region RSCO5715–SCO573557/7279Boundary ribosomal protein, bldB
Region SSCO5906–SCO592428/5650Hypothetical proteins, xylanase
Region TSCO6372–SCO640682/10082Recombinase
Region VSCO6607–SCO664862/12052Helicase
Region WSCO6806–SCO695373/13355Hypothetical proteins

aThis is calculated from the available normalized gene dataset from the two microarrays

Areas of the Streptomyces coelicolor genome identified as potentially horizontally transferred regions based on microarray parallel gene absence in all four species aThis is calculated from the available normalized gene dataset from the two microarrays

Gene conservation in the terminal regions of the four Streptomyces species

As has been mentioned earlier, the two regions at either terminus are much less well conserved than the central core region; these extend from SCO0001 to about SCO1100 on the left arm of the chromosome and from about SC7600 to SCO7845 on the right arm. The boundaries of these regions are not absolutely clear-cut, but what is clear is that as one moves towards the centre of the genome, gene conservation increases beyond these points. This can be clearly seen in Fig. 3 where the gene conservation is plotted using a moving window for the four species, but it is also clear that the lack of conservation is not uniform across the terminal regions and that areas of higher gene conservation can be identified. The significant interest in the terminal regions arises because the genomes of all Streptomyces that have been examined are linear and the problem of how the termini of such a molecule replicate is of particularly importance. Recent studies have indicated that two genes in particular, tpgA (SCO7734) and tapA (SCO7733), are involved in this process (Yang et al. 2002; Bao and Cohen 2001). tpgA encoding the terminal protein that covalently binds to the termini of many linear Streptomyces replicons is conserved across all four species. In S. avermitilis this is also true based on sequence data and, further more, there are multiple copies of tpgA unlike S. coelicolor. The signal level of the S. avermitilis gene at +1.2 supports the presence of these multiple copies. The signal levels for the other three species are between about −0.3 and −0.1, which supports a single slightly diverging copy of this gene in these species. However, if two copies are present then the sequence divergence may be higher. Furthermore, tapA is also conserved except for S. maritimus, which seems to be more divergent at −0.8. It should be noted that the presence of these two genes is not a criteria for defining a genome with a linear topology, but the presence of one or both is certainly suggestive (Dary et al. 2000; Wang et al. 1999; Huang et al. 1998; Lin and Chen 1997). Finally, ttrA is known to be involved in chromosomal transfer and is found very close to the telomere of S. coelicolor and S. avermitilis. This is also conserved in all four species suggesting the genetic exchange is highly important in Streptomyces and related species. The two terminal regions encompass the major areas that are prone to deletion in many Streptomyces species and are therefore not essential except for linear terminal replication and genetic exchange. Given the relatively high lack of conservation of genes in this region, genes that are present in all four species represent an interesting class. A full list of all genes conserved in all four species in the terminal regions is provided in Tables 2a and 2b. There are 36 hypothetical genes that show high similarity in the two terminal regions. Analysis of these groups of conserved genes using Artemis v7 (The Sanger Institute) identifies a total of five groups of genes that may make up possible single transcriptional units. These are SCO0551–SCO0552, SCO0705–SCO0710, SCO1021–SCO1024, SCO7677–SCO7680 and SCO7682–SCO7688. In addition to TpgA and TapA, it is possible that there are other genes involved in terminal replication and these may be among the conserved genes present in the terminal regions. Although possible candidates can be deduced from a direct comparison of the two known Streptomyces genome sequences, they are many in number. Using the microarray analysis of the Actinomycetes in this study, the candidates can be reduced significantly. From candidates in Tables 2a and 2b, two possible transcriptional units seem to be potential candidates for involvement in terminal replication; these are SCO1021–SCO1024 (hypothetical proteins), and SCO7677–SCO7689 (including hypothetical proteins, an AMP-binding ligase and membrane proteins). Gene knockout studies may be able to identify possible functions for these and other gene candidates, especially the other hypothetical proteins that are conserved in these four species.
Table 2

Genes from the (a) left terminal, (b) right terminal region of Streptomyces coelicolor showing microarray conservation in all four species

(a)
SCO0002 ttrASCO0800 putative TetR-family transcriptional regulatory protein
SCO0142 hypothetical proteinSCO0802 hypothetical protein
SCO0150 hypothetical proteinSCO0810 putative ABC transporter permease
SCO0201 putative integral membrane proteinSCO0830 putative penicillin-binding protein
SCO0232 hypothetical proteinSCO0839 putative transmembrane transport protein
SCO0415 hypothetical proteinSCO0840 putative marR-family transcriptional regulator
SCO0443 hypothetical proteinSCO0854 hypothetical protein
SCO0452 putative SIR2-like regulatory proteinSCO0883 polypeptide deformylase
SCO0466 araC family transcriptional regulatorSCO0887 putative TetR-family transcriptional regulator
SCO0471 putative araC family transcriptional regulatorSCO0894 putative membrane protein
SCO0496 putative iron-siderophore permease transmembrane proteinSCO0895 RNA polymerase principal sigma factor HrdC
SCO0536 hypothetical proteinSCO0900 putative transmembrane efflux protein
SCO0538 probable sugar transporter sugar binding lipoproteinSCO0905 putative membrane protein
SCO0544 hypothetical secreted proteinSCO0907 putative dehydrogenase
SCO0546 pyruvate carboxylaseSCO0925 putative lysR-family transcriptional regulator
SCO0551 putative histidine kinase proteinSCO0926 hypothetical protein
SCO0552 putative response regulatorSCO0931 putative secreted proline-rich protein
SCO0565 putative polyprenyl synthetaseSCO0942 putative RNA polymerase sigma factor
SCO0584 putative cytochromeSCO0943 hypothetical protein
SCO0591 putative lysozyme precursorSCO0947 putative integral membrane protein
SCO0592 hypothetical proteinSCO0949 hypothetical protein
SCO0614 hypothetical proteinSCO1011 conserved hypothetical protein
SCO0619 putative membrane proteinSCO1015 hypothetical protein
SCO0637 hypothetical proteinSCO1018 putative isomerase
SCO0690 possible oxidoreductaseSCO1021 hypothetical protein
SCO0695 hypothetical proteinSCO1022 hypothetical protein
SCO0701 hypothetical proteinSCO1024 hypothetical protein
SCO0707 putative branched-chain amino acid ABC transport permeaseSCO1034 putative tetR-family regulatory protein
SCO0708 putative branched-chain amino acid ABC transport proteinSCO1036 putative phosphotriesterase-family protein
SCO0709 putative branched-chain amino acid transport ATP-binding proteinSCO1040 putative DNA repair protein
SCO0710 putative branched-chain amino acid transport ATP-binding proteinSCO1041 hypothetical protein
SCO0765 secreted endoglucanaseSCO1043 putative transcriptional regulatory protein
SCO0779 conserved hypothetical proteinSCO1044 putative secreted protein
SCO0788 hypothetical proteinSCO1046 putative metal transporter ATPase
SCO0790 putative hydrolase
(b)
SCO7649 putative two-component system sensor kinase
SCO7677 putative secreted solute-binding protein
SCO7678 putative metal transport integral membrane protein
SCO7679 putative transport system integral membrane protein
SCO7680 putative ABC transporter ATP-binding protein
SCO7681 putative AMP-binding ligase
SCO7682 putative non-ribosomal peptide synthase
SCO7684 conserved hypothetical protein
SCO7685 conserved hypothetical protein
SCO7687 putative thioesterase
SCO7688 hypothetical protein
SCO7689 putative ABC transporter ATP-binding protein
SCO7718 hypothetical protein
SCO7720 hypothetical protein
SCO7724 hypothetical protein
SCO7734 Tpg protein

Bold indicates groups of consecutive genes that may form a single transcriptional unit

Genes from the (a) left terminal, (b) right terminal region of Streptomyces coelicolor showing microarray conservation in all four species Bold indicates groups of consecutive genes that may form a single transcriptional unit

Conservation of functional groups of genes across the four Streptomyces species

One approach to analyzing genetic variation across these four Streptomyces species is to look at the functional groupings of genes. Such an approach should allow the identification of strain versus genus specific genes especially when there are large numbers of genes with related functions such as sigma factors or where there are two copies of a gene, such as ftsK. However, because microarray data paints a broad picture across a whole genome, it is essential that once a gene or genes has been targeted based on microarray data, that experimental verification by other means is carried out. However, it is hoped that this dataset will be able to help researchers prioritize their gene targets better. The genes of the S. coelicolor chromosome have been grouped based on the scheme of M. Riley and colleagues for E. coli (ecocyc.org) modified for S. coelicolor (http://www.sanger.ac.uk/Projects/S_coelicolor/scheme.shtml) and we used this classification. The genes involved in ribosomal proteins synthesis and modification should be highly conserved and the results indicate that almost all of them are present in all four species (Table 3; Supplementary Fig. 4). The only exceptions are SCO0436, SCO0509 SCO3430 and SCO3909 in S. avermitilis and SCO4716 and SCO5514 in K. aureofaciens. Of these genes, SCO0436, SCO0509 and SCO5514 represent duplicate genes in the S. coelicolor genome and therefore the choice of the microarray sequence will have had a significant effect on the heterologous hybridization. There is no obvious explanation for the failure to hybridize of the other two genes, but as a whole, this dataset supports the integrity of the array system for analysis of genome content as these genes are scattered across the whole Streptomyces genome.
Table 3

Microarray data for ribosomal proteins from the four species

 S. avermitilisS. cattleyaS. maritimusK. aureofaciens
SCO0436 probable 50S ribosomal protein0.35−0.130.44−0.29
SCO0569 putative 50S ribsomomal protein fragment0.750.63−0.420.27
SCO1150 50S ribosomal protein L310.56−0.47−0.240.14
SCO1505 30S ribosomal protein S40.36−0.350.76−0.31
SCO1598 50S ribosomal protein L201.500.630.890.34
SCO1599 50S ribosomal protein L350.230.410.49−0.51
SCO1998 30S ribosomal protein S11.390.770.990.91
SCO2563 30s ribosomal protein S200.34−0.340.770.33
SCO2596 50S ribosomal protein L271.010.590.080.82
SCO2597 ribosomal protein L210.270.080.64−0.18
SCO3124 ribosomal L25p family protein0.390.31−0.23−0.89
SCO3427 putative 50S ribosomal protein L310.240.370.220.60
SCO3428 putative 50S ribosomal protein L330.150.280.540.09
SCO3429 putative 50S ribosomal protein L280.680.160.550.45
SCO3430 putative 30S ribosomal protein S140.800.10−0.17−0.19
SCO3880 putative 50S ribosomal protein L341.020.130.240.71
SCO3906 putative 30S ribosomal protein S60.700.941.11−0.18
SCO3909 putative 50S ribosomal protein L91.300.010.871.27
SCO4648 50S ribosomal protein L111.670.430.900.47
SCO4649 50S ribosomal protein L10.620.53−0.250.92
SCO4652 50S ribosomal protein L100.42−0.430.64−0.35
SCO4653 50S ribosomal protein L7/L121.221.030.790.45
SCO4659 30S ribosomal protein S120.740.650.70−0.53
SCO4660 30S ribosomal protein S70.57−0.230.680.12
SCO4701 30S ribosomal protein S101.191.171.16−0.21
SCO4702 50S ribosomal protein L30.920.020.840.49
SCO4703 50S ribosomal protein L41.160.910.590.23
SCO4704 50S ribosomal protein L230.851.441.240.36
SCO4705 50S ribosomal protein L20.85−0.120.840.22
SCO4706 30S ribosomal protein S190.060.240.32−0.26
SCO4707 50S ribosomal protein L220.960.690.640.15
SCO4708 30S ribosomal protein S31.150.380.781.07
SCO4709 50S ribosomal protein L160.520.671.091.26
SCO4710 50S ribosomal protein L290.33−0.06−0.190.41
SCO4711 30S ribosomal protein S170.590.920.51−0.13
SCO4712 50S ribosomal protein L141.050.350.480.82
SCO4713 50S ribosomal protein L241.090.880.630.64
SCO4714 50S ribosomal protein L51.240.780.771.03
SCO4715 30S ribosomal protein S140.390.030.190.12
SCO4716 30S ribosomal protein S81.18−0.110.640.76
SCO4717 50S ribosomal protein L60.960.820.79−0.02
SCO4718 50S ribosomal protein L180.090.300.570.74
SCO4719 30S ribosomal protein S51.560.661.02−0.09
SCO4720 50S ribosomal protein L300.130.390.640.26
SCO4721 50S ribosomal protein L151.790.420.800.84
SCO4726 50S ribosomal protein L360.46−0.130.35−0.10
SCO4727 30S ribosomal protein S130.63−0.230.62−0.17
SCO4728 30S ribosomal protein S111.120.550.870.27
SCO4730 50S ribosomal protein L170.690.270.860.51
SCO4734 50S ribosomal protein L13−0.270.450.500.40
SCO4735 30S ribosomal protein S90.36−0.020.00−0.01
SCO5359 50S ribosomal protein L310.860.221.400.46
SCO5564 putative 50S ribosomal protein L280.600.280.210.51
SCO5591 30S ribosomal protein S160.440.030.570.60
SCO5595 50S ribosomal protein L190.770.781.54−0.03
SCO5624 30S ribosomal protein S21.160.481.520.30
SCO5736 30S ribosomal protein S150.700.410.79−0.46
Mean hybridization score for ribosomal protein genes0.670.350.610.18

Bold values indicate that the signal for that gene is more than 2SD below the mean core signal for that species and such a value is suggestive of either gene absence or very low similarity

Microarray data for ribosomal proteins from the four species Bold values indicate that the signal for that gene is more than 2SD below the mean core signal for that species and such a value is suggestive of either gene absence or very low similarity Table 4 shows genes identified as possible sigma factors, anti-sigma factors and ant-sigma factor antagonists. The genes found in the central core region are more conserved. As would be expected, the major sigma factors such as hrdA, hrdB, hrdC and hrdD are conserved as well as many of the other studied sigma factors of S. coelicolor such as are sigA, sigE, sigF, sigG, sigR, sigT and whiG. Overall, fewer regulation genes from this group (anti-sigma factors and anti-anti-sigma factors) are conserved than sigma factors themselves. This analysis allows the identification of new candidate sigma factors for further study outside of the well studied ones, but within S. coelicolor and in other species. Overall, the results support the hypothesis that there is a core of sigma factors essential to keeping protein synthesis in Streptomyces running smoothly. The functionality of the rest may vary and include complete silence of some gene fragments, duplication of function, involvement in specific secondary metabolic activities and species/genus specific functions.
Table 4

Conservation across the four species of genes annotated as sigma factors or related proteins in Streptomyces coelicolor

 S.avermitilisS.cattleyaS. maritimusK. aureofaciens 
SCO0037 putative sigma factor1.040.900.881.49
SCO0159 putative ECF sigma factor1.260.652.050.60
SCO0194 putative sigma factor0.870.350.610.52
SCO0255 putative transcriptional regulator0.640.370.990.46
SCO0414 putative RNA polymerase sigma factor−0.05−0.28−0.15−0.22Conserved
SCO0598 putative anti anti sigma factor0.110.50−0.080.57Conserved
SCO0599 putative regulator of sig81.411.070.841.08
SCO0632 putative RNA polymerase sigma factor−0.140.190.820.11
SCO0672 putative anti-sigma factor antagonist−0.100.40−0.190.12
SCO0781 putative anti sigma factor antagonist0.790.861.060.83
SCO0803 putative RNA polymerase sigma factor−0.25−0.090.51−0.01
SCO0864 probable ECF-family sigma factor0.740.861.040.50
SCO0866 probable ECF-family sigma factor−0.130.19−0.280.23Conserved
SCO0869 putative anti-sigma factor antagonist0.590.901.230.90
SCO0895 RNA polymerase principal sigma factor HrdC0.520.431.120.34Conserved
SCO0942 putative RNA polymerase sigma factor0.340.810.610.45Conserved
SCO1263 putative ECF-sigma factor−0.17−0.260.080.35Conserved
SCO1276 RNA polymerase ECF sigma factor1.220.550.600.88
SCO1564 putative RNA polymerase sigma factor1.04−0.331.340.47
SCO1723 putative RNA polymerase sigma factor0.19−0.39−0.150.52Conserved
SCO1876 putative RNA polymerase sigma factor1.010.930.770.50
SCO2465 RNA polymerase principal sigma factor0.760.740.970.64Conserved
SCO2639 putative RNA polymerase sigma factor0.740.220.010.27Conserved
SCO2954 putative RNA polymerase sigma factor1.120.510.940.46
SCO3066 putative regulator of Sig150.530.290.880.31Conserved
SCO3067 putative anti anti sigma factor0.741.250.890.42
SCO3068 putative RNA polymerase sigma factor0.33−0.061.070.41Conserved
SCO3202 RNA polymerase principal sigma factor0.980.291.400.19Conserved
SCO3323 putative RNA polymerase sigma factor0.760.491.080.27Conserved
SCO3356 ECF sigma factor 37−0.050.110.730.46Conserved
SCO3450 putative RNA polymerase sigma factor (ECF subfamily)0.16−0.090.79−0.23
SCO3548 putative anti-sigma factor0.570.500.49−0.06
SCO3549 bldG putative anti-sigma factor antagonist−0.03−0.160.21−0.20Conserved
SCO3613 putative RNA polymerase sigma factor0.570.150.080.46Conserved
SCO3692 putative anti-sigma factor antagonist0.140.64−0.270.13Conserved
SCO3709 putative ECF sigma factor0.020.06−0.210.61Conserved
SCO3715 putative ECF sigma factor0.450.92−0.28−0.27Conserved
SCO3736 putative RNA polymerase ECF sigma factor−0.06−0.290.28−0.09Conserved
SCO3892 putative RNA polymerase sigma factor0.680.210.80−0.27Conserved
SCO4027 putative anti sigma factor antagonist−0.041.110.590.58
SCO4034 putative RNA polymerase sigma factor0.981.271.260.22Conserved
SCO4035 RNA polymerase sigma factor (fragment)1.041.270.880.45Conserved
SCO4146 putative ECF subfamily sigma factor−0.230.530.580.16Conserved
SCO4409 putative RNA polymerase sigma factor−0.100.120.100.64Conserved
SCO4410 putative anti anti sigma factor0.890.070.160.81
SCO4452 putative sigma factor−0.17−0.21−0.080.27Conserved
SCO4769 ECF sigma factor0.090.380.680.61
SCO4864 putative ECF sigma factor0.02−0.34−0.120.73
SCO4866 putative ECF sigma factor0.120.190.090.38Conserved
SCO4895 putative ECF sigma factor0.321.15−0.110.56
SCO4938 putative ECF-sigma factor0.170.430.240.64Conserved
SCO4960 possible sigma factor−0.040.050.660.60
SCO4996 putative RNA polymerase ECF sigma factor0.54−0.040.580.48
SCO5147 putative ECF-subfamily sigma factor0.390.350.790.59
SCO5217 anti-sigma factor0.47−0.050.280.80
SCO5244 anti-sigma factor0.320.54−0.37−0.29
SCO5386 putative anti-sigma factor antagonist0.150.370.00-0.07Conserved
SCO5621 RNA polymerase sigma factor WhiG0.790.920.64−0.27Conserved
SCO5820 hrdB, major vegetative sigma factor1.361.061.531.09Conserved
SCO5934 putative sigma factor0.070.25-0.580.17
SCO6239 putative sigma factor0.921.271.840.74
SCO6996 putative RNA polymerase sigma factor-0.340.290.00−0.04Conserved
SCO7099 putative RNA polymerase sigma factor−0.200.380.380.29
SCO7104 putative RNA polymerase sigma factor0.70−0.020.790.56
SCO7112 putative ECF-family RNA polymerase sigma factor0.35−0.251.65−0.38
SCO7144 putative ECF sigma factor0.620.130.910.34
SCO7314 probable RNA polymerase sigma factor−0.25−0.110.230.46Conserved
SCO7323 anti-sigma factor antagonist0.30−0.010.230.31Conserved
SCO7325 anti-sigma factor antagonist-0.37-0.68−0.231.15
SCO7341 putative RNA polymerase secondary sigma factor−0.090.540.290.44Conserved
SCO7573 putative anti-sigma factor antagonist−0.180.031.690.21
SCO7619 putative anti sigma factor antagonist−0.280.370.970.64
SCO7754 putative anti-sigma factor antagonist1.200.511.890.02
Mean hybridization score for ribosomal protein genes0.010.060.100.05NA

Bold values indicate that the signal for that gene is more than 2SD below the mean core signal for that species and such a value is suggestive of either gene absence or very low similarity. A conserved gene is one that seems to be present in all four species. NA, Not applicable

Conservation across the four species of genes annotated as sigma factors or related proteins in Streptomyces coelicolor Bold values indicate that the signal for that gene is more than 2SD below the mean core signal for that species and such a value is suggestive of either gene absence or very low similarity. A conserved gene is one that seems to be present in all four species. NA, Not applicable All four species studied here undergo differentiation and spore formation and as such would be expected to retain most genes involved in cell division/sporulation/differentiation. This is supported by Table 5. K. aureofaciens shows greater gene divergence for certain genes when compared to the three Streptomyces species and these are specifically ftsI (SCO2090) and a putative cell division protein (SCO2968). However, in general, the same genes in all four species show a higher divergence, for example sapA, which is a protein associated with the spore surface hydrophobicity. As spore morphology varies a lot in the Streptomyces, high variability/gene loss in such a gene is not unexpected. Other genes that show higher divergence are those involved in partitioning and cell division. This suggests that the genes and thus the proteins involved in these functions may differ from species to species in order to create the variation seen in aerial mycelium and spore structure across Streptomyces species. Specifically, SCO3934, an ftsK family protein gene is less well conserved than its homologue. This suggests that SCO5750 may produce the major ftsK protein. Other Fts proteins show a similar pattern with at least one homologue being well conserved. This may well help an understanding of the relationships between the genes involved in cell division and will allow better identification of specific targets for further study. One anomaly that stands out is bldB. This gene consistently shows a low level of hybridization. A comparison of the bldB gene sequence between S. coelicolor and S. avermitilis shows a nucleotide identity of about 87%, which ought to give a signal in the region of 0.0 or better. As two different arrays are used in this study, mechanical problems with this spot can probably be eliminated as the source of the anomaly. We suggest that because this is a relatively small gene, the PCR product chosen for both arrays may be the reason for this result. This emphasizes that array data should be used with a degree of caution and needs to be backed up by other experimental evidence when specific genes are being investigated.
Table 5

Conservation across the four species of genes in Streptomyces coelicolor annotated as involved in cell division, sporulation and differentiation

 S. avermitilisS. cattleyaS. maritimus K. aureofaciens
SCO0409 sapA spore-associated protein precursor1.991.390.590.67
SCO1454 putative amino oxidase1.000.451.18−0.17
SCO1489 bldD putative DNA binding protein0.990.761.020.58
SCO1772 putative partitioning or sporulation protein0.690.350.390.54
SCO2082 ftsZ cell division protein1.440.890.970.95
SCO2083 ftsQ sporulation protein0.320.740.060.26
SCO2084 murG0.86−0.010.490.32
SCO2085 fts W putative cell division protein0.820.810.590.50
SCO2086 murD0.580.230.350.45
SCO2087 murX0.410.050.51−0.30
SCO2088 murF1.180.730.670.01
SCO2089 murE0.730.470.360.31
SCO2090 ftsl cell division protein0.800.010.45−0.50
SCO2607 Sfr protein0.730.730.91−0.01
SCO2608 penicillin binding protein−0.04−0.250.45−0.92
SCO2609 mreD rod shape-determining protein0.090.070.660.43
SCO2610 mreC rod shape-determining protein0.42−0.160.170.52
SCO2611 mreB rod shape-determining protein0.980.800.680.19
SCO2620 putative cell division trigger factor0.81−0.120.500.25
SCO2968 putative cell division protein0.35−0.170.34−0.39
SCO2969 ftsE cell division ATP-binding protein0.370.430.000.51
SCO3034 whiB sporulation regulatory protein0.230.500.810.26
SCO3323 bldB putative RNA polymerase sigma factor0.760.491.080.27
SCO3404 ftsH2 cell division protein ftsH homolog1.110.511.070.15
SCO3549 bldG putative anti-sigma factor antagonist−0.03−0.160.21−0.20
SCO3557 putative septum site determining protein0.310.45−0.190.92
SCO3558 putative morphological differentiation-associated protein0.69−0.261.62−0.20
SCO3846 putative FtsW/RodA/SpoVE family cell cycle protein1.110.311.070.61
SCO3886 putative partitioning or sporulation protein0.000.830.430.98
SCO3887 putative partitioning or sporulation protein−0.19−0.190.241.04
SCO3934 ftsK/spoIIIE family protein0.560.391.180.53
SCO4014 sporulation associated protein0.870.930.811.17
SCO4184 mfC aerial mycelium formation0.120.170.01−0.05
SCO4508 putative cell division-related protein0.62−0.270.070.59
SCO4531 putative septum determining protein0.560.880.12−0.09
SCO4620 traB1 putative sporulation-related protein0.490.680.19−0.34
SCO4621 traA1 putative sporulation-related protein−0.030.511.33−0.38
SCO4767 putative regulatory protein0.140.001.67−0.01
SCO4768 bldM putative two-component regulator1.061.000.920.61
SCO5006 minD1 putative septum site-determining protein−0.31−0.040.58−0.28
SCO5008 minD3 putative septum site-determining protein0.04−0.21−0.11−0.07
SCO5112 BldKA0.420.920.750.51
SCO5114 BldKC0.39−0.081.23−0.23
SCO5115 BldKD0.01−0.180.89−0.15
SCO5116 bldKE putative peptide transport system ATP-binding protein−0.04−0.220.70−0.03
SCO5314 whiE protein VII1.24−0.230.030.67
SCO5315 polyketide cyclase−0.390.82−0.31−0.25
SCO5316 acyl carrier protein0.42−0.290.750.03
SCO5318 polyketide beta-ketoacyl synthase alpha−0.03−0.130.860.43
SCO5321 polyketide hydroxylase0.12−0.091.500.27
SCO5587 ftsH cell division protein FtsH homolog−0.050.310.230.21
SCO5621 whiG RNA polymerase sigma factor WhiG0.790.920.64−0.27
SCO5723 bldB putative regulator, BldB1.470.761.271.05
SCO5750 ftsK homolog0.670.160.652.52
SCO5819 whiH, sporulation transcription factor0.680.120.160.13
SCO6029 whiI two-component regulator0.14−0.260.770.92
Mean hybridization score for ribosomal protein genes0.05−0.050.27−0.02

Bold values indicate that the signal for that gene is more than 2SD below the mean core signal for that species and such a value is suggestive of either gene absence or very low similarity

Conservation across the four species of genes in Streptomyces coelicolor annotated as involved in cell division, sporulation and differentiation Bold values indicate that the signal for that gene is more than 2SD below the mean core signal for that species and such a value is suggestive of either gene absence or very low similarity The genes involved in DNA replication, repair, restriction/modification are shown in Table 6 and only about 20% of these genes are not conserved relatively well across all four species. This is to be expected as DNA replication and repair are core functions. Most of the genes that show higher levels of gene divergence are found in the terminal regions of the linear chromosome and probably are genes that perform functions that are not essential to cell survival because the terminal regions of Streptomyces chromosomes are unstable and liable to deletion without lethality. Of particular interest are SCO0183 and SCO0842 (deoxiribopyrmidine photolyases); these repair system would seem to be absent in S. lividans and S. maritimus, but a homologue is present in S. avermitilis (confirmed by the genome sequence) and in S. cattleya. This confirms the high variability found for this repair function across the Streptomyces (Kobayashi et al. 1989). A similar situation of high variability is found for the mutT homologues, potential 8 hydroxy-dGTP hydrolases. Knockout of this gene has been shown to increase the A:T to G:T mutation rate and thus it has a possible repair function (Kamiya et al. 2004). The genes for recA (SCO5769), recF (SCO3876) and recR (SCO3618) are present in all four species; however, the recX (SCO5770), is more divergent and gives a low signal for S. cattleya and S. maritimus. SCO6405, a putative DNA recombinase, is scored as absent in all four species suggesting that there is redundancy in the Streptomyces genes concerned with recombination or that this gene is transposon related. The latter is supported by low homology to S. avermitilis putative integrases/recombinases. There are four genes encoding DNA gyrases on the microarray, namely, gyrA DNA gyrase subunit A (SCO3873) and gyrB DNA gyrase subunit B (SCO3874) together with SCO5836 and SCO5822 and these may be TopIV homologues involved in resolving chromosome concatenates. All are conserved although the conservation of SCO5822 gyrB2 is lower. Thus both sets of gyrase genes would seem to be important. As expected, SCO1518, a ruvB Holliday junction protein gene and SCO1520, a ruvC crossover junction endonuclease are conserved across all the species. Unexpectedly, although probably present in all species, SCO1519 ruvA is much more divergent that the other two gene in this Holliday junction complex. This diversity is unexpected and not easily explicable except by the fact that recombination in Streptomyces may occur via a more variable mechanism than in other groups of bacteria and this is then reflected in the greater divergence of SCO1519 ruvA. All three genes annotated as a DNA polymerase 1 homologue are conserved as are four out of the five DNA polymerase III homologues, suggesting that there are roles for all of these conserved genes in Streptomyces. Two other unclassified DNA polymerase type genes, SCO4495 and SCO6084 are also conserved and thus may have important functions. There is, however, more diversity among the helicases and methylases/methyltransferases. With the helicases, three out of 14 show significant divergence and therefore most of the helicases probably have important cellular roles. Four out of nine methylases/methyltransferases show divergence. As some of these genes may be involved in the DNA modification part of restriction/modification, such diversity across strains in not unexpected. Finally, four out of six ligases show divergence, perhaps reflecting the fact that the origin of a number of these ligases might be from bacteriophages.
Table 6

Conservation across the four species of genes in Streptomyces coelicolor annotated as involved in DNA replication, repair, restriction and modification

 S. avermitilisS. cattleyaS. maritimusK. aureofaciens
SCO0183 putative deoxyribodipyrimidine photolyase1.383310.764041.157860.78973
SCO0760 putative methyltransferase−0.23598−0.176030.079613−0.16712
SCO0842 putative deoxyribodipyrimidine photolyase0.0024750.167429−0.051980.35453
SCO0918 putative excinuclease ABC subunit A−0.28707−0.348130.010852−0.46861
SCO0945 putative formamidopyrimidine-DNA glycosylase0.348570.606980.4371−0.38083
SCO1040 putative DNA repair protein0.0473150.5687230.4561680.367071
SCO1050 putative DNA protection protein−0.474790.0932990.4514870.92697
SCO1114 uracil DNA glycosylase0.345730.3571820.4339020.950894
SCO1167 putative helicase (fragment)0.6048130.072648−0.40455−0.25074
SCO1180 putative DNA polymerase III beta chain0.347−0.215110.42838−0.20922
SCO1202 putative DNA ligase0.3084970.202188−0.155240.096586
SCO1203 putative MutT-like protein−0.17791−0.37050.324233−0.28871
SCO1255 G/U mismatch-specific DNA glycosylase0.5216790.419040.3212080.429148
SCO1343 uracil-DNA glycosylase0.612940.003390.3114780.036821
SCO1380 putative DNA damage inducible protein0.7630640.3139480.2149310.677262
SCO1395 mutT-like protein0.0682150.5725270.170710.414092
SCO1475 putative primosomal protein0.0488920.6892320.535450.457655
SCO1518 ruvB holliday junction DNA helicase1.1364890.6388030.9300671.045593
SCO1519 ruvA holliday junction DNA helicase0.57721−0.23275−0.153740.008274
SCO1520 ruvC crossover junction endodeoxyribonuclease1.0797860.7081310.8863630.979821
SCO1534 putative DNA polymerase III0.32960.371242−0.100250.090739
SCO1739 putative DNA polymerase III1.1280490.4231581.1119640.326701
SCO1780 putative DNA repair protein−0.14578−0.091750.258713−0.32746
SCO1792 putative 3-methyladenine DNA glycosylase−0.20485−0.077550.083499−0.2824
SCO1827 putative DNA polymerase III0.5597150.7112650.5128730.343849
SCO1966 ABC excision nuclease subunit B0.0473820.068961.085903−0.05934
SCO1969 putative DNA-methyltransferase−0.19025−0.121840.1433670.56545
SCO2003 DNA polymerase I1.1721760.1884930.4981260.201411
SCO2468 DNA primase0.827150.5205521.256130.69797
SCO2626 putative DNA repair hydrolase (fragment)0.2899160.3373650.5677830.326821
SCO2863 putative helicase0.479350.681241.928810.81052
SCO2952 putative helicase protein0.5139520.388640.7505170.379556
SCO3109 putative transcriptional-repair coupling factor0.87255−0.296110.759684−0.33637
SCO3351 putative DNA repair protein−0.950430.685210.246370.48466
SCO3352 putative DNA-binding protein0.090569−0.096020.536392−0.2275
SCO3434 putative DNA polymerase I0.85541.6517680.8742461.328528
SCO3510 putative DNA methylase0.4024331.113932.152742.07607
SCO3541 putative DNA polymerase0.0035970.402890.2970760.75003
SCO3543 probable DNA topoisomerase I0.7987051.334770.878841.226661
SCO3550 putative helicase0.2636440.139350.5598720.216127
SCO3618 putative recomination protein0.5334040.0526230.511469−0.36893
SCO3873 DNA gyrase subunit A1.293458−0.208510.9855461.2494
SCO3874 DNA gyrase subunit B0.993330.9582471.2506690.37466
SCO3878 DNA polymerase III0.053750.0260530.3285020.62343
SCO3879 chromosomal replication initiator protein (fragment)1.259831.5725310.731065−0.12482
SCO4092 ATP-dependent helicase−0.007520.0551940.978098−0.21188
SCO4143 putative mutT-like protein0.0210350.4354710.479161−0.28423
SCO4272 putative mutT-like protein−0.13520.016206−0.329680.527274
SCO4351 putative DNA invertase1.266880.781441.292840.68489
SCO4495 putative DNA polymerase related protein−0.096240.759370.5824190.316259
SCO4577 putative helicase0.7269620.0531961.044155−0.09189
SCO4797 putative ATP-dependent DNA helicase II0.3233660.2864210.7498680.026912
SCO5064 putative bifunctional protein−0.202620.492591.90763−0.29433
SCO5143 DNA-3-methyladenine glycosylase I−0.865940.3424890.2766110.981316
SCO5183 putative ATP-dependent DNA helicase0.2356330.1207650.7675040.591935
SCO5184 putative ATP-dependent DNA helicase0.180638−0.175870.2088390.666073
SCO5188 putative ATP-dependent DNA helicase0.2040480.360050.808168−0.04087
SCO5331 putative DNA methylase1.528642.374833.764362.18209
SCO5494 putative DNA ligase0.1247810.2884290.215946−0.0149
SCO5566 putative ATP-dependent DNA helicase0.4089951.1867140.4281830.825946
SCO5567 putative methylase0.3381980.822070.5961960.83241
SCO5573 formamidopyrimidine-DNA glycosylase0.387552−0.083530.6015850.435209
SCO5760 DNA glycosylase0.8469850.022350.6758760.46121
SCO5770 RecX protein−0.11930.721040.580750.079675
SCO5802 putative ATP-dependent helicase0.823429−0.104330.5115950.058287
SCO5803 SOS regulatory protein LexA0.1433220.78751−0.0905−0.16946
SCO5805 ribonucleotide reductase0.2351820.2715140.98906−0.06823
SCO5815 probable ATP-dependent DNA helicase0.52023−0.311380.752540.71096
SCO5822 gyrB2, probable DNA gyrase0.1672750.2234730.4268490.393883
SCO5836 DNA gyrase-like protein0.7257080.5074980.9375310.073608
SCO6084 putative DNA polymerase−0.053810.1656340.084564−0.16407
SCO6151 putative methylated-DNA-protein-cysteine methyltransferase0.88705−0.081120.3752140.207725
SCO6262 putative helicase 6884138:6887071 forward MW:1039120.260624−0.288360.753508−0.08897
SCO6405 putative DNA recombinase−0.169580.655621.15936−0.30768
SCO6462 putative methylated-DNA-protein-cysteine methyltransferase−0.09961−0.002850.2318360.026866
SCO6640 putative ATP-dependent helicase0.566590.523671.18266−0.38994
SCO6707 putative DNA ligase−0.254090.618121−0.0252−0.3333
SCO6844 putative DNA methylase.0.4878060.4757110.593870.230037
SCO6907 putative DNA ligase.0.714910.1813840.675640.69734
SCO7345 probable ATP-dependent DNA ligase0.1766920.2076060.3402550.085159
SCO7522 putative DNA ligase−0.31874−0.05330.637010.470458
Mean hybridization score0.1136330.0074410.153752−0.04491

Bold values indicate that the signal for that gene is more than 2SD below the mean core signal for that species and such a value is suggestive of either gene absence or very low similarity

Conservation across the four species of genes in Streptomyces coelicolor annotated as involved in DNA replication, repair, restriction and modification Bold values indicate that the signal for that gene is more than 2SD below the mean core signal for that species and such a value is suggestive of either gene absence or very low similarity Table 7 shows the genes involved in peptidoglycan and teichoic acid synthesis. In this area of metabolism, there is also a relatively high level of conservation of genes, particularly the murA, murA2, murB, murD, murE, murF,murG and murX genes. Also conserved are the shape-determining genes SCO2609, SCO2610 and SCO2611, which may form an operon. This probably represent a core of genes together with the genes involved in biosynthesis of the cell wall that are needed to give a basic structure to the cells of any Streptomyces species. The penicillin binding proteins show a higher degree of variability, except for SCO2897, SCO4013 and SCO5301. The peptidases SCO3580, SCO3596, SCO3011 and SCO4439 and the D-alanine:D-lactate ligase SCO3595 all show a low level of gene conservation, perhaps because they are involved in relatively broad cellular functions and not under a great deal of selective pressure.
Table 7

Conservation across the four species of genes in Streptomyces coelicolor annotated as involved in peptidoglycan biosynthesis

 S. avermitilisS.cattleyaS. maritimusK. aureofaciens
SCO0237 putative oxidoreductase−0.075261.14854−0.098210.57347
SCO0286 putative peptidoglycan binding protein0.94590.939491.839671.02755
SCO0830 putative penicillin-binding protein0.2434580.0813410.5560960.08585
SCO0936 putative oligosaccharide deacetylase0.787591.386080.588921.00821
SCO1018 putative isomerase0.3905340.5192040.3926050.600019
SCO1875 putative secreted penicillin binding protein0.448310.0395980.3203560.256976
SCO2084 murG0.85602−0.006770.4859890.319639
SCO2085 putative cell division protein0.8166240.807420.5922140.502565
SCO2086 murD0.5805060.2253480.3476740.449973
SCO2087 murX0.4057310.0496450.509959−0.3047
SCO2088 murF1.1750780.7309250.6673830.006107
SCO2089 murE0.7340680.4708690.3634580.308295
SCO2345 putative peptidodoglycan-binding membrane protein−0.03290.0163380.1333550.069404
SCO2451 putative rod shape-determining protein0.6043270.1800280.6122430.205981
SCO2589 putative glycosyl transferase0.0932850.412170.478809−0.09528
SCO2590 putative glycosyltransferase0.0699330.529381.185230.89326
SCO2608 penicillin binding protein−0.0413−0.246810.4451430.92429
SCO2609 rod shape-determining protein0.0858790.067210.6636090.427758
SCO2610 rod shape-determining protein0.41695−0.157980.1680380.52083
SCO2611 rod shape-determining protein0.9798840.7964030.6752040.186331
SCO2706 putative transferase0.2334230.5056520.555440.407847
SCO2707 putative transferase0.0141260.372250.088105−0.12772
SCO2897 probable penicillin-binding protein0.5766460.4093880.6932370.120551
SCO2949 murA0.4183630.2090240.3351590.062475
SCO3580 putative transpeptidase−0.098840.766661.133470.212152
SCO3595 putative D-alanine:D-lactate ligase0.777071.214021.612661.31715
SCO3596 putative D-alanine:D-alanine dipeptidase1.472940.659261.48140.85496
SCO3811 putative D-alanyl-D-alanine carboxypeptidase0.507140.586170.1550770.78413
SCO3847 putative penicillin-binding protein0.1286220.436430.640849−0.18456
SCO3901 putative penicillin-binding protein0.596840.78306−0.26692−0.39905
SCO4013 putative penicillin binding protein−0.09095−0.01844−0.047970.164779
SCO4132 putative secreted transglycosylase0.237226−0.17562−0.03522−0.2261
SCO4439 putative D-alanyl-D-alanine carboxypeptidase0.748650.930490.1775851.21329
SCO4643 murB−0.01505−0.206590.197283−0.35551
SCO5039 putative penicillin-binding protein0.590106−0.709260.8918190.313533
SCO5301 putative penicillin-binding protein−0.19347−0.131310.547233−0.22028
SCO5365 putative transferase1.11236−0.273340.5691521.16844
SCO5467 muramoyl-pentapeptide carboxypeptidase−0.154480.387477−0.49044−0.02243
SCO5560 D-alanine-D-alanine ligase0.7281120.2096020.1677690.330255
SCO5998 murA20.8007260.7929751.2913680.051102
SCO6060 putative UDP-N-acetylmuramoyl-L-alanine ligase0.199410.3770040.8173290.527531
SCO7050 putative D-alanyl-D-alanine carboxypeptidase0.6059490.702030.3438270.524521
Mean hybridization score0.125637−0.107440.118866−0.14494

Bold values indicate that the signal for that gene is more than 2SD below the mean core signal for that species and such a value is suggestive of either gene absence or very low similarity

Conservation across the four species of genes in Streptomyces coelicolor annotated as involved in peptidoglycan biosynthesis Bold values indicate that the signal for that gene is more than 2SD below the mean core signal for that species and such a value is suggestive of either gene absence or very low similarity

Conserved genes with no known function

Genes with no known function and no homologue outside of S. avermitilis that are conserved across the other three Streptomyces species should represent genes important to specifically being a myceliate Actinobacteria and the phenotype of gene knockout strains for these genes will be particularly interesting in terms of Streptomyces biology. Based on the dataset here, 936 genes can be identified as annotated as either conserved hypothetical genes or non-conserved hypothetical genes and these are shown in Supplementary Table 1. The proportion of these genes that are conserved across all four species are 9%, 20%, 13%, 16% and 12% for the left terminal region, left Streptomyces specific region, core region, right Streptomyces specific region and right terminal region, respectively. There is also a low frequency of conserved hypothetical genes in the left terminal region and right Streptomyces specific region, 0.78% and 0.96%, respectively compared to 3.4% for the left Streptomyces specific region, 1.80% for the core region and 2.11% for the right terminal region. It is clear that there is a need to further screen these genes by increasing the range of Streptomyces species analyzed by microarray hybridization. This will reduce the number to a manageable number and will allow prioritization of genes for knockout and detailed phenotypic analysis. Another approach to the problem of identifying functionally important genes is by the pinpointing of functional groups of such genes that may form a transcriptional unit. Blocks of three or more hypothetical genes that are conserved across all species were identified and are shown in Table 8. It is possible that these groups represent conserved functional groups of genes essential to core functions that make Streptomyces different from other bacteria. They are found mostly in the area between the Streptomyces terminal regions and the central core region. There are seven groups of conserved hypothetical genes larger than five genes (SCO1407–SCO1413, SCO2362–SCO2370, SCO2911–SCO2919, SCO3846–SCO3854, SCO5536–SCO5543, SCO5762–SCO5767 and SCO6522–6528). It is likely, due to the proximity of various genes around SCO3846–SCO3854, that this complex is involved in cell division, development and DNA partitioning. The function of the others groups is unknown. Interestingly, none of these gene groups are upregulated shifting from exponential phase to stationary phase or under stress shift as indicated by Karoonuthaisiri et al. (2005).
Table 8

Hypothetical genes in S. coelicolor conserved as a group in the four species analyzed

Genes (SCO)Operon structureaLinked function if anyb
0614, 0616, 0617, 0618None
1317, 1318, 1319, 1320None
1521, 1522, 1523, 1524Possible operonRecombination
1634, 1635, 1636Possible operon
1650, 1651, 1652, 1653Possible operonProteosome
1788, 1789, 1790, 1791, 1794, 1795, 1796Possible operonBoth flanks of rRNA gene homologues
2030, 2031, 2032Possible operon
2124, 2125, 2127, 2129, 2130Possible operonGlucose kinase
2268, 2269, 2270Possible operonClose to heme oxygenase
2913, 2915, 2916, 2917None
3115, 3117, 3118, 3119None
3150, 3151, 3152, 3153None
3406, 3407, 3408Possible operonPenicillin binding protein
3950, 3951, 3952Possible operonOxidoreductase
4028, 4029, 4030None
4801, 4803, 4804, 4805None
5307, 5308, 5309, 5310, 5312None
5600, 5601, 5602, 5603, 5604Possible operonHomology to Mycobacterium tuberculosis
5762, 5763, 5764, 5765Possible operonDNA helicase
6413, 6415, 6416, 6417, 6419, 6420, 6421, 6422None
6574, 6575, 6576, 6577, 6578, 6579, 6580Possible operonPossible DNA binding protein
6671, 6672, 6674, 6675, 6676Possible operon
7070, 7071, 7072None

a Gene structure from Artemis v7 is compatible with an operon type structure with possible appropriate ribosome binding sites

b Inside or linked to the conserved genes is a gene(s) of known function

Hypothetical genes in S. coelicolor conserved as a group in the four species analyzed a Gene structure from Artemis v7 is compatible with an operon type structure with possible appropriate ribosome binding sites b Inside or linked to the conserved genes is a gene(s) of known function

Conservation of genes involved in secondary metabolism and similar functions

Genes that are involved in secondary metabolism and antibiotic production are widely distributed in the Streptomyces and many if not most may have been involved in horizontal transfer. However, there is significant similarity between genes involved in similar pathways and thus significant cross-hybridization may occur between similar metabolic pathways. A large number of genes are also involved in secondary metabolism (165) and polyketide synthesis (102) in the S. coelicolor genome. These are grouped together in 23 pathway groups and are displayed in Supplementary Fig. 3. Genes identified as secondary metabolic genes but existing on their own and not in a group of secondary metabolic genes have been eliminated to simplify the analysis leaving only genes involved in these functions with two or more genes together in a group. These include specific pathways producing secondary metabolic products such as melanin, actinorhodin, CDA and Red pathway. Many of the other potential pathways have not been studied in detail and the functions of these genes are unknown. Because of evolutionary similarity, the presence of genes hybridizing to a particular pathway does not mean that the specific pathway is present, but possibly that a related one is may be present. Similarly, a high level of hybridization can mean either a very close relationship between the pathways in the two species or the presence of multiple copies of related pathways. In general terms S. maritimus shows the greatest absence of secondary metabolic pathways that are present in S. coelicolor. Interesting, S. cattleya and K. aureofaciens seems to have pathways related to many of the S. coelicolor secondary metabolic pathways present in their genomes, although they are phylogenetically more distant than S. maritimus. The actinorhodin pathway would seem to be absent from S. avermitilis (as expected from the genome data), S. cattleya and S. maritimus although some related genes do seem to be present in K. aureofaciens. The WhiE pathway is conserved in all species, but with some genes showing a very low level of hybridization in certain cases and these include whiE protein VII and the acyl carrier protein. Genes from the Red pathway show varying levels of hybridization suggesting that distantly related pathways may be present in these species. The CDA pathway is conserved in all four species and in certain cases the genes seem to be over represented suggesting multiple examples of the same type of pathway in S. cattleya and S. maritimus. The presence of similar pathways at a level of about 50% for K. aureofaciens supports the well established idea that horizontal gene transfer of secondary metabolic pathways may have played a significant role in the evolution of the Streptomyces and any related genus. Because the natural environment of Streptomyces is the soil, they are thought to play an important role in the recycling of lignocellulose material. However, there is relatively little information on what genes are involved in this process. Interestingly, melC1 and melC2, which encode tyrosinase (monophenol monooxygenase, SCO2700) and its cofactor (SCO2701) (Leu et al. 1992) are conserved across the three Streptomyces species and probably also Kitasatospora (SCO2700 −0.76, SCO2701 0.08). On the other hand, the duplicate MelD1 (SCO2701) and MelD2 (SCO2700) genes found in S. coelicolor are not conserved and are phylogenetically distinct from MelC1 and MelC2 found in other Streptomyces (unpublished results). This perhaps represents a divergence of function between this two gene pairs. S. coelicolor does not produce a detectable amount of black melanin pigment and these results suggest that these enzymes may be involved in the metabolic conversion of lignocellulose byproducts rather than pigment formation. Evolutionary conservation of these genes to serve this function under particular conditions of induction would make more sense than retention of inducible black pigment formation. Other enzymes with a possible role in the lignocellulose cycle that are conserved across the species are shown in Table 9. These include many oxygenases that may have a role in producing oxygen radicals capable of attacking lignin, genes involved in the sensing and breaking down hydrogen peroxide, cellulose metabolism genes, cellobiose metabolism genes, etc. Those found in the terminal regions may represent gene groups that are not conserved in a syntenous manner and subject to horizontal gene transfer, while those within the core and intermediate regions may be part of the basic group of genes essential to Streptomyces in the soil environment. Lignocellulose degradation is a difficult topic to study in the Actinomycetales and therefore these candidate genes may help to solve some of the problems associated with this.
Table 9

Genes conserved in the four Streptomyces species that are potentially involved in lignocellulose cycling

SCO0333Dioxygenase
SCO0560Catalase/Peroxidase
SCO0765Endoglucanase
SCO1187Cellulase
SCO1188Cellulose binding protein
SCO1338Monooxygenase
SCO1451Endoglucanase
SCO1923Dioxygenase
SCO2016Monooxygenase
SCO2267Heme oxygenase
SCO2700Tyrosinase (monophenol monooxygenase)
SCO2701Tyrosinase cofactor
SCO2783Monooxygenase
SCO2798Cellobiose hydrolase
SCO2838Endoglucanase
SCO3172Monooxygenase
SCO3236Oxygenase
SCO4416Monooxygenase
SCO4870Monooxygenase
SCO5033Hydrogen peroxide sensing regulator
SCO5293Oxygenase
SCO5390Alkanal monoxygenase
SCO5773Monooxygenase
SCO6545Cellulase
SCO7223Monooxygenase
SCO7637Endoglucanase

Note that the oxygeneases included as possible enzymes that make be able to attack lignin are all unclassified yet as to their real function. The core region is in bold

Genes conserved in the four Streptomyces species that are potentially involved in lignocellulose cycling Note that the oxygeneases included as possible enzymes that make be able to attack lignin are all unclassified yet as to their real function. The core region is in bold

Conclusions

This study confirms that within the Streptomyces analyzed here there is conservation of a core set of genes in the middle of the linear S. coelicolor/S. avermitilis chromosome structure. This is associated with a much higher diversity of gene in the terminal regions of the linear chromosome. Linking these regions are two intermediate regions where there seems to be conservation of genus specific genes and gene clusters. This study also identifies candidate genes that may be possibly involved in terminal replication and other myceliate growth related functions based on a classification of genes into conserved and none conserved groups. This study also provides insights into which genes in Streptomyces play a more significant role in the biochemical network of S. coelicolor, Streptomyces and myceliate Actinobacteria in general. Finally, the degree of gene conserved detected between the four species implies that that genome model of S. coelicolor may extent well beyond the borders of the Streptomyces. It includes at least one Kitasatospora species; furthermore, a similar structure by microarray analysis has been found for Saccharomonospora viridis and Streptosporangium roseum, but not Streptomyces rimosus ATCC10970 (unpublished data). Thus, the microarray approach to genome content analysis and exploration of genome evolution may be fairly widely applicable in the various Actinomycete genus close to Streptomyces that undergo complex morphogenesis. Below is the link to the electronic supplementary material ESM (PDF 938 kb)
  44 in total

1.  Global analysis of growth phase responsive gene expression and regulation of antibiotic biosynthetic pathways in Streptomyces coelicolor using DNA microarrays.

Authors:  J Huang; C J Lih; K H Pan; S N Cohen
Journal:  Genes Dev       Date:  2001-12-01       Impact factor: 11.361

2.  The Stanford Microarray Database: data access and quality assessment tools.

Authors:  Jeremy Gollub; Catherine A Ball; Gail Binkley; Janos Demeter; David B Finkelstein; Joan M Hebert; Tina Hernandez-Boussard; Heng Jin; Miroslava Kaloper; John C Matese; Mark Schroeder; Patrick O Brown; David Botstein; Gavin Sherlock
Journal:  Nucleic Acids Res       Date:  2003-01-01       Impact factor: 16.971

Review 3.  Once the circle has been broken: dynamics and evolution of Streptomyces chromosomes.

Authors:  Carton W Chen; Chih-Hung Huang; Hsuan-Hsuan Lee; Hsiu-Hui Tsai; Ralph Kirby
Journal:  Trends Genet       Date:  2002-10       Impact factor: 11.639

Review 4.  Genome trees and the tree of life.

Authors:  Yuri I Wolf; Igor B Rogozin; Nick V Grishin; Eugene V Koonin
Journal:  Trends Genet       Date:  2002-09       Impact factor: 11.639

5.  Regional organization of gene expression in Streptomyces coelicolor.

Authors:  Nitsara Karoonuthaisiri; David Weaver; Jianqiang Huang; Stanley N Cohen; Camilla M Kao
Journal:  Gene       Date:  2005-06-20       Impact factor: 3.688

6.  Intraspecific variability of the terminal inverted repeats of the linear chromosome of Streptomyces ambofaciens.

Authors:  Frédéric Choulet; Alexandre Gallois; Bertrand Aigle; Sophie Mangenot; Claude Gerbaud; Chantal Truong; François-Xavier Francou; Frédéric Borges; Céline Fourrier; Michel Guérineau; Bernard Decaris; Valérie Barbe; Jean-Luc Pernodet; Pierre Leblond
Journal:  J Bacteriol       Date:  2006-09       Impact factor: 3.490

7.  Use of an open-reading frame-specific Campylobacter jejuni DNA microarray as a new genotyping tool for studying epidemiologically related isolates.

Authors:  Edward E Leonard; Tohru Takata; Martin J Blaser; Stanley Falkow; Lucy S Tompkins; Erin C Gaynor
Journal:  J Infect Dis       Date:  2003-01-29       Impact factor: 5.226

8.  Complete genome sequence and comparative analysis of the industrial microorganism Streptomyces avermitilis.

Authors:  Haruo Ikeda; Jun Ishikawa; Akiharu Hanamoto; Mayumi Shinose; Hisashi Kikuchi; Tadayoshi Shiba; Yoshiyuki Sakaki; Masahira Hattori; Satoshi Omura
Journal:  Nat Biotechnol       Date:  2003-04-14       Impact factor: 54.908

9.  Genomic comparison of Salmonella enterica serovars and Salmonella bongori by use of an S. enterica serovar typhimurium DNA microarray.

Authors:  Kaman Chan; Stephen Baker; Charles C Kim; Corrella S Detweiler; Gordon Dougan; Stanley Falkow
Journal:  J Bacteriol       Date:  2003-01       Impact factor: 3.490

10.  An experimental evaluation of a loop versus a reference design for two-channel microarrays.

Authors:  V Vinciotti; R Khanin; D D'Alimonte; X Liu; N Cattini; G Hotchkiss; G Bucca; O de Jesus; J Rasaiyaah; C P Smith; P Kellam; E Wit
Journal:  Bioinformatics       Date:  2004-09-16       Impact factor: 6.937

View more
  7 in total

Review 1.  Biotechnology of polyketides: new breath of life for the novel antibiotic genetic pathways discovery through metagenomics.

Authors:  Elisângela Soares Gomes; Viviane Schuch; Eliana Gertrudes de Macedo Lemos
Journal:  Braz J Microbiol       Date:  2014-03-10       Impact factor: 2.476

2.  Massive Gene Flux Drives Genome Diversity between Sympatric Streptomyces Conspecifics.

Authors:  Abdoul-Razak Tidjani; Jean-Noël Lorenzi; Maxime Toussaint; Erwin van Dijk; Delphine Naquin; Olivier Lespinet; Cyril Bontemps; Pierre Leblond
Journal:  mBio       Date:  2019-09-03       Impact factor: 7.867

3.  LacI-Family Transcriptional Regulator DagR Acts as a Repressor of the Agarolytic Pathway Genes in Streptomyces coelicolor A3(2).

Authors:  Maral Tsevelkhoroloo; So Heon Shim; Chang-Ro Lee; Soon-Kwang Hong; Young-Soo Hong
Journal:  Front Microbiol       Date:  2021-04-06       Impact factor: 5.640

4.  A DasA family sugar binding protein Ste2 links nutrient and oxidative stress to exopolysaccharides production in Streptomyces sp. 139.

Authors:  Mengxin Geng; Limei Ai; Ming Ma; Panpan Li; Lianhong Guo; Guangzhi Shan; Liping Bai
Journal:  BMC Microbiol       Date:  2022-03-08       Impact factor: 3.605

Review 5.  Antibiotic resistance in plant growth promoting bacteria: A comprehensive review and future perspectives to mitigate potential gene invasion risks.

Authors:  Ismail Mahdi; Nidal Fahsi; Mohamed Hijri; Mansour Sobeh
Journal:  Front Microbiol       Date:  2022-09-20       Impact factor: 6.064

6.  Extracellular and intracellular polyphenol oxidases cause opposite effects on sensitivity of Streptomyces to phenolics: a case of double-edged sword.

Authors:  Han-Yu Yang; Carton W Chen
Journal:  PLoS One       Date:  2009-10-14       Impact factor: 3.240

7.  Reconstruction of Secondary Metabolic Pathway to Synthesize Novel Metabolite in Saccharopolyspora erythraea.

Authors:  Chong-Yang Ren; Yong Liu; Wen-Ping Wei; Junbiao Dai; Bang-Ce Ye
Journal:  Front Bioeng Biotechnol       Date:  2021-07-02
  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.