David Gresham1, Maitreya J Dunham, David Botstein. 1. Lewis-Sigler Institute for Integrative Genomics, Department of Molecular Biology, Carl Icahn Laboratory, Princeton University, Princeton, New Jersey 08544, USA. dgresham@genomics.princeton.edu
Abstract
The rapid accumulation of complete genomic sequences offers the opportunity to carry out an analysis of inter- and intra-individual genome variation within a species on a routine basis. Sequencing whole genomes requires resources that are currently beyond those of a single laboratory and therefore it is not a practical approach for resequencing hundreds of individual genomes. DNA microarrays present an alternative way to study differences between closely related genomes. Advances in microarray-based approaches have enabled the main forms of genomic variation (amplifications, deletions, insertions, rearrangements and base-pair changes) to be detected using techniques that are readily performed in individual laboratories using simple experimental approaches.
The rapid accumulation of complete genomic sequences offers the opportunity to carry out an analysis of inter- and intra-individual genome variation within a species on a routine basis. Sequencing whole genomes requires resources that are currently beyond those of a single laboratory and therefore it is not a practical approach for resequencing hundreds of individual genomes. DNA microarrays present an alternative way to study differences between closely related genomes. Advances in microarray-based approaches have enabled the main forms of genomic variation (amplifications, deletions, insertions, rearrangements and base-pair changes) to be detected using techniques that are readily performed in individual laboratories using simple experimental approaches.
Since the early 1990s a large amount of effort has focused on determining the complete genomic DNA sequence of many diverse organisms. Remarkably, virtually all this sequencing has been done using a single method: chain termination sequencing using dideoxynucleosides[1], usually referred to as Sanger sequencing. From the determination of the first complete genome sequence of an organism, the bacteriophage φX174 (Ref. 2), to the completion of 95% of the human genome sequence[3,4] many technical advances in methodologies, automation and computing rapidly increased the rate at which DNA sequence was obtained[5]. The availability of genomic sequences has led to the development of many genome-scale analytical techniques that have greatly enriched modern biology — for example, techniques to measure global mRNA abundance[6,7], systematically knock out all genes[8], perturb their function[9] and generate comprehensive clone collections[10,11] — and together they constitute the new field of genomics.Studies of sequence variation in the same or similar species have many potential applications, ranging from understanding complex human diseases to analysing the products of experimental evolution. However, to realize fully the potential of this science, the task of characterizing genomes should be reduced to a routine procedure that can be done on hundreds of samples. To this end, the research and commercial community is accelerating towards new approaches to genome sequencing[12,13,14] that are increasingly less expensive, more rapid and efficient, and more widely available. Nevertheless, determining even modest numbers of complete genomic sequences is still a substantial undertaking, entailing equipment, infrastructure and running expenses beyond the resources of most individual laboratories, and the study of sequence variation through direct genome sequencing remains the province of a minority of biologists.The applications that are envisioned for cheap and rapid sequencing technology do not actually require repeated determination of entire genomic sequences. Methods that efficiently detect genomic differences, be they structural rearrangements, polymorphisms or mutations, often suffice to reduce the sequencing requirement to a tiny fraction of the genome, a capability that is routine in most modern biology laboratories. Several technologies that use hybridization to DNA microarrays are effective for detecting genomic variation in closely related samples. Thus, questions in which a researcher aims to compare normal and diseased tissues from the same individual or mutant and wild-type DNA from the same experimental organism can often be addressed by microarray-based experimental comparison as opposed to exhaustive sequencing of entire genomes.This Review is focused on the global characterization of differences between closely related genomes — an approach that is ideally suited to microarrays. We describe the various forms of genomic variation that can be detected using microarray-based approaches and discuss some of the important experimental considerations, ranging from experimental design to data analysis and visualization. We highlight the versatility of these approaches, their applicability to various questions and organisms, and briefly describe how global views of genomic diversity are revealing new biological insights.Hybridization technologyDNA microarrays are a collection of DNA probes that are arrayed on a solid support and are used to assay, through hybridization, the presence of complementary DNA that is present in a sample (see Boxes 1,2). The experimental conditions for annealing complementary strands of DNA was reported[15] within a decade of the determination of the structure of DNA, and it was quickly realized that in vitro hybridization of DNA presented a means for comparing genomes. One initial exemplar was the visualization of hybridization products between two entire bacteriophage genomes using electron microscopy[16]. The development of blotting techniques, which use labelled DNA probes for visualization, presaged the fabrication of synthetic nucleotides on a solid support[17,18]. Moreover, the effect of single mismatches on hybridization efficiency was soon appreciated and was used to detect mutations in bacteriophage[19] and human DNA[20] well before the advent of DNA microarrays.DNA microarrays are made either by chemically synthesizing DNA probes on a solid surface or by attaching pre-made DNA probes to a solid surface. Maksos and Southern[21] first demonstrated the synthesis of arrays of oligonucleotides on a solid support in situ. From these initial experiments, advances in technology and chemistry resulted in increasingly higher density oligonucleotide microarrays synthesized in situ using techniques such as photolithography[22] and ink-jet deposition[23]. Simultaneously, the development of printing techniques[24] allowed the robotic arraying of PCR products, pre-synthesized oligonucleotides, or genomic clones such as cDNA or BAC clones — often referred to as spotted microarrays.For genomic analysis a tiling array design is desirable, in which DNA probes are chosen from contiguous stretches of the genome. Whereas only short-oligonucleotide microarrays are appropriate for detecting sequence changes, all types of microarray can be used to detect structural variation. An important distinction is between microarrays that provide truly comprehensive coverage of the genome (whole genome) and those that provide partial coverage across the genome (genome scale). Until recently, whole-genome coverage using oligonucleotide arrays had only been available for small genomes such as those of viruses[25] or the human mitochondrion[26,27]. At present, whole-genome coverage of larger genomes can only be achieved using large probes such as BACs[28,29]. However, advances in engineering and chemistry, largely made by the commercial manufacturers of microarrays, have enabled the construction of increasingly dense oligonucleotide arrays with 105–106 probes per microarray. Thus, it is already possible to manufacture short-oligonucleotide microarrays that cover the entire (although relatively small) genomes of eukaryotic organisms such as Saccharomyces cerevisiae[30,31] and larger genomes such as Arabidopsis thaliana[32,33]. Using dozens of arrays, complete coverage of even mammalian genome sequences has been achieved quite recently[34].The availability of high-density microarrays has facilitated the development of rapid and comprehensive approaches to characterizing genomes. These methods are being applied to a myriad of questions — from explaining the genetic basis of phenotypic variation to describing the extent and nature of genomic diversity.Detection of structural variationStructural variation in the genome refers to microscopic and submicroscopic alterations of the genome and includes deletions and duplications, copy number variation (CNV), insertions, inversions and chromosomal translocations[35]. This broad class of variants constitute a diverse and pervasive source of variation with known functional consequences, including increased pathogenicity and antibiotic resistance of microorganisms[36], a range of humandevelopmental disorders[37] and association with humancancers[38].In contrast to targeted methods for detecting structural variation, such as fluorescent (FISH) and quantitative PCR, microarray-based approaches allow structural variation to be assessed across the entire genome in an unbiased manner. The approach that is used is known as array comparative genome hybridization (aCGH): here, microarrays consisting of BAC[39] or cDNA clones[40], PCR products[41] or oligonucleotides[42] (Box 1) are used to compare probe signal intensities within or between arrays, corresponding to quantitative differences in the amount of material hybridizing at a particular probe (Fig. 1). Recently, genome-scale approaches using microarrays have revealed the impressive extent and diversity of non-disease-related structural variation in the human genome[43,44], and there is increasing evidence that small-scale structural variation might be associated with a far greater number of developmental disorders than previously appreciated[45,46,47]. Similarly, in model organisms, copy number changes underlie adaptive events in experimental evolution studies[41,48], act as suppressor mutations[49] in yeast, and are an important and pervasive source of variation in worms[50] and mice[51]. Thus, structural variation is of broad importance across all species.
Figure 1
Identifying copy number variation in genomes using array comparative genome hybridization.
a | Resequencing microarrays are designed with short oligonucleotides in which every possible variant is represented at the central position of a probe (shown in coloured font). At least four probes are used to interrogate each nucleotide position (as shown here for two adjacent positions), but often eight or more are used to include both strands and other small insertions and deletions. The probe sequence that is exactly complementary to the sample will result in the greatest hybridization efficiency (indicated by a green letter) and thus a comparison among all probes can be used to determine the nucleotide sequence of the sample. The coloured boxes indicate the relative intensity of hybridization at each probe — yellow being the highest intensity. b | In the absence of resequencing arrays, hybridization of the sample to candidate sequence probes can be used. Mismatches resulting from mutations in sample DNA will result in a lower hybridization efficiency compared with hybridization to a sample with complete sequence complementarity. This approach has the advantage of requiring far fewer probes and is often sufficient to detect sequence variation. If a mismatch is inferred then small-scale sequencing is necessary to identify the variant nucleotide. c | The effect of a SNP on hybridization is related to its corresponding position in a probe. More central positions result in the greatest decrease, whereas SNPs positioned at the end of probes are much less likely to result in a significant decrease in hybridization. d | It is possible to use hybridization data obtained from a mutation detection array to compute a likelihood that a particular site is a sequence variant with respect to the reference genome. This approach facilitates the comparison of related individuals at the sequence level, allowing rapid scanning of the genome. The diagram illustrates the analysis of a drug-resistant mutant in the budding yeast, Saccharomyces cerevisiae. Candidate SNPs are identified by a positive log likelihood value. In this case, a small number of candidate SNPs are detected throughout the genome; one is shown here on chromosome V, which is representative of the entire 13 Mb of the yeast genome. The peak highlighted in the inset is shown at higher resolution in the main figure. A single signal in the CAN1 gene, which is known to confer resistance to the drug canavanine, was identified and subsequently verified using Sanger sequencing. AVT2, amino acid vacuolar transport 2; NPR2, nitrogen permease regulator 2. Images for parts c and d use data originally published in Ref. 31.
Detecting SNP variation using microarrays.
a | Resequencing microarrays are designed with short oligonucleotides in which every possible variant is represented at the central position of a probe (shown in coloured font). At least four probes are used to interrogate each nucleotide position (as shown here for two adjacent positions), but often eight or more are used to include both strands and other small insertions and deletions. The probe sequence that is exactly complementary to the sample will result in the greatest hybridization efficiency (indicated by a green letter) and thus a comparison among all probes can be used to determine the nucleotide sequence of the sample. The coloured boxes indicate the relative intensity of hybridization at each probe — yellow being the highest intensity. b | In the absence of resequencing arrays, hybridization of the sample to candidate sequence probes can be used. Mismatches resulting from mutations in sample DNA will result in a lower hybridization efficiency compared with hybridization to a sample with complete sequence complementarity. This approach has the advantage of requiring far fewer probes and is often sufficient to detect sequence variation. If a mismatch is inferred then small-scale sequencing is necessary to identify the variant nucleotide. c | The effect of a SNP on hybridization is related to its corresponding position in a probe. More central positions result in the greatest decrease, whereas SNPs positioned at the end of probes are much less likely to result in a significant decrease in hybridization. d | It is possible to use hybridization data obtained from a mutation detection array to compute a likelihood that a particular site is a sequence variant with respect to the reference genome. This approach facilitates the comparison of related individuals at the sequence level, allowing rapid scanning of the genome. The diagram illustrates the analysis of a drug-resistant mutant in the budding yeast, Saccharomyces cerevisiae. Candidate SNPs are identified by a positive log likelihood value. In this case, a small number of candidate SNPs are detected throughout the genome; one is shown here on chromosome V, which is representative of the entire 13 Mb of the yeast genome. The peak highlighted in the inset is shown at higher resolution in the main figure. A single signal in the CAN1 gene, which is known to confer resistance to the drug canavanine, was identified and subsequently verified using Sanger sequencing. AVT2, amino acid vacuolar transport 2; NPR2, nitrogen permease regulator 2. Images for parts c and d use data originally published in Ref. 31.The great advantage of resequencing arrays is that they not only detect variation, but they also allow the variant sequence to be inferred. Although probe densities of commercially prepared oligonucleotide arrays have increased markedly over the past 10 years and now contain in the order of 106 features per array, most eukaryotic genomes are in the order of 107–109 nucleotides. Thus, to cover the entire genome of a higher organism using resequencing arrays it is necessary to use many individual arrays, as was done for resequencing the genome of A. thaliana[33] and a significant subset of the human genome[34]. It is unclear what the upper limit is on the number of probes that can be placed on a microarray, but the rate of increase in probe density suggests that comprehensive resequencing arrays could eventually be available for larger genomes.In the absence of resequencing arrays for many organisms, including several important model organisms, microarrays that were designed for gene expression analysis have been used to identify candidate sequence changes. This approach entails the identification of sequence variation via the reduced hybridization signal — relative to some known non-polymorphic hybridization value — which results from mismatched DNA. Unlike resequencing arrays, these methods detect the positions of variation, often to high precision, but they do not provide the variant sequences. The first application of this approach was in S. cerevisiae using an Affymetrix array designed for probing gene expression[64]. This same approach has subsequently been used to probe sequence diversity in the genomes of A. thaliana[65] and mosquitoes[66]. Candidate sequence variants — referred to as single feature polymorphisms — are identified as those probes that differ in intensity by some criterion defined using simple parametric tests (Fig. 2b). These studies offered, for the first time, the ability to rapidly characterize sequence diversity on a genome scale in a single microarray experiment; they also suggested that whole-genome analysis should be feasible with increased coverage.Microarrays that provide complete coverage of the genome by tiling short probes immediately adjacent to one another provide a means of interrogating the entire genomic sequence for variation. Such an array design has been applied to the bacteria Helicobacter pylori[67] and Escherichia coli[68]. However, this design is not sufficient to reliably identify all sequence variation. This limitation can be attributed to the fact that the ability to detect sequence differences that correspond to the end of probes is poor (Fig. 2c).A microarray design that addresses this problem contains overlapping probes spaced at a regular interval such that every nucleotide is covered by more than one probe, and the nucleotide position falls at a different position within each of those probes. The redundancy in coverage of this array format provides multiple measurements of the effect of a SNP on hybridization. Using a microarray that provides coverage of the entire S. cerevisiae genome it has been possible to characterize the loss of signal that is due to a SNP by studying a genome with some 25,000 known SNPs versus a reference sequence[31]. In this study, the authors modelled the loss of signal in the presence of a SNP and found that just four parameters explained >70% of the variance. Thus, it is possible to estimate the expected decrease in intensity for every probe given a mutation at each nucleotide position, and thereby test an unknown sample for the likelihood that it contains polymorphic sites. This approach has proved to be a robust means of comparing strains of yeast that are either closely related — that is, when genomes differ by as little as 0.000,01%[31] (Fig. 2d) — or diverged[69].Following the detection of a likely SNP it is necessary to perform short-scale sequencing to identify the precise base change. This is usually feasible for a small number of mutations of interest. Even in the absence of precise sequence information it is often informative to examine the degree of sequence identity between different genomes to identify regions of high sequence identity and sequence divergence[69].The use of microarrays to assess sequence variation is inherently limited to the analysis of unique regions of the genome. Repetitive features and low-complexity DNA are generally avoided. In this respect, microarray-based mutation detection or resequencing is similar to new sequencing approaches that make use of millions of short sequence reads. One current advantage of a microarray-based approach over new sequencing approaches is the available throughput, although this is certain to change in the future. Moreover, although resequencing microarrays have been successfully applied to the detection of heterozygous mutations[70], it is yet to be demonstrated that a loss-of-signal approach can be reliably applied to this problem.Detecting insertion sequence variationMobile genetic elements provide one of the richest and most poorly understood sources of genetic variation. Transposons vary in copy number, type and location over virtually every genome, both between individuals and across species. In microorganisms, transposons are known to be associated with antibiotic resistance[71]. In humans, there is evidence that long interspersed nuclear element 1 (LINE1) retrotransposition is involved in generating neuronal diversity[72] and variation in gene expression as well as being a source of disease-causing mutation[73]. In Saccharomycesyeasts, the Ty families of retrotransposons show substantial variation across strain backgrounds and species[74,75], and novel transposon insertions can cause both adaptive[76,77] and detrimental[78] mutations. Retrotransposon proximity to genes can also modify gene expression and regulation[79]. Moreover, transposons are potentially a source of CNV as they are correlated with the breakpoints of genomic rearrangements from yeast[41,80] to humans[59].Although these repetitive sequences are biologically important, they are poorly covered by all but the highest quality sequencing approaches. For example, many of the several hundred gaps remaining in the Saccharomyces bayanus sequence correlate with retrotransposon sequences in S. cerevisiae (M. Cherry, personal communication). Because of the technical issues that are involved with sequencing and assembling these repetitive elements they are often excluded from sequencing projects. Thus, methods for mapping these elements have been developed that rely on identifying the unique sequences abutting common repetitive sequences.Global mapping of insertion sites is generally performed using a means of isolating the insertion element and its immediately neighbouring DNA. The DNA is then hybridized to a whole-genome array to identify its genomic location (Fig. 3). Sequence-specific isolation of insertion sites has been successfully demonstrated using oligonucleotide capture probes[75], or by annealing linkers to fragmented DNA and using PCR[81] by adapting a method that was developed to sequence the termini of genomic clones[82]. Thus far, analytical approaches to these data have simply entailed the identification of contiguous probes above some threshold value, although these methods are certain to evolve as the approaches mature.
Figure 3
Genome-wide mapping of loci by selective enrichment and detection using microarrays.
One way of mapping insertion sequence variation is to isolate the insertion element and its immediately neighbouring DNA. Specific regions of the genome are isolated using either a capture probe method (as illustrated in the diagram) or a PCR-based method. This approach is suited to mapping the location in the genome of mobile genetic elements, which are notoriously difficult to characterize using whole-genome sequencing approaches. As shown in the figure, DNA is fragmented in the first step (a). Two separate reactions selectively enrich for the 5′ and 3′ ends of the insertion sequence using sequence-specific capture probes (b). The 5′-enriched and the 3′-enriched fractions are labelled with different fluorophores (Cy5 and Cy3, respectively) and then hybridized to a microarray using a two-colour protocol (c). Finally, insertion sites are mapped on the basis of a transition (indicated by an arrow in d) from positive to negative log2 ratio, corresponding to sample that is enriched for DNA adjacent to the 5′ end of the insertion sequence and sample that is enriched for DNA adjacent to the 3′ end of the insertion sequence, respectively. The site of transition corresponds to the genomic location of the insertion sequence. Each red and blue bar represents a microarray probe in a contiguous region of the genome spanning ∼10 kb. The distance between each probe is ∼200 bp and therefore, in this case, the insertion site is detected between two probes corresponding to a mapping resolution of ∼200 bp.
Genome-wide mapping of loci by selective enrichment and detection using microarrays.
One way of mapping insertion sequence variation is to isolate the insertion element and its immediately neighbouring DNA. Specific regions of the genome are isolated using either a capture probe method (as illustrated in the diagram) or a PCR-based method. This approach is suited to mapping the location in the genome of mobile genetic elements, which are notoriously difficult to characterize using whole-genome sequencing approaches. As shown in the figure, DNA is fragmented in the first step (a). Two separate reactions selectively enrich for the 5′ and 3′ ends of the insertion sequence using sequence-specific capture probes (b). The 5′-enriched and the 3′-enriched fractions are labelled with different fluorophores (Cy5 and Cy3, respectively) and then hybridized to a microarray using a two-colour protocol (c). Finally, insertion sites are mapped on the basis of a transition (indicated by an arrow in d) from positive to negative log2 ratio, corresponding to sample that is enriched for DNA adjacent to the 5′ end of the insertion sequence and sample that is enriched for DNA adjacent to the 3′ end of the insertion sequence, respectively. The site of transition corresponds to the genomic location of the insertion sequence. Each red and blue bar represents a microarray probe in a contiguous region of the genome spanning ∼10 kb. The distance between each probe is ∼200 bp and therefore, in this case, the insertion site is detected between two probes corresponding to a mapping resolution of ∼200 bp.Transposons are also used extensively as large-scale mutagenesis tools, in model organisms ranging from bacteria[83] and yeast[84,85] to zebrafish[86] and mice[87]. Identifying the locations of experimentally mobilized transposons poses a similar challenge to identifying endogenous transposons. Isolation of the insertion sites of these artificial transposons in E. coli[83,88] and H. pylori[89] makes use of a T7 RNA polymerase promoter that is engineered into the transposon construct to selectively transcribe RNA from the region that is adjacent to the insertion site, which is then analysed using microarrays. This method is sufficiently quantitative that the relative abundance of mutants in a complex mixture can be followed over several rounds of genetic selection[88], allowing the simultaneous identification of enriched and depleted mutants.Most genome sequencing methods are not well suited to the problems posed by repetitive sequences. The shorter read lengths of next-generation sequencing methods exacerbate the problems of standard sequencing approaches with respect to coverage and assembly. Combining an enrichment method with sequencing rather than using microarrays presents one possibility. Conceptually, this approach is analogous to chromatin immunoprecipitation (ChIP), but rather than enriching loci using an antibody that is targeted to a DNA-bound protein of interest, an oligonucleotide would be used to enrich specific loci in the genome. As global ChIP has already been adapted to new sequencing platforms[90,91], transposon mapping could in principle also be done this way.Technical considerationsWhen dealing with millions of data points it is important to ensure that data quality and processing are managed effectively. To obtain robust data, experimental and analytical considerations need to be made at all stages of the protocol. A fundamental difference between experimental platforms is whether there is an internal control for each probe, which is achieved by co-hybridizing differentially labelled sample and reference DNA (a two-colour microarray), or whether only a single sample is analysed (a one-colour microarray) (Box 2). Recent comparative studies have indicated that two-colour and one-colour microarray experiments produce concordant results for gene expression analysis[92]. However, in contrast with gene expression experiments (in which data from multiple probes can be used to determine a gene expression value) comparative genomic analyses are more sensitive to spurious probe values.There are two arguments for the use of a co-hybridized reference in order to maximize the sensitivity of individual probe data. The first reason is the need for an internal control for probe quality. Microarray probes can be created in several different ways and the manufacturing of microarrays can result in significant variation in probe quality and quantity. This is particularly acute for the 'homemade' variety of microarrays, such as the BAC arrays used for aCGH, in which probe quality can vary greatly between microarrays. However, even leading microarray manufacturers frequently provide microarrays with variable probe quality[93]. The presence of an internal reference provides a means of controlling the variation that is due to probe quality because the measurement of interest is the relative binding efficiency between the reference (often a standard reference used over and over again) and sample. Therefore, this experimental design enables a ratiometric approach to data analysis rather than a reliance on absolute measurements between microarray experiments.The second experimental concern addressed by a ratiometric approach is geographic variation across the microarray due to the conditions in which the hybridization occurred. Most microarrays are mixed by placing the array in a rotating hybridization oven or some device designed to provide agitation of the reactants. Variation in mixing and the concomitant differential time of exposure to reagents across the microarray can result in variable hybridization efficiency across the array. This too is readily controlled by the presence of an internal reference. An additional requirement for minimizing geographic artefacts in microarray data is the randomization of probe locations on the microarray with respect to genome location. This simple adjustment in array design reduces the chance that experimental artefacts are conflated with biological significance.Several normalization methods that attempt to address inter- and intra-array variation for one-colour microarrays have been developed[94]. The use of a ratiometric approach greatly simplifies the task of extracting signal from noise because each measurement at every probe is internally controlled (Box 2). Nonetheless, two-colour experiments also have a potential limitation: they require the use of two different dyes, which can introduce a dye-specific bias. This effect can be mitigated either by repeating experiments with the dyes interchanged, by performing control experiments that explicitly study the effect of dye bias[95] or by addressing the effect using statistical approaches[96].A first step in data processing is normalizing data for comparison with or between microarrays. The methods that are used in both one-colour and two-colour microarray experiments usually assume that there is an equal quantity of DNA in all samples. However, this assumption is incorrect in the case of aneuploid samples and, to a lesser extent, samples with differing CNV content. Simple linear corrections have been applied in the case of yeast that carry extra chromosomes[97], but most methods for normalization do not account for this possibility thereby underestimating values for regions that differ between the two genomes. Similarly, normalization methods for the sequence-level comparison of genomes using short-oligonucleotide microarrays require approaches that do not assume that total hybridization across the array is always equal between samples.A key to validating any microarray approach is an assessment of false positive and false negative rates. This is challenging insofar as it requires knowledge of variation in another genome from the same species that can be used as a test case. For a small number of organisms, multiple sequenced genomes are available, which facilitates testing of a method against a gold standard. Where this is not possible methods must rely on comparisons between datasets.The value of visualizing and exploring genome-scale data cannot be understated. A number of software options exist for visualizing data (Table 1). These include visualization tools in programs such as MeV and Java TreeView. Modules in the Bioconductor package can also be used for visualizing aCGH data. In our experience the freeware application, Integrated Genome Browser, provides a great deal of flexibility in terms of the data types that can be imported and viewed and is versatile for viewing multiple diverse datasets simultaneously.
Table 1
Software for analyzing and visualizing microarray data for genome comparisons
Authors: Jason S Maydan; Stephane Flibotte; Mark L Edgley; Joanne Lau; Rebecca R Selzer; Todd A Richmond; Nathan J Pofahl; James H Thomas; Donald G Moerman Journal: Genome Res Date: 2007-01-31 Impact factor: 9.043
Authors: T R Hughes; M Mao; A R Jones; J Burchard; M J Marton; K W Shannon; S M Lefkowitz; M Ziman; J M Schelter; M R Meyer; S Kobayashi; C Davis; H Dai; Y D He; S B Stephaniants; G Cavet; W L Walker; A West; E Coffey; D D Shoemaker; R Stoughton; A P Blanchard; S H Friend; P S Linsley Journal: Nat Biotechnol Date: 2001-04 Impact factor: 54.908
Authors: Anirban Maitra; Yoram Cohen; Susannah E D Gillespie; Elizabeth Mambo; Noriyoshi Fukushima; Mohammad O Hoque; Nila Shah; Michael Goggins; Joseph Califano; David Sidransky; Aravinda Chakravarti Journal: Genome Res Date: 2004-05 Impact factor: 9.043
Authors: Laura J Blair; Bryce A Nordhues; Shannon E Hill; K Matthew Scaglione; John C O'Leary; Sarah N Fontaine; Leonid Breydo; Bo Zhang; Pengfei Li; Li Wang; Carl Cotman; Henry L Paulson; Martin Muschol; Vladimir N Uversky; Torsten Klengel; Elisabeth B Binder; Rakez Kayed; Todd E Golde; Nicole Berchtold; Chad A Dickey Journal: J Clin Invest Date: 2013-09-03 Impact factor: 14.808
Authors: Martin R Jones; Shu Yi Chua; Nigel J O'Neil; Robert C Johnsen; Ann M Rose; David L Baillie Journal: Mol Genet Genomics Date: 2009-03-29 Impact factor: 3.291
Authors: David Gresham; Bo Curry; Alexandra Ward; D Benjamin Gordon; Leonardo Brizuela; Leonid Kruglyak; David Botstein Journal: Proc Natl Acad Sci U S A Date: 2010-01-08 Impact factor: 11.205