Literature DB >> 11932250

BLAT--the BLAST-like alignment tool.

W James Kent1.   

Abstract

Analyzing vertebrate genomes requires rapid mRNA/DNA and cross-species protein alignments. A new tool, BLAT, is more accurate and 500 times faster than popular existing tools for mRNA/DNA alignments and 50 times faster for protein alignments at sensitivity settings typically used when comparing vertebrate sequences. BLAT's speed stems from an index of all nonoverlapping K-mers in the genome. This index fits inside the RAM of inexpensive computers, and need only be computed once for each genome assembly. BLAT has several major stages. It uses the index to find regions in the genome likely to be homologous to the query sequence. It performs an alignment between homologous regions. It stitches together these aligned regions (often exons) into larger alignments (typically genes). Finally, BLAT revisits small internal exons possibly missed at the first stage and adjusts large gap boundaries that have canonical splice sites where feasible. This paper describes how BLAT was optimized. Effects on speed and sensitivity are explored for various K-mer sizes, mismatch schemes, and number of required index matches. BLAT is compared with other alignment programs on various test sets and then used in several genome-wide applications. http://genome.ucsc.edu hosts a web-based BLAT server for the human genome.

Entities:  

Mesh:

Substances:

Year:  2002        PMID: 11932250      PMCID: PMC187518          DOI: 10.1101/gr.229202

Source DB:  PubMed          Journal:  Genome Res        ISSN: 1088-9051            Impact factor:   9.043


  20 in total

1.  The DNA sequence of human chromosome 22.

Authors:  I Dunham; N Shimizu; B A Roe; S Chissoe; A R Hunt; J E Collins; R Bruskiewich; D M Beare; M Clamp; L J Smink; R Ainscough; J P Almeida; A Babbage; C Bagguley; J Bailey; K Barlow; K N Bates; O Beasley; C P Bird; S Blakey; A M Bridgeman; D Buck; J Burgess; W D Burrill; K P O'Brien
Journal:  Nature       Date:  1999-12-02       Impact factor: 49.962

2.  Homology-based gene structure prediction: simplified matching algorithm using a translated codon (tron) and improved accuracy by allowing for long gaps.

Authors:  O Gotoh
Journal:  Bioinformatics       Date:  2000-03       Impact factor: 6.937

3.  Initial sequencing and analysis of the human genome.

Authors:  E S Lander; L M Linton; B Birren; C Nusbaum; M C Zody; J Baldwin; K Devon; K Dewar; M Doyle; W FitzHugh; R Funke; D Gage; K Harris; A Heaford; J Howland; L Kann; J Lehoczky; R LeVine; P McEwan; K McKernan; J Meldrim; J P Mesirov; C Miranda; W Morris; J Naylor; C Raymond; M Rosetti; R Santos; A Sheridan; C Sougnez; Y Stange-Thomann; N Stojanovic; A Subramanian; D Wyman; J Rogers; J Sulston; R Ainscough; S Beck; D Bentley; J Burton; C Clee; N Carter; A Coulson; R Deadman; P Deloukas; A Dunham; I Dunham; R Durbin; L French; D Grafham; S Gregory; T Hubbard; S Humphray; A Hunt; M Jones; C Lloyd; A McMurray; L Matthews; S Mercer; S Milne; J C Mullikin; A Mungall; R Plumb; M Ross; R Shownkeen; S Sims; R H Waterston; R K Wilson; L W Hillier; J D McPherson; M A Marra; E R Mardis; L A Fulton; A T Chinwalla; K H Pepin; W R Gish; S L Chissoe; M C Wendl; K D Delehaunty; T L Miner; A Delehaunty; J B Kramer; L L Cook; R S Fulton; D L Johnson; P J Minx; S W Clifton; T Hawkins; E Branscomb; P Predki; P Richardson; S Wenning; T Slezak; N Doggett; J F Cheng; A Olsen; S Lucas; C Elkin; E Uberbacher; M Frazier; R A Gibbs; D M Muzny; S E Scherer; J B Bouck; E J Sodergren; K C Worley; C M Rives; J H Gorrell; M L Metzker; S L Naylor; R S Kucherlapati; D L Nelson; G M Weinstock; Y Sakaki; A Fujiyama; M Hattori; T Yada; A Toyoda; T Itoh; C Kawagoe; H Watanabe; Y Totoki; T Taylor; J Weissenbach; R Heilig; W Saurin; F Artiguenave; P Brottier; T Bruls; E Pelletier; C Robert; P Wincker; D R Smith; L Doucette-Stamm; M Rubenfield; K Weinstock; H M Lee; J Dubois; A Rosenthal; M Platzer; G Nyakatura; S Taudien; A Rump; H Yang; J Yu; J Wang; G Huang; J Gu; L Hood; L Rowen; A Madan; S Qin; R W Davis; N A Federspiel; A P Abola; M J Proctor; R M Myers; J Schmutz; M Dickson; J Grimwood; D R Cox; M V Olson; R Kaul; C Raymond; N Shimizu; K Kawasaki; S Minoshima; G A Evans; M Athanasiou; R Schultz; B A Roe; F Chen; H Pan; J Ramser; H Lehrach; R Reinhardt; W R McCombie; M de la Bastide; N Dedhia; H Blöcker; K Hornischer; G Nordsiek; R Agarwala; L Aravind; J A Bailey; A Bateman; S Batzoglou; E Birney; P Bork; D G Brown; C B Burge; L Cerutti; H C Chen; D Church; M Clamp; R R Copley; T Doerks; S R Eddy; E E Eichler; T S Furey; J Galagan; J G Gilbert; C Harmon; Y Hayashizaki; D Haussler; H Hermjakob; K Hokamp; W Jang; L S Johnson; T A Jones; S Kasif; A Kaspryzk; S Kennedy; W J Kent; P Kitts; E V Koonin; I Korf; D Kulp; D Lancet; T M Lowe; A McLysaght; T Mikkelsen; J V Moran; N Mulder; V J Pollara; C P Ponting; G Schuler; J Schultz; G Slater; A F Smit; E Stupka; J Szustakowki; D Thierry-Mieg; J Thierry-Mieg; L Wagner; J Wallis; R Wheeler; A Williams; Y I Wolf; K H Wolfe; S P Yang; R F Yeh; F Collins; M S Guyer; J Peterson; A Felsenfeld; K A Wetterstrand; A Patrinos; M J Morgan; P de Jong; J J Catanese; K Osoegawa; H Shizuya; S Choi; Y J Chen; J Szustakowki
Journal:  Nature       Date:  2001-02-15       Impact factor: 49.962

4.  A greedy algorithm for aligning DNA sequences.

Authors:  Z Zhang; S Schwartz; L Wagner; W Miller
Journal:  J Comput Biol       Date:  2000 Feb-Apr       Impact factor: 1.479

5.  SSAHA: a fast search method for large DNA databases.

Authors:  Z Ning; A J Cox; J C Mullikin
Journal:  Genome Res       Date:  2001-10       Impact factor: 9.043

6.  SGP-1: prediction and validation of homologous genes based on sequence alignments.

Authors:  T Wiehe; S Gebauer-Jung; T Mitchell-Olds; R Guigó
Journal:  Genome Res       Date:  2001-09       Impact factor: 9.043

7.  Estimate of human gene number provided by genome-wide analysis using Tetraodon nigroviridis DNA sequence.

Authors:  H Roest Crollius; O Jaillon; A Bernot; C Dasilva; L Bouneau; C Fischer; C Fizames; P Wincker; P Brottier; F Quétier; W Saurin; J Weissenbach
Journal:  Nat Genet       Date:  2000-06       Impact factor: 38.330

8.  Hidden Markov models for detecting remote protein homologies.

Authors:  K Karplus; C Barrett; R Hughey
Journal:  Bioinformatics       Date:  1998       Impact factor: 6.937

9.  Improved tools for biological sequence comparison.

Authors:  W R Pearson; D J Lipman
Journal:  Proc Natl Acad Sci U S A       Date:  1988-04       Impact factor: 11.205

10.  Identification of common molecular subsequences.

Authors:  T F Smith; M S Waterman
Journal:  J Mol Biol       Date:  1981-03-25       Impact factor: 5.469

View more
  2000 in total

1.  The human genome browser at UCSC.

Authors:  W James Kent; Charles W Sugnet; Terrence S Furey; Krishna M Roskin; Tom H Pringle; Alan M Zahler; David Haussler
Journal:  Genome Res       Date:  2002-06       Impact factor: 9.043

2.  Multiple, dispersed human U6 small nuclear RNA genes with varied transcriptional efficiencies.

Authors:  Angela M Domitrovich; Gary R Kunkel
Journal:  Nucleic Acids Res       Date:  2003-05-01       Impact factor: 16.971

3.  Comparison of whole genome assemblies of the human genome.

Authors:  Eric C Rouchka; Warren Gish; David J States
Journal:  Nucleic Acids Res       Date:  2002-11-15       Impact factor: 16.971

4.  Computational analysis of unassigned high-quality MS/MS spectra in proteomic data sets.

Authors:  Kang Ning; Damian Fermin; Alexey I Nesvizhskii
Journal:  Proteomics       Date:  2010-07       Impact factor: 3.984

5.  Genome analysis of Moraxella catarrhalis strain BBH18, [corrected] a human respiratory tract pathogen.

Authors:  Stefan P W de Vries; Sacha A F T van Hijum; Wolfgang Schueler; Kristian Riesbeck; John P Hays; Peter W M Hermans; Hester J Bootsma
Journal:  J Bacteriol       Date:  2010-05-07       Impact factor: 3.490

6.  Species-specific exon loss in human transcriptomes.

Authors:  Jinkai Wang; Zhi-xiang Lu; Collin J Tokheim; Sara E Miller; Yi Xing
Journal:  Mol Biol Evol       Date:  2014-11-14       Impact factor: 16.240

7.  Polymorphic NumtS trace human population relationships.

Authors:  Martin Lang; Marco Sazzini; Francesco Maria Calabrese; Domenico Simone; Alessio Boattini; Giovanni Romeo; Donata Luiselli; Marcella Attimonelli; Giuseppe Gasparre
Journal:  Hum Genet       Date:  2011-12-08       Impact factor: 4.132

8.  Transcription profiling and identification of infection-related genes in Phytophthora cactorum.

Authors:  Xiao-Ren Chen; Shen-Xin Huang; Ye Zhang; Gui-Lin Sheng; Bo-Yue Zhang; Qi-Yuan Li; Feng Zhu; Jing-You Xu
Journal:  Mol Genet Genomics       Date:  2017-12-07       Impact factor: 3.291

9.  Quantitative sequence characterization for repetitive DNA content in the supernumerary chromosome of the migratory locust.

Authors:  Francisco J Ruiz-Ruano; Josefa Cabrero; María Dolores López-León; Antonio Sánchez; Juan Pedro M Camacho
Journal:  Chromosoma       Date:  2017-09-04       Impact factor: 4.316

10.  Human chromosomal translocations at CpG sites and a theoretical basis for their lineage and stage specificity.

Authors:  Albert G Tsai; Haihui Lu; Sathees C Raghavan; Markus Muschen; Chih-Lin Hsieh; Michael R Lieber
Journal:  Cell       Date:  2008-12-12       Impact factor: 41.582

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.