Literature DB >> 9403055

Comparison of DNA sequences with protein sequences.

W R Pearson1, T Wood, Z Zhang, W Miller.   

Abstract

The FASTA package of sequence comparison programs has been expanded to include FASTX and FASTY, which compare a DNA sequence to a protein sequence database, translating the DNA sequence in three frames and aligning the translated DNA sequence to each sequence in the protein database, allowing gaps and frameshifts. Also new are TFASTX and TFASTY, which compare a protein sequence to a DNA sequence database, translating each sequence in the DNA database in six frames and scoring alignments with gaps and frameshifts. FASTX and TFASTX allow only frameshifts between codons, while FASTY and TFASTY allow substitutions or frameshifts within a codon. We examined the performance of FASTX and FASTY using different gap-opening, gap-extension, frameshift, and nucleotide substitution penalties. In general, FASTX and FASTY perform equivalently when query sequences contain 0-10% errors. We also evaluated the statistical estimates reported by FASTX and FASTY. These estimates are quite accurate, except when an out-of-frame translation produces a low-complexity protein sequence. We used FASTX to scan the Mycoplasma genitalium, Haemophilus influenzae, and Methanococcus jannaschii genomes for unidentified or misidentified protein-coding genes. We found at least 9 new protein-coding genes in the three genomes and at least 35 genes with potentially incorrect boundaries.

Entities:  

Mesh:

Substances:

Year:  1997        PMID: 9403055     DOI: 10.1006/geno.1997.4995

Source DB:  PubMed          Journal:  Genomics        ISSN: 0888-7543            Impact factor:   5.736


  232 in total

1.  Two classes of genes in plants.

Authors:  N Carels; G Bernardi
Journal:  Genetics       Date:  2000-04       Impact factor: 4.562

2.  Molecular fossils in the human genome: identification and analysis of the pseudogenes in chromosomes 21 and 22.

Authors:  Paul M Harrison; Hedi Hegyi; Suganthi Balasubramanian; Nicholas M Luscombe; Paul Bertone; Nathaniel Echols; Ted Johnson; Mark Gerstein
Journal:  Genome Res       Date:  2002-02       Impact factor: 9.043

3.  Comprehensive analysis of amino acid and nucleotide composition in eukaryotic genomes, comparing genes and pseudogenes.

Authors:  Nathaniel Echols; Paul Harrison; Suganthi Balasubramanian; Nicholas M Luscombe; Paul Bertone; Zhaolei Zhang; Mark Gerstein
Journal:  Nucleic Acids Res       Date:  2002-06-01       Impact factor: 16.971

4.  Web-based visualization tools for bacterial genome alignments.

Authors:  L Florea; C Riemer; S Schwartz; Z Zhang; N Stojanovic; W Miller; M McClelland
Journal:  Nucleic Acids Res       Date:  2000-09-15       Impact factor: 16.971

5.  Analysis of histone acetyltransferase and histone deacetylase families of Arabidopsis thaliana suggests functional diversification of chromatin modification among multicellular eukaryotes.

Authors:  Ritu Pandey; Andreas Müller; Carolyn A Napoli; David A Selinger; Craig S Pikaard; Eric J Richards; Judith Bender; David W Mount; Richard A Jorgensen
Journal:  Nucleic Acids Res       Date:  2002-12-01       Impact factor: 16.971

Review 6.  SRY protein function in sex determination: thinking outside the box.

Authors:  Liang Zhao; Peter Koopman
Journal:  Chromosome Res       Date:  2012-01       Impact factor: 5.239

7.  Millions of years of evolution preserved: a comprehensive catalog of the processed pseudogenes in the human genome.

Authors:  Zhaolei Zhang; Paul M Harrison; Yin Liu; Mark Gerstein
Journal:  Genome Res       Date:  2003-12       Impact factor: 9.043

8.  Sequence variations of the human MPDZ gene and association with alcoholism in subjects with European ancestry.

Authors:  Victor M Karpyak; Jeong-Hyun Kim; Joanna M Biernacka; Eric D Wieben; David A Mrazek; John L Black; Doo-Sup Choi
Journal:  Alcohol Clin Exp Res       Date:  2009-01-21       Impact factor: 3.455

9.  N6-methyladenosine (m6A) recruits and repels proteins to regulate mRNA homeostasis.

Authors:  Raghu R Edupuganti; Simon Geiger; Rik G H Lindeboom; Hailing Shi; Phillip J Hsu; Zhike Lu; Shuang-Yin Wang; Marijke P A Baltissen; Pascal W T C Jansen; Martin Rossa; Markus Müller; Hendrik G Stunnenberg; Chuan He; Thomas Carell; Michiel Vermeulen
Journal:  Nat Struct Mol Biol       Date:  2017-09-04       Impact factor: 15.369

10.  Isomerization of the intersubunit disulphide-bond in Env controls retrovirus fusion.

Authors:  Michael Wallin; Maria Ekström; Henrik Garoff
Journal:  EMBO J       Date:  2003-12-11       Impact factor: 11.598

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.