| Literature DB >> 17085479 |
Byungwook Lee1, Taehyung Kim, Seon-Kyu Kim, Kwang H Lee, Doheon Lee.
Abstract
With the advent of automated and high-throughput techniques, the number of patent applications containing biological sequences has been increasing rapidly. However, they have attracted relatively little attention compared to other sequence resources. We have built a database server called Patome, which contains biological sequence data disclosed in patents and published applications, as well as their analysis information. The analysis is divided into two steps. The first is an annotation step in which the disclosed sequences were annotated with RefSeq database. The second is an association step where the sequences were linked to Entrez Gene, OMIM and GO databases, and their results were saved as a gene-patent table. From the analysis, we found that 55% of human genes were associated with patenting. The gene-patent table can be used to identify whether a particular gene or disease is related to patenting. Patome is available at http://www.patome.org/; the information is updated bimonthly.Entities:
Mesh:
Year: 2006 PMID: 17085479 PMCID: PMC1781150 DOI: 10.1093/nar/gkl807
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1Overview of the analysis flow in Patome. The downloaded sequences are divided into nucleic acids and amino acids. Nucleic acids are annotated with both RefSeq mRNAs and microbial mRNAs. Amino acids are annotated with RefSeq proteins. The BLAST results are filtered with three filtering methods: E-value & percent identity, organism comparison and coverage. From the mapping between the annotated data and biological database information (Gene, GO, OMIM), a gene–patent association table is built. The Patome database is constructed to store annotation and analysis data and to make this information available to users via web interfaces.
Summary of patent-related genes in the major organisms
| Organism | No. of genes | No. of patent-related genes | Patent-related genes (in %) |
|---|---|---|---|
| 39 216 | 21 478 | 55 | |
| 47 275 | 12 045 | 25 | |
| 21 146 | 8 503 | 40 | |
| 31 386 | 5 262 | 17 | |
| 60 714 | 4 479 | 7 | |
| 23 973 | 3 542 | 15 |
aThe number of genes is from NCBI Entrez Gene.
Figure 2Output of the Patome search. (A) Output of only patent (or application) number search. (B) Output of both patent (or application) number and SEQID search.