| Literature DB >> 24316644 |
Osmat A Jefferson1, Deniz Köllhofer, Thomas H Ehrich, Richard A Jefferson.
Abstract
Entities:
Mesh:
Year: 2013 PMID: 24316644 PMCID: PMC7416664 DOI: 10.1038/nbt.2755
Source DB: PubMed Journal: Nat Biotechnol ISSN: 1087-0156 Impact factor: 54.908
Figure 1Patent sequences mapped on the human genome (GRCh37 at http://www.ncbi.nlm.nih.gov/projects/genome/assembly/grc/index.shtml).
(a) Mapping was based on various similarity and query length coverage rates (90% 50%, 98% 95%, and 100% 100%). Unique patent sequences refer to sequences with only unique mapping locus. Although the majority of nonredundant sequences were declared human in the patent documents, around 20% were unspecified or nonhuman. (b) The internal chart shows only mapped sequences that are referenced in the granted claims (1% of the data); redundant sequence counts (R), nonredundant sequence counts (NR) and their corresponding patent grants counts. (c) Homology-based human transcriptome and proteome analysis based on two filters of E-value and percentage of minimum hit coverage.
Figure 2Categorization of patent sequences referenced in the claims.
(a,b) We categorized 76,910 unique patent sequences that map with 100% homology to the human genome and are referenced in the claims of 2,685 granted US patents according to the role of the sequence in the claim. In the distribution based on patents (a), if a patent had different sequences in multiple categories, the patent was counted more than once. However, for the distribution based on sequences (b), if the sequences were referenced in two different contexts in the same set of claims, they were categorized only once, according to the broadest category. For example, a sequence referenced both as a primer for use in a method claim and also claimed as a primer sequence would be categorized as 'Sequence claimed.' We used ten categories. 'Sequence claimed' includes claims for isolated nucleotide sequences, sequences specifically claimed as primer or probe sequences and antisense sequences. 'Subpart' includes sequences that are part of a larger sequence, sequences that are one of several sequences claimed as a set, and sequences claimed alongside nonsequence substances, such as a pharmaceutical carrier. 'Alter phenotype' includes sequences in method claims in which the sequence is used to alter a cell, tissue or organism. 'Comparison/target' includes sequences in claims that employ the sequence in a comparison (e.g., of methylation or expression), in a screening assay or as a target for some claimed product or method. 'Probe or primer used in a method claim' includes sequences that are referenced as a probe or primer to be used in a claimed method. 'RNA/double-stranded' includes sequences that are specifically claimed as RNA sequences along with sequences that are claimed as double-stranded. 'Artificial' includes sequences claimed that differ from the wild-type version of the sequence (so that manipulation of the wild-type sequence would be unlikely to infringe the claim), as well as sequences that are identical to the wild-type sequence, but with chemical modifications to the backbone or sugar residues. 'Peptide' includes claims for a peptide made with reference to the coding nucleotide. 'Reference' includes sequences that are generally referenced either as a placeholder or in the negative. 'Submarine' sequences are claims wherein the sequence ID is not referenced in the claim but the patent claims a broad set of sequences (e.g., US 6258540). By way of comparison, Merz et al. write that “Gene patents cover three distinct types of invention: diagnostics, compositions of matter and functional uses.”[10] Our categories do not correspond perfectly to Merz's, but generally Sequence claimed, Subpart, RNA, Artificial, Peptide and Submarine categories all correspond to compositions of matter. Comparison/target corresponds to diagnostics, Alter phenotype corresponds to functional uses, and Probe or primer used in a method corresponds to either diagnostics or functional, depending on the specific claim. (c,d) Profiles of the major categories are depicted based on publication date (c) or filing date (d).
Figure 3Sequences claimed by applicant and year.
Patent sequences, which were categorized as 'Sequence claimed,' are shown by applicant and by the year in which the patent claiming the sequence was granted by the US Patent and Trademark Office. Some relevant legal and regulatory events affecting the patenting of genetic sequence in the US are also shown across the x-axis, and percent inactive patents per year is also depicted on top of each bar. Patent sequences claimed from 2004–2012 are also shown in the internal chart to allow for differences in scale between the number of claimed sequences before and after 2003.
Percentage of claimed sequences in expired patents in the collection of 927 patents that contain claimed sequences and which map with 100% homology onto human genome
| Applicant category | Total number of sequences | Sequences in expired patents | Percentage of sequences in expired patents |
|---|---|---|---|
| Corporation | 7,584 (554 patents) | 982 (154 patents) | 13% |
| Corporation & research institute | 70 (9) | 14 (3) | 20% |
| Corporation & university | 265 (16) | 17 (4) | 6% |
| Government | 210 (38) | 80 (14) | 38% |
| Hospital | 38 (14) | 18 (7) | 47% |
| Hospital & research institute | 289 (18) | 54 (6) | 19% |
| Individual | 228 (27) | 10 (6) | 4% |
| Research institute | 359 (100) | 86 (30) | 24% |
| University | 846 (149) | 367 (55) | 43% |
Figure 4Patent sequences, which map to the human genome with various homology thresholds, can now be investigated using PatSeq Explorer-Human genome (http://www.lens.org/lens/bio/patseqexplorer).
In this example, disclosed sequences in patents with applicant “Myriad” and which map to the human genome with 90–100% similarity and 100% coverage are displayed in PatSeq Explorer. Under 'Filters' option, users can view patenting trends based on either publication or filing dates, or filter based on jurisdiction, sequence length, species or document type. Under 'Search' option, users can interrogate the data based on patent attributes such as claims, applicant, owner, inventor and classification. In the chromosome view, added features include linking to the OMIM database (turquoise panel) for associated disorders and traits on that particular position, viewing the document collection in the Lens at http://www.lens.org (brown panel), and analyzing the data at the loci/gene/sequence regions using PatSeq Analyzer.(orange panel).
Survey results on public availability of biological patent sequences from patent offices in 2011
| Jurisdiction | Public availability of sequence listings | Format provided | Coverage |
|---|---|---|---|
| Australia | Sequence listings can be accessed through AusPat ( | Mostly in pdf format | Since 1978 |
| Belgium | Access is through a paid service of 0.36 euro per (scanned) page. | Scanned images | Since 2005 and only from granted patents |
| Bulgaria | Sequence listings are provided only as a part of a patent or application. It is a paid service. | Scanned pdfs | Since September 1, 2007 |
| Canada | Electronic files and scanned images of sequence listings are available on the internet from the Canadian Patents Database (CPD). There is no fee applicable for the retrieval of files. | Multiple formats | Since October 1, 1996 and maintains full records |
| Chile | No available sequence listings yet. | Not applicable | Not applicable |
| China | Some sequence listings are available as part of patent documents. | Mostly in pdf and scanned images | No response |
| Czech Republic | A sequence listing is made available as part of the patent document (grant or published application), which can be accessed online free of charge ( | Unsearchable pdf files | Since 1991 |
| Denmark | As of July 13, 2011, Danish patent office had 5 granted patents with sequence listings and 25 published applications with sequence listings. | Unsearchable scanned copies | Since 1987 and maintains full records |
| Estonia | Available as part of the patent publication. | Multiple formats | No response yet. |
| Eurasian Patent Organization | Published Eurasian applications and patents may be viewed in EAPO Gazette and on EAPO website ( | Scanned copies | Since 1996 for patent applications and 1997 for grants |
| Finland | Accessible online ( | pdf format | Since 2013 for patent applications |
| France | Sequence listings are part of the description in French patent documents. They are accessible at | pdf format | Since 1978 for patent applications and 1989 for granted patents |
| Germany | Granted patents and published patent applications as of 1991 may be researched through the office's website ( | Computer searchable | Since 1991 with full records kept |
| Great Britain | Sequence listings are published with a patent application as it becomes open to the public inspection on the publication date. | Multiple formats | Since 2006, but sequence listings were accepted well before this date |
| Greece | Sequences are provided as part of patent documents and by a formal request. This is a paid service in a case-by-case decision of the Administrative Council of the Industrial Property Organization. | Multiple formats | Since 1988 for patent applications |
| Hungary | Sequence listings of all published patent applications are available online ( | Searchable pdf format | Since January 1, 2003 with records kept |
| Iceland | If a sequence listing forms part of the issued patent or the application, a paper copy of the list is joining the document and available upon request. | pdf format | Since 1984 with full records kept |
| Ireland | The office only provides the specifications disclosed. The sequences therein are not independently obtainable. | Scanned images | Since 1970 for granted patents and since 1991 for applications |
| Israel | Sequence listings are not published along with the patent. | Not applicable | Not available to the public |
| Japan | JPO provides gazettes for granted patents and published patent applications via the Industrial Property Digital Library (IPDL; | Multiple formats | |
| Liechtenstein | See responses from Switzerland. | ||
| Republic of Lithuania | Sequence listings present in patent specification are available on internet together with the whole specification (espacenet and | Scanned images but no computer readable files | Since 1994 for full-text grants and since January 1, 2010 for applications |
| Korea | Sequence listings are provided as part of patent documents as bulk download. | Multiple formats | |
| Netherlands | Sequence listings form part of the patent dossier and are public from the day the patent is signed in and granted ( | pdf or tif image | |
| Peru | Available on internet for the years 2000–2006 at Indecopi ( | pdf format | There is no registry of the patent applications and/or patents that have sequence listings, but there is a registry for biotech patents |
| Poland | A sequence listing is made available as part of the granted patent document only and can be accessed either by official request or online ( | pdf format | Since 1990 |
| Portugal | A copy of the granted patent (or the published patent) that contains the sequence listings can be provided upon request. | pdf format | No response yet |
| Romania | Patents containing sequence listings are published within 4 months after the grant ( | Multiple formats | No response yet |
| Russia | Information regarding nucleic and amino acid sequences is available only within the description of issued patents. | Scanned copies | No information |
| Slovak Republic | Sequence listings available online ( | pdf format | No response yet |
| Slovenia | Available online ( | pdf format | Since 1992 |
| Spain | Sequence listings can be obtained from published patent documents. | Scanned image | No response yet |
| Switzerland | Available via espacenet or searchable online ( | pdf and scanned images | Since 1995 |
| Taiwan | Available as part of patent specifications or published application specifications from the patent search database. | Scanned image | From May 1, 2003 |
| Turkey | Available as part of patent documents. | Scanned image | Since 1995 for applications and issued patents |
| USA | Available online as part of patent documents, as downloadable bulk, and deposits in GenBank. | Multiple formats | Since 1982 for grants and 2001 for applications |
Number of patent documents, which contain formatted sequence listings, as extracted from United States Patent and Trademark Office (USPTO) full text and bulk listings data sets and compared to those available at GenBank-Patent division (GB-PAT) from 1982–2013
| Publication date | USPTO documents | Gb-PAT documents | Corresponding sequences missing from GB-PAT |
|---|---|---|---|
| Mar 1982–Jun 1990 | Selected grants were sent to NCBI | 1,005 (23,850 sequences) | Not known |
| Jun 1990–Nov 1992 | 2 | 0 | 4 |
| Nov 1992–Dec 2001 | 18,391 grants | 16,220 grants | 127,588 |
| Jan 2001–April 2013 | 97,567 applications | 0 | 78,556,258 |
The last column shows the number of patent sequences that were missing in GB-PAT division and thus not available in machine-searchable form to the public. Individually, an electronic copy of the sequence listing can be requested from USPTO based on a fee set under 37 CFR 1.19(b)(3) (http://www.uspto.gov/web/offices/pac/mpep/s2435.html).