| Literature DB >> 23180790 |
Osamu Ogasawara1, Jun Mashima, Yuichi Kodama, Eli Kaminuma, Yasukazu Nakamura, Kousaku Okubo, Toshihisa Takagi.
Abstract
The DNA data bank of Japan (DDBJ, http://www.ddbj.nig.ac.jp) maintains a primary nucleotide sequence database and provides analytical resources for biological information to researchers. This database content is exchanged with the US National Center for Biotechnology Information (NCBI) and the European Bioinformatics Institute (EBI) within the framework of the International Nucleotide Sequence Database Collaboration (INSDC). Resources provided by the DDBJ include traditional nucleotide sequence data released in the form of 27 316 452 entries or 16 876 791 557 base pairs (as of June 2012), and raw reads of new generation sequencers in the sequence read archive (SRA). A Japanese researcher published his own genome sequence via DDBJ-SRA on 31 July 2012. To cope with the ongoing genomic data deluge, in March 2012, our computer previous system was totally replaced by a commodity cluster-based system that boasts 122.5 TFlops of CPU capacity and 5 PB of storage space. During this upgrade, it was considered crucial to replace and refactor substantial portions of the DDBJ software systems as well. As a result of the replacement process, which took more than 2 years to perform, we have achieved significant improvements in system performance.Entities:
Mesh:
Year: 2012 PMID: 23180790 PMCID: PMC3531146 DOI: 10.1093/nar/gks1152
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
List of large-scale data released by the DDBJ from July 2011 to June 2012
| Type | Organism | Accession number (number of entries) |
|---|---|---|
| Genome | scaffold CON: DF093604–DF097774 (4171 entries) | |
| WGS: BACK01000001–BACK01053640 (53 640 entries) | ||
| Sake yeast ( | scaffold CON: DG000037–DG000052 (14 entries) | |
| WGS: BABQ01000001–BABQ01000705 (705 entries) | ||
| Mitochondrion: AP012028 (1 entry) | ||
| Liver fluke ( | Phase 1 | |
| scaffold CON: DF126616–DF142827 (16 212 entries) | ||
| WGS: BADR01000001–BADR01060778 (60 778 entries) | ||
| Phase 2 | ||
| WGS: BADR02000001–BADR02006190 (6190 entries) | ||
| scaffold CON: DF142828-DF145382 (2555 entries) | ||
| Hitomebore rice ( | scaffold CON: DG000053–DG000064 (12 entries) | |
| WGS: BACJ01000001–BACJ01064745 (64 745 entries) | ||
| Eucaly ( | scaffold CON: DF097775–DF126446 (28 672 entries) | |
| WGS: BADO01000001–BADO01274001 (274 001 entries) | ||
| Full-length cDNA | Silkworm ( | AK377185–AK388575 (11 160 entries; 231 entries dropped) |
| Pig ( | AK389169–AK401026(11 858 entries) | |
| TSA | FX056085–FX112549 (56 465 entries) | |
| EST | FY358876–FY368220 (9,345 entries) | |
| Sea squirt ( | FY844421–FY896670 (52 250 entries) | |
| Silkworm ( | FS724152–FS939542, FY736910–FY762881 (241 363 entries) | |
| Eucaly ( | FY782538–FY841121 (58 584 entries) | |
| Bread wheat ( | HX000001–HX201765, HX247045–HX257200 (211 921 entries) | |
| Honey bee ( | HX282115–HX373155 (91 041 entries) | |
| Human ( | HY000001–HY377477 (377 477 entries) | |
| Asian Swallowtail ( | FY174038–FY210626 (36 589 entries) | |
| Common Mormon ( | FY302525–FY358875 (56 351 entries) | |
| MGA | Human ( | AEAAA0000001–AEAAA0026367, AEAAB0000001–AEAAB0012114, |
| AEAAC0000001–AEAAC0021096, AEAAD0000001–AEAAD0024262, | ||
| AEAAE0000001–AEAAE0023437, AEAAF0000001–AEAAF0030485, | ||
| AEAAG0000001–AEAAG0021798, AEAAH0000001–AEAAH0040734, | ||
| AEAAI0000001–AEAAI0029614, AEAAJ0000001–AEAAJ0030206 (260 113 entries) |
Figure 1.In the new DDBJ keyword search system, the web application was completely reimplementated to resolve the practical scalability limitations of the previous search engine. In the new system, not only normal divisions of INSD, but also EST, GSS, WGS have been made searchable.