| Literature DB >> 30537981 |
Theerapong Krajaejun1, Weerayuth Kittichotirat2, Preecha Patumcharoenpol3, Thidarat Rujirawat4, Tassanee Lohnoo4, Wanta Yingyong4.
Abstract
OBJECTIVES: The oomycete Pythium insidiosum infects humans and animals worldwide, and causes the life-threatening condition, called pythosis. Most patients lose infected organs or die from the disease. Comparative genomic analyses of different P. insidiosum strains could provide new insights into its pathobiology, and can lead to discovery of an effective treatment method. Several draft genomes of P. insidiosum are publicly available: three from Asia (Thailand), and one each from North (the United States) and Central (Costa Rica) Americas. We report another draft genome of P. insidiosum isolated from South America (Brazil), to serve as a resource for comprehensive genomic studies. DATA DESCRIPTION: In this study, we report genome sequence of the P. insidiosum strain CBS 101555, isolated from a horse with pythiosis in Brazil. One paired-end (180-bp insert) library of processed genomic DNA was prepared for Illumina HiSeq 2500-based sequencing. Assembly of raw reads provided genome size of 48.9 Mb, comprising 60,602 contigs. A total of 23,254 genes were predicted and classified into 18,305 homologous gene clusters. Compared with the reference genome (the P. insidiosum strain Pi-S), 1,475,337 sequence variants (SNPs and INDELs) were identified in the organism. The genome sequence data has been deposited in DDBJ under the accession numbers BCFP01000001-BCFP01060602.Entities:
Keywords: Gene cluster; Genome; Oomycete; Pythiosis; Pythium insidiosum; Sequence variant
Mesh:
Year: 2018 PMID: 30537981 PMCID: PMC6290497 DOI: 10.1186/s13104-018-3968-3
Source DB: PubMed Journal: BMC Res Notes ISSN: 1756-0500
Overview of data files/data sets
| Label | Name of data file/data set | File types (file extension) | Data repository and identifier (DOI or accession number) |
|---|---|---|---|
| Data file 1 | Whole genome sequence | FASTA | DDBJ (Accession numbers: BCFP01000001–BCFP01060602) ( |
| Data file 2 | Gene clusters | MS Excel file (.xlsx) | Mendeley database (10.17632/yjyzx5gk7s.1) ( |
| Data file 3 | Clusters of Orthologous Groups of Proteins (COGs) | MS Excel file (.xlsx) | Mendeley database (10.17632/5rhfd4n37k.1) ( |
| Data file 4 | Sequence variants | MS Excel file (.xlsx) | Mendeley database (10.17632/4y8hdw7tb7.1) ( |