| Literature DB >> 36158042 |
Chang Wan Seo1, Sung Hyun Kim1, Young Woon Lim1, Myung Soo Park2.
Abstract
Penicillium species have been actively studied in various fields, and many new and unrecorded species continue to be reported in Korea. Moreover, unidentified and misidentified Korean Penicillium species still exist in GenBank. Therefore, it is necessary to revise the Korean Penicillium inventory based on accurate identification. We collected Korean Penicillium nucleotide sequence records from GenBank using the newly developed software, GenMine, and re-identified Korean Penicillium based on the maximum likelihood trees. A total of 1681 Korean Penicillium GenBank nucleotide sequence records were collected from GenBank. In these records, 1208 strains with four major genes (Internal Transcribed Spacer rDNA region, β-tubulin, Calmodulin and RNA polymerase II) were selected for Penicillium re-identification. Among 1208 strains, 927 were identified, 82 were identified as other genera, the rest remained undetermined due to low phylogenetic resolution. Identified strains consisted of 206 Penicillium species, including 156 recorded species and 50 new species candidates. However, 37 species recorded in the national list of species in Korea were not found in GenBank. Further studies on the presence or absence of these species are required through literature investigation, additional sampling, and sequencing. Our study can be the basis for updating the Korean Penicillium inventory.Entities:
Keywords: GenBank; Penicillium; inventory; re-identification; tree-based identification
Year: 2022 PMID: 36158042 PMCID: PMC9467555 DOI: 10.1080/12298093.2022.2116816
Source DB: PubMed Journal: Mycobiology ISSN: 1229-8093 Impact factor: 1.946
Figure 1.Algorithms and features of GenMine software.
Figure 2.Composition and annual growth of Korean Penicillium records in GenBank. The pie chart shows the composition of Korean Penicillium records collected by GenMine with option “Korea” and “Penicillium” by gene types. The line graph shows the increase of Korean Penicillium records with time.
Figure 3.Diagram of the re-identification process and results on Korean Penicillium ITS and BenA sequences from GenBank. The sequences were re-identified by RAxML tree-based identification and compared with original annotation of corresponding GenBank record. Sequences without scientific names is GenBank record were labeled “Unassigned,” and sequences cannot be identified due to low resolution of phylogenetic tree were labeled as “Undetermined.” Numbers in the diagram show the number of records in each category.
Figure 4.Diagram of the re-identification process and results of Korean Penicillium RPB2 and CaM sequences from GenBank. The sequences were re-identified by RAxML tree-based identification and compared with original annotation of corresponding GenBank record. Sequences without scientific names is GenBank record were labeled “Unassigned,” and sequences cannot be identified due to low resolution of phylogenetic tree were labeled as “Undetermined.” Numbers in the diagram show the number of records in each category.
Number of Korean Penicillium strains and species found in GenBank.
| Strains | |
|---|---|
| Total | 1208 |
| Identified | 927 |
| Undetermined | 199 |
| Other genera | 82 |
| Species | |
| Total | 206 |
| New species candidates | 50 |
| Recorded (worldwide) | 156 |
| Recorded (National list of Korea) | 80 |
| Unrecorded (National list of Korea) | 76 |