| Literature DB >> 30533823 |
William G Miller1, Emma Yee1, James L Bono2.
Abstract
Arcobacter cryaerophilus was originally recovered from aborted bovine and porcine fetuses, but it has been subsequently isolated from meat, water, and human clinical samples. This study describes the complete whole-genome sequences of two A. cryaerophilus strains, ATCC 43158T (=A 169/BT =LMG 24291T) and ATCC 49615 (=CDC D2610 =LMG 10829).Entities:
Year: 2018 PMID: 30533823 PMCID: PMC6256622 DOI: 10.1128/MRA.01463-18
Source DB: PubMed Journal: Microbiol Resour Announc ISSN: 2576-098X
Sequencing metrics and genomic data for Arcobacter cryaerophilus strains ATCC 43158T and ATCC 49615
| Feature | Value(s) | |
|---|---|---|
| ATCC 43158T | ATCC 49615 | |
| Sequencing metrics | ||
| 454 (shotgun) platform | ||
| No. of reads | 215,129 | 87,759 |
| No. of bases | 90,272,116 | 36,134,765 |
| Avg length (bases) | 419.6 | 411.7 |
| Coverage (×) | 45.0 | 17.6 |
| 454 (paired-end) platform | ||
| No. of reads | 118,785 | 161,571 |
| No. of bases | 38,763,675 | 49,505,772 |
| Avg length (bases) | 326.3 | 306.4 |
| Coverage (×) | 19.3 | 24.1 |
| llumina HiSeq 2000 platform | ||
| No. of reads | 18,366,932 | 18,769,966 |
| No. of bases | 1,836,693,200 | 1,895,766,566 |
| Avg length (bases) | 100 | 101 |
| Coverage (×) | 915.2 | 922.1 |
| PacBio platform | ||
| No. of reads | 117,659 | NA |
| No. of bases | 437,690,629 | NA |
| Avg length (bases) | 3,720 | NA |
| Coverage (×) | 218.1 | NA |
| Newbler metrics | ||
| Total no. of contigs | 45 | 32 |
| N50ContigSize (454) | 195,432 | 346,474 |
| Q40PlusBases (454) (%) | 99.90 | 99.87 |
| N50ContigSize (HiSeq pool 1) | 195,198 | 301,090 |
| Q40PlusBases (HiSeq pool 1) (%) | 99.98 | 99.96 |
| N50ContigSize (HiSeq pool 2) | 195,524 | 165,874 |
| Q40PlusBases (HiSeq pool 2) (%) | 99.96 | 99.96 |
| N50ContigSize (HiSeq pool 3) | 195,196 | 301,081 |
| Q40PlusBases (HiSeq pool 3) (%) | 99.97 | 99.99 |
| Genomic data | ||
| Size (bp) | 2,006,909; 101,431 | 2,055,914 |
| G+C content (%) | 27.46; 24.63 | 27.51 |
| No. of CDS | 1,951; 94 | 1,994 |
| Assigned function (% CDS) | 847 (43.4); 3 (3.2) | 857 (43.0) |
| General function annotation (% CDS) | 675 (34.6); 34 (36.2) | 694 (34.8) |
| Domain/family annotation only (% CDS) | 131 (6.7); 4 (4.3) | 136 (6.8) |
| Hypothetical (% CDS) | 298 (15.3); 53 (56.4) | 307 (15.4) |
| Pseudogenes | 23; 6 | 8 |
| Genomic islands/CRISPR | ||
| No. of genetic islands | 8; 0 | 5 |
| No. of CDS in genetic islands | 108 [2]; 0 | 85 [2] |
| No. of CRISPR/Cas loci | 0 | 0 |
| Gene content/pathways | ||
| IS elements, mobile elements, or transposases | 2; 1 | 2 |
| Signal transduction | ||
| No. of Che proteins | 9; 0 | 9 |
| No. of methyl-accepting chemotaxis proteins | 20; 0 | 23 |
| No. of response regulators | 23; 2 | 25 |
| No. of histidine kinases | 25; 1 | 27 |
| No. of response regulator/histidine kinase fusions | 0 | 3 |
| No. of diguanylate cyclases | 14; 0 | 15 |
| No. of diguanylate phosphodiesterases (HD-GYP, EAL) | 4, 1; 0 | 5, 1 |
| No. of diguanylate cyclase/phosphodiesterases | 9; 0 | 9 |
| No. of other | 10; 1 | 10 |
| Motility | ||
| Flagellin genes | ||
| Restriction/modification | ||
| No. of type I systems ( | [2]; 0 | 2 |
| No. of type II systems | 1; 0 | 3 |
| No. of type III systems | 0 | 2 |
| Transcription/translation | ||
| No. of transcriptional regulatory proteins | 32; 4 | 32 |
| Non-ECF σ factors | σ70 | σ70 |
| No. of ECF σ factors | 0 | 0 |
| No. of tRNAs | 49; 0 | 51 |
| No. of ribosomal loci | 5; 0 | 5 |
| CO dehydrogenase ( | No | No |
| Ethanolamine utilization ( | No | No |
| Nitrogen fixation ( | No | No |
| Osmoprotection | No | |
| Pyruvate → acetyl-CoA | ||
| Pyruvate dehydrogenase (E1/E2/E3) | Yes | Yes |
| Pyruvate:ferredoxin oxidoreductase | No | No |
| Urease | No | No |
| Vitamin B12 biosynthesis | No | No |
Numbers in square brackets indicate pseudogenes or fragments. Strain ATCC 43158T values before a semicolon are for the chromosome, while values after the semicolon are for the pACRY43158 plasmid. NA, not applicable.
CDS, coding sequences; ECF, extracytoplasmic function; acetyl-CoA, acetyl coenzyme A.
Maximum length, 25,867 bases.
Features and values taken from largeContigMetrics within 454NewblerMetrics.txt for each assembly. Large contigs were defined as having ≥500 bases. Due to the large number of HiSeq reads, the total reads were split into three pools and assembled independently. N50ContigSize value is in number of bases.
Numbers do not include pseudogenes.