Literature DB >> 35389260

Genome Sequences of Five Arenaviruses from Pygmy Mice (Mus minutoides) in Sierra Leone.

Matej Vučak1, James Bangura2, Bruno M Ghersi3, Jenna Nichols1, Joseph Hughes1, Ana da Silva Filipe1, Alexandre Tremeau-Bravard3, David J Wolking3, Emmanuel Amara2, Abdulai Bangura2, Marilyn C Kanu2, Osman T Kanu2, Dickson Kargbo2, Edwin G Lavalie2, Victor Lungay2, Willie Robert2, Mohamed Turay2, Steven Fornie4, Thomas T Samba4, Bankolay B Sesay4, Patrick Swaray4, Mohamed A Vandi4, Mohamed Alpha Bah5, Andrew A Mansaray5, Brian H Bird3, Andrew J Davison1.   

Abstract

The genome sequences of five strains of a mammarenavirus were assembled from metagenomic data from pygmy mice (Mus minutoides) captured in Sierra Leone. The nearest fully sequenced relatives of this virus, which was named Seli virus, are lymphocytic choriomeningitis virus, Lunk virus, and Ryukyu virus.

Entities:  

Year:  2022        PMID: 35389260      PMCID: PMC9119099          DOI: 10.1128/mra.00095-22

Source DB:  PubMed          Journal:  Microbiol Resour Announc        ISSN: 2576-098X


ANNOUNCEMENT

Members of genus Mammarenavirus in family Arenaviridae have genomes consisting of two single-stranded ambisense RNA segments (L and S) and most have rodents as hosts (1). Among a collection of rodents captured in Sierra Leone that had tested positive for the mammarenavirus Lassa virus (LASV), we used visual recognition, DNA barcoding of the MT-CO1 gene (2), and sequencing of the MT-CO1 and MT-CYB genes to identify five apparently healthy animals from Koinadugu District as pygmy mice (Mus minutoides), in which LASV has been reported occasionally (3). An additional 18 pygmy mice in this collection were LASV negative. The five mice had tested positive for LASV in two assays for the L gene (encoding the RNA-directed RNA polymerase) but not in an assay for the GPC gene (encoding the glycoprotein precursor) (Table 1, see footnotes b and c for assay details). We determined the genome sequences of the mammarenaviruses in these mice.
TABLE 1

Data on SELV in this study

AnimalLocationbqPCR LASV load by sample type (CT)
Size (nt) of:
G+C (%) content of:
NCBI SRA accession no. by sample type
NCBI GenBank accession no. of:
BLcOScUScBLdLeSeLSBLOSUSLS MT-CO1 f MT-CYB f
00094Sengbeh, Gbenikoro31.736.4ND27.87,173 (7,160)3,390 (3,377)4547 SRX13144664 SRX13144665 SRX13144671 OL364850 OL364851 OL415139 OL441084
00127Wara Wara Bafodia, Bafodia38.6NDND27.27,210 (7,171)3,389 (3,384)4447 SRX13144672 SRX13144673 SRX13144674 OL364852 OL364853 OL415140 OL441085
00207Diang, Badala27.531.737.621.77,180 (7,180)3,389 (3,384)4447 SRX13144675 SRX13144676 SRX13144677 OL364854 OL364855 OL415141 OL441086
00367Wara Wara Bafodia, Bafodia31.632.331.627.47,173 (7,173)3,389 (3,389)4547 SRX13144678 SRX13144666 SRX13144667 OL364856 OL364857 OL415142 OL441087
00414Diang, Badala28.628.935.721.57,181 (7,176)3,388 (3,363)4547 SRX13144668 SRX13144669 SRX13144670 OL364858 OL364859 OL415143 OL441088

NCBI BioProject no. PRJNA780418. C, cycle threshold; BL, blood sample; OS, oral swab sample; US, urogenital swab sample; ND, not detected; nt, nucleotide; L, L segment; S, S segment; SRA, sequence read archive; MT-CO1, mitochondrial gene encoding cytochrome c oxidase I; MT-CYB, mitochondrial gene encoding cytochrome b.

Chiefdom and site in Koinadugu District.

Using a method targeting the L gene (16).

Using a RealStar LASV RT-PCR 2.0 kit targeting the L gene (Altona Diagnostics). LASV was not detected in these samples using a RealStar LASV RT-PCR 2.0 kit targeting the GPC gene.

Including undetermined nucleotides at the segment termini. The size without these nucleotides is given in parentheses.

Sequence determined from the combined data set for each animal and by Illumina sequencing of OS DNA from each animal.

Data on SELV in this study NCBI BioProject no. PRJNA780418. C, cycle threshold; BL, blood sample; OS, oral swab sample; US, urogenital swab sample; ND, not detected; nt, nucleotide; L, L segment; S, S segment; SRA, sequence read archive; MT-CO1, mitochondrial gene encoding cytochrome c oxidase I; MT-CYB, mitochondrial gene encoding cytochrome b. Chiefdom and site in Koinadugu District. Using a method targeting the L gene (16). Using a RealStar LASV RT-PCR 2.0 kit targeting the L gene (Altona Diagnostics). LASV was not detected in these samples using a RealStar LASV RT-PCR 2.0 kit targeting the GPC gene. Including undetermined nucleotides at the segment termini. The size without these nucleotides is given in parentheses. Sequence determined from the combined data set for each animal and by Illumina sequencing of OS DNA from each animal. Blood, oral swab, and urogenital swab samples were obtained from each mouse. Nucleic acids were extracted using Applied Biosystems MagMAX CORE and pathogen kits with a KingFisher Duo Prime purification system and treated with Ambion DNase I. RNA was purified using the Beckman Coulter RNAClean XP kit and depleted of rRNA using an Illumina Ribo-Zero Plus kit. A metagenomic method (G-Meta) (4) was used to prepare cDNA and sequencing libraries, which were analyzed on an Illumina NextSeq 550 instrument using a high-output 300-cycle cartridge. Bioinformatic tools were used with default parameters unless specified otherwise. The genome sequences were determined using a pipeline built around SPAdes v3.13 (5) and DIAMOND v2.0.9 (6) (https://github.com/mvvucak/LASV_Sample_Classification/). The 15 data sets (each consisting of 5,076,546 to 34,278,974 paired-end reads) were quality filtered using Trim Galore v0.4.0 (https://www.bioinformatics.babraham.ac.uk/projects/trim_galore/). The three data sets for each mouse were combined and assembled de novo, and contigs related to mammarenavirus sequences were identified. The contigs were joined into genome sequences manually by iteratively incorporating reads sharing short sequences near their ends. Individual and combined data sets were then mapped to the genome sequences using Bowtie 2 v2.3.1 (7) with the ‐‐local parameter. The alignments were visualized using Tablet v1.21.02.08 (8), and final adjustments to the consensus sequences were made. Published information on other mammarenaviruses was used to locate the segment ends when possible and to annotate the genome sequences. The analysis indicated that a single virus, in some instances with a few single-nucleotide variants, was present in each mouse. Phylogenetic analyses showed that these viruses represent strains of a mammarenavirus, which we have named Seli virus (SELV) after a river in Sierra Leone (Fig. 1). Thus, among fully sequenced mammarenaviruses, SELV is most closely related to Lunk virus, which was isolated from a pygmy mouse in Zambia (9); Ryukyu virus, which was isolated from a Ryukyu mouse (Mus caroli) in China (10); and lymphocytic choriomeningitis virus, which is widespread in house mice (Mus musculus) (1) (Fig. 1A and B). SELV is sufficiently distinct to meet the formal criteria for assignment to a new viral species (1), to which additional phylogenetic analyses based on short sequences suggest that Kodoko virus, from pygmy mice in Guinea (11), may also belong (Fig. 1C and D). Further mapping experiments using Bowtie 2 as described above showed that reads originating from recognized LASV clades (12) were completely absent from all 15 data sets. This finding suggests that the L gene assays used are capable of detecting SELV as well as LASV.
FIG 1

Phylogenetic analyses of SELV. The complete amino acid sequences of the L (A) and the NP (B) proteins of Old World mammarenaviruses and partial nucleotide sequences of the L (C) and the NP (D) genes of SELV and its closer relatives were aligned using MAFFT v7.475 (13). The alignments were inspected visually in JalView v2.11 (14). Maximum likelihood phylogenies for each alignment were reconstructed using IQ-Tree v2.1.3 (15) by instituting the best fitting model selected by ModelFinder according to the Bayesian information criterion. The trees in A and B were rooted using a New World mammarenavirus, Junín virus (not shown; GenBank accession no. AY358022 and AY358023, respectively). The trees in C and D were rooted using Lassa virus (not shown; GenBank accession no. U73034 and J04324, respectively). Branch support was assessed using 1,000 bootstrap replicates, with values above 75% shown at the nodes and values of 100% indicated by asterisks. The viruses are denoted by GenBank accession no. and name, and SELV strains are indicated by bold font. The scale bars represent substitutions per site.

Phylogenetic analyses of SELV. The complete amino acid sequences of the L (A) and the NP (B) proteins of Old World mammarenaviruses and partial nucleotide sequences of the L (C) and the NP (D) genes of SELV and its closer relatives were aligned using MAFFT v7.475 (13). The alignments were inspected visually in JalView v2.11 (14). Maximum likelihood phylogenies for each alignment were reconstructed using IQ-Tree v2.1.3 (15) by instituting the best fitting model selected by ModelFinder according to the Bayesian information criterion. The trees in A and B were rooted using a New World mammarenavirus, Junín virus (not shown; GenBank accession no. AY358022 and AY358023, respectively). The trees in C and D were rooted using Lassa virus (not shown; GenBank accession no. U73034 and J04324, respectively). Branch support was assessed using 1,000 bootstrap replicates, with values above 75% shown at the nodes and values of 100% indicated by asterisks. The viruses are denoted by GenBank accession no. and name, and SELV strains are indicated by bold font. The scale bars represent substitutions per site. The work was conducted in partnership with the Government of Sierra Leone Ministry of Agriculture and Forestry representatives working under a ministry-approved wildlife specimen collection permit, and with further approvals from the University of California, Davis Institutional Animal Care and Use Committee.

Data availability.

Sequence information and accession numbers are provided in Table 1.
  16 in total

1.  Using Tablet for visual exploration of second-generation sequencing data.

Authors:  Iain Milne; Gordon Stephen; Micha Bayer; Peter J A Cock; Leighton Pritchard; Linda Cardle; Paul D Shaw; David Marshall
Journal:  Brief Bioinform       Date:  2012-03-24       Impact factor: 11.622

2.  Fast and sensitive protein alignment using DIAMOND.

Authors:  Benjamin Buchfink; Chao Xie; Daniel H Huson
Journal:  Nat Methods       Date:  2014-11-17       Impact factor: 28.547

3.  ICTV Virus Taxonomy Profile: Arenaviridae.

Authors:  Sheli R Radoshitzky; Michael J Buchmeier; Rémi N Charrel; J Christopher S Clegg; Jean-Paul J Gonzalez; Stephan Günther; Jussi Hepojoki; Jens H Kuhn; Igor S Lukashevich; Víctor Romanowski; Maria S Salvato; Manuela Sironi; Mark D Stenglein; Juan Carlos de la Torre
Journal:  J Gen Virol       Date:  2019-06-13       Impact factor: 3.891

4.  Molecular surveillance and phylogenetic analysis of Old World arenaviruses in Zambia.

Authors:  Akihiro Ishii; Yuka Thomas; Ladslav Moonga; Ichiro Nakamura; Aiko Ohnuma; Bernard M Hang'ombe; Ayato Takada; Aaron S Mweene; Hirofumi Sawa
Journal:  J Gen Virol       Date:  2012-07-18       Impact factor: 3.891

5.  Fast gapped-read alignment with Bowtie 2.

Authors:  Ben Langmead; Steven L Salzberg
Journal:  Nat Methods       Date:  2012-03-04       Impact factor: 28.547

6.  MAFFT multiple sequence alignment software version 7: improvements in performance and usability.

Authors:  Kazutaka Katoh; Daron M Standley
Journal:  Mol Biol Evol       Date:  2013-01-16       Impact factor: 16.240

7.  Identification of mosquito bloodmeals using mitochondrial cytochrome oxidase subunit I and cytochrome b gene sequences.

Authors:  J S Townzen; A V Z Brower; D D Judd
Journal:  Med Vet Entomol       Date:  2008-12       Impact factor: 2.739

8.  Comparative analysis of rodent and small mammal viromes to better understand the wildlife origin of emerging infectious diseases.

Authors:  Zhiqiang Wu; Liang Lu; Jiang Du; Li Yang; Xianwen Ren; Bo Liu; Jinyong Jiang; Jian Yang; Jie Dong; Lilian Sun; Yafang Zhu; Yuhui Li; Dandan Zheng; Chi Zhang; Haoxiang Su; Yuting Zheng; Hongning Zhou; Guangjian Zhu; Hongying Li; Aleksei Chmura; Fan Yang; Peter Daszak; Jianwei Wang; Qiyong Liu; Qi Jin
Journal:  Microbiome       Date:  2018-10-03       Impact factor: 14.650

9.  Comparison of Next-Generation Sequencing Technologies for Comprehensive Assessment of Full-Length Hepatitis C Viral Genomes.

Authors:  Emma Thomson; Camilla L C Ip; Anjna Badhan; Mette T Christiansen; Walt Adamson; M Azim Ansari; David Bibby; Judith Breuer; Anthony Brown; Rory Bowden; Josie Bryant; David Bonsall; Ana Da Silva Filipe; Chris Hinds; Emma Hudson; Paul Klenerman; Kieren Lythgow; Jean L Mbisa; John McLauchlan; Richard Myers; Paolo Piazza; Sunando Roy; Amy Trebes; Vattipally B Sreenu; Jeroen Witteveldt; Eleanor Barnes; Peter Simmonds
Journal:  J Clin Microbiol       Date:  2016-07-06       Impact factor: 11.677

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.