| Literature DB >> 23964903 |
Yuki Iwasaki1, Takashi Abe, Yoshiko Wada, Kennosuke Wada, Toshimichi Ikemura.
Abstract
BACKGROUND: With the remarkable increase of microbial and viral sequence data obtained from high-throughput DNA sequencers, novel tools are needed for comprehensive analysis of the big sequence data. We have developed "Batch-Learning Self-Organizing Map (BLSOM)" which can characterize very many, even millions of, genomic sequences on one plane. Influenza virus is one of zoonotic viruses and shows clear host tropism. Important issues for bioinformatics studies of influenza viruses are prediction of genomic sequence changes in the near future and surveillance of potentially hazardous strains.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23964903 PMCID: PMC3765179 DOI: 10.1186/1471-2334-13-386
Source DB: PubMed Journal: BMC Infect Dis ISSN: 1471-2334 Impact factor: 3.090
Figure 1Mononucleotide BLSOM for influenza A and B genome sequences. (A) Mono. Mono-BLSOM was constructed for 12,067 influenza A and 328 B strains. Lattice points containing sequences from strains isolated from more than one host were indicated in black, and those containing sequences from one host were indicated in a color representing the host: avian (3,752 strains) colored in red, human (7,434 strains) in green, swine (783 strains) in blue, equine (95 strains), bat (3 strains) in grey, and influenza B virus (328 strains) in light blue. (B) Hum Sub. On the Mono-BLSOM, strains of each human subtype were specified in a color representing the subtype (H1N1 colored in light blue, H1N1/09 in dark green, H2N2 in geen, H3N2 in blue, H5N1 in brown, H7 in pink, H9 in pink, and others in green), and strains isolated from other animal hosts were achromatic. (Ci-iv) Occurrences of each mononucleotide at individual lattice points were sorted and classified into 21 groups and differing levels were indicated with different colors presented below the figure: wine red (high), dark green (low) and achromatic (intermediate).
Figure 2Normalized Tetranucleotide BLSOM for all influenza A and B virus genome sequences.(A) NorTetra. Lattice points on NorTetra-BLSOM were colored as described in Figure 1A. Green dots and many black dots in the avian territory primarily corresponded to H5N1 strains isolated from humans. (B) Hum Sub. On the NorTetra-BLSOM, lattice points containing different human subtypes were differentially colored as described in Figure 1B. (Ci-v) Normalized occurrences of each tetranucleotide at individual lattice points were sorted and indicated, as described in Figure 1C. (D) Map locations of avian strains of special interests and human H7N9 strains. Avian strains listed in Table 1 were indicated by green dots and by a-f colored as presented in Table 1, the avian H5N1 strain isolated from turkey in Virginia was indicated by a blue dot specified by pink-colored h, and three human H7N9 strains recently isolated were indicated by blue dots specified by brown-colored i.
Avian strains with high scores
High diagnostic tetranucleotides: AAGU, ACUU, AUCA, AUGA, AUUA, AUUU, CUUU, GGCC, UCAU, UUAA, UUAU, UUCA, and UUUG.
Low diagnostic tetranucleotdes: ACGC, AUUG, CUCC, CUGA, GAGC, GAGG, GCAG, GGAC, UCUU, and UGUG.
The avian strains suspected to be directly transferred from humans or swines were indicated in italic. Other strains were indicated in bold.
Figure 3Retrospective time-series changes observed for human seasonal and H1N1/09 strains on Mono-BLSOM presented in Figure1A. (A) Human seasonal H1N1 and H3N2 strains isolated in the specified time period were indicated in the color representing the subtype (H1N1 and H3N2 colored in brown and blue, respectively), other human strains were left in green, and those from other hosts were achromatic. (B) H1N1/09 strains isolated in the specified time period were indicated in pink, and other human strains were left in green; and those from other hosts were achromatic. H1N1/09 strains isolated at a very early stage (March/09) were arrowed.
Figure 4Time-series changes of mono- and tetranucleotide frequencies (%). The average frequency of each mononucleotide and of one tetranucleotide GAGG among strains belonging to one human subtype isolated in a specific time period was shown with a colored bar specifying the subtype (H1N1 and H3N2 colored in light blue and blue, respectively). In the case of avian (red) and swine (grey) and of human H5N1 (brown) strains, the average frequency of each mononucleotide was calculated for all strains independently of isolated years.
Figure 5Mono- and tetranucleotide frequencies (%). The average frequency of each mononucleotide and of GAGG for all avian and avian H5N1 strains (red) and for human H5N1 strains (brown) was shown with a colored bar. The frequency for the avian H5N1 strain isolated from turkey in Virginia (VA/H5N1, red), the avian H3, H4 and H6 strains listed in Table 1 (blue) and three human H7N9 strains (pink) also was shown with a colored bar.