Literature DB >> 17913743

CoVDB: a comprehensive database for comparative analysis of coronavirus genes and genomes.

Yi Huang¹, Susanna K P Lau, Patrick C Y Woo, Kwok-Yung Yuen.

Abstract

The recent SARS epidemic has boosted interest in the discovery of novel human and animal coronaviruses. By July 2007, more than 3000 coronavirus sequence records, including 264 complete genomes, are available in GenBank. The number of coronavirus species with complete genomes available has increased from 9 in 2003 to 25 in 2007, of which six, including coronavirus HKU1, bat SARS coronavirus, group 1 bat coronavirus HKU2, groups 2c and 2d coronaviruses, were sequenced by our laboratory. To overcome the problems we encountered in the existing databases during comparative sequence analysis, we built a comprehensive database, CoVDB (http://covdb.microbiology.hku.hk), of annotated coronavirus genes and genomes. CoVDB provides a convenient platform for rapid and accurate batch sequence retrieval, the cornerstone and bottleneck for comparative gene or genome analysis. Sequences can be directly downloaded from the website in FASTA format. CoVDB also provides detailed annotation of all coronavirus sequences using a standardized nomenclature system, and overcomes the problems of duplicated and identical sequences in other databases. For complete genomes, a single representative sequence for each species is available for comparative analysis such as phylogenetic studies. With the annotated sequences in CoVDB, more specific blast search results can be generated for efficient downstream analysis.

Entities: Disease Gene Species

Mesh：

Substances：
Viral Proteins

Year: 2007 PMID： 17913743 PMCID： PMC2238867 DOI： 10.1093/nar/gkm754

Source DB: PubMed Journal: Nucleic Acids Res ISSN： 0305-1048 Impact factor: 16.971

INTRODUCTION

Coronaviruses are found in a wide variety of animals and are associated with respiratory, enteric, hepatic and neurological diseases of varying severity. Based on genotypic and serological characterization, coronaviruses were divided into three distinct groups (1–3). As a result of the unique mechanism of viral replication, coronaviruses have a high frequency of recombination (2,4). The recent severe acute respiratory syndrome (SARS) epidemic, the discovery of SARS coronavirus (SARS-CoV) and identification of SARS-CoV-like viruses from Himalayan palm civets and a raccoon dog from wild live markets in China have led to a boost in interest on discovery of novel coronaviruses in both humans and animals (5–9) (Figure 1). For human coronaviruses, a novel group 1 human coronavirus, human coronavirus NL63 (HCoV-NL63) was reported in 2004 (10,11), while we described the discovery, complete genome sequence and genetic diversity of a novel group 2 human coronavirus, coronavirus HKU1 (CoV-HKU1) in 2005 (4,12–14). As for animal coronaviruses, six group 1 (15–17), four group 2, including bat SARS-CoV and two new subgroups of group 2 coronaviruses (6,8,18,19), and 11 group 3 (20–23) coronaviruses have recently been described.

Figure 1.

Number of coronavirus sequences in GenBank from 1984 to 2006.

Number of coronavirus sequences in GenBank from 1984 to 2006. By July 2007, more than 3000 coronavirus sequence records, including a total of 264 complete genomes, are available in GenBank (24). Among the 25 coronavirus species with complete genome sequence available, six were sequenced by our group, including CoV-HKU1 and bat SARS-CoV (13,16,18,19). Furthermore, we defined two novel subgroups of group 2 coronavirus (18). During the process of batch sequence retrieval for comparative genome analysis of the coronavirus genomes that we sequenced, we encountered several major problems about the coronavirus sequences in GenBank as well as other coronavirus databases (Coronaviridae Bioinformatics Resource, http://athena.bioc.uvic.ca/database.php?db=coronaviridae; PATRIC http://patric.vbi.vt.edu) (25). First, in GenBank, the non-structural proteins in the polyprotein encoded by orf1ab were not annotated. Second, in all databases, for the non-structural proteins encoded by ORFs downstream to orf1ab, the annotations are often confusing because they are not annotated using a standardized system. Third, multiple accession numbers are often present for reference sequences (26). These problems often lead to confusion when sequence retrieval is performed. Fourth, coronaviruses, especially SARS-CoV, amplified from different specimens may contain the same genome or gene sequences. These sequences usually lead to redundant work when they are analyzed. In view of these problems, we started to develop our own database for coronavirus gene and genome sequences in 2005. In this database, CoVDB, we sought to create a user-friendly platform for efficient batch sequence retrieval, which is crucial for comparative genome analysis. In this article, we describe this comprehensive database of annotated coronavirus genes and genomes, which provides a central source of information about coronaviruses. To further increase the usefulness of CoVDB, commonly used bioinformatics tools were also included for analysis of the sequence data.

MATERIALS AND METHODS

Database description

Sequence data

CoVDB is a web-based coronavirus database. Data of CoVDB is stored and managed by MySQL database management system. By July 2007, CoVDB contains 3982 coronavirus sequences and one torovirus genome sequence. Two hundred and sixty-four of them are complete genomes and the rest are partial genomes or genes. All data were retrieved from GenBank using modules of bioperl. We annotated sequences without gene information or non-structural protein boundary and labeled the 5′ and 3′ untranslated regions (UTRs) of the genomes. By July 2007, CoVDB contains 12 344 genes and UTRs.

Information on coronavirus genome characteristics

In addition to the two sequence retrieval pages, CoVDB collects information on coronavirus sequence characteristics, including genome organization, a brief description on each complete coronavirus genome, GC content, polyprotein cleavage sites, transcription regulatory sequences, acidic tandem repeat sequences and known RNA structures. These pieces of information can be accessed by clicking ‘Genome’ in the top menu bar of CoVDB. In the ‘Tools’ page, blast similarity search (27) against annotated coronavirus sequences in CoVDB can be performed and other commonly used tools are also provided.

Functionality of the database

Batch sequence retrieval

The main goal for setting up CoVDB is to provide a convenient and efficient platform for retrieving batches of coronavirus gene sequences. The interfaces of the database are simple and user friendly. All genes and genomes contain links to GenBank and/or pubmed. CoVDB contains two main pages for sequence retrieval. From the homepage, one can enter the first main page for retrieval of complete genomes and their genes by clicking ‘CoVDB’ (Figure 2a). From this page, users can obtain genes from specific coronavirus species by selecting the corresponding check boxes. We defined one representative genome from each species as the ‘Type strain’. Most of the time, this ‘Type strain’ is the one assigned as the reference sequence in GenBank. By choosing the ‘Type strain only’ option, users can obtain one gene sequence per species and construct phylogenetic tree or perform other comparisons. An example of retrieving complete genome or a specific gene of complete genome of selected species is shown in Figure 2b and c.

Figure 2.

Screenshots of CoVDB complete genome retrieval pages. (a) Specific gene can be retrieved using the pull-down list at the left lower corner. The number in brackets indicates the number of complete genomes for that coronavirus. (b) Example of showing genomes of selected species (some group 2a coronaviruses and SARS-CoV-related coronaviruses). Default is to show the ‘Type strain’ for each species only. The columns NCBIacc and PMID link to GenBank and pubmed, respectively. (c) Example of showing S gene of selected species by choosing S in the pull-down list. For genes downstream to orf1ab, sequences upstream to the initiation codons can also be retrieved from this result page. This function is particularly useful for the detection of transcription regulatory sequences. From the page for retrieval of complete genomes and their genes, one can enter the second main page for retrieval of all complete and/or incomplete genes of a coronavirus (Figure 3a) by clicking ‘From all groups of genes’. In this page, all the gene sequences are grouped vertically according to which coronavirus group and subgroup they belong to, and horizontally by the names of the genes. The option ‘Exclude partial CDS’ can be used if only complete genes are required. An example of retrieving all the sequence of a particular gene for a group of coronavirus is shown in Figure 3b. If the translated sequence of a selected gene has more than one stop codon which is probably due to sequencing error, the number in the ‘Length’ column of this gene will be marked in red.

Figure 3.

Screenshots of all gene retrieval pages. (a) Gene sequences are grouped vertically according to which coronavirus group and subgroup they belong to, and horizontally by the name of the genes. The numbers next to each checkbox indicates the number of that gene in CoVDB. The option ‘Exclude partial CDS’ can be used if only complete genes are required. (b) Example of showing the 15 sequences of nsp13 in group 3 coronaviruses. The first column is CoVDB gene id. In the Uniq column, ‘Uniq’ will be shown if there is no other identical sequence in CoVDB. Otherwise, gene id of the sequences identical to it will be shown.

Polyprotein annotation

In all coronavirus genomes, orf1ab occupies two-thirds of the genome and it is translated as a polyprotein. This polyprotein is post-translationally cleaved by 3C-like protease (3CLpro) and papain-like protease (PLpro) into 15–16 non-structural proteins. Some of the non-structural proteins, such as RNA-dependent RNA polymerase, helicase, 3CLpro and PLpro are essential for replication or virulence of the coronavirus, although the functions of others are still unclear. Due to the essentiality of the non-structural proteins, these sequences are often used for evolutionary analysis, primer design, etc. However, except for the reference sequences, detailed cleavage site information is not provided for the non-structural proteins in other sequences in GenBank. Since it has been shown that 3CLpro and PLpro of coronavirus cleave at conserved specific amino acids, the putative cleavage sites of the 15–16 non-structural proteins can be predicted by multiple sequence alignment. Using these pieces of information, we have annotated these non-structural proteins in all the coronavirus sequences for easy retrieval in CoVDB.

Protein/gene name unification

By convention, all non-structural proteins in the polyprotein encoded by orf1ab are named as ‘nsp’, with each protein numbered consecutively starting from the 5′ end (nsp1–nsp16). The structural proteins after the polyprotein are hemagglutinin esterase (HE, in group 2a coronaviruses), spike glycoprotein (S), envelope protein (E), membrane protein (M) and nucleocapsid protein (N). However, there is no unified naming system for the non-structural proteins encoded by ORFs downstream to orf1ab. This lack of a unified system greatly reduces the stability and accuracy of ortholog retrieval. In CoVDB, with the aim of facilitating gene retrieval, we tried to unify the naming of these non-structural proteins from different groups of coronaviruses. On the other hand, we have also tried to avoid radical changes in the names that may lead to confusion. In CoVDB, these non-structural proteins are named as NS2a, NS3x, NS4x, NS5x and NS7x (x = a, b, c,…). NS2a denotes the ORF between orf1ab and HE of group 2a coronaviruses. NS3x denotes the ORFs between S and E of groups 1, 2c, 2d and 3 coronaviruses. In most of these coronaviruses, there are two NS3x, named NS3a and NS3b. However, in group 1 coronaviruses, the genomes of some members (e.g. HCoV-NL63, PEDV) contain only one ORF between S and E. When we compared their putative amino acid sequences to the corresponding ones in other group 1 coronavirus genomes using BLAST, as well as searching for conserved domains using motifscan, results showed that the putative proteins encoded by these ORFs belonged to a protein family in Pfam originally assigned as ‘Corona_NS3b’ (accession number PF03053). Therefore, we named these ORFs as NS3b. NS4x denotes the ORFs between S and E of group 2a coronaviruses. NS5x denotes the ORFs between M and N of group 3 coronaviruses. One exception is NS5a of group 2a coronaviruses. Traditionally, this name denotes an ORF upstream of E in group 2a coronaviruses. Therefore, we have kept this name for that ORF in CoVDB. NS7x denotes the ORFs downstream of N gene. It is important to note that due to variations in genome organizations among different groups of coronaviruses (Table 1), NS genes with the same name in different coronavirus groups may not be orthologs of each other. The complete genome gene search page of CoVDB contains a link to a Gene synonyms page, which includes a list of synonymous names of the various genes in the coronavirus genomes.

Table 1.

Genome organization of different groups of coronavirus

Group	Organizations
1	5′UTR-nsp1-16-S-NS3x-E-M-N-(NS7x)-3′UTR
2a	5′UTR-nsp1-16-(NS2a)-HE-S-(NS4x)-NS5a-E-M-N-3′UTR
2b	5′UTR-nsp1-16-S-sars3x-E-M-sars6-sars7x-sars8x-N-3′UTR
2c	5′UTR-nsp1-16-S-NS3x-E-M-N-3′UTR
2d	5′UTR-nsp1-16-S-NS3x-E-M-N-(NS7x)-3′UTR
3	5′UTR-nsp1-16-S-NS3x-E-M-NS5x-N-(NS7x)-3′UTR

Genome organization of different groups of coronavirus

Identical sequence labeling

Sequence redundancy is another problem of coronavirus sequences in public nucleotide databases. Different strains of the same species from samples collected in different locations or at different times may possess completely or partially identical sequences. These sequences, though containing important epidemiological information, increase the workload during sequence analysis. In CoVDB, we compared all nucleotide sequences and labeled the identical ones to mitigate this problem. Users can choose to show or not to show strains with identical sequences by clicking on the check boxes to the left of the page (Figure 3b).

Blast similarity search

During the process of coronavirus gene sequences analysis, we encountered a major problem when coronavirus gene sequences, especially those of orf1ab, were used for blast search against GenBank or any other coronavirus databases. When part of the orf1ab gene (e.g. nsp5) is used as the query sequence, instead of getting the gene for the specific non-structural protein that the query sequence is homologous to, the results will only show that the hits are within orf1ab, or in some cases, shown to be within the entire coronavirus genome. Much time will be needed for further analyzing the results manually in order to locate the positions of the cleavage sites of the corresponding genes for the non-structural proteins, making it very inefficient for further downstream work. This problem has been overcome by the annotated sequences in CoVDB. The blast search page of CoVDB is an interface for facilitating coronavirus similarity search. The background support program, blastall, is from the NCBI Blast package. The blast search page can be entered by clicking ‘Tools’ in the top menu bar in any page of CoVDB. Since all sequences in CoVDB are annotated, they can be grouped into different datasets for blast search. Users can choose one of the three nucleotide and two protein sequence datasets as the database for comparison (Figure 4). The three nucleotide sequence datasets are: CoV genes (nsp + genes after 1ab), CoV genes (1ab + genes after 1ab) and CoV GenBank strains, which are the original sequences retrieved from GenBank. The two protein sequence datasets are the translated sequences of the first two nucleotide datasets: CoV proteins (nsp + aa after 1ab) and CoV proteins (1ab + aa after 1ab).

Figure 4.

Screenshot of blast similarity search page. Five datasets can be chosen as the database for comparison.

MyBlast

‘MyBlast’ employs the same blast program as the Blast page mentioned above. However, instead of selecting a predefined nucleotide or amino acid sequence database, multiple sequences can be pasted into the second sequence input box to generate a temporary sequence database. One or more query sequences can be pasted into the first sequence input box for blastn or blastp search against the temporary sequence database.

ORF finder for coronavirus

This ORF finder is specifically designed for coronavirus genome analysis. The result page shows the positions and lengths of each putative ORF and the position of the putative ribosomal frameshift site for translation of orf1ab. The nucleotide or amino acid sequences of the ORFs can be shown by selecting the corresponding check boxes. To facilitate genome comparison and annotation, the most closely related coronavirus, which had been annotated in CoVDB, can be chosen from a pull-down list for comparison using blast search. This function is particularly useful for determining the range of nsp in orf1ab.

DISCUSSION

Rapid and accurate batch sequence retrieval is both the cornerstone and bottleneck for comparative gene or genome analysis. During the process of complete genome sequencing and comparative analysis of the various novel human and animal coronavirus genomes in the past 2 years, we have developed a comprehensive database, CoVDB, of annotated coronavirus genes and genomes, which offers efficient batch sequence retrieval and analysis. As shown by our experience in using CoVDB for comparative genome analysis of novel coronaviruses we have discovered (4,13,16,18,19), we find that CoVDB is more rapid and efficient than other existing coronavirus databases for batch sequence retrieval for the following reasons. First, we have performed annotation on all non-structural proteins in the polyprotein encoded by orf1ab of every single sequence. Second, annotation was performed for the non-structural proteins encoded by ORFs downstream to orf1ab using a standardized system, with some exceptions given to some names that have been used for a long time so as to minimize confusion. Third, all sequences with identical nucleotide sequences were labeled where one can choose to show or not to show strains with identical sequences. Fourth, CoVDB contains not only complete coronavirus genome sequences, but also incomplete genomes and their genes. Some genes of coronaviruses, such as pol, spike and nucleocapsid are sequenced much more frequently than others because they are either most conserved or least conserved. These gene sequences are particularly important for evolutionary analysis, single nucleotide polymorphism studies and design of primers for RT-PCR or quantitative RT-PCR amplification.

Availability

CoVDB is constructed by the Department of Microbiology, the University of Hong Kong. It is available at no charge at http://covdb.microbiology.hku.hk.

27 in total

1. Characterization of a novel coronavirus associated with severe acute respiratory syndrome.

Authors: Paul A Rota; M Steven Oberste; Stephan S Monroe; W Allan Nix; Ray Campagnoli; Joseph P Icenogle; Silvia Peñaranda; Bettina Bankamp; Kaija Maher; Min-Hsin Chen; Suxiong Tong; Azaibi Tamin; Luis Lowe; Michael Frace; Joseph L DeRisi; Qi Chen; David Wang; Dean D Erdman; Teresa C T Peret; Cara Burns; Thomas G Ksiazek; Pierre E Rollin; Anthony Sanchez; Stephanie Liffick; Brian Holloway; Josef Limor; Karen McCaustland; Melissa Olsen-Rasmussen; Ron Fouchier; Stephan Günther; Albert D M E Osterhaus; Christian Drosten; Mark A Pallansch; Larry J Anderson; William J Bellini
Journal: Science Date: 2003-05-01 Impact factor: 47.728

2. Coronaviruses from pheasants (Phasianus colchicus) are genetically closely related to coronaviruses of domestic fowl (infectious bronchitis virus) and turkeys.

Authors: D Cavanagh; K Mawditt; D De B Welchman; P Britton; R E Gough
Journal: Avian Pathol Date: 2002-02 Impact factor: 3.378

3. The Genome sequence of the SARS-associated coronavirus.

Authors: Marco A Marra; Steven J M Jones; Caroline R Astell; Robert A Holt; Angela Brooks-Wilson; Yaron S N Butterfield; Jaswinder Khattra; Jennifer K Asano; Sarah A Barber; Susanna Y Chan; Alison Cloutier; Shaun M Coughlin; Doug Freeman; Noreen Girn; Obi L Griffith; Stephen R Leach; Michael Mayo; Helen McDonald; Stephen B Montgomery; Pawan K Pandoh; Anca S Petrescu; A Gordon Robertson; Jacqueline E Schein; Asim Siddiqui; Duane E Smailus; Jeff M Stott; George S Yang; Francis Plummer; Anton Andonov; Harvey Artsob; Nathalie Bastien; Kathy Bernard; Timothy F Booth; Donnie Bowness; Martin Czub; Michael Drebot; Lisa Fernando; Ramon Flick; Michael Garbutt; Michael Gray; Allen Grolla; Steven Jones; Heinz Feldmann; Adrienne Meyers; Amin Kabani; Yan Li; Susan Normand; Ute Stroher; Graham A Tipples; Shaun Tyler; Robert Vogrig; Diane Ward; Brynn Watson; Robert C Brunham; Mel Krajden; Martin Petric; Danuta M Skowronski; Chris Upton; Rachel L Roper
Journal: Science Date: 2003-05-01 Impact factor: 47.728

4. A previously undescribed coronavirus associated with respiratory disease in humans.

Authors: Ron A M Fouchier; Nico G Hartwig; Theo M Bestebroer; Berend Niemeyer; Jan C de Jong; James H Simon; Albert D M E Osterhaus
Journal: Proc Natl Acad Sci U S A Date: 2004-04-08 Impact factor: 11.205

5. Isolation and characterization of viruses related to the SARS coronavirus from animals in southern China.

Authors: Y Guan; B J Zheng; Y Q He; X L Liu; Z X Zhuang; C L Cheung; S W Luo; P H Li; L J Zhang; Y J Guan; K M Butt; K L Wong; K W Chan; W Lim; K F Shortridge; K Y Yuen; J S M Peiris; L L M Poon
Journal: Science Date: 2003-09-04 Impact factor: 47.728

Review 6. Molecular biology of severe acute respiratory syndrome coronavirus.

Authors: John Ziebuhr
Journal: Curr Opin Microbiol Date: 2004-08 Impact factor: 7.934

7. Complete genome sequence of bat coronavirus HKU2 from Chinese horseshoe bats revealed a much smaller spike gene with a different evolutionary lineage from the rest of the genome.

Authors: Susanna K P Lau; Patrick C Y Woo; Kenneth S M Li; Yi Huang; Ming Wang; Carol S F Lam; Huifang Xu; Rongtong Guo; Kwok-Hung Chan; Bo-Jian Zheng; Kwok-Yung Yuen
Journal: Virology Date: 2007-07-06 Impact factor: 3.616

8. Relative rates of non-pneumonic SARS coronavirus infection and SARS coronavirus pneumonia.

Authors: Patrick C Y Woo; Susanna K P Lau; Hoi-wah Tsoi; Kwok-hung Chan; Beatrice H L Wong; Xiao-yan Che; Victoria K P Tam; Sidney C F Tam; Vincent C C Cheng; Ivan F N Hung; Samson S Y Wong; Bo-jian Zheng; Yi Guan; Kwok-yung Yuen
Journal: Lancet Date: 2004-03-13 Impact factor: 79.321

9. Coronavirus infection of spotted hyenas in the Serengeti ecosystem.

Authors: Marion L East; Karin Moestl; Viviane Benetka; Christian Pitra; Oliver P Höner; Bettina Wachter; Heribert Hofer
Journal: Vet Microbiol Date: 2004-08-19 Impact factor: 3.293

10. Identification of a new human coronavirus.

Authors: Lia van der Hoek; Krzysztof Pyrc; Maarten F Jebbink; Wilma Vermeulen-Oost; Ron J M Berkhout; Katja C Wolthers; Pauline M E Wertheim-van Dillen; Jos Kaandorp; Joke Spaargaren; Ben Berkhout
Journal: Nat Med Date: 2004-03-21 Impact factor: 53.440

25 in total

1. Discovery of seven novel Mammalian and avian coronaviruses in the genus deltacoronavirus supports bat coronaviruses as the gene source of alphacoronavirus and betacoronavirus and avian coronaviruses as the gene source of gammacoronavirus and deltacoronavirus.

Authors: Patrick C Y Woo; Susanna K P Lau; Carol S F Lam; Candy C Y Lau; Alan K L Tsang; John H N Lau; Ru Bai; Jade L L Teng; Chris C C Tsang; Ming Wang; Bo-Jian Zheng; Kwok-Hung Chan; Kwok-Yung Yuen
Journal: J Virol Date: 2012-01-25 Impact factor: 5.103

2. Isolation and characterization of a novel Betacoronavirus subgroup A coronavirus, rabbit coronavirus HKU14, from domestic rabbits.

Authors: Susanna K P Lau; Patrick C Y Woo; Cyril C Y Yip; Rachel Y Y Fan; Yi Huang; Ming Wang; Rongtong Guo; Carol S F Lam; Alan K L Tsang; Kenneth K Y Lai; Kwok-Hung Chan; Xiao-Yan Che; Bo-Jian Zheng; Kwok-Yung Yuen
Journal: J Virol Date: 2012-03-07 Impact factor: 5.103

Review 3. Middle East respiratory syndrome coronavirus: another zoonotic betacoronavirus causing SARS-like disease.

Authors: Jasper F W Chan; Susanna K P Lau; Kelvin K W To; Vincent C C Cheng; Patrick C Y Woo; Kwok-Yung Yuen
Journal: Clin Microbiol Rev Date: 2015-04 Impact factor: 26.132

4. Discovery of a novel coronavirus, China Rattus coronavirus HKU24, from Norway rats supports the murine origin of Betacoronavirus 1 and has implications for the ancestor of Betacoronavirus lineage A.

Authors: Susanna K P Lau; Patrick C Y Woo; Kenneth S M Li; Alan K L Tsang; Rachel Y Y Fan; Hayes K H Luk; Jian-Piao Cai; Kwok-Hung Chan; Bo-Jian Zheng; Ming Wang; Kwok-Yung Yuen
Journal: J Virol Date: 2014-12-31 Impact factor: 5.103

Review 5. Unraveling the web of viroinformatics: computational tools and databases in virus research.

Authors: Deepak Sharma; Pragya Priyadarshini; Sudhanshu Vrati
Journal: J Virol Date: 2014-11-26 Impact factor: 5.103

6. Molecular epidemiology of human coronavirus OC43 reveals evolution of different genotypes over time and recent emergence of a novel genotype due to natural recombination.

Authors: Susanna K P Lau; Paul Lee; Alan K L Tsang; Cyril C Y Yip; Herman Tse; Rodney A Lee; Lok-Yee So; Yu-Lung Lau; Kwok-Hung Chan; Patrick C Y Woo; Kwok-Yung Yuen
Journal: J Virol Date: 2011-08-17 Impact factor: 5.103

7. ORION-VIRCAT: a tool for mapping ICTV and NCBI taxonomies.

Authors: Willy Valdivia-Granda; Francis Larson
Journal: Database (Oxford) Date: 2009-10-12 Impact factor: 3.451

8. Ecoepidemiology and complete genome comparison of different strains of severe acute respiratory syndrome-related Rhinolophus bat coronavirus in China reveal bats as a reservoir for acute, self-limiting infection that allows recombination events.

Authors: Susanna K P Lau; Kenneth S M Li; Yi Huang; Chung-Tong Shek; Herman Tse; Ming Wang; Garnet K Y Choi; Huifang Xu; Carol S F Lam; Rongtong Guo; Kwok-Hung Chan; Bo-Jian Zheng; Patrick C Y Woo; Kwok-Yung Yuen
Journal: J Virol Date: 2010-01-13 Impact factor: 5.103

9. Comparative analysis of complete genome sequences of three avian coronaviruses reveals a novel group 3c coronavirus.

Authors: Patrick C Y Woo; Susanna K P Lau; Carol S F Lam; Kenneth K Y Lai; Yi Huang; Paul Lee; Geraldine S M Luk; Kitman C Dyrting; Kwok-Hung Chan; Kwok-Yung Yuen
Journal: J Virol Date: 2008-10-29 Impact factor: 5.103

10. Complete genomic sequence analysis of infectious bronchitis virus Ark DPI strain and its evolution by recombination.

Authors: Arun Ammayappan; Chitra Upadhyay; Jack Gelb; Vikram N Vakharia
Journal: Virol J Date: 2008-12-22 Impact factor: 4.099