| Literature DB >> 19102764 |
Arun Ammayappan1, Chitra Upadhyay, Jack Gelb, Vikram N Vakharia.
Abstract
An infectious bronchitis virus Arkansas DPI (Ark DPI) virulent strain was sequenced, analyzed and compared with many different IBV strains and coronaviruses. The genome of Ark DPI consists of 27,620 nucleotides, excluding poly (A) tail, and comprises ten open reading frames. Comparative sequence analysis of Ark DPI with other IBV strains shows striking similarity to the Conn, Gray, JMK, and Ark 99, which were circulating during that time period. Furthermore, comparison of the Ark genome with other coronaviruses demonstrates a close relationship to turkey coronavirus. Among non-structural genes, the 5'untranslated region (UTR), 3C-like proteinase (3CLpro) and the polymerase (RdRp) sequences are 100% identical to the Gray strain. Among structural genes, S1 has 97% identity with Ark 99; S2 has 100% identity with JMK and 96% to Conn; 3b 99%, and 3C to N is 100% identical to Conn strain. Possible recombination sites were found at the intergenic region of spike gene, 3'end of S1 and 3a gene. Independent recombination events may have occurred in the entire genome of Ark DPI, involving four different IBV strains, suggesting that genomic RNA recombination may occur in any part of the genome at number of sites. Hence, we speculate that the Ark DPI strain originated from the Conn strain, but diverged and evolved independently by point mutations and recombination between field strains.Entities:
Mesh:
Substances:
Year: 2008 PMID: 19102764 PMCID: PMC2628353 DOI: 10.1186/1743-422X-5-157
Source DB: PubMed Journal: Virol J ISSN: 1743-422X Impact factor: 4.099
Figure 1Classical Genome Organization of IBV-Ark DPI. The genome of Ark DPI is 27,620 nt long, excluding poly (A) tract. Middle: ten genes and its ORFs. Ribosomal frameshift and position of transcriptional regulatory sequences (TRS) of each gene is indicated. Top: putative domains of ORF1a/1b polyprotein: nsp-non-structural protein, Ac-acidic domain, X-unknown domain X, PL1- papain like proteinase1, PL2-papain like proteinase 2; Y-unknown domain Y; HD-hydrophobic domain, 3CL-3C-like proteinase, G-Growth factor like protein, RdRp-RNA dependent RNA polymerase, Hel-helicase, ExoN-exoribonuclease, Ne-nidoviral uridylate-specific endoribonuclease, MT- 2'-O-ribose methyltransferase. Bottom: details of spike protein; SP-signal peptide, RRSRR/S- spike protein cleavage site between 544 and 545aa, TM-transmembrane domain of spike protein.
Percent (%) nucleotide identity of Ark DPI non-structural genes and ORF1ab, ORF2-6 and complete genome with other IBV strainsa, b, c
| IBV Strains | 5'UTR | PLpro | Mpro | RdRp | ORF1 | ORF2-6 | Complete genome |
| 87 | NA | NA | |||||
| A2 | 95 | 83 | 85 | 86 | 85 | 86 | |
| Beaudette | 85 | 93 | 93 | 91 | 91 | 91 | |
| BJ | 83 | 88 | 93 | 86 | 85 | 86 | |
| Cal99* | 94 | ||||||
| CK/CH/LSD/05I | 84 | 88 | 90 | 89 | 90 | 89 | |
| 87 | NA | NA | |||||
| CU-T2* | 94 | NA | 94 | NA | |||
| DE072 | 90 | 90 | NA | NA | NA | ||
| Florida | 87 | NA | NA | NA | |||
| GA98* | NA | NA | NA | ||||
| 87 | NA | NA | NA | ||||
| Jilin | NA | NA | NA | NA | NA | NA | |
| LX4 | 94 | 83 | 85 | 89 | 87 | 84 | 86 |
| M41 | 87 | 91 | 94 | 91 | 91 | 91 | |
| SAIBK | 92 | 85 | 90 | 90 | 90 | 86 | 89 |
a Sequences with > 95% identity are in bold letters
b NA-not analyzed
c Parental strains of Ark DPI are shown in bold letters and immediate derivative of Ark DPI is indicated by asterisk (*).
Percent (%) nucleotide identity of Ark DPI structural genes with other IBV strains a, b, c, d
| BV Strains | S1 | S2 | 3a | 3b | 3c | M | 5a | 5b | N | 3'UTR |
| 92 | 93 | |||||||||
| Beaudette | 81(79) | 95(94) | 91 | 84 | 88(83) | 91(93) | 85 | 93 | 93(95) | |
| BJ | 77(75) | 85(89) | 88 | 76 | 87(79) | 90(93) | NA | NA | 89(93) | 87 |
| Cal99* | 87(84) | 94(93) | 92 | 94(90) | 89 | |||||
| CK/CH/LSD/05I | 78(75) | 88(91) | 89 | 84 | (87) | 90 | ||||
| 81(77) | 92 | NA | ||||||||
| CU-T2* | 93(93) | 88 | 98 | 92(87) | 88(87) | |||||
| DE072 | 62(50) | 75(76) | NA | NA | NA | NA | NA | |||
| 83(80) | NA | NA | ||||||||
| GAV-92 | 94(92) | NA | NA | NA | NA | NA | NA | NA | NA | NA |
| H120 | 81(78) | NA | NA | NA | NA | 92(94) | NA | NA | 83 | |
| HK* | 81(78) | NA | ||||||||
| Holte | 83(80) | 95(95) | NA | NA | NA | NA | NA | NA | NA | NA |
| Ind/TN/92/03 | NA | NA | NA | NA | NA | NA | NA | NA | 92(94) | NA |
| IS/1366 | 78(75) | NA | NA | NA | NA | NA | NA | NA | 92(95) | NA |
| Jilin* | ||||||||||
| 84(82) | NA | NA | NA | NA | NA | NA | NA | |||
| KB8523 | 81(78) | 91(92) | NA | NA | NA | 93(95) | NA | NA | NA | |
| LX4 | 77(76) | 85(88) | 86 | 76 | 88(80) | 91(91) | 82 | 90 | 89(93) | NA |
| M41 | 81(78) | 95(94) | 91 | 85 | 88(83) | 91(95) | 90 | 94(95) | ||
| Qu16 | 84(81) | NA | NA | NA | NA | NA | NA | NA | NA | NA |
| SAIBK | 79(77) | 87(91) | 86 | 83 | 85(80) | 89(93) | 84 | 87(92) | 88 | |
| TW2296/95 | 79(77) | 86(90) | 86 | 85 | (83) | 91(92) | 82 | 89(92) | NA | |
| UK/2/91 | 78(76) | NA | NA | NA | NA | NA | NA | NA | NA | NA |
| Vic | 81(79) | 89(92) | 88 | 88 | 88(88) | 89(95) | 87 | 94 | 90(94) | NA |
a Sequences with > 95% identity are indicated in bold letters
b Amino acid sequences within the parenthesis
c NA-Not Analyzed
d Parental strains of Ark DPI are shown in bold letters and immediate derivative of Ark DPI is indicated by asterisk (*).
Percent (%) amino acid identity of Ark DPI replicase and structural proteins to other coronaviruses c, d
| Coronavirusesa | 3CLpro | S | E | M | N | Complete genomeb | |
| BatCoV | 40 | 22 | 11 | 29 | 25 | 46 | |
| BCoV | 41 | 21 | 13 | 30 | 24 | 47 | |
| ECoV | 41 | 22 | 13 | 30 | 25 | 47 | |
| FCoV | 45 | 22 | 16 | 23 | 23 | 48 | |
| HCoV 229E | 40 | 23 | 12 | 26 | 23 | 49 | |
| TGEV | 46 | 22 | 20 | 25 | 26 | 49 | |
| MHV A59 | 40 | 21 | 14 | 31 | 26 | 46 | |
| SARS CoV | 46 | 21 | 17 | 29 | 22 | 45 | |
| SW1 | 25 | 28 | 36 | 35 | 50 | ||
| 34 | |||||||
a BatCoV, Bat coronavirus; FCoV, feline coronavirus; HCoV, human coronavirus; BCoV, bovine coronavirus; MHV, mouse hepatitis virus; SARS-CoV, severe acute respiratory syndrome coronavirus; SW1, beluga whale coronavirus; TCoV, turkey coronavirus.
b Percent nucleotide identity of entire genome
c Sequences with > 50% identity are in bold letters
d Gene in bold letters (RdRp) is highly conserved; TCoV exhibits significant identity with IBV-Ark DPI (marked in bold letters).
Figure 2Phylogenetic tree analysis of complete Ark DPI genome sequence with other IBV strains. Phylogenetic tree analysis was conducted by neighbor-joining method using bootstrap analysis (100 replications). The scale at the bottom indicates the number of substitution events.
Figure 3Schematic presentation of the structural region of Ark DPI genome. Entire genome of Ark DPI was analyzed for its similarity with other IBV strains. Top panel: 5'UTR & ORF1. Shadowed regions were used for comparative analysis. 5'UTR-5'-untranslated region; PL1-papain like proteinase1; Mpro-main or 3C-like proteinase; RdRp-RNA-dependent-RNA polymerase. Bottom panel: ORF2 to 3'UTR. Structural genes and their ORFs are marked by (●). Conserved sequence TGTGTTGATTATAAT in S1 gene is shown; ◆ denotes plausible recombination site in Ark DPI.