| Literature DB >> 19300512 |
Brian A Fox1, Paul O Sheppard, Patrick J O'Hara.
Abstract
BACKGROUND: Type-I interferons, type-II interferons, and the IL-10 family are helical cytokines with similar three-dimensional folds. However, their homologous relationship is difficult to detect on the basis of sequence alone. We have previously described the discovery of the human type-III interferons (IFN lambda-1, -2, -3 or IL-29, IL-28A, IL-28B), which required a combination of manual and computational techniques applied to predicted protein sequences. PRINCIPALEntities:
Mesh:
Substances:
Year: 2009 PMID: 19300512 PMCID: PMC2654155 DOI: 10.1371/journal.pone.0004933
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Gene structure of human IFNL1, IFNL2 and IFNL3.
Each numbered box represents an exon (not drawn to scale). The shorter extensions at the terminal exons represent noncoding DNA. The letters inside the boxes are protein structural features: (S)ecretion peptide, helix (A), helix (B), helix (C) and helix (D).
Number and gene structure of Interferon Lambda genes in many genomes from ensembl v49.
| Species (brief name) | Maximum number of multi-exon gene loci | Number of single exon gene loci (and chromosomal location, if known) | Number of nearly full length genes (and pseudogene, if confirmed) | genome sequence version (in ENSEMBL 49) |
| Armadillo | - | - | - | ARMA |
| Bat | 5 | 2 | 1 | MICROBAT1 |
| Bush baby | 1 | - | - | BUSHBABY1 |
| Cat | 4 | 1 | 3 | CAT |
| Chimp | 4 (c19) | - | 1 | CHIMP2.1 |
| Cow | - | 1 | 1 | Btau_3.1 |
| Dog | 2 (c1) | 1 (c24) | 3 | BROADD2 |
| Elephant | 2 | - | - | BROADE1 |
| Guinea pig | 3 | - | 1 | GUINEAPIG |
| Hedgehog | 5 | - | - | HEDGEHOG |
| Horse | 3 | 1 (c10) | 3 | EquCab2 |
| Human | 4 (19q13) | - | 3 (1) | NCBI36 |
| Lemur | 3 | - | - | micMur1 |
| Mouse | 3 (7A3) | - | 2 (1) | NCBIM37 |
| Opossum | 2 (c4, c3) | - | - | BROADO5 |
| Orangutan | 6 (c19) | - | 1 | PPYG2 |
| Pika | 5 | - | - | pika |
| Platypus | 3 | - | - | OANA5 |
| Rabbit | 2 | - | - | RABBIT |
| Rat | 4 (c1, c17) | - | 1 | RGSC3.4 |
| Rhesus | 6 | - | 1 | MMUL_1 |
| Shrew | 9 | - | 5 | COMMON_SHREW1 |
| Squirrel | 2 | - | - | SQUIRREL |
| Tree shrew | 2 | - | 1 | TREESHREW |
| Chicken | 1 (c7) | - | - | WASHUC2 |
| Frog | - | - | - | JGI4.1 |
| Fugu | - | - | - | FUGU4 |
| Zebra fish | - | - | - | ZFISH7 |
| C elegans | - | - | - | WS180 |
| Fly | - | - | - | BDGP5.4 |
| Mosquito | - | - | - | AgamP3 |
| Yeast | - | - | - | SGD1.1 |
To be consistent, this total includes genes and pseudogenes. For draft genomes, not finding all exons in a short contig due to assembly and sequencing gaps is indistinguishable from a pseudogene with some exons missing.
the noted species has one gene which needs a frameshift of 1 bp or 2 bp needed in order to stay in the “correct” reading frame. This frameshift might indicate that this is a pseudogene, or it might be a sequencing error due to the draft nature of the genome sequence.
Figure 2Phylogenetic tree using the actual or predicted DNA sequences of spliced interferon lambda genes or pseudogenes in human, mouse, dog and guinea pig.
Human and mouse gene loci are labeled as in Lasfar et. al. [18]. The labels for dog and guinea pig are not intended to be suggestions for permanent names. The “dog1” gene is the single exon gene from dog on chromosome 24, while “dog2” and “dog3” are multi exon genes located near each other on chromosome 1 (see Supplementary Figure S1 for gene sequences). The sequence for the more distant mouse pseudogene, named interferon lambda 4 in Lasfar et. al., was excluded because we were unable to find the locus. The optimal tree is drawn to scale, with bootstrap values shown at branch points and branch lengths in the same units as those of the evolutionary distances used to infer the phylogenetic tree (sum of branch length = 1.0194). All positions containing gaps and missing data were eliminated from the dataset, and the full codons were used. There were a total of 314 nucleotide positions in the final dataset, compared to the longest sequence of 622 nucleotides (the pseudogenes shortened the total number of columns which were available for alignment).