| Literature DB >> 15809229 |
Robert Castelo1, Alexandre Reymond, Carine Wyss, Francisco Câmara, Genís Parra, Stylianos E Antonarakis, Roderic Guigó, Eduardo Eyras.
Abstract
The recent availability of the chicken genome sequence poses the question of whether there are human protein-coding genes conserved in chicken that are currently not included in the human gene catalog. Here, we show, using comparative gene finding followed by experimental verification of exon pairs by RT-PCR, that the addition to the multi-exonic subset of this catalog could be as little as 0.2%, suggesting that we may be closing in on the human gene set. Our protocol, however, has two shortcomings: (i) the bioinformatic screening of the predicted genes, applied to filter out false positives, cannot handle intronless genes; and (ii) the experimental verification could fail to identify expression at a specific developmental time. This highlights the importance of developing methods that could provide a reliable estimate of the number of these two types of genes.Entities:
Mesh:
Year: 2005 PMID: 15809229 PMCID: PMC1074396 DOI: 10.1093/nar/gki328
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Tissue distribution for the positive cases
| Identifier | Br | He | Ki | Sp | Li | Co | SI | Mu | Lu | St | Te | Pl | Sk | PBL | BM | FB | FL | FK | FH | FU | Th | Pa | MG | Pr |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| chr18_515 | + | |||||||||||||||||||||||
| chr15_51 | + | + | + | + | + | + | ||||||||||||||||||
| chr4_1746 | + | + | ||||||||||||||||||||||
| chr5_400 | + | |||||||||||||||||||||||
| chr4_55 | + | + | + | + | + | + | + | + | ||||||||||||||||
| chr22_143 | + | + |
Two-letter code for the tissues in Table 1
| Code | Tissue |
|---|---|
| Br | Brain |
| He | Heart |
| Ki | Kidney |
| Sp | Spleen |
| Li | Liver |
| Co | Colon |
| SI | Small intestine |
| Mu | Muscle |
| Lu | Lung |
| St | Stomach |
| Te | Testis |
| Pl | Placenta |
| Sk | Skin |
| PBL | Peripheral blood leukocyte |
| BM | Bone marrow |
| FB | Fetal brain |
| FL | Fetal liver |
| FK | Fetal kidney |
| FH | Fetal heart |
| FU | Fetal lung |
| Th | Thymus |
| Pa | Pancreas |
| MG | Mammary gland |
| Pr | Prostate |
Analysis of the six experimentally verified novel human genes
| Identifier | Pfam | ESTs | Homology | Identity (%) | Coverage (%) | Description |
|---|---|---|---|---|---|---|
| Chr15_51 | Mpp10 (PF04006) Neur_chan_LBD (PF02931) | NP_997588.1 | 74% (amino acid) | 56% (amino acid) | ELMO2 ( | |
| Chr18_515 | Ski_Sno (PF02437) | AK049035 | 80% (amino acid) | 89% (amino acid) | Riken mouse cDNA | |
| Chr22_143 | P2X_receptor (PF00864) | BX096265 | P2RXL1 | 94% (amino acid) | 94% (amino acid) | Purinergic receptor P2X-like 1, orphan receptor |
| BE876713 | ||||||
| Chr4_1746 | BQ429300 | AK006501 | 77% (amino acid) | 98% (amino acid) | Riken mouse cDNA of unknown function | |
| CN281994 | ||||||
| CN281995 | ||||||
| Chr5_400 | BQ428697 | BC039102.1 | 99% (nucleotide) | 73% (nucleotide) | IMAGE Clone—SelP precursor | |
| BF218453 | ||||||
| CN265566 | ||||||
| CD248366 | ||||||
| BG496466 | ||||||
| Chr4_55 | BU075833 | CAG12806.1 | 83% (amino acid) | 71% (amino acid) | Tetraodon protein of unknown function | |
| AW163448 | ||||||
| BE047596 | ||||||
| AL583585 | ||||||
| BM688172 |
The identifiers of the predictions include the chromosome name, on which they were predicted and a label that differentiates different genes predicted in the same chromosome. Three of the genes match Pfam domains and four match with genome specific human ESTs (not all are given). The percentage identity of the alignments with the homologous sequence (Identity) and the proportion of the gene prediction covered in this alignment (Coverage) are also given. We distinguish whether the alignment is at the amino acid or at the nucleotide level.