OBJECTIVE: An important proportion of coding sequences in genomes, notably in viruses, do not match any sequences in databases and are assigned as ORFan sequences. Nucleocytoplasmic large DNA viruses (NCLDVs) harbor great numbers of ORFs with a high number consisting of ORFans. Thus, we decided to decipher the nature of ORFans in the NCLDVs. METHODS: A genome-wide study was carried out to estimate the ORFan proportion in NCLDV genomes and to analyze their general features compared with non-ORFan. RESULTS: The ORFan percentages comprised between 2.8 and 75.2% of the ORF content according to the virus lineage. We propose to classify ORFans in four categories according to their possible match with metagenomic sequences and their prevalence at different taxonomic ranks. Our results indicate that NCLDV ORFans have overall similar features with non-ORFans, except they are shorter. CONCLUSIONS: An ORFan classification scheme was proposed to decipher their origin and evolution. Most ORFans were likely labeled ORFan owing to the gap of knowledge of the sequence space. ORFans might be true functional genes with likely the same expression potential as non-ORFan genes. Part of them may also correspond to new genes formed de novo through the diverse mechanisms of gene evolution. Copyright 2010 S. Karger AG, Basel.
OBJECTIVE: An important proportion of coding sequences in genomes, notably in viruses, do not match any sequences in databases and are assigned as ORFan sequences. Nucleocytoplasmic large DNA viruses (NCLDVs) harbor great numbers of ORFs with a high number consisting of ORFans. Thus, we decided to decipher the nature of ORFans in the NCLDVs. METHODS: A genome-wide study was carried out to estimate the ORFan proportion in NCLDV genomes and to analyze their general features compared with non-ORFan. RESULTS: The ORFan percentages comprised between 2.8 and 75.2% of the ORF content according to the virus lineage. We propose to classify ORFans in four categories according to their possible match with metagenomic sequences and their prevalence at different taxonomic ranks. Our results indicate that NCLDV ORFans have overall similar features with non-ORFans, except they are shorter. CONCLUSIONS: An ORFan classification scheme was proposed to decipher their origin and evolution. Most ORFans were likely labeled ORFan owing to the gap of knowledge of the sequence space. ORFans might be true functional genes with likely the same expression potential as non-ORFan genes. Part of them may also correspond to new genes formed de novo through the diverse mechanisms of gene evolution. Copyright 2010 S. Karger AG, Basel.
Authors: Dorine Gaëlle Reteno; Samia Benamar; Jacques Bou Khalil; Julien Andreani; Nicholas Armstrong; Thomas Klose; Michael Rossmann; Philippe Colson; Didier Raoult; Bernard La Scola Journal: J Virol Date: 2015-07 Impact factor: 5.103
Authors: Paulo V M Boratto; Graziele P Oliveira; Talita B Machado; Ana Cláudia S P Andrade; Jean-Pierre Baudoin; Thomas Klose; Frederik Schulz; Saïd Azza; Philippe Decloquement; Eric Chabrière; Philippe Colson; Anthony Levasseur; Bernard La Scola; Jônatas S Abrahão Journal: Proc Natl Acad Sci U S A Date: 2020-06-29 Impact factor: 11.205
Authors: Felipe L Assis; Leena Bajrai; Jonatas S Abrahao; Erna G Kroon; Fabio P Dornas; Kétyllen R Andrade; Paulo V M Boratto; Mariana R Pilotto; Catherine Robert; Samia Benamar; Bernard La Scola; Philippe Colson Journal: Viruses Date: 2015-06-26 Impact factor: 5.048
Authors: Sihem Hannat; Pierre Pontarotti; Philippe Colson; Marie-Line Kuhn; Eric Galiana; Bernard La Scola; Sarah Aherfi; Franck Panabières Journal: Front Microbiol Date: 2021-06-01 Impact factor: 5.640