| Literature DB >> 26917942 |
Abstract
Some genetic systems frequently present ambiguous data that cannot be straightforwardly analyzed with common methods of population genetics. Two possibilities arise to analyze such data: one is the arbitrary simplification of the data and the other is the development of methods adapted to such ambiguous data. In this article, we present an attempt at such a development, the uniformat grammar and The gene[rate] tools, highlighting the specific aspects and the adaptations required to analyze ambiguous nominal data in population genetics.Entities:
Keywords: EM algorithm; Hardy-Weinberg; ambiguous genetic data; data manipulation; frequency estimation; linkage disequilibrium
Year: 2016 PMID: 26917942 PMCID: PMC4762493 DOI: 10.4137/EBO.S32415
Source DB: PubMed Journal: Evol Bioinform Online ISSN: 1176-9343 Impact factor: 1.625
| # | a simple double heterozygous |
| # | for HLA-A and a locus with alleles k− and k+ |
| id | A * 01,A * 02:01 k−,k+ |
| # | an untyped case for first locus |
| id | @ k+,k− |
| # | a homozygous-or-blank-heterozygous for the second locus |
| id | A*02,A*11 k+ |
| # | a real homozygous for first locus and a multiple allele for second locus |
| id | A*01,A*01 B*07,B*14:01&B*14:02&B*14:05 |
| # | a case of multiple allele pairs for a locus |
| id | B*07,B*14:01|B*07,B*14:02|B*07,B*14:05 |
| sample | : data EOF | EOF |
| data | : data case | case |
| case | : IDENT LOCI_SEP full_pheno |
| full_pheno | : full_pheno LOCI_SEP locus_pheno | locus_pheno |
| locus_pheno | : multi_alp | ALLELE | MULTI_AL |
| multi_alp | : multi_alp ALP_SEP basic_alps | basic_alps |
| basic_alps | : ALLELE AL_SEP ALLELE |