Literature DB >> 27294016

Quality standards for DNA sequence variation databases to improve clinical management under development in Australia.

B Bennetts¹, M Caramins², A Hsu³, C Lau⁴, S Mead⁵, C Meldrum⁶, T D Smith⁷, G Suthers⁸, G R Taylor³, R G H Cotton⁷, V Tyrrell⁴.

Abstract

Despite the routine nature of comparing sequence variations identified during clinical testing to database records, few databases meet quality requirements for clinical diagnostics. To address this issue, The Royal College of Pathologists of Australasia (RCPA) in collaboration with the Human Genetics Society of Australasia (HGSA), and the Human Variome Project (HVP) is developing standards for DNA sequence variation databases intended for use in the Australian clinical environment. The outputs of this project will be promoted to other health systems and accreditation bodies by the Human Variome Project to support the development of similar frameworks in other jurisdictions.

Entities: CellLine Disease Gene Species

Keywords: Data quality; Genetic variation databases; Global knowledge sharing; Standards

Year: 2014 PMID： 27294016 PMCID： PMC4888016 DOI： 10.1016/j.atg.2014.07.002

Source DB: PubMed Journal: Appl Transl Genom ISSN： 2212-0661

Introduction

It has now become routine practice to compare sequence variations identified during clinical genetic testing with variants recorded in a wide range of genetic variation databases as well as in the scientific literature to aid in understanding the potential clinical significance and determining a definitive diagnosis. Although numerous genetic variation databases already exist, there are few that meet the accuracy and reproducibility required for clinical diagnostics. Current databases are of variable quality and many contain errors in variant calls, non-standardized nomenclature, incomplete pathogenicity associations and limited phenotypic information linked to genomic data (Saunders et al., 2012). These all represent limitations and risks to the quality of patient care. Based on the current research experience of highly curated mutation data (Thompson et al., 2014, Sosnay et al., 2013) the curation of databases to clinical standards is likely to require a substantial investment of time and effort. The increasing ease of access to technologies such as massively parallel sequencing is producing increasing volumes of genomic data that needs to be recorded in an organized, accurate manner. The integrity of this stored data is critical as there becomes a greater demand for analysis and interpretation in clinical research and diagnostics, a task which now forms a substantial proportion of the genetic diagnostic workload. There are numerous initiatives and white papers, which discuss the steps needed to allow for responsible integration of emerging genomic technologies into mainstream clinical diagnostics, many of which touch on data quality and collection. Some of these are described below. Data to Discovery: Genomes to Health (Ahalt et al., 2014) made recommendations on data provenance, collection, and management; delineation of phenotypes; adjudication of genomic variants; biostatistics and bioinformatics, data sharing; and bioethics and the law. The Global Alliance for Genomics and Health has established data, security, regulatory and ethics, and clinical working groups who have established priorities which include the development of formal data models, application programming interface (API) implementations for submitting, exchanging, querying, and analyzing genomic data (Global Alliance for Genomics and Health, 2014). The British Society for Genetic Medicine made public the outcomes of the BSGM 100,000 Genome Group which made recommendations on the collaborative development of appropriate genomic standards and policies, promotion of data sharing, and further development of the existing NHS Diagnostic Mutation Database (DMuDB) and DECIPHER database to be more readily usable for the clinical laboratory (Burn and Douglas, 2013). This was followed up by reports on recommendations from the United Kingdom appointed working groups (https://www.gov.uk/government/publications/mapping-100000-genomes-strategic-priorities-data-and-ethics). Recent challenges being addressed by the eMERGE network and others include collection of phenotype data, the integration of genomic findings into electronic health records, and the current efforts to extend HL7 Version 2 vocabularies for exome and whole genome sequencing within the context of clinical workflows (Kullo et al., 2014, Chute et al., 2013). In September 2013, the National Human Genome Research Institute (NHGRI) and the Eunice Kennedy Shriver National Institute of Child Health and Development (NICHD) awarded USD25M to support a consortium of three groups to design and implement a framework for evaluating variants, and their role in patient care. This consortium is enabling access to this information through the NCBI ClinVar database. The International Collaboration for Clinical Genomics (ICCG) is a part of this project, and is intended to support data collection and sharing (http://www.iccg.org/about-the-iccg/clingen/). In addition to the white papers and initiatives, there is a growing number of best practice policies and guidelines addressing the responsible integration of genomics into a clinical environment such as those released by the Association for Clinical Genetic Science (ACGS, part of the British Society of Genetic Medicine (BSGM)) and the Dutch Society of Clinical Genetic Laboratory Specialists (VKGL) (Wallis et al., 2013), American College of Medical Genetics and Genomics (ACMG) (Rehm et al., 2013), the Clinical Molecular Genetics Society, UK (CMGS), also part of the British Society of Genetic Medicine (BSGM) (Ellard et al., 2012), Best Practice Guidelines for the use of Next-Generation Sequencing Applications in Genome Diagnostics from a National Collaborative Study of Dutch Genome Diagnostic Laboratories (Weiss et al., 2013), a draft NIH Genomic Data Sharing Policy (Draft NIH Genomic Data Sharing Policy — Request for Public Comments, 2013), and conclusions from a working group of experts in genomic research, analysis and clinical diagnostic sequencing convened by the NHGRI (MacArthur et al., 2014). All of these guidelines partially address data within the clinical genomics workflow, however they do not focus specifically on the area. Collection of information related to genetic variation is not a new concept, with over 2000 locus specific databases established with disease and/or gene specific variation information. There are currently no established ISO standards which govern sequence variation databases. There are however numerous de-facto standards and established best practices (Vihinen et al., 2012). While this aids with providing consistent formats, they are in part outdated as genomic data becomes more readily accessible and available. With regard to guidelines for the establishment of locus specific databases (LSDBs), the Human Variome Project (HVP) has been collaborating with the Human Genome Variation Society, and the GEN2PHEN project, working towards standardizing the way that variation and pathogenicity data is presented. In addition Celli et al. developed a supporting document describing curation of a gene variant database as first step to establishing guidelines for database curation (Celli et al., 2012). The HVP continues to promote global standards and guidelines which encourage the establishment and maintenance of quality-assured sequence variation data repositories. Their ongoing work is described further below.

The standards development project

Despite the initiatives and guidelines described above, there are no specific standards or equivalent mechanisms which concentrate on guiding the accreditation of DNA sequence variation databases to ensure the accuracy, quality, and ongoing maintenance of uploaded data into any central repository to meet the needs of the clinical diagnostics environment. An Australian national project led by the Royal College of Pathologists of Australasia (RCPA) in collaboration with the Human Genetics Society of Australasia (HGSA) and the Human Variome Project (HVP) is developing standards for DNA sequence variation databases intended for use in the clinical environment. This project is being supported by the Australian Department of Health's Quality Use of Pathology Program (QUPP). The standards under development will be a broad reaching set of national standards that are sympathetic to the rapidly changing landscape of genomics in the clinic to seek compliance by both existing and future databases. The fundamental principle of the document is to provide a standard for oversight for DNA sequence variation databases intended to provide utility in clinical diagnostic service delivery, and thereby ensure that they are developed, curated, and maintained as safe, secure, and accurate repositories of genomic data. They are intended to complement existing laboratory standards and accreditation requirements, align with global initiatives and guidelines in existence, act as a guide to identify a quality database, establish new databases as well as improve existing databases that have evolved out of the research environment, and set minimum requirements for clinical purposes within the boundaries of existing legislation both nationally and globally.

The standards framework

The framework within which the standards are being developed consists of nine key areas described in Table 1. The framework is intended to adequately address the accreditation requirements in a systematic order with clearly defined and concise criteria. In each section of the document, points deemed important for practice will be identified as either ‘Standards’ or ‘Guidelines’ in the style of current National Pathology Accreditation Advisory Council (NPAAC) documents (National Pathology Accreditation Advisory Council (NPAAC), 2008). A Standard will be considered the minimum requirement for a procedure, method, staffing resource or laboratory facility that is required before a laboratory can attain accreditation. A Guideline will be a consensus recommendation for best practice and should be used if a higher level of practice is appropriate. A Commentary may also be provided to give clarification to the Standards and Guidelines as well as to provide examples and guidance on interpretation of the statements.

Table 1

Framework for development of standards for DNA sequence variation databases.

Framework areas	Items being considered in each of the areas (include, but not limited to)
Purpose	• Scope of the database • Nature of information being held in the database • Quality parameters • Standard operating procedures
Governance	• Custodian definition, accountability, and responsibility • Mechanisms for complaints, troubleshooting, auditing, and risk mitigation • Ethics committee, advisory board, and multidisciplinary team involvement • Sustainability, and contingency in case of demise • Compliance with jurisdictional legislations and or regulations
Establishment	• Principle hardware and software requirements including web interfacing, networking, infrastructure, storage, backup capabilities • Compatibility — external databases, electronic health/medical records (EMR/EHR), HL7 V2, SNOMED-CT, federated databases.
Protection privacy security	• Content of an information policy (such as how data are collected, used, disclosed, managed, administered, stored, and accessed) • Compliance with local Australian (Privacy Amendment Act 2012) and other jurisdictional legislation/regulation such as HIPAA. • Consent for storage of data, and use of data for diagnostic and or research purposes • Privacy, security through de-identification, data encryption, and protected access • Security breach management
Content	• Data to be collected and submitted including but not limited to data structure, nomenclature and variant description, methodology used to detect the variant, orthogonal method verification, sequence quality data, reference genome, provenance of existing data, variant occurrences, inheritance information, phenotype, and clinical accreditation status of submitting laboratories.
Functionality	• Version control, modifications • Interrogation and return of information from external databases, linkage of variant occurrences and familial grouping • Mechanisms to track de-identified data to facilitate patient management.
Currency of information	• Specific DNA database curation definition and requirements • Filtering and triaging variant calls, determination of relevance and inclusion • Quality controls and evaluation of level of confidence in accuracy • Maintaining relevance and accuracy of data, • Maintaining currency of genome builds and compatibility of variants recorded • Regular audits to assure quality of the database schema and data held within.
Access & sharing	• Policy governing participation through access and sharing • Mechanisms for facilitating access and sharing through secure practices • User registration, and the clinical need to utilize the data • Communication between user and curator/custodian • Quality Control, auditing of access and sharing
Professional use	• Standardizing ontology within a database, or between federated databases • Variant classification, traceability of clinical reports, re-analysis • Skill sets, knowledge base, and experience required • Workforce training and development

Implementation of the standards

Accreditation of pathology laboratories for clinical service delivery in Australia is overseen by the National Pathology Accreditation Advisory Council (NPAAC). NPAAC is an agency within the Commonwealth (Federal) Department of Health. NPAAC plays a key role in ensuring the quality of Australian pathology services, and is responsible for the development and maintenance of standards and guidelines for pathology practices (http://www.health.gov.au/npaac). The National Association of Testing Authorities (NATA) is the authority which provides independent assurance of technical competence in conjunction with the Royal College of Pathologists of Australasia (RCPA) through a proven network of best practice industry experts. NATA/RCPA provides assessment, accreditation, and training services to laboratories and technical facilities throughout Australia and internationally (http://www.nata.asn.au). NATA audits against the standards and guidelines laid down by NPAAC. Laboratories seeking eligibility for Federal government funding for medical tests are required to meet the specified quality standards as expressed by NPAAC in the context of the Australian pathology accreditation framework. There are a number of specialized technical publications that specify requirements in laboratories undertaking specific areas of medical testing in addition to requirements for good medical practice in all pathology laboratories. The DNA Sequence Variation Database Standards under development are intended to be an adjunct to existing NPAAC standards and guidelines such as “Requirements for Medical Pathology Services (Requirements for Medical Pathology Services (First Edition, 2013); National Pathology Accreditation Advisory Council (NPAAC), 2008)” and “Requirements for the Retention of Laboratory Records and Diagnostic Material (Requirements for the Retention of Laboratory Records and Diagnostic Material (Sixth Edition, 2013) National Pathology Accreditation Advisory Council, 2013)”. When completed, the standards will be submitted for potential endorsement by the RCPA and HGSA boards, and will be made available as a tool for laboratories and NATA assessors alike to facilitate accreditation. Further, the RCPA will engage the NPAAC to seek their inclusion of these Standards in the Commonwealth Health Insurance (Accredited Pathology Laboratories) — Approval Principles 2002. It is recognized that there is a need to bridge a gap between the translational research environment and the clinical diagnostic environment, and therefore regulation of the use of data within the scope of the respective environments. To address this, in addition to the NATA/RCPA and NPAAC requirements, the Standards will encourage users to comply with the Australian Government National Health and Medical Research Council National Statement on Ethical Conduct in Human Research 2007 (Updated March 2014) (National Statement on Ethical Conduct in Human Research, 2007).

Challenges to implementation

There are foreseeable challenges to the implementation of a set of standards such as those described above. Initial acceptance and implementation of a new set of standards can be difficult to achieve without end users supporting the accreditation or compliance requirements. Early communication of this initiative is underway, and includes broad consultation with key experts and stakeholders who will be impacted by the introduction of standards, and presentation of the standards in draft form at local scientific meetings for discussion. It is the intention of the project steering committee that the resulting set of standards gains support prior to their final release. Further to this, to ensure the implementation of and compliance with the standards, continued accreditation could be monitored via the development by a professional organization of a time limited license or registration program applied to the databases and operators of those databases. Elements could include an external quality assessment program and automated auditing or review of the elements, functions, and curation of the databases. Online training and certification of database users under a continuing professional development (CPD) or continuing medical education (CME) program could be implemented to ensure the information held in databases is appropriately utilized in a clinical environment. Sequence variation databases are housed both locally and offshore in multiple countries, with ownership existing outside of Australian jurisdiction. It will be difficult for laboratories to apply these standards locally unless they “own” the database. However, the standards will provide them with a tool to judge the integrity and therefore the level of confidence that they might apply to an overseas database, which in turn can be included in their quality systems for future accreditation of bioinformatic pipelines for analysis and interpretation. This project is being undertaken within the context of the Australian healthcare system and its national- and state-based legislation and regulations governing the quality of medical services. However, given the global reach of individual databases, the findings from this project should be applicable to other countries with similar medico-legal frameworks, and perhaps more broadly. Sharing knowledge, experience, and aligning standards globally in a structured and coordinated manner is critical to advancing the successful implementation of genomics testing in the clinical environment.

Broader adoption and the global view

Gaining international consensus and commitment to consistent standards in medical testing represents a major challenge. One mechanism for achieving this outcome is the Human Variome Project, an international initiative to integrate the routine and responsible sharing of genetic variation information into standard clinical practice. The Human Variome Project is a consortium of researchers, diagnosticians and health-care professionals committed to the free and open sharing of genetic variation information generated during clinical testing, thereby leading to better patient outcomes and more accessible genetic health services. The Project is working towards establishing globally acceptable Standards and Guidelines for the collection, curation, interpretation and sharing of genomic knowledge and enabling the sustainable development and operation of a harmonized and federated global knowledge sharing network. A key aspect of this work is harmonizing national and regional efforts around regulatory frameworks and governance of electronic data repositories and knowledge sharing infrastructure. The Project, through its Variant Database Quality Assessment Working Group has specified guidelines for quality parameters that should be assessed in a quality accreditation scheme (in press). In addition to The Human Variome Project global initiatives, Australia is well represented in the Global Alliance for Genomics and Health (http://www.genomicsandhealth.org), with Alliance partners including the Human Variome Project, National Health and Medical Research Council (NHMRC), Australian Genome Research Facility (http://agrf.org.au), Garvan Institute of Medical Research, Melbourne Genomics Health Alliance and other highly regarded groups (http://genomicsandhealth.org/partners).

Conclusion

Regulating the quality, accuracy, and relevance of DNA sequence variation databases and the data held within them through the implementation of standards will reduce the risk of aberrant or uninformative variants being reported, promote the sharing of clinical quality sequencing, and accelerate the delivery of accurate, actionable, and efficient clinical reports to improve patient management and outcomes. The Australian standards development reported above will build on work undertaken to date, and is a promising step towards national and regional harmonization efforts. We hope that the outcome of this project will be of interest to other countries and health systems.

10 in total

1. Guidelines for establishing locus specific databases.

Authors: Mauno Vihinen; Johan T den Dunnen; Raymond Dalgleish; Richard G H Cotton
Journal: Hum Mutat Date: 2011-12-09 Impact factor: 4.878

2. Curating gene variant databases (LSDBs): toward a universal standard.

Authors: Jacopo Celli; Raymond Dalgleish; Mauno Vihinen; Peter E M Taschner; Johan T den Dunnen
Journal: Hum Mutat Date: 2011-11-03 Impact factor: 4.878

Review 3. Best practice guidelines for the use of next-generation sequencing applications in genome diagnostics: a national collaborative study of Dutch genome diagnostic laboratories.

Authors: Marjan M Weiss; Bert Van der Zwaag; Jan D H Jongbloed; Maartje J Vogel; Hennie T Brüggenwirth; Ronald H Lekanne Deprez; Olaf Mook; Claudia A L Ruivenkamp; Marjon A van Slegtenhorst; Arthur van den Wijngaard; Quinten Waisfisz; Marcel R Nelen; Nienke van der Stoep
Journal: Hum Mutat Date: 2013-08-19 Impact factor: 4.878

4. Rapid whole-genome sequencing for genetic disease diagnosis in neonatal intensive care units.

Authors: Carol Jean Saunders; Neil Andrew Miller; Sarah Elizabeth Soden; Darrell Lee Dinwiddie; Aaron Noll; Noor Abu Alnadi; Nevene Andraws; Melanie LeAnn Patterson; Lisa Ann Krivohlavek; Joel Fellis; Sean Humphray; Peter Saffrey; Zoya Kingsbury; Jacqueline Claire Weir; Jason Betley; Russell James Grocock; Elliott Harrison Margulies; Emily Gwendolyn Farrow; Michael Artman; Nicole Pauline Safina; Joshua Erin Petrikin; Kevin Peter Hall; Stephen Francis Kingsmore
Journal: Sci Transl Med Date: 2012-10-03 Impact factor: 17.956

Review 5. Some experiences and opportunities for big data in translational research.

Authors: Christopher G Chute; Mollie Ullman-Cullere; Grant M Wood; Simon M Lin; Min He; Jyotishman Pathak
Journal: Genet Med Date: 2013-09-05 Impact factor: 8.822

6. ACMG clinical laboratory standards for next-generation sequencing.

Authors: Heidi L Rehm; Sherri J Bale; Pinar Bayrak-Toydemir; Jonathan S Berg; Kerry K Brown; Joshua L Deignan; Michael J Friez; Birgit H Funke; Madhuri R Hegde; Elaine Lyon
Journal: Genet Med Date: 2013-07-25 Impact factor: 8.822

7. Guidelines for investigating causality of sequence variants in human disease.

Authors: D G MacArthur; T A Manolio; D P Dimmock; H L Rehm; J Shendure; G R Abecasis; D R Adams; R B Altman; S E Antonarakis; E A Ashley; J C Barrett; L G Biesecker; D F Conrad; G M Cooper; N J Cox; M J Daly; M B Gerstein; D B Goldstein; J N Hirschhorn; S M Leal; L A Pennacchio; J A Stamatoyannopoulos; S R Sunyaev; D Valle; B F Voight; W Winckler; C Gunter
Journal: Nature Date: 2014-04-24 Impact factor: 49.962

8. Application of a 5-tiered scheme for standardized classification of 2,360 unique mismatch repair gene variants in the InSiGHT locus-specific database.

Authors: Bryony A Thompson; Amanda B Spurdle; John-Paul Plazzer; Marc S Greenblatt; Kiwamu Akagi; Fahd Al-Mulla; Bharati Bapat; Inge Bernstein; Gabriel Capellá; Johan T den Dunnen; Desiree du Sart; Aurelie Fabre; Michael P Farrell; Susan M Farrington; Ian M Frayling; Thierry Frebourg; David E Goldgar; Christopher D Heinen; Elke Holinski-Feder; Maija Kohonen-Corish; Kristina Lagerstedt Robinson; Suet Yi Leung; Alexandra Martins; Pal Moller; Monika Morak; Minna Nystrom; Paivi Peltomaki; Marta Pineda; Ming Qi; Rajkumar Ramesar; Lene Juel Rasmussen; Brigitte Royer-Pokora; Rodney J Scott; Rolf Sijmons; Sean V Tavtigian; Carli M Tops; Thomas Weber; Juul Wijnen; Michael O Woods; Finlay Macrae; Maurizio Genuardi
Journal: Nat Genet Date: 2013-12-22 Impact factor: 38.330

9. Defining the disease liability of variants in the cystic fibrosis transmembrane conductance regulator gene.

Authors: Patrick R Sosnay; Karen R Siklosi; Fredrick Van Goor; Kyle Kaniecki; Haihui Yu; Neeraj Sharma; Anabela S Ramalho; Margarida D Amaral; Ruslan Dorfman; Julian Zielenski; David L Masica; Rachel Karchin; Linda Millen; Philip J Thomas; George P Patrinos; Mary Corey; Michelle H Lewis; Johanna M Rommens; Carlo Castellani; Christopher M Penland; Garry R Cutting
Journal: Nat Genet Date: 2013-08-25 Impact factor: 38.330

10. Return of results in the genomic medicine projects of the eMERGE network.

Authors: Iftikhar J Kullo; Ra'ad Haddad; Cynthia A Prows; Ingrid Holm; Saskia C Sanderson; Nanibaa' A Garrison; Richard R Sharp; Maureen E Smith; Helena Kuivaniemi; Erwin P Bottinger; John J Connolly; Brendan J Keating; Catherine A McCarty; Marc S Williams; Gail P Jarvik
Journal: Front Genet Date: 2014-03-26 Impact factor: 4.599

10 in total

1 in total

1. Public variant databases: liability?

Authors: Adrian Thorogood; Robert Cook-Deegan; Bartha Maria Knoppers
Journal: Genet Med Date: 2016-12-15 Impact factor: 8.822

1 in total