Literature DB >> 34468209

Microsatellite Markers in Biobanking: A New Multiplexed Assay.

Gretchen Smith1, Debra Mathews2, Samuel Sander-Effron1, Deborah Requesens1,3, Nahid Turan1, Laura Scheinfeldt1.   

Abstract

Microsatellites, or MSATs, offer a fast and cost-effective way for biobanks to establish a biospecimen genetic profile. Importantly, this genetic profile can be used to authenticate multiple submissions derived from the same individual as well as biospecimens derived from the same original sample submission over time. While the Certificate of Confidentiality provided by the National Institutes of Health offers some meaningful protection to prevent the disclosure of potentially identifiable information to entities within the United States, we consider, in this study, the potential to offer additional protection to participants who choose to donate to biobanks by minimizing the use of forensic Combined DNA Index System (CODIS) MSAT markers in biobanking. To this end, we report the design and validation of a new multiplexed MSAT assay that does not include CODIS markers for use in biobanking operations and quality control management.

Entities:  

Keywords:  biobanking; biospecimen authentication; microsatellite

Mesh:

Year:  2021        PMID: 34468209      PMCID: PMC8665806          DOI: 10.1089/bio.2021.0042

Source DB:  PubMed          Journal:  Biopreserv Biobank        ISSN: 1947-5543            Impact factor:   2.300


Introduction

The ubiquity of repetitive elements in eukaryotic genomes has contributed to their extensive characterization and cataloging.[1] Microsatellites (MSATs), also known as short tandem repeats (STRs), comprise the subset of these elements with repeat sequences shorter than 14 bp in length. MSAT methodology emerged in the 1980s through efforts to leverage individual human genetic variation data to construct a unique genetic profile or genetic “fingerprint.”[2] Since that time, MSATs have been utilized in a wide range of applications, including population genetics,[3] forensic testing,[4] cell line authentication,[5,6] paternity testing in humans,[7] animals[8] and plants,[9] genetic mapping,[10,11] and biobanking.[12] MSATs are found across the human genome, exhibit relatively high levels of allelic variability in humans, and are inherited according to known principles of human genetics.[13]

Biospecimen genetic profiling

From the biospecimen perspective, there are two primary applications of MSAT methodology that we consider in more detail, biobanking and forensic testing. Biobanking involves the collection, processing, storage, and distribution of biological samples, including but not limited to blood, tissue, cell cultures, and DNA.[14] There are several advantages to the use of MSATs in biobanking. First, when multiple samples collected from the same individual are submitted to a biobank, MSAT profiling remains the fastest and most cost-effective way to compare genetic variation in DNA extracted from each submission vial to confirm that all of the samples originated from the same individual. Indeed, the data collection, storage, and analysis of MSATs are all more cost effective and time efficient relative to newer array-based or next-generation sequencing-based assays, especially when multiplexed.[15,16] Moreover, this value is especially relevant to older, historical biobanks that include thousands or tens of thousands of submissions with (or without) already established MSAT profiles. Second, biobanks that store MSAT profiles for each biospecimen can also compare profiles between submissions to identify potential duplicate submissions and submissions from related individuals. Third, initial MSAT profiles collected at submission can be stored and compared against MSAT profiles collected from derived products that are produced through downstream sample processing to ensure biospecimen identity and integrity over time. Fourth, MSAT profiles of derived biospecimens that are distributed back to the research community can be compared against the submission MSAT profile to confirm that every externally distributed sample is derived from the intended original submission. These third and fourth use cases are especially relevant for cell culture biobanking. Studies of lymphoblastoid cell line transformation,[17] passaging, and expansion[18] demonstrate negligible impact on genome-wide genetic variation over time. However, the extensive documentation of mischaracterized and misidentified cell line use in research demonstrates the critical importance of maintaining consistent measures of cell line integrity and quality control throughout biospecimen processing.[19-21] Forensic applications benefit from MSAT profiling in a similar manner. In these cases, MSAT profiles are typically compared between crime scene biospecimens and potential victim-derived biospecimens, potential suspect-derived biospecimens, or MSAT profiles already stored in forensic databases.[4,22] The Federal Bureau of Investigation (FBI) has typically used 13 STR loci for forensic profiling in the United States, although this was recently expanded to 20 loci, and Interpol uses 12 loci in the EU.[23,24] The FBI maintains the Combined DNA Index System (CODIS), which comprises MSAT profiles of DNA samples collected from crime scenes and from individuals for potential matches.[25]

Ethical issues raised by MSAT usage

Due to a long history of marginalization, exploitation, and abuse of black communities, indigenous peoples, and other communities of color by researchers, there is understandable mistrust within those communities of those of us who conduct research.[26,27] Relatedly, there is also a lack of representation of these communities in genetic research,[28,29] limiting what we can know and understand of the genetic diversity within these communities and the benefits that research can offer them. When the language we use to describe people/samples in research is the language of law enforcement, trust is more likely to be harmed than rebuilt, and efforts to improve equity in research will be frustrated. Given the increasing use of genetic information from noncriminal contexts by law enforcement to identify crime scene suspects, there is an additional concern about the use and potential misuse of genetic information that has been collected for other applications,[30,31] including research. Some protection of these data and the individuals from whom they were derived is offered by the National Institutes of Health (NIH) through Certificates of Confidentiality (COCs), which are issued automatically for all NIH-funded projects that use identifiable information collected from participants, and can be requested by projects that are not funded by the NIH.[32] The NIH COC protects against the disclosure of identifiable participant information “in any Federal, State, or local civil, criminal, administrative, legislative, or other proceeding”[32]; however, there are very few instances in which this protection has been tested in court.[32] While a COC might minimize the risk of misuse of CODIS MSATs collected from research participant biospecimens, it cannot address the concerns about trust. An additional protection that can be offered by biobanks to research participants who agree to donate biospecimens is the minimization of use of CODIS markers in biospecimen processing, quality control and integrity analyses, and authentication—describing samples in a language that is unique to research and not shared by law enforcement. In this study, we describe a new MSAT assay for use in biobanking operations and quality control management that minimizes risk of data misuse by law enforcement by excluding the CODIS MSATs that are used by law enforcement in the United States.

Methods

In 1997, Coriell scientists implemented an MSAT assay comprising six tetranucleotide repeats that are polymorphic in humans for biobanking applications (THO-1, D5S592, D10S526, vWA31, D22S417, and FES/FPS). Since 2004, Coriell has used an ABI 3730 fluorescent sequencer with 48 capillaries to assay this set of MSATs. This instrument is higher throughput and allows more flexible fluorochrome selection for assaying multiplexed sets of MSATs relative to the older ABI 377 that was previously employed. By using several fluorochromes, labeled primers can be multiplexed at the polymerase chain reaction (PCR) step, or combined in a single capillary run after PCR. This standard set of six MSATs is run with a primer pair targeting the amelogenin allele on the X and Y chromosomes. The amplicon size from the amelogenin gene on the X and Y chromosomes differ; thus, male samples generate two different-sized amplicons, and female samples generate two amplicons of the same size.[33] This feature, therefore, allows a single multiplex PCR assay to generate both a DNA fingerprint for biospecimen identity profiling and a determination of sex. The assay is routinely run on DNA extracted from each submission vial (most commonly whole blood) as well as on each derived biospecimen (most commonly transformed or expanded lymphoblastoid cell lines). This assay can also detect as low as 10% cross-contamination from other samples; in particular, the presence of more than two alleles at a given MSAT locus indicates cross-contamination. We note that when large-scale data, such as genome-wide single nucleotide polymorphism microarray data or next-generation whole genome sequencing data, are available, several more sensitive methods to identify cross-contamination are available.[34-36] For the past decade, Coriell has employed a supplemental panel of MSATs, the “Identifiler Plus” (the AmpFLSTR™ Identifiler™ Plus PCR Amplification Kit by Thermo Fisher Scientific), for added discrimination in the subset of cases where all standard six MSAT loci match between apparently distinct individuals. The Identifiler Plus marker set contains 15 STRs, of which 13 overlap core CODIS loci. More recently, in an effort to minimize the inclusion of CODIS markers in ongoing quality control efforts, Coriell has developed a new in-house supplemental panel of six MSATs.

MSAT assay development

One of the members of the NIGMS Human Genetic Cell Repository (NIGMS Repository) Scientific Advisory Committee, Andrew J. Sharp, PhD, shared a list of over 600 tetranucleotide MSATs with relatively high heterozygosity in human whole genome sequencing data. We prioritized candidates for our new assay based on those with the largest number of observed alleles. Of those 38 candidates, we used the Primer3 tool[37-39] to predict the range of PCR product size, GC content, primer melting temperature (Tm), and the degree of self-complementarity of each primer. We further used the Thermo Fisher Scientific Multiple Primer Analyzer to identify potential primer dimerization between primer pairs. After computational predictions were evaluated, we chose eight MSAT primer pairs to assess in the laboratory. Primers resulting in overlapping allele size ranges were tagged with distinct fluorochromes. Each of these primer pairs were first run in an individual PCR to confirm successful PCR amplification; seven of these primer pairs successfully amplified individually and were tested together in a multiplex PCR; and six of these primer pairs successfully amplified in a multiplexed PCR. A range of annealing temperatures were compared (58°C, 60°C, 62°C, 64°C, and 66°C), and the 64°C annealing temperature resulted in the best PCR performance. Supplementary Table S1 includes additional information on multiplex PCR conditions, and Supplementary Table S2 contains additional details on each primer pair. All of this initial primer testing was performed on three NIGMS Repository DNA samples (NA06990, NA10859, and NA07057). MSAT allele sizes were called using Thermo Fisher Scientific's GeneMapper™ software. As shown in Supplementary Table S2, one MSAT marker pair lies on the same chromosome, and the markers are over 13 Mb apart. When comparing the locations among the standard and new MSAT assay markers, two additional MSAT marker pairs lie on the same chromosome, each over 30 Mb apart. While linkage disequilibrium (LD) in humans has been documented over regions as large as 1 Mb,[40] the majority of LD regions across human genomes are much smaller and consistent with our assumption of relative independence across MSAT markers included in our standard and new MSAT assays. No IRB approval was needed to conduct this study.

Results

Assay implementation

For the current study, we leveraged data from the NINDS Human Genetics Resource Center (NINDS Repository), one of the largest biorepositories at Coriell with over 48,000 unique submissions. Each of the six standard MSAT markers has sufficiently high heterozygosity in the NINDS Repository sample data (0.78, 0.83, 0.90, 0.81, 0.84, and 0.70, respectively), such that the probability of identical “profiles” in unrelated individuals with this marker set is theoretically one in one million (when assuming independence among markers).[41] We calculate similar theoretical discriminatory power for the new set of six MSAT markers using the same approach,[41] assuming unrelated individuals and independence among markers, and using the heterozygosity information shared by Dr. Sharp; however, we do not have enough data to estimate the empirical discriminatory power of this marker set. In practice, all of the NINDS Repository samples have been assayed for the standard MSAT 6-plex, and the Molecular Biology Laboratory at Coriell has documented several matches (Table 1). Two of these matched pairs reflect comparisons between family members, while the remaining eight matched pairs occurred between apparently unrelated individuals. Three of these matched pairs occurred between individuals with distinct self-reported gender, while the remaining seven matched pairs occurred between individuals with the same self-reported gender. The results from the new MSAT assay demonstrate the ability to distinguish all 10 NINDS Repository matched pairs that previously required a second MSAT assay (in this case the Identifiler Plus assay) to differentiate (Table 1). This discrimination power holds even for related individuals. The siblings from family NINDS0760 can be distinguished by one allele for four of the six new MSATs, and the siblings from family NINDS5712 can be distinguished by one allele for two of the six new MSATs. Of the remaining eight pairs of individuals that shared profiles for all six of the standard MSAT assay, no more than one of the new assay's MSATs is shared between individuals. Therefore, in cases where the standard MSAT assay is unable to distinguish apparently unrelated or related but distinct individuals, adding the new MSAT assay offers additional and adequate discrimination power without using any additional CODIS markers.
Table 1.

Microsatellite Comparisons

Repository IDGenderFamily IDRelationship to probandTHO-1D5S592D10S526VWA31D22S417FES/FPSSTR_101818STR_109310STR_1285164STR_222658STR_308935STR_359667
ND08549FNINDS0760Daughter159 159178 186246 254149 153173 185224 228300 356291 303158 194325 351197 213378 390
ND08550MNINDS0760Son159 159178 186246 254149 153173 185224 228300 356303 343158 194351 361161 213390 402
ND36045FNINDS4814Daughter159 163182 186194 254149 153173 185220 224340 350335 343194 194325 351165 217334 378
ND44362F  159 163182 186194 254149 153173 185220 224304 308291 339194 194349 353141 145326 378
ND03713M  159 167178 178202 246153 157185 185224 224312 360303 343194 194325 349165 217330 330
ND14295MNINDS1306Proband159 167178 178202 246153 157185 185224 224344 344295 335194 234325 349197 197362 398
ND10753MNINDS1063Spouse159 175182 186194 246153 157173 173220 224348 356295 347150 194325 349169 169330 364
ND43103F  159 175182 186194 246153 157173 173220 224320 324331 343194 194349 353145 221350 394
ND11464M  163 175178 182198 202149 153173 185228 228348 348295 299194 230349 351141 169382 382
ND43406M  163 175178 182198 202149 153173 185228 228304 340291 331194 194325 349209 217326 382
ND14406F  163 175182 186194 202153 153173 193220 228344 364299 303194 194325 325157 205334 364
ND43372F  163 175182 186194 202153 153173 193220 228340 342299 327194 226325 325165 197350 364
ND38417MNINDS5712Sibling167 175178 182190 190145 149177 177224 224308 312295 299150 194325 349201 225330 382
ND41175MNINDS5712Proband167 175178 182190 190145 149177 177224 224308 312295 299150 194325 349201 217330 376
ND31713M  171 175178 178194 242141 149173 177224 228304 324299 299166 234325 401169 169358 382
ND45055M  171 175178 178194 242141 149173 177224 228340 344299 303154 194349 397165 213378 386
ND36600MNINDS4727Sibling175 175182 190194 194149 153177 189224 228328 340295 331150 226337 349141 165378 386
ND43425F  175 175182 190194 194149 153177 189224 228304 340327 339194 194325 393157 209334 370
ND07492F  175 175186 190238 246149 153185 197224 228320 344295 299194 194325 389193 201390 402
ND14800F  175 175186 190238 246149 153185 197224 228340 360327 335158 210325 393169 229354 374
Microsatellite Comparisons

Discussion

MSAT assays offer a fast and cost-effective way[15,16] for biobanks to establish a genetic fingerprint of each biospecimen submission to serve as a reference against other submissions as well as against biospecimens derived from the same submission to ensure sample authentication and sample integrity over time. We have routinely used a multiplexed assay of six MSATs to establish genetic profiles for each biobank submission. While the theoretical discriminatory power of this standard assay is one in one million, in practice we occasionally identify matching profiles between biospecimens collected from distinct people. We previously utilized the Identifiler Plus assay for additional resolution in these cases; however, this assay includes 13 CODIS markers and the NIGMS Repository Scientific Advisory Committee in collaboration with the NIGMS Repository team at Coriell felt an alternative that did not rely on MSAT markers used by law enforcement would offer added protection to individuals who contribute samples to biorepositories. We developed and validated a new multiplexed MSAT assay to replace our reliance on the Identifiler Plus assay to discriminate samples that could not be distinguished by our standard MSAT 6-plex assay alone. We leveraged existing data from over 48,000 research participants that have donated biospecimens to the NINDS Repository and identified a subset of 10 submission pairs from distinct individuals with matching profiles using our standard MSAT assay. The results from our new MSAT assay demonstrate the ability to distinguish all 10 NINDS Repository matched pairs that previously required the Identifiler Plus assay to differentiate (Table 1). This discrimination power holds even for the subset of related individuals included in Table 1. Therefore, in cases where our standard MSAT assay is unable to distinguish apparently unrelated or related but distinct individuals, adding this new MSAT assay offers additional and adequate discrimination power without reliance on the Identifiler Plus assay. More generally, we have found that there are more than enough polymorphic MSATs in the human genome to use for biobanking applications without the need to rely on CODIS markers. In this study, we share the development and validation of a new multiplexed MSAT assay that demonstrates discriminatory power between apparently unrelated as well as between related individuals without the inclusion of CODIS markers.
  40 in total

1.  Quality standards in Biobanking: authentication by genetic profiling of blood spots from donor's original sample.

Authors:  Sergio Cardoso; Laura Valverde; Adrian Odriozola; Xabier Elcoroaristizabal; Marian M de Pancorbo
Journal:  Eur J Hum Genet       Date:  2010-03-17       Impact factor: 4.246

2.  Centre d'etude du polymorphisme humain (CEPH): collaborative genetic mapping of the human genome.

Authors:  J Dausset; H Cann; D Cohen; M Lathrop; J M Lalouel; R White
Journal:  Genomics       Date:  1990-03       Impact factor: 5.736

3.  Selection and implementation of expanded CODIS core loci in the United States.

Authors:  Douglas R Hares
Journal:  Forensic Sci Int Genet       Date:  2015-03-12       Impact factor: 4.882

4.  ART-DeCo: easy tool for detection and characterization of cross-contamination of DNA samples in diagnostic next-generation sequencing analysis.

Authors:  Alice Fiévet; Virginie Bernard; Henrique Tenreiro; Catherine Dehainault; Elodie Girard; Vivien Deshaies; Philippe Hupe; Olivier Delattre; Marc-Henri Stern; Dominique Stoppa-Lyonnet; Lisa Golmard; Claude Houdayer
Journal:  Eur J Hum Genet       Date:  2019-01-25       Impact factor: 4.246

Review 5.  Patterns of linkage disequilibrium in the human genome.

Authors:  Kristin G Ardlie; Leonid Kruglyak; Mark Seielstad
Journal:  Nat Rev Genet       Date:  2002-04       Impact factor: 53.242

6.  Comprehensive human genetic maps: individual and sex-specific variation in recombination.

Authors:  K W Broman; J C Murray; V C Sheffield; R L White; J L Weber
Journal:  Am J Hum Genet       Date:  1998-09       Impact factor: 11.025

7.  Certificates of confidentiality: legal counsels' experiences with and perspectives on legal demands for research data.

Authors:  Leslie E Wolf; Lauren A Dame; Mayank J Patel; Brett A Williams; Jefferey A Austin; Laura M Beskow
Journal:  J Empir Res Hum Res Ethics       Date:  2012-10       Impact factor: 1.742

8.  Primer3--new capabilities and interfaces.

Authors:  Andreas Untergasser; Ioana Cutcutache; Triinu Koressaar; Jian Ye; Brant C Faircloth; Maido Remm; Steven G Rozen
Journal:  Nucleic Acids Res       Date:  2012-06-22       Impact factor: 16.971

9.  Development of a cost-effective high-throughput process of microsatellite analysis involving miniaturized multiplexed PCR amplification and automated allele identification.

Authors:  Truc T M Nguyen; Shaheen E Lakhan; Barry A Finette
Journal:  Hum Genomics       Date:  2013-03-05       Impact factor: 4.639

10.  How Could the Ethical Management of Health Data in the Medical Field Inform Police Use of DNA?

Authors:  Gaelle Krikorian; Joëlle Vailly
Journal:  Front Public Health       Date:  2018-05-29
View more
  1 in total

1.  A high-throughput real-time PCR tissue-of-origin test to distinguish blood from lymphoblastoid cell line DNA for (epi)genomic studies.

Authors:  Lise M Hardy; Yosra Bouyacoub; Antoine Daunay; Mourad Sahbatou; Laura G Baudrin; Laetitia Gressin; Mathilde Touvier; Hélène Blanché; Jean-François Deleuze; Alexandre How-Kit
Journal:  Sci Rep       Date:  2022-03-18       Impact factor: 4.379

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.