Literature DB >> 28199790

An Integrated Microfluidic Processor for DNA-Encoded Combinatorial Library Functional Screening.

Andrew B MacConnell¹, Alexander K Price¹, Brian M Paegel¹.

Abstract

DNA-encoded synthesis is rekindling interest in combinatorial compound libraries for drug discovery and in technology for automated and quantitative library screening. Here, we disclose a microfluidic circuit that enables functional screens of DNA-encoded compound beads. The device carries out library bead distribution into picoliter-scale assay reagent droplets, photochemical cleavage of compound from the bead, assay incubation, laser-induced fluorescence-based assay detection, and fluorescence-activated droplet sorting to isolate hits. DNA-encoded compound beads (10-μm diameter) displaying a photocleavable positive control inhibitor pepstatin A were mixed (1920 beads, 729 encoding sequences) with negative control beads (58 000 beads, 1728 encoding sequences) and screened for cathepsin D inhibition using a biochemical enzyme activity assay. The circuit sorted 1518 hit droplets for collection following 18 min incubation over a 240 min analysis. Visual inspection of a subset of droplets (1188 droplets) yielded a 24% false discovery rate (1166 pepstatin A beads; 366 negative control beads). Using template barcoding strategies, it was possible to count hit collection beads (1863) using next-generation sequencing data. Bead-specific barcodes enabled replicate counting, and the false discovery rate was reduced to 2.6% by only considering hit-encoding sequences that were observed on >2 beads. This work represents a complete distributable small molecule discovery platform, from microfluidic miniaturized automation to ultrahigh-throughput hit deconvolution by sequencing.

Entities: Chemical Gene

Keywords: DNA-encoded synthesis; combinatorial compound libraries; miniaturized automation

Mesh：

Substances：

Year: 2017 PMID： 28199790 PMCID： PMC5350604 DOI： 10.1021/acscombsci.6b00192

Source DB: PubMed Journal: ACS Comb Sci ISSN： 2156-8944 Impact factor: 3.784

Introduction

Combinatorial chemistry is currently experiencing a renaissance, with DNA-templated[1,2] and -encoded[3,4] synthesis technologies squarely at the epicenter of this movement. DNA-encoded libraries (DELs) store complex small-molecule structural information in an associated DNA sequence,[5] analogous to display-type technologies that associate a peptide with its encoding nucleic acid.[6−8] DELs, however, bridge a significant chemotype divide by moving library content away from the space of naturally occurring biopolymers found in display libraries and into the more drug-like space of high-throughput screening (HTS) compound collections.[4,9,10] The plunging costs of both DNA synthesis and next-generation sequencing (NGS) have positioned DEL technology as a promising companion to industrial HTS platforms[11] because it offers access to larger compound collections and lead identification is more economical and expeditious. Unlike conventional HTS in which compound library members are directly assayed for the desired function, leads from DELs are generated from selections, using binding as a surrogate for all function (e.g., inhibition, agonism). The DEL is incubated with the target, the unbound fraction is washed away, and the bound fraction is eluted, amplified, and sequenced.[4] Ligands have been discovered in this fashion for several targets, including sirtuins,[12] InhA,[13] BCATm,[14] and neurokinin-3 receptor.[15] Analysis of the sequencing data from a DEL selection can predict relative binding affinities between hits,[16] but compounds are ultimately resynthesized and evaluated individually. One can bias the hit pool by eluting target-bound library members using a known functional competitor ligand, but such ligands may not be available for every target. Beyond this measure, evaluating individual DEL members for function other than binding in the primary selection would require a single-molecule process, such as the first selections of catalytic RNA.[17] In vitro compartmentalization,[18] another single-molecule approach, has recently found use in single-molecule DEL binding assays.[19] Combinatorial libraries prepared via solid-phase synthesis[20−22] fundamentally differ from DELs in that individual, microscopic beads each display many copies of a single library member. This one-bead-one-compound (OBOC)[22] format presents the opportunity to screen library members directly for function by liberating them from the bead surface into discrete volumes for analysis. For example, bead libraries have been spread on culture plates to discover antibacterial compounds,[23] distributed into microplates to generate HTS compound library stock solutions,[24] arrayed in various microfabricated ultralow-volume well arrays,[25,26] and even nebulized into Petri dishes.[27] Many of these approaches, however, lack automated strategies to address the bottlenecks associated with screening and structure elucidation, which are currently manual and serial. Solutions to these OBOC handling and screening automation challenges are now within reach after a decade of component-level technology development in the field of microfluidics. The discovery of segmented flow or “droplet” microfluidics[28,29] and its utility in high-throughput reaction monitoring[30,31] led to a flood of components for droplet sorting,[32,33] splitting,[34] incubation,[35] synchronization,[36,37] and sampling[38−40] among many others. Facile droplet-scale component integration has paved the way to advanced circuit architectures for highly sophisticated, multistage biochemical reaction assembly, processing, and analysis.[41−45] In a previous study, we disclosed an integrated microfluidic processor for light-induced and -graduated high-throughput screening after bead release (SABR), wherein droplet-scale assay initiation with photochemical compound dosing control, incubation, and high-sensitivity detection occur in flow.[46] In this study, droplet sorting was fully integrated into the hvSABR processor to perform a functional screen of a model DNA-encoded combinatorial compound bead library. Hit deconvolution by NGS provides a powerful, quantitative means of rejecting noise resulting from the Poisson process of bead encapsulation in droplets.

Results and Discussion

An integrated microfluidic circuit (Figure A) combines assay reagents, encapsulates model library beads into droplets, photochemically cleaves compound from the bead, incubates the dosed droplets, and sorts droplets based on their individual fluorescence intensity. Model library beads are suspended in a capped pipet tip, or “suspension hopper,”[40] filled with bead hopper buffer (BHB) that was supplemented with sucrose (13% w/v) to tune density-driven sedimentation, and thereby bead introduction rate. Two aqueous inputs allow for mixing of assay reagent streams immediately prior to droplet generation. A mixture of assay probe and internal standard (R-phycoerythrin) in BHB is introduced via AQ1, which sweeps library beads from LIB toward the flow-focusing junction for encapsulation. Cathepsin D (CatD) in protease activity assay buffer is introduced via AQ2. Following library bead encapsulation in activity assay-filled droplets, an integrated waveguide precisely irradiates (λ = 365 nm) the droplet flow to induce photochemical cleavage of compound from the bead into the droplet volume. UV LED power was calibrated[46] to maintain consistent UV intensity and thereby compound dosing (1–3 μM) between experiments. Droplets dosed with compound are then incubated (18 min) within a Frenz-type delay line (20 cm long, 1 mm wide, 133 μm deep).[35] At the end of the delay line, droplets are focused back into single file. Additional spacing oil from OIL2 and guiding oil from OIL3 ensure adequate droplet separation and positioning before detection and sorting. Droplets enter the sorting junction (orange box),[33] the confocal laser-induced fluorescence (LIF) detectors measure droplet fluorescence (2000 Hz), and the software makes a sort decision (Figure B). By default, all droplets flow through the sorting junction to OUT. If a droplet is determined to contain an active compound, a salt water electrode (4 M NaCl, orange)[47] delivers a high-voltage AC pulse to the sorting junction, generating an electric field that deflects the desired droplet toward HIT output (Figure C).[32]

Figure 1

Microfluidic circuit schematic. (A) Oil, aqueous phase assay components 1 and 2, and model DNA-encoded library beads enter the device at inputs OIL1, AQ1 and AQ2, and LIB, respectively. OIL1 flow encapsulates library beads in droplets of assay reagent at the flow-focusing junction, where OIL1 meets the combined AQ1 and AQ2 streams. Droplets flow through a serpentine channel, where an integrated waveguide irradiates droplets with UV, photochemically liberating compound from the bead into the droplet volume. Droplets then flow into a deep, 20 cm-long channel for incubation before auxiliary oil inputs (OIL2, OIL3) separate and guide droplets at the sorting junction (orange box). Channel depths are color coded: shallow 32 μm (cyan), standard 57 μm (black, orange, green), and deep 133 μm (dark blue). (B) By default, droplets flow to the primary output (OUT) unless the droplet LIF profile (detection laser spot shown as a blue star) defines the droplet as a hit. When the system detects a hit-containing droplet, it energizes a salt water electrode (VAC, orange) that dielectrophoretically deflects the hit-containing droplet to the collection output (HIT). The gapped sort divider (cyan) facilitates droplet deflection while minimizing droplet splitting. A salt water ground moat (green) shields the incubation circuit. (C) Micrographs show a selected droplet (green, false color) deflecting into the HIT output channel in response to an 8-ms sorting voltage pulse (500 Vpp, 10 kHz). Neighboring droplets do not deflect and continue to OUT. Scale = 500 μm. Model libraries containing two types of DNA-encoded library beads (10-μm diameter,Figure A) were prepared to assess droplet sorting performance and to investigate postsort sequence-based bead counting. A photocleavable linker is appended with either a positive control inhibitor (Glu-pepstatin A bead, 1) or negative control (N-acetyl Glu bead, 2). DNA-encoding sequences are installed modularly by split-and-pool enzymatic cohesive-end ligation onto headpiece DNA (HDNA)[4] functionalized resin, which validated as supporting ligation and qPCR amplification of 52000 DNA molecules/bead.[48] Split-and-pool combinatorial oligonucleotide ligations using 10 different encoding modules in encoding positions 1–4 (Supporting InformationT1–T2) afforded 104 distinct bead-specific barcodes (BSB) onto both 1 and 2 beads.[49] Compound encoding region (ER) modules in encoding positions 5–10 were similarly installed. Bead set 1 was encoded using 3 different encoding modules at positions 5, 7, and 9, and 3 different modules at positions 6, 8, and 10, yielding (3 × 3)3 = 729 distinct ER sequences. Bead set 2 was encoded using 3 different encoding modules at positions 5, 7, and 9, and 4 different modules at positions 6, 8, and 10, yielding (3 × 4)3 = 1728 distinct ER sequences. The modules used to encode bead sets 1 and 2 were distinct from one another; there were no duplicated sequences between the 729 possible ER sequences of bead set 1 and the 1728 ER sequences of bead set 2. Combining the ER sequences for each bead set with the 104 BSB sequences employed, the 10-position encoding strategy produced a total of 7.29 × 106 and 1.73 × 107 unique sequences for bead sets 1 and 2, respectively. Ligation of a reverse primer module (Figure A, inset), displaying a position-specific overhang, randomized 8-nt unique molecular identifier (UMI, green, inset), and reverse primer binding site, completed the DNA encoding tag. Five bead lots (100 beads each) of encoded bead sets 1 and 2 were analyzed by qPCR and average encoding tags/bead were measured (1, 4.3 ± 0.8 × 104 tags/bead; 2, 2.5 ± 0.2 × 104 tags/bead). Positive control resin 1 was labeled with 5(6)-carboxytetramethylrhodamine (TMR) fluorophore (orange), allowing rapid visual differentiation of 1 from 2 in a mixture (Figure B).

Figure 2

Model DNA-encoded library bead structures. Lysine, linker (gray), photocleavable linker, and Glu were sequentially coupled to 10-μm-diameter TentaGel resin. (A) Positive control inhibitor beads 1 display pepstatin A (red) coupled to Glu. The linker is labeled with 5(6)-carboxy TMR fluorophore (orange). Negative control beads 2 were prepared by acetylating the Glu α amine and linker amine (cyan). Bead sets were substoichiometrically functionalized with azido DNA headpiece (HDNA) via CuAAC. The DNA encoding sequence was installed by split-and-pool combinatorial enzymatic ligation. The BSB region contained 10 unique sequence modules at each of 4 positions (1—4, 104 possible BSBs). The encoding regions (ER, 5—10) contained either 729 (Glu-pepstatin A positive control beads, 1) or 1728 (N-acetyl-Glu negative control beads, 2) possible sequences. The DNA sequence terminates with ligation of a reverse primer module containing the reverse PCR primer binding site flanking an internal unique molecular identifier (UMI, green, inset). The UMI is a random 8-mer (65 536 possible sequences). (B) Micrographs of a model library containing positive and negative control beads 1 and 2 visualized in brightfield (left) and brightfield overlay with TMR fluorescence emission (λex = 550 nm; λem = 570 nm; right) illustrate facile differentiation between the two bead types. Scale = 100 μm. DNA-encoded solid-phase synthesis (DESPS) of combinatorial libraries introduces numerous advantages, from synthesis scaling to analysis throughput. The sensitivity of PCR permits synthesis and encoding tag detection on miniaturized resin (10-μm diameter, 0.4 pmol/bead) and NGS massively parallelizes structure elucidation of hit-bead collections. Redundant libraries, in which multiple beads display the same compound library member, are particularly valuable for screening. Despite sacrificing screening throughput, redundancy adds statistical confidence to hit prediction. True positive hits tend to reproduce the desired screening outcome on multiple beads (replicates); false positive hits tend to be observed only once.[50,51] The limited set of 729 encoding sequences for bead set 1 ensured that any given hit collected from screening a model library (∼1500 total 1) would likely share encoding region sequence with another hit bead. Differentiating replicates requires the BSBs, since the ER sequences would be identical otherwise.[49] Downstream quantitative sequence counting requires the incorporation of UMIs. The integrated circuit was first used to investigate droplet sorting performance in a hvSABR screen of a model library. Model library (∼100 000 beads, 1.5% bead set 1) was introduced via suspension hopper into droplets containing CatD, fluorogenic peptide probe, and internal standard. Droplets were irradiated, incubated, and detected using confocal LIF (Figure A). Each droplet’s LIF profile was compared to a predetermined threshold. CatD was uninhibited in both unoccupied droplets and droplets containing a negative control bead 2, efficiently digesting the fluorogenic probe and increasing droplet fluorescence in the 520 nm channel. These droplets flow by default to waste. CatD was inhibited in droplets containing at least one positive control hit bead 1. As a result, fluorogenic probe remained undigested and droplet fluorescence remained low. If the droplet fluorescence was below the inhibition threshold, the software triggered the generation of an electric field, which deflected the inhibitor bead-containing droplet for collection. Real-time data smoothing eliminated bead strike-related spikes in fluorescence intensity that would potentially compromise hit identification (Figure B). Histogram analysis of assay performance (Figure C) revealed two distinct droplet populations. CatD inhibition (positive, red) was observed in 1081 droplets over the 240 min experiment duration. Uninhibited droplet populations (negative, blue hues) were plotted as two 3 min time slices that represented the upper and lower extremes. Uninhibited droplet probe fluorescence values oscillated (period = 20 min, amplitude = 787 counts, Supporting Information S1) as did the internal standard signal (phycoerythrin, λem = 570 nm). Pairing the entire population of positive control droplet fluorescence values with either of the two negative control droplet populations to calculate assay quality score, Z′,[52] gave Z′ = 0.75 and 0.63. A separate negative control experiment was conducted wherein model library beads were introduced into droplets, incubated, and detected without UV photochemical compound cleavage. Droplet fluorescence intensities under the inhibition threshold (1000 counts) were not detected under these conditions (Supporting Information S2).

Figure 3

Droplet-based CatD activity assay. (A) Following incubation, assay droplets approach the sorting junction and traverse the LIF detection point (λex = 488 nm, blue cone). A PMT detects the droplet fluorescence emission (λem = 520 nm) to generate a droplet intensity profile. Droplets containing negative control beads 2 or no bead (empty droplet) exhibited high fluorescence intensity (blue droplet, blue trace). Droplets containing positive control inhibitor beads 1 exhibited low fluorescence intensity due to CatD inhibition (red droplet, red trace). If the profile maximum fell below a set threshold, the droplet was sorted and collected in the HIT output. (B) Raw droplet fluorescence emission signal (blue line) is plotted with real-time median-smoothing (filled). If droplet fluorescence was below the threshold (red dotted line), the droplet was a hit (red fill). (C) Droplet fluorescence maxima are plotted as a histogram to display assay performance. The hit droplet population for the entire run is plotted (0—240 min, 1081 droplets, red) and compared to 2 representative 3 min sections of negative droplet populations (146—149 min, 6161 droplets, dark blue; 156—159 min, 6408 droplets, light blue). Robust assay performance was easily attainable (Z′ > 0.5) in <20 min incubation time due to the sensitivity of LIF detection and the large number of droplets sampled,[46,53] however the experiment illuminated systematic error. The regular oscillation in droplet fluorescence may stem from fluctuations in the low AQ1 and AQ2 flow rates (AQ1, 0.3 μL/min; AQ2, 0.2 μL/min). Fluorescence emission oscillation in the internal standard, which is only present in AQ1, further corroborates this hypothesis. We are currently investigating the cause of this oscillatory behavior and methods to minimize its impact. The model library screen also demonstrated that photochemical cleavage is required in order to observe enzyme inhibition in droplets. Pepstatin A is an extremely potent (IC50 < 1 nM)[54] inhibitor of CatD activity and, despite this fact, library screening under conditions of no UV irradiation yielded no detectable hit beads. One might expect that all of the target would bind the bead surface driven by target-inhibitor interaction. After all, on-bead binding-based screens of OBOC libraries rely precisely on this concept. However, given reasonable affinity of the probe for target, the probe is likely competing for target-bead binding, continually driving the equilibrium toward solution-phase reaction. Droplets with fluorescence below the inhibition threshold were sorted and collected. Droplet sorting performance was visually analyzed by overlaying the bright field and probe fluorescence images (Figure A, FAM), and by overlaying the positive control bead 1 fluorescence (TMR). Of the total 975 droplets in the plane of focus, 871 contained at least one 1 bead, while all 896 bead-occupied droplets contained 1247 1 + 2 beads. Bead set 1 occupancy was 871 droplets containing 890 beads. Co-encapsulation frequencies for droplets containing 1 and 2 were counted, plotted, and compared to a model Poisson distribution (Figure B). The total population of bead set 2 (357) included those coencapsulated with 1, (i.e., “passenger” beads; 252 droplets, 332 passenger beads), and those encapsulated without 1 (16 droplets, 25 false-positive beads). Within collected droplets, droplet occupancy agreed closely with a model Poisson distribution (droplet occupancy occupancy: λdrop = 0.68 beads/droplet). Deviation from Poisson (gray) increased at higher occupancy 5-bead events. Image analysis of droplet fluorescence indicated two high-intensity droplets. Medium-intensity droplets (n = 29) either contained no beads (n = 26) or contained only a negative control bead 2 (3 droplets).

Figure 4

Sorted hit droplet collection. (A) Droplets exhibiting CatD inhibition were sorted, collected, and visually inspected. A brightfield/probe fluorescence overlay (FAM, top left) and positive control bead fluorescence (TMR, top right) confirmed the identity of positive control inhibitor beads 1. Magnified regions (i, ii) contain an overlay of brightfield and both fluorescence channels. Positive control beads 1 (white) are coencapsulated with negative control beads 2 (blue false color). (B) The overlay images were used to measure hit droplet occupancy (n = 975 droplets). For droplets containing at least one positive control inhibitor bead 1, the observed droplet occupancy frequency (red) was plotted against the occupancy frequency predicted by the Poisson distribution function (λdrop = 0.68, yellow). Scale = 100 μm. System performance was evaluated by visual inspection to quantitate sources of negative control beads 2 in the hit bead collection. High-intensity droplets (n = 2) were likely collected as a result of mis-sorting. Medium-intensity droplets (n = 29), suggesting weak inhibition, were observed in the absence of a positive control bead. These droplets may have been contaminated by a bead-fragment that contributed a low dose of the highly potent pepstatin A. All other droplets in the FAM composite image showed pronounced inhibition, even droplets that were apparently empty, indicating that the system maintained a low error rate in sorting (0.2% sort failure). According to the overall sorting data set, the predominant source of 2 within the hit collection stemmed from Poisson-limited coencapsulation. Either increasing droplet generation frequency or slowing bead introduction (by increasing BHB solution density) rate is likely to reduce this source of 2 in the hit collection. Furthermore, we hypothesized that screening redundant libraries (>3 average replicates) and prioritizing replicate hit beads might form an additional mechanism for rejecting these otherwise randomly selected false positive hits. To test this hypothesis, a hvSABR screen for DNA sequencing analysis was performed using another model library (60 000 beads, 3.2% bead set 1) and droplets were collected as described. The system performed 1518 droplet sorts, visual inspection of a subset of the sorted droplets (n = 1188) revealed 1532 total beads comprised of 1 (1166, 76%) and 2 (366, 24%), in agreement with the output of the previous screen (1: 890, 71%; 2: 357, 29%). The exact total bead count was unknown, therefore calibration standards containing positive control bead 1 lots (30, 100, 300, 1000, 3000 beads) were prepared by FACS in triplicate, and analyzed by qPCR (Supporting Information S3). NGS analysis of the counted calibration bead set and the hit collection returned a set of DNA sequences with the number of reads and a list of UMIs for each sequence. Each encoding sequence’s UMI list was distance filtered (see Supporting Information) prior to counting. Sequences were rank-ordered by UMI counts (bead index) and plotted (Figure ). For each lot size, the UMI counts descended to baseline as the bead index approached ∼85% of the expected aliquot size. A clear inflection point is observed near the expected number of beads for each lot sample (gray lines). The inflection point was evaluated by selecting the bead index with a maximum %ΔUMIw, a normalized derivative analysis of the rank-ordered UMI counts. The bead index cutoff values (25, 84, 270, 861, 2681) for the FACS-counted standard sets were consistent between standards (83%, 84%, 90%, 86%, 89%) and in agreement with the Poisson-limited sort yield specification for the instrument (>80%). The %ΔUMIw analysis of the hit bead collection indicated a bead index cutoff (1863) in agreement with the extrapolated estimate from visual inspection (1957).

Figure 5

Hit bead collection quantitation by sequence analysis. Standards of positive control pepstatin A beads 1 were prepared by FACS in known lot sizes (30, 100, 300, 1000, 3000; blue hues). The DNA encoding tags of standard lots and the hit bead collection were each amplified in bulk and sequenced. Sequencing coverage for the hit bead collection and 3000-bead lot were 6-fold lower than other standard lots (30, 100, 300, 1000). Reads were aligned to a degenerate reference sequence, edited for single-base errors, aggregated by sequence, and counted by the N8 UMI after enforcing a Hamming distance of >1 per UMI. (A) Unique encoding sequences were rank ordered and plotted by UMI counts (bead index). Each plot contained an inflection (large data point) as the bead index approached the known lot size. (B) A normal weighted average UMI value (UMIw) was generated for each bead index, and the inflection point for each curve was approximated at the bead index yielding a maximum ΔUMIw/UMIw (% change in UMI; inset). The inflection x coordinate is the bead index cutoff (25, 84, 270, 861, 1863, 2681). The bead index cutoff empirically defines the point at which sequencing reads no longer correspond to bead-derived encoding DNA. Sequencing reads from these “imposter” amplicons constitute a real interference in decoding the hit bead sequence pool as they align to the encoding reference sequence and contain otherwise legitimate ER sequences. The hvSABR output posed a challenge for bead counting because droplets encapsulate multiple beads based on the Poisson distribution,[40] thus the droplet sort event count underestimates the actual number of beads collected. Analysis of the calibration bead lots demonstrated that the bead count can be inferred from the NGS data. By expanding the language to 10 encoding positions (1010 encoding depth) and augmenting the tag structure with both combinatorial BSBs and UMIs for molecule counting,[4,55,56] it was possible to extract bead-derived structure-encoding hits from the NGS data for both control bead sets and hit bead collections from a model library screen. However, the heterogeneous effects of diverse library synthesis reaction conditions on encoding tag integrity may complicate the above approach,[57] necessitating more sophisticated high-speed droplet imaging[58] in order to obtain a direct bead count in situ. NGS analysis of the hit bead collection was consistent with the visual inspection and contained numerous replicate hit sequences. Using the bead index cutoff determined by %ΔUMIw inflection analysis (1863), bead sequences were decoded as either 1 or 2 based only on ER sequence, which represented unique compounds. Unique compounds (ER sequence) were then binned into replicate classes (k, Table ) based on the number of BSBs associated with each ER sequence. The total bead counts of 1 (1380, 74%) and 2 (483, 26%) matched the visual analysis (76% and 24%, respectively). The replicate analysis experimentally demonstrated that Poisson-limited false positives in the hit collection can be readily rejected as randomly selected compounds. Significantly, large redundancy is not necessary to achieve this result. Our model library emulated a 60 000-bead screen at 3.2% hit rate (1920 1 beads) encoded such that hits were only present on average at 2.5-fold redundancy (729 ER sequences), meanwhile the negative control bead population (58 000 2 beads) was oversampled at 33-fold redundancy (1728 ER sequences). We observed a total of 627 of 729 possible 1 ER sequences, and 409 of 1728 possible 2 ER sequences. Partitioning these into replicate classes clearly differentiated authentic positives (1) and false positives (2). Each replicate class is summarized by a false discovery rate (FDR) and the hit recovery as a fraction of unique 1 contained within the class. Even in the limit of this modest hit redundancy and pessimistic oversampling of negative controls, when we only consider replicate classes k > 2, we isolate primarily 1 (222 unique ER) and few 2 (6 unique ER) giving a low FDR (2.6%).

Table 1

Hit Bead Collection by Replicate Class

The validated microfluidic hvSABR lead identification architecture is now ready for analysis of several assays that have already been adapted to droplet scale. Example applications include biochemical assays of HIV protease activity[40] or histone deacetylase activity,[59] two recent enzyme targets that were explored in bids to move functional screening off the bead surface. There is also significant potential in exploring other challenging target classes, such as protein–protein interactions[60,61] or revisiting the miniaturization of phenotypic assays.[25,27] As assay scope expands, it will also likely become advantageous to adapt alternative detection strategies, such as fluorescence polarization,[62] which could result in a new approach to screening combinatorial libraries directly for solution-phase ligand binding. In conclusion, we used a model DNA-encoded compound library to demonstrate that hvSABR-based lead identification can economically identify functional library members with high confidence. The screen consumed 120 μL of mixed assay volume and surveyed up to 100 000 library beads (0.05 mg library beads) over 4 h of automated operation, demonstrating that the platform can interrogate a whole library using little material and without the need for robotic automation. Adding UMI and BSB features to the DNA encoding language represented minimal modifications to the DESPS protocol, but enabled powerful noise rejection at both the sequencing and hit structure identification stages. The hvSABR circuit thus provides distributable and economical library screening automation that matches the efficiency of DNA-encoded solid-phase combinatorial library synthesis and structure elucidation.

Experimental Procedures

Materials Sources

All reagents were obtained from Sigma-Aldrich (St. Louis, MO) unless otherwise specified. 5-Azidopentanoic acid, N-hydroxysuccinimide (NHS), 1,3-bis[tris(hydroxymethyl)methylamino]propane (Bis-Tris), tris(hydroxymethyl)aminomethane (Tris), 2-(N-morpholino)ethanesulfonic acid hydrate (MES), pepstatin A, N,N′-diisopropylcarbodiimide (DIC), ethyl 2-cyano-2-(hydroxyimino)acetate (Oxyma), 1-hydroxy-7-azabenzotriazole (HOAt), N,N′-diisopropylethylamine (DIEA), 2,4,6-trimethylpyridine (TMP), propargylamine, triisopropylsilane (TIPS), α-cyano-4-hydroxycinnamic acid (HCCA), bromoacetic acid (BAA), trifluoroacetic acid (TFA), acetic anhydride, mineral oil, triethylammonium acetate (TEAA, Life Technologies, Carlsbad, CA), M-280 streptavidin-coated magnetic resin (Life Technologies), biotin N-hydroxysulfosuccinimidyl ester (biotin-sNHS, Pierce Biotechnologies, Rockford, IL), Taq DNA polymerase (Taq, New England Biolabs, Ipswich, MA), 2′-deoxyribonucleotide triphosphate (dNTP, set of dATP, dTTP, dGTP, dCTP, Promega Corp., Milwaukee, WI), N-α-Fmoc-Lys(Mtt)–OH (AnaSpec, Inc., Fremont, CA), N-α-Fmoc-Arg(Pbf)–OH (AnaSpec), N-α-Fmoc-Gly-OH (AnaSpec), Fmoc-Glu(OtBu)–OH (AnaSpec), 5-(6)-carboxytetramethylrhodamine (TMR, AnaSpec), R-phycoerythrin (AnaSpec), HiLyte Fluor 488/QXL520-based FRET peptide substrate (AnaSpec), bovine serum albumin (BSA, Roche Diagnostics, Indianapolis, IN), 4-{4-[1-(9-fluorenylmethyloxycarbonylamino)ethyl]-2-methoxy-5-nitrophenoxy}butanoic acid (Fmoc-Photolinker-OH, Advanced ChemTech, Louisville, KY), DMF-A-6CS (Shin-Etsu, Akron, OH), KF-6038 (Shin-Etsu), polydimethylsiloxane (PDMS, Dow Corning, Midland, MI), trimethylsiloxy-terminated PDMS (200 cSt, Gelest Inc., Morrisville, PA), and Brilliant Violet 510 Streptavidin (BV-510, BioLegend, San Diego, CA) were used as provided. Tris[(1-benzyl-1H-1,2,3-triazol-4-yl)methyl]amine (TBTA) was recrystallized three times in t-BuOH/H2O (1:1).[63] Solvents used in solid-phase synthesis were dried over molecular sieves (3 Å, 3.2 mm pellets). All solid-phase synthesis was conducted in a UV-free environment.

Buffers and Oils

Cathepsin D protease activity assay buffer (100 mM NaOAc, 1 M NaCl, 1 mM EDTA, 2% DMSO, 1% BSA, 0.1% Tween 80, pH 4.7), MES buffer (100 mM MES, 1 M NaCl, 1 mM EDTA, 2% DMSO, 1% BSA, pH 6.5), bead hopper buffer (BHB, 1 mM sodium phosphate, 10 mM NaCl, 2% DMSO, 13% sucrose, 1% BSA, 0.1% Tween 80, pH 7.0), bind and wash buffer (BWB, 1 mM EDTA, 2 M NaCl, 10 mM Tris, pH 7.5), bind and wash buffer with Tween (BWBT, 1 mM EDTA, 2 M NaCl, 10 mM Tris, 0.1% Tween 20, pH 7.5), 10× Bis-Tris propane ligation buffer (BTPLB, 500 mM NaCl, 100 mM MgCl2, 10 mM ATP, 0.2% Tween 20, 100 mM Bis-Tris propane, pH 7.6), Bis-Tris propane wash buffer (BTPWB, 50 mM NaCl, 0.04% Tween 20, 10 mM Bis-Tris propane, pH 7.6), Bis-Tris propane breaking buffer (BTPBB 100 mM NaCl, 10 mM EDTA, 1% SDS, 1% Tween 20, 10 mM Bis-Tris propane, pH 7.6), click reaction buffer (CRB, 50% DMSO, 30 mM TEAA, 0.04% Tween 20, pH 7.5), 10X PCR buffer (2 mM each dNTP, 25 mM MgCl2, 500 mM KCl, 100 mM Tris, pH 8.3), 1X GC-PCR buffer (1× PCR buffer, 8% DMSO, 1 M betaine), and denaturing polyacrylamide gel electrophoresis loading buffer (GLB, 6 M urea, 0.5 mg/mL bromophenol blue, 12% w/v Ficoll 400, 1× TBE buffer, pH 8.5) were prepared in DI H2O. Oil for microfluidic droplet formation (4:20:76, w/w/w, KF-6038, DMF-A-6CS, mineral oil) was prepared gravimetrically and mixed with gentle rotation (14 h, 8 rpm) prior to use.

Dual-Scale Photolabile Linker Synthesis

Two aliquots of dual-scale resin were prepared. Each dual-scale aliquot contained TentaGel M NH2 resin (10 μm, 0.23 mmol/g, 30 mg, Rapp-Polymere, Tübingen, Germany) and TentaGel MB Rink amide resin (160 μm, 0.41 mmol/g, 5 mg, Rapp-Polymere). Mixed-scale resin was swelled in DMF (1 h, RT) and transferred into fritted spin-columns (Mobicol Classic, large filter, 10-μm pore size, MoBiTec GmbH, Goettingen, Germany). Linker construction proceeded via iterative cycles of manual solid-phase peptide synthesis. Each cycle included: (1) Fmoc deprotection (20% piperidine in DMF, 2 × 450 μL, 5 min first aliquot, 15 min second aliquot); (2) N-α-Fmoc-amino acid (36 μmol, 450 μL DMF) activation with COMU/DIEA (36 μmol/72 μmol), and incubation (2 min, RT); (3) N-α-Fmoc-amino acid coupling to resin by transferring activated acid (450 μL) to resin and incubating with rotation (15 min, RT, 8 rpm). Each amino acid coupling was repeated once. After each deprotection and coupling step, reactants were expelled and the resin washed (DMF, 3 × 400 μL; DCM, 1 × 400 μL; DMF, 1 × 400 μL). N-α-Fmoc-Lys(Mtt)-OH, N-α-Fmoc-Gly-OH, N-α-Fmoc-Arg(Pbf)-OH, and N-α-Fmoc-Gly-OH were coupled sequentially. The pendant Fmoc-protected amine was deprotected and the resin washed (see above). The deprotected N-terminus was acylated by preparing a solution of BAA (112 μmol) and DIC (112 μmol) in DMF (450 μL) and incubating (2 min, RT), transferring the activated bromoacid (450 μL) to the resin, and incubating (15 min, 40 °C). The bromoacetylated resin was washed, propargylamine solution (1 M in DMF, 450 μL) was transferred to resin, and the resin was incubated with rotation (3 h, 50 °C, 8 rpm). The resin was washed, then N-α-Fmoc-Gly-OH was coupled as other amino acids above. The pendant Fmoc-protected amine was deprotected and the resin washed. Fmoc-photolinker-OH (36 μmol, 450 μL DMF) was activated with DIC/Oxyma/TMP (56.2 μmol/36 μmol/56.2 μmol) in DMF (450 μL), incubated (2 min, RT), then coupled to resin by transferring the activated acid (450 μL) to resin and incubating the resin with rotation (3 h, 37 °C, 8 rpm). Photolinker coupling reaction was repeated once, then the resin was washed (see above). Acetic anhydride (386 μmol) and TMP (386 μmol) were combined in DMF (450 μL), the solution added to resin, and the resin incubated (20 min, RT, 8 rpm). The pendant Fmoc-protected amine was deprotected and the resin washed. Fmoc-Glu(OtBu)-OH was coupled as above, and the resin was washed.

Dual-Scale Photolabile Glu-Pepstatin-A Resin Synthesis

An aliquot of dual-scale photolabile linker resin (35 mg) was deprotected (20% piperidine in DMF, 2 × 450 μL, 5 min first aliquot, 15 min second aliquot) and the resin washed (DMF, 3 × 400 μL; DCM, 1 × 400 μL; DMF, 1 × 400 μL). Pepstatin A (11.3 μmol) was activated with DIC/Oxyma/TMP (72 μmol/18 μmol/18 μmol) in DMF (450 μL), incubated (4 min, 60 °C) then coupled to resin by transferring the activated acid (450 μL) to resin and incubating with rotation (2 h, 60 °C, 8 rpm). Resin was washed (DMF, 3 × 400 μL; DCM, 3 × 400 μL), combined with Mtt deprotection cocktail (TFA/TIPS/DCM, 1:2:97, 400 μL) incubated with rotation (30 min, RT, 8 rpm), and washed (DCM, 2 × 450 μL). Resin was combined with a fresh aliquot of Mtt deprotection cocktail, incubated with rotation (30 min, RT, 8 rpm), and washed (DCM, 2 × 450 μL; DMF, 1 × 450 μL; 1% DIEA in DMF, 1 × 450 μL; DMF, 2 × 450 μL). 5(6)-TMR (39 μmol) was dissolved in DMSO (62 μL), combined with TMP (72 μmol), and COMU (36 μmol) in DMF (388 μL), and incubated (2 min, RT). The resin was combined with the activated 5(6)-TMR solution and incubated with rotation (120 min, 50 °C, 8 rpm). The resin was washed until washes were colorless and stored in DMF (4 °C).

Dual-Scale Photolabile N-Acetyl-Glu Resin Synthesis

An aliquot of dual-scale photolabile linker resin (35 mg) was deprotected (20% piperidine in DMF, 2 × 450 μL, 5 min first aliquot, 15 min second aliquot), washed (DMF, 3 × 400 μL; DCM, 3 × 400 μL), combined with Mtt deprotection cocktail (400 μL), incubated with rotation (30 min, RT, 8 rpm), and washed (DCM, 2 × 450 μL). Resin was combined with a fresh aliquot of Mtt deprotection cocktail, incubated with rotation (30 min, RT, 8 rpm), and washed (DCM, 2 × 450 μL; DMF, 1 × 450 μL; 1% DIEA in DMF, 1 × 450 μL; DMF, 2 × 450 μL). The resin was combined with acetic anhydride (386 μmol) and TMP (386 μmol) in DMF (450 μL), and incubated with rotation (20 min, RT, 8 rpm), washed (DMF, 3 × 400 μL; DCM, 1 × 400 μL; DMF, 1 × 400 μL), and stored in DMF (4 °C).

Photolabile Glu-pepstatin and N-Acetyl-Glu Resin Compound Characterization and Screening Bead Deprotection

Both resin samples were washed (DMF, 2 × 450 μL), sonicated (2 min, RT, B3510DTH, Branson Ultrasonics, Danbury, CT), and filtered (CellTrics 150-μm mesh, Sysmex-Partec, Lincolnshire, IL). The 160-μm resin particle samples (1 mg) were collected into fritted spin columns (Mobicol Classic), washed (DCM, 4 × 400 μL), and dried in vacuo. The dried 160-μm resin samples were combined with cleavage cocktail (90% TFA, 5% DCM, 5% TIPS, 450 uL), and incubated with rotation (1 h, RT, 8 rpm). Cleavage product was expelled into a tube, evaporated to dryness, resuspended (20% DMSO, 0.1% TFA in H2O, 100 μL), and analyzed using reversed-phase HPLC (X-Bridge BEH C18 column, 4.6 mm × 100 mm, 130 Å, 5 μm, Waters Corp., Milford, MA) with gradient elution (mobile phase A: ACN; mobile phase B: 0.1% TFA in H2O; 5%—75% A, 30 min) and absorbance detection (λ = 330 nm). Product fractions were collected, an aliquot (1 μL) was spotted to a MALDI-TOF MS target plate, dried, covered with HCCA matrix solution (1.5 mg/mL HCCA, dissolved in a 2:1 solution of aqueous 0.1% TFA:ACN) dried, and mass analyzed via MALDI-TOF MS (Microflex, Bruker Daltonics Inc., Billerica, MA). Major peaks in the HPLC chromatogram corresponded to photolabile N-acetyl-Glu (retention time = 9 min; theoretical [M + H]+ = 1019.5; observed [M + H]+ = 1019.7) and photolabile Glu-pepstatin TMR-labeled linker (retention time = 17 min; theoretical [M + H]+ = 2058; observed [M + H]+ = 2057). The 10-μm resin samples were collected during filtration into separate fritted spin columns (Mobicol Classic), washed (DCM, 3 × 400 μL), and dried in vacuo. The dried 10-μm resin samples were combined with deprotection cocktail (90% TFA, 5% DCM, 5% TIPS, 400 uL) and incubated (30 min, RT). The deprotection cocktail was drained, fresh cocktail was added (90% TFA, 5% DCM, 5% TIPS, 400 uL), and resin was incubated (30 min, RT). Deprotection cocktail was drained and the resin was washed (DCM, 3 × 400 μL; 1% DIEA in DMF, 2 × 400 μL; DMF, 2 × 400 μL; 1% DIEA in DMF, 1 × 400 μL; DMF, 2 × 400 μL), then incubated in DMF (overnight, RT).

Oligonucleotides

Oligonucleotides (Integrated DNA Technologies, Inc. Coralville, IA) were purchased as desalted lyophilate and used without further purification unless otherwise specified. Oligonucleotide ligation substrates were 5′-phosphorylated (/5Phos/). Amino-modified headpiece DNA (NH2-HDNA, /5Phos/GAGTCA/iSp9//iUniAmM//iSp9/TGACTCCC) was HPLC purified at the manufacturer and used without further purification. Oligonucleotides are indicated by “ ≈ ” followed by a 4-digit numeric identifier and “[+]” or “[−]” strand designation. The first digit groups the oligonucleotides by set: set 0 contains PCR primer sequences and sets 1 and 2 contain ER sequences. The second digit denotes position in the ER. The third and fourth digits index the different coding sequences in each set. Oligonucleotide paired (OP) stock solutions of complementary oligonucleotides (60 μM [+], 60 μM [−], 50 mM NaCl, 1 mM Bis-Tris, pH 7.6) were heated (5 min, 60 °C) and cooled to ambient (5 min, RT) before each use. These reagents are indicated with a “[±]” double-stranded designation. Table T1 is a concise look-up table for generating all oligonucleotide sequences. For example, ≈1302[+] is from set 1, and built by concatenating overhang X3XX[+] “/5Phos/GTT” with encoding sequence 1 × 02[+] “ACGGAGCA” to yield the sequence “/5Phos/GTTACGGAGCA.” The complement, ≈1302[−] is also from set 1, and built by concatenating overhang X3XX[−] “/5Phos/TAG” with encoding sequence 1 × 02[−] “TGCTCCGT” to yield the sequence “/5Phos/TAGTGCTCCGT.” Combining ≈1302[−] and ≈1302[+] and thermally processing as above yields the double-stranded coding module ≈1302[±], a position 3 OP stock solution (OP3) of set 1 parent sequence 02 (1 × 02). All sequences are written in the 5′ to 3′ direction. The new 10-position language positions are indicated in hexidecimal (0, 1, 2, ..., D, E, F), with “B” as the last position used for the 10-position language (Supporting Information T1).

Azido HDNA Synthesis, Purification, and Characterization

NH2–HDNA (300 nmol) was dissolved in phosphate buffer (200 mM, pH 8.0, 240 μL). 5-azidopentanoic acid NHS ester was prepared by dissolving NHS (9.6 μmoles), EDC (9.6 μmoles), and 5-azidopentanoic acid (7.2 μmoles) in DMF (20 μL) and incubating (30 min, 60 °C). The NH2–HDNA acylation reaction was assembled by sparging (N2, 1 min) the phosphate-buffered NH2–HDNA, adding 5-azidopentanoic acid NHS ester solution (22 μL), and incubating (2 h, RT). A fresh solution of 5-azidopentanoic acid NHS ester was prepared as described above, added to the acylation reaction, and the reaction incubated (1 h, RT). The reaction was quenched (1 M Tris, pH 7.6, 100 μL) and incubated (5 min, 60 °C). Azido-HDNA (N3–HDNA) product was precipitated twice in ethanol. The pellet was dried under N2, resuspended (20 mM TEAA, pH 8.0, 100 μL), and purified at semipreparative scale using reversed-phase HPLC (X-Bridge BEH C18 column, 10 mm × 150 mm, 130 Å, 5 μm, Waters Corp.) with gradient elution (mobile phase A H2O, 20 mM TEAA, pH 8; mobile phase B ACN; 5—12% B, 24 min). A product fraction aliquot (1 μL) was spotted to a MALDI-TOF MS target plate, dried, covered with matrix solution (18 mg/mL THAP, 7 mg/mL ammonium citrate dibasic in 1:1 ACN:H2O), dried and mass analyzed via MALDI-TOF MS (Microflex, Bruker Daltonics Inc.). Product-containing fractions (theoretical [M + H]+ = 5064; observed [M + H]+ = 5058) were pooled and evaporated to dryness.

HDNA Functionalization onto Photolabile Compound Resin

Photolabile Glu-pepstatin and photolabile N-acetyl-Glu resin (10 mg 10 μm, 0.5 mg 160 μm resin; 2.2 μmol) were each aliquoted, washed (CRB, 3 × 200 μL), combined with CRB (1 mL) and incubated (1 h, 40 °C). CuSO4 (3.1 μmol), TBTA (5.4 nmol), and ascorbic acid (15.6 μmol) were dissolved (66% v/v DMSO in H2O, 49.2 μL). N3–HDNA (11 nmol) and ascorbic acid (32 nmol) were dissolved in TEAA buffer (200 mM, pH 7.5, 16 μL). Bifunctional resin was washed (CRB, 1 mL), resuspended in CRB (1.3 mL), combined with Cu(II) solution (49 μL), mixed, and incubated with rotation (5 min, 40 °C, 15 rpm). The resin was centrifuged (30 s, 1000 rcf), combined with N3–HDNA solution (16 μL, 0.004 eq. to bead sites), vortexed immediately, and incubated with rotation (6 h, 40 °C, 15 rpm). Resin was centrifuged (2 min, 1,000 rcf), the supernatant removed, and the resin washed (BTPBB, 3 × 1 mL) and incubated with rotation (12 h, RT, 15 rpm). Resin was washed (BTPWB, 3 × 1 mL) then transferred into fritted spin-columns, washed (DI H2O, 3 × 1 mL; DMF, 3 × 1 mL), and stored in DMF (−20 °C).

Photolabile Glu-pepstatin and N-Acetyl-Glu Resin HDNA Coupling Validation

Wells of filtration microtiter plates (0.45 μm Hydrophobic PTFE MultiScreen Solvinert Filtre Plate, Merck Millipore Ltd., Cork, Ireland) were each wetted (DCM, 100 μL) then aliquots of HDNA-functionalized photolabile Glu-pepstatin, HDNA-functionalized photolabile N-acetyl-Glu resin (0.1 mg), and biotin-HDNA magnetic control resin (50 μg, prepared as previously described)[48] were transferred into separate clean wells and washed (BTPWB, 6 × 150 μL, BTPLB; 1 × 100 μL). Reagents for the first enzymatic oligonucleotide ligation reaction, consisting of ≈0001[±] (1.4 nmol), ≈1109[±] (1.4 nmol), ≈2210[±] (1.4 nmol), ≈1301[±] (1.4 nmol), ≈2402[±] (1.4 nmol), ≈1503[±] (1.4 nmol), and T4 DNA ligase (4500 U) were combined (BTPLB, 450 μL) and aliquoted to HDNA-functionalized photolabile Glu-pepstatin resin (150 μL), HDNA functionalized photolabile N-acetyl-Glu resin (150 μL), and biotin-HDNA magnetic resin (150 μL). Plate wells were sealed with foil adhesive, and resin samples were incubated with shaking (3 h, RT, 600 rpm), then washed (BTPBB, 3 × 150 μL; BTPLB, 1 × 100 μL). Reagents for the second ligation reaction, consisting of ≈2604[±] (1.4 nmol), ≈1705[±] (1.4 nmol), ≈2806[±] (1.4 nmol), ≈1907[±] (1.4 nmol), ≈2A08[ ± ] (1.4 nmol), ≈0B01[ ± ] (1.4 nmol), and T4 DNA ligase (4500 U) were combined (BTPLB, 450 μL) and aliquoted to HDNA-functionalized photolabile Glu-pepstatin resin (150 μL), HDNA-functionalized photolabile N-acetyl-Glu resin (150 μL), and biotin-HDNA magnetic resin (150 μL). Plate wells were sealed and resin samples were incubated with shaking (3 h, RT, 600 rpm), then washed (BTPBB, 3 × 150 μL). qPCR mixture contained Taq (0.05 U/μL), oligonucleotide primers 5′-GCCGCCCAGTCCTGCTCGCTTCGCTAC-3′ and 5′-GTGGCACAACAACTGGCGGGCAAAC-3′ (0.3 μM each), and SYBR Green (0.2×, Life Technologies) in GC-PCR buffer (1×). Resin particles (HDNA functionalized photolabile Glu-pepstatin, HDNA functionalized photolabile N-acetyl-Glu) in BTPWB (100 beads/μL, 1 μL) were added to separate amplification reaction wells (20 μL, 10 replicates each). Each resin supernatant (1 μL) was added to respective negative control reaction wells (20 μL, 2 replicates). Biotin-HDNA magnetic resin (2000 beads/μL, 1 μL) was added to a positive control amplification reaction well (20 μL, 10 replicates). Template standards (100 amol, 10 amol, 1 amol, 100 zmol, 10 zmol, 1 zmol, 100 ymol, and 10 ymol in BTPWB) were added to separate reaction wells (20 μL). The reaction plate was thermally cycled (96 °C, 10 s; [95 °C, 8 s; 72 °C, 24 s] × 32 cycles; 72 °C, 120 s; C1000 Touch Thermal Cycler, Bio-Rad, Hercules, CA) with fluorescence monitoring (CFX-96 Real-Time System, Bio-Rad). Samples were quantitated (CFX Manager, version 3.1, Bio-Rad) using single-threshold Cq determination mode (300 RFU). Supernatant background was subtracted from respective single-particle measurements. Background-subtracted replicates were averaged and %RSD calculated.

FACS-Based Bead Lot Preparation and Sequencing

DNA-encoded photolabile Glu-pepstatin beads (1, 100 000 beads) were aliquoted (PBS, 1 mL) and sorted (BD FACS Jazz, BD Biosciences, San Jose, CA) into 96-well skirted PCR plate in replicates for each bead lot size (3 × 3000 beads, 3 × 1000, 3 × 300, 3 × 100, 3 × 30, 3 × 10, 3 × 3, 34 × 1 bead). Forward and side scatter were used to define a gate for the single-bead population. PBS was added to bead-occupied wells and template standard wells to equalize sorting-buffer volume (10 μL total). qPCR mixture contained Taq (0.05 U/μL), oligonucleotide primers 5′-GCCGCCCAGTCCTGCTCGCTTCGCTAC-3′ and 5′-GTGGCACAACAACTGGCGGGCAAAC-3′ (0.3 μM each), and SYBR Green (0.2×, Life Technologies) in GC-PCR buffer (1×). Template standards (100 amol, 10 amol, 1 amol, 100 zmol, 10 zmol, 1 zmol, 100 ymol, and 10 ymol in BTPWB) were added to separate reaction wells (30 μL). The reaction plate was thermally cycled (96 °C, 10 s; [95 °C, 8 s; 72 °C, 24 s] × 32 cycles; 72 °C, 120 s; C1000 Touch Thermal Cycler, Bio-Rad) with fluorescence monitoring (CFX-96 Real-Time System, Bio-Rad). Samples were quantitated using single-threshold Cq determination mode (400 RFU). Supernatant background was subtracted from respective single-particle measurements. Background-subtracted replicates were averaged and %RSD calculated (Supporting Information S3). Plate samples were centrifuged using a swing bucket rotor (2 min, 400 rcf). For each bead population replicate set (n = 3), one sample was transferred into a clean 96-well plate and diluted (1:10,000 in BTPWB). PCR mixture contained Taq DNA polymerase (0.05 U/μL), oligonucleotide primer 5′-CCTCTCTATGGGCAGTCGGTGATGCCGCCCAGTCCTGCTCGCTTCGCTAC-3′ (0.3 μM), SYBR Green (0.2×, Life Technologies), DMSO (6%), betaine (1 M), MgCl2 (1 mM), and PCR buffer (1×). Each amplicon dilution was added (2 μL, 1 μL, 0.5 μL) with a corresponding NGS barcode (XXXXXXXXXX) oligonucleotide primer (5′-CCATCTCATCCCTGCGTGTCTCCGACTCAGXXXXXXXXXXGATGTGGCACAACAACTGGCGGGCAAAC-3′, 12 pmol) to separate amplification reactions (40 μL). The plate was thermally cycled ([95 °C, 8 s; 70 °C, 24 s; 72 °C, 16 s] x 24 cycles; 72 °C, 120 s). Aliquots of barcoded amplicons with similar quantitation (Cq = 12.8 ± 0.2) were pooled (0.1, 0.35, 1.0, 3.5, 10.5, 35, 16.8 μL; 3, 10, 30, 100, 300, 1000, 3000 bead samples, respectively) and purified by native PAGE (6%, 1 × TBE, 4 W, 30 min) with SYBR Gold staining (Life Technologies). Gel slices containing 248-bp DNA products were excised and placed in a tube (0.6 mL) punctured at the bottom using a syringe needle (18 gauge). The punctured tube was placed inside a larger tube (1.5 mL) and centrifuged (5 min, 10 000 rcf). The extruded gel slices were combined with DI H2O (150 μL), incubated (overnight, RT, 8 rpm), and centrifuged (5 min, 14 000 rcf). The supernatant was removed to a clean tube. An aliquot was used for standard NGS sample preparation and sequencing (Ion Proton, Life Technologies).

Confocal LIF Detection System

Droplet fluorescence was detected on-chip using a two-channel confocal LIF microscope that was built in-house using 30-mm cage system components (Thorlabs, Inc., Newton, NJ). A long-pass dichroic mirror (500 nm, 25.2 × 35.6 mm dichroic long-pass filter, Edmund Optics, Barrington, NJ) directs light from an optically pumped semiconductor laser (488 nm, 20 mW, OBIS-488 20LS, Coherent Inc., Santa Clara, CA) into a microscope objective (40×, 0.60 NA, 2.8 mm WD, Motic, Richmond, Canada), which focuses the excitation beam on the microfluidic channel and collects fluorescence emission. The dichroic transmits the fluorescence signal to a short-pass dichroic (550 nm, 25.2 × 35.6 mm, dichroic short-pass filter, Edmund Optics). Emitted light between 500 and 550 nm is spectrally filtered through a bandpass filter (520BP10, Omega Optical Inc., Brattleboro, VT) before being focused by a plano-convex lens (f/D = 30 mm/25.4 mm). A pinhole (8 μm) spatially filters the focused light prior to detection with a photon counting PMT (H7828, Hamamatsu, Middlesex, NJ). Emitted light longer than 550 nm is directed to a long-pass dichroic (600 nm, 25.2 × 35.6 mm, dichroic long-pass filter, Edmund Optics), which reflects light between 550 and 600 nm to a bandpass filter (570BP10, Omega Optical) and an otherwise identical plano-convex lens/pinhole/PMT optical train. LabVIEW code written in-house controls signal processing and droplet sorting decisions. The PMT signals were digitized by a data acquisition board (DAQ, NI USB-6341, National Instruments, Austin, TX), and binned into packets of counts (Δt = 0.5 ms). Median-filter smoothing (window width = 7) is applied to the signal in real-time, droplet signal regions are identified by the 570 nm channel signal (counts > 500) and 520 nm channel signal maxima (max = counts when counts < counts) returned as “droplet fluorescence.” When a hit droplet is detected (droplet fluorescence < 1000 counts), LabVIEW outputs a TTL pulse from the DAQ board to a waveform generator (Agilent 33210A, Agilent Technologies, Santa Clara, CA), triggering a defined square wave pulse output (0—5 V, 10 kHz, 80 cycles) that is amplified (gain = 100 V/V, TREK Model 2210 high-voltage power amplifier, TREK Inc., Lockport, NY) and conducted through needle-fitted tubing filled with salt water (4 M NaCl) into a microfabricated electrode channel (VAC).

Microfluidic Device Fabrication and Calibration

Channel structures were fabricated using soft lithography.[64] Devices were fabricated using two-tone patterned PDMS as previously described.[45] Briefly, avobenzone (34 mg) was dissolved in toluene (200 μL) and mixed into PDMS prepolymer (5.5 g, 10:1 elastomer base/curing agent). Degassed avobenzone-PDMS prepolymer was loaded into a disposable syringe (3 mL, BD Medical, Franklin Lakes, NJ) and applied over the incubation channel and bead-introduction reservoir regions of the master. After partial curing (80 °C, 12 min), native, degassed PDMS prepolymer (44 g, 10:1) was poured on top of the master and cured to completion (80 °C, 1 h). After peeling the PDMS mold from the wafer, fluidic access ports were punched with a biopsy punch (0.75 mm, World Precision Instruments, Inc., Sarasota, FL). Microfluidic devices were prepared and fitted with integrated waveguides.[46] Waveguide performance was calibrated for each device to ensure reproducible compound liberation. To calibrate, a solution of Brilliant Violet 510 (BV510, 4 μg/mL in MES buffer) was pumped through the calibration channel (0.5 μL/min). The dye was excited with UV via integrated waveguide (LED current = 115 mA, 130 mA) and emission was measured in the 520 nm channel (100 Hz). Afterward, the calibration channel was rinsed with water, dried, and filled with trimethylsiloxy-terminated PDMS.

Integrated Circuit Operation

Prior to droplet generation, the incubation channel was backfilled with oil phase. OIL1 (1.5 μL/min) was pumped into the circuit through OIL1 inlet and aqueous phase was pumped into the circuit through AQ1 (0.3 μL/min) and AQ2 (0.2 μL/min). Fluorogenic CatD peptide probe (1.5 nmol, AnaSpec) and R-phycoerythrin (12.5 μg) in BHB (250 μL) were driven into the circuit via AQ1. CatD (610 ng, 0.36 U) in protease assay buffer (400 μL) was driven into the circuit via AQ2. A mixture of DNA-encoded photocleavable pepstatin A positive control inhibitor beads 1 and negative control N-acetyl-Glu beads 2 serves as a model bead library. The DNA-encoded model library was suspended in BHB (180 μL), filtered (CellTrics 20-μm mesh, Sysmex-Partec), and drawn into syringe (1 mL, slip-tip disposable tuberculin syringe, Becton Dickinson, Franklin Lakes, NJ) fitted with a needle (30 gauge, 0.5 in., Precisionglide, Becton Dickinson) while preserving head space. A PDMS plug-sealed pipet tip (200 μL, Rainin BioClean LTS tips, Mettler-Toledo, Columbus, OH) was filled with model library bead suspension by inserting the needle through the PDMS plug and expelling the bead suspension (∼170 μL) to fill the tip without air pockets. The puncture hole in the PDMS was sealed with hot glue, and the tip was inserted into the LIB port of the device.[40] Water-in-oil droplets were generated (190—220 pL, 43—37 Hz) at the flow-focusing channel intersection, directed to the waveguide illumination region for UV irradiation, then driven through the incubation channel (18 min) before detection and sorting. Droplet spacing oil (3.5 μL/min) and droplet guiding oil (2.5 μL/min) enter the device at OIL2 and OIL3, respectively. Droplets were interrogated via confocal laser-induced fluorescence detection (2 kHz acquisition). Droplets were dielectrophoretically deflected in an electric field generated between one saltwater electrode connected to a high-voltage source (500 VAC, 10 kHz square wave, 8 ms pulse) and another electrode connected to ground. Deflected droplets flow to the HIT output for collection. After model library screening, droplets collected through the HIT output are visualized using an inverted epifluorescence microscope (2.5×, 0.075 NA, Axio Observer A1, Zeiss, Thornwood, NY) equipped with a CCD camera (AxioCam ICm1, Zeiss). Droplets were imaged in brightfield mode and fluorescence mode using Zeiss filter set 38HE (FAM channel, λex = 470/40 nm; λdc = 495 nm; λem = 525/50 nm) and Zeiss filter set 20 (TMR channel, λex = 546/12 nm; λdc = 560 nm; λem = 608/65 nm).

NGS Decoding and Bead Index Cutoff

Ion Proton FASTQ files for each screening sample set were imported into R, each sequence was matched to the degenerate 10-position reference sequence “ATGGNNNNNNNNTCANNNNNNNNGTTNNNNNNNNCTANNNNNNNNTTCNNNNNNNNCGCNNNNNNNNGTANNNNNNNNTGGNNNNNNNNTCTNNNNNNNNAAGNNNNNNNNGCCTCCCAAACnnnnnnnnGTT” (overhangs in bold, N = n = any DNA base) allowing for up to 3 mismatched bases, then excess sequence was trimmed. All NNNNNNNN encoding sequences were matched with the known encoding set (“AAGAGGCA”, “ACGGAGCA”, “ACGAGATT”, “AAGGAGGT”, “AGAAAGCA”, “ATAGAGCC”, “CAGAAGGA”, “GAGGAACA”, “TGAAGGAA”, “TTGAGGAT”, “CCTCCTAA”, “AACCTCAA”, “ATTCTCGG”, “AACCCTAC”, “GACTCCGC”, “CATTTCAA”, “CCCTCCGG”, “CGTTCCTG”, “TTCTTCAT”, “TCTCCTCC”). Hamming distances were calculated for all nonmatched NNNNNNNN encoding sequences. Those with Hamming distance = 1 from a member of the known encoding set were replaced with the correct sequence. Any read containing an encoding sequence with Hamming distance >1 was removed. Identical sequences including the UMI (nnnnnnnn) were aggregated as a single sequence, trimmed to terminate at “GCCTCCCAAC”, and both the UMI sequence (nnnnnnnn) and read counts for each UMI were preserved. Identical sequences were aggregated (now without UMI consideration) as a single sequence, and a random sample of up to 5000 UMI sequences for each sequence was selected. For each sequence, the set of UMIs was examined and for any given UMI pair with a Hamming distance = 1, the UMI represented by the fewest read counts was discarded. After this UMI “distance filter”, each sequence’s UMI set was counted. Sequences were rank-ordered by UMI counts, assigned a “Bead Index” number, then plotted. For each bead-lot sample (30, 100, 300, 1000, 3000, hit collection), a weighted mean UMI count (UMIw) was generated at each bead index, using a rolling window (width = 10), and a normal weighted distribution (e–; k = 0.01). The transition from bead-derived sequences to background sequences was approximated by determining the inflection point of each curve, equal to the bead index yielding a maximum %ΔUMIw/UMIw (% change in UMIw). The inflection’s x coordinate is called the bead index cutoff.

hvSABR Hit Bead Recovery and Preparation for NGS

The water-in-oil emulsion (∼0.6 mL) was centrifuged (5 min, 12000 rcf), 2/3 of the oil phase was removed, and fresh DMF-A-6CS silicone oil (Shin-Etsu) was added (400 μL). After oil exchange, the tube contents were agitated by flicking. This process was repeated thrice more and BTPWB (2 μL) was added. The tube was agitated and centrifuged (5 min, 12,000 rcf). BTPWB (100 μL) was added, and the tube was centrifuged (5 min, 12,000 rcf). Most oil was removed, BTPWB (300 μL) was added, and the tube was centrifuged (5 min, 12,000 rcf). The remaining oil and excess aqueous phase were removed with a clean polyurethane swab (Berkshire Corporation, Great Barrington, MA). Beads were transferred to PCR tube (0.2 mL) in BTPWB (4 × 50 μL), then the tube was centrifuged (30 s, 6000 rcf) and all supernatant except ∼10 μL was removed. qPCR mixture contained Taq (0.05 U/μL), oligonucleotide primers 5′-GCCGCCCAGTCCTGCTCGCTTCGCTAC-3′ and 5′-GTGGCACAACAACTGGCGGGCAAAC-3′ (0.3 μM each), and SYBR Green (0.2×, Life Technologies) in GC-PCR buffer (1×). Template standards (100 amol, 10 amol, 1 amol, 100 zmol, 10 zmol, 1 zmol, 100 ymol, and 10 ymol in BTPWB) were added into BTPWB (10 μL) in separate reaction wells of a PCR tube strip (8 tubes), and qPCR mix (30 μL) was added. Hit beads in the PCR tube were lightly vortexed to free bead pellet, sonicated (30 s), then qPCR mix (30 μL) was added prior to thermal cycling. The hit bead PCR tube and standards were thermally cycled (96 °C, 10 s; [95 °C, 8 s; 72 °C, 24 s] × 27 cycles; 72 °C, 120 s; C1000 Touch Thermal Cycler, Bio-Rad) with fluorescence monitoring (CFX-96 Real-Time System, Bio-Rad). The hit bead PCR tube was centrifuged (1 min, 1000 rcf), and supernatant was diluted (1:10000 in BTPWB). The amplicons were barcoded and prepared for NGS as described previously (Ion Proton, Life Technologies).

61 in total

1. Dynamic pattern formation in a vesicle-generating microfluidic device.

Authors: T Thorsen; R W Roberts; F H Arnold; S R Quake
Journal: Phys Rev Lett Date: 2001-04-30 Impact factor: 9.161

2. The generality of DNA-templated synthesis as a basis for evolving non-natural small molecules.

Authors: Z J Gartner; D R Liu
Journal: J Am Chem Soc Date: 2001-07-18 Impact factor: 15.419

3. Directed evolution of an RNA enzyme.

Authors: A A Beaudry; G F Joyce
Journal: Science Date: 1992-07-31 Impact factor: 47.728

4. A new type of synthetic peptide library for identifying ligand-binding activity.

Authors: K S Lam; S E Salmon; E M Hersh; V J Hruby; W M Kazmierski; R J Knapp
Journal: Nature Date: 1991-11-07 Impact factor: 49.962

5. The chemistrode: a droplet-based microfluidic device for stimulation and recording with high temporal, spatial, and chemical resolution.

Authors: Delai Chen; Wenbin Du; Ying Liu; Weishan Liu; Andrey Kuznetsov; Felipe E Mendez; Louis H Philipson; Rustem F Ismagilov
Journal: Proc Natl Acad Sci U S A Date: 2008-10-30 Impact factor: 11.205

6. Screening for novel antimicrobials from encoded combinatorial libraries by using a two-dimensional agar format.

Authors: J L Silen; A T Lu; D W Solas; M A Gore; D MacLean; N H Shah; J M Coffin; N S Bhinderwala; Y Wang; K T Tsutsui; G C Look; D A Campbell; R L Hale; M Navre; C R DeLuca-Flaherty
Journal: Antimicrob Agents Chemother Date: 1998-06 Impact factor: 5.191

7. In-Bead Screening of Hydroxamic Acids for the Identification of HDAC Inhibitors.

Authors: Katrine Qvortrup; Thomas E Nielsen
Journal: Angew Chem Int Ed Engl Date: 2016-03-04 Impact factor: 15.336

8. Rapid Prototyping of Microfluidic Systems in Poly(dimethylsiloxane).

Authors: D C Duffy; J C McDonald; O J Schueller; G M Whitesides
Journal: Anal Chem Date: 1998-12-01 Impact factor: 6.986

9. DNA display I. Sequence-encoded routing of DNA populations.

Authors: David R Halpin; Pehr B Harbury
Journal: PLoS Biol Date: 2004-06-22 Impact factor: 8.029

10. Droplet electrospray ionization mass spectrometry for high throughput screening for enzyme inhibitors.

Authors: Shuwen Sun; Robert T Kennedy
Journal: Anal Chem Date: 2014-08-26 Impact factor: 6.986

22 in total

1. What Do You Get from DNA-Encoded Libraries?

Authors: Alexander L Satz
Journal: ACS Med Chem Lett Date: 2018-04-17 Impact factor: 4.345

2. Activity-Based DNA-Encoded Library Screening.

Authors: Wesley G Cochrane; Marie L Malone; Vuong Q Dang; Valerie Cavett; Alexander L Satz; Brian M Paegel
Journal: ACS Comb Sci Date: 2019-03-29 Impact factor: 3.784

3. Considerations for Achieving Maximized DNA Recovery in Solid-Phase DNA-Encoded Library Synthesis.

Authors: Alexander K Price; Brian M Paegel
Journal: ACS Comb Sci Date: 2020-08-24 Impact factor: 3.784

4. Integrated, Continuous Emulsion Creamer.

Authors: Wesley G Cochrane; Amber L Hackler; Valerie J Cavett; Alexander K Price; Brian M Paegel
Journal: Anal Chem Date: 2017-11-28 Impact factor: 6.986

Review 5. Automating drug discovery.

Authors: Gisbert Schneider
Journal: Nat Rev Drug Discov Date: 2017-12-15 Impact factor: 84.694