Skip to main content
SearchLoginLogin or Signup

Cell-Free DNA Blood Collection Tubes Crosslinking Cellular DNA Impeding Nanopore Long-Read Sequencing

Keywords: buffy coat, crosslinking, DNA, extraction, nanopore sequencing, Streck

Published onMar 19, 2025
Cell-Free DNA Blood Collection Tubes Crosslinking Cellular DNA Impeding Nanopore Long-Read Sequencing
·

ABSTRACT

Long-read DNA and RNA sequencing facilitate genome assembly, haplotyping, complex variant calling, and gene isoform identification. Structural variant calling is integral to the molecular characterization of tumors; thus, long-read nanopore DNA sequencing technology is becoming routinely used in cancer research. As a standard practice, high molecular weight (HMW) DNA is extracted from both tumor and matched normal samples from blood or buffy coat to redact germline variants from somatic mutations. However, we found that buffy coat DNA consistently underperformed compared to DNA extracted from tumor tissue. Furthermore, this observation was unique to DNA extracted from buffy coat cells collected in Streck, but not ethylenediaminetetraacetic acid (EDTA), tubes. We therefore investigated whether the released formaldehyde in Streck tubes resulted in DNA crosslinking, which would explain the low data throughput. Indeed, a decrosslinking step during Streck DNA extraction significantly improved data yield and fragment length without compromising data quality. We therefore recommend a tailored DNA extraction protocol of Streck-derived buffy coat samples for nanopore sequencing.

Address correspondence to: Neeman Mohibullah, Integrated Genomics Operation, Memorial Sloan Kettering Cancer Center, 1250 1st Ave., New York, NY 10065, USA (Tel: 212-639-2853; Fax: 646-888-3407; Email: [email protected]).

Keywords: buffy coat, crosslinking, DNA, extraction, nanopore sequencing, Streck

Conflicts of Interest: The authors declare no conflicts of interest.

Note: Anonymized DNA samples were used from three healthy volunteers under an approved protocol, IRB 06-107, by the Institutional Review Board of Memorial Sloan Kettering Cancer Center.

INTRODUCTION

The study aim was to develop a protocol that allowed the use of DNA derived from buffy coat samples collected in Streck Cell-Free DNA Blood Collection Tubes (cfDNA BCTs). While processing more than 600 samples in our core, we noticed that Streck BCT DNA consistently underperformed in nanopore long-read sequencing, failing to reach target coverage of 30X for human genome, approximately equivalent to data yield of 90 Gb (Fig. 1). However, both buffy coat DNA from Ethylenediaminetetraacetic acid (EDTA) BCTs and the optimal cutting temperature (OCT) compound-derived matched tumor tissue reliably yielded >100 Gb per PromethION flow cell. Streck tubes are coated with imidazolidinyl urea, a formaldehyde-releasing agent, which prevents release of cellular DNA, facilitating analysis of cfDNA.[1],[2] A previous report showed that Streck tubes induce peripheral mononuclear cell protein formaldehyde crosslinking, particularly in cytoskeletal proteins and histones.[3] We hypothesized that Streck buffy coat DNA is also crosslinked by the released formaldehyde, impeding nanopore sequencing. Due to their frequent use in the clinic for liquid biopsy testing, it is important to enable compatibility between Streck-derived genomic DNA (gDNA) and nanopore long-read sequencing, a powerful technology for analyzing cancer genomes.

Figure 1

Streck buffy coat samples are largely incompatible with nanopore sequencing.

(A) Plot showing post-sequencing data yield (Gb) and estimated coverage (X) from 14 ONT samples run: Streck-derived buffy coat samples submitted as Normal (SN; N=5), Streck-matched OCT-derived Tumor (SmT; N=5), and EDTA-derived buffy coat samples submitted as Normal (EN; N=4). Unpaired t-test was used for statistical significance; p<0.001 (***), not significant (ns). Dotted line denotes minimum data yield target per sample (90 Gb). Samples originated from numerous labs across the institution and were collected under IRB protocol 12-245. 

 (B) Pore activity graph from representative samples generated by MinKNOW 23.11.7, showing sample performance and pore status (color-coded) during sequencing. W denotes time-point of flow cell wash. Note rapid pore depletion in Streck sample run, <20% active pores in <24hrs, flow cell washing replenished the pores only temporarily, suggesting pore clogging caused by the loaded sample. 

(C) Plot showing N50 sequence lengths (kb) for Streck-derived buffy coat (SN), Streck-matched Tumor (SmT) and EDTA-derived buffy coat (EN) DNA samples. Unpaired t-test was used for statistical significance; p<0.05 (*), not significant (ns).

SN: Streck-derived Normal, SmT: Streck-matched Tumor, EN: EDTA-derived Normal, W: Wash.   

Here we describe a simple alteration in high molecular weight (HMW) DNA extraction protocol for Streck-derived buffy coat samples, which is necessary to successfully prepare and sequence nanopore libraries.

MATERIALS & METHODS

Sample collection and processing

One 5 mL Streck cfDNA BCT (S) (230470, Streck, La Vista, NE, USA) and two 5 mL BD Vacutainer EDTA tubes (367863, Becton Dickinson, Franklin Lakes, NJ, USA) (untreated: E; treated with final concentration of 0.1% formaldehyde (47608, Sigma, St. Louis, MO, USA) immediately after blood draw: EF) were collected from three healthy volunteers. After a 20-hr incubation, DNA was extracted with (+) and without (–) decrosslinking (Fig. 2).

Figure 2

Workflow schematic of sample collection and preparation.

Blood was collected from three healthy volunteers. One Streck blood collection tube (S), one untreated EDTA blood collection tube (E) and one EDTA treated with 0.1% formaldehyde (EF) were collected from each volunteer. The blood was incubated in the respective collection tubes for ~20 hr at room temperature to simulate real-life use and transport conditions before processing. Buffy coat cells from each blood collection tube were isolated by centrifugation and split into two fresh Eppendorf tubes. High molecular weight DNA from each sample was then extracted with (S+, E+, EF+,) and without (S-, E-, EF-) decrosslinking (65°C overnight incubation). ONT libraries were prepared and sequenced. Created with BioRender.com.

S: Streck, E: EDTA, EF: EDTA + 0.1% formaldehyde, HMW: high molecular weight.

Buffy coat DNA extraction

Whole blood samples were centrifuged at 800 g for 10 min at room temperature. The plasma phase was discarded, and the buffy coat was split into two 1.5 mL Eppendorf DNA LoBind tubes (13698791; Thermo Fisher Scientific, Waltham, MA, USA). Genomic DNA (gDNA) samples were extracted using the Monarch HMW DNA Extraction Kit for Cells & Blood (T3050L; New England Biolabs, Ipswich, MA, USA) following the manufacturer’s protocol except for an initial 1:1 dilution of buffy coat in 1xPBS (10010072; Gibco, Grand Island, NY, USA) to bring the buffy coat volume to 500 µL followed by erythrocyte lysis. For decrosslinked samples only: the addition of Nuclei Lysis Solution (Proteinase K and Nuclei Lysis Buffer) was followed by 56°C incubation for 10 min at 2000 rpm (ThermoMixer C; Eppendorf, Hamburg, Germany) and overnight incubation at 65°C, before continuing with leukocyte lysis. Extracted DNA was quantified by Quant-iT 1X dsDNA High Sensitivity Assay Kit (Q33232; Thermo Fisher Scientific), according to manufacturer’s instructions. Quality and size were analyzed by NanoDrop (ND1000; Thermo Fisher Scientific) and gDNA TapeStation 4200 (Agilent, Santa Clara, CA, USA), respectively.

Oxford Nanopore Technologies (ONT) library preparation and sequencing

3 μg gDNA was normalized to 49 µL of nuclease-free water and transferred to a g-TUBE (520079; Covaris, Woburn, MA, USA). gDNA was sheared at 4600-6000 rpm for 2-4 min following the manufacturer’s guidelines. DNA size was re-analyzed, to ensure ~25 kb fragments, by TapeStation, and DNA purity was re-assessed by NanoDrop, before library preparation.

Libraries were prepared with the Ligation Sequencing Kit (SQK-LSK114; Oxford Nanopore Technologies (ONT), Oxford, UK), according to manufacturer’s guidelines. Libraries were quantified by Quant-iT and 50 fmol were loaded onto a FLO-PRO114M (ONT) flow cell. Run options in the MinKNOW UI included 1 kb minimum read length and 90-hr run limit. The high accuracy (HAC) basecalling model was selected. Flow cells were washed using Flow Cell Wash Kit (EXP-WSH004; ONT) once the active pores reached ~25-30%, and 50 fmol of the library was reloaded according to manufacturer’s guidelines.

Data Analysis

Estimated coverage was calculated using the equation C=LN/GC = LN/G where L = average of N50 and median read length, and N = read number, all of which were reported by MinKNOW 23.11.7, and G = haploid genome size. The sequencing reads were aligned to human genome reference GRCh38 using minimap2 with default Oxford nanopore reads setting. OxoG Metrics stats were collected using CollectOxoGMetrics from Picard tools (https://broadinstitute.github.io/picard/).

RESULTS

In an early study of five tumor and matched normal pairs processed in the core, Streck-derived buffy coat ONT DNA libraries consistently underperformed, defined as data yield below 90 Gb, unlike matched tumor and other EDTA-derived libraries, which had an average yield of 116 Gb, and an average coverage of 47X (Fig. 1A). Moreover, most Streck-derived samples resisted gTube fragmentation, maintaining a >60 kb size as assessed by TapeStation, even after multiple attempts to shear to 25-30 kb (Table 1). While very long DNA can lead to lower data yield and pore clogging, the size alone could not explain the significant difference in data yield and the rapid pore depletion (Fig. 1B), as some of the tumor samples also appeared >55-60 kb. Importantly, even though the Streck buffy coat samples seemingly had the longest fragments after extraction, the post-sequencing average N50 value was the lowest compared to tumor and EDTA buffy coat N50s (Fig. 1C). We therefore questioned whether the Streck preservative, potentially releasing formaldehyde, was responsible for these outcomes. A formaldehyde test strip confirmed that Streck tubes have a formaldehyde concentration of >100 mg/l (0.01%), potentially crosslinking gDNA over prolonged incubation time (Fig. 3). Crosslinked DNA could explain the rapid pore depletion and low data yield. To address our hypothesis and test whether a decrosslinking step could mitigate the outcome, blood samples were collected from each of three healthy volunteers and processed under the conditions shown in the schematic (EDTA, E; EDTA + Formaldehyde, EF; Streck, S) (Fig. 2). All extracted DNA samples were of optimal quality. It is noteworthy that all EF samples resisted shearing, maintaining >50 kb size, and had the lowest library yield (Table 2). Comparison between the samples extracted without decrosslinking showed that S and EF samples behaved similarly with lower library yield, rapid pore depletion, and lower N50s versus E samples, corroborating our previous observations (Fig. 4). Decrosslinking rescued sequencing output for both S and EF samples, resulting in data yield of >100 Gb (Fig. 4A-B). E sample performances were comparable with or without decrosslinking (Fig. 4A-C). It has been previously shown that extended heat exposure causes DNA damage, with 8-oxoguanine (oxoG) being the most frequent base lesion.[4] Analysis of the oxidative DNA lesion oxoG showed that the higher temperature for the decrosslinking step did not induce oxidative DNA damage. Additionally, decrosslinking resulted in improved oxidation error rates in S and EF samples, versus no decrosslinking, probably due to the resolution of crosslinks reducing the number of alternate base calls (Fig. 4D).[5]

In this study, we hypothesize that the released formaldehyde in Streck tubes crosslinks cellular DNA impeding nanopore sequencing. Streck DNA nanopore performance was comparable to formaldehyde treated DNA, suggesting that the released formaldehyde in Streck tubes was responsible for the poor outcome. As such, decrosslinking at 65oC overnight during the HMW DNA extraction process of Streck-derived buffy coat DNA samples is essential for a high-throughput nanopore sequencing run.

Table 1

Sample DNA quality control (QC)

Streck buffy coat - Normal

A260/230

A260/280

DIN

Peak post-shear size (kb)

SN1

2.34

1.87

9.9

60

SN2

1.66

1.86

9.9

60

SN3

2.32

1.86

9.9

60

SN4

2.37

1.86

9.9

60

SN5

2.31

1.87

9.9

20

Streck-matched Tumor

SmT1

2.36

1.82

9.8

56.9

SmT2

1.66

1.86

9.9

51.8

SmT3

2.32

1.86

9.5

26.2

SmT4

2

1.84

9.9

60

SmT5

2.02

1.84

9.9

59.7

EDTA buffy coat - Normal

EN1

2.34

1.91

9.4

34.3

EN2

2.37

1.92

9.8

25.1

EN3

1.51

1.83

9.1

28.8

EN4

1.81

2

9.8

31.1

DNA QC evaluating the purity and size (post-shearing) of the extracted DNA, by NanoDrop and TapeStation, respectively. Note all samples were of optimal quality for nanopore sequencing. DIN=DNA Integrity, SN=Streck buffy coat-Normal, SmT=Streck-matched Tumor, EN=EDTA buffy coat-Normal.

Table 2

Donor sample DNA quality control (QC)

Samples

A260/230

A260/280

DIN

Peak post-shear size (kb)

Library yield (fmol)

⎼ decrosslinking

Streck-BC (S)

S1

2.2

1.86

9.7

31.8

89

S2

2.14

1.85

9.9

19.7

60

S3

2.36

1.88

9.9

24.9

131

EDTA-BC (E)

E1

2.28

1.87

9.9

24

124

E2

1.92

1.9

9.8

18.1

120

E3

2.23

1.86

9.9

25

160

EDTA+FA-BC (EF)

EF1

2.22

1.83

9.9

60

54

EF2

1.97

1.96

9.7

52

40

EF3

2.08

1.89

9.9

54.3

34

+ decrosslinking

Streck-BC (S)

S1

2.28

1.93

9.9

32.9

94

S2

2.24

1.96

9.9

27.3

101

S3

2.73

1.95

9.9

21.2

146

EDTA-BC (E)

E1

2.2

1.9

9.9

19.3

174

E2

2.22

1.91

9.8

27.6

124

E3

2.68

1.91

9.9

21.2

135

EDTA+FA-BC (EF)

EF1

2.21

1.99

9.9

21.8

91

EF2

2.28

2.06

9.9

22.9

147

EF3

3.47

1.98

9.9

18.8

172

DNA quality control (QC) evaluating the purity and size (post-shearing) of the extracted DNA, by NanoDrop and TapeStation, respectively. Note all donor samples were of optimal quality for nanopore sequencing. Library yield from each sample is also shown. DIN=DNA Integrity

Figure 3

Streck tube content releases formaldehyde.

Streck tube content was diluted with 5ml nuclease-free water. The solution was tested for formaldehyde using a formaldehyde test strip (1100360001, Sigma). An EDTA tube was washed with nuclease-free water and was also tested as a negative control. 0.1% formaldehyde solution was used as a positive control. Note that the Streck tube residue released formaldehyde at a concentration of >100mg/l (0.01%). + ctl: positive control, - ctl: negative control.

Figure 4

A decrosslinking step during extraction of Streck-derived DNA improves flow cell performance resulting in higher data yield.

(A) Pore activity graphs showing sample performance and pore status (color-coded) during sequencing of Streck (S), EDTA (E) and EDTA treated with 0.1% formaldehyde (EF) donor samples extracted with (+) and without (-) decrosslinking. Note EF and S samples resulted in rapid pore depletion (<20% active pores in <24hrs) and flow cell washing (W) for S samples replenished the pores only temporarily, suggesting pore clogging. EF DNA did not yield enough library for washing and reloading. Rapid pore depletion was rescued by decrosslinking. Pore status is color-coded.

(B) Plot showing post-sequencing data yield (Gb) and estimated coverage (X) for S, E and EF samples extracted with (+) and without (-) decrosslinking. Note significant data yield improvement for S and EF samples with decrosslinking versus without decrosslinking. Unpaired t-test was used for statistical significance; p<0.01 (**), not significant (ns).

(C) Plot showing N50 sequence lengths (kb) for S, E and EF samples with (+) and without (-) decrosslinking. Note the significant improvement in read lengths in S and EF samples extracted with a decrosslinking step versus without. Unpaired t-test was used for statistical significance; p<0.01 (**), not significant (ns).

(D) Quantification of the error rate resulting from oxidative artifacts. Oxidation error rate was calculated as max(alt_oxo_bases-alt_nonoxo_bases,1)/total_bases. Oxidation_Q = -10*log10 (oxidation_error_rate). Note 1) no increase in oxidation error rate in samples that underwent decrosslinking at 65oC overnight versus samples without decrosslinking, 2) improved oxidation error rate in S and EF samples with decrosslinking versus no decrosslinking.

S: Streck, E: EDTA, EF: EDTA+0.1% formaldehyde, W: Wash.

ACKNOWLEDGMENTS

This work was funded by the MSK Cancer Center Support Grant P30 CA008748; PI: Selwyn Vickers. We are grateful to Ms. Tina M. Alano and Dr. Joseph M. Scandura for the blood collection of volunteers, and Ms. Emily Stockfisch for blood sample information. We thank all members of the Integrated Genomics Operation for helpful insights and support.

Comments
0
comment
No comments here
Why not start the discussion?