Apr 13, 2020

Public workspaceAn error-corrected panel-based next-generation sequencing assay for ultra-sensitive somatic mutation detection in ctDNA

  • 1Department of Medicine, School of Clinical Sciences, Monash University, Australia
Icon indicating open access to content
QR code linking to this content
Protocol CitationHeidi Fettke 2020. An error-corrected panel-based next-generation sequencing assay for ultra-sensitive somatic mutation detection in ctDNA. protocols.io https://dx.doi.org/10.17504/protocols.io.bb3siqne
License: This is an open access protocol distributed under the terms of the Creative Commons Attribution License,  which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited
Protocol status: Working
We use this protocol and it's working
Created: February 03, 2020
Last Modified: April 13, 2020
Protocol Integer ID: 32594
Keywords: Hybrid-capture, next-generation sequencing, targeted panel, molecular barcoding, cell-free DNA, cfDNA, circulating tumour DNA, ctDNA,
Abstract
Circulating tumour DNA (ctDNA), fragmented DNA shed into the circulation from tumour cells, holds the potential to become a minimally-invasive biopsy material with which to interrogate the tumour landscape. However, ctDNA can be highly diluted in normal cell-free DNA (cfDNA) (released into the blood primarily by hematopoietic cells), and thus robust liquid biopsy assays with high analytical sensitivity are required for ctDNA to reach its full clinical potential in molecular genotyping.
This protocol uses optimised off-the-shelf reagents in order to generate indexed and molecularly-tagged cfDNA and germline DNA libraries compatible with Illumina short-read sequencing. This protocol involves the hybridisation-capture of targeted regions of the genome using RNA baits - and has been used previously with a 42-gene panel 447 kbp in size. Also described here is the bioinformatic tools used for data processing and variant identification.
The combination of deep-sequencing and error-correction based on molecular barcoding allows high sensitivity and specificity to be generated at low variant frequencies.
Guidelines
Always try to use lobind plasticware to reduce loss of cfDNA during processing (this includes pipette tips)
Always use pure, nuclease-free water (and PBS if necessary)
Always try to keep Pre-PCR components separate from Post-PCR components. Note: Post-PCR = final stage of library generation
You can store your cfDNA in TE buffer rather than water but keep in mind where you may need to concentrate your samples

Blood collection
Blood collection
5m
5m
Collect whole peripheral blood in either an EDTA tube or a dedicated cfDNA stabilising tube such as a Streck Cell-Free DNA BCT tube depending on the estimated time to processing (within 4 hours for an EDTA tube otherwise use a stabilising tube). 21-23 gauge needle or larger preferred to reduce chances of haemolysis. Invert tube ~5 times, then try to prevent any further agitation during transit if possible.
Blood processing
Blood processing
40m
40m
Centrifuge blood tubes at Centrifigation1900 x g, 4°C, 00:10:00 to facilitate the initial separation of plasma from other blood components.

Separate plasma into Amount5 mL eppendorf tubes, being careful not to draw up the buffy coat layer below.
At the same time, collect the buffy coat layer (approx Amount1 mL ) below the plasma into a cryovial and store at Temperature-80 °C until required.

Further clarify the plasma via centrifugation at Centrifigation16000 x g, 4°C, 00:10:00 .
Draw up supernatant into a fresh Amount5 mL eppendorf tube, leaving any pellet behind to be discarded.
Re-centrifuge the supernatant to ensure the absence of a pellet.
Aliquot the plasma into cryovials and store at Temperature-80 °C required.

cfDNA extraction
cfDNA extraction
3h
3h
Thaw plasma on ice.
Extract cfDNA from plasma using the QIAamp circulating nucleic acid kit (Qiagen) as per the manufacturer's instructions. Elute into Amount50 µL buffer AVE. Pass the eluate over the column again to maximise yield after a second on-column incubation period of Duration00:01:00 .

Note: Try to avoid extracting cfDNA from more than 6 patient samples at once in order to avoid cross-contamination as much as possible.
Remove contaminating large molecular weight DNA fragments (that can arise due to cellular lysis during the blood draw or transporting process:

- Incubate the Amount50 µL eluate with Amount25 µL ampureXP paramagnetic beads (Beckman Coulter) (a 0.5:1 v/v ratio) at room temperature for Duration00:05:00 .
- separate beads on a magnetic rack for Duration00:02:00 and draw off the supernatant into a new eppendorf tube. The beads now contain large DNA fragments (over 500 nt) and can be discarded
- to the supernatant, add a further Amount50 µL ampure XP beads (1:1 v/v) to the supernatant and incubate for Duration00:10:00 at room temperature in order to capture the remaining cfDNA fragments
- separate beads on a magnetic rack for Duration00:02:00 and discard the supernatant
- keeping the beads on the magnetic rack, wash twice with Amount200 µL 80% (v/v) pure ethanol
- after discarding the ethanol, allow the beads to dry for between Duration00:05:00 to Duration00:10:00 (do not leave for so long that cracks appear)
- elute the cfDNA from the beads by mixing well with Amount13 µL pure water and incubating at room temperature for Duration00:10:00
- place tube back on the magnetic rack to separate the beads, drawing up the eluate and storing in a new eppendorf tube


Assess size distribution, quality and quantity of resulting cfDNA using automated gel electrophoresis (2200 TapeStation and Expert software, Agilent) and the Qubit dsDNA high-sensitivity assay (Thermo Fisher Scientific). When using Qubit, it is a good idea to always use a control i.e. Amount1 µL of standard #2
Note: cfDNA can be stored at either Temperature4 °C or Temperature-20 °C

gDNA extraction
gDNA extraction
1h
1h
Thaw buffy coat at room temperature.
Extract germline DNA from buffy coat (containing peripheral blood mononuclear cells) using the QIAamp DNA blood mini kit (Qiagen) and elute into Amount200 µL buffer AE.

Shear extracted gDNA to 150-200 bp in length using ultrasonication (Covaris), with the following settings:
Duty Factor: 10%
Peak Incident Power: 175
Cycles per Burst: 200
Treatment Time: 360 seconds
Bath temperature: 4-8ºC
Quantify and qualify fragmented gDNA using automated gel electrophoresis (2200 TapeStation and Expert software, Agilent) and the Qubit dsDNA high-sensitivity assay (Thermo Fisher Scientific).
Library construction
Library construction
2h
2h
Use the ThruPLEX Tag-seq kit as per the manufacturer's protocol to generate molecularly-tagged libraries from both cfDNA and gDNA. The ideal input is the kit maximum of Amount50 ng as it will maximise the number of unique DNA fragments and improve chances of low frequency variant identification; however as little as Amount10 ng can also be used.

Clean-up the library fragments by using a 1:1 (v/v) ratio of AmpureXP beads (Beckman Coulter) and eluting in Amount33 µL pure water. Quantify using the qubit as before, and confirm library size (main peak should have moved from ~170 bp to ~340 bp.

If you have >400 ng of library, proceed to the next step. If not, perform 1-2 cycles of PCR enrichment following the protocol in Step 16 below.
Hybridisation capture
Hybridisation capture
4d
4d
Use the Agilent SureSelect target enrichment system (Integrated Sciences) to capture target regions of interest using a library of biotinylated RNA baits complementary to the regions of interest (Integrated Sciences). In addition to the reagents required, you will also need 4 blocking oligonucleotides (IDT) to allow compatability with the SureSelect libraries:

Thruplex p5-UMI-11nt ext
AATGATACGGCGACCACCGAGATCTACACAGGCGAAGACACTCTTTCCCTACACGACGCTCTTC
CGATCTXXXXXXTCAGTAGCTCA/3block/

Thruplex p5-UMI-8nt ext
AATGATACGGCGACCACCGAGATCTACACAGGCGAAGACACTCTTTCCCTACACGACGCTCTTC
CGATCTXXXXXXGTAGCTCA/3block/

Thruplex p7-UMI-11nt ext
CAAGCAGAAGACGGCATACGAGATXXXXXXXXGTGACTGGAGTTCAGACGTGTGCTCTTCCGAT
CTXXXXXXTCAGTAGCTCA/3block/

Thruplex p7-UMI-8nt ext
CAAGCAGAAGACGGCATACGAGATXXXXXXXXGTGACTGGAGTTCAGACGTGTGCTCTTCCGAT
CTXXXXXXGTAGCTCA/3block/

Note: ‘X’ represents either sample-unique barcode or unique molecular identifier (UMI) sequence regions.


Follow the manufactuer's protocol for the first capture, incubating the baits with the libraries for Duration24:00:00 at Temperature65 °C ; with the following alterations:
1. Replace SureSelect Block 3 with Amount0.5 µL of each of the indexing blockers, and 0Amount0.6 µL pure water (total volume of blocking mix is Amount7.6 µL )
2. Use 1 ul of capture library instead of the Amount2 µL suggested for libraries <3 Mb
3. Perform 4 washes with Amount400 µL of Wash Buffer 2, incubated for Duration00:20:00 each time
4. Re-suspend library in Amount14 µL pure water



Perform PCR enrichment of the capture libraries using all Amount14 µL of the captured library as follows:
Reagent Volume for 1 reaction
Nuclease-free water 19.5µl
5x Herculase Rxn Buffer (clear cap) 10µl
Herculase II Fusion DNA Polymerase (red cap) 1µl
100mM dNTP Mix (green cap) 0.5µl
10µM Illumina P5 primer 2.5µl
10µM Illumina P7 primer 2.5µl
Total 36µl

Run the following steps on the thermal cycler:
Step Number of cycles Temperature Time
Step 1 1 98ᵒC 2 mins
Step 2 16 98ᵒC 30 secs
57ᵒC30 secs
72ᵒC1 min
Step 3 1 72ᵒC 10 mins
Step 4 1 4ᵒC Hold

Complete the capture protocol by cleaning up the PCR enriched libraries with AmpureXP beads and elute in 33 ul pure water. Quantitate libraries using the Qubit as above. Libraries can be stored at Temperature4 °C for up to one week or at Temperature-20 °C for longer periods.

Perform DCAP-Seq by repeating the hybridisation capture and PCR enrichment procedures above with the following changes:
1. Incubate library with probes for Duration05:00:00 at Temperature65 °C instead of 24 hours
2. 5 PCR amplification cycles instead of 16
3. Resuspend final libraries in Amount33 µL pure water

Library sequencing and data processing
Library sequencing and data processing
Generate between 50 -70 million 100-150 bp paired end reads (for cfDNA), and approximately 1 million reads (for gDNA) on the Illumina HiSeq2500 on rapid mode.

Demultiplex libraries using per sample indices and map to the human reference genome (hg19) using BWA-MEM (v 0.7.12) and converted to BAM files for further analysis using Samtools (v 1.6).
Generate read families and perform error-correction using the molecular barcodes using Connor (v 0.5). Only reads with perfectly matching barcodes and at least 3 replicates should be clustered together as a family to increase the validity of each base call. A consensus frequency of at least 0.66 is required to generate a collapsed read.
Variant identification and filtering
Variant identification and filtering
Identify somatic variants by using VarDict (v 1.5.8) in paired samples mode using standard filter settings, with the exception of minmum variant frequency, which is reduced to 0.001%.

Filter resulting VCF files using VarSeq (Golden Helix) using the following settings:

Somatic variants
1. Status matches 'strong somatic'
2. Variant not located in intergenic regions
3. Variant must either be loss-of-function or a missense variant with a CADD PHRED score of 20 or greater
4. Variant not present in gnomAD Exomes Variant Frequencies v 2.1.1 (BROAD) database at greater than 0.001 frequency, or is missing altogether
5. Minimum read depth of 500X
6. Minimum variant allele frequency of 0.0025
7. Minimum of 3 supporting reads
8. Variant must have at least 1 supporting reverse read and 1 supporting forward read

Germline variants
1. Status matches 'germline'
2. Minimum read depth of 8X in the germline sample

Assess all remaining variants on Integrative Genomics Viewer (IGV) in order to remove any sequencing artefacts or common variants that may have been missed by gnomAD
CNV identification and filtering
CNV identification and filtering
Generate CNV calls per patient sample using VarSeq CNV Caller ( VS-CNV) using the following settings:

1. Set up the reference files by importing BAM files from germline (normal) samples that have undergone sequencing using the same targeted panel and methodology as the tumour samples. At least 30 references are preferred. Pre-deduplicated files may be preferred if post-connor depth is low
2. Open VarSeq project and calculate coverage regions using the BED file used for panel design
3. Generate CNV calls: as standard settings are not sensitive enough to allow cfDNA analysis, custom parameters need to be set on the CNV calling algorithm:
- maximum allowable difference between tumour and reference samples <30%
- mean z-score for duplications reduced to 1.2; and mean ratio reduced to 1.3
- mean z-score for heterozygous deletions reduced to -1.2; and mean ratio reduced to 0.85
4. Additional filters used are as follows, but may need to be optimised for different assays:
- Duplications require a p-value <0.0001 AND a z-score of >0.25 AND a minimum target depth of 50
- Deletions required to fulfill one of the following:
Have p-value <0.00001 AND ratio of <0.87
Have p-value <0.05 AND z-score <-0.85 AND ratio of <0.7