License: This is an open access protocol distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited
Protocol status: Working
Protocol has been used internally, and by external collaborating laboratories.
DISCLAIMER – FOR INFORMATIONAL PURPOSES ONLY; USE AT YOUR OWN RISK
The protocol content here is for informational purposes only and does not constitute legal, medical, clinical, or safety advice, or otherwise; content added to protocols.io is not peer reviewed and may not have undergone a formal approval of any kind. Information presented in this protocol should not substitute for independent professional judgment, advice, diagnosis, or treatment. Any action you take or refrain from taking using or relying upon the information presented here is strictly at your own risk. You agree that neither the Company nor any of the authors, contributors, administrators, or anyone else associated with protocols.io, can be held responsible for your use of the information contained in or linked to this protocol or any of our Sites/Apps and Services.
Abstract
Sequins are synthetic RNA controls that that are ‘spiked-in’ to your RNA sample, and undergo concurrent library preparation, sequencing and analysis. The sequins are then analyzed as internal controls in the output NGS library.
This protocol describes the laboratory steps required to dilute, store and spike the sequins into your RNA sample prior to library preparation for RNA sequencing. We also describe the bioinformatic steps to analyse sequins within your read (.FASTQ) or alignment (.BAM) files.
Materials
MATERIALS
RNA sequins standardsSequins
Before start
Install Anaquin Software.
To analyze sequins, we have developed a software toolkit, named anaquin, that accepts .FASTQ or .BAM formats, and can be integrated into your RNAseq bioinformatic pipeline. When anaquin rna processes the .FASTQ or .BAM files, it performs two main functions:
(i) Calibrate. The number/fraction of sequin reads in a library can be modulated. For example, anaquin rna can calibrate the number of sequins reads to comprise 1% of the library (using the –calibrate 0.01 option). This tool is useful for matching dilution between multiple replicates and samples.
(ii) Report. Anaquin rna generates several useful reports, including on library performance (rna_summary.stats), quantitative accuracy (rna_sequins_table.tsv), and individual sequin performance (rna_sequins.tsv).
Briefly, total RNA was extracted from K562 cell line with sequins (Mix A) was added. Libraries were prepared using the KAPA Stranded mRNA-SeqTM and sequenced using IIlumina HiSeq 2500TM.
Laboratory Steps
Laboratory Steps
Receiving sequins.
Upon receipt of RNA sequins, first check to ensure they have not thawed during shipment. Please contact us if you have any concerns. Immediately transfer the RNA sequins to frozen storage at -80°C (sequins should not be stored in a -20°C frost-free freezer).
Each tube contains RNA sequins in 10 μL solution, which is typically sufficient for ~100 RNAseq libraries. On first thaw, spin the tube down to collect the contents at the bottom of tube, and prepare smaller single-use aliquots to minimize subsequent freeze-thaw cycles.
Figure 1. Example traces of RNA sequins using an 2100 BioAnalyzer with the RNA Nano Kit (Agilent Technologies) for (left upper) neat Sequin Mixture A and (left lower) neat Sequins Mixture B. Also shown are example traces for (right upper) K562 with Sequin Mixture A and (right lower) GM12878 with Sequins Mixture B.
Preparing sequin stocks.
RNA sequins are provided in solution in nuclease-free water at a concentration of 15 ng/μL. Please use the table below to determine the amount and dilution of sequins that should be added to the sample RNA amount (please note users should dilute sequins even further when using with targeted RNA sequencing applications) :
Sample RNA Amount.
Dilution Volume (ddH20)
Diluted Sequins Concentration.
Sequin Volume to
Add to Sample.
Mass of Sequins
Added.
Final Sequin
Concentration
20ng
740ul
0.2ng/ul
1ul
0.2ng
1%
50ng
290ul
0.5ng/ul
1ul
0.5ng
1%
100ng
140ul
1ng/ul
1ul
1.0ng
1%
500ng
20ul
5ng/ul
1ul
5.0ng
1%
1000ng
6ul
10ng/ul
1ul
10.0ng
1%
Table 1. Guidelines for diluting RNA sequins according to sample RNA amounts (recommended 1% spike-in).
NOTE | RNA sequins are provided in two mixtures (A and B). Each mixture contain the same sequin transcripts, but at different molar ratios, thereby emulating fold-change differences in gene expression and alternative splicing between the two mixtures. Mixture A and B can be added to different samples to allow the detection of these known fold-differences.
Sequencing.
The library that is generated from the combined RNA sample and sequins is then sequenced according to manufacturer’s instructions.
Analysis of .FASTQ libraries (Option 1).
Analysis of .FASTQ libraries (Option 1).
Analysis of .FASTQ libraries (option 1).
To directly analyze the sequin from your library .FASTQ files, run the following command:
In this example command, we used the --calibrate 0.005 option to subsample sequin alignments to comprise 0.5% of total alignments from the NGS library.
Output Results.
Output Results.
When anaquin rna is complete, the following files are generated in the output directory:
Analysis.
anaquin.log – Log files recording the usage and execution of sequin processes.
rna_report.html – Useful visual report describing sequin performance in library. Load with browser.
rna_summary.txt - Summary statistics describing sequins and libraries.
rna_sequins.tsv - Detailed statistics on each individual sequin.
rna_sequins_calibrated.tsv - Detailed statistics on individual sequins following calibration.
Libraries.
rna_sequin_gene_table.tsv – gene-level quantification of sequins.
rna_sequin_isoform_table.tsv – isoform-level quantification of sequins.