This method was developed at the FDA’s Center for Food Safety and Applied Nutrition for GenomeTrakr’s pandemic response project, monitoring SARS-CoV-2 variants in wastewater; however, this protocol was written to be broadly applicable for all wastewater sequence data submissions to NCBI. Protocols developed for this project cover wastewater collection, concentration, RNA extraction, RT-qPCR, library prep, genome sequencing, quality control checks, and data submission to NCBI.
This protocol covers the last step of making your data public at NCBI. Specifically, it provides the steps to establish a new NCBI submission environment for your laboratory, including the creation of new BioProject(s) and submission groups. Once these are step up, the protocol then walks through the process for submitting raw reads to SRA and sample metadata to BioSample through the Submission portal.
For new submitters, there's quite a bit of groundwork that needs to be established before a laboratory can start its first data submission. We recommend that one person in the laboratory take a few days to get everything set up in advance of when you expect to do your first data submission.
If you need a pipeline for frequent or large volume submissions, follow Step 1 in this protocol to get your NCBI submission environment established, then contact gb-admin@ncbi.nlm.nih.gov to set up an account for submitting through the API. V2: minor edits to the BioSample and SRA templates
V3: Adapted the protocol to be more broadly applicable to submitters outside of FDA's wastewater project. Updates were also made to both metadata templates, including a new attribute to the SRA metadata template, called "enrichment_kit".
V4: updates to BioSample and SRA templates: expanded picklists, addition of specimen processing attributes for including replicate info, and the removal of target_extract attribute for reporting level of target found in the sample.
V5: includes guidance for submitting BioSamples with no linked sequence data.
V6: Updated templates. BioSample: added picklist for PCR concentration units. SRA: added new quality control attributes.
V7: SRA and BioSample template updates