License: This is an open access protocol distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited
Protocol status: Working
Created: August 10, 2016
Last Modified: February 24, 2018
Protocol Integer ID: 3390
Abstract
Provides a short introduction to MEGAHIT, IDBA-UD, and SPAdes assemblers, a demo on Prodigal Gene Caller, and determining % of reads and contig coverage using Bowtie2 short read aligner.
Open this protocol inside the virtual machine (details in 'Start Instructions') for easy copy, paste of commands into the command line terminal window.
Before starting, please visit the ECOGEO website for more information on this "Introduction to Environmental 'Omics" training series. The site contains a pre-packaged virtual machine that can be downloaded and used to run all of the protocols in this protocols.io collection. In addition to the VM, the website contains video and presentations from our initial "Intro to Env 'Omics" workshop held at the Univ. of Hawai'i at Manoa on 25-26 Jul 2016.
Please email ‘ecogeo-join@earthcube.org’ to join the ECOGEO listserv for future updates.
Introduction to assemblers
Introduction to assemblers
Move to directory containing assemblers.
Command
$ cd /home/c-debi/ecogeo/assembly
View assembler parameters for MEGAHIT v1.0.3, IDBA-UD v1.1.1, and SPAdes v3.7.1
Command
These commands will show parameters for each assembler.
$ megahit
$ idba_ud
$ spades.py
Trimmomatic Quality Control:
Command
This step has already been completed for you.
PLEASE NOTE: Commands in black on the presentation, video should NOT be executed in the VM (assembly steps require more computational power).
Reference assessment: QUAST can perform comparisons against the reference genomes used to construct artifiial metagenome. Start with a baseline size of contiges (>1kb).