Protocol Citation: Martina Albuja-Quintana, Gabriela Pozo, Milton Gordillo-Romero, Carolina E. Armijos, María e Lourdes Torres 2024. Vaccinium floribundum Genome Assembly and Annotation Script. protocols.io https://dx.doi.org/10.17504/protocols.io.n92ldmo4nl5b/v1
License: This is an open access protocol distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited
Protocol status: Working
We use this protocol and it's working
Created: March 15, 2024
Last Modified: April 03, 2024
Protocol Integer ID: 96735
Keywords: vaccinium floribundum genome assembly, vaccinum floribundum specimen, whole genome of this species, genome, whole genome, genome completeness, illumina short read, sequencing statistics, ab initio gene predictor snap, annotation script oxford nanopore, sequencing, ont long read, short read, long read, oxford nanopore, flye assembly, species, resulting flye assembly, dna
Funders Acknowledgements:
Fondos COCIBA USFQ
Abstract
Oxford Nanopore long reads and Illumina short reads obtained from sequencing the DNA of a Vaccinum floribundum specimen from the Paramo region in Ecuador, were used to assemble and annotate the whole genome of this species. ONT long reads were filtered and trimmed in Nanofilt and Porechop. Sequencing statistics were visualized in Nanoplot. Illumina short reads were evaluated with fastqc. Different assemblers and polishers were used with both long and short reads. The resulting Flye assembly polished with Polca and Medaka was then analyzed for genome completeness and quality with Quast, BUSCO, LAI Index, and Coverage Graph. The assembly was later annotated in Maker in 3 consecutive rounds using the ab initio gene predictor SNAP.
Troubleshooting
Oxford Nanopore Sequencing - Raw Reads Filtering, Trimming, and Statistics
4. Extract 10% of ONT and Illumina reads (total reads were obtained from previously run Nanoplot)
BWA (RRID:SCR_010910)
reformat.sh in1=FC225MJ3LT3_L1_Mort-Illumina_1.fq.gz in2=FC225MJ3LT3_L1_Mort-Illumina_2.fq.gz out1=FC225MJ3LT3_L1_Mort-Illumina_1_subset_10.fq.gz out2=FC225MJ3LT3_L1_Mort-Illumina_2_subset_10.fq.gz reads=# of total reads samplereadstarget=# of 10% reads
5. Map ONT reads and illumina reads, seperately, to the assembly