Next you'll log in to the TSCC with your given username and password.
Minimap2 and Samtools is already installed in the cluster.
#Using minimap2, map out skimming reads that match to your fasta file containing a set of 18S, 28S, or H3 from the most closely related species to your targeted sample. Preferably, you may have Sanger sequence of the targeted gene from the species, and can directly use this in order to obtain the full length of the gene. Deposit your reference sequence (.fasta) into the folder that your interleaved file is in. It is easiest to have them all in once place.
IMPORTANT: You FIRST need to initialize samtools before doing the commands below, as it won't extrtact your file if you dont.
It's really important to make sure you're in a familiar folder so the mapped file will be deposited into a place that's accessible. I would recommend creating a folder and the change directory to that folder so your data is input nicely.
module load samtools/1.13-fd7mbdu
/tscc/nfs/home/nmkoch/Programs/minimap2/minimap2/minimap2 -ax sr /Users/Avery/Dropbox/Polynoidae_Genome_Skimming/phylogenetic-analyses/minimap-skims/Macellicephalinae_18S.fasta /Users/Avery/Dropbox/Polynoidae_Genome_Skimming/A3432_Macellicephalinae_sp2/A3432_Interleaved.fastq | samtools fastq -n -F 4 - > A3432_Macellicephalinae_sp2_18Smapped.fq
It is only one space between the path to reference sequence and path to interleaved file.
path/to/minimap2 -ax sr /path_to_reference_sequence /path_to_interleaved_file \ samtools fastq -n -F 4 - > _sample_name_mapped_gene.fq