... | ... | @@ -62,6 +62,72 @@ plot_len1.pl test.clstr \ |
|
|
```
|
|
|
- however the need to re-run the same samples by swapping was not only cumbersome, but depending on the orientation of the samples, obtained non-overlapping sets of fasta list
|
|
|
|
|
|
## So, I decided to pursue [mmseqs2](https://github.com/soedinglab/MMseqs2) instead
|
|
|
# So, I decided to pursue [mmseqs2](https://github.com/soedinglab/MMseqs2) instead
|
|
|
* The output of this is now incorporated into the [updated_SNAKEFILE](https://git-r3lab.uni.lu/susheel.busi/ont_pilot_gitlab/-/blob/master/2019_GDB/updated_SNAKEFILE) that will be used for the analyses going forward.
|
|
|
* see also [MMSEQ_RULES](https://git-r3lab.uni.lu/susheel.busi/ont_pilot_gitlab/-/blob/master/2019_GDB/rules/MMSEQ_RULES) |
|
|
\ No newline at end of file |
|
|
* see also [MMSEQ_RULES](https://git-r3lab.uni.lu/susheel.busi/ont_pilot_gitlab/-/blob/master/2019_GDB/rules/MMSEQ_RULES)
|
|
|
|
|
|
## And this is where the fun begins..
|
|
|
- Like any good research story - one question always leads to another, and one answer opens up the rabbit hole of unending possibilities
|
|
|
- And this is where the ensuing chapter arrives. Let's call it "[guppy](https://denbi-nanopore-training-course.readthedocs.io/en/latest/basecalling/basecalling.html)"!
|
|
|
|
|
|
## Chapter II - Running 'methylation-aware' basecalling to improve the otherwise dismal sequencing error rates from ONT
|
|
|
- Installed guppy as follows:
|
|
|
```
|
|
|
# Installation
|
|
|
cd /home/users/sbusi/apps
|
|
|
wget https://mirror.oxfordnanoportal.com/software/analysis/ont-guppy_research_rle_0.0.0_linux64.tar.gz
|
|
|
tar -xzvf ont-guppy_research_rle_0.0.0_linux64.tar.gz
|
|
|
export PATH=$PATH:"/home/users/sbusi/apps/ont-guppy/bin/"
|
|
|
```
|
|
|
- Basecalling with methylation-aware latest 'guppy' update
|
|
|
```
|
|
|
# first one has to make the appropriate directories
|
|
|
cd /scratch/users/sbusi/ONT/
|
|
|
mkdir cedric_ont_basecalling/
|
|
|
|
|
|
cd /scratch/users/sbusi/ONT/cedric_ont_basecalling
|
|
|
cp /scratch/users/claczny/ont/fecal_pilot_testing/cluster.json .
|
|
|
cp /scratch/users/claczny/ont/fecal_pilot_testing/config.yaml .
|
|
|
cp /scratch/users/claczny/ont/fecal_pilot_testing/Snakefile .
|
|
|
cp /scratch/users/claczny/ont/fecal_pilot_testing/src/* .
|
|
|
mkdir src && mv *.sh src/.
|
|
|
|
|
|
# edited the above files as needed with correct paths (OR) do the following
|
|
|
ln -sd /scratch/users/claczny/ont/fecal_pilot_testing/data data
|
|
|
ln -sd /scratch/users/claczny/ont/fecal_pilot_testing/dbs dbs
|
|
|
ln -sd /scratch/users/claczny/ont/fecal_pilot_testing/envs envs
|
|
|
ln -sd /scratch/users/claczny/ont/fecal_pilot_testing/scripts scripts
|
|
|
|
|
|
# basecalling run as follows after modifying the "rule all" inputs for basecalling.done
|
|
|
./src/snakemake_run_use_conda.sh
|
|
|
|
|
|
# individual snakemake rules can also be run separately as follows:
|
|
|
snakemake -p --use-conda BASECALL_MERGE_QC
|
|
|
snakemake -p --use-conda COVERAGE_OF_REFERENCES
|
|
|
|
|
|
# LOG can be checked as follows:
|
|
|
less default.log
|
|
|
```
|
|
|
|
|
|
##### GITLAB #####
|
|
|
- copying all changed files to gitlab: https://git-r3lab.uni.lu/susheel.busi/ont_pilot_gitlab
|
|
|
```
|
|
|
cd /scratch/users/sbusi/ONT/cedric_ont_basecalling
|
|
|
git pull https://git-r3lab.uni.lu/susheel.busi/ont_pilot_gitlab.git
|
|
|
|
|
|
cd /scratch/users/sbusi/ONT/cedric_ont_basecalling/ont_pilot_gitlab
|
|
|
cp ../Snakefile .
|
|
|
|
|
|
git checkout work
|
|
|
git add Snakefile
|
|
|
git commit -m "any message of note"
|
|
|
git push
|
|
|
# go to the provided URL, and then submit the merge request, for it to be approved later.
|
|
|
```
|
|
|
|
|
|
##### SLURM files #####
|
|
|
- moving all slurm files into one folder for 'cleaner' appearance
|
|
|
```
|
|
|
mkdir slurm_files
|
|
|
mv slurm* slurm_files/.
|
|
|
``` |
|
|
\ No newline at end of file |