... | ... | @@ -71,7 +71,7 @@ plot_len1.pl test.clstr \ |
|
|
- Like any good research story - one question always leads to another, and one answer opens up the rabbit hole of unending possibilities
|
|
|
- And this is where the ensuing chapter arrives. Let's call it "[guppy](https://denbi-nanopore-training-course.readthedocs.io/en/latest/basecalling/basecalling.html)"!
|
|
|
|
|
|
## Chapter II - Running 'methylation-aware' basecalling to improve the otherwise dismal sequencing error rates from ONT
|
|
|
## Chapter II - Running `methylation-aware` basecalling to improve the otherwise dismal sequencing error rates from ONT
|
|
|
- Installed guppy as follows:
|
|
|
```
|
|
|
# Installation
|
... | ... | @@ -110,14 +110,14 @@ snakemake -p --use-conda COVERAGE_OF_REFERENCES |
|
|
less default.log
|
|
|
```
|
|
|
|
|
|
## Chapter III - What about 'non-methlyation-aware' basecalling, a.k.a. "non-mod" basecalling?
|
|
|
## Chapter III - What about 'non-methlyation-aware' basecalling, a.k.a. `non-mod` basecalling?
|
|
|
- The question arose as to what effect this might have
|
|
|
- What better way to answer this, than test it?!
|
|
|
```
|
|
|
#########################
|
|
|
### NO_MOD_basecalled ###
|
|
|
#########################
|
|
|
- Running the basecalled data that was generated without any "modifications" or "methylation-awareness"
|
|
|
- Running the basecalled data that was generated `without` any "modifications" or "methylation-awareness"
|
|
|
- To do this, we first have to move the "results" folder to a new name
|
|
|
- then create a new "results" and keep the "basecalled_NO_MOD", but rename this folder to "basecalled"
|
|
|
- Can be done via script as follows:
|
... | ... | @@ -190,8 +190,8 @@ cat lr/GDB_2018_metaT_reads-x-barcode07-flye_contigs.avg_cov.txt | awk '{x+=$2; |
|
|
#30.7486
|
|
|
```
|
|
|
|
|
|
## Chapter VI - the "big kahuna"
|
|
|
- This part of the story is very much "serendipitous"
|
|
|
## Chapter VI - the ``big kahuna``
|
|
|
- This part of the story is very much `serendipitous`
|
|
|
- Having a 1200 line Snakefile can be amazing; unless, for some unknown reason the entire run is triggered even when files are present
|
|
|
- Due to this slight inconvenience of time, we decided to break down the Snakefile into "modular" sections
|
|
|
- All the rules were broken down to [rules](https://git-r3lab.uni.lu/susheel.busi/ont_pilot_gitlab/-/tree/checkpoint_snakefile/2019_GDB%2Frulesurl) and [workflows](https://git-r3lab.uni.lu/susheel.busi/ont_pilot_gitlab/-/tree/checkpoint_snakefile/2019_GDB%2Fworkflowsurl)
|
... | ... | |