Skip to content
GitLab
Explore
Sign in
Primary navigation
Search or go to…
Project
Genome_Evaluation_Pipeline
Manage
Activity
Members
Labels
Plan
Issues
Issue boards
Milestones
Wiki
Requirements
Code
Repository
Branches
Commits
Tags
Repository graph
Compare revisions
Snippets
Locked files
Deploy
Releases
Package registry
Container registry
Model registry
Operate
Terraform modules
Monitor
Incidents
Analyze
Value stream analytics
Contributor analytics
Repository analytics
Issue analytics
Insights
Model experiments
Help
Help
Support
GitLab documentation
Compare GitLab plans
Community forum
Contribute to GitLab
Provide feedback
Keyboard shortcuts
?
Snippets
Groups
Projects
Show more breadcrumbs
cmazzoni
Genome_Evaluation_Pipeline
Commits
a98a21e5
Commit
a98a21e5
authored
2 years ago
by
james94
Browse files
Options
Downloads
Patches
Plain Diff
renamed envs
parent
f9b15972
Branches
Branches containing commit
No related tags found
No related merge requests found
Changes
3
Expand all
Hide whitespace changes
Inline
Side-by-side
Showing
3 changed files
rules/build_hifi.smk
+41
-39
41 additions, 39 deletions
rules/build_hifi.smk
rules/build_illumina.smk
+55
-49
55 additions, 49 deletions
rules/build_illumina.smk
rules/evaluate.smk
+128
-125
128 additions, 125 deletions
rules/evaluate.smk
with
224 additions
and
213 deletions
rules/build_hifi.smk
+
41
−
39
View file @
a98a21e5
localrules: symlinkUnzippedHifi, symlinkfornotSmartTrimmed, multiqc_hifi
localrules: symlink_UnzippedFastq_hifi, \
symlink_noSMRTBellAdaptTrim_hifi, \
multiQC_hifi
...
@@ -20,62 +22,62 @@ def hifi_notgzipped(wildcards):
...
@@ -20,62 +22,62 @@ def hifi_notgzipped(wildcards):
rule unzip
H
ifi:
rule unzip
Fastq_h
ifi:
input:
input:
fastq=hifi_gzipped,
fastq=hifi_gzipped,
output:
output:
os.path.join(config['Results'], "0_buildDatabases/{sample}/hifiReads/temp_unzipFastqs/{readCounter}.{smrtornot}.fastq"),
os.path.join(config['Results'], "0_buildDatabases/{sample}/hifiReads/temp_unzipFastqs/{readCounter}.{smrtornot}.fastq"),
log:
log:
os.path.join(config['Results'], "0_buildDatabases/{sample}/
logs/
hifiReads/{readCounter}.{smrtornot}
_pigzUnzip
.log")
os.path.join(config['Results'], "0_buildDatabases/{sample}/hifiReads/
logs/pigzUnzip.
{readCounter}.{smrtornot}.log")
conda:
conda:
os.path.join(workflow.basedir, "envs/
pigz
.yaml")
os.path.join(workflow.basedir, "envs/
UNZIP_and_QC
.yaml")
threads:
threads:
resource['unzip
H
ifi']['threads']
resource['unzip
Fastq_h
ifi']['threads']
resources:
resources:
mem_mb=resource['unzip
H
ifi']['mem_mb'],
mem_mb=resource['unzip
Fastq_h
ifi']['mem_mb'],
time=resource['unzip
H
ifi']['time'],
time=resource['unzip
Fastq_h
ifi']['time'],
shell:
shell:
"""
"""
pigz -p {threads} -c -d -k {input.fastq} > {output} 2> {log}
pigz -p {threads} -c -d -k {input.fastq} > {output} 2> {log}
"""
"""
rule symlinkUnzipped
H
ifi:
rule symlink
_
Unzipped
Fastq_h
ifi:
input:
input:
fastq=hifi_notgzipped,
fastq=hifi_notgzipped,
output:
output:
os.path.join(config['Results'], "0_buildDatabases/{sample}/hifiReads/temp_unzipFastqs/{readCounter}.{smrtornot}.fastq"),
os.path.join(config['Results'], "0_buildDatabases/{sample}/hifiReads/temp_unzipFastqs/{readCounter}.{smrtornot}.fastq"),
log:
log:
os.path.join(config['Results'], "0_buildDatabases/{sample}/
logs/
hifiReads/{readCounter}.{smrtornot}
_pigzUnzip
.log")
os.path.join(config['Results'], "0_buildDatabases/{sample}/hifiReads/
logs/pigzUnzip.
{readCounter}.{smrtornot}.log")
container:
container:
None
None
shell:
shell:
"""
"""
ln -s {input.fastq} {output}
ln -s {input.fastq} {output}
echo "{input.fastq} no gzipped. Symlink created in place of expected decompressed file." > {log}
echo "{input.fastq} no
t
gzipped. Symlink created in place of expected decompressed file." > {log}
"""
"""
rule trimSMRT
b
ell:
rule trimSMRT
B
ell
Adapters_hifi
:
input:
input:
os.path.join(config['Results'], "0_buildDatabases/{sample}/hifiReads/temp_unzipFastqs/{readCounter}.smrtTrimmed.fastq"),
os.path.join(config['Results'], "0_buildDatabases/{sample}/hifiReads/temp_unzipFastqs/{readCounter}.smrtTrimmed.fastq"),
output:
output:
outputFile=os.path.join(config['Results'],"0_buildDatabases/{sample}/hifiReads/temp_trimReads/{readCounter}.smrtTrimmed.fastq")
outputFile=os.path.join(config['Results'],"0_buildDatabases/{sample}/hifiReads/temp_trimReads/{readCounter}.smrtTrimmed.fastq")
threads:
threads:
resource['trimSMRT
b
ell']['threads']
resource['trimSMRT
B
ell
Adapters_hifi
']['threads']
resources:
resources:
mem_mb=resource['trimSMRT
b
ell']['mem_mb'],
mem_mb=resource['trimSMRT
B
ell
Adapters_hifi
']['mem_mb'],
time=resource['trimSMRT
b
ell']['time'],
time=resource['trimSMRT
B
ell
Adapters_hifi
']['time'],
log:
log:
os.path.join(config['Results'], "0_buildDatabases/{sample}/
logs/
hifiReads/{readCounter}
_trimSMRTbell
.log")
os.path.join(config['Results'], "0_buildDatabases/{sample}/hifiReads/
logs/trimSMRTbell.
{readCounter}.log")
priority:
priority:
15
15
conda:
conda:
os.path.join(workflow.basedir, "envs/
pigz
.yaml")
os.path.join(workflow.basedir, "envs/
UNZIP_and_QC
.yaml")
shell:
shell:
"""
"""
(cutadapt -j {threads} -o {output.outputFile} {input} -b A
TCTCTCTC
AA
C
AA
C
AA
C
AA
CGGAGGAGGAGGAAAA
GAGAG
AGAT
-b ATCTCTCTCTTTTCCTCCTCCTCCGTTGTTGTTGTTGAGAGAGAT --discard-trimmed) &> {log}
(cutadapt -j {threads} -o {output.outputFile} {input} -b AAAAAAAAA
AAAAAAAAATTAACG
GAG
G
AG
GAGGA --overlap 35
-b ATCTCTCTCTTTTCCTCCTCCTCCGTTGTTGTTGTTGAGAGAGAT
--overlap 45 --revcomp -e 0.1
--discard-trimmed) &> {log}
"""
"""
rule symlink
fornotSmartTrimmed
:
rule symlink
_noSMRTBellAdaptTrim_hifi
:
input:
input:
os.path.join(config['Results'], "0_buildDatabases/{sample}/hifiReads/temp_unzipFastqs/{readCounter}.notsmrtTrimmed.fastq"),
os.path.join(config['Results'], "0_buildDatabases/{sample}/hifiReads/temp_unzipFastqs/{readCounter}.notsmrtTrimmed.fastq"),
output:
output:
...
@@ -87,7 +89,7 @@ rule symlinkfornotSmartTrimmed:
...
@@ -87,7 +89,7 @@ rule symlinkfornotSmartTrimmed:
ln -s {input} {output.outputFile}
ln -s {input} {output.outputFile}
"""
"""
rule fast
qc
_hifi:
rule fast
QC
_hifi:
input:
input:
os.path.join(config['Results'],"0_buildDatabases/{sample}/hifiReads/temp_trimReads/{readCounter}.{smrtornot}.fastq")
os.path.join(config['Results'],"0_buildDatabases/{sample}/hifiReads/temp_trimReads/{readCounter}.{smrtornot}.fastq")
params:
params:
...
@@ -95,20 +97,20 @@ rule fastqc_hifi:
...
@@ -95,20 +97,20 @@ rule fastqc_hifi:
output:
output:
os.path.join(config['Results'],"0_buildDatabases/{sample}/hifiReads/QC/fastqc/{readCounter}.{smrtornot}_fastqc.html")
os.path.join(config['Results'],"0_buildDatabases/{sample}/hifiReads/QC/fastqc/{readCounter}.{smrtornot}_fastqc.html")
threads:
threads:
resource['
trimSMRTbell
']['threads']
resource['
fastQC_hifi
']['threads']
resources:
resources:
mem_mb=resource['fast
qc
_hifi']['mem_mb'],
mem_mb=resource['fast
QC
_hifi']['mem_mb'],
time=resource['fast
qc
_hifi']['time'],
time=resource['fast
QC
_hifi']['time'],
log:
log:
os.path.join(config['Results'], "0_buildDatabases/{sample}/hifiReads/logs/{readCounter}.{smrtornot}.
FastQC.
log")
os.path.join(config['Results'], "0_buildDatabases/{sample}/hifiReads/logs/
fastQC.hifi.
{readCounter}.{smrtornot}.log")
conda:
conda:
os.path.join(workflow.basedir, "envs/
pigz
.yaml")
os.path.join(workflow.basedir, "envs/
UNZIP_and_QC
.yaml")
shell:
shell:
"""
"""
(fastqc {input} -o {params.folder2out} -t {threads}) &> {log}
(fastqc {input} -o {params.folder2out} -t {threads}) &> {log}
"""
"""
rule multi
qc
_hifi:
rule multi
QC
_hifi:
input:
input:
lambda wildcards: expand(os.path.join(config['Results'],"0_buildDatabases/{sample}/hifiReads/QC/fastqc/{readCounter}.{smrtornot}_fastqc.html"), sample=wildcards.sample, readCounter=dictReadCounter[wildcards.sample], smrtornot=dictSamples[wildcards.sample][1])
lambda wildcards: expand(os.path.join(config['Results'],"0_buildDatabases/{sample}/hifiReads/QC/fastqc/{readCounter}.{smrtornot}_fastqc.html"), sample=wildcards.sample, readCounter=dictReadCounter[wildcards.sample], smrtornot=dictSamples[wildcards.sample][1])
params:
params:
...
@@ -118,37 +120,37 @@ rule multiqc_hifi:
...
@@ -118,37 +120,37 @@ rule multiqc_hifi:
output:
output:
os.path.join(config['Results'],"0_buildDatabases/{sample}/hifiReads/QC/multiqc/{sample}.multiqcReport.html")
os.path.join(config['Results'],"0_buildDatabases/{sample}/hifiReads/QC/multiqc/{sample}.multiqcReport.html")
log:
log:
os.path.join(config['Results'], "0_buildDatabases/{sample}/hifiReads/logs/{sample}
.multiqc
.log")
os.path.join(config['Results'], "0_buildDatabases/{sample}/hifiReads/logs/
multiQC.
{sample}.log")
conda:
conda:
os.path.join(workflow.basedir, "envs/
pigz
.yaml")
os.path.join(workflow.basedir, "envs/
UNZIP_and_QC
.yaml")
shell:
shell:
"(multiqc {params.folder2qc} -o {params.folder2OUT} -n {params.filename}) &> {log}"
"(multiqc {params.folder2qc} -o {params.folder2OUT} -n {params.filename}) &> {log}"
rule meryl_hifi
_count
:
rule meryl
Count
_hifi:
input:
input:
reads=os.path.join(config['Results'],"0_buildDatabases/{sample}/hifiReads/temp_trimReads/{readCounter}.{smrtornot}.fastq")
reads=os.path.join(config['Results'],"0_buildDatabases/{sample}/hifiReads/temp_trimReads/{readCounter}.{smrtornot}.fastq")
params:
params:
kmer = "{kmer}"
kmer = "{kmer}"
threads:
threads:
resource['meryl_hifi
_count
']['threads']
resource['meryl
Count
_hifi']['threads']
resources:
resources:
mem_mb=resource['meryl_hifi
_count
']['mem_mb'],
mem_mb=resource['meryl
Count
_hifi']['mem_mb'],
time=resource['meryl_hifi
_count
']['time'],
time=resource['meryl
Count
_hifi']['time'],
output:
output:
temp(directory(os.path.join(config['Results'], "0_buildDatabases/{sample}/hifiReads/merylDb/" + "{readCounter}" + "_hifi_dB.{smrtornot}.{kmer}.meryl"))),
temp(directory(os.path.join(config['Results'], "0_buildDatabases/{sample}/hifiReads/merylDb/" + "{readCounter}" + "_hifi_dB.{smrtornot}.{kmer}.meryl"))),
log:
log:
os.path.join(config['Results'], "0_buildDatabases/{sample}/hifiReads/logs/{readCounter}
_hifi_
{kmer}.{smrtornot}.log")
os.path.join(config['Results'], "0_buildDatabases/{sample}/hifiReads/logs/
meryl_hifi_count.
{readCounter}
.
{kmer}.{smrtornot}.log")
priority:
priority:
10
10
conda:
conda:
os.path.join(workflow.basedir, "envs/
merylMerq_2
.yaml")
os.path.join(workflow.basedir, "envs/
MERYL_MERQURY
.yaml")
shell:
shell:
"""
"""
(meryl count k={params.kmer} threads={threads} {input.reads} output {output}) &> {log}
(meryl count k={params.kmer} threads={threads} {input.reads} output {output}) &> {log}
"""
"""
rule meryl_hifi
_build
:
rule meryl
Union
_hifi:
input:
input:
lambda wildcards: expand(os.path.join(config['Results'],"0_buildDatabases/{sample}/hifiReads/merylDb/{readCounter}_hifi_dB.{smrtornot}.{kmer}.meryl/"), sample=wildcards.sample, readCounter=dictReadCounter[wildcards.sample], kmer=dictSamples[wildcards.sample][0], smrtornot=dictSamples[wildcards.sample][1])
lambda wildcards: expand(os.path.join(config['Results'],"0_buildDatabases/{sample}/hifiReads/merylDb/{readCounter}_hifi_dB.{smrtornot}.{kmer}.meryl/"), sample=wildcards.sample, readCounter=dictReadCounter[wildcards.sample], kmer=dictSamples[wildcards.sample][0], smrtornot=dictSamples[wildcards.sample][1])
params:
params:
...
@@ -156,18 +158,18 @@ rule meryl_hifi_build:
...
@@ -156,18 +158,18 @@ rule meryl_hifi_build:
removeReadDIR_trimmed=os.path.join(config['Results'],"0_buildDatabases/{sample}/hifiReads/temp_trimReads/"),
removeReadDIR_trimmed=os.path.join(config['Results'],"0_buildDatabases/{sample}/hifiReads/temp_trimReads/"),
removeReadDIR_unzipped=os.path.join(config['Results'],"0_buildDatabases/{sample}/hifiReads/temp_unzipFastqs/")
removeReadDIR_unzipped=os.path.join(config['Results'],"0_buildDatabases/{sample}/hifiReads/temp_unzipFastqs/")
threads:
threads:
resource['meryl_hifi
_build
']['threads']
resource['meryl
Union
_hifi']['threads']
resources:
resources:
mem_mb=resource['meryl_hifi
_build
']['mem_mb'],
mem_mb=resource['meryl
Union
_hifi']['mem_mb'],
time=resource['meryl_hifi
_build
']['time'],
time=resource['meryl
Union
_hifi']['time'],
output:
output:
directory(os.path.join(config['Results'], "0_buildDatabases/{sample}/hifiReads/merylDb/complete_hifi.{sample}.{kmer}.meryl")),
directory(os.path.join(config['Results'], "0_buildDatabases/{sample}/hifiReads/merylDb/complete_hifi.{sample}.{kmer}.meryl")),
log:
log:
os.path.join(config['Results'], "0_buildDatabases/{sample}/hifiReads/logs/
{sample}.
meryl_hifi.{kmer}.log")
os.path.join(config['Results'], "0_buildDatabases/{sample}/hifiReads/logs/meryl_hifi
_combine.{sample}
.{kmer}.log")
priority:
priority:
10
10
conda:
conda:
os.path.join(workflow.basedir, "envs/
merylMerq_2
.yaml")
os.path.join(workflow.basedir, "envs/
MERYL_MERQURY
.yaml")
shell:
shell:
"""
"""
(meryl union-sum {input} output {output}) &> {log}
(meryl union-sum {input} output {output}) &> {log}
...
...
This diff is collapsed.
Click to expand it.
rules/build_illumina.smk
+
55
−
49
View file @
a98a21e5
...
@@ -2,7 +2,13 @@
...
@@ -2,7 +2,13 @@
localrules: symlinkUnzippedFastq_R1, symlinkUnzippedFastq_R2, symLink_trim10xbarcodes_notrimAdapt, symlinks_no10xwithAdaptTrim, symlinks_no10xOrAdaptTrim, symlink_trim10xbarcodesR2, multiqc_hifi
localrules: symlink_UnzippedFastq_R1_illumina,\
symlink_UnzippedFastq_R2_illumina, \
symLink_Trim10xBarcodes_noSequencingAdaptTrim_illumina, \
symlink_No10xWithSequencingAdaptTrim_illumina, \
symlink_No10xOrSequencingAdaptTrim_illumina, \
symlink_Trim10xBarcodes_R2_illumina, \
multiQC_illumina
...
@@ -19,7 +25,7 @@ def R2_gzipped(wildcards):
...
@@ -19,7 +25,7 @@ def R2_gzipped(wildcards):
def R2_notgzipped(wildcards):
def R2_notgzipped(wildcards):
return noGzip_R2.loc[(wildcards.sample, wildcards.readCounter), "Library_R2"]
return noGzip_R2.loc[(wildcards.sample, wildcards.readCounter), "Library_R2"]
rule unzipFastq_R1:
rule unzipFastq_R1
_illumina
:
input:
input:
assembly=R1_gzipped,
assembly=R1_gzipped,
output:
output:
...
@@ -27,18 +33,18 @@ rule unzipFastq_R1:
...
@@ -27,18 +33,18 @@ rule unzipFastq_R1:
log:
log:
os.path.join(config['Results'], "0_buildDatabases/{sample}/illuminaReads/logs/{readCounter}.{trim10x}.{trimAdapters}_R1_pigzUnzip.log"),
os.path.join(config['Results'], "0_buildDatabases/{sample}/illuminaReads/logs/{readCounter}.{trim10x}.{trimAdapters}_R1_pigzUnzip.log"),
conda:
conda:
os.path.join(workflow.basedir, "envs/
pigz
.yaml")
os.path.join(workflow.basedir, "envs/
UNZIP_and_QC
.yaml")
threads:
threads:
resource['unzipFastq_R1']['threads']
resource['unzipFastq_R1
_illumina
']['threads']
resources:
resources:
mem_mb=resource['unzipFastq_R1']['mem_mb'],
mem_mb=resource['unzipFastq_R1
_illumina
']['mem_mb'],
time=resource['unzipFastq_R1']['time']
time=resource['unzipFastq_R1
_illumina
']['time']
shell:
shell:
"""
"""
pigz -p {threads} -c -d -k {input.assembly} > {output} 2> {log}
pigz -p {threads} -c -d -k {input.assembly} > {output} 2> {log}
"""
"""
rule symlinkUnzippedFastq_R1:
rule symlink
_
UnzippedFastq_R1
_illumina
:
input:
input:
assembly=R1_notgzipped,
assembly=R1_notgzipped,
output:
output:
...
@@ -50,7 +56,7 @@ rule symlinkUnzippedFastq_R1:
...
@@ -50,7 +56,7 @@ rule symlinkUnzippedFastq_R1:
ln -s {input} {output}
ln -s {input} {output}
"""
"""
rule unzipFastq_R2:
rule unzipFastq_R2
_illumina
:
input:
input:
assembly=R2_gzipped,
assembly=R2_gzipped,
output:
output:
...
@@ -58,18 +64,18 @@ rule unzipFastq_R2:
...
@@ -58,18 +64,18 @@ rule unzipFastq_R2:
log:
log:
os.path.join(config['Results'], "0_buildDatabases/{sample}/illuminaReads/logs/{readCounter}.{trim10x}.{trimAdapters}_R2_pigzUnzip.log")
os.path.join(config['Results'], "0_buildDatabases/{sample}/illuminaReads/logs/{readCounter}.{trim10x}.{trimAdapters}_R2_pigzUnzip.log")
conda:
conda:
os.path.join(workflow.basedir, "envs/
pigz
.yaml")
os.path.join(workflow.basedir, "envs/
UNZIP_and_QC
.yaml")
threads:
threads:
resource['unzipFastq_R2']['threads']
resource['unzipFastq_R2
_illumina
']['threads']
resources:
resources:
mem_mb=resource['unzipFastq_R2']['mem_mb'],
mem_mb=resource['unzipFastq_R2
_illumina
']['mem_mb'],
time=resource['unzipFastq_R2']['time']
time=resource['unzipFastq_R2
_illumina
']['time']
shell:
shell:
"""
"""
pigz -p {threads} -c -d -k {input.assembly} > {output} 2> {log}
pigz -p {threads} -c -d -k {input.assembly} > {output} 2> {log}
"""
"""
rule symlinkUnzippedFastq_R2:
rule symlink
_
UnzippedFastq_R2
_illumina
:
input:
input:
assembly=R2_notgzipped,
assembly=R2_notgzipped,
output:
output:
...
@@ -82,27 +88,27 @@ rule symlinkUnzippedFastq_R2:
...
@@ -82,27 +88,27 @@ rule symlinkUnzippedFastq_R2:
"""
"""
rule trim10x
b
arcodes:
rule trim10x
B
arcodes
_illumina
:
input:
input:
read1=os.path.join(config['Results'], "0_buildDatabases/{sample}/illuminaReads/temp_unzipFastqs/{readCounter}.10xTrimmed.{trimAdapters}_R1.fastq"),
read1=os.path.join(config['Results'], "0_buildDatabases/{sample}/illuminaReads/temp_unzipFastqs/{readCounter}.10xTrimmed.{trimAdapters}_R1.fastq"),
output:
output:
read1=os.path.join(config['Results'],"0_buildDatabases/{sample}/illuminaReads/temp_trimReads/{readCounter}.10xTrimmed.{trimAdapters}_Read1.fastq"),
read1=os.path.join(config['Results'],"0_buildDatabases/{sample}/illuminaReads/temp_trimReads/{readCounter}.10xTrimmed.{trimAdapters}_Read1.fastq"),
threads:
threads:
resource['trim10x
b
arcodes']['threads']
resource['trim10x
B
arcodes
_illumina
']['threads']
resources:
resources:
mem_mb=resource['trim10x
b
arcodes']['mem_mb'],
mem_mb=resource['trim10x
B
arcodes
_illumina
']['mem_mb'],
time=resource['trim10x
b
arcodes']['time']
time=resource['trim10x
B
arcodes
_illumina
']['time']
log:
log:
os.path.join(config['Results'], "0_buildDatabases/{sample}/illuminaReads/logs/{readCounter}.10xTrimmed.10BarcodeRemoval_Trimmomatic.{trimAdapters}.log")
os.path.join(config['Results'], "0_buildDatabases/{sample}/illuminaReads/logs/{readCounter}.10xTrimmed.10BarcodeRemoval_Trimmomatic.{trimAdapters}.log")
conda:
conda:
os.path.join(workflow.basedir, "envs/
pigz
.yaml")
os.path.join(workflow.basedir, "envs/
UNZIP_and_QC
.yaml")
shell:
shell:
"""
"""
(trimmomatic SE -threads {threads} {input.read1} {output.read1} HEADCROP:23) &> {log}
(trimmomatic SE -threads {threads} {input.read1} {output.read1} HEADCROP:23) &> {log}
"""
"""
rule symlink_
t
rim10x
b
arcodesR2:
rule symlink_
T
rim10x
B
arcodes
_
R2
_illumina
:
input:
input:
read2=os.path.join(config['Results'], "0_buildDatabases/{sample}/illuminaReads/temp_unzipFastqs/{readCounter}.10xTrimmed.{trimAdapters}_R2.fastq")
read2=os.path.join(config['Results'], "0_buildDatabases/{sample}/illuminaReads/temp_unzipFastqs/{readCounter}.10xTrimmed.{trimAdapters}_R2.fastq")
output:
output:
...
@@ -116,7 +122,7 @@ rule symlink_trim10xbarcodesR2:
...
@@ -116,7 +122,7 @@ rule symlink_trim10xbarcodesR2:
rule symLink_
t
rim10x
b
arcodes_no
trimAdapt
:
rule symLink_
T
rim10x
B
arcodes_no
SequencingAdaptTrim_illumina
:
input:
input:
read1=os.path.join(config['Results'],"0_buildDatabases/{sample}/illuminaReads/temp_trimReads/{readCounter}.10xTrimmed.notAdaptTrimmed_Read1.fastq"),
read1=os.path.join(config['Results'],"0_buildDatabases/{sample}/illuminaReads/temp_trimReads/{readCounter}.10xTrimmed.notAdaptTrimmed_Read1.fastq"),
read2=os.path.join(config['Results'],"0_buildDatabases/{sample}/illuminaReads/temp_trimReads/{readCounter}.10xTrimmed.notAdaptTrimmed_Read2.fastq"),
read2=os.path.join(config['Results'],"0_buildDatabases/{sample}/illuminaReads/temp_trimReads/{readCounter}.10xTrimmed.notAdaptTrimmed_Read2.fastq"),
...
@@ -131,7 +137,7 @@ rule symLink_trim10xbarcodes_notrimAdapt:
...
@@ -131,7 +137,7 @@ rule symLink_trim10xbarcodes_notrimAdapt:
ln -s {input.read2} {output.read2}
ln -s {input.read2} {output.read2}
"""
"""
rule symlink
s_n
o10xOr
AdaptTrim
:
rule symlink
_N
o10xOr
SequencingAdaptTrim_illumina
:
input:
input:
read1=os.path.join(config['Results'], "0_buildDatabases/{sample}/illuminaReads/temp_unzipFastqs/{readCounter}.not10xTrimmed.notAdaptTrimmed_R1.fastq"),
read1=os.path.join(config['Results'], "0_buildDatabases/{sample}/illuminaReads/temp_unzipFastqs/{readCounter}.not10xTrimmed.notAdaptTrimmed_R1.fastq"),
read2=os.path.join(config['Results'], "0_buildDatabases/{sample}/illuminaReads/temp_unzipFastqs/{readCounter}.not10xTrimmed.notAdaptTrimmed_R2.fastq")
read2=os.path.join(config['Results'], "0_buildDatabases/{sample}/illuminaReads/temp_unzipFastqs/{readCounter}.not10xTrimmed.notAdaptTrimmed_R2.fastq")
...
@@ -146,7 +152,7 @@ rule symlinks_no10xOrAdaptTrim:
...
@@ -146,7 +152,7 @@ rule symlinks_no10xOrAdaptTrim:
ln -s {input.read2} {output.read2}
ln -s {input.read2} {output.read2}
"""
"""
rule symlink
s_n
o10x
w
ith
AdaptTrim
:
rule symlink
_N
o10x
W
ith
SequencingAdaptTrim_illumina
:
input:
input:
read1=os.path.join(config['Results'], "0_buildDatabases/{sample}/illuminaReads/temp_unzipFastqs/{readCounter}.not10xTrimmed.AdaptTrimmed_R1.fastq"),
read1=os.path.join(config['Results'], "0_buildDatabases/{sample}/illuminaReads/temp_unzipFastqs/{readCounter}.not10xTrimmed.AdaptTrimmed_R1.fastq"),
read2=os.path.join(config['Results'], "0_buildDatabases/{sample}/illuminaReads/temp_unzipFastqs/{readCounter}.not10xTrimmed.AdaptTrimmed_R2.fastq")
read2=os.path.join(config['Results'], "0_buildDatabases/{sample}/illuminaReads/temp_unzipFastqs/{readCounter}.not10xTrimmed.AdaptTrimmed_R2.fastq")
...
@@ -161,7 +167,7 @@ rule symlinks_no10xwithAdaptTrim:
...
@@ -161,7 +167,7 @@ rule symlinks_no10xwithAdaptTrim:
ln -s {input.read2} {output.read2}
ln -s {input.read2} {output.read2}
"""
"""
rule trim
Adapters
:
rule trim
SequencingAdapters_illumina
:
input:
input:
read1= os.path.join(config['Results'],"0_buildDatabases/{sample}/illuminaReads/temp_trimReads/{readCounter}.{trim10x}.AdaptTrimmed_Read1.fastq"),
read1= os.path.join(config['Results'],"0_buildDatabases/{sample}/illuminaReads/temp_trimReads/{readCounter}.{trim10x}.AdaptTrimmed_Read1.fastq"),
read2= os.path.join(config['Results'],"0_buildDatabases/{sample}/illuminaReads/temp_trimReads/{readCounter}.{trim10x}.AdaptTrimmed_Read2.fastq"),
read2= os.path.join(config['Results'],"0_buildDatabases/{sample}/illuminaReads/temp_trimReads/{readCounter}.{trim10x}.AdaptTrimmed_Read2.fastq"),
...
@@ -172,20 +178,20 @@ rule trimAdapters:
...
@@ -172,20 +178,20 @@ rule trimAdapters:
read1=temp(os.path.join(config['Results'],"0_buildDatabases/{sample}/illuminaReads/temp_trimReads/{readCounter}.{trim10x}.AdaptTrimmed_val_1.fq")),
read1=temp(os.path.join(config['Results'],"0_buildDatabases/{sample}/illuminaReads/temp_trimReads/{readCounter}.{trim10x}.AdaptTrimmed_val_1.fq")),
read2=temp(os.path.join(config['Results'],"0_buildDatabases/{sample}/illuminaReads/temp_trimReads/{readCounter}.{trim10x}.AdaptTrimmed_val_2.fq"))
read2=temp(os.path.join(config['Results'],"0_buildDatabases/{sample}/illuminaReads/temp_trimReads/{readCounter}.{trim10x}.AdaptTrimmed_val_2.fq"))
threads:
threads:
resource['trim
Adapters
']['threads']
resource['trim
SequencingAdapters_illumina
']['threads']
resources:
resources:
mem_mb=resource['trim
Adapters
']['mem_mb'],
mem_mb=resource['trim
SequencingAdapters_illumina
']['mem_mb'],
time=resource['trim
Adapters
']['time'],
time=resource['trim
SequencingAdapters_illumina
']['time'],
log:
log:
os.path.join(config['Results'], "0_buildDatabases/{sample}/illuminaReads/logs/{readCounter}.{trim10x}.AdaptTrimmed_tGalore.log")
os.path.join(config['Results'], "0_buildDatabases/{sample}/illuminaReads/logs/{readCounter}.{trim10x}.AdaptTrimmed_tGalore.log")
conda:
conda:
os.path.join(workflow.basedir, "envs/
pigz
.yaml")
os.path.join(workflow.basedir, "envs/
UNZIP_and_QC
.yaml")
shell:
shell:
"""
"""
(trim_galore -j {threads} --basename {params.r1_prefix} --dont_gzip --length 65 -o {params.outputDir} --paired {input.read1} {input.read2}) &> {log}
(trim_galore -j {threads} --basename {params.r1_prefix} --dont_gzip --length 65 -o {params.outputDir} --paired {input.read1} {input.read2}) &> {log}
"""
"""
rule fast
qc_I
llumina:
rule fast
QC_i
llumina:
input:
input:
read1=os.path.join(config['Results'],"0_buildDatabases/{sample}/illuminaReads/temp_trimReads/{readCounter}.{trim10x}.{trimAdapters}_val_1.fq"),
read1=os.path.join(config['Results'],"0_buildDatabases/{sample}/illuminaReads/temp_trimReads/{readCounter}.{trim10x}.{trimAdapters}_val_1.fq"),
read2=os.path.join(config['Results'],"0_buildDatabases/{sample}/illuminaReads/temp_trimReads/{readCounter}.{trim10x}.{trimAdapters}_val_2.fq")
read2=os.path.join(config['Results'],"0_buildDatabases/{sample}/illuminaReads/temp_trimReads/{readCounter}.{trim10x}.{trimAdapters}_val_2.fq")
...
@@ -197,16 +203,16 @@ rule fastqc_Illumina:
...
@@ -197,16 +203,16 @@ rule fastqc_Illumina:
log:
log:
os.path.join(config['Results'], "0_buildDatabases/{sample}/illuminaReads/logs/{readCounter}.{trim10x}.{trimAdapters}_fastqc.log")
os.path.join(config['Results'], "0_buildDatabases/{sample}/illuminaReads/logs/{readCounter}.{trim10x}.{trimAdapters}_fastqc.log")
threads:
threads:
resource['fast
qc_I
llumina']['threads']
resource['fast
QC_i
llumina']['threads']
resources:
resources:
mem_mb=resource['fast
qc_I
llumina']['mem_mb'],
mem_mb=resource['fast
QC_i
llumina']['mem_mb'],
time=resource['fast
qc_I
llumina']['time'],
time=resource['fast
QC_i
llumina']['time'],
conda:
conda:
os.path.join(workflow.basedir, "envs/
pigz
.yaml")
os.path.join(workflow.basedir, "envs/
UNZIP_and_QC
.yaml")
shell:
shell:
"(fastqc {input} -o {params.folder2out} -t {threads}) &> {log}"
"(fastqc {input} -o {params.folder2out} -t {threads}) &> {log}"
rule multi
qc_hifi
:
rule multi
QC_illumina
:
input:
input:
read1=lambda wildcards: expand(os.path.join(config['Results'],"0_buildDatabases/{sample}/illuminaReads/QC/fastqc/{readCounter}.{trim10x}.{trimAdapters}_val_1_fastqc.html"), sample=wildcards.sample, readCounter=dictReadCounter[wildcards.sample], trim10x=dictSamples[wildcards.sample][1], trimAdapters=dictSamples[wildcards.sample][2]),
read1=lambda wildcards: expand(os.path.join(config['Results'],"0_buildDatabases/{sample}/illuminaReads/QC/fastqc/{readCounter}.{trim10x}.{trimAdapters}_val_1_fastqc.html"), sample=wildcards.sample, readCounter=dictReadCounter[wildcards.sample], trim10x=dictSamples[wildcards.sample][1], trimAdapters=dictSamples[wildcards.sample][2]),
read2=lambda wildcards: expand(os.path.join(config['Results'],"0_buildDatabases/{sample}/illuminaReads/QC/fastqc/{readCounter}.{trim10x}.{trimAdapters}_val_2_fastqc.html"), sample=wildcards.sample, readCounter=dictReadCounter[wildcards.sample], trim10x=dictSamples[wildcards.sample][1], trimAdapters=dictSamples[wildcards.sample][2])
read2=lambda wildcards: expand(os.path.join(config['Results'],"0_buildDatabases/{sample}/illuminaReads/QC/fastqc/{readCounter}.{trim10x}.{trimAdapters}_val_2_fastqc.html"), sample=wildcards.sample, readCounter=dictReadCounter[wildcards.sample], trim10x=dictSamples[wildcards.sample][1], trimAdapters=dictSamples[wildcards.sample][2])
...
@@ -219,22 +225,22 @@ rule multiqc_hifi:
...
@@ -219,22 +225,22 @@ rule multiqc_hifi:
log:
log:
os.path.join(config['Results'], "0_buildDatabases/{sample}/illuminaReads/logs/{sample}.multiqc.log")
os.path.join(config['Results'], "0_buildDatabases/{sample}/illuminaReads/logs/{sample}.multiqc.log")
conda:
conda:
os.path.join(workflow.basedir, "envs/
pigz
.yaml")
os.path.join(workflow.basedir, "envs/
UNZIP_and_QC
.yaml")
shell:
shell:
"(multiqc {params.folder2qc} -o {params.folder2out} -n {params.filename}) &> {log}"
"(multiqc {params.folder2qc} -o {params.folder2out} -n {params.filename}) &> {log}"
rule meryl
_R1
:
rule meryl
Count_R1_illumina
:
input:
input:
read1= os.path.join(config['Results'],"0_buildDatabases/{sample}/illuminaReads/temp_trimReads/{readCounter}.{trim10x}.{trimAdapters}_val_1.fq")
read1= os.path.join(config['Results'],"0_buildDatabases/{sample}/illuminaReads/temp_trimReads/{readCounter}.{trim10x}.{trimAdapters}_val_1.fq")
params:
params:
kmer = "{kmer}",
kmer = "{kmer}",
threads:
threads:
resource['meryl
_R1
']['threads']
resource['meryl
Count_R1_illumina
']['threads']
resources:
resources:
mem_mb=resource['meryl
_R1
']['mem_mb'],
mem_mb=resource['meryl
Count_R1_illumina
']['mem_mb'],
time=resource['meryl
_R1
']['time'],
time=resource['meryl
Count_R1_illumina
']['time'],
output:
output:
temp(directory(os.path.join(config['Results'], "0_buildDatabases/{sample}/illuminaReads/merylDb/{readCounter}.{trim10x}.{trimAdapters}_R1.{kmer}.meryl")))
temp(directory(os.path.join(config['Results'], "0_buildDatabases/{sample}/illuminaReads/merylDb/{readCounter}.{trim10x}.{trimAdapters}_R1.{kmer}.meryl")))
log:
log:
...
@@ -242,23 +248,23 @@ rule meryl_R1:
...
@@ -242,23 +248,23 @@ rule meryl_R1:
priority:
priority:
10
10
conda:
conda:
os.path.join(workflow.basedir, "envs/
merylMerq_2
.yaml")
os.path.join(workflow.basedir, "envs/
MERYL_MERQURY
.yaml")
shell:
shell:
"""
"""
export OMP_NUM_THREADS={threads}
export OMP_NUM_THREADS={threads}
(meryl count k={params.kmer} threads={threads} {input.read1} output {output}) &> {log}
(meryl count k={params.kmer} threads={threads} {input.read1} output {output}) &> {log}
"""
"""
rule meryl
_R2
:
rule meryl
Count_R2_illumina
:
input:
input:
read2= os.path.join(config['Results'],"0_buildDatabases/{sample}/illuminaReads/temp_trimReads/{readCounter}.{trim10x}.{trimAdapters}_val_2.fq")
read2= os.path.join(config['Results'],"0_buildDatabases/{sample}/illuminaReads/temp_trimReads/{readCounter}.{trim10x}.{trimAdapters}_val_2.fq")
params:
params:
kmer = "{kmer}",
kmer = "{kmer}",
threads:
threads:
resource['meryl
_R2
']['threads']
resource['meryl
Count_R2_illumina
']['threads']
resources:
resources:
mem_mb=resource['meryl
_R2
']['mem_mb'],
mem_mb=resource['meryl
Count_R2_illumina
']['mem_mb'],
time=resource['meryl
_R2
']['time'],
time=resource['meryl
Count_R2_illumina
']['time'],
output:
output:
temp(directory(os.path.join(config['Results'], "0_buildDatabases/{sample}/illuminaReads/merylDb/{readCounter}.{trim10x}.{trimAdapters}_R2.{kmer}.meryl")))
temp(directory(os.path.join(config['Results'], "0_buildDatabases/{sample}/illuminaReads/merylDb/{readCounter}.{trim10x}.{trimAdapters}_R2.{kmer}.meryl")))
log:
log:
...
@@ -266,7 +272,7 @@ rule meryl_R2:
...
@@ -266,7 +272,7 @@ rule meryl_R2:
priority:
priority:
10
10
conda:
conda:
os.path.join(workflow.basedir, "envs/
merylMerq_2
.yaml")
os.path.join(workflow.basedir, "envs/
MERYL_MERQURY
.yaml")
shell:
shell:
"""
"""
export OMP_NUM_THREADS={threads}
export OMP_NUM_THREADS={threads}
...
@@ -274,7 +280,7 @@ rule meryl_R2:
...
@@ -274,7 +280,7 @@ rule meryl_R2:
"""
"""
rule meryl_illumina
_build
:
rule meryl
Union
_illumina:
input:
input:
# removeReads1=lambda wildcards: expand(os.path.join(config['Results'],"0_buildDatabases/{sample}/illuminaReads/temp_trimReads/{readCounter}.{trim10x}.{trimAdapters}_val_1.fq"), sample=wildcards.sample, readCounter=dictReadCounter[wildcards.sample], trim10x=dictSamples[wildcards.sample][1], trimAdapters=dictSamples[wildcards.sample][2]),
# removeReads1=lambda wildcards: expand(os.path.join(config['Results'],"0_buildDatabases/{sample}/illuminaReads/temp_trimReads/{readCounter}.{trim10x}.{trimAdapters}_val_1.fq"), sample=wildcards.sample, readCounter=dictReadCounter[wildcards.sample], trim10x=dictSamples[wildcards.sample][1], trimAdapters=dictSamples[wildcards.sample][2]),
# removeReads2=lambda wildcards: expand(os.path.join(config['Results'],"0_buildDatabases/{sample}/illuminaReads/temp_trimReads/{readCounter}.{trim10x}.{trimAdapters}_val_2.fq"), sample=wildcards.sample, readCounter=dictReadCounter[wildcards.sample], trim10x=dictSamples[wildcards.sample][1], trimAdapters=dictSamples[wildcards.sample][2]),
# removeReads2=lambda wildcards: expand(os.path.join(config['Results'],"0_buildDatabases/{sample}/illuminaReads/temp_trimReads/{readCounter}.{trim10x}.{trimAdapters}_val_2.fq"), sample=wildcards.sample, readCounter=dictReadCounter[wildcards.sample], trim10x=dictSamples[wildcards.sample][1], trimAdapters=dictSamples[wildcards.sample][2]),
...
@@ -286,10 +292,10 @@ rule meryl_illumina_build:
...
@@ -286,10 +292,10 @@ rule meryl_illumina_build:
kmer = "{kmer}",
kmer = "{kmer}",
path= os.path.join(config['Results'], "0_buildDatabases/{sample}/illuminaReads/merylDb/")
path= os.path.join(config['Results'], "0_buildDatabases/{sample}/illuminaReads/merylDb/")
threads:
threads:
resource['meryl_illumina
_build
']['threads']
resource['meryl
Union
_illumina']['threads']
resources:
resources:
mem_mb=resource['meryl_illumina
_build
']['mem_mb'],
mem_mb=resource['meryl
Union
_illumina']['mem_mb'],
time=resource['meryl_illumina
_build
']['time'],
time=resource['meryl
Union
_illumina']['time'],
output:
output:
directory(os.path.join(config['Results'], "0_buildDatabases/{sample}/illuminaReads/merylDb/complete_illumina.{sample}.{kmer}.meryl")),
directory(os.path.join(config['Results'], "0_buildDatabases/{sample}/illuminaReads/merylDb/complete_illumina.{sample}.{kmer}.meryl")),
log:
log:
...
@@ -297,7 +303,7 @@ rule meryl_illumina_build:
...
@@ -297,7 +303,7 @@ rule meryl_illumina_build:
priority:
priority:
10
10
conda:
conda:
os.path.join(workflow.basedir, "envs/
merylMerq_2
.yaml")
os.path.join(workflow.basedir, "envs/
MERYL_MERQURY
.yaml")
shell:
shell:
"""
"""
export OMP_NUM_THREADS={threads}
export OMP_NUM_THREADS={threads}
...
...
This diff is collapsed.
Click to expand it.
rules/evaluate.smk
+
128
−
125
View file @
a98a21e5
This diff is collapsed.
Click to expand it.
Preview
0%
Loading
Try again
or
attach a new file
.
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Save comment
Cancel
Please
register
or
sign in
to comment