Skip to content
GitLab
Explore
Sign in
Primary navigation
Search or go to…
Project
T
tasks
Manage
Activity
Members
Labels
Plan
Issues
Issue boards
Milestones
Wiki
Code
Merge requests
Repository
Branches
Commits
Tags
Repository graph
Compare revisions
Snippets
Build
Pipelines
Jobs
Pipeline schedules
Artifacts
Deploy
Releases
Container Registry
Model registry
Operate
Environments
Monitor
Incidents
Analyze
Value stream analytics
Contributor analytics
CI/CD analytics
Repository analytics
Model experiments
Help
Help
Support
GitLab documentation
Compare GitLab plans
Community forum
Contribute to GitLab
Provide feedback
Keyboard shortcuts
?
Snippets
Groups
Projects
Show more breadcrumbs
biowdl
tasks
Commits
ebba8927
Commit
ebba8927
authored
4 years ago
by
Ruben Vorderman
Browse files
Options
Downloads
Plain Diff
update changelog
parents
d0207b7f
fdab6172
No related branches found
Branches containing commit
No related tags found
Tags containing commit
No related merge requests found
Changes
2
Hide whitespace changes
Inline
Side-by-side
Showing
2 changed files
CHANGELOG.md
+6
-2
6 additions, 2 deletions
CHANGELOG.md
picard.wdl
+15
-7
15 additions, 7 deletions
picard.wdl
with
21 additions
and
9 deletions
CHANGELOG.md
+
6
−
2
View file @
ebba8927
...
...
@@ -18,11 +18,15 @@ version 4.0.0-develop
+
Hisat2 task has added controls for samtools.
+
Alignment tasks no longer produce BAM indexes as these are not needed
by the markduplicates step.
+
Picard Markduplicates now uses 7G of RAM just like in GATK's best practice
example pipeline.
+
Picard SortSam added as a task.
+
Md5 files are no longer created by default on Picard tasks that generate
BAM files.
+
Changed PicardMarkduplicates to use COMPRESSION_LEVEL=1 by default
speeding up execution by 2x at the cost of a 20% larger BAM file.
+
Changed PicardMarkduplicates to use COMPRESSION_LEVEL=1 by default with
the htsjdk deflater.
This makes the task finish in 32% less time at the cost of a 8% larger BAM
file.
+
Added sambamba markdup and sambamba sort. NOTE: samtools sort is more
efficient and is recommended.
+
Correctly represent samtools inconsistent use of the threads flag.
...
...
This diff is collapsed.
Click to expand it.
picard.wdl
+
15
−
7
View file @
ebba8927
...
...
@@ -467,9 +467,15 @@ task MarkDuplicates {
String metricsPath
Int compressionLevel = 1
Boolean createMd5File = false
Boolean useJdkInflater = true # Slightly faster than the intel one.
# Better results for compression level 1 (much smaller). Higher compression levels similar to intel deflater.
Boolean useJdkDeflater = true
# In GATK Best practices pipeline MarkDuplicates is given a 7G VM.
# https://github.com/gatk-workflows/broad-prod-wgs-germline-snps-indels/blob/d2934ed656ade44801f9cfe1c0e78d4f80684b7b/PairedEndSingleSampleWf-fc-hg38.wdl#L1040
Int javaXmxMb = 6656 # 6.5G
String memoryMb = javaXmxMb + 512
String memory = "9G"
String javaXmx = "8G"
Int timeMinutes = 1 + ceil(size(inputBams, "G") * 8)
String dockerImage = "quay.io/biocontainers/picard:2.20.5--0"
...
...
@@ -488,7 +494,7 @@ task MarkDuplicates {
command {
set -e
mkdir -p "$(dirname ~{outputBamPath})"
picard -Xmx~{javaXmx
}
-XX:ParallelGCThreads=1 \
picard -Xmx~{javaXmx
Mb}M
-XX:ParallelGCThreads=1 \
MarkDuplicates \
INPUT=~{sep=' INPUT=' inputBams} \
OUTPUT=~{outputBamPath} \
...
...
@@ -500,7 +506,9 @@ task MarkDuplicates {
CLEAR_DT="false" \
CREATE_INDEX=true \
ADD_PG_TAG_TO_READS=false \
CREATE_MD5_FILE=~{true="true" false="false" createMd5File}
CREATE_MD5_FILE=~{true="true" false="false" createMd5File} \
USE_JDK_INFLATER=~{true="true" false="false" useJdkInflater} \
USE_JDK_DEFLATER=~{true="true" false="false" useJdkDeflater}
}
output {
...
...
@@ -513,7 +521,7 @@ task MarkDuplicates {
runtime {
docker: dockerImage
time_minutes: timeMinutes
memory: memory
memory:
"~{
memory
Mb}M"
}
parameter_meta {
...
...
@@ -523,8 +531,8 @@ task MarkDuplicates {
metricsPath: {description: "The location where the output metrics file should be written.", category: "required"}
read_name_regex: {description: "Equivalent to the `READ_NAME_REGEX` option of MarkDuplicates.", category: "advanced"}
memory: {description: "The amount of memory this job will use.", category: "advanced"}
javaXmx: {description: "The maximum memory available to the program. Should be lower than `memory` to accommodate JVM overhead.",
memory
Mb
: {description: "The amount of memory this job will use
in megabytes
.", category: "advanced"}
javaXmx
Mb
: {description: "The maximum memory available to the program
in megabytes
. Should be lower than `memory
Mb
` to accommodate JVM overhead.",
category: "advanced"}
timeMinutes: {description: "The maximum amount of time the job will run in minutes.", category: "advanced"}
dockerImage: {description: "The docker image used for this task. Changing this may result in errors which the developers may choose not to address.",
...
...
This diff is collapsed.
Click to expand it.
Preview
0%
Loading
Try again
or
attach a new file
.
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Save comment
Cancel
Please
register
or
sign in
to comment