Skip to content
Snippets Groups Projects
  1. Aug 24, 2020
  2. Aug 21, 2020
  3. Aug 20, 2020
  4. Aug 12, 2020
    • van den Berg's avatar
      Remove directory output from fastqc · 89da42b6
      van den Berg authored
      The directory output for the fastqc tasks is causing issues on the
      shared file system of the cluster, since it cannot properly determine
      the age of the folder. As a result, it re-runs the fastqc tasks every
      time a workflow is restarted, regardless of whether the task has already
      completed.
      
      To prevent this, a single dummy output file '.done' has been added to
      the fastqc tasks which will be written when fastqc exits successfully.
      89da42b6
    • van den Berg's avatar
      e76c3b7b
  5. Aug 07, 2020
    • van den Berg's avatar
      Add an option to restrict BaseRecalibration · e33b4498
      van den Berg authored
      The base recalibration (BQSR) step of the pipeline can take up to 7 hours for
      WGS samples, which is a significant part of the total run time.
      
      The developers of GATK state that BQSR requires at least 100M bases per read
      group: "We usually expect to see more than 100M bases per read group; as a rule
      of thumb, larger numbers will work better."
      
      A human WGS sample with an average read depth of 43x has almost 1300 times that
      amount of bases. The analysis of these samples would be sped up greatly by
      restricting BQSR to a single chromosome.
      e33b4498
    • van den Berg's avatar
      Add an option to restrict BaseRecalibration · 095305f0
      van den Berg authored
      The base recalibration step of the pipeline can take up to 7 hours for
      WGS samples, which is a significant part of the total run time.
      
      The developers of GATK state that BQSR requires at least 100M bases per
      read group: "We usually expect to see more than 100M bases per read
      group; as a rule of thumb, larger numbers will work better."
      
      A human WGS sample with an average read depth of 43x has almost 1300
      times that amount of bases. The analysis of these samples would be sped
      up greatly by restricting BQSR to a single chromosome.
      095305f0
    • van den Berg's avatar
      Speed up cutadapt · b585ff80
      van den Berg authored
      Use 8 cores instead of just 1, according to the documentation speed
      should almost scale linear with the cores provided.
      
      Also reduce the compression level on the output file, since most time
      for cutadapt is spent re-compressing the data after trimming the
      adapters.
      b585ff80
  6. Jul 29, 2020
  7. Jul 28, 2020
  8. Jul 24, 2020
Loading