From b853933e731c468cee7265ae7cf311d034acc3ab Mon Sep 17 00:00:00 2001 From: Sander Bollen <a.h.b.bollen@lumc.nl> Date: Wed, 29 Nov 2017 12:15:09 +0100 Subject: [PATCH] features in readme --- README.md | 19 +++++++++++++++++++ 1 file changed, 19 insertions(+) diff --git a/README.md b/README.md index 174d289..66418b3 100644 --- a/README.md +++ b/README.md @@ -1,5 +1,24 @@ # hutspot +This is a multisample DNA variant calling pipeline based on Snakemake, bwa and the +GATK HaplotypeCaller. + +## Features +* Any number of samples is supported +* Whole-genome calling, regardless of sample protocol used. +* Follows modern best practices + * Each sample is individually called as as a GVCF. + * A multisample VCF is then produced by genotyping the collection of GVCFs. +* Data parallelization for calling and genotyping steps. + * Using ~100 chunks, we call an entire exome in ~15 minutes! +* Reasonably fast. + * 96 exomes in < 24 hours. +* No unnecessary jobs +* Coverage metrics for any number of bed files. +* Separate conda environments for **every** step. No more dependency hell! +Every job can potentially use different versions of the same package. +* Optionally sub-sample inputs when number of bases exceeds a user-defined +threshold. # graph -- GitLab