Subsample fastq files when over user-specified limit
Sometimes supplied fast files are too large for proper analysis (e.g. a single individual on an entire flowcell). To remedy this, we subsample (using seqtk) before we align.
-
Add config value to set max amount of reads per sample -
None
is no subsampling
-
-
add rule to count fastq length -
Based on this rule, and rule that either subsamples or simply creates a link to the raw fastq file.