Skip to content

Bcl2fastq2

The Illumina bcl2fastq2 conversion software demultiplexes sequencing data and converts base call (BCL) files into FASTQ files. For every cycle of a sequencing run, the Real-Time Analysis (RTA) software generates a BCL file containing base calls and associated quality scores (Q-scores).

Bcl2fastq2 is available as a module on Apocrita.

Usage

To run the default installed version of Bcl2fastq2, simply load the bcl2fastq2 module:

$ module load bcl2fastq2
$ bcl2fastq -h

Usage:
      bcl2fastq [options]

For usage documentation, run bcl2fastq2 -h.

Example job

Serial job

Threading options

By default, Bcl2fastq2 will run on as many threads as installed CPUs during conversion/demultiplexing, plus additional threads for reading/writing data. Therefore, to prevent overloading a compute node, you must set the number of processing threads to the number of cores requested, or simply use the NSLOTS variable, as shown in the example job below.

The file i/o threads (loading/writing) are typically inactive and consume minimal processing time. By default, each will use 4 threads but we recommend using 1 for reading and 1 for writing unless you need to perform a lot of i/o. If your job is i/o intensive, increasing the loading/writing threads will improve the overall job performance, especially if paired with using your scratch space.

Here is an example job running on 4 cores and 8GB of memory:

#!/bin/bash
#$ -cwd
#$ -j y
#$ -pe smp 4
#$ -l h_rt=1:0:0
#$ -l h_vmem=2G

module load bcl2fastq2

bcl2fastq --runfolder-dir <runfolder_dir> \
          --output-dir <output_dir> \
          --processing-threads ${NSLOTS} \
          --loading-threads 1 \
          --writing-threads 1

References