featureCounts Benchmark

RustQC includes built-in gene-level read counting that produces output compatible with the Subread featureCounts format. This page compares the counting performance and output accuracy against standalone Rsubread featureCounts.

Performance

Large benchmark input: GM12878 REP1 — a 10 GB paired-end RNA-seq BAM aligned to GRCh38 (63,086 genes).

Tool	Runtime
Rsubread featureCounts	3m 39s
RustQC (all tools, single pass)	3m 56s

RustQC’s single pass includes featureCounts-compatible counting alongside dupRadar duplication analysis and all 7 RSeQC tools. The traditional workflow requires running each tool separately. Standalone featureCounts timing is from Docker with x86 emulation on ARM Mac.

Output equivalence

RustQC’s read counting uses the same algorithm as Subread featureCounts:

Feature type: exon-level features grouped by gene_id
Overlap detection: at least 1 base pair overlap
Strand awareness: configurable via the -s / --strandedness flag
Multi-mapping: tracked separately for unique and multi-mapper columns

Count comparison (large benchmark)

Metric	Rsubread featureCounts	RustQC	Exact match
allCounts (unique)	14,654,579	14,654,579	100%
filteredCounts (unique)	3,599,832	3,599,832	100%
allCountsMulti	16,089,488	16,089,488	100%
filteredCountsMulti	4,503,920	4,503,920	100%

Gene-level read counts are identical across all 63,086 genes. Assignment statistics (Assigned, Unassigned_NoFeatures, Unassigned_Ambiguous) match exactly.

The output format is directly compatible with downstream tools such as DESeq2 and MultiQC.

Additional outputs

Beyond the standard featureCounts counts file and summary, RustQC also produces:

Biotype counts (.biotype_counts.tsv) — per-biotype read count summaries
Biotype MultiQC bargraph (.biotype_counts_mqc.tsv) — ready for MultiQC visualization
rRNA percentage (.biotype_counts_rrna_mqc.tsv) — rRNA fraction for MultiQC general statistics

Generating these in the traditional workflow requires additional scripting after the featureCounts run.