Flow Map流程地图
RNA-seq flow family
RNA-seq 流程家族
The TAFFISH RNA-seq suite is a family of small, versioned, auditable flow apps. The reference route starts from a genome and annotation; the de novo route starts from reads and builds the transcript feature space first. Both routes keep command provenance, logs, output contracts, and static reports explicit.
TAFFISH RNA-seq 套件由一组小而清楚、可版本化、可审计的流程应用组成。有参路线从基因组和注释开始;无参路线从 reads 开始,先构建转录本特征空间。两条路线都显式保留命令溯源、日志、输出契约和静态报告。
How the flows connect
这些流程如何连接
Reference mode remains the default for projects with a reliable genome and annotation. The optional alignment lane adds genome-aware evidence. De novo mode is an explicit no-reference route that replaces the reference index and expression segment with assembly, de novo quantification, and annotation.
有参模式仍是拥有可靠基因组和注释项目的默认路线。可选比对分支补充基因组位置证据。无参模式是显式 no-reference 路线,用组装、无参定量和注释替代有参 index 与 expression 段。
rnaseq-standard-flow
one command that orchestrates reference or de novo mode
根据参数编排有参或无参路线的一站式入口
Reference expression route
有参表达路线
FASTQ plus genome and annotation to expression, DE, enrichment, and report
从 FASTQ、基因组和注释到表达、差异、富集和报告
Optional alignment and count evidence route
可选比对和计数证据路线
enabled by --route both; DE can switch to featureCounts with --de-source featurecounts
通过 --route both 启用;差异分析可用 --de-source featurecounts 切换到 featureCounts
Explicit de novo route
显式无参路线
enabled by --mode denovo; no genome or annotation is required, but annotation resources must be supplied for functional interpretation
通过 --mode denovo 启用;不需要基因组或注释,但功能解释需要用户提供注释资源
The standard flow is an umbrella, not a black box
standard-flow 是总入口,不是黑盒
rnaseq-standard-flow is designed for users who want to start from local FASTQ files and receive a coherent analysis directory plus a bilingual HTML report. In reference mode, the inputs are FASTQ, genome FASTA, annotation, metadata, and optional gene sets. In de novo mode, the inputs are FASTQ, metadata, a protein database for homology annotation, and an optional protein-to-GO map for enrichment.
rnaseq-standard-flow 面向希望从本地 FASTQ 开始,并得到一个完整分析目录和双语 HTML 报告的用户。有参模式输入 FASTQ、基因组 FASTA、注释、元数据和可选基因集;无参模式输入 FASTQ、元数据、用于同源注释的蛋白数据库,以及用于富集分析的可选蛋白到 GO 映射表。
The default behavior remains reference Salmon-first. De novo mode is never entered silently just because a genome was omitted; users must select --mode denovo. This protects routine reference analyses from accidental mode switches and makes no-reference interpretation boundaries visible in the final report.
默认行为仍然是有参 Salmon-first。流程不会因为用户漏传 genome 就悄悄切到无参;必须显式选择 --mode denovo。这样可以避免常规有参分析被意外切换,也能让无参解释边界在最终报告中清楚呈现。
How to choose a route
如何选择路线
Use reference mode when
这些情况使用有参模式
- the organism has a reliable genome and annotation;
- 研究物种有可靠基因组和注释;
- gene-level expression, DE, and enrichment are the primary deliverables;
- 主要交付物是基因层面表达、差异分析和富集;
- speed, interpretability, and compatibility with existing databases matter.
- 速度、可解释性以及与现有数据库兼容性更重要。
Use de novo mode when
这些情况使用无参模式
- there is no trusted reference genome or annotation;
- 没有可信参考基因组或注释;
- the project can tolerate transcript-level feature interpretation;
- 项目可以接受转录本层面特征解释;
- users can provide offline protein and GO resources for annotation.
- 用户能够提供离线蛋白数据库和 GO 映射资源。
Use --route both only in reference mode. De novo mode does not create genome-aligned BAM, featureCounts matrices, or reference-based BAM QC.
只有有参模式才使用 --route both。无参模式不会生成基因组比对 BAM、featureCounts 矩阵或基于参考的 BAM 质控。