TAFFISH phylogeny report

Phylogeny Report系统发育分析报告

Generated by phylogeny-flow 0.1.0-r1 at 2026-05-28T17:32:53Z UTC. This static report follows the analysis path from homologous input sequences through alignment, trimming, tree inference, static visualization, methods and provenance.本报告由 phylogeny-flow 0.1.0-r1 于 2026-05-28T17:32:53Z UTC 生成,按同源序列输入、多序列比对、比对修剪、系统发育树推断、静态可视化、方法记录和溯源信息组织结果。

Offline static HTML离线静态报告Language switchable中英切换Command-level provenance命令级溯源
Project view项目视图

Executive Summary项目总览

These metrics summarize the computational route and give a first-pass view before biological interpretation. They do not by themselves validate homology, orthology, or species-tree conclusions.这些指标概括本次计算路线,用于在正式生物学解释前快速了解分析状态。它们本身不自动证明同源性、直系同源关系或物种树结论。

Sequences序列数24
Sequence type序列类型protein
Tree engine建树工具iqtree
Model模型LG
Raw alignment length原始比对长度116
Trimmed length修剪后长度104
Variable sites变异位点71
Plot layouts树图布局2
Biological logic生物学逻辑

Workflow and Biological Questions分析流程与生物学问题

The workflow is organized as a chain of biological evidence transformations rather than a file dump. Each step answers a different question about the same sequence set.报告按生物学证据转换链组织,而不是简单堆叠文件。每一步都围绕同一组序列回答不同问题。

Homologous input同源序列输入Are these sequences comparable?这些序列是否可比较?
->
ID normalizationID 标准化Can labels survive Newick?标签能否安全进入 Newick?
->
MSA多序列比对mafft
->
Trimming比对修剪trimal
->
Tree inference系统发育推断iqtree
->
Visualization可视化rectangular / circular矩形 / 环形
Tree views树图视图

Tree Layout Gallery树形布局图集

The same Newick tree is rendered in two complementary default layouts: a readable rectangular tree and a compact circular overview. The top-level 03_results/tree/plots/tree.* files keep the primary layout selected by --plot-layout; if fan is selected as the primary layout, that extra view is also retained.同一棵 Newick 树默认渲染成两种互补布局:便于阅读的矩形树图,以及适合总览的环形树图。顶层 03_results/tree/plots/tree.* 保留 --plot-layout 指定的主图;如果显式选择 fan 作为主布局,则额外保留扇形图。

Rectangular tree / 矩形树图
Rectangular tree矩形树图SVGPNGPDF

Best for reading branch lengths and tip labels in a methods/report setting.适合阅读分支长度和末端标签,是方法报告中最稳妥的主图。

Circular tree / 环形树图
Circular tree环形树图SVGPNGPDF

Useful when many tips need a compact overview and label direction is less central.适合在末端较多时提供紧凑总览,但不一定最适合逐个阅读标签。

Reading layer阅读层

How to Read This Report如何阅读本报告

A phylogenetic report should be read as evidence plus assumptions. Start from input identity and alignment quality, then review trimming, model choice, support values, and only then discuss biological interpretation.系统发育报告应被理解为“证据 + 假设”。建议先确认输入序列身份和比对质量,再查看修剪、模型选择、支持率,最后再讨论生物学解释。

1. Check sequence meaning1. 先确认序列含义

A tree is only meaningful if the input sequences are homologous and represent the intended gene, protein, marker, or locus.只有输入序列同源并代表预期基因、蛋白、marker 或位点时,树才有解释意义。

2. Inspect the alignment2. 查看比对

MSA columns define which residues are treated as comparable evidence. Poor alignment can dominate topology.多序列比对的列决定哪些残基被当作可比较证据;差的比对会主导拓扑。

3. Treat trimming as a decision3. 把修剪当作决策

Trimming can remove noisy regions, but aggressive trimming can also remove real biological signal.修剪可以去掉噪声区域,但过强修剪也可能去掉真实生物学信号。

4. Separate gene tree and species tree4. 区分基因树和物种树

This flow infers a gene/protein tree; species history needs orthology, sampling, and often multi-gene evidence.本流程推断的是基因/蛋白树;物种历史需要直系同源判断、采样设计,通常还需要多基因证据。

Interpretation boundary解读边界
Reproducibility makes the analysis auditable, but it does not turn every topology into biological truth. Homolog selection, paralogy, recombination, taxon sampling, alignment uncertainty and model choice remain biological decisions.可复现性让分析可审计,但不会把每一种拓扑自动变成生物学真相。同源序列选择、旁系同源、重组、物种采样、比对不确定性和模型选择仍然是生物学决策。
Biology and bioinformatics primer生物学与生信背景

What Each Step Means每一步意味着什么

Homology同源性

Phylogeny starts from a claim that sequences share ancestry. Database hits, marker identity, orthogroups, or curated protein families are common ways to justify that claim.系统发育分析始于“这些序列共享祖先”的判断。数据库命中、marker 身份、orthogroup 或人工整理蛋白家族都可以为这一判断提供依据。

Multiple sequence alignment多序列比对

The alignment converts sequences of different histories and lengths into site-wise comparable evidence. Insertions, deletions and divergent regions are where many errors enter.比对把不同长度和演化历史的序列转换成逐位点可比较证据;插入、缺失和高度分化区域往往是误差来源。

Tree inference建树推断

IQ-TREE performs model-based maximum-likelihood inference, while FastTree provides a faster approximate route. Speed, model richness and support interpretation differ.IQ-TREE 进行基于模型的最大似然推断,FastTree 提供更快的近似路线。二者在速度、模型丰富度和支持率解释上不同。

Support values支持率

Support values describe stability under a method and dataset, not absolute truth. Low support suggests fragile splits; high support still depends on input correctness.支持率描述某种方法和数据集下的稳定性,不是绝对真理。低支持提示分支脆弱;高支持也仍依赖输入是否正确。

Branch lengths分支长度

Branch lengths usually reflect inferred substitutions per site. They are not calendar time unless a time-calibrated model is explicitly used.分支长度通常表示每个位点的推断替换数;除非显式使用时间校准模型,否则它不是日历时间。

Visualization可视化

Rectangular and circular layouts are complementary views of the same tree. A prettier layout can help communication, but the canonical result remains the Newick tree.矩形和环形布局是同一棵树的互补视图。更漂亮的布局有助于沟通,但规范结果仍是 Newick 树。

Machine-readable evidence机器可读证据

Evidence Tables证据表格

The tables below are the audit layer behind the report. They make the HTML readable by humans while keeping the run inspectable by scripts.下面这些表格是报告背后的审计层。HTML 便于人阅读,TSV 表格则让脚本也能检查结果。

Flow summary流程摘要

metricvalue
flow_version0.1.0-r1
input_moderaw_fasta
sequence_count24
seq_typeprotein
alignermafft
trimmertrimal
tree_engineiqtree
final_tree03_results/tree/tree.nwk
tree_plot_statusgenerated
tree_plot_dir03_results/tree/plots
tree_plot_layoutrectangular
tree_plot_gallery_layoutsrectangular,circular
tree_plot_formatspdf,png,svg
tree_plot_stylejournal
tree_plot_label_alignnone
tree_plot_label_transformpretty
tree_plot_scale_barauto
trimmed_alignment03_results/alignment/trimmed.fa

Input summary输入摘要

metricvalue
input_moderaw_fasta
sequence_count24
seq_typeprotein
min_length104
median_length105
max_length112

Alignment statistics比对统计

metricvalue
sequence_count24
alignment_length116
non_gap_columns116
gap_fraction0.085848
variable_sites71

Trimming summary修剪摘要

metricvalue
trimmertrimal
modeautomated1
input_alignment_length116
trimmed_alignment_length104
columns_removed12

Model summary模型摘要

metricvalue
tree_engineiqtree
modelLG
model_sourceLG

Support summary支持率摘要

metricvalue
tree_engineiqtree
bootstrap_typeUFBoot
bootstrap_replicates0
alrt_replicates0

Plot gallery index树图索引

layoutsvgpngpdfsummaryannotations
rectangular03_results/tree/plots/rectangular/tree.svg03_results/tree/plots/rectangular/tree.png03_results/tree/plots/rectangular/tree.pdf03_results/tree/plots/rectangular/tree_plot_summary.tsv03_results/tree/plots/rectangular/tree_tip_annotations.tsv
circular03_results/tree/plots/circular/tree.svg03_results/tree/plots/circular/tree.png03_results/tree/plots/circular/tree.pdf03_results/tree/plots/circular/tree_plot_summary.tsv03_results/tree/plots/circular/tree_tip_annotations.tsv

Primary plot summary主图摘要

metricvalue
tree_filecytc-out/03_results/tree/tree.nwk
tip_count24
internal_node_count22
metadata_fileNA
metadata_id_columnNA
metadata_rows0
metadata_matched_tips0
metadata_unmatched_tipshuman_P99999_Homo,chimpanzee_P99998_Pan,rhesus_macaque_P00002_Macaca,horse_P00004_Equus,bovine_P62894_Bos,pig_P62895_Sus,rabbit_P00008_Oryctolagus,mouse_P62897_Mus,rat_P62898_Rattus,chicken_P67881_Gallus,king_penguin_P00017_Aptenodytes,mallard_P00020_Anas,alligator_P81280_Alligator,snapping_turtle_P00022_Chelydra,bullfrog_P00024_Aquarana,zebrafish_Q6IQM2_Danio,common_carp_P00026_Cyprinus,kangaroo_P00014_Macropus,honeybee_P00038_Apis,c_elegans_P19974_Caenorhabditis,budding_yeast_P00044_Saccharomyces,arabidopsis_P29380_Arabidopsis,rice_Q0DI31_Oryza,gorilla_Q6WUX8_Gorilla
label_map_filecytc-out/00_inputs/sequence_id_map.tsv
label_map_fromsafe_id
label_map_tooriginal_id
label_map_rows24
layoutrectangular
stylejournal
label_alignnone
label_transformpretty
scale_barauto
scale_bar_width0.1
scale_bar_label0.1
titlephylogeny-flow tree - rectangular
formatspdf,png,svg
plot_width10.36
plot_height7.26
output_filestree.pdf,tree.png,tree.svg

Aligned variable sites: 71. Alignment gap fraction: 0.085848.比对中的变异位点:71。gap 比例:0.085848。

Files to reuse可复用文件

Deliverables and Output Structure交付文件与输出结构

The output directory is designed to be copied or archived as a complete result package. The HTML is the entry point, while TSV, FASTA, Newick and log files remain reusable.输出目录可以作为完整结果包复制或归档。HTML 是入口,TSV、FASTA、Newick 和日志文件仍可被复用。

03_results/tree/tree.nwk

Canonical final Newick tree.规范最终 Newick 树。

03_results/tree/plots/

Primary plot plus rectangular and circular layout subdirectories; optional fan output appears only when requested as the primary layout.主图以及矩形、环形布局子目录;扇形图只在显式设为主布局时出现。

03_results/alignment/

Aligned and trimmed FASTA plus alignment statistics.比对、修剪后的 FASTA 以及比对统计。

04_reports/

Bilingual HTML report, methods, commands, versions and flow summary.双语 HTML 报告、方法、命令、版本和流程摘要。

Final Newick tree最终 Newick 树
(human_P99999_Homo:0.0000000000,(chimpanzee_P99998_Pan:0.0000010000,(rhesus_macaque_P00002_Macaca:0.0000010000,((((horse_P00004_Equus:0.0322821101,(bovine_P62894_Bos:0.0000010000,pig_P62895_Sus:0.0000010000):0.0000010000):0.0204671439,((rabbit_P00008_Oryctolagus:0.0202661987,(mouse_P62897_Mus:0.0000010000,rat_P62898_Rattus:0.0000010000):0.0000010000):0.0000010000,((((chicken_P67881_Gallus:0.0100558661,king_penguin_P00017_Aptenodytes:0.0100063188):0.0099341418,mallard_P00020_Anas:0.0100988562):0.0133800352,(alligator_P81280_Alligator:0.0299760376,snapping_turtle_P00022_Chelydra:0.0124674457):0.0371726893):0.0213745361,(bullfrog_P00024_Aquarana:0.0516719492,(zebrafish_Q6IQM2_Danio:0.0000010000,common_carp_P00026_Cyprinus:0.0305873732):0.0554760330):0.0215599525):0.0249352129):0.0101557716):0.0000010000,kangaroo_P00014_Macropus:0.0413654074):0.0197299328,((honeybee_P00038_Apis:0.0517514743,c_elegans_P19974_Caenorhabditis:0.4538441075):0.0943831015,((budding_yeast_P00044_Saccharomyces:0.2724551317,arabidopsis_P29380_Arabidopsis:0.2443780200):0.1432699350,rice_Q0DI31_Oryza:0.3161014928):0.0892957878):0.1083095371):0.0552937499):0.0101050190):0.0000010000,gorilla_Q6WUX8_Gorilla:0.0000000000);
Audit trail审计线索

Provenance and Reuse溯源与复用

Commands, versions, methods and manifest files are preserved so the run can be reviewed, cited, or reproduced later.命令、版本、方法和 manifest 会被保留,便于后续审计、引用或复现。

Versions版本

toolversionsource
phylogeny-flow0.1.0-r1taffish flow
taf-seqkit2.13.0-r2taffish dependency
taf-mafft7.525-r3taffish dependency
taf-trimal1.5.1-r1taffish dependency
taf-iqtree3.1.2-r1taffish dependency
taf-fasttree2.2.0-r1taffish dependency
taf-clipkit2.12.0-r1taffish dependency
taf-muscle5.3-r4taffish dependency
taf-clustal-omega1.2.4-r1taffish dependency
taf-bioconductor-bioviz3.23-r2taffish dependency
Final boundary最终边界
This report preserves a reproducible command-level tree analysis. It does not search homologs, determine orthology, detect recombination, time-calibrate divergence, or replace expert review of the biological system.本报告保留一条命令级可复现的建树分析。它不负责搜索同源序列、判定直系同源、检测重组、做分歧时间校准,也不替代对具体生物系统的专业审阅。