feat: add optional FINE_TUNE_MODELS step for DL model fine-tuning by ypriverol · Pull Request #54 · bigbio/quantmsdiann

ypriverol · 2026-04-13T19:46:24Z

Summary

Adds an optional model fine-tuning step to the pipeline, eliminating the need for two separate pipeline runs when working with non-standard modifications.

How it works

When --enable_fine_tuning is enabled:

INSILICO_LIBRARY → PRELIMINARY_ANALYSIS → ASSEMBLE_EMPIRICAL_LIBRARY
    → FINE_TUNE_MODELS → TUNED_INSILICO_LIBRARY
    → INDIVIDUAL_ANALYSIS → FINAL_QUANTIFICATION

Standard flow produces the empirical library
FINE_TUNE_MODELS trains RT/IM models on the empirical library (--tune-lib --tune-rt --tune-im)
TUNED_LIBRARY_GENERATION regenerates the in-silico library with tuned models (--tokens, --rt-model, --im-model)
The rest of the pipeline uses the tuned library

New parameters

Parameter	Default	Description
`--enable_fine_tuning`	`false`	Enable fine-tuning step after empirical library assembly
`--tune_fr`	`false`	Also fine-tune the fragmentation model (quality-sensitive)
`--tune_lr`	`null`	Fine-tuning learning rate (DIA-NN default: 0.0005)

When to use

Fine-tuning is beneficial for:

Custom chemical labels (mTRAQ, dimethyl, SILAC labels)
Exotic PTMs not in DIA-NN's built-in model
Unmodified cysteines

Standard modifications (Phospho, Oxidation, Acetylation, etc.) do not need fine-tuning.

Version requirement

Requires DIA-NN >= 2.0. Cannot be combined with --skip_preliminary_analysis.

Test plan

Run with --enable_fine_tuning false (default) — verify no FINE_TUNE step appears
Run with --enable_fine_tuning true on DIA-NN 2.x — verify tuning + library regeneration
Run with --enable_fine_tuning true on DIA-NN 1.8.1 — verify version guard error
Run with --enable_fine_tuning true --skip_preliminary_analysis true — verify conflict error

🤖 Generated with Claude Code

When --enable_fine_tuning is enabled, the pipeline: 1. Runs the standard flow through ASSEMBLE_EMPIRICAL_LIBRARY 2. Fine-tunes RT/IM (optionally fragment) models on the empirical library 3. Re-generates the in-silico library using tuned models (--tokens, --rt-model, --im-model) 4. Uses the tuned library for INDIVIDUAL_ANALYSIS and FINAL_QUANTIFICATION New parameters: --enable_fine_tuning (default: false) — enable the fine-tuning step --tune_fr (default: false) — also fine-tune the fragmentation model --tune_lr (default: DIA-NN's 0.0005) — fine-tuning learning rate New module: modules/local/diann/fine_tune_models/ - Takes empirical library + FASTA + diann_config - Runs diann --tune-lib --tune-rt --tune-im - Outputs: dict.txt, rt.d0.pt, im.d0.pt (optionally fr.d0.pt) INSILICO_LIBRARY_GENERATION updated with optional tuned model inputs (pass [] when not fine-tuning, actual files when fine-tuning). Version guard: requires DIA-NN >= 2.0. Cannot be combined with --skip_preliminary_analysis. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

coderabbitai · 2026-04-13T19:47:13Z

Important

Review skipped

Auto reviews are disabled on base/target branches other than the default branch.

Please check the settings in the CodeRabbit UI or the .coderabbit.yaml file in this repository. To trigger a single review, invoke the @coderabbitai review command.

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: dd37e14e-9532-41f1-b106-dc81d4389c01

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

Use the checkbox below for a quick retry:

🔍 Trigger review

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch feat/fine-tune-step

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

github-actions · 2026-04-13T19:48:04Z

`nf-core pipelines lint` overall result: Passed ✅ ⚠️

Posted for pipeline commit 034587e

+| ✅ 106 tests passed       |+
#| ❔  19 tests were ignored |#
#| ❔   1 tests had warnings |#
!| ❗   4 tests had warnings |!

Details

❗ Test warnings:

files_exist - File not found: conf/igenomes.config
files_exist - File not found: conf/igenomes_ignored.config
files_exist - File not found: .github/workflows/awstest.yml
files_exist - File not found: .github/workflows/awsfulltest.yml

❔ Tests ignored:

files_exist - File is ignored: .github/workflows/ci.yml
files_exist - File is ignored: .github/workflows/nf-test.yml
files_exist - File is ignored: .gitignore
files_exist - File is ignored: conf/modules.config
files_exist - File is ignored: conf/test.config
files_exist - File is ignored: conf/test_full.config
nextflow_config - nextflow_config
files_unchanged - File ignored due to lint config: .github/.dockstore.yml
files_unchanged - File ignored due to lint config: .github/CONTRIBUTING.md
files_unchanged - File ignored due to lint config: .github/PULL_REQUEST_TEMPLATE.md
files_unchanged - File ignored due to lint config: .github/workflows/branch.yml
files_unchanged - File ignored due to lint config: .github/workflows/linting_comment.yml
files_unchanged - File ignored due to lint config: .github/workflows/linting.yml
files_unchanged - File ignored due to lint config: docs/README.md
files_unchanged - File ignored due to lint config: .gitignore or .prettierignore
actions_nf_test - '.github/workflows/nf-test.yml' not found
actions_awstest - 'awstest.yml' workflow not found: /home/runner/work/quantmsdiann/quantmsdiann/.github/workflows/awstest.yml
multiqc_config - multiqc_config
modules_config - modules_config

❔ Tests fixed:

rocrate_readme_sync - Mismatch fixed: RO-Crate description updated from README.md.

✅ Tests passed:

files_exist - File found: .gitattributes
files_exist - File found: .nf-core.yml
files_exist - File found: .prettierignore
files_exist - File found: .prettierrc.yml
files_exist - File found: CHANGELOG.md
files_exist - File found: CITATIONS.md
files_exist - File found: CODE_OF_CONDUCT.md
files_exist - File found: LICENSE or LICENSE.md or LICENCE or LICENCE.md
files_exist - File found: nextflow_schema.json
files_exist - File found: nextflow.config
files_exist - File found: README.md
files_exist - File found: .github/.dockstore.yml
files_exist - File found: .github/CONTRIBUTING.md
files_exist - File found: .github/ISSUE_TEMPLATE/bug_report.yml
files_exist - File found: .github/ISSUE_TEMPLATE/config.yml
files_exist - File found: .github/ISSUE_TEMPLATE/feature_request.yml
files_exist - File found: .github/PULL_REQUEST_TEMPLATE.md
files_exist - File found: .github/workflows/branch.yml
files_exist - File found: .github/actions/get-shards/action.yml
files_exist - File found: .github/actions/nf-test/action.yml
files_exist - File found: .github/workflows/linting_comment.yml
files_exist - File found: .github/workflows/linting.yml
files_exist - File found: assets/email_template.html
files_exist - File found: assets/email_template.txt
files_exist - File found: assets/sendmail_template.txt
files_exist - File found: assets/nf-core-quantmsdiann_logo_light.png
files_exist - File found: docs/images/nf-core-quantmsdiann_logo_light.png
files_exist - File found: docs/images/nf-core-quantmsdiann_logo_dark.png
files_exist - File found: docs/output.md
files_exist - File found: docs/README.md
files_exist - File found: docs/README.md
files_exist - File found: docs/usage.md
files_exist - File found: nf-test.config
files_exist - File found: tests/default.nf.test
files_exist - File found: main.nf
files_exist - File found: assets/multiqc_config.yml
files_exist - File found: conf/base.config
files_exist - File found: modules.json
files_exist - File found: ro-crate-metadata.json
files_exist - File not found check: .github/ISSUE_TEMPLATE/bug_report.md
files_exist - File not found check: .github/ISSUE_TEMPLATE/feature_request.md
files_exist - File not found check: .github/workflows/push_dockerhub.yml
files_exist - File not found check: .markdownlint.yml
files_exist - File not found check: .nf-core.yaml
files_exist - File not found check: .yamllint.yml
files_exist - File not found check: bin/markdown_to_html.r
files_exist - File not found check: conf/aws.config
files_exist - File not found check: docs/images/nf-core-quantmsdiann_logo.png
files_exist - File not found check: lib/Checks.groovy
files_exist - File not found check: lib/Completion.groovy
files_exist - File not found check: lib/NfcoreTemplate.groovy
files_exist - File not found check: lib/Utils.groovy
files_exist - File not found check: lib/Workflow.groovy
files_exist - File not found check: lib/WorkflowMain.groovy
files_exist - File not found check: lib/WorkflowQuantmsdiann.groovy
files_exist - File not found check: parameters.settings.json
files_exist - File not found check: pipeline_template.yml
files_exist - File not found check: Singularity
files_exist - File not found check: lib/nfcore_external_java_deps.jar
files_exist - File not found check: .travis.yml
nf_test_content - 'tests/default.nf.test' contains outdir parameter
nf_test_content - 'tests/default.nf.test' snapshots a 'versions.yml' file
nf_test_content - 'tests/default.nf.test' snapshots a 'versions.yml' file
nf_test_content - 'tests/nextflow.config' contains modules_testdata_base_path
nf_test_content - 'tests/nextflow.config' contains pipelines_testdata_base_path
nf_test_content - 'nf-test.config' sets a testsDir
nf_test_content - 'nf-test.config' sets a workDir
nf_test_content - 'nf-test.config' sets a configFile
files_unchanged - .gitattributes matches the template
files_unchanged - .prettierrc.yml matches the template
files_unchanged - LICENSE matches the template
files_unchanged - .github/ISSUE_TEMPLATE/bug_report.yml matches the template
files_unchanged - .github/ISSUE_TEMPLATE/feature_request.yml matches the template
files_unchanged - assets/email_template.html matches the template
files_unchanged - assets/email_template.txt matches the template
files_unchanged - assets/sendmail_template.txt matches the template
readme - README Nextflow minimum version badge matched config. Badge: 25.04.0, Config: 25.04.0
readme - README nf-core template version badge found.
readme - README Zenodo placeholder was replaced with DOI.
pipeline_todos - No TODO strings found
pipeline_if_empty_null - No ifEmpty(null) strings found
plugin_includes - No wrong validation plugin imports have been found
pipeline_name_conventions - Name adheres to nf-core convention
template_strings - Did not find any Jinja template strings (0 files)
schema_lint - Schema lint passed
schema_lint - Schema title + description lint passed
schema_lint - Input mimetype lint passed: 'text/csv'
schema_params - Schema matched params returned from nextflow config
system_exit - No System.exit calls found
actions_schema_validation - Workflow validation passed: linting_comment.yml
actions_schema_validation - Workflow validation passed: clean-up.yml
actions_schema_validation - Workflow validation passed: ci.yml
actions_schema_validation - Workflow validation passed: fix_linting.yml
actions_schema_validation - Workflow validation passed: template-version-comment.yml
actions_schema_validation - Workflow validation passed: linting.yml
actions_schema_validation - Workflow validation passed: extended_ci.yml
actions_schema_validation - Workflow validation passed: branch.yml
actions_schema_validation - Workflow validation passed: merge_ci.yml
merge_markers - No merge markers found in pipeline files
modules_json - Only installed modules found in modules.json
modules_structure - modules directory structure is correct 'modules/nf-core/TOOL/SUBTOOL'
local_component_structure - local subworkflows directory structure is correct 'subworkflows/local/TOOL/SUBTOOL'
base_config - conf/base.config found and not ignored.
nfcore_yml - Repository type in .nf-core.yml is valid: pipeline
nfcore_yml - nf-core version in .nf-core.yml is set to the latest version: 3.5.2
rocrate_readme_sync - RO-Crate description matches the README.md.

Run details

nf-core/tools version 3.5.2
Run at 2026-04-14 05:57:10

Per Vadim Demichev's recommendation, fine-tuning should run as a separate phase BEFORE the main analysis, not mid-pipeline: Phase 0 (when --enable_fine_tuning): 1. INSILICO_LIBRARY_GENERATION (default models) 2. TUNE_PRELIMINARY_ANALYSIS (on --tune_n_files subset) 3. TUNE_ASSEMBLE_LIBRARY (empirical library from subset) 4. FINE_TUNE_MODELS (train RT/IM/fragment models) 5. TUNED_LIBRARY_GENERATION (re-generate with tuned models) Phase 1 (always, the standard pipeline): Uses tuned in-silico library (or default if no fine-tuning) PRELIMINARY_ANALYSIS → ASSEMBLE → INDIVIDUAL → FINAL New parameter: --tune_n_files (default: 3) — number of files for the tuning search subset. Uses Nextflow process aliases (TUNE_PRELIMINARY_ANALYSIS, TUNE_ASSEMBLE_LIBRARY, TUNED_LIBRARY_GENERATION) to avoid duplicate process invocation conflicts. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

ypriverol and others added 2 commits April 13, 2026 20:53

feat: add social preview image for og:image

b80b2bd

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add optional FINE_TUNE_MODELS step for DL model fine-tuning#54

feat: add optional FINE_TUNE_MODELS step for DL model fine-tuning#54
ypriverol wants to merge 3 commits intodevfrom
feat/fine-tune-step

ypriverol commented Apr 13, 2026

Uh oh!

coderabbitai bot commented Apr 13, 2026 •

edited

Loading

Review skipped

Uh oh!

github-actions bot commented Apr 13, 2026 •

edited

Loading

❗ Test warnings:

❔ Tests ignored:

❔ Tests fixed:

✅ Tests passed:

Run details

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

ypriverol commented Apr 13, 2026

Summary

How it works

New parameters

When to use

Version requirement

Test plan

Uh oh!

coderabbitai bot commented Apr 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Review skipped

Uh oh!

github-actions bot commented Apr 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

nf-core pipelines lint overall result: Passed ✅ ⚠️

❗ Test warnings:

❔ Tests ignored:

❔ Tests fixed:

✅ Tests passed:

Run details

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

coderabbitai bot commented Apr 13, 2026 •

edited

Loading

github-actions bot commented Apr 13, 2026 •

edited

Loading

`nf-core pipelines lint` overall result: Passed ✅ ⚠️