Migration guide for workflow outputs #6162

bentsherman · 2025-06-05T00:42:41Z

No description provided.

Signed-off-by: Ben Sherman <[email protected]>

netlify · 2025-06-05T00:42:45Z

✅ Deploy Preview for nextflow-docs-staging ready!

Name	Link
🔨 Latest commit	`107409c`
🔍 Latest deploy log	https://app.netlify.com/projects/nextflow-docs-staging/deploys/685ec035b95c600008f0020c
😎 Deploy Preview	https://deploy-preview-6162--nextflow-docs-staging.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify project configuration.

christopher-hakkaart

I've suggested headings to the bullet points as I think it's easier to scan.

Before making suggestion to the example migration, did you want it to be follow along? Or something more demonstration with explanations?

This will impact some of the language:

"Declare an output for each channel..." vs "An output channel for each must be declared..."

docs/guides/workflow-outputs.md

Signed-off-by: Ben Sherman <[email protected]>

bentsherman · 2025-06-05T21:47:59Z

Thanks Chris, all great improvements

It is mostly a follow-along, with some explanation sprinkled throughout. I need to guide the reader through a specific example while also pausing at times to make more general points. The current version is my best attempt to balance these things

christopher-hakkaart · 2025-06-05T21:59:56Z

Great. The steps you have included are already great, so I'll make suggestions that tweak the language to make it consistent as a follow-along.

bentsherman · 2025-06-16T16:03:11Z

From our discussion last week, we will follow up this PR with a separate tutorial about the rnaseq-nf pipeline to give more context.

Also, this guide is really more of a tutorial, but we already renamed the section from "Tutorials" to "Guides". Once we move to seqera docs, we should have explicit sections for both

I also wouldn't be opposed to just having a "Guides" and "Tutorials" section in the current docs? I just didn't want to keep renaming the current section back and forth

christopher-hakkaart · 2025-06-22T21:50:27Z

I agree with your points. Catching up on some backlog today and will review the current PR in the next day or so.

christopher-hakkaart

Sorry it took me so long to get to this.

I've added suggestions to make this an example of a migration. Past tense, "this is what was done to change this pipeline".

Some sections were multi-step, i.e., they showed something and then showed a better way of doing it, and then they switched to "you could do it this way, but it was done this way for these reasons."

Let me know what you think. Happy to keep iterating if you have suggestions.

Some of the suggestions got a little messy, so if you're happy and accept, I'll give it another check to make sure everything is consistent.

docs/guides/workflow-outputs.md

christopher-hakkaart · 2025-06-26T22:17:37Z

docs/guides/workflow-outputs.md

+
+### Replacing `publishDir` with workflow outputs
+
+We'll start by removing each `publishDir` directive and publishing the corresponding process output channel in the entry workflow.


Suggested change

We'll start by removing each `publishDir` directive and publishing the corresponding process output channel in the entry workflow.

The `publishDir `directive is not required when you publish process outputs in the entry workflow. Instead, outputs are emitted in the entry workflow.

christopher-hakkaart · 2025-06-26T22:26:59Z

docs/guides/workflow-outputs.md

+
+We'll start by removing each `publishDir` directive and publishing the corresponding process output channel in the entry workflow.
+
+First, emit the `QUANT` and `FASTQC` outputs separately in the `RNASEQ` workflow:


Suggested change

First, emit the `QUANT` and `FASTQC` outputs separately in the `RNASEQ` workflow:

For example, the `QUANT` and `FASTQC` outputs are emitted in the `RNASEQ` workflow:

christopher-hakkaart · 2025-06-26T23:52:41Z

docs/guides/workflow-outputs.md

+}
+```
+
+We use maps instead of tuples so that we can access fields by name, and so that the index file can use the map keys as column names.


Suggested change

We use maps instead of tuples so that we can access fields by name, and so that the index file can use the map keys as column names.

Maps instead of tuples were used so that fields are accessible by name, and the index file can use the map keys as column names.

christopher-hakkaart · 2025-06-26T23:53:17Z

docs/guides/workflow-outputs.md

+
+We use maps instead of tuples so that we can access fields by name, and so that the index file can use the map keys as column names.
+
+Declare the `samples` output with an index file:


Suggested change

Declare the `samples` output with an index file:

The `samples` are declared as outputs with an index file:

christopher-hakkaart · 2025-06-26T23:54:03Z

docs/guides/workflow-outputs.md

+}
+```
+
+Since each channel value now contains multiple files that were going to different subdirectories, we must use *publish statements* in the `path` directive to route each file to the appropriate location.


Suggested change

Since each channel value now contains multiple files that were going to different subdirectories, we must use *publish statements* in the `path` directive to route each file to the appropriate location.

Since each channel value contains multiple files that were going to different subdirectories, the *publish statements* in the `path` directive were used to route each file to the appropriate location.

christopher-hakkaart · 2025-06-26T23:55:06Z

docs/guides/workflow-outputs.md

+
+Since each channel value now contains multiple files that were going to different subdirectories, we must use *publish statements* in the `path` directive to route each file to the appropriate location.
+
+Finally, run the pipeline to verify the index file:


Suggested change

Finally, run the pipeline to verify the index file:

This would produce the following index file:

christopher-hakkaart · 2025-06-26T23:56:59Z

docs/guides/workflow-outputs.md

+"spleen","results/fastqc/spleen","results/quant/spleen"
+```
+
+In the future, if we add a tool with per-sample outputs, we only need to join the tool output into the `samples_ch` channel and update the output `path` directive accordingly. This approach keeps our output definition concise as we add more tools to the pipeline. Additionally, a single unified index file for all per-sample outputs is easier for downstream pipelines to consume, rather than cross-referencing multiple related index files.


Suggested change

In the future, if we add a tool with per-sample outputs, we only need to join the tool output into the `samples_ch` channel and update the output `path` directive accordingly. This approach keeps our output definition concise as we add more tools to the pipeline. Additionally, a single unified index file for all per-sample outputs is easier for downstream pipelines to consume, rather than cross-referencing multiple related index files.

In the future, if a tool with per-sample outputs were added, the tool output could join the `samples_ch` channel and update the output `path` directive accordingly. This approach keeps the output definition concise as we add more tools to the pipeline. Additionally, a single unified index file for all per-sample outputs is easier for downstream pipelines to consume, rather than cross-referencing multiple related index files.

Signed-off-by: Ben Sherman <[email protected]>

bentsherman · 2025-06-27T16:07:42Z

I split the Guides section into Tutorials and Guides. Based on our previous discussion, this migration guide seems to be a "tutorial", since it is now a strict step-by-step guide, but also contains some explanations. Whereas I think your page on spot retries is the only guide in this list?

I don't think the past-tense suggestions are a good fit for what I'm trying to do. It feels too brittle and distant, whereas the present-tense imperative feels more engaging and interactive. A similar approach is taken in the "tutorials" for data lineage and Flux, for similar reasons. So I would prefer to try to make it work with the current approach

Migration guide for workflow outputs [ci fast]

38d8fe8

Signed-off-by: Ben Sherman <[email protected]>

bentsherman requested a review from a team as a code owner June 5, 2025 00:42

bentsherman added the docs label Jun 5, 2025

christopher-hakkaart reviewed Jun 5, 2025

View reviewed changes

bentsherman added 2 commits June 5, 2025 16:44

Apply suggestions from review

6c433f0

Signed-off-by: Ben Sherman <[email protected]>

Merge branch 'master' into docs-workflow-outputs-guide

ccdeba4

bentsherman mentioned this pull request Jun 6, 2025

New RFC: Switching from publishDir to workflow output nf-core/proposals#47

Open

1 task

Merge branch 'master' into docs-workflow-outputs-guide

0771108

christopher-hakkaart reviewed Jun 27, 2025

View reviewed changes

bentsherman added 7 commits June 27, 2025 10:07

Merge branch 'master' into docs-workflow-outputs-guide

03831cf

Reorganize tutorials and guides

8271807

Signed-off-by: Ben Sherman <[email protected]>

Apply suggestions from review

cbe24e4

Signed-off-by: Ben Sherman <[email protected]>

minor edit

66ea88e

Signed-off-by: Ben Sherman <[email protected]>

Merge branch 'master' into docs-workflow-outputs-guide

ff1d2d5

Apply suggestion from review [ci fast]

8374d43

Signed-off-by: Ben Sherman <[email protected]>

minor edits

107409c

Signed-off-by: Ben Sherman <[email protected]>


		### Replacing `publishDir` with workflow outputs

		We'll start by removing each `publishDir` directive and publishing the corresponding process output channel in the entry workflow.

	We'll start by removing each `publishDir` directive and publishing the corresponding process output channel in the entry workflow.
	The `publishDir `directive is not required when you publish process outputs in the entry workflow. Instead, outputs are emitted in the entry workflow.


		We'll start by removing each `publishDir` directive and publishing the corresponding process output channel in the entry workflow.

		First, emit the `QUANT` and `FASTQC` outputs separately in the `RNASEQ` workflow:

	First, emit the `QUANT` and `FASTQC` outputs separately in the `RNASEQ` workflow:
	For example, the `QUANT` and `FASTQC` outputs are emitted in the `RNASEQ` workflow:

	We use maps instead of tuples so that we can access fields by name, and so that the index file can use the map keys as column names.
	Maps instead of tuples were used so that fields are accessible by name, and the index file can use the map keys as column names.


		We use maps instead of tuples so that we can access fields by name, and so that the index file can use the map keys as column names.

		Declare the `samples` output with an index file:

	Declare the `samples` output with an index file:
	The `samples` are declared as outputs with an index file:

	Since each channel value now contains multiple files that were going to different subdirectories, we must use publish statements in the `path` directive to route each file to the appropriate location.
	Since each channel value contains multiple files that were going to different subdirectories, the publish statements in the `path` directive were used to route each file to the appropriate location.


		Since each channel value now contains multiple files that were going to different subdirectories, we must use publish statements in the `path` directive to route each file to the appropriate location.

		Finally, run the pipeline to verify the index file:

	Finally, run the pipeline to verify the index file:
	This would produce the following index file:

Migration guide for workflow outputs #6162

Are you sure you want to change the base?

Migration guide for workflow outputs #6162

Conversation

bentsherman commented Jun 5, 2025

Uh oh!

netlify bot commented Jun 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

✅ Deploy Preview for nextflow-docs-staging ready!

Uh oh!

christopher-hakkaart left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

bentsherman commented Jun 5, 2025

Uh oh!

christopher-hakkaart commented Jun 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bentsherman commented Jun 16, 2025

Uh oh!

christopher-hakkaart commented Jun 22, 2025

Uh oh!

christopher-hakkaart left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

christopher-hakkaart Jun 26, 2025

Choose a reason for hiding this comment

Uh oh!

christopher-hakkaart Jun 26, 2025

Choose a reason for hiding this comment

Uh oh!

christopher-hakkaart Jun 26, 2025

Choose a reason for hiding this comment

Uh oh!

christopher-hakkaart Jun 26, 2025

Choose a reason for hiding this comment

Uh oh!

christopher-hakkaart Jun 26, 2025

Choose a reason for hiding this comment

Uh oh!

christopher-hakkaart Jun 26, 2025

Choose a reason for hiding this comment

Uh oh!

christopher-hakkaart Jun 26, 2025

Choose a reason for hiding this comment

Uh oh!

bentsherman commented Jun 27, 2025

Uh oh!

Uh oh!

netlify bot commented Jun 5, 2025 •

edited

Loading

christopher-hakkaart commented Jun 5, 2025 •

edited

Loading

christopher-hakkaart left a comment •

edited

Loading