Aggregate results when using Kubeflow Pipelines kfp.ParallelFor

About

Asked 19/3, 2020 at 20:17 Answered 30/7, 2021 at 7:4

What is a good pattern for aggregating the results from Kubeflow Pipleine kfp.ParallelFor?

Ratiocinate answered 19/3, 2020 at 20:17 Comment(0)

Not exactly what you asked for, but our workaround was to write the results of the parallelfor tasks into S3 and simply collect them afterwards in a postprocessing task.

with dsl.ParallelFor(preprocessing_task.output) as plant_item:
                predict_plant='{}'.format(plant_item)
                forecasting_task = forecasting_op(predict_plant, ....).after(preprocessing_task)
postprocessing_task = postprocessing_op(...).after(forecasting_task)

Culch answered 30/7, 2021 at 7:4 Comment(3)

(After multiple suggested edits: no, the postprocessing step is not inside the loop, it is afterwards. That is exactly what collects the results.) – Culch 4/8, 2021 at 6:50

Are you aware of any documentation supporting that? When I try to recreate this approach, one postprocessing_task node appears in the graph for each of my forecasting_task equivalents. – Spahi 1/12, 2021 at 17:57

Did you put plant_item or predict_plant as input into it? Because then it's understandable. But no, documentation I couldn't find, just trial and error. – Culch 3/12, 2021 at 6:29

At the moment this might not be supported:

Support inputs with multiple arguments #1933

Gradely answered 5/5, 2020 at 12:24 Comment(0)

Hot tags

Godot Unity Godot Help Programming Godot 4.X GUI GDScript 3D 2D Physics CSharp Godot 3.X VR XR Projects C++

Recommended topics

Hot tags