Remove stages from Pipeline API #5244

mzient · 2023-12-13T16:10:33Z

Category:

New feature (non-breaking change which adds functionality)
Refactoring (Redesign of existing code that doesn't affect functionality)

Description:

What

Get rid of separate RunCPU/RunGPU from Pipeline and Executor interface

Why

Because it doesn't make sense to have separate APIs for running stages when we want to get rid of stages (or, if you prefer, have a flexible number of stages)

Why we can do it

These APIs are NOT present in Python or C and C++ is not official/stable/whatever

How

Remove the APIs from Pipeline entirely
Add a new Prefetch API that fills the queues according to the buffer sizes
Add a new feed_input_count API that tells the user/frontend how many times to fill a buffer for a particular input (much easier to use and harder to get wrong with separated queues - we were using it incorrectly before!)
Hide the prefetching pattern from the user/frontend behind Prefetch and feed_input_count
Remove the APIs from the executor's interface
Keep the APIs in the executor as private members (they are invoked from Run and Prefetch)

Additional information:

Affected modules and functionalities:

Executor, Pipeline, Python frontend for the pipeline, C API, TF plugin

Key points relevant for the review:

Tests:

Existing tests apply - ALL OF THEM except those that called the removed APIs from C++ directly.

Checklist

Documentation

DALI team only

Requirements

Implements new requirements
Affects existing requirements
N/A

REQ IDs: N/A

JIRA TASK: DALI-3743

dali-automaton · 2024-01-03T18:59:19Z

CI MESSAGE: [11867574]: BUILD STARTED

dali/python/nvidia/dali/pipeline.py

dali-automaton · 2024-01-03T19:17:45Z

CI MESSAGE: [11867574]: BUILD FAILED

szalpal · 2024-01-04T07:57:38Z

include/dali/c_api.h

+ * @return The number of calls to be made
+ */
+DLL_PUBLIC int
+daliInputFeedCount(daliPipelineHandle *pipe_handle, const char *input_name);


Since this corresponds to the daliSetExternalInput... function set, maybe it would be good to call it daliExternalInputFeedCount? Alternatively, if this function can be used on any input (not only the external one), maybe the documentation should be modified, since now it points to the daliSetExternal... function only?

I'm not entirely satisfied with this name. I was thinking about something like InputPrefetchCount or InputPrefetchFeedCount.

Alternatively, if this function can be used on any input (not only the external one),

What other inputs are there? Everything that's not external just feeds itself. Even if you called it on some other operator, the information would be non-actionable and even misleading.

szalpal · 2024-01-04T08:02:29Z

dali/pipeline/pipeline.h

+ * @param fill_queue If true, the inputs are fed `InputFeedCount(name)` times;
+ * otherwise it's fed once.


Should it be plural?

Suggested change

* @param fill_queue If true, the inputs are fed `InputFeedCount(name)` times;

* otherwise it's fed once.

* @param fill_queue If true, the inputs are fed `InputFeedCount(name)` times;

* otherwise they're fed once.

Yeah, I guess...

dali-automaton · 2024-01-04T09:42:41Z

CI MESSAGE: [11881000]: BUILD STARTED

dali-automaton · 2024-01-04T14:32:10Z

CI MESSAGE: [11881000]: BUILD FAILED

dali-automaton · 2024-01-04T15:59:33Z

CI MESSAGE: [11886704]: BUILD STARTED

dali-automaton · 2024-01-04T17:17:23Z

CI MESSAGE: [11888104]: BUILD STARTED

dali-automaton · 2024-01-04T19:45:10Z

CI MESSAGE: [11886704]: BUILD FAILED

dali-automaton · 2024-01-04T21:35:35Z

CI MESSAGE: [11888104]: BUILD FAILED

dali-automaton · 2024-01-05T14:51:18Z

CI MESSAGE: [11907275]: BUILD STARTED

dali-automaton · 2024-01-05T15:07:29Z

CI MESSAGE: [11907275]: BUILD FAILED

dali-automaton · 2024-01-05T15:59:49Z

CI MESSAGE: [11908302]: BUILD STARTED

dali-automaton · 2024-01-05T19:51:10Z

CI MESSAGE: [11908302]: BUILD FAILED

dali-automaton · 2024-01-08T16:38:16Z

CI MESSAGE: [11961717]: BUILD STARTED

dali-automaton · 2024-01-09T09:32:25Z

CI MESSAGE: [11961717]: BUILD PASSED

mzient · 2024-01-09T15:43:19Z

dali/pipeline/executor/executor.h

- DLL_PUBLIC virtual void RunCPU() = 0;
- DLL_PUBLIC virtual void RunMixed() = 0;
- DLL_PUBLIC virtual void RunGPU() = 0;


These functions go away from the interface...

mzient · 2024-01-09T15:43:39Z

dali/pipeline/executor/executor.h

+ DLL_PUBLIC virtual void RunCPU();
+ DLL_PUBLIC virtual void RunMixed();
+ DLL_PUBLIC virtual void RunGPU();


...and re-emerge as an implementation detail.

stiepan

I have yet to read a few files, but I am leaving a few comment that crossed my mind so far.

dali/c_api/c_api.cc

dali/python/nvidia/dali/pipeline.py

dali/pipeline/operator/builtin/external_source_test.cc

Signed-off-by: Michal Zientkiewicz <michalz@nvidia.com>

Add Pipeline::Prefetch. Rework prefetching mechanism in Python. Fix tests. Signed-off-by: Michal Zientkiewicz <michalz@nvidia.com>

Signed-off-by: Michal Zientkiewicz <michalz@nvidia.com>

dali-automaton · 2024-01-29T09:56:59Z

CI MESSAGE: [12427612]: BUILD STARTED

dali-automaton · 2024-01-29T11:19:47Z

CI MESSAGE: [12427612]: BUILD FAILED

dali-automaton · 2024-01-29T13:56:57Z

CI MESSAGE: [12427612]: BUILD PASSED

dali-automaton · 2024-01-29T15:30:52Z

CI MESSAGE: [12433588]: BUILD STARTED

mzient force-pushed the RemoveRunStageAPI branch from 2923781 to dbb217b Compare January 3, 2024 18:55

github-advanced-security bot found potential problems Jan 3, 2024

View reviewed changes

dali/python/nvidia/dali/pipeline.py Fixed Show fixed Hide fixed

dali/python/nvidia/dali/pipeline.py Fixed Show fixed Hide fixed

mzient force-pushed the RemoveRunStageAPI branch from 39a401e to 0491441 Compare January 3, 2024 19:13

szalpal reviewed Jan 4, 2024

View reviewed changes

mzient force-pushed the RemoveRunStageAPI branch from a317968 to 3cda968 Compare January 4, 2024 09:46

mzient marked this pull request as ready for review January 4, 2024 16:33

dali-automaton assigned stiepan and awolant Jan 8, 2024

NVIDIA deleted a comment from dali-automaton Jan 9, 2024

mzient commented Jan 9, 2024

View reviewed changes

awolant approved these changes Jan 10, 2024

View reviewed changes

stiepan reviewed Jan 10, 2024

View reviewed changes

dali/c_api/c_api.cc Outdated Show resolved Hide resolved

dali/python/nvidia/dali/pipeline.py Outdated Show resolved Hide resolved

dali/python/nvidia/dali/pipeline.py Show resolved Hide resolved

stiepan reviewed Jan 10, 2024

View reviewed changes

dali/pipeline/operator/builtin/external_source_test.cc Show resolved Hide resolved

mzient force-pushed the RemoveRunStageAPI branch from 8478c9a to 4b2a1ef Compare January 10, 2024 14:18

stiepan approved these changes Jan 10, 2024

View reviewed changes

szkarpinski mentioned this pull request Jan 23, 2024

Expose checkpointing in C API #5287

Merged

18 tasks

mzient added 15 commits January 29, 2024 10:55

[WIP] Remove stage APIs.

e275448

Signed-off-by: Michal Zientkiewicz <michalz@nvidia.com>

Add Pipeline::InputFeedCount.

9197891

Add Pipeline::Prefetch. Rework prefetching mechanism in Python. Fix tests. Signed-off-by: Michal Zientkiewicz <michalz@nvidia.com>

Bug fixes, python stuff.

2cc317f

Signed-off-by: Michal Zientkiewicz <michalz@nvidia.com>

Fingers crossed.

c88dc9a

Signed-off-by: Michal Zientkiewicz <michalz@nvidia.com>

Remove debugging code.

771f55b

Signed-off-by: Michal Zientkiewicz <michalz@nvidia.com>

Remove unused variable; improve documentation.

fca7837

Signed-off-by: Michal Zientkiewicz <michalz@nvidia.com>

Numerous fixes around prefetching in Python.

131b978

Signed-off-by: Michal Zientkiewicz <michalz@nvidia.com>

Typos.

28dad93

Signed-off-by: Michal Zientkiewicz <michalz@nvidia.com>

Fix parallel external source.

e57e8ba

Signed-off-by: Michal Zientkiewicz <michalz@nvidia.com>

Use one fill depth due to batch size lookahead.

f841643

Signed-off-by: Michal Zientkiewicz <michalz@nvidia.com>

Fix GCC build.

c935f9d

Signed-off-by: Michal Zientkiewicz <michalz@nvidia.com>

Fix operator tests.

bf6be68

Signed-off-by: Michal Zientkiewicz <michalz@nvidia.com>

Fix accidentally removed lines.

ae636d9

Signed-off-by: Michal Zientkiewicz <michalz@nvidia.com>

Fix accidentally removed lines.

968a043

Signed-off-by: Michal Zientkiewicz <michalz@nvidia.com>

Review fixes.

dbf894f

Signed-off-by: Michal Zientkiewicz <michalz@nvidia.com>

mzient force-pushed the RemoveRunStageAPI branch from 4b2a1ef to dbf894f Compare January 29, 2024 09:56

mzient merged commit 6fec3f1 into NVIDIA:main Jan 29, 2024
6 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove stages from Pipeline API #5244

Remove stages from Pipeline API #5244

mzient commented Dec 13, 2023 •

edited

Loading

dali-automaton commented Jan 3, 2024

dali-automaton commented Jan 3, 2024

szalpal Jan 4, 2024

mzient Jan 4, 2024

szalpal Jan 4, 2024

mzient Jan 4, 2024

dali-automaton commented Jan 4, 2024

dali-automaton commented Jan 4, 2024

dali-automaton commented Jan 4, 2024

dali-automaton commented Jan 4, 2024

dali-automaton commented Jan 4, 2024

dali-automaton commented Jan 4, 2024

dali-automaton commented Jan 5, 2024

dali-automaton commented Jan 5, 2024

dali-automaton commented Jan 5, 2024

dali-automaton commented Jan 5, 2024

dali-automaton commented Jan 8, 2024

dali-automaton commented Jan 9, 2024

mzient Jan 9, 2024

mzient Jan 9, 2024

stiepan left a comment

dali-automaton commented Jan 29, 2024

dali-automaton commented Jan 29, 2024

dali-automaton commented Jan 29, 2024

dali-automaton commented Jan 29, 2024

		* @param fill_queue If true, the inputs are fed `InputFeedCount(name)` times;
		* otherwise it's fed once.

Remove stages from Pipeline API #5244

Remove stages from Pipeline API #5244

Conversation

mzient commented Dec 13, 2023 • edited Loading

Category:

Description:

What

Why

Why we can do it

How

Additional information:

Affected modules and functionalities:

Key points relevant for the review:

Tests:

Checklist

Documentation

DALI team only

Requirements

dali-automaton commented Jan 3, 2024

dali-automaton commented Jan 3, 2024

szalpal Jan 4, 2024

Choose a reason for hiding this comment

mzient Jan 4, 2024

Choose a reason for hiding this comment

szalpal Jan 4, 2024

Choose a reason for hiding this comment

mzient Jan 4, 2024

Choose a reason for hiding this comment

dali-automaton commented Jan 4, 2024

dali-automaton commented Jan 4, 2024

dali-automaton commented Jan 4, 2024

dali-automaton commented Jan 4, 2024

dali-automaton commented Jan 4, 2024

dali-automaton commented Jan 4, 2024

dali-automaton commented Jan 5, 2024

dali-automaton commented Jan 5, 2024

dali-automaton commented Jan 5, 2024

dali-automaton commented Jan 5, 2024

dali-automaton commented Jan 8, 2024

dali-automaton commented Jan 9, 2024

mzient Jan 9, 2024

Choose a reason for hiding this comment

mzient Jan 9, 2024

Choose a reason for hiding this comment

stiepan left a comment

Choose a reason for hiding this comment

dali-automaton commented Jan 29, 2024

dali-automaton commented Jan 29, 2024

dali-automaton commented Jan 29, 2024

dali-automaton commented Jan 29, 2024

mzient commented Dec 13, 2023 •

edited

Loading