PR #10489: [GPU] Fix command buffer support for cuDNN fusions. (f152cc8d) · Commits · gitlab-org / build / omnibus-mirror / tensorflow

Commit f152cc8d authored 1 year ago by

Ilia Sergachev Committed by TensorFlower Gardener 1 year ago

PR #10489: [GPU] Fix command buffer support for cuDNN fusions.

Imported from GitHub PR https://github.com/openxla/xla/pull/10489

CuDnnCmd is constructed before DnnGraph in CuDnnThunk is initialized so CuDnnCmd has to get  unique_ptr\<DnnGraph\>& instead of DnnGraph& at initialization.

Accordingly cuDNN thunks have to be initialized before command buffer ones to initialize graphs before they get captured.

Test CommandBuffersAreSupported used to not demonstrate the use of command buffers because the corresponding command buffer call used to be inlined and no command buffers were created. This is now cleaned up and does work as expected with minimal CUDA graph size set to 1 with a flag.
Copybara import of the project:

--
8547c674f3e0858efca9763bed586f1d796184d7 by Ilia Sergachev <isergachev@nvidia.com>:

[GPU] Fix command buffer support for cuDNN fusions.

Merging this change closes #10489

FUTURE_COPYBARA_INTEGRATE_REVIEW=https://github.com/openxla/xla/pull/10489 from openxla:fix_cudnn_cmd_buffers 8547c674f3e0858efca9763bed586f1d796184d7
PiperOrigin-RevId: 615965989

parent 255e14e5

Expand all Hide whitespace changes

Inline Side-by-side

Showing with 1081 additions and 64 deletions

Please register or to comment