[XLA] Fix a bug in SchedulingAnnotationCrossesOverlapLimit. (1ff32b86) · Commits · gitlab-org / build / omnibus-mirror / tensorflow

Commit 1ff32b86 authored 5 months ago by

Seher Ellis Committed by TensorFlower Gardener 5 months ago

[XLA] Fix a bug in SchedulingAnnotationCrossesOverlapLimit.

Before this code change, we checked whether scheduling each instruction in a given group individually would cross any limits. This check was not strong enough because
i) async instructions in the group are supposed to overlap each other and
ii) total resource usage of the group can still exceed the limit while individual usages do not.

For example, if the all-gather limit is 1, we should not allow a group with 2 async all-gathers to be scheduled.

With this code change, we compute the "accumulated" resource usage of the annotation group and compare that against the limit.

PiperOrigin-RevId: 726684858

parent bef0b9e3

Hide whitespace changes

Inline Side-by-side

Showing with 54 additions and 30 deletions

Please register or to comment