-
Notifications
You must be signed in to change notification settings - Fork 497
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
[XLA:CPU] Extend the custom algorithm for transposed convolutions
This commit adds support for a case with multiple input and output channels at the same time. Performance of the already supported cases is not impacted. New cases show expected performance improvement. Results: name old cpu/op new cpu/op delta BM_Conv1DTransposedStrided/129/1/process_time 34.0ms ±15% 34.7ms ±17% ~ (p=0.548 n=5+5) BM_Conv1DTransposedStrided/129/3/process_time 15.4s ±21% 0.1s ±13% -99.52% (p=0.008 n=5+5) BM_Conv1DTransposedStridedNonDefaultLayout/129/1/process_time 32.5ms ±15% 32.4ms ±17% ~ (p=1.000 n=5+5) BM_Conv1DTransposedStridedNonDefaultLayout/129/3/process_time 16.2s ±18% 0.1s ±14% -99.55% (p=0.008 n=5+5) BM_Conv2DTransposedStrided/process_time 36.1ms ±16% 34.9ms ±19% ~ (p=0.841 n=5+5) name old time/op new time/op delta BM_Conv1DTransposedStrided/129/1/process_time 9.58ms ±22% 9.56ms ±21% ~ (p=1.000 n=5+5) BM_Conv1DTransposedStrided/129/3/process_time 732ms ±26% 15ms ±19% -97.91% (p=0.008 n=5+5) BM_Conv1DTransposedStridedNonDefaultLayout/129/1/process_time 8.96ms ±18% 8.91ms ±23% ~ (p=0.841 n=5+5) BM_Conv1DTransposedStridedNonDefaultLayout/129/3/process_time 783ms ±24% 14ms ±18% -98.21% (p=0.008 n=5+5) BM_Conv2DTransposedStrided/process_time 10.2ms ±22% 9.9ms ±22% ~ (p=0.690 n=5+5) Planned improvements of this algorithm: - support feature_group_size > 1 (grouped convolution), - parallel packing of the patches (second algorithm step), - explore input kernel rotation possibilities & perf impact, PiperOrigin-RevId: 710297666
- Loading branch information
1 parent
82d1bb3
commit 5042012
Showing
2 changed files
with
92 additions
and
49 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters