What's Changed
- [misc] Add latest cutlass 3.7.0 submodule by @DefTruth in #62
- [Bugfix] fix macro typo by @DefTruth in #63
- [Misc] Update launch templates configs for small d by @DefTruth in #64
- [misc] remove some wrong comments by @DefTruth in #65
- [test] refactor ffpa-l1 multi-stages tests by @DefTruth in #66
- Revert "[test] refactor ffpa-l1 multi-stages tests" by @DefTruth in #67
- [test] refactor ffpa-l1 multi-stages tests by @DefTruth in #68
- [test] Add official flash-attn -> test cases by @DefTruth in #69
- [feat] support ffpa-l1 registers double buffers by @DefTruth in #70
Full Changelog: v0.0.2...v0.0.2.post1