I automated generating fused attention kernels, and it works for tons of variants! Watch 2min demo:
1,94K