


default search action
"FlashMask: Efficient and Rich Mask Extension of FlashAttention."
Guoxia Wang et al. (2024)
- Guoxia Wang, Jinle Zeng, Xiyuan Xiao, Siming Wu, Jiabin Yang, Lujing Zheng, Zeyu Chen, Jiang Bian, Dianhai Yu, Haifeng Wang:
FlashMask: Efficient and Rich Mask Extension of FlashAttention. CoRR abs/2410.01359 (2024)

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.