Alibaba Cloud Big Data AI Platform
Jul 11, 2022 · Artificial Intelligence
How Structure-Aware Sparse Attention Boosts Long-Code Transformers
The SASA model, a structure‑aware sparse‑attention Transformer developed by Alibaba Cloud PAI and Prof. Gao Ming’s team, improves long‑code sequence processing by sparsifying self‑attention using top‑k frequency and AST pattern matrices, achieving higher performance and lower memory/computation costs on CodeXGLUE benchmarks.
ASTCode UnderstandingLong Sequences
0 likes · 8 min read
