Expand description
Fast implementations of commonly used multi-op functions.
Functionsยง
- Layer normalization.
- Layer normalization.
- Root Mean Square normalization (RMS norm).
- Root Mean Square normalization (RMS norm).
- Optimized implementation of
NN.RoPE
. - Optimized implementation of
NN.RoPE
. - A fast implementation of multi-head attention:
O = softmax(Q @ K.T, dim=-1) @ V
- A fast implementation of multi-head attention:
O = softmax(Q @ K.T, dim=-1) @ V