Aug 23, 2025 Implementing Flash-Attention with Softmax Offset (Sinks) Nov 22, 2024 Rounding Errors of FP16 and its impact in Machine Learning