Reads Like Teen Spirit Blogs about GenAI LLM Implementing Flash-Attention with Softmax Offset (Sinks) 3 min read · August 23, 2025 2025 · Infrastructure · Infrastructure Rounding Errors of FP16 and its impact in Machine Learning 3 min read · November 22, 2024 2024 · Infrastructure · Infrastructure LLMs are secretly lossless text compressor and how to use it like one 7 min read · November 22, 2024 2024 · LLM · LLM