d-Matrix interview question

LLM Quantization methods. Flash Attention