Accelerating Gemma 4: faster inference with multi-token prediction drafters 668 points by amrrs 1 days ago 326 comments story