Completely overhauled the attention implementation. Using the existing Gemma-3 attention implementation rather than custom monkey-patched implementation. (#10) 17d96ff verified AshwinSankar psidharth567 commited on Jun 18