Would there ever be a GGUF for the gemma-3n models ?

#11

by bdutta - opened 12 days ago

12 days ago

Understand that Gemma 3N is specifically for edge devices, but a good efficient, highly optimized, open, lightweight model could be useful for non-edge cases as well. So wondering if we might expect to see a GGUF for these family of models anytime soon ?

osanseviero

Google org 11 days ago

Yes, this repository is just a preview. Stay tuned!

ygs2697

10 days ago

This comment has been hidden

floir1997

10 days ago

would really love to try it on my macbook locally

ivanglushko

9 days ago

Yes want to run this on ollama. This gemma 3n seems like a perfect fit! Thanks team :)

kth8

9 days ago

Gemma 3n is based on the MatFormer architecture so the inference engine probably need to be significantly updated instead of just having this in GGUF format

MarcosGM

8 days ago

would really love to try it on my macbook locally

You can, using MediaPipe. Built an iOS/MacOS app yesterday. It downloads both models (2B and 4B) from Hugging Face (auth first to get token and accept Google TOS) and it runs very well in Mac and high-end iPhones. https://ai.google.dev/edge/mediapipe/solutions/genai/llm_inference/ios

bdutta

8 days ago

would really love to try it on my macbook locally

You can, using MediaPipe. Built an iOS/MacOS app yesterday. It downloads both models (2B and 4B) from Hugging Face (auth first to get token and accept Google TOS) and it runs very well in Mac and high-end iPhones. https://ai.google.dev/edge/mediapipe/solutions/genai/llm_inference/ios

So while documentation says iOS, it works on MacOS too ? Interesting option. I'd still prefer GGUF (if feasible) as it allows me to use it as an option in my current tooling and workflow.

MarcosGM

8 days ago

would really love to try it on my macbook locally

You can, using MediaPipe. Built an iOS/MacOS app yesterday. It downloads both models (2B and 4B) from Hugging Face (auth first to get token and accept Google TOS) and it runs very well in Mac and high-end iPhones. https://ai.google.dev/edge/mediapipe/solutions/genai/llm_inference/ios

So while documentation says iOS, it works on MacOS too ? Interesting option. I'd still prefer GGUF (if feasible) as it allows me to use it as an option in my current tooling and workflow.

Sorry, Mac (designed for iPad), not MacOS. It works great for testing, btw.

DefaultDF

3 days ago

Anyone know about how much we must wait until gguf?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment