Skip to content

Pre-Quantized Checkpoints: Gemma 4 models #1237

@rnett

Description

@rnett

Detailed description of the requested feature

It would be great to have gemma 4 models in your published optimized models (i.e. https://huggingface.co/collections/nvidia/inference-optimized-checkpoints-with-model-optimizer).

Timeline

Soon, but not blocking.

Describe alternatives you've considered

Target hardware/use case

For me, a 5080 for local inference. But it would be broadly useful for anyone running gemma 4 models.

Metadata

Metadata

Assignees

Labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions