Skip to content

Pull requests: vllm-project/llm-compressor

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

add: GQA support to AWQ awq For any issue / PR related to AWQ support transforms Related to transforms-based modifiers like SpinQuant and Quip two-reviews When a PR requires two reviews
#2818 opened Jun 10, 2026 by soyr-redhat Contributor Loading…
4 tasks done
feat: add handle_cache_error decorator to IntermediatesCache enhancement New feature or request
#2817 opened Jun 9, 2026 by kylesayrs Collaborator Loading…
4 tasks done
model/checkpoint mapping for ignore list + gemma4 example fp8 For any issue / PR related to FP8 support moe ready When a PR is ready for review
#2816 opened Jun 9, 2026 by HDCharles Collaborator Loading…
[QuIP] Change default to only use input (v) rotations enhancement New feature or request transforms Related to transforms-based modifiers like SpinQuant and Quip
#2815 opened Jun 9, 2026 by kylesayrs Collaborator Loading…
fix: cpu offload hits OOM awq For any issue / PR related to AWQ support bug Something isn't working model_free_ptq For any PR/issue related to the `model_free_ptq` pathway two-reviews When a PR requires two reviews
#2813 opened Jun 9, 2026 by JINO-ROHIT Contributor Loading…
Add Mistral 4 MoE calibration support enhancement New feature or request moe needs-rebase tracing Issues related to model tracing transforms Related to transforms-based modifiers like SpinQuant and Quip two-reviews When a PR requires two reviews
#2812 opened Jun 9, 2026 by robertforbes68 Draft
Add load_context unified context manager enhancement New feature or request moe ready When a PR is ready for review
#2811 opened Jun 9, 2026 by kylesayrs Collaborator Loading…
Update dep versions for release enhancement New feature or request two-reviews When a PR requires two reviews
#2810 opened Jun 9, 2026 by dhuangnm Collaborator Loading…
[Agents] Configure coderabbit enhancement New feature or request
#2809 opened Jun 9, 2026 by kylesayrs Collaborator Loading…
[Skill] Automated PR Validator
#2804 opened Jun 5, 2026 by brian-dellabetta Collaborator Draft
3 of 5 tasks
Add OlmoForCausalLM (v1/v2) AWQ mapping using exaone4-style awq For any issue / PR related to AWQ support requires-validate Indicates that a PR looks appropriate, but needs to be run before merging transforms Related to transforms-based modifiers like SpinQuant and Quip two-reviews When a PR requires two reviews
#2802 opened Jun 4, 2026 by HumphreySun98 Loading…
Add ArceeForCausalLM mappings for AWQ and SmoothQuant awq For any issue / PR related to AWQ support enhancement New feature or request requires-validate Indicates that a PR looks appropriate, but needs to be run before merging smoothquant For any issue / PR related to SmoothQuant support transforms Related to transforms-based modifiers like SpinQuant and Quip two-reviews When a PR requires two reviews
#2801 opened Jun 4, 2026 by HumphreySun98 Loading…
Add MoE architectures to SmoothQuant registry (attention-only) enhancement New feature or request moe requires-validate Indicates that a PR looks appropriate, but needs to be run before merging smoothquant For any issue / PR related to SmoothQuant support transforms Related to transforms-based modifiers like SpinQuant and Quip two-reviews When a PR requires two reviews
#2800 opened Jun 4, 2026 by HumphreySun98 Loading…
Add StablelmForCausalLM mappings for AWQ and SmoothQuant awq For any issue / PR related to AWQ support enhancement New feature or request requires-validate Indicates that a PR looks appropriate, but needs to be run before merging smoothquant For any issue / PR related to SmoothQuant support transforms Related to transforms-based modifiers like SpinQuant and Quip two-reviews When a PR requires two reviews
#2799 opened Jun 4, 2026 by HumphreySun98 Loading…
fix: re-tie word embeddings before save to avoid duplicate lm_head bug Something isn't working quality-failed ready When a PR is ready for review transforms Related to transforms-based modifiers like SpinQuant and Quip two-reviews When a PR requires two reviews
#2798 opened Jun 4, 2026 by EdalatiAli Loading…
Add GraniteForCausalLM mappings for AWQ and SmoothQuant awq For any issue / PR related to AWQ support enhancement New feature or request requires-validate Indicates that a PR looks appropriate, but needs to be run before merging smoothquant For any issue / PR related to SmoothQuant support transforms Related to transforms-based modifiers like SpinQuant and Quip two-reviews When a PR requires two reviews
#2797 opened Jun 4, 2026 by HumphreySun98 Loading…
[model_free_ptq] Perform validation on meta device model_free_ptq For any PR/issue related to the `model_free_ptq` pathway
#2796 opened Jun 4, 2026 by kylesayrs Collaborator Loading…
[Distributed] Module parallel calibration for QuantizationModifier
#2785 opened Jun 2, 2026 by kylesayrs Collaborator Loading…
[Lifecycle] Add modules argument to on_sequential_epoch_end
#2784 opened Jun 2, 2026 by kylesayrs Collaborator Loading…
[Lifecycle] Make calibration events first-class citizens awq For any issue / PR related to AWQ support gptq For any PR / issue related to GPTQ support ready When a PR is ready for review smoothquant For any issue / PR related to SmoothQuant support transforms Related to transforms-based modifiers like SpinQuant and Quip two-reviews When a PR requires two reviews
#2783 opened Jun 2, 2026 by kylesayrs Collaborator Loading…
[Misc] Remove oneshot output_dir warning enhancement New feature or request
#2782 opened Jun 2, 2026 by kylesayrs Collaborator Loading…
[Tests] Sampling params for E2E tests enhancement New feature or request
#2780 opened Jun 1, 2026 by kylesayrs Collaborator Draft
[XPU] Add torch.cuda linter Refactor Code cleanup and/or improvements to existing features
#2776 opened Jun 1, 2026 by kylesayrs Collaborator Draft
refactor: modernize utils, pipeline, and observers modules with Python 3.10+ type hints needs-rebase Refactor Code cleanup and/or improvements to existing features two-reviews When a PR requires two reviews
#2775 opened Jun 1, 2026 by AsadShahid04 Contributor Loading…
ProTip! Exclude everything labeled bug with -label:bug.