Fix issue in mul_mat_id for OpenVINO backend by zhaixuejun1993 · Pull Request #163 · ravi9/llama.cpp

zhaixuejun1993 · 2026-05-14T03:28:37Z

This pull request refines the handling and shape management of expert weights and activations in the OpenVINO backend for the MUL_MAT_ID operation. The core improvements ensure that multi-expert weight tensors retain their full dimensionality, and that input/output tensor shapes are rebuilt more robustly, improving compatibility with dynamic or reshaped input graphs.

Key changes include:

Shape Handling and Tensor Materialization:

In ggml-decoder.cpp, when materializing non-quantized expert weights, the code now preserves the full reversed 4D shape, ensuring that the expert dimension is not collapsed. This prevents issues where later operations (like Gather/MatMul) would only see a single expert slice.

Dynamic Shape Reconstruction and Reshaping:

In mul_mat_id.cpp, the logic for squeezing singleton axes from weights, activations, and ids is replaced with explicit dynamic shape reconstruction using ShapeOf and Reshape. This makes the code robust to input tensors that may have undergone reshaping or view operations, ensuring correct logical ranks regardless of input shape permutations.
The output shape of the MatMul result is now explicitly constructed using dynamic shape information and the expected output rank, replacing previous fixed unsqueeze/squeeze logic. This ensures that the output tensor always matches the required 4D shape, with checks for static rank and row dimension.

General Improvements:

Added missing include for concat.hpp to support new shape concatenation logic.## Overview

Additional information

Requirements

I have read and agree with the contributing guidelines
AI usage disclosure:

zhaixuejun1993 requested review from cavusmustafa and wine99 as code owners May 14, 2026 03:28

github-actions Bot added OpenVINO ggml labels May 14, 2026

OpenVINO backend: fix issue in mul_mat_id

a86baaf

zhaixuejun1993 merged commit 5f58d5d into ravi9:dev_backend_openvino May 15, 2026
3 of 12 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix issue in mul_mat_id for OpenVINO backend#163

Fix issue in mul_mat_id for OpenVINO backend#163
zhaixuejun1993 merged 1 commit into
ravi9:dev_backend_openvinofrom
zhaixuejun1993:xuejun/arch-llama

zhaixuejun1993 commented May 14, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

zhaixuejun1993 commented May 14, 2026

Additional information

Requirements

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant