Skip to content

[BUG]: IPC-enabled pinned pool uses a fixed host NUMA node #1603

@rwgk

Description

@rwgk

Is this a duplicate?

Type of Bug

Runtime Error

Component

cuda.core

Describe the bug

PinnedMemoryResource(ipc_enabled=True) currently routes pinned IPC pools to host NUMA location ID 0 instead of the NUMA node closest to the active CUDA device. On multi-NUMA systems (especially when the current device is attached to a non-zero host NUMA node), this can cause pinned IPC pool creation/allocation failures or incorrect NUMA placement. The expected behavior is to select the current device's host NUMA ID (with a safe fallback only when the attribute/context is unavailable).

(Generated by Cursor(GPT-5.3 Codex Extra High) based on the production code changes under #1575)

How to Reproduce

See nvbug 5823243

Expected behavior

See test under #1575

Operating System

No response

nvidia-smi output

No response

Metadata

Metadata

Assignees

Labels

bugSomething isn't workingcuda.coreEverything related to the cuda.core module

Type

Projects

No projects

Relationships

None yet

Development

No branches or pull requests

Issue actions