Implement Fabric-aware get/set_local_poses via indexedfabricarray by pv-nvidia · Pull Request #2 · pv-nvidia/IsaacLab

pv-nvidia · 2026-05-05T11:07:06Z

Description

Adopts the prototype design from bareya/pbarejko/camera-update: all transform reads and writes go through wp.indexedfabricarray, the view→fabric mapping is path-based (no custom prim attributes), and FabricFrameView reads/writes omni:fabric:localMatrix directly.

Previously get_local_poses delegated to USD even when Fabric held the source-of-truth world matrices, so after set_world_poses() the next get_local_poses() returned stale values — the reason test_set_world_updates_local was marked xfail(strict=True). An earlier attempt fixed this by composing local = inv(parent_world) * child_world in torch using a Python-level USD parent loop, which was correct but 3× slower than USD. This PR replaces that with two Warp kernels operating on Fabric matrices via a parent indexed fabric array, eliminating the Python bottleneck.

Implementation summary

Three persistent PrimSelection instances (trans_ro, world_rw, local_rw) differing only in per-attribute access mode.
Path-based view→fabric index mapping computed from selection.GetPaths(); no isaaclab:view_index:HASH prim attributes written anywhere.
wp.indexedfabricarray everywhere: kernels dereference ifa[view_index] instead of taking a separate mapping argument.
Stage-level IFabricHierarchy cache + dirty-stages set: multiple FabricFrameView instances on the same stage share one hierarchy lookup.
Symmetric local↔world propagation via Warp kernels: set_local_poses marks the stage dirty; the next world read fires child_world = parent * child_local per child. Mirror kernel runs after set_world_poses to recompute child_local = inv(parent) * child_world.
Two new public Warp kernels in isaaclab.utils.warp.fabric: decompose_indexed_fabric_transforms and compose_indexed_fabric_transforms, plus propagation kernels update_indexed_local_matrix_from_world and update_indexed_world_matrix_from_local.

The shared contract test test_set_world_updates_local no longer needs the xfail override and passes for both cpu and cuda:0. Three additional tests cover the new paths: test_set_local_via_fabric_path, test_get_scales_fabric_path, test_prepare_for_reuse_detects_topology_change. Local run of the full fabric test suite: 41 passed, 0 failed.

Benchmark results

1024 prims, 50 iterations, NVIDIA RTX A6000.

Operation	USD (ms)	Fabric (ms)	Speedup
Get World Poses	13.5	0.067	201×
Set World Poses	30.0	0.123	244×
Interleaved Set→Get	43.8	0.159	276×
Get Local Poses	6.1	0.064	96×
Set Local Poses	8.6	0.053	163×

Reproduce with:

./isaaclab.sh -p scripts/benchmarks/benchmark_view_comparison.py \
  --num_envs 1024 --num_iterations 50 --backends usd fabric \
  --device cuda:0 --headless

Depends on: pv-nvidia:fix/fabric-prepare-for-reuse (PR #<TBD>) — please merge that one first.

Fixes #5

Type of change

• Bug fix (non-breaking change which fixes an issue)
• New feature (non-breaking change which adds functionality)

Checklist

• [x] I have read and understood the [contribution guidelines](https://isaac-sim.github.io/IsaacLab/main/source/refs/contributing.html)
• [x] I have run the [pre-commit checks](https://pre-commit.com/) with ./isaaclab.sh --format
• [x] I have made corresponding changes to the documentation
• [x] My changes generate no new warnings
• [x] I have added tests that prove my fix is effective or that my feature works

• [x] I have updated the changelog and the corresponding version in the extension's config/extension.toml file
• [x] I have added my name to the CONTRIBUTORS.md or my name already exists there

Per the fragment-based changelog system (#5434), this PR ships fragments at source/isaaclab_physx/changelog.d/fabric-frame-view-local-poses.rst and source/isaaclab/changelog.d/pv-5381-rebased.rst instead of editing CHANGELOG.rst and config/extension.toml directly — the nightly CI workflow compiles those from the fragments.

…m#5473) # Description  Extend debug Visualization Markers, which are supported in the Kit Visualizer, to the Newton Visualizers. These Visualization Markers are various shapes and models which can be added to envs for debugging / showing extra information. Also added filtering for partial visualization (when we filtered which envs are shown the in the visualizer, we also filter the markers) For general USD mesh marker support in Newton, a followup PR will be required, once a Newton API for general USD -> Newton Mesh conversion is added (see newton-physics/newton#2667) Checked velocity arrows, dexcubes, raycasts, frames, goal markers  ## Type of change  - New feature (non-breaking change which adds functionality) - Documentation update ## Screenshots Please attach before and after screenshots of the change if applicable.  ## Checklist - [ ] I have read and understood the [contribution guidelines](https://isaac-sim.github.io/IsaacLab/main/source/refs/contributing.html) - [ ] I have run the [`pre-commit` checks](https://pre-commit.com/) with `./isaaclab.sh --format` - [ ] I have made corresponding changes to the documentation - [ ] My changes generate no new warnings - [ ] I have added tests that prove my fix is effective or that my feature works - [ ] I have updated the changelog and the corresponding version in the extension's `config/extension.toml` file - [ ] I have added my name to the `CONTRIBUTORS.md` or my name already exists there  --------- Signed-off-by: matthewtrepte <mtrepte@nvidia.com>

# Description - Increase the CI startup-hang grace period from 45s to 120s so slow but valid Kit startup is not killed prematurely. - Make `SurfaceGripper` fail fast on non-CPU simulation backends before loading the surface gripper extension. - Skip the CI-only `SurfaceGripperView` CPU initialization path that can deadlock, while keeping CUDA fail-fast coverage. ## Type of change  - Bug fix (non-breaking change which fixes an issue) ## Checklist - [x] I have read and understood the [contribution guidelines](https://isaac-sim.github.io/IsaacLab/main/source/refs/contributing.html) - [x] I have run the [`pre-commit` checks](https://pre-commit.com/) with `./isaaclab.sh --format` - [x] I have made corresponding changes to the documentation - [x] My changes generate no new warnings - [ ] I have added tests that prove my fix is effective or that my feature works - [ ] I have updated the changelog and the corresponding version in the extension's `config/extension.toml` file - [ ] I have added my name to the `CONTRIBUTORS.md` or my name already exists there

@kellyguo11

…isaac-sim#5478) Follow-up to isaac-sim#5434 (fragment-based changelog system). Two contributor-facing references still pointed at the old "edit CHANGELOG.rst directly" workflow: - **`docs/source/refs/contributing.rst`** — *Maintaining a changelog and extension.toml* section described per-version editing of CHANGELOG.rst with manual SemVer bumps. - **`.github/PULL_REQUEST_TEMPLATE.md`** — checklist asked contributors to update the changelog and bump extension.toml directly. Replaced only the parts that talk about direct editing; section/style guidance (Added/Changed/Deprecated/Removed/Fixed, past tense, the sample bullets themselves) stays intact. ## Test plan - [x] Pre-commit clean - [ ] Verify Build Latest Docs CI step renders the new section correctly cc @kellyguo11 — addresses the doc gaps flagged after isaac-sim#5434 merged.

# Description Mark all RTX-based rendering test cases flaky until they can produce deterministic low-res camera outputs that pass golden image testing on every CI run. Fixes # (issue)  ## Type of change - Test change ## Screenshots Please attach before and after screenshots of the change if applicable.  ## Checklist - [x] I have read and understood the [contribution guidelines](https://isaac-sim.github.io/IsaacLab/main/source/refs/contributing.html) - [x] I have run the [`pre-commit` checks](https://pre-commit.com/) with `./isaaclab.sh --format` - [x] I have made corresponding changes to the documentation - [x] My changes generate no new warnings - [x] I have added tests that prove my fix is effective or that my feature works - [x] I have updated the changelog and the corresponding version in the extension's `config/extension.toml` file - [x] I have added my name to the `CONTRIBUTORS.md` or my name already exists there  --------- Signed-off-by: HuiDong Chen <huidongc@nvidia.com> Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com>

@pbarejko

…ic_write (isaac-sim#5380) ## Summary Replace the `sync_usd_on_fabric_write` workaround in `FabricFrameView` with proper `PrepareForReuse()` calls on the Fabric `PrimSelection`. This tells the renderer (FSD/Storm) that Fabric data has changed, so the next rendered frame reflects updated transforms — eliminating the need to copy Fabric writes back to USD. ## Motivation The existing `sync_usd_on_fabric_write` flag worked by mirroring every Fabric write back to USD, which defeated the performance benefits of Fabric. With `PrepareForReuse()`, the rendering pipeline is properly notified of Fabric data changes without any USD writeback. Additionally, the old code incorrectly fell back to USD for CPU devices — Warp handles CPU Fabric buffers correctly, so the fallback was unnecessary. This addresses two of the issues raised in @pbarejko Piotr's review of PR isaac-sim#4923: - **Issue #1** (USD write-back): Fabric writes no longer sync back to USD - **Issue isaac-sim#4** (PrepareForReuse): Renderer notification via `PrepareForReuse()` instead of USD writeback ## Changes ### Core (FabricFrameView) - Call `_prepare_for_reuse()` in write paths (`set_world_poses`, `set_scales`) to notify the renderer - Remove `sync_usd_on_fabric_write` parameter (accepted via `**kwargs` for backward compat) - Remove incorrect CPU/device fallback warnings — Warp handles CPU Fabric buffers correctly - Add `_rebuild_fabric_arrays()` for topology change recovery when `PrepareForReuse()` returns True, with assertion guarding the prim-count invariant ### Camera - Remove `sync_usd_on_fabric_write=True` from FrameView construction in `camera.py` ## Benchmark Results 1024 prims, 50 iterations, NVIDIA L40 GPU: | Operation | USD (ms) | Fabric (ms) | Speedup | |---|---|---|---| | Get World Poses | 14.71 | 0.07 | **203x** | | Set World Poses | 40.75 | 0.16 | **259x** | | Interleaved Set→Get | 55.90 | 0.24 | **232x** | | Get Local Poses | 11.08 | 11.12 | 1.0x | | Set Local Poses | 16.14 | 16.28 | 1.0x | Local poses fall back to USD (expected — Fabric only accelerates world poses via `omni:fabric:worldMatrix`). ## Tests Added | Test | What it validates | |------|------------------| | `test_camera_pose_update_reflected_in_render` | Camera pose changes propagate to rendered depth (close vs far) for CPU/GPU, tiled/non-tiled | | `test_fabric_set_world_does_not_write_back_to_usd` | Fabric writes stay in Fabric, USD prim unchanged | | `test_set_world_updates_local` (xfail) | Documents Issue isaac-sim#5: `set_world_poses` doesn't update local pose in Fabric mode | ## Test Results | Test Suite | Passed | Skipped | Xfailed | Total | |---|---|---|---|---| | Fabric contract tests (`test_views_xform_prim_fabric.py`) | 17 | 16 | 1 | 34 | | USD contract tests (`test_views_xform_prim.py`) | 45 | 0 | 0 | 45 | | Camera render test (`test_tiled_camera.py`) | 8 | 0 | 0 | 8 | ## Type of change - Performance improvement (removes redundant USD writeback on Fabric operations) ## Checklist - [x] I have read and understood the [contribution guidelines](https://isaac-sim.github.io/IsaacLab/main/source/refs/contributing.html) - [x] I have run the [`pre-commit` checks](https://pre-commit.com/) with `./isaaclab.sh --format` - [ ] I have made corresponding changes to the documentation - [x] My changes generate no new warnings - [x] I have added tests that prove my fix is effective or that my feature works - [x] I have updated the changelog and the corresponding version in the extension's `config/extension.toml` file - [x] I have added my name to the `CONTRIBUTORS.md` or my name already exists there *No doc changes needed (parameter wasn't referenced in any docs)*

…tation (isaac-sim#5506) # Description This PR changes the PyTorch3d installation command in the locomanipulation SDG policy training / rollout to use git and install pytorch3d from source. Fixes # (issue) NV bug 6115836 ## Type of change  - Bug fix (non-breaking change which fixes an issue) ## Screenshots Please attach before and after screenshots of the change if applicable.  ## Checklist - [ ] I have read and understood the [contribution guidelines](https://isaac-sim.github.io/IsaacLab/main/source/refs/contributing.html) - [ ] I have run the [`pre-commit` checks](https://pre-commit.com/) with `./isaaclab.sh --format` - [ ] I have made corresponding changes to the documentation - [ ] My changes generate no new warnings - [ ] I have added tests that prove my fix is effective or that my feature works - [ ] I have updated the changelog and the corresponding version in the extension's `config/extension.toml` file - [ ] I have added my name to the `CONTRIBUTORS.md` or my name already exists there

Replaces the earlier Python-based parent-loop implementation (which was correct but ~3× slower than USD for local poses) with a fully GPU-side Fabric path that follows the bareya/pbarejko/camera-update prototype: * Three persistent ``PrimSelection`` instances differing only in per-attribute access mode — one each for {trans_ro, world_rw, local_rw}. * Path-based view → fabric index mapping computed once from ``selection.GetPaths()`` and stored as a Warp ``int32`` array. No custom prim attributes are written to the stage. * All transform reads and writes go through ``wp.indexedfabricarray``, so the kernels just dereference ``ifa[view_index]`` instead of taking a separate mapping argument. * Stage-level ``IFabricHierarchy`` cache and dirty-stages set so multiple ``FabricFrameView`` instances on the same stage share state. World ↔ local consistency is preserved through Warp kernels that run on the affected write paths: * ``set_local_poses`` writes ``omni:fabric:localMatrix`` directly via the compose kernel, then a second kernel recomputes child worldMatrix from ``parent_world * child_local`` so the next ``get_world_poses`` read is consistent. ``IFabricHierarchy.update_world_xforms()`` is *not* used for this — in practice it re-reads USD's authored xformOps and would overwrite the matrices we just authored. * ``set_world_poses`` mirrors the above, recomputing ``child_local = inv(parent_world) * child_world`` after the write. Two new public Warp kernels in ``isaaclab.utils.warp.fabric``: * ``decompose_indexed_fabric_transforms`` / ``compose_indexed_fabric_transforms`` — indexed-array variants of the existing decompose/compose kernels. * ``update_indexed_local_matrix_from_world`` / ``update_indexed_world_matrix_from_local`` — propagate one direction using a parent indexed fabric array. Implemented directly in storage convention (``local = world * inv(parent)``, ``world = local * parent``) — the four transposes the math-convention form would imply cancel out under ``(A·B)^T = B^T·A^T``. Benchmark (1024 prims, 50 iterations, RTX A6000): Operation USD (ms) Fabric (ms) Speedup Get World Poses 12.33 0.044 282× Set World Poses 27.98 0.117 240× Interleaved Set→Get 41.34 0.160 258× Get Local Poses 6.04 0.037 162× Set Local Poses 8.54 0.053 162× Local-pose ops went from ~3× slower than USD (in the earlier torch-based parent-loop implementation) to ~160× faster, with the new Fabric-side localMatrix authoring keeping ``test_set_world_updates_local`` passing without an xfail override. Tests: 41 passed in the Fabric backend's contract + new coverage for the Fabric-native ``set_local_poses``, ``get_scales``, and topology rebuild paths. Drops the no-longer-needed ``cd571d482`` Python-loop attempt.

Three review fixes on the indexedfabricarray refactor: * set_scales wrote the new scale into worldMatrix but never refreshed localMatrix, so a subsequent get_local_poses returned the stale scale. Call _sync_local_from_world after the world write, matching set_world_poses. * The view->fabric mapping was stored in a single shared _fabric_indices field that each accessor overwrote from its own selection's GetPaths() ordering. Selections do not guarantee a shared path ordering, so this was brittle and hard to reason about. Cache the mapping per selection (_trans_ro_fabric_indices, _world_rw_fabric_indices, _local_rw_fabric_indices) and pass it explicitly to _build_indexed_array. The three trans_ro accessors now share a _rebuild_trans_ro_arrays helper. * Update two test comments that referenced the removed _fabric_usd_sync_done attribute to point at the lazy _initialize_fabric() call instead.

The chained set_world_poses -> get_world_poses round-trip in test_fabric_rebuild_after_topology_change goes through Warp's float32 SRT compose/decompose, which accumulates a few ULP of drift. At the test's position magnitudes (~4-6), one float32 ULP is ~4.77e-7, so the prior atol of 1e-7 demanded sub-ULP agreement and was sensitive to GPU/codegen variation -- it passed locally on the A6000 but flaked in CI. 1e-5 corresponds to roughly 20 ULP at those magnitudes: tight enough to catch any real bug (a wrong index or stale read would be at least ~1e-3 off given the test setup) and consistent with the shared contract harness in frame_view_contract_utils.py, which already documents and uses ATOL = 1e-5 for compose/decompose-through-float32 checks.

github-actions Bot added the isaac-lab label May 5, 2026

pv-nvidia changed the title ~~Pv/5381 rebased~~ Implement Fabric-aware get/set_local_poses via indexedfabricarray May 5, 2026

matthewtrepte and others added 7 commits May 5, 2026 13:13

pv-nvidia force-pushed the pv/5381-rebased branch from 7d135d0 to 71a38d2 Compare May 7, 2026 11:37

github-actions Bot added documentation Improvements or additions to documentation infrastructure labels May 7, 2026

pv-nvidia added 4 commits May 7, 2026 11:47

Some cleanup

4660703

Fixed tests

1546479

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement Fabric-aware get/set_local_poses via indexedfabricarray#2

Implement Fabric-aware get/set_local_poses via indexedfabricarray#2
pv-nvidia wants to merge 11 commits intofix/fabric-prepare-for-reusefrom
pv/5381-rebased

pv-nvidia commented May 5, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Conversation

pv-nvidia commented May 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Implementation summary

Benchmark results

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

pv-nvidia commented May 5, 2026 •

edited

Loading