[core] Improve, test and document the DSL by kpouget · Pull Request #35 · openshift-psap/forge

kpouget · 2026-04-20T15:41:50Z

Summary by CodeRabbit

New Features
- Introduced Toolbox DSL for pipeline-style commands with task decorators, conditional execution, retries, and guaranteed cleanup.
Documentation
- Added comprehensive authoring guides, specs, quickstart, data model, plan, and checklists.
Tests
- Added end-to-end unit tests and an isolation fixture validating DSL semantics and behavior.
Chores
- Added CI workflow and narrowed test discovery to the core tests directory.
Refactor
- Standardized logging to module-scoped loggers and removed global logging side effects.

openshift-ci · 2026-04-20T15:41:54Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by:
Once this PR has been reviewed and has the lgtm label, please assign avasilevskii for approval. For more information see the Code Review Process.

The full list of commands accepted by this bot can be found here.

Details

Needs approval from an approver in each of these files:

OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

coderabbitai · 2026-04-20T15:42:04Z

Warning

Rate limit exceeded

@kpouget has exceeded the limit for the number of commits that can be reviewed per hour. Please wait 0 minutes and 47 seconds before requesting another review.

Your organization is not enrolled in usage-based pricing. Contact your admin to enable usage-based pricing to continue reviews beyond the rate limit, or try again in 0 minutes and 47 seconds.

⌛ How to resolve this issue?

After the wait time has elapsed, a review can be triggered using the @coderabbitai review command as a PR comment. Alternatively, push new commits to this PR.

We recommend that you space out your commits to avoid hitting the rate limit.

🚦 How do rate limits work?

CodeRabbit enforces hourly rate limits for each developer per organization.

Our paid plans have higher rate limits than the trial, open-source and free plans. In all cases, we re-allow further reviews after a brief timeout.

Please see our FAQ for further information.

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: fdb43027-f5de-4d6b-aa54-bdaeadff3074

📥 Commits

Reviewing files that changed from the base of the PR and between f50b06f and 5353d91.

📒 Files selected for processing (31)

.github/workflows/test_toolbox_dsl.yml
docs/toolbox/dsl.md
projects/cluster/toolbox/rebuild_image/main.py
projects/core/ci_entrypoint/github/pr_args.py
projects/core/ci_entrypoint/prepare_ci.py
projects/core/ci_entrypoint/run_ci.py
projects/core/dsl/__init__.py
projects/core/dsl/log.py
projects/core/dsl/runtime.py
projects/core/dsl/shell.py
projects/core/dsl/task.py
projects/core/launcher/README.md
projects/core/library/config.py
projects/core/library/run.py
projects/core/notifications/github/api.py
projects/core/notifications/send.py
projects/core/notifications/slack/api.py
projects/core/tests/conftest.py
projects/core/tests/test_dsl_toolbox.py
projects/fournos_launcher/orchestration/cli.py
projects/fournos_launcher/orchestration/submit.py
projects/skeleton/orchestration/prepare_skeleton.py
pyproject.toml
specs/008-toolbox-dsl/checklists/requirements.md
specs/008-toolbox-dsl/contracts/toolbox_dsl_authoring.md
specs/008-toolbox-dsl/data-model.md
specs/008-toolbox-dsl/plan.md
specs/008-toolbox-dsl/quickstart.md
specs/008-toolbox-dsl/research.md
specs/008-toolbox-dsl/spec.md
specs/008-toolbox-dsl/tasks.md

📝 Walkthrough

🚥 Pre-merge checks | ✅ 4 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 41.98% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (4 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title '[core] Improve, test and document the DSL' directly summarizes the main changes: enhancements to the DSL implementation, comprehensive test coverage, and documentation additions across multiple files.
Linked Issues check	✅ Passed	Check skipped because no linked issues were found for this pull request.
Out of Scope Changes check	✅ Passed	Check skipped because no linked issues were found for this pull request.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 4

🧹 Nitpick comments (9)

projects/core/dsl/runtime.py (2)
144-158: Comment scope — this block also runs on the success path.

After the success path, task_index == len(file_tasks) so pending == [] and the whole block becomes a no-op — correct behavior, but the leading comment “After a failure, skip pending non-@Always tasks; still run pending @always tasks” suggests it’s failure-only. A small tweak would make the intent obvious to readers tracking control flow:
📝 Doc-only tweak
-            # After a failure, skip pending non-@always tasks; still run pending `@always` tasks
+            # On failure: skip pending non-@always tasks, still run pending `@always` tasks.
+            # On success: task_index == len(file_tasks), so `pending` is empty and this is a no-op.
             pending = file_tasks[task_index + 1 :] if task_index < len(file_tasks) else []
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@projects/core/dsl/runtime.py` around lines 144 - 158, The leading comment is
misleading because this block also executes on the success path (when task_index
== len(file_tasks) it becomes a no-op); update the comment above the
pending/always_pending logic to reflect the true intent — e.g., say that for any
remaining tasks after the current index we skip non-@always tasks and still run
`@always` tasks (and note it will be a no-op when there are no pending tasks) — so
change the comment near variables task_index, file_tasks, pending,
always_pending and the check on current_task_info.get("always_execute") to make
the behavior explicit.
135-142: Redundant exception classes in tuple.

Exception already catches TaskExecutionError, ConditionError, and RetryFailure. The explicit list is purely informational and ends up being dead code from the interpreter’s point of view. Either drop them, or drop the bare Exception if the intent was to document the expected set (and add a separate fallback).
♻️ Minimal cleanup
-            except (TaskExecutionError, ConditionError, RetryFailure, Exception) as e:
+            except Exception as e:
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@projects/core/dsl/runtime.py` around lines 135 - 142, The except clause
currently lists TaskExecutionError, ConditionError, RetryFailure and Exception
together which is redundant; split into two handlers: one that explicitly
catches (TaskExecutionError, ConditionError, RetryFailure) as e and performs the
existing logging, log_completion_banner call and sets execution_error, and a
separate fallback except Exception as e that does the same fallback handling so
the specific expected exceptions remain documented while still preserving a
generic catch-all; refer to the existing symbols logger.fatal,
log_completion_banner and execution_error when making the changes.
projects/core/tests/conftest.py (1)
9-24: autouse=True binds every test to DSL setup — narrow when the suite grows.

This fixture currently changes cwd to FORGE_HOME, sets ARTIFACT_DIR, and calls env.init() for every test collected under projects/core/tests. That is fine today because the suite is DSL-focused, but once non-DSL tests are added they will pay this cost and be forced to run with cwd pinned to FORGE_HOME. Consider gating via a marker (e.g. @pytest.mark.dsl) or splitting into a non-autouse fixture you opt into, once the suite diversifies.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@projects/core/tests/conftest.py` around lines 9 - 24, The _dsl_isolation
fixture is currently autouse and forces all tests to run with cwd set to
FORGE_HOME and ARTIFACT_DIR set, which is undesirable for non-DSL tests; change
it from autouse=True to a non-autouse fixture (remove autouse) and provide an
alternate marker-based approach so only tests that need DSL setup opt in (e.g.,
add a pytest config to register a "dsl" marker and create a helper fixture or
require tests to use `@pytest.mark.dsl` to invoke _dsl_isolation), while keeping
the existing behavior inside _dsl_isolation (monkeypatch.setenv("ARTIFACT_DIR",
...), monkeypatch.chdir(env.FORGE_HOME), env.init(), reset_script_manager()) and
ensuring teardown still calls reset_script_manager().
.github/workflows/test_toolbox_dsl.yml (1)
11-13: Concurrency group key is fragile for non-PR events.

github.event.pull_request.number is null for push and workflow_dispatch, so the group collapses to ${{ github.workflow }}-${{ github.ref }} — usually fine, but if both a push-on-main and a manually dispatched run on main coexist, the latter will cancel the former. Not a blocker; consider including github.run_id as a fallback for workflow_dispatch.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In @.github/workflows/test_toolbox_dsl.yml around lines 11 - 13, Update the
concurrency group key so it falls back to github.run_id when
github.event.pull_request.number is null (e.g., for workflow_dispatch or push)
instead of collapsing to just github.ref; change the group expression at the
concurrency block to prefer github.event.pull_request.number, then
github.run_id, then github.ref so manual dispatches don’t cancel unrelated runs
while keeping cancel-in-progress: true unchanged.
projects/core/dsl/task.py (3)
51-73: Function parameters silently overridden by func._retry_config.

_execute_with_retry accepts attempts, delay, backoff, retry_on_exceptions as explicit parameters, but Lines 69-73 then overwrite all of them from getattr(func, "_retry_config", {}). Since retry always populates every key of _retry_config (Lines 329-334), the .get(..., <param>) fallback is effectively never used — the passed-in arguments are dead. This is confusing, and if runtime.py ever passes values that differ from what's attached to func (e.g., future call sites, dynamic overrides), those values will be silently ignored.

Pick one source of truth: either drop the per-call parameters and have _execute_with_retry read exclusively from func._retry_config, or drop the getattr lookup and trust the parameters. The current hybrid hides bugs.
♻️ Suggested simplification (trust parameters passed by runtime)
-def _execute_with_retry(func, attempts, delay, backoff, retry_on_exceptions, *args, **kwargs):
+def _execute_with_retry(func, attempts, delay, backoff, retry_on_exceptions, *args, **kwargs):
     ...
-    retry_config = getattr(func, "_retry_config", {})
-    retry_attempts = retry_config.get("attempts", attempts)
-    retry_delay = retry_config.get("delay", delay)
-    retry_backoff = retry_config.get("backoff", backoff)
-    retry_on_exc = retry_config.get("retry_on_exceptions", retry_on_exceptions)
-
-    current_delay = retry_delay
+    current_delay = delay
+    retry_attempts = attempts
+    retry_backoff = backoff
+    retry_on_exc = retry_on_exceptions
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@projects/core/dsl/task.py` around lines 51 - 73, The function
_execute_with_retry currently overwrites the explicit parameters (attempts,
delay, backoff, retry_on_exceptions) with values from func._retry_config, making
caller-passed values ignored; fix this by removing the getattr(func,
"_retry_config", {}) lookup and the subsequent assignment to
retry_attempts/retry_delay/retry_backoff/retry_on_exc so the function uses the
parameters passed into _execute_with_retry directly; also remove or adjust any
code that relies on reading _retry_config inside _execute_with_retry (leave the
decorator/ retry that sets func._retry_config untouched) and ensure callers
(e.g., runtime) pass the desired retry values into _execute_with_retry instead
of relying on func._retry_config.
143-201: task_only simple-decorator path now requires the exact parameter name func.

The new gate params[0].name == "func" cleanly disambiguates @when(lambda: ...) from a bare @decorator application, which is a real fix. Worth noting for future maintainers: any future simple decorator wrapped with @task_only that doesn't name its sole parameter func will silently fall through to the factory path and break at apply-time in a non-obvious way. The docstring hints at this, but a brief inline assertion (or an explicit error on “looks like a simple decorator but param isn't named func”) would make the contract harder to violate accidentally.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@projects/core/dsl/task.py` around lines 143 - 201, The task_only wrapper
currently treats a single-parameter decorator as a simple decorator only if that
parameter is literally named "func", which can silently misroute valid simple
decorators that use a different parameter name; update task_only to detect the
one-parameter case and raise a clear error if params[0].name != "func"
(including decorator_func.__name__ in the message) so maintainers get an
explicit failure rather than an obscure factory-path break; reference the
task_only wrapper and _ensure_is_task when adding this assertion/error.
107-140: Minor: use bare raise and note the unreachable fallback.

Two small nits in the new exception-retry path:

Line 114: raise exc re-raises but anchors the traceback to this line; a bare raise inside the except block preserves the original traceback more cleanly.

Lines 137-140: this trailing raise RetryFailure is unreachable whenever retry_attempts >= 1 — every branch of the loop body either returns, raises on falsy exhaustion (Lines 100-102), or raises on exception exhaustion (Lines 119-122). It is only reached when retry_attempts == 0, which @retry(attempts=0) currently accepts silently. Consider either validating attempts >= 1 at decoration time or keeping the fallback with a comment that it guards the zero-attempts corner case.
♻️ Proposed small cleanup
         except Exception as exc:
             if not retry_on_exc:
                 logger.error(f"==> TASK EXCEPTION: {func.__name__} failed with exception")
                 logger.info("")
-                raise exc
+                raise
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@projects/core/dsl/task.py` around lines 107 - 140, Replace the anchored
re-raise "raise exc" inside the except block with a bare "raise" to preserve the
original traceback (affecting the except that checks retry_on_exc and references
func.__name__). Also prevent the unreachable trailing fallback by validating the
retry count early: in the retry decorator/wrapper (the code path that sets
retry_attempts / accepts `@retry`(attempts=...)), assert or raise a ValueError if
retry_attempts < 1, or alternatively keep the final raise RetryFailure as an
explicit guard and add a comment noting it only runs when retry_attempts == 0;
reference the symbols retry_attempts, retry_on_exc, RetryFailure, and the
wrapper handling func to locate where to apply these changes.
projects/core/tests/test_dsl_toolbox.py (2)
15-18: Exercise the public RetryFailure export.

The PR exports RetryFailure from projects.core.dsl, but this test imports it from the internal projects.core.dsl.task module, so a broken public export would not be caught.
♻️ Proposed import update
-from projects.core.dsl import always, execute_tasks, retry, task, when
+from projects.core.dsl import RetryFailure, always, execute_tasks, retry, task, when
 from projects.core.dsl.runtime import TaskExecutionError
 from projects.core.dsl.script_manager import reset_script_manager
-from projects.core.dsl.task import RetryFailure
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@projects/core/tests/test_dsl_toolbox.py` around lines 15 - 18, The test
imports RetryFailure from the internal module projects.core.dsl.task instead of
exercising the public export; update the import in test_dsl_toolbox.py to import
RetryFailure from the public package (projects.core.dsl) alongside always,
execute_tasks, retry, task, when so the test will fail if the public export is
missing or broken.
186-198: Assert the original exception remains chained under RetryFailure.

_execute_with_retry raises RetryFailure from the last exception on exhaustion, but this test only checks the outer cause. Add the nested cause assertion so that regression is caught.
🧪 Proposed assertion strengthening
     with pytest.raises(TaskExecutionError) as ei:
         execute_tasks(locals())
-    assert isinstance(ei.value.__cause__, RetryFailure)
+    retry_failure = ei.value.__cause__
+    assert isinstance(retry_failure, RetryFailure)
+    assert isinstance(retry_failure.__cause__, ValueError)
+    assert str(retry_failure.__cause__) == "nope"
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@projects/core/tests/test_dsl_toolbox.py` around lines 186 - 198, The test
test_retry_on_exceptions_exhausted needs to assert that the original exception
is preserved under the RetryFailure chain: after execute_tasks raises
TaskExecutionError and you assert its __cause__ is a RetryFailure, also assert
that ei.value.__cause__.__cause__ (the nested cause from _execute_with_retry /
RetryFailure) is the original ValueError (or isinstance(..., ValueError)) so the
test fails if the original exception is not chained correctly.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@projects/core/dsl/runtime.py`:
- Around line 144-168: The warning messages are being emitted to the root logger
instead of the DSL file logger, so replace calls to logging.warning(...) with
the module-level logger.warning(...) (the same logger used elsewhere from .log)
so the messages are captured in the per-run task.log; specifically update the
"Executing the `@always` tasks …" warning in the pending-task handling block
(where _execute_single_task and execution_error are used) and the "Path
{co_filename} isn't relative ..." warning in the error-wrapping path to use
logger.warning; no other behavior changes are needed since
_setup_execution_logging attaches the file handler to the DSL logger.
- Around line 162-168: The current handler in the always-task loop stores only
the first always_exc into execution_error and discards further exceptions;
change this to accumulate all always-task exceptions into an ExceptionGroup
(e.g., collect them into a list and, when raising, wrap them with
ExceptionGroup) instead of overwriting execution_error, update the logging to
still log each failure (logger.error) as now, and when preparing to re-raise at
the end of the task-run logic (where execution_error is currently raised), raise
an ExceptionGroup that includes any previously stored execution_error plus all
collected always-task exceptions so callers receive all cleanup failures; look
for the variables execution_error, always_exc and the block that currently
raises execution_error and replace that raise with raising an ExceptionGroup
built from the collected exceptions.

In `@projects/foreign_testing/orchestration/cli.py`:
- Around line 10-12: Remove the unused and duplicate imports: delete the unused
"sys" import and remove the duplicate "types" and "pathlib" import lines so only
the original imports remain; specifically, ensure there is a single import
statement for "types" and a single for "pathlib" (remove the extra occurrences
you added on lines importing sys, types, pathlib) so the module no longer
imports sys and does not duplicate types/pathlib.

In `@pyproject.toml`:
- Line 99: The pyproject.toml change narrows pytest discovery by setting
testpaths = ["projects/core/tests"], which will omit tests under
projects/llm_d/orchestration and projects/skeleton/orchestration; update the
testpaths value (e.g., include "projects/llm_d/orchestration" and
"projects/skeleton/orchestration" or broader roots like "projects") or add a
comment/documentation entry near the testpaths key explaining why those
subprojects are intentionally excluded so test discovery behavior is explicit
and reproducible.

---

Nitpick comments:
In @.github/workflows/test_toolbox_dsl.yml:
- Around line 11-13: Update the concurrency group key so it falls back to
github.run_id when github.event.pull_request.number is null (e.g., for
workflow_dispatch or push) instead of collapsing to just github.ref; change the
group expression at the concurrency block to prefer
github.event.pull_request.number, then github.run_id, then github.ref so manual
dispatches don’t cancel unrelated runs while keeping cancel-in-progress: true
unchanged.

In `@projects/core/dsl/runtime.py`:
- Around line 144-158: The leading comment is misleading because this block also
executes on the success path (when task_index == len(file_tasks) it becomes a
no-op); update the comment above the pending/always_pending logic to reflect the
true intent — e.g., say that for any remaining tasks after the current index we
skip non-@always tasks and still run `@always` tasks (and note it will be a no-op
when there are no pending tasks) — so change the comment near variables
task_index, file_tasks, pending, always_pending and the check on
current_task_info.get("always_execute") to make the behavior explicit.
- Around line 135-142: The except clause currently lists TaskExecutionError,
ConditionError, RetryFailure and Exception together which is redundant; split
into two handlers: one that explicitly catches (TaskExecutionError,
ConditionError, RetryFailure) as e and performs the existing logging,
log_completion_banner call and sets execution_error, and a separate fallback
except Exception as e that does the same fallback handling so the specific
expected exceptions remain documented while still preserving a generic
catch-all; refer to the existing symbols logger.fatal, log_completion_banner and
execution_error when making the changes.

In `@projects/core/dsl/task.py`:
- Around line 51-73: The function _execute_with_retry currently overwrites the
explicit parameters (attempts, delay, backoff, retry_on_exceptions) with values
from func._retry_config, making caller-passed values ignored; fix this by
removing the getattr(func, "_retry_config", {}) lookup and the subsequent
assignment to retry_attempts/retry_delay/retry_backoff/retry_on_exc so the
function uses the parameters passed into _execute_with_retry directly; also
remove or adjust any code that relies on reading _retry_config inside
_execute_with_retry (leave the decorator/ retry that sets func._retry_config
untouched) and ensure callers (e.g., runtime) pass the desired retry values into
_execute_with_retry instead of relying on func._retry_config.
- Around line 143-201: The task_only wrapper currently treats a single-parameter
decorator as a simple decorator only if that parameter is literally named
"func", which can silently misroute valid simple decorators that use a different
parameter name; update task_only to detect the one-parameter case and raise a
clear error if params[0].name != "func" (including decorator_func.__name__ in
the message) so maintainers get an explicit failure rather than an obscure
factory-path break; reference the task_only wrapper and _ensure_is_task when
adding this assertion/error.
- Around line 107-140: Replace the anchored re-raise "raise exc" inside the
except block with a bare "raise" to preserve the original traceback (affecting
the except that checks retry_on_exc and references func.__name__). Also prevent
the unreachable trailing fallback by validating the retry count early: in the
retry decorator/wrapper (the code path that sets retry_attempts / accepts
`@retry`(attempts=...)), assert or raise a ValueError if retry_attempts < 1, or
alternatively keep the final raise RetryFailure as an explicit guard and add a
comment noting it only runs when retry_attempts == 0; reference the symbols
retry_attempts, retry_on_exc, RetryFailure, and the wrapper handling func to
locate where to apply these changes.

In `@projects/core/tests/conftest.py`:
- Around line 9-24: The _dsl_isolation fixture is currently autouse and forces
all tests to run with cwd set to FORGE_HOME and ARTIFACT_DIR set, which is
undesirable for non-DSL tests; change it from autouse=True to a non-autouse
fixture (remove autouse) and provide an alternate marker-based approach so only
tests that need DSL setup opt in (e.g., add a pytest config to register a "dsl"
marker and create a helper fixture or require tests to use `@pytest.mark.dsl` to
invoke _dsl_isolation), while keeping the existing behavior inside
_dsl_isolation (monkeypatch.setenv("ARTIFACT_DIR", ...),
monkeypatch.chdir(env.FORGE_HOME), env.init(), reset_script_manager()) and
ensuring teardown still calls reset_script_manager().

In `@projects/core/tests/test_dsl_toolbox.py`:
- Around line 15-18: The test imports RetryFailure from the internal module
projects.core.dsl.task instead of exercising the public export; update the
import in test_dsl_toolbox.py to import RetryFailure from the public package
(projects.core.dsl) alongside always, execute_tasks, retry, task, when so the
test will fail if the public export is missing or broken.
- Around line 186-198: The test test_retry_on_exceptions_exhausted needs to
assert that the original exception is preserved under the RetryFailure chain:
after execute_tasks raises TaskExecutionError and you assert its __cause__ is a
RetryFailure, also assert that ei.value.__cause__.__cause__ (the nested cause
from _execute_with_retry / RetryFailure) is the original ValueError (or
isinstance(..., ValueError)) so the test fails if the original exception is not
chained correctly.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: fb48fffb-be5e-4e73-88de-f1c218bd6fcd

📥 Commits

Reviewing files that changed from the base of the PR and between fb6fb08 and 0278cd8.

📒 Files selected for processing (18)

.github/workflows/test_toolbox_dsl.yml
docs/toolbox/dsl.md
projects/core/dsl/__init__.py
projects/core/dsl/runtime.py
projects/core/dsl/task.py
projects/core/launcher/README.md
projects/core/tests/conftest.py
projects/core/tests/test_dsl_toolbox.py
projects/foreign_testing/orchestration/cli.py
pyproject.toml
specs/008-toolbox-dsl/checklists/requirements.md
specs/008-toolbox-dsl/contracts/toolbox_dsl_authoring.md
specs/008-toolbox-dsl/data-model.md
specs/008-toolbox-dsl/plan.md
specs/008-toolbox-dsl/quickstart.md
specs/008-toolbox-dsl/research.md
specs/008-toolbox-dsl/spec.md
specs/008-toolbox-dsl/tasks.md

coderabbitai

Actionable comments posted: 5

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

projects/core/dsl/task.py (1)
174-188: ⚠️ Potential issue | 🟠 Major

The simple-decorator branch is not dead code; always actively uses it, but the brittleness risk remains.

The new params[0].name == "func" heuristic correctly disambiguates when(lambda: ...) from a simple @decorator(func) application on a lambda. However, the original claim that "@task_only is currently applied only to factories when and retry; always doesn't use it" is incorrect—always (line 155) is decorated with @task_only and is a simple decorator with func as its parameter, so it successfully uses the simple-decorator path.

That said, the brittleness concern is real: any simple decorator (or future variant of always) whose wrapped-callable parameter is not literally named func (e.g. fn, function) would silently fall through to the factory branch and break at runtime. The naming invariant should either be enforced with a clear error message or documented explicitly.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@projects/core/dsl/task.py` around lines 174 - 188, The simple-decorator
branch currently only accepts a single-parameter decorator when that parameter
is literally named "func", which is brittle (used by always decorated with
`@task_only`); change behavior to detect a simple decorator attempt more robustly
and raise a clear error if the parameter name is not "func". Concretely, in the
branch that checks len(params) == 1 and callable(args[0]) (refer to
params[0].name and decorator_func.__name__), if the parameter name is not "func"
fail fast with a TypeError that references decorator_func.__name__ and instructs
that simple decorators must name their callable parameter "func" (or call the
decorator as a factory), rather than silently falling through; keep the
_ensure_is_task(func, decorator_func.__name__) and return decorator_func(func)
behavior when the name is "func".

🧹 Nitpick comments (2)

projects/core/ci_entrypoint/run_ci.py (1)

480-482: Use the module-scoped logger for consistency.

logging.exception(...) goes through the root logger, unlike the rest of this file which uses the module logger. Minor nit:

-        logging.exception("Unexpected exception")
+        logger.exception("Unexpected exception")

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@projects/core/ci_entrypoint/run_ci.py` around lines 480 - 482, The except
block currently calls logging.exception("Unexpected exception") which logs to
the root logger; replace that call with the module-scoped logger
(logger.exception("Unexpected exception")) so the message goes through the same
logger used elsewhere in this file (ensure the module-level variable logger is
imported/defined and used in the except Exception handler).

projects/core/dsl/task.py (1)

109-139: Minor polish in the exception branch.

Two small things worth considering in _execute_with_retry:

Line 113 uses raise exc; prefer the bare raise inside the except block so the traceback is preserved verbatim without re-binding.
The trailing raise RetryFailure(...) at lines 136-139 is only reachable when retry_attempts <= 0 (the for loop body never runs). In that misconfiguration the message "(exceptions retried until exhausted)" is misleading since nothing was actually retried. Consider either validating attempts >= 1 at decorator time or tweaking the message for that case.

Proposed tweak

-        except Exception as exc:
-            if not retry_on_exc:
-                logger.error(f"==> TASK EXCEPTION: {func.__name__} failed with exception")
-                logger.info("")
-                raise exc
+        except Exception as exc:
+            if not retry_on_exc:
+                logger.error(f"==> TASK EXCEPTION: {func.__name__} failed with exception")
+                logger.info("")
+                raise

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@projects/core/dsl/task.py` around lines 109 - 139, In _execute_with_retry:
replace the re-raised exception "raise exc" with a bare "raise" inside the
except block so the original traceback is preserved (locate the re-raise in the
except where retry_on_exc is false in the _execute_with_retry function). Also
handle the unreachable/misleading final RetryFailure: either validate
retry_attempts >= 1 at decorator/configuration time and raise a clear
configuration error earlier, or change the final RetryFailure message produced
at the end of _execute_with_retry to reflect that no attempts were made (e.g.,
indicate retry_attempts was <= 0) so the message is accurate when the for-loop
never ran.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@projects/cluster/toolbox/rebuild_image/main.py`:
- Line 23: Replace the module-level logger creation so it uses the DSL-named
logger instead of module name: change the logger = logging.getLogger(__name__)
instance to logging.getLogger("DSL") so CLI-facing info messages emitted by this
module (the logger variable used throughout main.py) are handled by the DSL
infrastructure (the logger configured in execute_tasks()). No other behavior
changes needed—just switch the logger reference to "DSL" so INFO output appears
consistently like in task.py/shell.py/runner.py/script_manager.py.

In `@projects/core/ci_entrypoint/github/pr_args.py`:
- Around line 524-526: The error log call currently double-prefixes the message
("ERROR: ERROR: ..."); update the except block by removing the manual "ERROR: "
prefix so the logger uses the configured level prefix only—replace the call to
logger.error(f"ERROR: {e}") with logger.error(str(e)) (or logger.exception(e) if
a traceback is desired) and keep the existing sys.exit(1).
- Line 465: The module/function docstring was corrupted by a blanket
print→logger.info change—replace the malformed single-line text "Reads
environment variables and logger.infos the parsed configuration to stdout." with
a proper triple-quoted docstring that describes the function's behavior (e.g.,
"Reads environment variables and prints the parsed configuration to stdout." or
similar), ensuring the docstring belongs to the correct function/class in
pr_args.py and is a literal string (not code invoking logger.info); keep the
original semantic wording and formatting so the docstring is not executed as a
logging call.
- Around line 469-473: The --help-directives informational output is being sent
via logger.info and can be dropped when pr_args.py is run standalone because
setup_logging() isn't called; change the user-facing informational emissions in
pr_args.py (the block that calls get_supported_directives() and
get_directive_prefixes() and the other banner/info lines emitted by main()) to
use print(...) to write to stdout instead of logger.info(...), leaving logger
usage intact for internal/parsed flows (e.g., when invoked via
prepare_ci.py/parse_pr_arguments()) so standalone main() prints reliably.

In `@projects/fournos_launcher/orchestration/cli.py`:
- Around line 72-74: The code currently logs raw user-provided data (args_list
and extra_overrides) via logger.info; change these logger calls so they do not
print full values — instead log safe summaries such as the number of args
(len(args_list)) or the set of override keys (e.g., extra_overrides.keys()) and
keep the existing config.project.set_config calls unchanged; update the
logger.info calls referencing args_list and extra_overrides to log only counts
or keys and a brief context message.

---

Outside diff comments:
In `@projects/core/dsl/task.py`:
- Around line 174-188: The simple-decorator branch currently only accepts a
single-parameter decorator when that parameter is literally named "func", which
is brittle (used by always decorated with `@task_only`); change behavior to detect
a simple decorator attempt more robustly and raise a clear error if the
parameter name is not "func". Concretely, in the branch that checks len(params)
== 1 and callable(args[0]) (refer to params[0].name and
decorator_func.__name__), if the parameter name is not "func" fail fast with a
TypeError that references decorator_func.__name__ and instructs that simple
decorators must name their callable parameter "func" (or call the decorator as a
factory), rather than silently falling through; keep the _ensure_is_task(func,
decorator_func.__name__) and return decorator_func(func) behavior when the name
is "func".

---

Nitpick comments:
In `@projects/core/ci_entrypoint/run_ci.py`:
- Around line 480-482: The except block currently calls
logging.exception("Unexpected exception") which logs to the root logger; replace
that call with the module-scoped logger (logger.exception("Unexpected
exception")) so the message goes through the same logger used elsewhere in this
file (ensure the module-level variable logger is imported/defined and used in
the except Exception handler).

In `@projects/core/dsl/task.py`:
- Around line 109-139: In _execute_with_retry: replace the re-raised exception
"raise exc" with a bare "raise" inside the except block so the original
traceback is preserved (locate the re-raise in the except where retry_on_exc is
false in the _execute_with_retry function). Also handle the
unreachable/misleading final RetryFailure: either validate retry_attempts >= 1
at decorator/configuration time and raise a clear configuration error earlier,
or change the final RetryFailure message produced at the end of
_execute_with_retry to reflect that no attempts were made (e.g., indicate
retry_attempts was <= 0) so the message is accurate when the for-loop never ran.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: 29aaf8c2-b06e-434a-9531-db216d65fcb8

📥 Commits

Reviewing files that changed from the base of the PR and between 0278cd8 and f50b06f.

📒 Files selected for processing (15)

projects/cluster/toolbox/rebuild_image/main.py
projects/core/ci_entrypoint/github/pr_args.py
projects/core/ci_entrypoint/prepare_ci.py
projects/core/ci_entrypoint/run_ci.py
projects/core/dsl/log.py
projects/core/dsl/runtime.py
projects/core/dsl/shell.py
projects/core/dsl/task.py
projects/core/library/config.py
projects/core/library/run.py
projects/core/notifications/github/api.py
projects/core/notifications/send.py
projects/core/notifications/slack/api.py
projects/fournos_launcher/orchestration/cli.py
projects/skeleton/orchestration/prepare_skeleton.py

💤 Files with no reviewable changes (2)

projects/core/dsl/shell.py
projects/core/dsl/log.py

✅ Files skipped from review due to trivial changes (6)

projects/skeleton/orchestration/prepare_skeleton.py
projects/core/ci_entrypoint/prepare_ci.py
projects/core/notifications/slack/api.py
projects/core/library/config.py
projects/core/notifications/github/api.py
projects/core/notifications/send.py

🚧 Files skipped from review as they are similar to previous changes (1)

projects/core/dsl/runtime.py

kpouget · 2026-04-21T07:52:46Z

/test fournos skeleton
/cluster cluster-1
/var forge.namespace: psap-automation-wip

topsail-bot · 2026-04-21T08:02:18Z

🔴 Test of 'fournos_launcher submit' failed after 00 hours 08 minutes 52 seconds 🔴

• Link to the test results.

• No reports index generated...

Test configuration:

/test fournos skeleton
/cluster cluster-1
/var forge.namespace: psap-automation-wip

• Failure indicator:

## /logs/artifacts/FAILURE 
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
~~ projects/fournos_launcher/toolbox/submit_and_wait/main.py:167
~~ TASK: wait_for_job_completion: Wait for FOURNOS job to complete
~~ ARTIFACT_DIR: /logs/artifacts/001__submit_and_wait
~~ LOG_FILE: /logs/artifacts/001__submit_and_wait/task.log
~~ ARGS:
~~     cluster_name: cluster-1
~~     project: skeleton
~~     args: []
~~     variables_overrides:
~~       forge.namespace: psap-automation-wip
~~     job_name: ''
~~     namespace: psap-automation
~~     owner: FORGE/fournos-launcher
~~     display_name: Forge Job
~~     pipeline_name: forge-test-only
~~     env:
~~       JOB_TYPE: presubmit
~~       JOB_NAME: pull-ci-openshift-psap-forge-main-fournos
~~       JOB_SPEC: '{"type":"presubmit","job":"pull-ci-openshift-psap-forge-main-fournos","buildid":"2046497368961978368","prowjobid":"76287593-89db-4211-bc56-80d212c2d7a1","refs":{"org":"openshift-psap","repo":"forge","repo_link":"https://github.com/openshift-psap/forge","base_ref":"main","base_sha":"fb6fb080aad35ae46a3a453396d0561d4791f91f","base_link":"https://github.com/openshift-psap/forge/commit/fb6fb080aad35ae46a3a453396d0561d4791f91f","pulls":[{"number":35,"author":"kpouget","sha":"5bf8e857073d4d107b70569a43e478d36bd35e1a","title":"[core]
~~         Improve, test and document the DSL","head_ref":"008-toolbox-dsl","link":"https://github.com/openshift-psap/forge/pull/35","commit_link":"https://github.com/openshift-psap/forge/pull/35/commits/5bf8e857073d4d107b70569a43e478d36bd35e1a","author_link":"https://github.com/kpouget"}]},"decoration_config":{"timeout":"23h0m0s","grace_period":"15s","utility_images":{"clonerefs":"us-docker.pkg.dev/k8s-infra-prow/images/clonerefs:v20260414-6691f5aff","initupload":"us-docker.pkg.dev/k8s-infra-prow/images/initupload:v20260414-6691f5aff","entrypoint":"us-docker.pkg.dev/k8s-infra-prow/images/entrypoint:v20260414-6691f5aff","sidecar":"us-docker.pkg.dev/k8s-infra-prow/images/sidecar:v20260414-6691f5aff"},"resources":{"clonerefs":{"limits":{"memory":"3Gi"},"requests":{"cpu":"100m","memory":"500Mi"}},"initupload":{"limits":{"memory":"200Mi"},"requests":{"cpu":"100m","memory":"50Mi"}},"place_entrypoint":{"limits":{"memory":"100Mi"},"requests":{"cpu":"100m","memory":"25Mi"}},"sidecar":{"limits":{"memory":"2Gi"},"requests":{"cpu":"100m","memory":"250Mi"}}},"gcs_configuration":{"bucket":"test-platform-results","path_strategy":"single","default_org":"openshift","default_repo":"origin","mediaTypes":{"log":"text/plain"},"compress_file_types":["txt","log","json","tar","html","yaml"]},"gcs_credentials_secret":"gce-sa-credentials-gcs-publisher","skip_cloning":true,"censor_secrets":true,"censoring_options":{"minimum_secret_length":6}}}'
~~       OPENSHIFT_CI: 'true'
~~       JOB_NAME_SAFE: fournos
~~       BUILD_ID: '2046497368961978368'
~~       PULL_PULL_SHA: 5bf8e857073d4d107b70569a43e478d36bd35e1a
~~       PULL_NUMBER: '35'
~~       PULL_BASE_REF: main
~~       REPO_NAME: forge
~~       REPO_OWNER: openshift-psap
~~       PULL_BASE_SHA: fb6fb080aad35ae46a3a453396d0561d4791f91f
~~       PULL_TITLE: '[core] Improve, test and document the DSL'
~~       PULL_REFS: main:fb6fb080aad35ae46a3a453396d0561d4791f91f,35:5bf8e857073d4d107b70569a43e478d36bd35e1a
~~       PULL_HEAD_REF: 008-toolbox-dsl
~~     status_dest: /logs/artifacts
~~     artifact_dir: /logs/artifacts/001__submit_and_wait
~~ CONTEXT:
~~     final_job_name: forge-skeleton-20260421-075326
~~     manifest_file: /logs/artifacts/001__submit_and_wait/src/forge-skeleton-20260421-075326-manifest.yaml
~~
~~ EXCEPTION: RuntimeError
~~     Job forge-skeleton-20260421-075326 not found in namespace psap-automation
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx


[...]

• Execution logs

kpouget · 2026-04-21T08:08:15Z

/test fournos skeleton
/cluster cluster-1
/var fournos.namespace: psap-automation-wip

topsail-bot · 2026-04-21T08:09:21Z

🟢 Test of 'fournos_launcher submit' succeeded after 00 hours 00 minutes 27 seconds 🟢

• Link to the test results.

• No reports index generated...

Test configuration:

/test fournos skeleton
/cluster cluster-1
/var fournos.namespace: psap-automation-wip

• Execution logs

…ject to launch

…kage directly

…ions

kpouget · 2026-04-21T08:29:22Z

/test fournos llm_d
/cluster cluster-1
/var fournos.namespace: psap-automation-wip

topsail-bot · 2026-04-21T08:30:35Z

🟢 Test of 'fournos_launcher submit' succeeded after 00 hours 00 minutes 28 seconds 🟢

• Link to the test results.

• No reports index generated...

Test configuration:

/test fournos llm_d
/cluster cluster-1
/var fournos.namespace: psap-automation-wip

• Execution logs

kpouget · 2026-04-21T08:38:28Z

/test fournos llm_d
/cluster psap-mgmt
/var fournos.namespace: psap-automation-wip

topsail-bot · 2026-04-21T08:48:30Z

🟢 Test of 'fournos_launcher submit' succeeded after 00 hours 09 minutes 30 seconds 🟢

• Link to the test results.

• No reports index generated...

Test configuration:

/test fournos llm_d
/cluster psap-mgmt
/var fournos.namespace: psap-automation-wip

• Execution logs

kpouget · 2026-04-21T09:00:43Z

I merge this one 👍🏻

coderabbitai Bot reviewed Apr 20, 2026

View reviewed changes

Comment thread projects/core/dsl/runtime.py

Comment thread projects/core/dsl/runtime.py

Comment thread projects/foreign_testing/orchestration/cli.py Outdated

Comment thread pyproject.toml

coderabbitai Bot reviewed Apr 21, 2026

View reviewed changes

kpouget force-pushed the 008-toolbox-dsl branch 3 times, most recently from d7b36e5 to 5bf8e85 Compare April 21, 2026 07:45

kpouget added 9 commits April 21, 2026 10:28

[foreign_testing] orchestration: make sure the project_path exists

ad362f9

[foreign_testing] orchestration: add dynamic configuration of the pro…

c7f6a46

…ject to launch

specs: 008-toolbox-dsl: new spec

122c2bf

[core] Improve, test and document the DSL

53ea77f

[core] Never use the logging package directly

22798ca

[fournos_launcher] Never use the logging package directly

653fc49

[cluster] toolbox: rebuild_image: don't use the logging package directly

860ca9b

[skeleton] orchestration: prepare_skeleton: don't use the logging pac…

23ee00c

…kage directly

[core] dsl: runtime: don't swallow the (multiple) always-tasks except…

7b9ba1a

…ions

kpouget force-pushed the 008-toolbox-dsl branch from 5bf8e85 to 7b9ba1a Compare April 21, 2026 08:28

[fournos_launcher] orchestration: submit: strip the display name

5353d91

kpouget merged commit 543e5bb into openshift-psap:main Apr 21, 2026
6 of 7 checks passed

kpouget deleted the 008-toolbox-dsl branch April 21, 2026 09:01

Conversation

kpouget commented Apr 20, 2026 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Uh oh!

openshift-ci Bot commented Apr 20, 2026

Uh oh!

coderabbitai Bot commented Apr 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Rate limit exceeded

❌ Failed checks (1 warning)

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

kpouget commented Apr 21, 2026

Uh oh!

topsail-bot Bot commented Apr 21, 2026

Uh oh!

kpouget commented Apr 21, 2026

Uh oh!

topsail-bot Bot commented Apr 21, 2026

Uh oh!

kpouget commented Apr 21, 2026

Uh oh!

topsail-bot Bot commented Apr 21, 2026

Uh oh!

kpouget commented Apr 21, 2026

Uh oh!

topsail-bot Bot commented Apr 21, 2026

Uh oh!

kpouget commented Apr 21, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

kpouget commented Apr 20, 2026 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented Apr 20, 2026 •

edited

Loading