fix: address 6 confirmed review issues on PR #514 by colehurwitz · Pull Request #520 · akashgit/remote-factory

colehurwitz · 2026-06-12T22:20:46Z

Closes #519. Addresses review feedback from @osilkin98 on PR #514.

Summary

Fixes all 6 issues confirmed in the code review:

EvalFragment.passed semantic overload → add coverage_pct field
Undisclosed scoring changes → stub new dimensions to return None (behavioral parity with main)
cargo test --workspace → cargo test (restore main behavior)
TypeScript detection regression → NodeEvaluator.name returns typescript unconditionally
NodeEvaluator.name statefulness → resolved by fix 4 (no more self._project_path reference)
Characterization tests updated to reflect all fixes

This PR is a behavior-preserving architecture refactor. New eval dimensions for Go/Rust/Node will be enabled in a follow-up PR with explicit disclosure.

Refs: PR #514, #513

…ompat The multi-language eval refactor required source files in detect() methods, breaking discovery which only needs marker files (package.json, Cargo.toml, etc.) to identify project language. Phantom detection prevention is already handled by each run_*() method returning None when no source files exist. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

…or paths Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

- EvalFragment: add coverage_pct field to fix passed semantic overload - NodeEvaluator.name: return typescript unconditionally (fix detection regression + statefulness) - Rust: revert cargo test --workspace to cargo test - Go/Rust/Node: stub new eval dimensions to None (behavioral parity with main) - Tests: update characterization tests to reflect all fixes Closes #519 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

codecov · 2026-06-12T22:22:30Z

Codecov Report

❌ Patch coverage is 97.76952% with 6 lines in your changes missing coverage. Please review.
✅ Project coverage is 87.92%. Comparing base (5985563) to head (e161326).

Files with missing lines	Patch %	Lines
factory/eval/languages/node.py	95.12%	2 Missing ⚠️
factory/eval/languages/rust.py	94.11%	2 Missing ⚠️
factory/eval/languages/go.py	96.29%	1 Missing ⚠️
factory/eval/languages/python.py	98.18%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #520      +/-   ##
==========================================
+ Coverage   86.77%   87.92%   +1.14%     
==========================================
  Files          64       70       +6     
  Lines       10027    10120      +93     
==========================================
+ Hits         8701     8898     +197     
+ Misses       1326     1222     -104

☔ View full report in Codecov by Harness.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

Filter out EvalFragment entries where coverage_pct is None before summing, preventing a TypeError crash if a future evaluator returns a coverage fragment without setting coverage_pct. Also handle the empty-list edge case to avoid ZeroDivisionError. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

…priority Swap node and rust registration so detection priority is python → node/typescript → rust → go, matching the original _detect_language behavior. Without this, projects with both Cargo.toml and package.json would incorrectly detect as 'rust'. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

colehurwitz · 2026-06-12T23:02:25Z

✅ Factory Review: KEEP

Verdict: KEEP
Reason: Fixes all 6 confirmed review issues on PR #514: EvalFragment semantic overload, undisclosed scoring changes, cargo test --workspace, TypeScript detection regression, NodeEvaluator statefulness, circular tests. Score +0.0054. All checks pass.

Experiment: #20
Hypothesis: Fix all 6 confirmed review issues on PR #514

Score Comparison

Metric	Value
Before	0.7356
After	0.7410
Delta	+0.0054
Threshold	0.6000

Guard Checks

Check	Result
scope	✅ PASS
eval_immutable	✅ PASS

Posted by Factory CEO

colehurwitz · 2026-06-13T01:07:25Z

✅ Factory Review: KEEP

Verdict: KEEP
Reason: All 6 confirmed review issues from PR #514 resolved with behavioral parity to main

Experiment: #20
Hypothesis: Address all 6 confirmed review issues on PR #514: coverage_pct semantic fix, None returns for unsupported dimensions, cargo test revert, stateless NodeEvaluator.name, simplified detect(), registration order fix

Score Comparison

Metric	Value
Before	0.7356
After	0.7410
Delta	+0.0054
Threshold	0.0100

Guard Checks

Check	Result
scope	✅ PASS
eval_immutable	✅ PASS

Posted by Factory CEO

colehurwitz · 2026-06-13T01:17:54Z

@osilkin98 it made a new PR. I filed a bug report about this. But please take a look

RobotSail · 2026-06-13T04:56:12Z

@colehurwitz Thank you for addressing the changes, sorry it's been such a hassle. Let's merge this and fix the other bug.

colehurwitz and others added 5 commits June 9, 2026 17:10

feat: add LanguageEvaluator Protocol + Registry for multi-language eval

aa5e5be

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

chore: remove unused _rust_env method from RustEvaluator

0019926

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

test: add coverage for Go/Rust/Node new eval methods and _run_cmd err…

42b34ec

…or paths Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

colehurwitz and others added 2 commits June 12, 2026 18:23

colehurwitz marked this pull request as ready for review June 12, 2026 23:02

colehurwitz mentioned this pull request Jun 13, 2026

factory ceo: --focus on PR review should push fixes to the original PR branch #521

Open

osilkin98 approved these changes Jun 13, 2026

View reviewed changes

osilkin98 merged commit 17e224f into main Jun 13, 2026
6 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: address 6 confirmed review issues on PR #514#520

fix: address 6 confirmed review issues on PR #514#520
osilkin98 merged 7 commits into
mainfrom
factory/run-d8c85613

colehurwitz commented Jun 12, 2026

Uh oh!

codecov Bot commented Jun 12, 2026 •

edited

Loading

Uh oh!

colehurwitz commented Jun 12, 2026

Uh oh!

colehurwitz commented Jun 13, 2026

Uh oh!

colehurwitz commented Jun 13, 2026

Uh oh!

RobotSail commented Jun 13, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

colehurwitz commented Jun 12, 2026

Summary

Uh oh!

codecov Bot commented Jun 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

colehurwitz commented Jun 12, 2026

✅ Factory Review: KEEP

Score Comparison

Guard Checks

Uh oh!

colehurwitz commented Jun 13, 2026

✅ Factory Review: KEEP

Score Comparison

Guard Checks

Uh oh!

colehurwitz commented Jun 13, 2026

Uh oh!

RobotSail commented Jun 13, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

codecov Bot commented Jun 12, 2026 •

edited

Loading