Skip to content

Worker enabled but Hub SSE reconnects every few seconds and task pipeline never closes #380

@q7793527

Description

@q7793527

Summary

On Windows, after enabling Worker locally and also enabling the Worker toggle in the EvoMap web UI, the worker pipeline does not reach task intake / claim / complete closure.

The local loop stays healthy, but the Hub event stream repeatedly does:

[SSE] Event stream connected
[SSE] Stream error, will reconnect in 5s

This happens over and over, so the worker never reaches a stable event-stream state and no task claim / completion evidence appears locally.

Environment

  • Evolver version: 1.65.0
  • Platform: Windows
  • Loop command:
    node C:\Users\Administrator\<redacted>\skills\evolver\index.js --loop
    
  • Local worker enabled: WORKER_ENABLED=1
  • Worker domains: empty (WORKER_DOMAINS=) to allow default / all-domain behavior
  • Max load: 5
  • Hub URL: https://evomap.ai
  • Web UI Worker toggle: enabled during this test

What We Verified

  • The local loop is alive and healthy.
  • lifecycle status reports the loop correctly.
  • lifecycle check reports healthy.
  • The Worker toggle in the web UI was enabled for the matching node.
  • Despite that, the runtime log continuously shows SSE connect/error/reconnect cycles.
  • No task claim / completion artifacts appeared during the test window.

Relevant Evidence

Local runtime status:

{
  "running": true,
  "pids": [
    {
      "pid": 2184,
      "cmd": "node C:\\Users\\Administrator\\<redacted>\\skills\\evolver\\index.js --loop"
    }
  ],
  "log": "logs\\evolver_loop.log"
}

Health check:

{
  "healthy": true,
  "pids": [2184]
}

Repeated runtime log pattern:

[SSE] Event stream connected
[SSE] Stream error, will reconnect in 5s
[SSE] Event stream connected
[SSE] Stream error, will reconnect in 5s
... repeats continuously ...

During the test, no local evidence of task closure appeared:

  • no new task-claim / task-complete markers
  • no obvious mailbox/task state changes
  • no new worker-completion trail in local artifacts

Expected Behavior

When both of these are true:

  1. local .env has WORKER_ENABLED=1
  2. the EvoMap web UI Worker toggle is enabled for the same node

the worker should be able to maintain a stable Hub event-stream connection, receive task availability, claim a task, execute it, and submit the result or terminal state.

Actual Behavior

The worker loop stays alive, but the Hub event stream reconnects every few seconds and the task pipeline never reaches a verified claim/complete cycle.

Why This Matters

This blocks real worker-pool usage even though:

  • local configuration is correct
  • the node is online
  • the web toggle is enabled
  • the loop itself is healthy

From an operator perspective, everything looks enabled, but the actual worker pipeline never closes.

Suggested Investigation Areas

  • why the Hub SSE connection terminates immediately after connect
  • whether the server is closing the stream due to auth, capability registration, node state, or malformed worker metadata
  • whether the Windows client path behaves differently from other platforms
  • whether an additional local proxy / mailbox component is required but not documented for this path
  • whether the worker toggle path in the web UI is correctly bound to the same node identity as A2A_NODE_ID

Privacy Note

Local filesystem paths were manually redacted before submission.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions