Rework map_async to handle failures better by dwskoog · Pull Request #495 · python-streamz/streamz

dwskoog · 2026-01-16T18:17:23Z

map stops the flow of items in the stream when the function raises but map_async is outside of the direct line of return so it fails weirdly during an exception. To address that, I added the idea about stopping the stream or not. This way, if the stream does not deliberately invoke stop during an exception, the stream continues to process inputs after an exception.

Since the map_async now conceives of stopping or not, I added a boolean in the node state to control the loop inside the worker task.

In the case of an exception during mapping, map_async will now release the references held on the metadata for the offending input.

I added an example that shows off the failure modes of map and map_async that plainly demonstrates that exceptions can leave the stream in a weird state.

dwskoog · 2026-01-16T18:19:49Z

streamz/core.py

+        while self.running:
+            task, metadata = await self.work_queue.get()
+            self.work_queue.task_done()


@martindurant I thought more about your feedback on the original PR last night and worked this up this morning. The new example file shows off the failure modes when map/map_async raise so I added this to give the users a better handle on failures.

`map` stops the flow of items in the stream when the function raises but `map_async` is outside of the direct line of return so it fails weirdly during an exception. To address that, I added the idea about stopping the stream or not. This way, if the stream does not deliberately invoke `stop` during an exception, the stream continues to process inputs after an exception. Since the `map_async` now conceives of stopping or not, I added a boolean in the node state to control the loop inside the worker task. In the case of an exception during mapping, `map_async` will now release the references held on the metadata for the offending input. I added an example that shows off the failure modes of `map` and `map_async` that plainly demonstrates that exceptions can leave the stream in a weird state.

dwskoog · 2026-01-16T18:22:00Z

streamz/core.py

                if results:
                    await asyncio.gather(*results)
-                self._release_refs(metadata)
+            self._release_refs(metadata)


map_async calls _retain_refs during the insert into the work queue so making sure that we call _release_refs even during an exception seems better.

Correct; probably the assumption is that the exception simply stops the whole pipeline, but we can do better. Nodes that filter in/out on exceptions would be reasonable.

I actually had this idea for the next improvement. It would be better for map/starmap/map_async to flow down Exceptions (probably paired with the offending input) so that the graph can fork the success one way and the failure to a logging/recovery flow.

martindurant · 2026-01-16T20:59:35Z

streamz/core.py

        stream_name = kwargs.pop('stream_name', None)
        self.kwargs = kwargs
        self.args = args
+        self.running = True


Isn't starting the stream optional?

I rebuilt the stop/start mechanism to make it tolerate restarting from upstream or down.

dwskoog · 2026-01-16T23:41:27Z

streamz/sources.py

+            if self.stopped:
+                break


By not checking self.stopped after coming back from the gather, the source over-consumes the underlying iterable and loses an element.

martindurant · 2026-01-17T19:18:42Z

Support for py3.9 should be dropped.

dwskoog · 2026-01-20T19:39:45Z

Support for py3.9 should be dropped.

Happy to oblige because I just figured out why this failed and it is deep in the dark depths of asyncio. Trying to get this to work on 3.9 would be misery.

martindurant · 2026-01-21T18:58:28Z

I just figured out why this failed and it is deep in the dark depths of asyncio

I empathize.

martindurant · 2026-01-21T19:00:34Z

Thanks for this, going in!

dwskoog commented Jan 16, 2026

View reviewed changes

dwskoog force-pushed the map_async_improvements branch from e6f1400 to 09969c5 Compare January 16, 2026 18:20

dwskoog commented Jan 16, 2026

View reviewed changes

martindurant reviewed Jan 16, 2026

View reviewed changes

dwskoog added 3 commits January 16, 2026 18:31

from_iterable was over-consuming during stop

c357729

Make map_async restartable

96463b5

Show off restarting map_async when it stops

9a7b3ed

dwskoog force-pushed the map_async_improvements branch from 4787deb to 9a7b3ed Compare January 16, 2026 23:39

dwskoog commented Jan 16, 2026

View reviewed changes

Drop Python 3.9 support

3203a73

martindurant merged commit 3c0f570 into python-streamz:master Jan 21, 2026
6 checks passed

dwskoog deleted the map_async_improvements branch January 21, 2026 20:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rework map_async to handle failures better#495

Rework map_async to handle failures better#495
martindurant merged 5 commits intopython-streamz:masterfrom
dwskoog:map_async_improvements

dwskoog commented Jan 16, 2026

Uh oh!

dwskoog Jan 16, 2026

Uh oh!

dwskoog Jan 16, 2026

Uh oh!

martindurant Jan 16, 2026

Uh oh!

dwskoog Jan 16, 2026

Uh oh!

martindurant Jan 16, 2026

Uh oh!

dwskoog Jan 16, 2026

Uh oh!

dwskoog Jan 16, 2026

Uh oh!

martindurant commented Jan 17, 2026

Uh oh!

dwskoog commented Jan 20, 2026

Uh oh!

martindurant commented Jan 21, 2026

Uh oh!

martindurant commented Jan 21, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

dwskoog commented Jan 16, 2026

Uh oh!

dwskoog Jan 16, 2026

Choose a reason for hiding this comment

Uh oh!

dwskoog Jan 16, 2026

Choose a reason for hiding this comment

Uh oh!

martindurant Jan 16, 2026

Choose a reason for hiding this comment

Uh oh!

dwskoog Jan 16, 2026

Choose a reason for hiding this comment

Uh oh!

martindurant Jan 16, 2026

Choose a reason for hiding this comment

Uh oh!

dwskoog Jan 16, 2026

Choose a reason for hiding this comment

Uh oh!

dwskoog Jan 16, 2026

Choose a reason for hiding this comment

Uh oh!

martindurant commented Jan 17, 2026

Uh oh!

dwskoog commented Jan 20, 2026

Uh oh!

martindurant commented Jan 21, 2026

Uh oh!

martindurant commented Jan 21, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants