FEAT normalize messages before sending by hannahwestra25 · Pull Request #1613 · microsoft/PyRIT

hannahwestra25 · 2026-04-14T22:04:37Z

Description

Utilize the Normalization Pipeline in the Target Send Path

PR 4 of the TargetConfiguration roadmap

Problem

The TargetConfiguration.normalize_async pipeline (system-squash, history-squash, etc.) was fully built in this PR but never called. Every target independently fetched conversation history, appended the current message, and sent it to the API — some with ad-hoc normalization (AzureMLChatTarget), most with none at all. This meant the centralized normalization pipeline was dead code, and normalization behavior was inconsistent across targets.

Solution

Wire the normalization pipeline into the send path so that every prompt passes through configuration.normalize_async() before reaching the target's API call. This is done by making send_prompt_async a concrete template method on PromptTarget that validates, fetches conversation from memory, runs the normalization pipeline, and delegates to a new _send_prompt_target_async abstract method for wire-format-specific logic.

Changes

PromptTarget.send_prompt_async: Now a concrete method that calls self.configuration.normalize_async(messages=...) and passes the result to _send_prompt_target_async
All 20 target subclasses: Renamed send_prompt_async → _send_prompt_target_async, removed duplicated validation/memory-fetch boilerplate, now receive the pre-normalized conversation directly
AzureMLChatTarget: message_normalizer parameter deprecated with auto-translation to TargetConfiguration(policy={SYSTEM_PROMPT: ADAPT}); will be removed in v0.14.0

Breaking Changes

Target authors must override _send_prompt_target_async instead of send_prompt_async

Tests and Documentation

Tests: Updated all mocks/stubs to new signature; added test_normalize_async_integration.py (395 lines) covering normalize-is-called, normalized-conversation-is-used, memory-not-mutated, and legacy deprecation paths

wip: running integration tests

romanlutz

~~Don't we need tests for this?~~

Nvm didn't render first time I looked!

…ra/normalize_send_prompt

rlundeen2 · 2026-04-16T18:02:29Z

        """
+        if not message.message_pieces:
+            raise ValueError("Message must contain at least one message piece. Received: 0 pieces.")
+        normalized_conversation = await self._get_normalized_conversation_async(message=message)


I think there is a bug here. A nasty one if I understand it correctly.

_get_normalized_conversation can create new messages. As an example, HistorySquashNormalizer creates a new message, and it has a new conversation_id, drops labels, new attack_identifier, etc. Targets then use his to construct the response, and so the response inherits the garbage metadata. The implication is the response is not added to memory as part of the conversation, and we lose other stuff also.

One fix might be in _get_normalized_conversation_async if we re-stamp all the original message metadata.

if normalized: self._stamp_lineage(source=message, target_message=normalized[-1])

Where stamp_lineage copies all the metadata onto every piece in the target message.

Either way, can we have a test with a conversation, run a normalizer that changes it (like squash_messages) and verify that the conversation_id and metadata of the response is accurate? It might be good to have this test first to verify the bug exists. And then run the test again to verify the fix

ahh yes so I added a few tests to test_prompt_target for this scenario (which initially repro'd the issue), and added a propagate_lineage function. One thing is that right now all the normalizers update the last message so I'm only updating that with the lineage data. That's technically not an invariant of normalizers--to only produce one message--so if a user (or us) created a normalizer that produced multiple messages, the garbage metadata would still exist on all but the last message. I think to fix that we'd have to have some way of distinguishing what are the new normalized messages vs the rest of the conversation. I'm not sure how much of an issue this could be and figured we could just propagate onto the last message for now, but curious if you have thoughts

I like it! Also good job seeing the edge case.

But I could also see a message_normalizer that does split things to multiple messages... We may want a defense in depth that logger.warns if the message_normalizer result has an increased number of conversations. And maybe we still stamp the conversation_id so it's at least associated.

WDYT?

…ra/normalize_send_prompt

rlundeen2

Approved, but have one suggestion for insidious bug; not a blocker but worth thinking through

hannahwestra25 added 3 commits April 14, 2026 13:50

normalization in send_prompt_async

41365d3

replace message with normalized conversation

738fa99

pre-commit

34bd656

hannahwestra25 commented Apr 14, 2026

View reviewed changes

Comment thread pyrit/prompt_target/azure_ml_chat_target.py

romanlutz approved these changes Apr 15, 2026

View reviewed changes

Comment thread pyrit/prompt_target/azure_blob_storage_target.py Outdated

hannahwestra25 added 2 commits April 15, 2026 14:23

update deprecation message

858c941

Merge branch 'main' of https://github.com/microsoft/PyRIT into hawest…

281607a

…ra/normalize_send_prompt