Add segmented merge algorithm with tests by abhishek593 · Pull Request #7126 · TheHPXProject/hpx

abhishek593 · 2026-03-29T12:24:32Z

Proposed Changes

This PR introduces segmented hpx::merge, allowing stable merging of two sorted distributed ranges across multiple localities. The algorithm employs the co-rank technique: a binary search that identifies what sub-ranges of A and B would be in the current destination slice.

Future Refinement

The current implementation resides in a single detail/merge.hpp header. Once I start working on set_* algorithms, the shared infrastructure (slice decomposition, handle registry, collective context, co-rank, payload batching, etc.) will be organised into reusable headers. As of now, the exact API boundaries are not clear to me, so premature splitting would not be appropriate.

Signed-off-by: Abhishek Bansal <abhibansal593@gmail.com>

StellarBot · 2026-03-29T12:25:03Z

Can one of the admins verify this patch?

hkaiser · 2026-03-29T13:01:16Z

@kollanur This may be interesting for you. Please have a look.

hkaiser

A couple of superficial comments so far. I will need to invest more time in understanding the proposed solution in detail.

One thing that puzzled me right away: why didn't you use the existing (local) hpx::merge, possibly its parallelized variant? We have spent a significant amount of time to optimize that.

Signed-off-by: Abhishek Bansal <abhibansal593@gmail.com>

abhishek593 · 2026-03-29T18:51:03Z

@hkaiser The reason I didn't use HPX's parallel merge is that it can only be used in Phase 6 (We have to do rest of the phases for the distributed case), and even then, we would have to first copy the received fragments into a buffer, then launch the parallel merge. This would defeat any optimization gain that we may have from parallel merge. Currently, I am just doing a standard merge directly from fragments.

But this distributed merge implementation uses the same co_rank idea (Phase 2). There, it's called diagonal_intersection, but it's based on the same binary-search idea.

Signed-off-by: Abhishek Bansal <abhibansal593@gmail.com>

codacy-production · 2026-04-04T21:34:16Z

Up to standards ✅

🟢 Issues 0 issues

Results:
0 new issues

View in Codacy

_{TIP This summary will be updated as you push new changes. Give us feedback}

hkaiser · 2026-04-05T13:54:58Z

@hkaiser The reason I didn't use HPX's parallel merge is that it can only be used in Phase 6 (We have to do rest of the phases for the distributed case), and even then, we would have to first copy the received fragments into a buffer, then launch the parallel merge. This would defeat any optimization gain that we may have from parallel merge. Currently, I am just doing a standard merge directly from fragments.

But this distributed merge implementation uses the same co_rank idea (Phase 2). There, it's called diagonal_intersection, but it's based on the same binary-search idea.

If you can call a sequential merge you should be able to invoke a parallel one as well. At least use hpx::merge(seq) for the fragments. Even our sequential merge was heavility optimized beyond a simple loop.

abhishek593 · 2026-04-05T14:21:31Z

@hkaiser Sorry for any misunderstanding. But, I am not saying that we can't call parallel merge. We can definitely call it, but we would have to convert fragments which is of type vector<resolved_fragment>, to a vector first, or something that has an iterator we can pass. Just to do this operation, we would have time complexity O(N). This would eliminate any gains that we may have from parallel merge, and infact would be worse.

For parallel merge, they didn't have this pre-step of converting to a vector first to get the iterator. If I am missing something completely obvious, please let me know, since I am not seeing any way to achieve this. I can work on a benchmark in the next couple of days.

hkaiser · 2026-04-05T14:56:49Z

@hkaiser Sorry for any misunderstanding. But, I am not saying that we can't call parallel merge. We can definitely call it, but we would have to convert fragments which is of type vector<resolved_fragment>, to a vector first, or something that has an iterator we can pass. Just to do this operation, we would have time complexity O(N). This would eliminate any gains that we may have from parallel merge, and infact would be worse.

For parallel merge, they didn't have this pre-step of converting to a vector first to get the iterator. If I am missing something completely obvious, please let me know, since I am not seeing any way to achieve this. I can work on a benchmark in the next couple of days.

I will have to have a closer look at your code to understand this. This is a big PR, please be patient with me.

Add segmented merge algorithm with tests

e081af7

Signed-off-by: Abhishek Bansal <abhibansal593@gmail.com>

abhishek593 requested a review from hkaiser as a code owner March 29, 2026 12:24

hkaiser added type: enhancement type: compatibility issue category: algorithms labels Mar 29, 2026

hkaiser reviewed Mar 29, 2026

View reviewed changes

Addressed review comments

a2c682b

Signed-off-by: Abhishek Bansal <abhibansal593@gmail.com>

Fix compilation issues

307688e

Signed-off-by: Abhishek Bansal <abhibansal593@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add segmented merge algorithm with tests#7126

Add segmented merge algorithm with tests#7126
abhishek593 wants to merge 3 commits intoTheHPXProject:masterfrom
abhishek593:distributed-merge

abhishek593 commented Mar 29, 2026

Uh oh!

StellarBot commented Mar 29, 2026

Uh oh!

hkaiser commented Mar 29, 2026

Uh oh!

hkaiser left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

abhishek593 commented Mar 29, 2026

Uh oh!

codacy-production bot commented Apr 4, 2026 •

edited

Loading

Uh oh!

hkaiser commented Apr 5, 2026

Uh oh!

abhishek593 commented Apr 5, 2026

Uh oh!

hkaiser commented Apr 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

abhishek593 commented Mar 29, 2026

Proposed Changes

Future Refinement

Uh oh!

StellarBot commented Mar 29, 2026

Uh oh!

hkaiser commented Mar 29, 2026

Uh oh!

hkaiser left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

abhishek593 commented Mar 29, 2026

Uh oh!

codacy-production bot commented Apr 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Up to standards ✅

Uh oh!

hkaiser commented Apr 5, 2026

Uh oh!

abhishek593 commented Apr 5, 2026

Uh oh!

hkaiser commented Apr 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

codacy-production bot commented Apr 4, 2026 •

edited

Loading