feat(rust/sedona-spatial-join): Support sequential spatial index building option #362

zhangfengcdt · 2025-11-24T22:56:15Z

This PR introduces an optional sequential index building mode for spatial joins as an alternative to the default parallel implementation.

A new use_sequential_index_build boolean flag (default: false) controls the index building strategy. The sequential implementation via build_index_sync collects partitions one-by-one without spawning async tasks, supporting execution contexts that lack full async runtime support.

Key types and functions are now publicly exported: build_index, build_index_sync, SpatialIndex, SpatialIndexBuilder, SpatialJoinBuildMetrics, and SpatialPredicate, enabling external components to programmatically build and manage spatial indexes.

This PR is to support external use of the spatial join executor in other query engine projects (single or distributed), such as Apache Datafusion Comet.

Resolved conflicts after major refactoring: - Moved build_index and build_index_sync to new build_index.rs module - Updated to use new refactored module structure (index submodules) - Fixed OnceAsync usage pattern (lock -> get_or_insert -> try_once) - Made SpatialIndex, SpatialJoinBuildMetrics, SpatialIndexBuilder public for Comet - Preserved build_index_sync for JNI execution contexts - Added timing instrumentation for index building performance The key feature preserved: using build_index_sync (sequential collection) instead of build_index (parallel with JoinSet) to avoid deadlocks in Comet/JNI synchronous execution contexts.

…/comet.bc.index.join.support

Kontinuation · 2025-11-25T01:07:14Z

rust/sedona-spatial-join/src/build_index.rs

+/// Synchronous version of build_index that doesn't spawn tasks
+/// Used in execution contexts without async runtime support (e.g., Spark/Comet JNI)
+pub async fn build_index_sync(


This newly added build_index_sync is also an async function. Why can we not directly use build_index, which is also an async function?

For the use case of Comet, I believe that the DataFusion physical plans constructed by Comet are all single partition. What problem have you encountered when using build_index?

@Kontinuation Yes, I think the function name might be a bit confusing, I can rename it to build_index_sequential to better convey that it avoids task spawning.

So basically, the reason we need this (instead of using build_index directly) is because the limitation with JoinSet::spawn() in JNI contexts: when running in a Spark/Comet JNI context, JoinSet::spawn() fails because:

The tokio runtime in JNI contexts may be single-threaded or have limited threading capabilities

JoinSet::spawn() requires a multi-threaded runtime to spawn new tasks

Even with single-partition DataFusion plans (which is typical for Comet), collect_all still calls JoinSet::spawn():

// In collect_all: let mut join_set = JoinSet::new(); for ... { join_set.spawn(async move { ... }); // <- This fails in JNI contexts } join_set.join_all().await; // Wait for all tasks

The issue isn't about parallelism benefits (single partition means nothing to parallelize anyway), but about JoinSet::spawn() itself not working in the JNI execution environment.

When I call build_index directly in JNI context, it hanged forever. I guess this might be caused by a deadlock by JoinSet::spawn().

In a JNI context:

When JoinSet::spawn() is called, it schedules new tasks to run on the runtime's worker threads

Then join_all().await blocks the current thread waiting for those spawned tasks

But if the current thread IS the only worker thread (or all workers are blocked), no one can execute the spawned tasks

Deadlock: the current thread waits for tasks that need the current thread to execute

It would be ideal to reuse build_index if there is a workaround for it, but I also think using a clean build_index_sequential makes sense because this is called in a distributed framework and the parallelism is provided and dictated by the distributed schedulings themselves.

Comet is using a multi-thread Tokio runtime according to jni_api.rs, and the Tokio runtime will use default config if COMET_* environment variables are not present.

The problem may not be the usage of single-threaded runtime, probably there are too-many in-flight queries and the workers pool have been exhausted. No matter what the actual problem is, a sequential index building option is always a good thing to have.

I tried to reproduce this by spawning some async tasks in the native comet scan operator, but Comet tests involving native scan operator finished successfully and did not hang. I have also tried setting env COMET_WORKER_THREADS=1 and still could not observe hanging. The problem could be in somewhere else and not related to spawning async tasks.

impl Stream for ScanStream<'_> { type Item = DataFusionResult<RecordBatch>; fn poll_next(self: Pin<&mut Self>, ctx: &mut Context<'_>) -> Poll<Option<Self::Item>> { // ... println!("polling batches from scan: spawn an async task"); let mut join_set = JoinSet::new(); for k in 0..10 { let res = join_set.spawn(async move { println!("hello {}", k); k }); println!("spawned async task: {:?}", res); } let fut = join_set.join_all(); let pin_fut = pin!(fut); let res = ready!(pin_fut.poll(ctx)); println!("async task result: {:?}", res); // ... } // ... }

I see your tests directly places JoinSet::spawn directly inside poll_next. The hang I observed might not be directly caused by JoinSet::spawn itself. The difference might be:

Your test: Spawns directly inside poll_next using the provided Context

Spatial join: Uses OnceAsync for lazy initialization, which has its own coordination logic

The interaction between block_on(async { poll!(...) }), OnceAsync, and JoinSet might behave differently than direct spawning in poll_next.

But in SpatialJoinExec, build_index is called through OnceAsync, which has its own coordination logic. The execution path is:

JNI calls get_runtime().block_on(async { poll!(stream.next()) })

Stream's poll_next triggers OnceAsync

OnceAsync calls build_index → collect_all → JoinSet::spawn

Kontinuation

build_index_sync and build_index has lots of duplicated code. Can we add a parameter to build_index to switch between the concurrent mode and sequential mode?

- rename to build_index_seq - created a private build_index_impl function with a concurrent: bool parameter - remove duplicated logic from the code

…/comet.bc.index.join.support

zhangfengcdt · 2025-12-04T18:45:50Z

build_index_sync and build_index has lots of duplicated code. Can we add a parameter to build_index to switch between the concurrent mode and sequential mode?

@Kontinuation I have refactored the build_index_sync to (1) rename it to build_index_seq (2) add a new flag to control concurrency (3) remove duplicated logic.

Could you please take a look again? Thanks!

zhangfengcdt added 12 commits November 18, 2025 11:55

WIP - Support distributred comet broadcast index spatial joins

91c7911

remove debug print out

710f02b

add some timing functions

233ae8b

Clean up backup file from merge

d8a3e42

remove unused logs

8453453

cleanup dead codes

988af46

revert changes in stream.rs

db2db78

revert changes in exec.rs

1181a87

add option to do sequential index building

93f70d0

Merge branch 'main' of github.com:zhangfengcdt/sedona-db into feature…

bdf8adb

…/comet.bc.index.join.support

fix fmt errors

d30f091

Kontinuation reviewed Nov 25, 2025

View reviewed changes

zhangfengcdt marked this pull request as ready for review December 1, 2025 18:57

Kontinuation reviewed Dec 2, 2025

View reviewed changes

zhangfengcdt added 3 commits December 4, 2025 09:32

refactor build_index_sync

a40603e

- rename to build_index_seq - created a private build_index_impl function with a concurrent: bool parameter - remove duplicated logic from the code

Merge branch 'main' of github.com:zhangfengcdt/sedona-db into feature…

6fae898

…/comet.bc.index.join.support

revert changes to code formatting

a8fe025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(rust/sedona-spatial-join): Support sequential spatial index building option #362

feat(rust/sedona-spatial-join): Support sequential spatial index building option #362

Uh oh!

zhangfengcdt commented Nov 24, 2025

Uh oh!

Kontinuation Nov 25, 2025

Uh oh!

zhangfengcdt Dec 1, 2025

Uh oh!

Kontinuation Dec 2, 2025

Uh oh!

Kontinuation Dec 2, 2025

Uh oh!

zhangfengcdt Dec 3, 2025

Uh oh!

Kontinuation left a comment •

edited

Loading

Uh oh!

zhangfengcdt commented Dec 4, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

feat(rust/sedona-spatial-join): Support sequential spatial index building option #362

Are you sure you want to change the base?

feat(rust/sedona-spatial-join): Support sequential spatial index building option #362

Uh oh!

Conversation

zhangfengcdt commented Nov 24, 2025

Uh oh!

Kontinuation Nov 25, 2025

Choose a reason for hiding this comment

Uh oh!

zhangfengcdt Dec 1, 2025

Choose a reason for hiding this comment

Uh oh!

Kontinuation Dec 2, 2025

Choose a reason for hiding this comment

Uh oh!

Kontinuation Dec 2, 2025

Choose a reason for hiding this comment

Uh oh!

zhangfengcdt Dec 3, 2025

Choose a reason for hiding this comment

Uh oh!

Kontinuation left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

zhangfengcdt commented Dec 4, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Kontinuation left a comment •

edited

Loading