As referenced in #680, we have some tests that use spatial joins involving OvertureMaps datasets (address, building, landuse). I also have a few that I put together for the release post for 0.3.0 at apache/sedona#2675 (and there are some in our overture docs guide that mostly exercise pruning).
We should keep track of these in the benchmarks directory (even if informally) since coming up with these queries is non-trivial!