Skip to content

fix: remove unnecessary IgnoreCometNativeDataFusion tags from 3.5.8 diff#3831

Open
andygrove wants to merge 2 commits intomainfrom
fix/spark-358-remove-ignore-native-datafusion
Open

fix: remove unnecessary IgnoreCometNativeDataFusion tags from 3.5.8 diff#3831
andygrove wants to merge 2 commits intomainfrom
fix/spark-358-remove-ignore-native-datafusion

Conversation

@andygrove
Copy link
Copy Markdown
Member

@andygrove andygrove commented Mar 28, 2026

Which issue does this PR close?

Closes #3312, closes #3313, closes #3314, closes #3315, closes #3319, closes #3320, closes #3401.

Rationale for this change

Several tests in the 3.5.8 Spark SQL test diff were tagged with IgnoreCometNativeDataFusion but actually pass when run with COMET_PARQUET_SCAN_IMPL=native_datafusion. These tags were only present in the 3.5.8 diff and not in the 3.4.3 or 4.0.1 diffs. In some cases the tests just needed their plan node pattern matches updated to also handle CometNativeScanExec.

What changes are included in this PR?

Removed IgnoreCometNativeDataFusion from tests that pass as-is (verified with COMET_PARQUET_SCAN_IMPL=native_datafusion):

Fixed ExtractPythonUDFsSuite (#3312) to match CometNativeScanExec in plan node pattern matches for column pruning and filter pushdown checks.

Fixed BucketedReadSuite (#3319) by adding CometNativeScanExec to getFileScan(), getBucketScan(), and the coalesced bucket scan pattern match.

Fixed DisableUnnecessaryBucketedScanSuite (#3319) by adding CometNativeScanExec to checkNumBucketedScan().

Updated DynamicPartitionPruningSuite issue reference from #3313 to #3442 for consistency with the 3.4.3 and 4.0.1 diffs.

How are these changes tested?

Each test was run individually with ENABLE_COMET=true ENABLE_COMET_ONHEAP=true COMET_PARQUET_SCAN_IMPL=native_datafusion against Apache Spark 3.5.8 with the updated diff applied. All tests passed.

Remove IgnoreCometNativeDataFusion from tests that pass with
native_datafusion scan in the 3.5.8 Spark SQL test diff. Also fix
ExtractPythonUDFsSuite to match CometNativeScanExec in plan checks,
and update DPP test issue reference from #3313 to #3442 for
consistency with other diffs.

Tests that still need the tag (bucketed read/scan suites) are kept
as they require helper method updates to support CometNativeScanExec.
Add CometNativeScanExec to plan node pattern matches in
BucketedReadSuite and DisableUnnecessaryBucketedScanSuite helper
methods, allowing all #3319 tests to pass with native_datafusion scan.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

1 participant