Skip to content

Add test coverage for DSv2 table refresh and pinning design doc#55033

Draft
longvu-db wants to merge 1 commit intoapache:masterfrom
longvu-db:dsv2-refresh-pinning-test-coverage
Draft

Add test coverage for DSv2 table refresh and pinning design doc#55033
longvu-db wants to merge 1 commit intoapache:masterfrom
longvu-db:dsv2-refresh-pinning-test-coverage

Conversation

@longvu-db
Copy link
Contributor

33 new tests covering gaps identified in the "Refreshing and pinning tables in Spark" design doc across all 5 sections:

  • Section 1: Temp views with stored plans (drop+add column same/different type, type widening, external changes, multiple column additions, subquery refresh, filter pushdown)
  • Section 2: Repeated table access regression tests (external data writes, schema changes, drop/recreate)
  • Section 3: Incrementally constructed queries (join with drop+add column, three-way join with version drift)
  • Section 4: Dataset show/collect consistency (QE reuse behavior, schema changes via external catalog API, interleaved actions)
  • Section 5: CACHE TABLE scenarios (external schema pinning, session write re-pinning, REFRESH TABLE, external drop/recreate)
  • Edge cases: nested views, self-join version alignment, REFRESH no-op for DataFrame temp views, cached temp view invalidation

Co-authored-by: Isaac

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Was this patch authored or co-authored using generative AI tooling?

33 new tests covering gaps identified in the "Refreshing and pinning
tables in Spark" design doc across all 5 sections:

- Section 1: Temp views with stored plans (drop+add column same/different
  type, type widening, external changes, multiple column additions,
  subquery refresh, filter pushdown)
- Section 2: Repeated table access regression tests (external data writes,
  schema changes, drop/recreate)
- Section 3: Incrementally constructed queries (join with drop+add column,
  three-way join with version drift)
- Section 4: Dataset show/collect consistency (QE reuse behavior, schema
  changes via external catalog API, interleaved actions)
- Section 5: CACHE TABLE scenarios (external schema pinning, session write
  re-pinning, REFRESH TABLE, external drop/recreate)
- Edge cases: nested views, self-join version alignment, REFRESH no-op
  for DataFrame temp views, cached temp view invalidation

Co-authored-by: Isaac
@longvu-db longvu-db force-pushed the dsv2-refresh-pinning-test-coverage branch from 62e3e76 to ed536cf Compare March 26, 2026 13:06
@dongjoon-hyun dongjoon-hyun marked this pull request as draft March 26, 2026 17:08
Copy link
Member

@dongjoon-hyun dongjoon-hyun left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please file a JIRA issue to have a proper JIRA ID before converting this back from Draft status, @longvu-db .

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants