Skip to content

Tests: GenericArchive cassette migration (unit tier)#298

Open
jeandet wants to merge 97 commits into
SciQLop:mainfrom
jeandet:modernisation/pr8-genericarchive-cassettes
Open

Tests: GenericArchive cassette migration (unit tier)#298
jeandet wants to merge 97 commits into
SciQLop:mainfrom
jeandet:modernisation/pr8-genericarchive-cassettes

Conversation

@jeandet
Copy link
Copy Markdown
Member

@jeandet jeandet commented May 11, 2026

Summary

Eighth PR of the modernisation effort. Fifth per-provider cassette migration.

Plan: docs/superpowers/plans/2026-05-11-pr8-genericarchive-cassettes.md.

Stacked on PR #296 (SSC), which stacks on #295#294#293#292#291#290. This PR's diff includes all predecessors until they merge in order.

What this PR does

  • Promotes tests/test_direct_archive_downloader.py (22 tests, ddt-driven) and tests/test_file_access.py (13 tests) from contract tier to unit tier with cassette-backed replay.
  • Records 29 new cassettes against the live LPP/CDA-mirror/THEMIS/Arase endpoints, uploads them to https://sciqlop.lpp.polytechnique.fr/data/speasy_cassettes/, updates the manifest.
  • Adds tests/test_genericarchive_contract.py (4 daily-cron drift probes).

Tests dropped from unit tier

Four heavyweight / vcrpy-incompatible tests skipped from the unit tier; equivalents on the contract probe file:

Test Reason Replacement
test_get_data ddt cases 3, 4 (MMS FPI burst) 369 MB and 644 MB cassettes — over budget test_mms_fpi_burst_returns_data probe (smaller time range)
test_cached_remote_txt_file 137 MB HTML response test_remote_text_resource_assertion_still_holds probe (first 1 KB only)
test_remote_file_request_deduplication Multiprocess workers have independent VCR contexts — vcrpy cannot intercept reliably No direct probe (multiprocess dedup is hard to validate without real upstream load)

Net effect

  • Unit tier: +24 (now 645 total) — most GenericArchive tests run on every PR, deterministic, no network.
  • Contract tier: -29 +4 — 4 GenericArchive drift probes hit upstream daily.
  • Compressed cassette storage: +34 MB (cumulative for the modernisation: 89 MB across AMDA, CDA, CSA, SSC, GenericArchive).

Test plan

  • CI: unit.yml green — GenericArchive tests replay from cassettes
  • CI: contract.yml (manually triggered) — 4 GenericArchive probes pass against real upstream

jeandet and others added 30 commits May 8, 2026 15:55
Captures decisions on UV adoption, hatchling build backend, ruff/basedpyright
tooling, and three-tier test strategy (unit/contract/e2e). Sequences the work
as 17 small PRs ending with a mass reformat.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
Per-task implementation plan for the first PR of the modernisation effort.
Covers pyproject.toml updates, uv.lock generation, CI/RTD switch to uv,
deletion of requirements*.txt / tox.ini / setup.cfg, and developer-doc
updates to drop the PYTHONPATH=. pattern.

Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
- Add SPEASY_CORE_HTTP_REWRITE_RULES env to PRs.yml non-3.10 pytest step
  (previously only on push/scheduled tests.yml — would have hit a
  non-existent server on PR builds for non-3.10 matrix entries).
- Add --with wheel to PRs.yml build step for parity with tests.yml.
- Scope flake8 to 'speasy tests' in both workflows (matches Makefile
  lint target). Avoids silently broadening lint to docs/conf.py and
  removes the .venv exclusion workaround that was needed when
  flake8 ran from repo root.
Without UV_PROJECT_ENVIRONMENT, uv creates .venv/ inside the project
and RTD's sphinx step (which calls $READTHEDOCS_VIRTUALENV_PATH/bin/python
directly) fails with 'python: not found'. Point uv at RTD's venv so the
install lands where the runner looks for it.
Classified via devtools/apply_test_markers.py:
- 12 files marked unit (pure-logic, no network)
- 19 files marked contract (real-server, will be migrated to cassettes in PRs 4-9)

Reclassifications during manual review:
- test_cache.py: contract -> unit (pure cache-logic, no network or speasy provider use)
- test_file_access.py: unit -> contract (uses HTTP via any_loc_open against live servers)

test_wasm.py was manually adjusted to place pytestmark at module level (the
file's body lives inside a try/except ImportError block, so the script's
naive insertion landed at wrong indentation).
…le path and sample across the inventory

The flat_inventories.generic_archive lookup uses module attribute access on
an instance, not a submodule import. Also, the first N parameters in the
flat inventory are clustered by mission, so a fixed time range can miss all
of them; sample across the full list instead.
- test_e2e_smoke.test_generic_archive: fail loudly if every candidate
  raises (was silently skipping, defeating the e2e tier's purpose).
- pyproject.toml: drop dead --ignore=setup.py from addopts and document
  the -m unit override semantics so future contributors don't trip on
  'pytest tests/test_amda.py' silently collecting nothing.
- contract.yml / e2e.yml: add concurrency groups so a manual run can't
  overlap with a cron run hammering the same upstream servers.
- CONTRIBUTING.rst: add a short note explaining the three test tiers
  and how to invoke each from local dev.
CI failure (blocking):
- unit.yml: 'make doctest' was using system Python (no sphinx in scope).
  Prefix with 'uv run' so make uses the project venv.

Reviewer findings:
- wasm_tests.yml: pytest tests/test_wasm.py without -m collected 0 tests
  under the new addopts default (test_wasm.py is contract-marked). Add
  -m '' to override.
- CLAUDE.md: examples like 'uv run pytest tests/test_amda.py' silently
  collected 0 tests under -m unit default. Replaced with tier-aware
  examples and added -m '' for the all-tests case.
- unit.yml: only sync --group docs on the coverage runner that needs it,
  not on every matrix entry.
nbsphinx requires the system pandoc binary (not the Python pandoc
wrapper that's in the docs dependency group). PR 1's tests.yml had
'sudo apt install -y texlive pandoc' before make doctest; my unit.yml
rewrite in PR 2 dropped that line, so the doctest job failed with
'nbsphinx.NotebookError: PandocMissing in examples/AMDA.ipynb'.
Restored as a separate apt step on the coverage runner.
The doctest step's examples reference all data providers (cdpp3dview
included) and live inventories. The job-level
SPEASY_CORE_DISABLED_PROVIDERS='cdpp3dview' makes the inventory tree's
cdpp3dview attribute missing during doctest, surfacing as
'types.SimpleNamespace object has no attribute cdpp3dview' and a chain
of NameErrors for variables defined in earlier doctest blocks.

Original tests.yml overrode SPEASY_CORE_DISABLED_PROVIDERS="" on the
combined pytest+doctest step, plus set HTTP_REWRITE_RULES (re-routes
the placeholder URL used in some examples to LPP's mirror) and
USER_AGENT. My PR 2 rewrite dropped the env block; restoring it on
the doctest step.
Pandas now prints its public name in type() repr ('pandas.DataFrame')
rather than the internal module path ('pandas.core.frame.DataFrame').
The user/numpy.rst doctest was written against the old form.
Surfaced now that uv.lock pins a recent pandas; pip-installed envs
were getting older pandas where the old form still applied.
jeandet added 27 commits May 13, 2026 14:45
…chive

4 probes covering the capabilities that test the heavyweight/skipped
unit-tier tests would otherwise exercise:
- LPP cache server still serves text via any_loc_open
- LPP data server still serves binary via any_loc_open
- MMS FPI burst products still resolve via the CDA mirror (smaller
  time range than the dropped unit-tier test to keep daily cron cheap)
- Vbias HTML resource still returns a parseable HTML document
@jeandet jeandet force-pushed the modernisation/pr8-genericarchive-cassettes branch from 0c0c137 to 4ae2a22 Compare May 13, 2026 12:46
@sonarqubecloud
Copy link
Copy Markdown

Quality Gate Failed Quality Gate failed

Failed conditions
9 Security Hotspots
E Security Rating on New Code (required ≥ A)

See analysis details on SonarQube Cloud

Catch issues before they fail your Quality Gate with our IDE extension SonarQube for IDE

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants