CDM Task Service

This is currently a prototype

Enables running jobs on remote compute from the KBase CDM cluster.

Nomenclature

CDM: Central Data Model
- The KBase data model
CTS: CDM Task Service

Usage notes

The CTS uses CRC64/NVME checksums on all input and output files for data integrity checks and to ensure files do not unexpectedly change over time. As such, all input files in S3 must have the checksums stored in their metadata.

A user can check this via various methods, but here we use the AWS CLI.

When using an S3-compatible store rather than AWS S3 itself, configure a profile:

export AWS_PROFILE=<your preferred profile name>
$ aws configure set endpoint_url <s3 url here>
$ aws configure
AWS Access Key ID [None]: <access key>
AWS Secret Access Key [None]: <secret key>
Default region name [None]: default
Default output format [None]:

Checking a file's checksum:

$ aws s3api head-object --bucket test-bucket --key test-file-with-checksum --checksum-mode ENABLED
{
    "AcceptRanges": "bytes",
    "LastModified": "2026-04-13T22:10:40+00:00",
    "ContentLength": 507,
    "ChecksumCRC64NVME": "4YocWC/iHHg=",
    "ChecksumType": "FULL_OBJECT",
    "ETag": "\"b49d8511ffc4c158395f4b920763f80b\"",
    "ContentType": "binary/octet-stream",
    "Metadata": {}
}

Files without the CRC64NVME checksum will be rejected by the service.

The AWS CLI automatically includes a CRC64NVME checksum on upload:

$ aws s3 cp job_ids s3://test-bucket/test-file-with-checksum
upload: ./job_ids to s3://test-bucket/test-file-with-checksum

If a file already exists in S3 but has no checksum, it may need to be downloaded and re uploaded. Copying also may be an option, but be aware that some S3 implementations, in particular Ceph, do not currently add checksums on a copy.

aws s3 cp s3://test-bucket/test-file-no-checksum s3://test-bucket/new-file --checksum-algorithm CRC64NVME

Further usage documentation

The service OpenAPI documentation
- https://ci.kbase.us/services/cts/docs
Image setup and running documentation / examples:
- ./docs/

Service Requirements

Python 3.12+
crane
An S3 instance for use as a file store, but see "S3 requirements" below
MongoDB 7+
Kafka 2.1+
If submitting jobs to HTCondor, see HTCondor requirements below

S3 requirements

Path style access is required.
The service does not support objects encrypted with customer supplied keys or with the AWS key management service.
The provided credentials must enable listing readable buckets, as the service performs that operation to check the host and credentials on startup.

HTCondor requirements

Due to the multitude of ways HTCondor (HTC) connectivity and authentication can be configured, the service does not expect any particular HTC configuration other than calling

import htcondor2

schedd = htcondor2.Schedd()

... should Just Work (TM) and jobs should be able to be submitted with that Schedd instance. At minimum some sort of authentication must be set up and COLLECTOR_HOST must be supplied in a config file or the environment (via _CONDOR_COLLECTOR_HOST).

The service administrator is expected to set up the HTCondor configuration so that the above is true. If using the Docker image, a configuration file and / or credentials will likely need to be mounted into the container, e.g.

An IDTOKEN and, if necessary, the _CONDOR_SEC_TOKEN_DIRECTORY to tell HTC the token's location.
A condor configuration file and password file if using PASSWORD authentication.

Furthermore each HTCondor worker must have the necessary secrets provided in files for the executor to read:

a KBase token with an auth2 role indicating the user is a CTS external executor
the S3 access secret

See the HTCondor section of cdmtaskservice_config.toml.jinja.

Powered by

Development

Adding code

In this alpha / prototype stage, we will be PRing (do not push directly) to main. In the future, once we want to deploy beyond CI, we will add a develop branch.
The PR creator merges the PR and deletes branches (after builds / tests / linters complete).

Code requirements for prototype code

Any code committed must at least have a test file that imports it and runs a noop test so that the code is shown with no coverage in the coverage statistics. This will make it clear what code needs tests when we move beyond the prototype stage.
Each module should have its own test file. Eventually these will be expanded into unit tests (or integration tests in the case of app.py)
Any code committed must have regular code and user documentation so that future devs converting the code to production can understand it.
Release notes are not strictly necessary while deploying to CI, but a concrete version (e.g. no -dev* or -prototype* suffix) will be required outside of that environment. On a case by case basis, add release notes and bump the prototype version (e.g. 0.1.0-prototype3 -> 0.1.0-prototype4) for changes that should be documented.

Running tests

Docker must be installed.

Copy test.cfg.example to test.cfg and fill it in appropriately.

uv sync --dev  # only the first time or when uv.lock changes
PYTHONPATH=. uv run pytest test

Exit from prototype status

Coverage badge in Readme
Run through all code, refactor to production quality
Add tests where missing (which is a lot) and inspect current tests for completeness and quality
- E.g. don't assume existing tests are any good
- Async testing help https://tonybaloney.github.io/posts/async-test-patterns-for-pytest-and-unittest.html

Name		Name	Last commit message	Last commit date
Latest commit History 959 Commits
.github		.github
cdmtaskservice		cdmtaskservice
design		design
docker_compose/htcondor		docker_compose/htcondor
docs		docs
images		images
scripts		scripts
test		test
test_common		test_common
test_manual		test_manual
.gitignore		.gitignore
.python-version		.python-version
Dockerfile		Dockerfile
LICENSE.md		LICENSE.md
README.md		README.md
RELEASE_NOTES.md		RELEASE_NOTES.md
cdmtaskservice_config.toml.jinja		cdmtaskservice_config.toml.jinja
cdmtaskservice_refdata_config.toml.jinja		cdmtaskservice_refdata_config.toml.jinja
docker-compose.yaml		docker-compose.yaml
pyproject.toml		pyproject.toml
test.cfg.example		test.cfg.example
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CDM Task Service

Nomenclature

Usage notes

Further usage documentation

Service Requirements

S3 requirements

HTCondor requirements

Powered by

Development

Adding code

Code requirements for prototype code

Running tests

Exit from prototype status

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

CDM Task Service

Nomenclature

Usage notes

Further usage documentation

Service Requirements

S3 requirements

HTCondor requirements

Powered by

Development

Adding code

Code requirements for prototype code

Running tests

Exit from prototype status

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages