Goal
Design a JSON-LD schema that extends Croissant / schema.org to represent interrogatory model cards, maximizing interoperability with existing tooling.
Why Croissant/schema.org?
- JSON-LD foundation - machine-readable, web-native, supports linked data
- Existing adoption - HuggingFace, Kaggle, Google Dataset Search use Croissant
- Extensible - designed for domain-specific vocabularies
- Future-proof - constrains future users least; can render to other formats
Design Questions
Namespace & Vocabulary
CAN/SHOULD/MUST Representation
Relationship to CycloneDX ML-BOM
Evidence Linking Schema
Deliverables
schema/interrogatory-model-card.jsonld - JSON-LD context definition
schema/interrogatory-model-card.schema.json - JSON Schema for validation
- Example card in both formats demonstrating all field types
Resources
Open Questions
- Should the schema be strict (all MUST fields required) or permissive (flag missing fields)?
- How to handle proprietary models with limited disclosure?
- Versioning strategy for the schema itself?
Goal
Design a JSON-LD schema that extends Croissant / schema.org to represent interrogatory model cards, maximizing interoperability with existing tooling.
Why Croissant/schema.org?
Design Questions
Namespace & Vocabulary
@contextextending schema.org and CroissantSoftwareSourceCode,Dataset,CreativeWork)InterrogatoryModelCard,ModelClaim,EvidenceLink,InterrogatoryPromptCAN/SHOULD/MUST Representation
Relationship to CycloneDX ML-BOM
modelCardfield - should we output compatible fragments?Evidence Linking Schema
dataset_version,eval_script_commit,run_hash?Deliverables
schema/interrogatory-model-card.jsonld- JSON-LD context definitionschema/interrogatory-model-card.schema.json- JSON Schema for validationResources
Open Questions