✨ add files and assay model by Christina-J-Diaz · Pull Request #21 · include-dcc/common-access-model

Christina-J-Diaz · 2026-03-18T18:26:10Z

Adding modeling for a files access model and assay access model. still WIP

RobertJCarroll

Hi Christina. It looks like you haven't been able to test this yet. I think the github action for running the build may have not triggered as it was converted from a draft, but I'm not sure. Happy to help get that set up, You won't need copier, but the other prerequisites here should get you started:
https://github.com/dalito/linkml-project-copier?tab=readme-ov-file#prerequisites

I tested locally, and the build is failing. One part that stood out is that the slot definitions should all happen in the slot section (ie, line 637 and beyond in your version). Looking at your yaml, none of the information about descriptions should appear there, for example, this description is already present (broadly) in the slot definition here. We also want to use slot_usage as infrequently as possible. IE, where the names, titles, types, and descriptions can reasonably be the same, we want to preserve that. It looks like you might be trying to define everything within the classes, which is understandable, but not the "linkml way". Slot_usage would mainly be to specify something as the unique key of a class/table or to set something as required in that specific class/table's context.

To get this building, I would recommend starting by moving all of these "slot definition details", eg, the descriptions nested in the class slots section and the other definitions in the class slot_usage section into the high level slots. You'd then only have the names of those slots in the class, eg, like DOI and it's slots do_id and bibliographic_reference.

We can dig a bit deeper on the field-level specifics once it's building- I see some duplicate slots in here, and it'll be easier to review when they are cleaned up. Please let me know if you need more guidance on the "how".

github-actions · 2026-03-27T21:41:17Z

PR Preview Action v1.8.1
🚀 View preview at https://include-dcc.github.io/include-access-model/pr-preview/pr-21/
Built to branch `gh-pages` at 2026-04-17 19:22 UTC. Preview will be ready when the GitHub Pages deployment is complete.

torstees

There are some string types that might be better suited as enums to ensure we are consistent across studies/programs. @RobertJCarroll can say for sure whether these might be better suited as strings, but I thought it was worth flagging.

torstees · 2026-04-02T20:14:19Z

+    description: Identifier for a specific version of the object
+    range: string
+    required: true
+  access_type: 


Should this not be an enumeration?

yea @torstees I agree - i set them as strings for now and was hoping to add some initial Enums I had in mind this week. I have an idea of the KF enums we typically use, but wasn't sure about INCLUDE ones we'd need. Would love to discuss further.

torstees · 2026-04-02T20:16:23Z

+    range: string
+    required: true
+  experimental_strategy: 
+    title: Experimental Strategy


Some of these look like suitable candidates for Enumerations: Exp Strategy, Assay Center, Platform.

Christina-J-Diaz · 2026-04-03T21:29:16Z

@torstees @RobertJCarroll - I added some initial ideas for Enums. I'd like to discuss further enums for data_category and data_types. We have a lot of these enums on the KF side that our bix team has maintained but it doesn't align with what is currently listed in this model.

RobertJCarroll · 2026-04-06T21:09:45Z

We should have a deeper conversation about the enumerations. Some of these are complex enough that they likely require a separate management process if we can't use external sources. EG, the EnumPlatform has two different type of Illumina. One is more specific, but they aren't nested. We also likely want to avoid "other" terms if possible. Up front I'd prefer permitting nulls and adding them in ASAP (or keeping it required with the supports for rapid response changes). Untangling "Did we pick other before we added that one" is pernicious.

With regards to FileAdmin, we probably want to either just have it reference a File or inherit from File- every FileAdmin should have a File, too, right?

Similar with FileAssay- If we want it to extend File that's ok probably, but we should think about where things belong. Would it make more sense for it to just be "Assay" and point to files generated? I do worry a bit about having too many places to link files to individuals or samples, but maybe that's ok?

Christina-J-Diaz · 2026-04-07T23:47:02Z

We should have a deeper conversation about the enumerations. Some of these are complex enough that they likely require a separate management process if we can't use external sources. EG, the EnumPlatform has two different type of Illumina. One is more specific, but they aren't nested. We also likely want to avoid "other" terms if possible. Up front I'd prefer permitting nulls and adding them in ASAP (or keeping it required with the supports for rapid response changes). Untangling "Did we pick other before we added that one" is pernicious.

agree on having a larger convo about the enums - we have a lot of standardization on our end for file enums so curious how that fits into this model and wondering if others have initial feedback too - I don't think I added every single enum (as we have a ton in a master file). @calkinsh @chris-s-friedman @awarkow @allisonheath

With regards to FileAdmin, we probably want to either just have it reference a File or inherit from File- every FileAdmin should have a File, too, right?

Hm yea that'a good point. I took another look at it - and I agree. I think in my head I mixed up some of the logic with inheritance there (since file is a subset of fileAdmin - i got the inheritance direction confused 😅 ).

Similar with FileAssay- If we want it to extend File that's ok probably, but we should think about where things belong. Would it make more sense for it to just be "Assay" and point to files generated? I do worry a bit about having too many places to link files to individuals or samples, but maybe that's ok?

For FileAssay I was imagining a similar scenario with File / FileAdmin. In the sense there is some universal/operational table with the all the relevant FileAssay information we track, and then FileAssay is a subset of that table. I think we can remove the subject_id from that model and have it point to files generated and samples. I think it's okay in this case, so we don't have to join this file assay table to a files table just to see that file --> sample mapping. But yea curious to get more feedback on this one.

…ic.py

Linting requires a specific EFO URI: > warning Schema maps prefix 'EFO' to namespace 'http://www.ebi.ac.uk/efo' instead of namespace 'http://identifiers.org/efo/' (canonical_prefixes)

…ic.py

Somewhat hierarchical platforms with meanings pointing to EFO

…lude-dcc/include-access-model into d3b-2559-add-files-and-assay

RobertJCarroll · 2026-04-24T19:39:31Z

Looking over this PR, it's currently up to speed with main, but we still need some changes on the file metadata organization. @Christina-J-Diaz are you working on those updates?

There's also some changes on the enum side, but that can be in a pass after we have the structure aligned.

Christina-J-Diaz added 4 commits March 16, 2026 19:36

🚧 Add files model ideas

f6527f1

🚧 Adding stuff

038d9b7

♻️ Update files model

b8a58ed

🚧 Working on assay model

93f9b17

Christina-J-Diaz marked this pull request as ready for review March 26, 2026 15:38

Christina-J-Diaz requested review from RobertJCarroll, allisonheath, awarkow, calkinsh and chris-s-friedman March 26, 2026 15:39

💡 Updating comments

fe360b3

RobertJCarroll requested changes Mar 26, 2026

View reviewed changes

🔀 Rebasing with main

48f0bb3

Christina-J-Diaz and others added 4 commits April 2, 2026 09:30

♻️ Move to global slots

51ef4a0

✨ Fixing errors

51b528d

🐛 Fixing errors

c66de5c

🐛 Change file_id slot in file to string

8f135bd

torstees requested changes Apr 2, 2026

View reviewed changes

✨ Add enums

c7db003

Somewhat hierarchical platforms with meanings pointing to EFO

daeb5f5

RobertJCarroll mentioned this pull request Apr 10, 2026

✨ Add Access Policy #33

Merged

RobertJCarroll added 5 commits April 14, 2026 13:36

Delete docs/schema/include_access_model.yaml

3d93201

Delete src/include_access_model/datamodel/include_access_model.py

6574ad1

Delete src/include_access_model/datamodel/include_access_model_pydant…

d740458

…ic.py

🔧 Update EFO URI

cb56705

Linting requires a specific EFO URI: > warning Schema maps prefix 'EFO' to namespace 'http://www.ebi.ac.uk/efo' instead of namespace 'http://identifiers.org/efo/' (canonical_prefixes)

Delete src/include_access_model/datamodel/include_access_model.py

555a7c5

RobertJCarroll added 7 commits April 17, 2026 14:08

Delete src/include_access_model/datamodel/include_access_model_pydant…

59d0d1d

…ic.py

Merge pull request #32 from include-dcc/platform-thoughts

7f0322d

Somewhat hierarchical platforms with meanings pointing to EFO

📝 Add FileAdmin title

39c8e21

Merge branch 'd3b-2559-add-files-and-assay' of https://github.com/inc…

4f31fe7

…lude-dcc/include-access-model into d3b-2559-add-files-and-assay

Delete docs/.DS_Store

9adeeee

Merge branch 'main' into d3b-2559-add-files-and-assay

72f0cbf

✨ Ignore DS_Store files

8389894

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

✨ add files and assay model#21

✨ add files and assay model#21
Christina-J-Diaz wants to merge 24 commits into
mainfrom
d3b-2559-add-files-and-assay

Christina-J-Diaz commented Mar 18, 2026

Uh oh!

RobertJCarroll left a comment

Uh oh!

github-actions Bot commented Mar 27, 2026 •

edited

Loading

Built to branch `gh-pages` at 2026-04-17 19:22 UTC.
Preview will be ready when the GitHub Pages deployment is complete.

Uh oh!

torstees left a comment

Uh oh!

torstees Apr 2, 2026

Uh oh!

Christina-J-Diaz Apr 2, 2026

Uh oh!

torstees Apr 2, 2026

Uh oh!

Christina-J-Diaz commented Apr 3, 2026

Uh oh!

RobertJCarroll commented Apr 6, 2026

Uh oh!

Christina-J-Diaz commented Apr 7, 2026

Uh oh!

RobertJCarroll commented Apr 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

Christina-J-Diaz commented Mar 18, 2026

Uh oh!

RobertJCarroll left a comment

Choose a reason for hiding this comment

Uh oh!

github-actions Bot commented Mar 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Built to branch gh-pages at 2026-04-17 19:22 UTC. Preview will be ready when the GitHub Pages deployment is complete.

Uh oh!

torstees left a comment

Choose a reason for hiding this comment

Uh oh!

torstees Apr 2, 2026

Choose a reason for hiding this comment

Uh oh!

Christina-J-Diaz Apr 2, 2026

Choose a reason for hiding this comment

Uh oh!

torstees Apr 2, 2026

Choose a reason for hiding this comment

Uh oh!

Christina-J-Diaz commented Apr 3, 2026

Uh oh!

RobertJCarroll commented Apr 6, 2026

Uh oh!

Christina-J-Diaz commented Apr 7, 2026

Uh oh!

RobertJCarroll commented Apr 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

github-actions Bot commented Mar 27, 2026 •

edited

Loading

Built to branch `gh-pages` at 2026-04-17 19:22 UTC.
Preview will be ready when the GitHub Pages deployment is complete.