Skip to content

Add 5 new formats: CDF, FITS, ASDF, Zarr, and GPX#46

Merged
mbjones merged 5 commits into
DataONEorg:developfrom
rebeccaringuette:main
Jan 5, 2026
Merged

Add 5 new formats: CDF, FITS, ASDF, Zarr, and GPX#46
mbjones merged 5 commits into
DataONEorg:developfrom
rebeccaringuette:main

Conversation

@rebeccaringuette
Copy link
Copy Markdown
Contributor

See issues #41, #42, #43, #44, and #45 for discussion of the new formats, particularly the MIME mediaTypes.

Added GPX, CDF, FITS, ASDF, and Zarr. Those without registered MIME types were added with the mediaType fields left blank (GPX, ASDF, and Zarr).
updating MIME types based on discussions on the open issues.
Copy link
Copy Markdown
Member

@mbjones mbjones left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This looks great! Thanks @rebeccaringuette so much. A few things we should consider and changes to be made. See my inline code comments too:

  1. We need to update the total object count in the root element <d1v2:objectFormatList count="155" start="0" total="155" to a count of 160
  2. For the x-* mime types, I like your choices of mime type and formatId. But for the FITS standard, because the MIME type is an accepted and registered type (application/fits), we generally try to use that as the formatId as well. How does that sound?
  3. Can you cluster all of the new formats at the end, as we have been just appending them in the past in chronological order.

Comment thread objectFormatListV2.xml Outdated
<extension>xml</extension>
</objectFormat>
<objectFormat>
<formatId>CDF</formatId>
Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

is application/x-cdf well-enough established to make a good formatId as well? see general comments in review.

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It'd be more useful if the formatID could stay as CDF for user-friendliness when this gets plugged into Zenodo.

Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@datadavev thoughts on this decision? In the past we;ve tried to use the mime type when it makes sense (e.g., text/csv) and left the human-readable name to the formatName field.

In general, @rebeccaringuette , I think UIs should be using formatName in displays and formatId as the unique identifier of the format.

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Oh yeah, that makes sense. I'll change it for CDF and FITS.

Comment thread objectFormatListV2.xml Outdated
<formatName>Common Data Format (CDF)</formatName>
<formatType>DATA</formatType>
<mediaType name="application/x-cdf"/>
<extension>ncdf</extension>
Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

is this the correct extension, or is it cdf?

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

no, cdf is correct. Typo fixed.

Comment thread objectFormatListV2.xml Outdated
<extension>xml</extension>
</objectFormat>
<objectFormat>
<formatId>FITS</formatId>
Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I suggest this formatId should be application/fits. See review comments.

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I am expecting Zenodo to use the formatID as the item in the drop down list. It would be more user-friendly if the formatID could stay as FITS. Will that work?

Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

same discussion as above -- maybe Zenodo can/should use formatName for display? Let's see what Dave and others pitch in...

Comment thread objectFormatListV2.xml Outdated
<extension>nc</extension>
</objectFormat>
<objectFormat>
<formatId>Zarr</formatId>
Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is application/x-zarr well-enough established to use it as the formatId?

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

No, they are still debating this, but also see response on the FITS formatID.

Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

ok, sounds good.

Comment thread objectFormatListV2.xml Outdated
<extension>zarr</extension>
</objectFormat>
<objectFormat>
<formatId>GPX-1.1</formatId>
Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is the 1.1 version of GPX not self-describing? Can we generalize this to GPX? Generally its better to only have one formatId if it suffices across versions. @iannesbitt thoughts?

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I have no idea on this one. I have never heard of a GPX file before.

Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@iannesbitt can you weigh in on this GPX question please?

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I do need these new items included before February for Zenodo to build with. If including the GPX term will delay this, then I (or you) could remove it easily and proceed. Github's notification emails for this thread aren't being sent to me anymore, but I will check again in January. Thanks.

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@mbjones @rebeccaringuette Sorry for the delay, I didn't get a notification for this for some reason. Yes, GPX files are self-describing. I think we should go ahead with generalizing in this case. The formatId should be GPX and not GPX-1.1.

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Got it. I updated the formatID and name to remove the 1.1. Is this ready?

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@rebeccaringuette yes, that looks correct! Thanks

@rebeccaringuette
Copy link
Copy Markdown
Contributor Author

Items 1 and 3 done, comments left on item 2.

addressed items 1 and 3.
updating the formatID for CDF and FITS to be the MIME type
updating the GPX details per feedback
@mbjones mbjones changed the base branch from main to develop January 5, 2026 18:17
mbjones added a commit that referenced this pull request Jan 5, 2026
Modify the name for ASDF to remove the commentary.
Modify the formatId for GPX to use the mime type.

Merged PR #46 into develop from @rebeccaringuette.
@mbjones mbjones merged commit acfd485 into DataONEorg:develop Jan 5, 2026
1 check passed
@mbjones
Copy link
Copy Markdown
Member

mbjones commented Jan 5, 2026

@rebeccaringuette I merged your PR into the develop branch, but changed a couple of things:

  • changed the name of ASDF to remove the commentary
  • changed the formatId of gpx to use the mime-type

You can see the final proposed changes in the develop branch objectFormatListV2.xml

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants