Skip to content

Conversation

@tjacovich
Copy link
Contributor

@tjacovich tjacovich commented Apr 25, 2022

  • Sanitize DOIs that have trailing characters or are merged duplicates
  • Modify maintenance_reevaluate to check for bad DOIs in database
  • Added function to update any alternate bibcodes to uppercase in parsed_metadata
  • Modified populate_bibcode_column to respect curated_metadata but not overwrite parsed_metadata.
  • Fixed alembic revisions to respect column contents on downgrade.

…odes are handled for maintenance_curation and maintenance_metadata.
…and then create new citations to the sanitized records.
…cond regex to catch malformed records of the form zenodo/123456.
…data. Added check on populate_bibcode_column to make sure it is not run on a db with curated metadata already present.
@tjacovich tjacovich marked this pull request as ready for review April 27, 2022 13:59
@tjacovich
Copy link
Contributor Author

Removed alt_bibcode fix for alembic from this PR. Modified alembic revisions that change 'status' column so they preserve column values.

tjacovich added 9 commits May 19, 2022 20:16
… applied to software with versions but none that exist in ADS.
…s_expanded_raw_data() to look at raw_content column instead of content column for reconstruction.
…anitize new dois and to only mark registered citations as sanitized. Added sanitized targets to db in order to facilitate downgrade. Modified reevaluate to handle sanitizing records in a way consistent with new citation processing.
@tjacovich tjacovich marked this pull request as draft June 26, 2025 20:45
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant