Ignore benchmark related files and internal benchmarks #74

J535D165 · 2025-05-23T22:13:35Z

Usually, I'm not a big fan of solutions like this, but given the importance of performance, I think this pragmatic solution can be acceptable. I utilize large, real-world datasets to benchmark the parser's performance and frequently need to switch between branches.

Btw, I'm making nice progress on the PubMed parsing PR, but there are still some open challenges. Performance is one of them.

shapiromatron

Seems like an ok solution, though I wonder if it'd be better to see if a synthetic benchmark dataset could be generated instead of hidden data that can only run by one of our repository collaborators.

shapiromatron · 2025-08-19T14:01:54Z

.gitignore

 # created from tests
 export.ris
+
+# extra benchmark data only for internal use (because of copyright)


any chance we could create some synthetic data using something like faker? https://github.com/joke2k/faker

Ignore benchmark related files and internal benchmarks

4cd5019

J535D165 requested a review from shapiromatron May 23, 2025 22:13

shapiromatron self-assigned this May 28, 2025

shapiromatron approved these changes Aug 19, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Ignore benchmark related files and internal benchmarks #74

Ignore benchmark related files and internal benchmarks #74

Uh oh!

J535D165 commented May 23, 2025

Uh oh!

shapiromatron left a comment

Uh oh!

shapiromatron Aug 19, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Ignore benchmark related files and internal benchmarks #74

Are you sure you want to change the base?

Ignore benchmark related files and internal benchmarks #74

Uh oh!

Conversation

J535D165 commented May 23, 2025

Uh oh!

shapiromatron left a comment

Choose a reason for hiding this comment

Uh oh!

shapiromatron Aug 19, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants