Hi, I have a couple of questions about fine-tuning the UDOP model:
-
For key-value extraction, a sample CORD dataset is used. Is there any resource or guideline available to understand how this dataset is structured so that we can format our own data accordingly?
-
The current notebook supports single-page document classification. What modifications would be needed to extend it for multi-page document classification during fine-tuning?
Looking forward to your insights. Thanks!
Hi, I have a couple of questions about fine-tuning the UDOP model:
For key-value extraction, a sample CORD dataset is used. Is there any resource or guideline available to understand how this dataset is structured so that we can format our own data accordingly?
The current notebook supports single-page document classification. What modifications would be needed to extend it for multi-page document classification during fine-tuning?
Looking forward to your insights. Thanks!