hi Abraham! i ran your algorithm with some series subtitles (in italian), but the result is quite akward... surely due to the small dimension of my dataset (60k lines)... i was wondering, maybe i could try helping the training process with embeddings...
i found some embeddings file in italian... but most of it are not in the form you required (TF checkpoints)
they come with .m extensions or .npy ect... nothing seems to fit the one your algorithm can process
do you think is possibile, in a few lines (i don't want to excessively bother you) to explain to me how to create a brand new embedding checkpoint (from scratch or converting one already built), or tell me where i can check in github projects?
thank you!!
Matteo
hi Abraham! i ran your algorithm with some series subtitles (in italian), but the result is quite akward... surely due to the small dimension of my dataset (60k lines)... i was wondering, maybe i could try helping the training process with embeddings...
i found some embeddings file in italian... but most of it are not in the form you required (TF checkpoints)
they come with .m extensions or .npy ect... nothing seems to fit the one your algorithm can process
do you think is possibile, in a few lines (i don't want to excessively bother you) to explain to me how to create a brand new embedding checkpoint (from scratch or converting one already built), or tell me where i can check in github projects?
thank you!!
Matteo