Skip to content

embeddings #27

@matteogabella

Description

@matteogabella

hi Abraham! i ran your algorithm with some series subtitles (in italian), but the result is quite akward... surely due to the small dimension of my dataset (60k lines)... i was wondering, maybe i could try helping the training process with embeddings...
i found some embeddings file in italian... but most of it are not in the form you required (TF checkpoints)
they come with .m extensions or .npy ect... nothing seems to fit the one your algorithm can process
do you think is possibile, in a few lines (i don't want to excessively bother you) to explain to me how to create a brand new embedding checkpoint (from scratch or converting one already built), or tell me where i can check in github projects?
thank you!!
Matteo

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions