The purpose of this issue is to have a conversation/brainstorm about the things that should be implemented to get Simplexity into a more working form for general use, with reproducibility, organization, and ease of use. Some things off the top of my head that come up:
- Seems like everything should be MLFlow, including artifacts, saved ckpts, even analysis results?
- @ealt mentioned a tagging system, in order to deal with determining what should be saved (or not)
- Currently we have to set seq_len, n_ctx, vocab size, bos, etc. and its hard to think through how they all relate. In the standard case defining one or two of those should set the rest.