Currently, this program just receives parameters, transcribe, and then exit. It's not very convenient if I want to call it from my program. And the model loading, resources initializing costs are necessary every time.
If it provides a REPL mode, other programs can interacts with it via STDIO.
I have already built a usable version in my fork, if you need, I can help to implement the REPL mode.