thoughts on the 4096 limits coming in

Hey, interesting repo. Unfortunately looks like openai's 3.5-turbo-1106 model limits output to 4096 tokens. As does the 4-turbo model.
Claude has also changed every model's output to 4096 tokens.

OpenaAI's 0613 models will be deprecated in July this year. Azure as well. Soon we will have no major provider (of not OSS LLMs) providing >4096 generation tokens capabilities. IMO this is going to hit a lot of different use cases hard.

Have you had any thoughts on this?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

thoughts on the 4096 limits coming in #1

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

thoughts on the 4096 limits coming in #1

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions