Skip to content
@open-thoughts

OpenThoughts

Open collaborations on data-centric research

https://openthoughts.ai

A community effort to curate the best open post-training datasets.

We are currently working on OpenThoughts-Agent, a collaboration building the best open agent training datasets.

Our first project was curating open reasoning data recipes. OpenThoughts3, our best reasoning dataset recipe, is detailed in our release blog and the full paper.

About us

We are a team of researchers and engineers from Bespoke Labs, Stanford, University of California Berkeley, University of Washington, Juelich Supercomputing Center (JSC), LAION, UCLA, UNC Chapel Hill, and Toyota Research Institute united around building the best datasets (and thus the best models). See our previous works at datacomp.ai and mlfoundations.

Open Thoughts is supported by Bespoke Labs, Lambda Labs, NSF IFML, Juelich Supercomputing Center, Toyota Research Institute.

Pinned Loading

  1. open-thoughts open-thoughts Public

    Fully open data curation for reasoning models

    Python 2.2k 179

Repositories

Showing 4 of 4 repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Python MDX

Most used topics

Loading…