-
Notifications
You must be signed in to change notification settings - Fork 5
Expand file tree
/
Copy pathOLMo.yaml
More file actions
93 lines (76 loc) · 3.87 KB
/
OLMo.yaml
File metadata and controls
93 lines (76 loc) · 3.87 KB
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
---
# Thank you for contributing!
# In filling out this yaml file, please follow the criteria as described here:
# https://osai-index.eu/contribute
# You're free to build on this work and reuse the data. It is licensed under CC-BY 4.0, with the
# stipulation that attribution should come in the form of a link to https://osai-index.eu/
# and a citation to the peer-reviewed paper in which the dataset & criteria were published:
# Liesenfeld, A. and Dingemanse, M., 2024. Rethinking open source generative AI: open-washing and the EU AI Act. In Proceedings of the 2024 ACM Conference on Fairness, Accountability, and Transparency (pp. 1774-1787).
# Organization tags:
# - National origin: United States
# - Contributor type: Academic (Research institution)
system:
name: OLMo
link: https://huggingface.co/allenai/Olmo-3.1-32B-Think
type: text
performanceclass: full
basemodelname: Olmo-3-1125-32B
endmodelname: Olmo-3.1-32B-Think
endmodellicense: Apache-2.0
releasedate: 2025-12
notes: Open LLM trained from scratch by Allen AI.
org:
name: Ai2
link: https://allenai.org
notes: Allen Institute for AI (non-profit research institute)
# availability:
datasources_basemodel:
class: open
link: ["https://huggingface.co/datasets/allenai/dolma3_mix-5.5T-1125", "https://huggingface.co/datasets/allenai/dolma3_longmino_mix-100B-1125", "https://huggingface.co/datasets/allenai/dolma3_dolmino_mix-100B-1125"]
notes: Training data for base model released and use documented.
datasources_endmodel:
class: open
link: https://huggingface.co/datasets/allenai/Dolci-Think-RL-32B
notes: Data for fine-tuning published in a well-organized manner.
weights_basemodel:
class: open
link: https://huggingface.co/allenai/Olmo-3-1125-32B
notes: Model weights made available on HuggingFace.
weights_endmodel:
class: open
link: https://huggingface.co/allenai/Olmo-3.1-32B-Think
notes: Model weights made available on HuggingFace.
trainingcode:
class: open
link: ["https://github.com/allenai/open-instruct", "https://github.com/allenai/OLMo-core"]
notes: Multiple repos with training, architecture and fine-tuning code available.
# documentation:
code:
class: open
link: ["https://github.com/allenai/open-instruct", "https://github.com/allenai/OLMo-core"]
notes: Repositories and code well-described, commented and documented.
hardware_architecture:
class: open
link: https://aclanthology.org/2024.acl-long.841/
notes: Architecture documented in requisite detail.
preprint:
class: open
link: https://huggingface.co/papers/2512.13961
notes: Pre-print goes into impressive detail about the data, training process, architecture, and evaluation.
paper:
class: partial
link: https://openreview.net/forum?id=2ezugTT9kU#discussion
notes: Conference paper published in COLM 2025. More under review?
modelcard:
class: open
link: https://huggingface.co/allenai/Olmo-3.1-32B-Think
notes: Model card provides broad overview and links to full details.
datasheet:
class: open
link: ["https://huggingface.co/datasets/allenai/dolma3_mix-5.5T-1125", "https://huggingface.co/datasets/allenai/dolma3_longmino_mix-100B-1125", "https://huggingface.co/datasets/allenai/dolma3_dolmino_mix-100B-1125","https://huggingface.co/datasets/allenai/Dolci-Think-RL-32B", "https://huggingface.co/datasets/allenai/Dolci-Instruct-RL-7B", "https://huggingface.co/datasets/allenai/Dolci-Think-DPO-32B", "https://huggingface.co/datasets/allenai/Dolci-Instruct-DPO-7B", "https://huggingface.co/datasets/allenai/Dolci-Think-SFT-32B", "https://huggingface.co/datasets/allenai/Dolci-Instruct-SFT-7B"]
notes: Data sheets are well-documented and provide requisite info.
# access:
licenses:
class: open
link: https://huggingface.co/allenai/Olmo-3.1-32B-Think#model-description
notes: Apache 2.0, an OSI-approved license.