Mapping 1,000+ Language Models via the Log-Likelihood Vector

Paper

Mapping 1,000+ Language Models via the Log-Likelihood Vector
Momose Oyama, Hiroaki Yamagiwa, Yusuke Takase, Hidetoshi Shimodaira
arXiv:2502.16173 | accepted to ACL 2025 main

Data

Model

models_1018.yaml
A list of 1,018 model names used in our research.
model-data-1018.pkl
Collected metadata for the 1,018 models, including model_type, model_size, and other model attributes.
Usage examples can be found in load_model-data.ipynb.

Text

texts-10k-pile.jsonl
A JSONL file containing 10,000 text chunks from the Pile dataset. Each line in the file is a JSON object representing one chunk, with fields such as text, pile_set_name, and indexing metadata.

Log-Likelihood

raw_log-likelihood_1018.pkl
Log-likelihood data for 1,018 language models calculated on the texts-10k-pile.jsonl dataset.
Usage examples can be found in load_log-likelihood.ipynb.
clipped_log-likelihood_1018.pkl
Log-likelihood data for 1,018 language models calculated on the texts-10k-pile.jsonl dataset.
This data is derived by clipping the bottom 2% of the values from the raw_log-likelihood_1018.pkl data.
Usage examples can be found in load_log-likelihood.ipynb.

Code

calculate-log-likelihood.ipynb
Jupyter notebook containing sample code for calculating log-likelihood.
For code that predicts model performance from model coordinates (Section 5) and generates the LaTeX file listing 1,018 models (Appendix L), see code_for_Section_5_and_Appendix_L/.
For the code for the experiments on weight interpolation (Section 6.3 and Appendix J), see code_for_weight_interpolation.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Mapping 1,000+ Language Models via the Log-Likelihood Vector

Paper

Data

Model

Text

Log-Likelihood

Code

Examples

FilesExpand file tree

README.md

Latest commit

History

README.md

File metadata and controls

Mapping 1,000+ Language Models via the Log-Likelihood Vector

Paper

Data

Model

Text

Log-Likelihood

Code

Examples