GitHub - guyulongcs/Awesome-LLM-papers: Awesome papers in Large Language Models (LLM). They focus on state-of-the-art LLM methods, such as algorithms, system, SFT, RL, Multi-modal LLMs, MOE, Quantization, and Applications (RAG, agent, coding).

Awesome papers in Large Language Models (LLM). They focus on state-of-the-art LLM methods, such as algorithms, system, SFT, RL, Multi-modal LLMs, MOE, Quantization, and Applications (RAG, agent, coding).

00_Organizations

0_OpenAI

1_Google

2013 (Google) (NIPS) [Word2vec] Distributed Representations of Words and Phrases and their Compositionality
2014 (Google) (NIPS) [Seq2Seq] Sequence to Sequence Learning with Neural Networks
2017 (Google) (NIPS) [Transformer] Attention Is All You Need
2019 (Google) (NAACL) [Bert] BERT - Pre-training of Deep Bidirectional Transformers for Language Understanding
2020 (Google) (ICLR) [ALBERT] ALBERT - A Lite BERT for Self-supervised Learning of Language Representations
2020 (Google) (JMLR) [T5] Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
2021 (Google) (ICLR) [VIT] An Image is Worth 16x16 Words - Transformers for Image Recognition at Scale
2022 (Google) (Arxiv) [PaLM] PaLM - Scaling Language Modeling with Pathways
2022 (Google) (Arxiv) [Retro] Improving language models by retrieving from trillions of tokens
2022 (Google) (JMLR) [SwitchTransfomers] Switch Transformers - Scaling to Trillion Parameter Models with Simple and Efficient Sparsity
2022 (Google) (NIPS) [COT] Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
2022 (Google) (TMLR) [Emergent] Emergent Abilities of Large Language Models
2022 (Goolge) (NIPS) Large Language Models are Zero-Shot Reasoners
2023 (DeepMind) (NIPS) [TOT] Tree of Thoughts - Deliberate Problem Solving with Large Language Models
2023 (Google (Arxiv) [SIGLIP] Sigmoid Loss for Language Image Pre-Training
2023 (Google) (Arxiv) PaLM 2 Technical Report
2023 (Google) (ICLR) Self-Consistency Improves Chain of Thought Reasoning in Language Models
2023 (Google) (ICLR) [ReAct] ReAct - Synergizing Reasoning and Acting in Language Models
2023 (Google) (ICML) [PaLM-E] PaLM-E - An Embodied Multimodal Language Model
2024 (Google) (Arxiv) Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters
2024 (Google) (Arxiv) [Gemini 1.5] Gemini 1.5 - Unlocking multimodal understanding across millions of tokens of context
2024 (Google) (Arxiv) [Gemini] Gemini - A Family of Highly Capable Multimodal Models
2024 (Google) (Arxiv) [Gemma] Gemma - Open Models Based on Gemini Research and Technology
2024 (Google) (ICLR) [OPRO] Large Language Models as Optimizers
2025 (Google (Arxiv) [SIGLIP2] SigLIP 2 - Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features
2025 (Google) (Arxiv) [Gemini 2.5] Gemini 2.5 - Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities
2025 (Google) (Arxiv) [Gemma3] Gemma 3 Technical Report

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
00_Organizations		00_Organizations
01_AlgorithmOptimization		01_AlgorithmOptimization
02_SystemOptimization		02_SystemOptimization
03_SFT		03_SFT
04_RLHF		04_RLHF
05_Multi-Modal-LLMs		05_Multi-Modal-LLMs
06_MOE		06_MOE
07_Application		07_Application
08_Quantization		08_Quantization
09_Evaluation		09_Evaluation
10_Resources		10_Resources
11_Survey		11_Survey
.gitignore		.gitignore
README.md		README.md
generateReadme.py		generateReadme.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Awesome papers in Large Language Models (LLM). They focus on state-of-the-art LLM methods, such as algorithms, system, SFT, RL, Multi-modal LLMs, MOE, Quantization, and Applications (RAG, agent, coding).

00_Organizations

0_OpenAI

1_Google

2_DeepSeek

3_Alibaba

4_Meta

5_Zhipu

6_Others

01_AlgorithmOptimization

LongContext

PositionEncoding

Tokenization

02_SystemOptimization

03_SFT

04_RLHF

05_Multi-Modal-LLMs

06_MOE

07_Application

08_Quantization

09_Evaluation

10_Resources

11_Survey

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Awesome papers in Large Language Models (LLM). They focus on state-of-the-art LLM methods, such as algorithms, system, SFT, RL, Multi-modal LLMs, MOE, Quantization, and Applications (RAG, agent, coding).

00_Organizations

0_OpenAI

1_Google

2_DeepSeek

3_Alibaba

4_Meta

5_Zhipu

6_Others

01_AlgorithmOptimization

LongContext

PositionEncoding

Tokenization

02_SystemOptimization

03_SFT

04_RLHF

05_Multi-Modal-LLMs

06_MOE

07_Application

08_Quantization

09_Evaluation

10_Resources

11_Survey

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages