Sample Pages (Top 50 by confidence)
[2503.13423] SuperBPE: Space Travel for Language Models
https://arxiv.org/pdf/2503.13423.pdf
1 user
Last: Jan 07, 2026
100% confidence
Steering Language Models with Weight Arithmetic
https://arxiv.org/pdf/2511.05408
1 user
Last: Jan 07, 2026
100% confidence
Inference-Time Reward Hacking in Large Language Models
https://arxiv.org/pdf/2506.19248
1 user
Last: Jan 07, 2026
100% confidence
Pre-Finetuning/Domain-Adaptive Pre-training of Language Models | by Chien-Sheng (Jason) Wu | Process My Language | Medium
https://medium.com/jasonwu0731/pre-finetuning-domain-adaptive-pre-training-of-la...
1 user
Last: Jan 07, 2026
100% confidence
[2207.05221] Language Models (Mostly) Know What They Know
https://arxiv.org/pdf/2207.05221.pdf
1 user
Last: Jan 07, 2026
100% confidence
[2207.07061] Confident Adaptive Language Modeling
https://arxiv.org/pdf/2207.07061.pdf
1 user
Last: Jan 07, 2026
100% confidence
[2211.15458] Validating Large Language Models with ReLM
https://arxiv.org/pdf/2211.15458.pdf
1 user
Last: Jan 07, 2026
100% confidence
Holistic Evaluation of Language Models (HELM)
https://crfm.stanford.edu/helm/latest/?group=core_scenarios
1 user
Last: Jan 07, 2026
100% confidence
[2501.09223] Foundations of Large Language Models
https://arxiv.org/pdf/2501.09223.pdf
1 user
Last: Jan 07, 2026
100% confidence
Simple distribution approximation: When sampled 100 times, can language models yield 80% A and 20% B? — AI Alignment Forum
https://www.alignmentforum.org/posts/iaHk9DMCbrYsKuqgS/simple-distribution-appro...
1 user
Last: Jan 07, 2026
100% confidence
Can Large Language Models Develop Gambling Addiction?
https://arxiv.org/pdf/2509.22818
1 user
Last: Jan 07, 2026
100% confidence
The Origins of Representation Manifolds in Large Language Models
https://arxiv.org/pdf/2505.18235
1 user
Last: Jan 07, 2026
100% confidence
[2502.00873] Language Models Use Trigonometry to Do Addition
https://arxiv.org/pdf/2502.00873.pdf
1 user
Last: Jan 07, 2026
100% confidence
Foundations of Large Language Models
https://arxiv.org/pdf/2501.09223v2
1 user
Last: Jan 07, 2026
100% confidence
Teaching language models to support answers with verified quotes.pdf
https://storage.googleapis.com/deepmind-media/Teaching%20language%20models%20to%...
1 user
Last: Jan 07, 2026
100% confidence
A Trainable Spaced Repetition Model for Language Learning
https://research.duolingo.com/papers/settles.acl16.pdf
1 user
Last: Jan 07, 2026
100% confidence
Machine Learning–Driven Language Assessment
https://research.duolingo.com/papers/settles.tacl20.pdf
1 user
Last: Jan 07, 2026
100% confidence
Improving language models by retrieving.pdf
https://storage.googleapis.com/deepmind-media/research/language-research/Improvi...
1 user
Last: Jan 07, 2026
100% confidence
COMS 6998-7 (Spring 2025): “Theoretical Foundations of Large Language Models”
https://djhsu.notion.site/COMS-6998-7-Spring-2025-Theoretical-Foundations-of-Lar...
1 user
Last: Jan 07, 2026
100% confidence
Wordcraft: Story Writing With Large Language Models
https://dl.acm.org/doi/fullHtml/10.1145/3490099.3511105
1 user
Last: Jan 07, 2026
100% confidence
contents | Build a Large Language Model (From Scratch)
https://learning.oreilly.com/library/view/build-a-large/9781633437166/OEBPS/Text...
1 user
Last: Jan 07, 2026
100% confidence
Historical analogies for large language models
https://dynomight.substack.com/p/llms?s=r
1 user
Last: Jan 07, 2026
100% confidence
Foundations of Large Language Models: Tools, Techniques, and Applications | WatSPEED | University of Waterloo
https://uwaterloo.ca/watspeed/programs-and-courses/foundations-large-language-mo...
1 user
Last: Jan 07, 2026
100% confidence
Program Synthesis with Large Language Models
https://arxiv.org/pdf/2108.07732.pdf
1 user
Last: Jan 07, 2026
100% confidence
[2112.02969] Jigsaw: Large Language Models meet Program Synthesis
https://arxiv.org/pdf/2112.02969.pdf
1 user
Last: Jan 07, 2026
100% confidence
[2302.03169] Data Selection for Language Models via Importance Resampling
https://arxiv.org/pdf/2302.03169.pdf
1 user
Last: Jan 07, 2026
100% confidence
[2004.10964] Don't Stop Pretraining: Adapt Language Models to Domains and Tasks
https://arxiv.org/pdf/2004.10964.pdf
1 user
Last: Jan 07, 2026
100% confidence
[2302.07842] Augmented Language Models: a Survey
https://arxiv.org/pdf/2302.07842.pdf
1 user
Last: Jan 07, 2026
100% confidence
[2305.17333] Fine-Tuning Language Models with Just Forward Passes
https://arxiv.org/pdf/2305.17333.pdf
1 user
Last: Jan 07, 2026
100% confidence
[2304.05128] Teaching Large Language Models to Self-Debug
https://arxiv.org/pdf/2304.05128.pdf
1 user
Last: Jan 07, 2026
100% confidence
Efficient Guided Generation for Large Language Models
https://arxiv.org/pdf/2307.09702
1 user
Last: Jan 07, 2026
100% confidence
Bridging the data gap between children and large language models - ScienceDirect
https://www.sciencedirect.com/science/article/pii/S1364661323002036
1 user
Last: Jan 07, 2026
100% confidence
Esoteric Language Models
https://arxiv.org/pdf/2506.01928
2 users
Last: Jan 07, 2026
100% confidence
[2310.07820] Large Language Models Are Zero-Shot Time Series Forecasters
https://arxiv.org/pdf/2310.07820.pdf
1 user
Last: Jan 07, 2026
100% confidence
[2208.03299] Few-shot Learning with Retrieval Augmented Language Models
https://arxiv.org/pdf/2208.03299.pdf
1 user
Last: Jan 07, 2026
100% confidence
[2002.08909] REALM: Retrieval-Augmented Language Model Pre-Training
https://arxiv.org/pdf/2002.08909.pdf
1 user
Last: Jan 07, 2026
100% confidence
LanguageGuessr
https://languageguessr.io/quick-game
1 user
Last: Jan 07, 2026
100% confidence
Bridging the data gap between children and large language models - ScienceDirect
https://www.sciencedirect.com/science/article/abs/pii/S1364661323002036
1 user
Last: Jan 07, 2026
100% confidence
The Hitchhiker’s Guide to Instruction Tuning Large Language Models | by Viraj Shah | Medium
https://medium.com/@veer15/the-hitchhikers-guide-to-instruction-tuning-large-lan...
1 user
Last: Jan 07, 2026
100% confidence
Explicitly unbiased large language models still form biased associations | PNAS
https://www.pnas.org/doi/10.1073/pnas.2416228122
1 user
Last: Jan 07, 2026
100% confidence
Beyond Linear Steering: Unified Multi-Attribute Control for Language Models
https://arxiv.org/pdf/2505.24535
1 user
Last: Jan 07, 2026
100% confidence