curius graph
☾
Dark
all pages
search
showing 35701-35750 of 160880 pages (sorted by popularity)
« prev
1
...
713
714
715
716
717
...
3218
next »
Interactive Scalable Interfaces for Machine Learning Interpretability — Fred Hohman
1 user ▼
Simple Long Convolutions for Sequence Modeling · Hazy Research
1 user ▼
[2303.03846] Larger language models do in-context learning differently
1 user ▼
MosaicBERT: Pretraining BERT from Scratch for $20
1 user ▼
[2303.08112] Eliciting Latent Predictions from Transformers with the Tuned Lens
1 user ▼
Observability | Practical Observability
1 user ▼
Decision Transformer Interpretability - AI Alignment Forum
1 user ▼
What Is Bfloat16 Arithmetic? – Nick Higham
1 user ▼
[2303.05119] Entropic Wasserstein Component Analysis
1 user ▼
TRAK
1 user ▼
Why didn't we get GPT-2 in 2005?
1 user ▼
Announcing OpenFlamingo: An open-source framework for training vision-language models with in-context learning | LAION
1 user ▼
[2303.11249] What Makes Data Suitable for a Locally Connected Neural Network? A Necessary and Sufficient Condition Based on Quantum Entanglement
1 user ▼
Actually, Othello-GPT Has A Linear Emergent World Representation — Neel Nanda
1 user ▼
Reverse engineering the NTK
1 user ▼
On AI Deployment: AI supply chains (and why they matter)
1 user ▼
rnoti-p1034.pdf
1 user ▼
The Independent Compositional Subspace Hypothesis for the Structure of CLIP's Last Layer | OpenReview
1 user ▼
PureJaxRL
1 user ▼
When are Neural Networks more powerful than Neural Tangent Kernels? – Off the convex path
1 user ▼
AI for General Science - Large language models for scientific hypothesis/research ideas generation | Xinming Tu
1 user ▼
Parfit: A Philosopher and His Mission to Save Morality by David Edmonds - review by Jane O’Grady
1 user ▼
[2304.03843] Why think step-by-step? Reasoning emerges from the locality of experience
1 user ▼
Scaling, emergence, and reasoning (Jason Wei, NYU) - Google Slides
1 user ▼
Revisiting the classics: Jensen’s inequality – Machine Learning Research Blog
1 user ▼
[2103.00564] An Introduction to Johnson-Lindenstrauss Transforms
1 user ▼
[2303.14177] Scaling Expert Language Models with Unsupervised Domain Discovery
1 user ▼
PsyArXiv Preprints | Surprisal does not explain syntactic disambiguation difficulty: evidence from a large-scale benchmark
1 user ▼
[2303.17951] FP8 versus INT8 for efficient deep learning inference
1 user ▼
Seurat CCA? It's just a simple extension of PCA! | Xinming Tu
1 user ▼
Niels Bohr's Memorandum to President Roosevelt | The Manhattan Project | Historical Documents | atomicarchive.com
1 user ▼
Public Policy for Realists - by Pradyumna Prasad
1 user ▼
What Is Iterative Refinement? – Nick Higham
1 user ▼
[2305.08809] Interpretability at Scale: Identifying Causal Mechanisms in Alpaca
1 user ▼
PsyArXiv Preprints | How hard is cognitive science?
1 user ▼
Direct Approach Interactive Model
1 user ▼
Playing Doc’s Games—I | The New Yorker
1 user ▼
Six Experiments in Action Minimization
1 user ▼
Aligning Faithful Interpretations with their Social Attribution - ACL-TACL-2021_2021.tacl-1.18
1 user ▼
The longest training run
1 user ▼
Learning explanations that are hard to vary | OpenReview
1 user ▼
What We Get Wrong About AI & China—Asterisk
1 user ▼
What is the curl of a vector field, really? – theHigherGeometer
1 user ▼
Lens
1 user ▼
What does it mean to understand how a scientific literature is put together? - Marginal REVOLUTION
1 user ▼
Inside Argentina's currency exchange black markets | devonzuegel.com
1 user ▼
[2307.05599] AlephZero and Mathematical Experience
1 user ▼
Can we develop theoretical explanations for today’s AI systems? - generally intelligent
1 user ▼
Anthropic \ Studying Large Language Model Generalization with…
1 user ▼
Embroid: Correcting and Improving LLM Predictions Without Labels · Hazy Research
1 user ▼
« prev
1
...
713
714
715
716
717
...
3218
next »