curius graph
☾
Dark
all pages
search
showing 24951-25000 of 160880 pages (sorted by popularity)
« prev
1
...
498
499
500
501
502
...
3218
next »
SGD's Bias — LessWrong
1 user ▼
Minimax - Wikipedia
1 user ▼
Chunking and data compression in verbal short-term memory - ScienceDirect
1 user ▼
Frontier LLMs Attempt to Persuade into Harmful Topics
1 user ▼
AI models that lie, cheat and plot murder: how dangerous are LLMs really?
1 user ▼
Kraft–McMillan inequality - Wikipedia
1 user ▼
Alma Deutscher - Wikipedia
1 user ▼
Thoughts on The Curve - by Nathan Lambert - Interconnects
1 user ▼
Transparency is Surveillance - Nguyen - 2022 - Philosophy and Phenomenological Research - Wiley Online Library
1 user ▼
Contra Dwarkesh on RL sample-efficiency via information theory
1 user ▼
The Swimming of the Ginkgo Sperm - Arnold Arboretum
1 user ▼
Zephaniah Roe
1 user ▼
EconEvals: Benchmarks and Litmus Tests for LLM Agents in Unknown Environments
1 user ▼
[2411.17693] Adaptive Deployment of Untrusted LLMs Reduces Distributed Threats
1 user ▼
The Multi-Armed Bandit Problem and Its Solutions | Lil'Log
1 user ▼
Requiem (Mozart) - Wikipedia
1 user ▼
Recommendation: reports on the search for missing hiker Bill Ewasko — LessWrong
1 user ▼
Fundamental attribution error - Wikipedia
1 user ▼
Mirrors and Paintings — LessWrong
1 user ▼
the void — LessWrong
1 user ▼
Open Problems in AIXI Agent Foundations — AI Alignment Forum
1 user ▼
20 Great Articles and Essays about Artificial Intelligence - The Electric Typewriter
1 user ▼
The Copernican Revolution from the Inside — LessWrong
1 user ▼
CAT(0) group - Wikipedia
1 user ▼
Bloom: an open source tool for automated behavioral evaluations
1 user ▼
alignment_pretraining_feedback_draft_12_20_25.pdf - Google Drive
1 user ▼
RLHF 及其变体 Iterative DPO/RLOO/GRPO/REINFORCE 算法和工程分析 - 知乎
1 user ▼
Check digit - Wikipedia
1 user ▼
Embedded Universal Predictive Intelligence — LessWrong
1 user ▼
[2511.22226] Embedded Universal Predictive Intelligence: a coherent framework for multi-agent learning
1 user ▼
HPMOR
1 user ▼
[1506.02438] High-Dimensional Continuous Control Using Generalized Advantage Estimation
1 user ▼
Apply for Alignment Mentorship From TurnTrout and Alex Cloud
1 user ▼
Beyond Kolmogorov and Shannon — LessWrong
1 user ▼
Richard Sutton – Father of RL thinks LLMs are a dead end
1 user ▼
Fashion MNIST
1 user ▼
unrot your brain - by Kylee - plum pits
1 user ▼
The Coding Gopher - YouTube
1 user ▼
How do you look for pain points? : r/Entrepreneur
1 user ▼
Flight-to-Safety-Critical-AI.pdf
1 user ▼
Rubrics as Rewards: Reinforcement Learning Beyond Verifiable Domains
1 user ▼
OptiTree: Hierarchical Thoughts Generation with Tree Search for LLM Optimization Modeling
1 user ▼
Started playing with TickTick today after using Things for 6 months. Open up the gallery to see some of the differences. I like it a lot so far, its got some features I wish things had. More in the comments… : r/thingsapp
1 user ▼
Chi tiết tin - Quảng Bình
1 user ▼
Vụ 573 nhãn hiệu sữa giả: Chi 150.000 USD để "chạy án" | Báo Dân trí
1 user ▼
A Science-based Guide to Thinking Creatively—With LLMs
1 user ▼
Burhan Sönmez Asks the Question "Who Owns a Book?" - PEN America
1 user ▼
Exclusive: Inside Trump’s First 100 days | TIME
1 user ▼
Food consumption, habitus and the embodiment of social change: Making class and doing gender in urban Vietnam - Judith Ehlert, 2021
1 user ▼
Diagnosis as Self-Understanding & Self-Alienation
1 user ▼
« prev
1
...
498
499
500
501
502
...
3218
next »