curius graph
☾
Dark
all pages
search
showing 23101-23150 of 170283 pages (sorted by popularity)
« prev
1
...
461
462
463
464
465
...
3406
next »
Falling off - by Vivian Loh - circles
1 user ▼
[2511.21654] EvilGenie: A Reward Hacking Benchmark
1 user ▼
[2511.21654] EvilGenie: A Reward Hacking Benchmark
1 user ▼
Indo-European Explorer: A 6,000-Year Journey
1 user ▼
Opinion | What if Labor Becomes Unnecessary? - The New York Times
1 user ▼
The best new novels to read this spring
1 user ▼
The engine of Germany's wealth is blocking its future | The European Correspondent
1 user ▼
30hr Open Weight Safety Projects - Google Docs
1 user ▼
Why Harry Styles Loves Running: “It’s Just You and a Pair of Shoes.”
1 user ▼
Insider Journalism - by Robin Hanson - Overcoming Bias
1 user ▼
Statisticism: How Cluster-Thinking About Data Creates Blind Spots
1 user ▼
Distilling Replacing Guilt — LessWrong
1 user ▼
Moral Reality Check – Unstable Ontology
1 user ▼
[2603.05414] Dissociating Direct Access from Inference in AI Introspection
1 user ▼
Our Team | until
1 user ▼
The Repugnant Conclusion (Stanford Encyclopedia of Philosophy)
1 user ▼
🟡 Iran War continues, Strait of Hormuz remains closed, sharp drop in Chinese aircraft flying near Taiwan, Alibaba AI agent mystery || Global Risks Weekly Roundup #10/2026
1 user ▼
Censored LLMs as a Natural Testbed for Secret Knowledge Elicitation — AI Alignment Forum
1 user ▼
Log In | Robinhood
1 user ▼
[2603.05414] Dissociating Direct Access from Inference in AI Introspection
1 user ▼
What a time to be an oncologist - by Olivia Webb Kosloff
1 user ▼
Amy Tam on X: "When code is free, research is all that matters" / X
1 user ▼
Partial Lean formalization of Analysis I — Verso
1 user ▼
How To Become a Mathematical Genius - by Sinéad O’Sullivan
1 user ▼
The Untold Chaos Behind a $3 Billion AI Startup Launch - YouTube
1 user ▼
The First Crusade | The Salahuddin Generation | Ep. 2 | Dr. Hassan Elwan - YouTube
1 user ▼
Versa Diary - Google Docs
1 user ▼
The Mog Language Guide | Mog
1 user ▼
Evidence that recurrent circuits are critical to the ventral stream’s execution of core object recognition behavior | Nature Neuroscience
1 user ▼
Fantastic Beasts and How to Rank Them | The New Yorker
1 user ▼
We Should Revisit Literate Programming in the Agent Era | silly business
1 user ▼
[2510.16062] Can LLMs Correct Themselves? A Benchmark of Self-Correction in LLMs
1 user ▼
[2510.16062] Can LLMs Correct Themselves? A Benchmark of Self-Correction in LLMs
1 user ▼
Feedback eaters and how to spot them - by Carmen - Altered
1 user ▼
Rohan Paul on X: "Self‑Correction Bench shows 1 word can flip 64% failure into success. Large language models often spot errors in a user prompt yet ignore identical errors in their own output. This paper measures that gap and shows a simple prompt tweak almost erases it. The authors build https://t.co/r8i7OfO5Py" / X
1 user ▼
CorrectBench: A Benchmark of Self-Correction in LLMs
1 user ▼
"Clean" Code, Horrible Performance - YouTube
1 user ▼
Why Escalation Favors Iran | Foreign Affairs
1 user ▼
Cursor's Third Era: Cloud Agents — ft. Sam Whitmore, Jonas Nelle, Cursor - YouTube
1 user ▼
Times New Roman Turns Right - McSweeney’s Internet Tendency
1 user ▼
How are MPs passing so many bills without voting? | CBC News
1 user ▼
Canadian military personnel identified on white supremacist dating site | CBC Accessibility
1 user ▼
Discover – type.lol
1 user ▼
Kaluza–Klein theory
1 user ▼
luck/luck.md at main · soleio/luck
1 user ▼
justinzwu.com
1 user ▼
The-Complete-Guide-to-Building-Skill-for-Claude.pdf
1 user ▼
FrameBook.
1 user ▼
Daniel Kokotajlo's Shortform — LessWrong
1 user ▼
Compensation as a Reflection of Values / Oxide
1 user ▼
« prev
1
...
461
462
463
464
465
...
3406
next »