curius graph

all pages

showing 140451-140500 of 160880 pages (sorted by popularity)

« prev 1...2808 280928102811 2812...3218 next »

[2005.00060] Bridging Mode Connectivity in Loss Landscapes and Adversarial Robustness

[1805.12185] Fine-Pruning: Defending Against Backdooring Attacks on Deep Neural Networks

[1805.12185] Fine-Pruning: Defending Against Backdooring Attacks on Deep Neural Networks

[CCAI All-Hands] Generative AI Models - Google Slides

lecture01.pdf

lamini-ai/lamin

[2210.01790] Goal Misgeneralization: Why Correct Specifications Aren't Enough For Correct Goals

Creative Good: The Google Glass feature no one is talking about

Why Claims “Brainstorming Doesn’t Work” Are So Silly | LinkedIn

Brainstorming – IDEO U

Relaxed adversarial training for inner alignment - AI Alignment Forum

Outer vs inner misalignment: three framings - AI Alignment Forum

"Inner Alignment Failures" Which Are Actually Outer Alignment Failures - AI Alignment Forum

Search - Obsidian Help

8 Factors for Effective Use of Obsidian Tags, Links, and Folders | by Denise Todd | Medium

Two Obsidian Plugins You’ll Wonder How you Lived Without | by Denise Todd | Medium

Folder, Note, Tag use? : r/ObsidianMD

Yet Another Hot Take on "Folders versus Tags"

loen/alpaca-lora: Instruct-tune LLaMA on consumer hardware

Categorizing failures as “outer” or “inner” misalignment is often confused - AI Alignment Forum

[2105.14111] Goal Misgeneralization in Deep Reinforcement Learning

Air Safety to Combat Global Catastrophic Biorisks_Web version

First clean water, now clean air - EA Forum

Automation_bias?ref=bounded-regret.ghost.io

Jon Uleis on Twitter: "My new favorite thing - Bing's new ChatGPT bot argues with a user, gaslights them about the current year being 2022, says their phone might have a virus, and says "You have not been a good user" Why? Because the person asked where Avatar 2 is showing nearby https://t.co/X32vopXxQG" / Twitter

model-written-evals.pdf

[2210.10760] Scaling Laws for Reward Model Overoptimization

Miles Brundage on the world's desperate need for AI strategists and policy experts - 80,000 Hours

The case for building expertise to work on US AI policy, and how to do it - 80,000 Hours

Blog Home

Reinforcement Learning: An Introduction to the Concepts, Applications and Code | by Ryan Wong | Towards Data Science

The Extent and Consequences of P-Hacking in Science | PLOS Biology

[2302.05441] Project and Probe: Sample-Efficient Domain Adaptation by Interpolating Orthogonal Features

[2304.15004] Are Emergent Abilities of Large Language Models a Mirage?

Opinion | We Need a Manhattan Project for AI Safety - POLITICO

AI-box experiment - RationalWiki

Language models can explain neurons in language models

Neuron viewer

openai/automated-interpretability

Is power-seeking AI an existential risk? [draft] - Google Docs

15 Questions About Remote Work, Answered

Virtual communication curbs creative idea generation | Nature

What to Know Before You Take a Job in a Hybrid Workplace | WIRED

The effects of remote work on collaboration among information workers | Nature Human Behaviour

Athletics at the 1904 Summer Olympics – men's marathon

Quick meta note about STS 10SI - Google Docs

AI Safety Community: Momentum | Lakera – Protecting AI teams that disrupt the world.

How this Vietnamese refugee became Uber's CTO

The Art of Daily Ritual: Keeping Sane in an Insane World | The On Being Project

About Us — The Everyday Projects

« prev 1...2808 280928102811 2812...3218 next »