curius graph

all pages

showing 35801-35850 of 160880 pages (sorted by popularity)

« prev 1...715 716717718 719...3218 next »

Cumulants, and Cumulant Generating Functions

cuda_mode_lecture2 - Google Slides

A Reply to Makelov et al. (2023)’s “Interpretability Illusion” Arguments

start leveraging einx get_at for extra clarity · lucidrains/vector-quantize-pytorch@3af4110

Distillation Walkthrough

Grand-master Level Chess without Search

Social science research topics for global health and wellbeing | Open Philanthropy

Notes on control evaluations for safety cases — AI Alignment Forum

Portable Evaluation Tasks via the METR Task Standard - METR

Some costs of superposition — AI Alignment Forum

task-standard/examples at main · METR/task-standard

Fine-Grained Human Feedback | Databricks Blog

Further notes on Birkhoff-von Neumann decomposition of doubly stochastic matrices

Universal Jailbreak Backdoors from Poisoned Human Feedback | SPY Lab

Simple distribution approximation: When sampled 100 times, can language models yield 80% A and 20% B? — AI Alignment Forum

How Can We Harness Pre-Training to Develop Robust Models? – gradient science

How to compute Hessian-vector products? | ICLR Blogposts 2024

Prompt Injection Defenses Should Suck Less

FACT SHEET: Vice President Harris Announces OMB Policy to Advance Governance, Innovation, and Risk Management in Federal Agencies’ Use of Artificial Intelligence | The White House

kronfluence/DOCUMENTATION.md at main · pomonam/kronfluence

A Fire Upon the Deep

How to compute Hessian-vector products? | ICLR Blogposts 2024

Making a SOTA Adversarial Attack on LLMs 38x Faster | Haize Labs Blog 🕊️

Andy Jones

Empirical Evidence Against "The Longest Training Run" — LessWrong

Calibrating the Mosaic Evaluation Gauntlet | Databricks Blog

Manifest AI - Compute-Optimal Context Size

epochai.org/files/direct-approach.pdf

Fabien's Shortform — LessWrong

The Limited Benefit of Recycling Foundation Models – Epoch AI

ZebraLogic: Benchmarking the Logical Reasoning Ability of Language Models

Faith and Fate: Transformers as fuzzy pattern matchers – Answer.AI

nvlpubs.nist.gov/nistpubs/ai/NIST.AI.800-1.ipd.pdf

An update on our general capability evaluations - METR

Reason for eigenvalues

III: Scaling to deep learning

There Are No Magic Outcome Variables | Elements of Evolutionary Anthropology

Hybrid Heterogeneous Clusters Can Lower the Energy Consumption of LLM Inference Workloads

Does Robustness Improve with Scale? | FAR AI

Demystifying AI Inference Deployments for Trillion Parameter Large Language Models | NVIDIA Technical Blog

Can You Trust An AI Press Release?—Asterisk

The Decline in Writing About Progress - by Matt Clancy

Behavioral red-teaming is unlikely to produce clear, strong evidence that models aren't scheming

If I wanted to spend WAY more on AI, what would I spend it on? — LessWrong

Scaling Automatic Neuron Description | Transluce AI

Memorandum on Advancing the United States’ Leadership in Artificial Intelligence; Harnessing Artificial Intelligence to Fulfill National Security Objectives; and Fostering the Safety, Security, and Trustworthiness of Artificial Intelligence | The White House

Trendlines in AIxBio evals – Lennart Justen

On LLM prompt optimization and amortization

Critical batch-size and effective dimension in Ordinary Least Squares

Tyler Cowen's Ethnic Dining Guide | All food is ethnic food.

« prev 1...715 716717718 719...3218 next »