curius graph
☾
Dark
all pages
search
showing 120301-120350 of 160880 pages (sorted by popularity)
« prev
1
...
2405
2406
2407
2408
2409
...
3218
next »
RoFormer
1 user ▼
Sparse Distributed Memory and Hopfield Networks – Trenton Bricken
1 user ▼
1940's a Summer evening sitting on a porch and it's raining (oldies music from another room) ASMR - YouTube
1 user ▼
[2401.06118] Extreme Compression of Large Language Models via Additive Quantization
1 user ▼
[2405.04517] xLSTM: Extended Long Short-Term Memory
1 user ▼
Considering C99 for curl | daniel.haxx.se
1 user ▼
CUTLASS Tutorial: Mastering the NVIDIA® Tensor Memory Accelerator (TMA) – Colfax Research
1 user ▼
christine-sites
1 user ▼
jzarnett/ece459: ECE 459: Programming for Performance
1 user ▼
efeslab/Nanoflow: A throughput-oriented high-performance serving framework for LLMs
1 user ▼
Footer | Typographic Footers — The only footer gallery on earth.
1 user ▼
LLMs: Understanding Code Syntax and Semantics for Code Analysis
1 user ▼
tianyi-lab/Reflection_Tuning: [ACL'24] Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning
1 user ▼
zml/zml: High performance AI inference stack. Built for production. @ziglang / @openxla / MLIR / @bazelbuild
1 user ▼
a phoneless summer - YouTube
1 user ▼
[1911.01547] On the Measure of Intelligence
1 user ▼
Design patterns — Gordon Brander
1 user ▼
The Practitioner’s Guide to the Maximal Update Parameterization - Cerebras
1 user ▼
[2409.14586] Backtracking Improves Generation Safety
1 user ▼
Skeleton loading trick CSS
1 user ▼
William Gedney's Timelessly Intimate Photographs of San Francisco in the 1960s
1 user ▼
slyd0g/SwiftSpy: macOS keylogger, clipboard monitor, and screenshotter
1 user ▼
[2409.20370] The Perfect Blend: Redefining RLHF with Mixture of Judges
1 user ▼
Fully Sharded Data Parallel - CUDA
1 user ▼
expose ports in Colab
1 user ▼
Emmett Shear on X: "This sent me down a rabbit hole. The first three were clear, but the fourth was hard to pick out… https://t.co/wJbMGoG9lg" / X
1 user ▼
the friendship theory of everything - by Ava
1 user ▼
Stanford University CS236: Deep Generative Models
1 user ▼
I am Flora Guo
1 user ▼
cde-small-v1, the best BERT-sized text embedding model in the world
1 user ▼
xjdr-alt/entropix: Entropy Based Sampling and Parallel CoT Decoding
1 user ▼
[2409.17066] VPTQ: Extreme Low-bit Vector Post-Training Quantization for Large Language Models
1 user ▼
Grammatical Notations
1 user ▼
McMaster University Archive for the History of Economic Thought
1 user ▼
20th WCP: Plato on Education as the Development of Reason
1 user ▼
Compute by hyperspace -- agent-driven research engine
1 user ▼
sam-paech/antislop-sampler
1 user ▼
sam-paech/antislop-sampler
1 user ▼
samuel-vitorino/lm.rs: Minimal LLM inference in Rust
1 user ▼
[2104.10350] Carbon Emissions and Large Neural Network Training
1 user ▼
modern friendship - by Nix 🕊 - starting from nix ꩜
1 user ▼
google/pyglove: Manipulating Python Programs
1 user ▼
SJ's Talent List
1 user ▼
Jeremy Goldberg
1 user ▼
The Most Beautiful Equation in Math - YouTube
1 user ▼
“You know something has gone wrong when he switches to Chinese” - YouTube
1 user ▼
Dynamics of optimization in high dimensions: summary statistics, effective dynamics and...
1 user ▼
What is Risograph Printing? | RISOTTO Studio
1 user ▼
[Public, Approved] Intro to Transformers - Google Slides
1 user ▼
Dick’s Tricks - Leonard Susskind
1 user ▼
« prev
1
...
2405
2406
2407
2408
2409
...
3218
next »