Toggle navigation
About
Stuff
(current)
Thoughts
Interests
·
06/2024
Research
Spatial Competence Benchmark
·
03/2026
Preview: Visual Geometry Bench
·
11/2025
Intuiting Policy Gradient methods
·
07/2025
Neural Network precision pitfalls in the wild
·
05/2025
Multi-Class Boundary Extraction from Implicit Representations
·
01/2025
Code
List of open source contributions to Inspect AI
·
03/2026
Hill climb on MBPP using verifiers
·
09/2025
From scratch: SFT and GRPO on Qwen 2.5
·
08/2025
LLM Agent ~ Reddit Consensus
·
07/2025