Research
I work on the theory and practice of recurrent sequence models: selective state-space models, what it costs to learn state tracking, and where subquadratic models pay off. Much of the work is trained locally on Apple Silicon with MLX.