Akshit Pareek

i write deep-learning kernels (layernorm, softmax, etc.) for TI's DL library, targeting inference on TDAx edge accelerators. on the side, i train small robot models and work on the infrastructure to deploy them.

Two weeks of fine-tuning VLAs in sim: what worked, what broke

ACT → SmolVLA → harder scene on one 5070 Ti. Peaked at 60% on a 4-color shelf, dropped to 17.5% when I shuffled the books (0/54 on non-canonical positions). The textbook fix made it strictly worse, and the harder scene went 0/10. Three architectures, four scenes, zero clean wins. Full log.