I am currently the post-training science lead at Oracle AI labs for LLM-based code and SQL generation. Previously at Amazon AGI I worked on post-training optimization of large language models using reinforcement learning and supervised fine-tuning. My past projects include GRPO for verifiable code generation, safety models for browser agents (Amazon Nova Act), and data mixing strategies for robust, domain-adapted LLMs.
Leading post-training efforts for LLM-based code and SQL generation.
Large Language Model fine-tuning, reinforcement learning, and LLM-based agent safety.
Developed multi-modal ML solutions for Echo devices and always-on audio sensing.
Audio ML for smart devices.
Personalized audio experiences using sensor fusion.