Llm fine tuning

⚡️Every product of the future will be a living system  — Ronak Malde, Trajectory.ai

⚡️Every product of the future will be a living system — Ronak Malde, Trajectory.ai

Ronuk Malde, CEO of Trajectory.ai, discusses his journey from building AI coding agents at Windsurf to his current focus on continual learning for enterprise AI. He shares insights on leveraging real-world user data, the unique challenges of model acquisition, and how Trajectory.ai's platform, powered by innovations like scaled SDPO and a novel training stack, enables dynamic, always-learning AI models for diverse industries from legal to finance.

Stop Making Models Bigger, Make Them Behave — Kobie Crawdord, Snorkel

Stop Making Models Bigger, Make Them Behave — Kobie Crawdord, Snorkel

Snorkel.ai's research demonstrates how a 4-billion-parameter model, fine-tuned with Reinforcement Learning for under $500, significantly outperformed a 235-billion-parameter model on financial analysis tool-use tasks. The key was cultivating 'tool discipline' and error correction capabilities, rather than relying on sheer model size or deeper reasoning. Single-table training generalized effectively to harder multi-table problems, emphasizing the importance of targeted behavioral fixes identified through detailed evaluation rubrics.