Introduction to Supervised and Reinforcement Finetuning - Sachin Dharashivkar

34.6K subscribers

179 views

About
Share

Published On Premiered Aug 21, 2023

Sachin Dharashivkar will speak about LLM Finetuning and RLHF
Sachin is a founder who is exploring use cases of AI agents. He enjoys training Reinforcement Learning agents and exploring novel applications of Large Language Models.

Three steps of training chatGPT style models. How to perform supervised finetuning. Why is Reinforcement Learning from Human Feedback important and How to train Reward and Policy models.

More at has.gy/rEcp

Published On Premiered Aug 21, 2023

Share/Embed

Video Link