Introduction to Supervised and Reinforcement Finetuning - Sachin Dharashivkar
Hasgeek TV Hasgeek TV
34.6K subscribers
179 views
0

 Published On Premiered Aug 21, 2023

Sachin Dharashivkar will speak about LLM Finetuning and RLHF
Sachin is a founder who is exploring use cases of AI agents. He enjoys training Reinforcement Learning agents and exploring novel applications of Large Language Models.

Three steps of training chatGPT style models. How to perform supervised finetuning. Why is Reinforcement Learning from Human Feedback important and How to train Reward and Policy models.

More at has.gy/rEcp

show more

Share/Embed