- 30:12
用RLHF的方法解读论语_哔哩哔哩_bilibili
- 08:32
TensorFlow 2.0深度学习入门-中文书-免费!!!_哔哩哔哩_bilibili
- 09:35
你知道什么是红队吗?它是ChatGPT中关键技术之一_哔哩哔哩_bilibili
- 16:38
【StatQuest】Recurrent Neural Networks 详解_哔哩哔哩_bilibili
- 01:00:02
什么是基于人类反馈的强化学习 What is RLHF?_哔哩哔哩_bilibili
- 12:21
ChatGPT背后的技术(1/2)IFT SFT COT RLHM你知道吗?_哔哩哔哩_bilibili
- 01:16:55
PPO@RLHF ChatGPT原理解析_哔哩哔哩_bilibili
- 01:18:36
OpenAI研究员讲解指令微调和RLHF_哔哩哔哩_bilibili
- 06:34
19 How LLMs follow instructions- Instruction tuning and RLHF (optional)_哔哩哔哩_bilibili
- 01:00:38
chatGPT: 源自人类反馈的强化学习 | HuggingFace: RL from Human Feedback- From Zero to chatGPT_哔哩哔哩_bilibili