Profile Picture
  • All
  • Search
  • Images
  • Videos
  • Maps
  • News
  • More
    • Shopping
    • Flights
    • Travel
  • Notebook
Report an inappropriate content
Please select one of the options below.
  • Length
    AllShort (less than 5 minutes)Medium (5-20 minutes)Long (more than 20 minutes)
  • Date
    AllPast 24 hoursPast weekPast monthPast year
  • Resolution
    AllLower than 360p360p or higher480p or higher720p or higher1080p or higher
  • Source
    All
    Dailymotion
    Vimeo
    Metacafe
    Hulu
    VEVO
    Myspace
    MTV
    CBS
    Fox
    CNN
    MSN
  • Price
    AllFreePaid
  • Clear filters
  • SafeSearch:
  • Moderate
    StrictModerate (default)Off
Filter
Reinforcement Learning with Human Feedback (RLHF) - How to train and fine-tune Transformer Models
15:31
Reinforcement Learning with Human Feedback (RLHF) - How to train an…
30.2K viewsFeb 12, 2024
YouTubeSerrano.Academy
Reinforcement Learning from Human Feedback explained with math derivations and the PyTorch code.
2:15:13
Reinforcement Learning from Human Feedback explained with …
61.9K viewsFeb 27, 2024
YouTubeUmar Jamil
【生成式AI導論 2024】第8講:大型語言模型修練史 — 第三階段: 參與實戰,打磨技巧 (Reinforcement Learning from Human Feedback, RLHF)
36:59
【生成式AI導論 2024】第8講:大型語言模型修練史 — 第三階段: 參與實 …
79K viewsApr 12, 2024
YouTubeHung-yi Lee
Fine-tuning LLMs on Human Feedback (RLHF + DPO)
28:53
Fine-tuning LLMs on Human Feedback (RLHF + DPO)
18.1K views10 months ago
YouTubeShaw Talebi
Reinforcement Learning from Human Feedback (RLHF) Explained
11:29
Reinforcement Learning from Human Feedback (RLHF) Explained
72.2K viewsAug 7, 2024
YouTubeIBM Technology
RLHF Visualizer | Hands-on Reinforcement Learning
45:51
RLHF Visualizer | Hands-on Reinforcement Learning
2.9K views3 months ago
YouTubeVizuara
RLHF训练法从零复现,代码实战,大语言模型训练
24:29
RLHF训练法从零复现,代码实战,大语言模型训练
20.3K viewsMay 8, 2024
bilibili蓝斯诺特
6:06:21
【6小时教程】完整 LLM 实战课程:从 Transformer 到 RLHF 全流程
3.1K views3 months ago
bilibiliAIDeepCoder
3:14:37
RLHF from scratch, step-by-step, in code
129 views6 months ago
YouTubeAshwani Kumar
16:33
5、后训练【微调、强化学习RLHF】(上)
25 views1 month ago
YouTube大模型-十一
See more videos
Static thumbnail place holder
More like this
Feedback
  • Privacy
  • Terms