All
Search
Images
Videos
Shorts
Maps
News
More
Shopping
Flights
Travel
Notebook
Report an inappropriate content
Please select one of the options below.
Not Relevant
Offensive
Adult
Child Sexual Abuse
LLM Videotutorial Full-Course
GPT On My Files Relevance Ai
Ai Chat Box for PDF Using FloWise
FloWise Ai
Tutorials
Rlfh
LLM
Tutorial
Reinforcement Learning IBM
Reinforcement Learning LLM
Huggingface Pipelines
Rlhf
Explained for Beginners
Lm Models
SLM Fine-Tuning
LLM Course
Rlhf
Huggingface
Rlhf
Algorithm
Rlhf
Reinforcement Learning
LLM Fundamentals
Machine Learning without Rag
AI Engine Meow Fine-Tunes
Fine-Tuning
How to Do Fine-Tuning
Fine-Tune
How to Fine Tune an LLM
Length
All
Short (less than 5 minutes)
Medium (5-20 minutes)
Long (more than 20 minutes)
Date
All
Past 24 hours
Past week
Past month
Past year
Resolution
All
Lower than 360p
360p or higher
480p or higher
720p or higher
1080p or higher
Source
All
Dailymotion
Vimeo
Metacafe
Hulu
VEVO
Myspace
MTV
CBS
Fox
CNN
MSN
Price
All
Free
Paid
Clear filters
SafeSearch:
Moderate
Strict
Moderate (default)
Off
Filter
LLM Videotutorial Full-Course
GPT On My Files Relevance Ai
Ai Chat Box for PDF Using FloWise
FloWise Ai
Tutorials
Rlfh
LLM
Tutorial
Reinforcement Learning IBM
Reinforcement Learning LLM
Huggingface Pipelines
Rlhf
Explained for Beginners
Lm Models
SLM Fine-Tuning
LLM Course
Rlhf
Huggingface
Rlhf
Algorithm
Rlhf
Reinforcement Learning
LLM Fundamentals
Machine Learning without Rag
AI Engine Meow Fine-Tunes
Fine-Tuning
How to Do Fine-Tuning
Fine-Tune
How to Fine Tune an LLM
1:48
ChatGPT: Yes-Man atau Analisis Kritis?
113.8K views
11 months ago
TikTok
regrezan
2:21
What are the three phases of the RLHF pipeline — Frontier Path #29 | ML Interview Prep
637 views
1 week ago
YouTube
moot-vs-the-rubric
1:59
How does ChatGPT technically work? When receiving user input, it undergoes preprocessing and tokenization to convert text into a machine-readable format. These tokens are then embedded into vectors and processed by the transformer neural network, which uses mechanisms to understand contextual nuances. With ChatGPT, a large aspect of its functionality is Reinforcement Learning from Human Feedback (RLHF), where it's fine-tuned with human input to ensure the responses are not only contextually appr
15.1K views
Jan 27, 2024
TikTok
tiffintech
1:01
How AI Actually Learns From Human Feedback (RLHF Explained) #Shorts
375 views
2 weeks ago
YouTube
AI Bytes Shorts
0:51
Skip RLHF! Align LLMs natively with DPO 🧠⚡
212 views
2 weeks ago
YouTube
DevPulse
3:34
Google finally claps back to OpenAI dominating the market with a seemingly incredible all-in-one model named Gemini. The middle tier of this model is live on Bard right now, the ultra version to topple gpt 4 is coming next year after more RLHF. #technology #techtok #ai #artificialintelligence #openai #gpt #gpt3 #aitools #aibusiness #chatgpt #chatgpt3 #google #bard #machinelearning #gpt4 #googlebard #bardai #multimodal
20K views
Dec 6, 2023
TikTok
timcarambat
0:06
This lecture provides a concise overview of building a ChatGPT-like model, covering both pretraining (language modeling) and post-training (SFT/RLHF). For each component, it explores common practices in data collection, algorithms, and evaluation methods. This guest lecture was delivered by Yann Dubois in Stanford’s CS229: Machine Learning course, in Summer 2024. #DevLife #WebDev #CodingTeam #StartupLife
6.4K views
May 24, 2025
TikTok
ai_devbytes
1:43
How AI models are really trained: RLHF
1.3K views
1 month ago
YouTube
Garrit Wilson
0:59
Que es el Reinforcement Learning From Human Feedback o RLHF es la forma actual en la que muchas empresas estan alineando sus modelos de inteligencia artificial para que estos puedan dar respuestas utiles y que no den informacion perjudicial #rlhf #openai #machinelearning #deeplearning #ai #inteligenciaartificial
16.9K views
Mar 31, 2023
TikTok
fazttech
0:53
The AI Explained How It Learns to Please Humans
299 views
1 month ago
YouTube
The BlackVeil Files Clips
0:36
RLHF Is a Proxy for Human Judgment #ai #podcast
793 views
2 weeks ago
YouTube
The MAD Podcast with Matt Turck
1:08
Meta ซื้อบริษัทด้าน AI สัมผัสอนาคตการลงทุน
3.7K views
Jun 27, 2025
TikTok
stockcurious
1:20
RLHF explained simply
2.5K views
6 months ago
YouTube
What's AI by Louis-François Bouchard
1:28
RLHF: What is it and how does it work? Reinforcement Learning from Human Feedback is being used a lot recently to refine the answers of large language models after the supervised learning stage. Check out my YouTube series to learn more about supervise learning vs. unsupervised learning vs. reinforcement learning, and check out my 10 Days of AI Basics series here on Instagram for an overview of AI fundamentals in ten 90-second segments. Please let me know in the comments if you have any addition
2.5K views
Feb 6, 2025
TikTok
harpercarrollai
0:53
Ep. 17 RLHF #artificialintelligence #machinelearning #educational
408 views
1 month ago
TikTok
papertrailai
2:29
Reinforcement Learning with Human Feedback (RLHF)| AI Concepts for Everyone - Day 26 #rlhf #ai #llm
581 views
2 weeks ago
YouTube
Code With Shukla Ji
4:48
Deep dive on how to improve large language models. I provide an introduction to zero-shot and few-shot learning methods. I also discuss the role of in-context learning and emergence. For fine-tuning, the video explains instruction tuning, reinforcement learning with human feedback (rlhf), reinforcement learning with AI feedback (rlaif, and parameter efficient fine tuning (peft). I will also have a larger version of this video on my youtube, where it's easier to see the slides. #datascience #mach
8.4K views
Apr 28, 2023
TikTok
rajistics
2:11
¿La Tierra es plana o redonda? 🌍 Si entrenas una IA con ambas… ¡puede responder cualquiera de las dos! 4 técnicas para reducir los sesgos: 1️⃣ Ponderar fuentes (Wikipedia > Reddit) 2️⃣ Guardarraíles (filtros de seguridad) 3️⃣ RLHF (personas que califican respuestas) 4️⃣ Datos sintéticos (contenido “de confianza” generado por IA) 💡 Aun así, los sesgos no desaparecen. Por eso necesitas entenderlos para usar bien la IA. 👉 Dime en comentarios: ¿Qué respuesta rara te ha dado una IA? #IA #Artificia
2.5K views
11 months ago
TikTok
fer.pilot
3:00
Inversión de Meta en Scale.AI y el Poder de los Datos
1.9K views
Jun 29, 2025
TikTok
arcadim
0:54
Three Stages of Training | RLHF
140 views
1 month ago
YouTube
SN ByteNexus
See more
More like this
Short videos
1:48
ChatGPT: Yes-Man atau Analisis Kritis?
113.8K views
11 months ago
TikTok
regrezan
2:21
What are the three phases of the RLHF pipeline — Frontier Path #29 | ML Interview Prep
637 views
1 week ago
YouTube
moot-vs-the-rubric
1:59
How does ChatGPT technically work? When receiving user input, it undergoes
15.1K views
Jan 27, 2024
TikTok
tiffintech
1:01
How AI Actually Learns From Human Feedback (RLHF Explained) #Shorts
375 views
2 weeks ago
YouTube
AI Bytes Shorts
0:51
Skip RLHF! Align LLMs natively with DPO 🧠⚡
212 views
2 weeks ago
YouTube
DevPulse
3:34
Google finally claps back to OpenAI dominating the market with a seemingly incredible all
20K views
Dec 6, 2023
TikTok
timcarambat
0:06
This lecture provides a concise overview of building a ChatGPT-like model, covering
6.4K views
May 24, 2025
TikTok
ai_devbytes
1:43
How AI models are really trained: RLHF
1.3K views
1 month ago
YouTube
Garrit Wilson
0:59
Que es el Reinforcement Learning From Human Feedback o RLHF es la forma
16.9K views
Mar 31, 2023
TikTok
fazttech
0:53
The AI Explained How It Learns to Please Humans
299 views
1 month ago
YouTube
The BlackVeil Files Clips
0:36
RLHF Is a Proxy for Human Judgment #ai #podcast
793 views
2 weeks ago
YouTube
The MAD Podcast with Matt
1:08
Meta ซื้อบริษัทด้าน AI สัมผัสอนาคตการลงทุน
3.7K views
Jun 27, 2025
TikTok
stockcurious
1:20
RLHF explained simply
2.5K views
6 months ago
YouTube
What's AI by Louis-François
1:28
RLHF: What is it and how does it work? Reinforcement Learning from Human
2.5K views
Feb 6, 2025
TikTok
harpercarrollai
0:53
Ep. 17 RLHF #artificialintelligence #machinelearning #educationa
408 views
1 month ago
TikTok
papertrailai
2:29
Reinforcement Learning with Human Feedback (RLHF)| AI Concepts for Everyone - Day
581 views
2 weeks ago
YouTube
Code With Shukla Ji
4:48
Deep dive on how to improve large language models. I provide an introduction to zero
8.4K views
Apr 28, 2023
TikTok
rajistics
2:11
¿La Tierra es plana o redonda? 🌍 Si entrenas una IA con ambas… ¡puede responder
2.5K views
11 months ago
TikTok
fer.pilot
3:00
Inversión de Meta en Scale.AI y el Poder de los Datos
1.9K views
Jun 29, 2025
TikTok
arcadim
0:54
Three Stages of Training | RLHF
140 views
1 month ago
YouTube
SN ByteNexus
More like this
Feedback