Reinforcement Learning - ChatGPT, Playing Games & More • Dean Wampler • GOTO 2023
This presentation was recorded at GOTO Chicago 2023. #GOTOcon #GOTOchgo
Dean Wampler - Product Engineering Director for Accelerated Discovery at IBM Research @kdeanwampler
RESOURCES
Dean
@deanwampler
ABSTRACT
Reinforcement Learning (RL) trains an agent to maximize a cumulative reward in an environment. It rocketed to fame as the tool to achieve expert level performance in Atari games and the game of Go. It is also used for robotics, autonomous vehicles, process automation, and more recently, making ChatGPT more effective.
I will begin with why RL is important and how it supports the applications listed above, including “Reinforcement Learning with Human Feedback“, an essential tool used to develop ChatGPT. Then I will discuss how RL requires a variety of computational patterns: data management and processing, large-scale simulations and model training, and even model serving.
Finally, I will show how Ray RLlib seamlessly and efficiently supports RL, providing an ideal platform for building Python-based, RL applications with an intuitive, flexible API. [...]
TIMECODES
00:00 Intro
01:52 Agenda
03:18 What is reinforcement learning?
05:08 Video:
06:02 What is RL? continued
10:18 Ray RLlib
17:36 RL & ChatGPT
24:10 RL for recommendations
29:08 Outro
Download slides and read the full abstract here:
RECOMMENDED BOOKS
Phil Winder • Reinforcement Learning •
Kelleher & Tierney • Data Science (The MIT Press Essential Knowledge series) •
Enes Bilgin • Mastering Reinforcement Learning with Python •
Aske Plaat • Deep Reinforcement Learning •
Miguel Morales • Grokking Deep Reinforcement Learning •
#ReinforcementLearning #RL #RLHF #AI #ML #AlphaGo #OpenAI #ChatGPT #Ray #RLlib #RayRLlib #Programming #ArtificialIntelligence #MachineLearning #DataScience #DeanWampler
Looking for a unique learning experience?
Attend the next GOTO conference near you! Get your ticket at
Sign up for updates and specials at
SUBSCRIBE TO OUR CHANNEL - new videos posted almost daily.
1 view
0
0
9 years ago 01:47:00 48
Machine learning for neuroscience: HMMs, reinforcement learning, and deep learning
6 years ago 03:55:27 52
Reinforcement Learning Course - Full Machine Learning Tutorial
5 years ago 01:31:18 7
Reinforcement Learning | IJCAI Macao 2019
5 years ago 02:14:58 32
News Sentiment & Reinforcement Learning in Finance & Algorithmic Trading
11 months ago 01:00:19 15
MIT : Reinforcement Learning
7 years ago 00:36:04 73
How reinforcement learning works in Becca 7
8 years ago 01:27:30 152
MIT : Deep Reinforcement Learning for Motion Planning
3 years ago 00:44:27 2
Panel: The future of reinforcement learning
6 years ago 01:01:39 37
Deep Reinforcement Learning in Robotics with NVIDIA Jetson
2 years ago 00:08:40 40
AI Learns to Walk (deep reinforcement learning)
6 years ago 03:55:27 9
Reinforcement Learning Crash Course | Complete Deep Learning Course