Reinforcement Learning - ChatGPT, Playing Games & More • Dean Wampler • GOTO 2023

This presentation was recorded at GOTO Chicago 2023. #GOTOcon #GOTOchgo Dean Wampler - Product Engineering Director for Accelerated Discovery at IBM Research @kdeanwampler RESOURCES Dean @deanwampler ABSTRACT Reinforcement Learning (RL) trains an agent to maximize a cumulative reward in an environment. It rocketed to fame as the tool to achieve expert level performance in Atari games and the game of Go. It is also used for robotics, autonomous vehicles, process automation, and more recently, making ChatGPT more effective. I will begin with why RL is important and how it supports the applications listed above, including “Reinforcement Learning with Human Feedback“, an essential tool used to develop ChatGPT. Then I will discuss how RL requires a variety of computational patterns: data management and processing, large-scale simulations and model training, and even model serving. Finally, I will show how Ray RLlib seamlessly and efficiently supports RL, providing an ideal platform for building Python-based, RL applications with an intuitive, flexible API. [...] TIMECODES 00:00 Intro 01:52 Agenda 03:18 What is reinforcement learning? 05:08 Video: 06:02 What is RL? continued 10:18 Ray RLlib 17:36 RL & ChatGPT 24:10 RL for recommendations 29:08 Outro Download slides and read the full abstract here: RECOMMENDED BOOKS Phil Winder • Reinforcement Learning • Kelleher & Tierney • Data Science (The MIT Press Essential Knowledge series) • Enes Bilgin • Mastering Reinforcement Learning with Python • Aske Plaat • Deep Reinforcement Learning • Miguel Morales • Grokking Deep Reinforcement Learning • #ReinforcementLearning #RL #RLHF #AI #ML #AlphaGo #OpenAI #ChatGPT #Ray #RLlib #RayRLlib #Programming #ArtificialIntelligence #MachineLearning #DataScience #DeanWampler Looking for a unique learning experience? Attend the next GOTO conference near you! Get your ticket at Sign up for updates and specials at SUBSCRIBE TO OUR CHANNEL - new videos posted almost daily.

1 view