Dr. Minqi Jiang and Dr. Marc Rigter explain an innovative new method to make the intelligence of agents more general-purpose by training them to learn many worlds before their usual goal-directed training, which we call “reinforcement learning“.
Their new paper is called “Reward-free curricula for training robust world models“
Interviewer: Dr. Tim Scarfe
Please support us on Patreon, Tim is now doing MLST full-time and taking a massive financial hit. If you love MLST and want this to continue, please show your support! In return you get access to shows very early and private discord and networking.
We are also looking for show sponsors, please get in touch if interested mlstreettalk at gmail.
MLST Discord:
00:00:00 - Intro
00:01:05 - Mod
1 view
64
17
2 weeks ago 00:43:28 2
Stop Learning French & Start Speaking! | French Speaking Tips for Beginners | Frenchy Tales
3 weeks ago 00:21:20 5
Can you paint Chrome NMM using CONTRAST PAINTS? - Mechazoid Sokorentai @CorvusBelliOfficial
3 weeks ago 00:05:30 4
DIRKSCHNEIDER - Winter Dreams (feat. Doro Pesch) (Official Music Video)
3 weeks ago 00:20:15 2
Karoline Leavitt’s Lifestyle 2025★ House Tour, Husband, Children, Cars, Net Worth....
4 weeks ago 00:06:52 10
Rentarou’s love speech with ENG subs | Hyakkano/100 girlfriends