’Show Your Working’: ChatGPT Performance Doubled w/ Process Rewards (+Synthetic Data Event Horizon)
I have not only read the Let’s Verify step by Step paper released less than 24 hours ago, I have combed the release notes and appendix, read most of the linked papers and done my own tests. It’s true, performance is massively boosted, and not just for mathematics but science and other domains too. I’ll show you comparisons with GPT 3 and PaLM 2, and demonstrate that new records are coming soon.
I will also cover the ’synthetic data event horizon’ and what might have gone into GPT 4’s training. I’ll show you how PRM works vs ORM, and why finetuning is still relevant. Plus I’ll cover reaction from Jan Leike, Ilya Sutskever, Sam Altman and more. I will also feature the highly relevant paper ’Language Models Don’t Always Say What They Think: Unfaithful Explanations in Chain-of-Thought Prompting’. I’ll also give a glimpse from Rob Miles about just how weirdly GPT 4 might think.
1 view
417
105
1 week ago 00:10:15 1
Maria Daines - Ain’t You The Man [Relaxing Blues Music 2020]
2 weeks ago 00:02:36 1
“Resurrection of Perfection“ Shawn Spears AEW Entrance Theme | AEW Music
2 weeks ago 00:02:00 1
FLIZER’S WITHER STORM ADDON Showcase || Released
2 weeks ago 00:21:14 4
We’re LOSING The Tariff War With China: How Our Elites Sold Us Out
2 weeks ago 00:01:11 1
Chibiverse Theme Song | NEW Series | Disney Channel Animation
2 weeks ago 00:02:47 11
Kian’s Bizarre B&B | House Tour | Netflix [ENG SUB]
2 weeks ago 00:00:23 1
ZorayPT Custom Bone Marrow Needle | Precision OEM/ODM Showcase