My Q3 2023 Plan: Embarking on Reinforcement…

Jun 21, 2023

(Coauthored with ChatGPT)

6 Comments

Jul 6, 2023

Haha very nice to read your latest post! I've also started in RL from a slight different aspect of interest, in real-time gradient descent for meta learning. Have just done all the courses from https://community.deeplearning.ai. Thanks for the pointers of https://www.coursera.org/learn/fundamentals-of-reinforcement-learning, will start on it.

Expand full comment

Reply (1)

Zheng Shao

Aug 1, 2023

All courses from deeplearning.ai ? That's impressive!

Expand full comment

Yuxi Li

Jun 22, 2023Edited

Great to see that you plan to focus on RL, during the storm of LLMs.

RL is “guaranteed” to make (academic) progress, given enough resources, e.g., there is a chance to innovate all manually designed methods or previous algorithms with RL, like AlphaTensor, AlphaDev, and works in DB, compiler, chip design, magnetic control of tokamak plasmas, stratospheric balloons, even learning algorithms themselves.

ICML/NeurIPS workshops on reinforcement learning for real life

https://sites.google.com/view/RL4RealLife

(Invited talks and panel discussions from top experts and great papers.)

A survey in early 2022.

Reinforcement Learning in Practice: Opportunities and Challenges

https://arxiv.org/abs/2202.11296

Recently I spend much time on LLMs.

The following blog is at an abstract level.

Reinforcement learning is all you need, for next generation language models.

https://yuxili.substack.com/p/reinforcement-learning-is-all-you

I am working on a perspective paper with more concrete ideas.

Hopefully I will finish it by the end of this month.

I am always glad to join discussions about RL.

Expand full comment

Zheng Shao

Aug 1, 2023

First milestone reached: https://coursera.org/share/63a584bd7bbbee386be6145220058df7

Expand full comment

Gratus

Jun 21, 2023

I am going to try joining union this journey- I‘be in software engineering for over a decade and this looks like a great opportunity to dive in to RL.

Expand full comment

Reply (1)

Zheng Shao

Jun 22, 2023

Great! The first step: Start Auditing (it's free) https://www.coursera.org/learn/fundamentals-of-reinforcement-learning !

Expand full comment

Zheng’s Substack

My Q3 2023 Plan: Embarking on Reinforcement…