Great to see that you plan to focus on RL, during the storm of LLMs.
RL is “guaranteed” to make (academic) progress, given enough resources, e.g., there is a chance to innovate all manually designed methods or previous algorithms with RL, like AlphaTensor, AlphaDev, and works in DB, compiler, chip design, magnetic control of tokamak plasmas, stratospheric balloons, even learning algorithms themselves.
ICML/NeurIPS workshops on reinforcement learning for real life
Haha very nice to read your latest post! I've also started in RL from a slight different aspect of interest, in real-time gradient descent for meta learning. Have just done all the courses from https://community.deeplearning.ai. Thanks for the pointers of https://www.coursera.org/learn/fundamentals-of-reinforcement-learning, will start on it.
All courses from deeplearning.ai ? That's impressive!
Great to see that you plan to focus on RL, during the storm of LLMs.
RL is “guaranteed” to make (academic) progress, given enough resources, e.g., there is a chance to innovate all manually designed methods or previous algorithms with RL, like AlphaTensor, AlphaDev, and works in DB, compiler, chip design, magnetic control of tokamak plasmas, stratospheric balloons, even learning algorithms themselves.
ICML/NeurIPS workshops on reinforcement learning for real life
https://sites.google.com/view/RL4RealLife
(Invited talks and panel discussions from top experts and great papers.)
A survey in early 2022.
Reinforcement Learning in Practice: Opportunities and Challenges
https://arxiv.org/abs/2202.11296
Recently I spend much time on LLMs.
The following blog is at an abstract level.
Reinforcement learning is all you need, for next generation language models.
https://yuxili.substack.com/p/reinforcement-learning-is-all-you
I am working on a perspective paper with more concrete ideas.
Hopefully I will finish it by the end of this month.
I am always glad to join discussions about RL.
First milestone reached: https://coursera.org/share/63a584bd7bbbee386be6145220058df7
I am going to try joining union this journey- I‘be in software engineering for over a decade and this looks like a great opportunity to dive in to RL.
Great! The first step: Start Auditing (it's free) https://www.coursera.org/learn/fundamentals-of-reinforcement-learning !