Ilya’s secret machine learning paper list

Table of Contents

This link¹ contains Ilya Sutskever’s² curated machine learning paper list. The following tweets show the original story behind this list.

I rather expected @ilyasut to have made a public post by now after all the discussion of the AI reading list he gave me. A canonical list of references from a leading figure would be appreciated by many. I would be curious myself about what he would add from the last three years.
— John Carmack (@ID_AA_Carmack) February 6, 2023

til, Ilya sutskever gave john carmack this reading list of approx 30 research papers and said, ‘If you really learn all of these, you’ll know 90% of what matters today.’ https://t.co/6eNmrgyq7k
— keshav (@keshavchan) May 7, 2024

I modified the sequence so that similar ones are grouped together, yet the original index with digit emojis is placed in the beginning of the paper title for your own reference, although I don’t think there is a specific reason or I don’t know the reason why the order of the list is arranged as follows. There are 27 papers and materials at the moment, I added additional placeholders in case this list is appended with additional references. I also added some emojis so that you know the specific type of the link: means this is blog post/article; is a academic paper; indicates it is a video based class.

Most of the papers are authors from Google Brain. Other than papers, some are blog posts and there is one item for a CS course. To my surprise, not all papers have a large number of citations. Rock stars like Kaiming He and Ilya Sutskever both have 2 papers in the list; Oriol Vinyals has a whopping 4 papers while the godfather only has 2; Andrej Karpathy has 1 blog post. I will try to learn from the blog posts and hope to write high quality blogs like that.

Ilya’s List

0️⃣1️⃣ The Annotated Transformer³
0️⃣2️⃣ The First Law of Complexodynamics⁴
0️⃣3️⃣ The Unreasonable Effectiveness of Recurrent Neural Networks⁵
0️⃣4️⃣ Understanding LSTM Networks⁶
0️⃣5️⃣ Recurrent Neural Network Regularization⁷
0️⃣6️⃣ Keeping Neural Networks Simple by Minimizing the Description Length of the Weights⁸
0️⃣7️⃣ Pointer Networks⁹
0️⃣8️⃣ ImageNet Classification with Deep Convolutional Neural Networks¹⁰
0️⃣9️⃣ Order Matters: Sequence to sequence for sets¹¹
1️⃣0️⃣ GPipe: Efficient Training of Giant Neural Networks using Pipeline Parallelism¹²
1️⃣1️⃣ Deep Residual Learning for Image Recognition¹³
1️⃣2️⃣ Multi-Scale Context Aggregation by Dilated Convolutions¹⁴
1️⃣3️⃣ Neural Message Passing for Quantum Chemistry¹⁵
1️⃣4️⃣ Attention is All you Need¹⁶
1️⃣5️⃣ Neural Machine Translation by Jointly Learning to Align and Translate¹⁷
1️⃣6️⃣ Identity Mappings in Deep Residual Networks¹⁸
1️⃣7️⃣ A simple neural network module for relational reasoning¹⁹
1️⃣8️⃣ Variational Lossy Autoencoder²⁰
1️⃣9️⃣ Relational recurrent neural networks²¹
2️⃣0️⃣ Quantifying the Rise and Fall of Complexity in Closed Systems: The Coffee Automaton²²
2️⃣1️⃣ Neural Turing Machines²³
2️⃣2️⃣ Deep Speech 2: End-to-End Speech Recognition in English and Mandarin²⁴
2️⃣3️⃣ Scaling Laws for Neural Language Models²⁵
2️⃣4️⃣ A Tutorial Introduction to the Minimum Description Length Principle²⁶
2️⃣5️⃣ Machine super intelligence²⁷
2️⃣6️⃣ Kolmogorov Complexity and Algorithmic Randomness²⁸
2️⃣7️⃣ CS231n Convolutional Neural Networks for Visual Recognition²⁹
2️⃣8️⃣ ³⁰
2️⃣9️⃣ ³⁰
3️⃣0️⃣ ³⁰
3️⃣1️⃣ ³⁰
3️⃣2️⃣ ³⁰
3️⃣3️⃣ ³⁰
3️⃣4️⃣ ³⁰
3️⃣5️⃣ ³⁰
3️⃣6️⃣ ³⁰
3️⃣7️⃣ ³⁰
3️⃣8️⃣ ³⁰
3️⃣9️⃣ ³⁰
4️⃣0️⃣ ³⁰

An Reinforcement Learning List

This is a bonus section, where OpenAI released paper list for reinforcement learning³¹.

Print 🖨 eBook 📱

Posted

June 1, 2024

Xiaomeng Wang

Tags:

ai, machine learning

Ilya’s secret machine learning paper list

Ilya’s List

An Reinforcement Learning List

Comments

Leave a Reply Cancel reply