Bahdanau et al. (2014) - Attention MechanismsBahdanau et al.'s 2014 paper introduced neural attention mechanisms, enabling …
Evolution of Model ArchitecturesFrom the Perceptron to Transformers, this article traces the historical …
Hochreiter & Schmidhuber (1997) - Long Short-Term Memory (LSTM)LSTM networks, introduced by Hochreiter and Schmidhuber in 1997, solved the …
LeCun et al. - Convolutional Neural Networks (CNNs)Yann LeCun's early work on convolutional neural networks established the …
Radford et al. - Generative Pretrained Transformers (GPT)OpenAI's GPT series expanded the Transformer architecture into a generative …
Rosenblatt (1958) - The PerceptronFrank Rosenblatt's 1958 introduction of the Perceptron marked the birth of …
Rumelhart et al. (1986) - BackpropagationThe 1986 revival of backpropagation by Rumelhart, Hinton, and Williams enabled …
Sutskever et al. (2014) - Encoder-Decoder ArchitecturesThe encoder-decoder framework, introduced by Sutskever et al. (2014), enabled …
Turing (1950) - Computing Machinery and IntelligenceAlan Turing's 1950 paper proposed the imitation game - now known as the Turing …
Vaswani et al. (2017) - TransformersTransformers, introduced by Vaswani et al. in 2017, revolutionized sequence …