Advanced AI / Reinforcement Learning

DQN to Double DQN

A reinforcement learning report and slide deck explaining how Atari DQN made value learning from pixels practical, and how Double DQN improved target reliability by reducing overestimation bias.

DQN to Double DQN presentation title slide

Overview

This artifact compares the original DQN breakthrough with the Double DQN correction. DQN shows that convolutional Q-networks can learn control directly from raw Atari frames, while Double DQN addresses the way DQN can overestimate action values when action selection and action evaluation use the same noisy estimates.

Key Features

Evidence

Interactive Slide Deck

DQN to Double DQN Slides

Download Slides PDF
DQN to Double DQN Slides slide 1 of 13
Slide 1 of 13

The main technical takeaway is that deep reinforcement learning progress came from both better visual representations and better target design. DQN made pixel-based value learning practical; Double DQN made the same value learning more trustworthy.