Tagged articles
15 articles
Page 1 of 1
Data Party THU
Data Party THU
Oct 21, 2025 · Artificial Intelligence

Why DQN Overestimates Q‑Values and How Double DQN Fixes It

The article explains how DQN’s use of the max operator introduces a maximization bias that leads to overestimated Q‑values, and shows how Double DQN separates action selection from value evaluation to eliminate this bias, improving stability and performance in Atari benchmarks.

DQNDouble DQNalgorithm analysis
0 likes · 7 min read
Why DQN Overestimates Q‑Values and How Double DQN Fixes It
Python Programming Learning Circle
Python Programming Learning Circle
Jul 10, 2025 · Artificial Intelligence

Build a DQN Autonomous Driving Agent with gym and highway‑env

This tutorial walks through installing gym and highway‑env, configuring six driving scenarios, processing observations (kinematics, images, occupancy grids), defining actions and rewards, constructing a DQN network, training it with a reinforcement‑learning loop, and analyzing collision, time, and reward metrics.

DQNautonomous drivinggym
0 likes · 10 min read
Build a DQN Autonomous Driving Agent with gym and highway‑env
DataFunSummit
DataFunSummit
Apr 2, 2024 · Artificial Intelligence

Reinforcement Learning: Fundamentals, Classic Algorithms, and Applications in Short Video Recommendation

This article provides an in-depth overview of reinforcement learning, covering its goals, mathematical foundations such as Markov Decision Processes, classic algorithms like DQN, and practical applications including short‑video recommendation systems that aim to improve user retention through RL‑based ranking.

DQNMarkov Decision ProcessRL applications
0 likes · 12 min read
Reinforcement Learning: Fundamentals, Classic Algorithms, and Applications in Short Video Recommendation
Python Programming Learning Circle
Python Programming Learning Circle
Dec 16, 2023 · Artificial Intelligence

Using highway‑env with OpenAI Gym for Reinforcement Learning: Installation, Configuration, and DQN Training

This tutorial explains how to install the gym and highway‑env packages, configure the highway‑v0 environment, explore its observation types, and implement a DQN agent in Python to train and evaluate autonomous driving policies, complete with code snippets and performance visualizations.

DQNPythongym
0 likes · 9 min read
Using highway‑env with OpenAI Gym for Reinforcement Learning: Installation, Configuration, and DQN Training
DaTaobao Tech
DaTaobao Tech
Aug 18, 2022 · Artificial Intelligence

Introduction to Deep Reinforcement Learning: Theory, Algorithms, and Applications

This article introduces deep reinforcement learning by explaining its Markov decision process foundations, then categorizes the main algorithm families—value‑based methods like DQN, policy‑based approaches such as PG/DPG/DDPG, and actor‑critic techniques including A3C, PPO, and DDPG—detailing their architectures, training procedures, and key advantages.

DQNMDPactor-critic
0 likes · 14 min read
Introduction to Deep Reinforcement Learning: Theory, Algorithms, and Applications
58 Tech
58 Tech
Jun 24, 2022 · Artificial Intelligence

Reinforcement Learning for Lead Generation in Task‑Oriented Dialogue Systems

This article presents a reinforcement‑learning‑based approach to improve lead‑capture efficiency of a task‑oriented chatbot used in local services, detailing the system architecture, RL algorithms (DQN/DDQN), data construction, model training, offline and online evaluation, and the resulting commercial gains.

DQNLead Generationcustomer-service
0 likes · 27 min read
Reinforcement Learning for Lead Generation in Task‑Oriented Dialogue Systems
Huawei Cloud Developer Alliance
Huawei Cloud Developer Alliance
Aug 20, 2021 · Artificial Intelligence

Training a DQN AI to Master 2048: Step-by-Step Guide

This article walks through using reinforcement learning with a Deep Q‑Network in PyTorch to train an AI agent that plays the 2048 puzzle game, covering environment setup, algorithm implementation, network design, and a short training run that achieves a score of 256.

2048AIDQN
0 likes · 8 min read
Training a DQN AI to Master 2048: Step-by-Step Guide
DataFunTalk
DataFunTalk
Dec 10, 2019 · Artificial Intelligence

Applying Deep Reinforcement Learning (DQN) to the 2048 Game: Experiments and Insights

This article details a series of reinforcement‑learning experiments on the 2048 game, from random baselines through DQN implementations, classical value‑iteration methods, network redesigns, and Monte‑Carlo tree search, highlighting challenges such as reward design, over‑estimation, and exploration while achieving scores up to 34 000 and tiles of 2048.

2048AIDQN
0 likes · 8 min read
Applying Deep Reinforcement Learning (DQN) to the 2048 Game: Experiments and Insights