Tagged articles
7 articles
Page 1 of 1
Machine Learning Algorithms & Natural Language Processing
Machine Learning Algorithms & Natural Language Processing
Apr 25, 2026 · Artificial Intelligence

ICLR 2026 Award Winners: Outstanding Papers and Alec Radford’s Test‑of‑Time Honor

ICLR 2026 announced two Outstanding Paper awards, a Honorable Mention, and two Test‑of‑Time awards—including the seminal DCGAN and DDPG papers—highlighting a 19,000‑paper submission pool with a 28% acceptance rate and showcasing new theoretical insights on Transformers and multi‑turn LLM evaluation.

DCGANDDPGICLR
0 likes · 8 min read
ICLR 2026 Award Winners: Outstanding Papers and Alec Radford’s Test‑of‑Time Honor
Bighead's Algorithm Notes
Bighead's Algorithm Notes
Mar 24, 2026 · Artificial Intelligence

How an Interactive Imitation‑Learning Agent Framework Trains Robust Trading Strategies

The article analyzes the simulation‑reality gap in algorithmic trading and proposes an interactive market simulator that combines a pool of imitation‑learning agents, an action‑synthesis network, and a DDPG‑based reinforcement‑learning trader, showing superior robustness and downside protection on QQQ data.

Agent-Based ModelingDDPGFinancial AI
0 likes · 16 min read
How an Interactive Imitation‑Learning Agent Framework Trains Robust Trading Strategies
Network Intelligence Research Center (NIRC)
Network Intelligence Research Center (NIRC)
Feb 3, 2026 · Artificial Intelligence

INCS: A DRL‑Based Intent‑Driven Network‑Wide Configuration Synthesis Framework

The article presents INCS, a novel framework that combines graph neural networks and deep reinforcement learning to achieve protocol‑agnostic, millisecond‑level, globally optimized network configuration synthesis, addressing scalability, protocol dependence, and lack of optimization in traditional SMT‑based methods, and demonstrates its superior performance on large‑scale topologies.

DDPGGraph Neural NetworkNetwork Synthesis
0 likes · 8 min read
INCS: A DRL‑Based Intent‑Driven Network‑Wide Configuration Synthesis Framework
Meituan Technology Team
Meituan Technology Team
Nov 15, 2018 · Artificial Intelligence

Reinforcement Learning for Meituan's "Guess You Like" Recommendation Ranking

Meituan enhanced its homepage “Guess You Like” recommendation slot by modeling user‑item interactions as a Markov Decision Process and applying an improved DDPG reinforcement‑learning agent that adjusts the ranking trade‑off parameter, uses advantage‑based Q decomposition, shares actor‑critic weights, and runs in a real‑time TensorFlow pipeline, delivering consistent lifts in click‑through, dwell time, and depth.

DDPGMDP ModelingOnline Learning
0 likes · 21 min read
Reinforcement Learning for Meituan's "Guess You Like" Recommendation Ranking
Sohu Tech Products
Sohu Tech Products
Oct 10, 2018 · Artificial Intelligence

Optimizing News Recall with DDPG Reinforcement Learning and Transformer Architecture

This article explains how reinforcement learning, specifically the DDPG algorithm combined with Transformer-based networks, is applied to improve large‑scale news recall systems, detailing the business scenario, algorithm selection, model architecture, speed optimizations, training challenges, and observed online performance gains.

AIDDPGTransformer
0 likes · 13 min read
Optimizing News Recall with DDPG Reinforcement Learning and Transformer Architecture