How Deep Meta‑Learning Boosts Spatio‑Temporal Sales Forecasting for Retail

This article summarizes a AAAI 2021 paper that introduces a deep meta‑learning framework with an amortization network to generate spatio‑temporal representations, enabling accurate retail sales predictions across regions and time periods, especially during high‑volume shopping festivals.

JD Cloud Developers
JD Cloud Developers
JD Cloud Developers
How Deep Meta‑Learning Boosts Spatio‑Temporal Sales Forecasting for Retail

Sales forecasting for online and offline retail is crucial for inventory planning, especially during large shopping festivals, but historical data are scarce.

Background

Spatial distribution of stores, population attributes, and date types (weekday, weekend, festival) lead to diverse shopping behaviors. Limited data per region and time make it hard to train accurate models; heterogeneity of population density and evolving regional features require comprehensive spatial features such as POI distribution and demographics.

Spatio‑Temporal Representation Generation

The authors propose an Amortization Network that models the approximate posterior of spatio‑temporal representations for target regions and date types, considering both sales and related spatial features.

Temporal patterns, including linear and nonlinear correlations and weekly cycles, are captured using a linear transformation module and a special skip‑LSTM layer, followed by a feature‑fusion module that combines temporal and static features.

Spatio‑Temporal Sales Prediction

A generative model leverages shared statistical structure to learn across tasks. It combines two inputs: (1) a recent‑sales representation extracted via feature extraction and fusion, and (2) a task‑specific spatio‑temporal feature sampled from the amortization network. A fully‑connected network merges them to predict future sales orders.

Spatio‑Temporal Alternating Training

Inspired by multi‑view learning, two views—spatial and temporal—are constructed. Alternating training switches between views, allowing the model to learn complementary information and integrate the shared generative model, improving prediction accuracy.

Experiments

Evaluations on different time intervals (weekday, weekend, shopping festivals) and ablation studies show that the proposed STMP model outperforms baselines, reducing MSE by at least 30 % and achieving superior performance in bursty sales scenarios.

Further analysis of model variants demonstrates the importance of task‑specific spatio‑temporal distributions and modeling multiple periodic patterns.

Visualization on four representative regions across 2019 confirms that STMP accurately captures explosive sales peaks that baseline methods miss.

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Sign in to view source
Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactadmin@besthub.devand we will review it promptly.

Deep LearningMeta LearningRetailSales Forecastingspatio-temporal
JD Cloud Developers
Written by

JD Cloud Developers

JD Cloud Developers (Developer of JD Technology) is a JD Technology Group platform offering technical sharing and communication for AI, cloud computing, IoT and related developers. It publishes JD product technical information, industry content, and tech event news. Embrace technology and partner with developers to envision the future.

0 followers
Reader feedback

How this landed with the community

Sign in to like

Rate this article

Was this worth your time?

Sign in to rate
Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.