Probabilistic Pair Recommendations & IRGAN: Boosting E‑commerce Click‑Through

This article summarizes two SIGIR 2017 papers: one introduces a probabilistic latent‑class model for shopping‑pair push recommendations that improves e‑commerce click‑through rates by leveraging co‑purchase and view‑then‑purchase graphs, and the other presents IRGAN, a GAN‑based framework that unifies generative and discriminative information‑retrieval models, achieving state‑of‑the‑art results across web search, recommendation, and QA tasks.

Alibaba Cloud Developer
Alibaba Cloud Developer
Alibaba Cloud Developer
Probabilistic Pair Recommendations & IRGAN: Boosting E‑commerce Click‑Through

1. Probabilistic Latent‑Class Model for Shopping‑Pair Push

In e‑commerce marketing push scenarios, click‑through rate (CTR) is heavily influenced by copy and limited to a single display slot. To improve CTR, the authors propose a “shopping‑pair” recommendation that pushes a complementary item together with a previously purchased product, increasing user attachment and opening rates.

Shopping pair example
Shopping pair example

Two metrics are defined for pairing items:

Co‑Purchase Graph (simultaneous purchase score)

View‑and‑then‑Purchase Graph (substitutivity score)

Co‑Purchase graph
Co‑Purchase graph
View‑and‑then‑Purchase graph
View‑and‑then‑Purchase graph

The pairing score combines high complementarity and low substitutivity, serving as an important feature in the recommendation model.

Pairing score formula
Pairing score formula

User behavior is represented by embeddings learned via a probabilistic latent‑class model. The model consists of two layers: a hidden layer that captures user segmentation using multidimensional logistic regression, and a click‑through prediction layer (a two‑dimensional logistic regression, extendable to deep neural networks).

User embedding model
User embedding model

Parameters are estimated with the Expectation‑Maximization (EM) algorithm. The Q‑function is derived and optimized via gradient descent.

EM Q‑function
EM Q‑function

Online experiments show a ~50% CTR improvement for shopping‑pair push recommendations, and the approach can be generalized to other targets such as items, news, or videos.

2. GAN for Information Retrieval – IRGAN

IRGAN integrates generative and discriminative IR models using a minimax game. The discriminator p_φ(r|q,d) learns to predict relevance from labeled data, while the generator p_θ(d|q,r) produces challenging document samples for the discriminator.

The discriminator is trained via logistic regression to maximize its objective, and the generator outputs a probability distribution over documents using a softmax function.

Discriminator objective
Discriminator objective

Training alternates between updating D (gradient descent) and G (policy gradient for discrete outputs).

Parameter update
Parameter update

Experiments on three IR tasks—Web Search, Item Recommendation, and Question Answering—demonstrate that IRGAN outperforms strong baselines.

Source code is available at https://github.com/geek-ai/irgan .

Original Source

Signed-in readers can open the original source through BestHub's protected redirect.

Sign in to view source
Republication Notice

This article has been distilled and summarized from source material, then republished for learning and reference. If you believe it infringes your rights, please contactadmin@besthub.devand we will review it promptly.

e‑commerceGANinformation retrievalprobabilistic modeling
Alibaba Cloud Developer
Written by

Alibaba Cloud Developer

Alibaba's official tech channel, featuring all of its technology innovations.

0 followers
Reader feedback

How this landed with the community

Sign in to like

Rate this article

Was this worth your time?

Sign in to rate
Discussion

0 Comments

Thoughtful readers leave field notes, pushback, and hard-won operational detail here.