Tagged articles

Seq2Seq

17 articles · Page 1 of 1

Aug 6, 2025 · Artificial Intelligence

How Transformers Revolutionize Sequence Modeling: From RNN Limits to Self‑Attention Mastery

This article explains why Transformer models surpass traditional RNN‑based seq2seq architectures by introducing self‑attention, multi‑head attention, and positional encoding, detailing the inner workings of encoders, decoders, and attention mechanisms, and comparing their advantages and limitations across NLP and vision tasks.

GRULSTMRNN

0 likes · 30 min read

How Transformers Revolutionize Sequence Modeling: From RNN Limits to Self‑Attention Mastery

Tencent Technical Engineering

Apr 16, 2025 · Artificial Intelligence

Understanding Transformer Architecture for Chinese‑English Translation: A Practical Guide

This practical guide walks through the full Transformer architecture for Chinese‑to‑English translation, detailing encoder‑decoder structure, tokenization and embeddings, batch handling with padding and masks, positional encodings, parallel teacher‑forcing, self‑ and multi‑head attention, and the complete forward and back‑propagation training steps.

Machine TranslationPositional EncodingPyTorch

0 likes · 26 min read

Understanding Transformer Architecture for Chinese‑English Translation: A Practical Guide

Baidu Geek Talk

Oct 26, 2022 · Artificial Intelligence

Exploring Automatic Advertising Copy Generation: Techniques, Practices, and Future Directions

The article surveys automatic advertising copy generation, detailing why optimization is needed, the fundamentals of neural text generation with Seq2Seq and attention, extractive versus abstractive approaches, modern embeddings and MASS pre‑training, practical data and evaluation methods, and future enhancements such as multi‑stage attention, knowledge integration, and large pre‑trained models.

AIAdvertisingMASS

0 likes · 21 min read

Exploring Automatic Advertising Copy Generation: Techniques, Practices, and Future Directions

Zuoyebang Tech Team

Jul 14, 2022 · Artificial Intelligence

Enhancing Speech Keyword Detection Using Prefix Automaton Beam Search

This article presents a method to improve keyword detection in large‑scale speech recognition by integrating a prefix automaton into the beam‑search decoding of seq2seq models, enabling real‑time addition of new terms while reducing computational overhead compared to traditional approaches.

Beam SearchSeq2Seqkeyword detection

0 likes · 12 min read

Enhancing Speech Keyword Detection Using Prefix Automaton Beam Search

Youku Technology

Feb 28, 2022 · Artificial Intelligence

Seq2Path: Generating Sentiment Tuples as Paths of a Tree

Seq2Path treats each sentiment tuple as an independent tree path, training with average path loss and decoding via constrained beam search with a discriminative token, achieving state‑of‑the‑art results on five aspect‑based sentiment analysis datasets and deployment in Alibaba Entertainment AI Brain.

ACLBeam SearchSentiment Analysis

0 likes · 3 min read

Seq2Path: Generating Sentiment Tuples as Paths of a Tree

58 Tech

Oct 12, 2021 · Artificial Intelligence

Seq2Seq Approaches for Phone Number Extraction from Two‑Speaker Voice Dialogues

This article presents a practical study of extracting phone numbers from two‑speaker voice dialogues using Seq2Seq models—including LSTM, GRU with attention and feature fusion, and Transformer—detailing data characteristics, model architectures, training strategies, experimental results, and comparative analysis showing the GRU‑Attention approach achieving the best performance.

GRULSTMNLP

0 likes · 13 min read

Seq2Seq Approaches for Phone Number Extraction from Two‑Speaker Voice Dialogues

TiPaiPai Technical Team

Aug 2, 2021 · Artificial Intelligence

How Attention Boosts Text Recognition: From CNN‑Seq2Seq to Multi‑Scale Models

This article explains how attention mechanisms are applied to text recognition, covering the basic CNN‑Seq2Seq‑Attention architecture, multi‑scale attention extensions, and a 2D attentional irregular scene text recognizer with detailed network components, training loss, and experimental results.

CNNDeep LearningMulti-Scale

0 likes · 8 min read

How Attention Boosts Text Recognition: From CNN‑Seq2Seq to Multi‑Scale Models

JD Tech

Jun 17, 2021 · Artificial Intelligence

MTrajRec: Map-Constrained Trajectory Recovery via Seq2Seq Multi‑Task Learning

The paper introduces MTrajRec, a Seq2Seq multi‑task learning framework that simultaneously restores low‑sampling‑rate GPS trajectories to high‑sampling‑rate and aligns them to the road network, achieving more accurate and efficient trajectory recovery for downstream applications such as navigation and travel‑time estimation.

Deep LearningKDD 2021Multi-Task Learning

0 likes · 9 min read

MTrajRec: Map-Constrained Trajectory Recovery via Seq2Seq Multi‑Task Learning

New Oriental Technology

Feb 1, 2021 · Artificial Intelligence

Neural Machine Translation: Seq2Seq, Beam Search, BLEU, Attention Mechanisms, and GNMT Improvements

This article explains key concepts of neural machine translation, covering Seq2Seq encoder‑decoder models, beam search strategies, BLEU evaluation, various attention mechanisms, and the enhancements introduced in Google's Neural Machine Translation system to improve speed, OOV handling, and translation quality.

BLEUBeam SearchGNMT

0 likes · 11 min read

Neural Machine Translation: Seq2Seq, Beam Search, BLEU, Attention Mechanisms, and GNMT Improvements

DataFunSummit

Dec 18, 2020 · Artificial Intelligence

Complex Semantic Representation in Voice Assistants: NLP Layers, DIS Limitations, and the CMRL Schema

This article explains how voice assistants rely on a three‑layer NLP pipeline (lexical, syntactic, and semantic analysis), discusses the shortcomings of the traditional DIS (Domain‑Intent‑Slot) structure for complex commands, and introduces the hierarchical CMRL schema along with two neural models (copy‑write seq2seq and seq2tree) for converting natural language into structured logical expressions.

CMRLNLPSeq2Seq

0 likes · 14 min read

Complex Semantic Representation in Voice Assistants: NLP Layers, DIS Limitations, and the CMRL Schema

Sohu Tech Products

Nov 18, 2020 · Artificial Intelligence

Understanding Sequence‑to‑Sequence (seq2seq) Models and Attention Mechanisms

This article explains the fundamentals of seq2seq neural machine translation models, covering encoder‑decoder architecture, word embeddings, context vectors, RNN processing, and the attention mechanism introduced by Bahdanau and Luong, with visual illustrations and reference links for deeper study.

Deep LearningEmbeddingNeural Machine Translation

0 likes · 11 min read

Understanding Sequence‑to‑Sequence (seq2seq) Models and Attention Mechanisms

DataFunTalk

Aug 3, 2020 · Artificial Intelligence

Advances in Sequence‑to‑Sequence Text Generation: Attention, Pointer, Copy, and Transformer Models

This article reviews the evolution of encoder‑decoder based text generation, covering classic seq2seq with attention, pointer networks, copy mechanisms, knowledge‑enhanced models, convolutional approaches, and the latest Transformer‑based pre‑training such as MASS, highlighting their architectures, key innovations, and practical considerations.

NLPSeq2SeqText Generation

0 likes · 17 min read

Advances in Sequence‑to‑Sequence Text Generation: Attention, Pointer, Copy, and Transformer Models

58 Tech

Mar 13, 2019 · Artificial Intelligence

Design and Implementation of the 58.com Intelligent Article Writing Robot

The article describes the design, workflow, and two‑stage model improvements of 58.com’s intelligent writing robot, which uses template matching, seq2seq with attention and BeamSearch, and slot‑replacement techniques to automatically generate titles and body content for real‑estate and used‑car promotions, achieving high publishing volume and readership.

AI writingBLEUBeamSearch

0 likes · 9 min read

Design and Implementation of the 58.com Intelligent Article Writing Robot

Alibaba Cloud Developer

Aug 22, 2018 · Artificial Intelligence

How Smart Copy Generation Boosts 1688 B2B Sales: From Seq2Seq to Coverage Attention

This article analyzes the challenges of generating product copy for the 1688 B2B platform, proposes enhancements to attention‑based Seq2Seq and Pointer‑Generator models—including TextCNN classification, convolutional inputs, coverage attention, and beam‑search constraints—and demonstrates significant gains in accuracy, diversity, and reduced repetition through extensive experiments.

AICopy GenerationCoverage Attention

0 likes · 17 min read

How Smart Copy Generation Boosts 1688 B2B Sales: From Seq2Seq to Coverage Attention

Qunar Tech Salon

Mar 1, 2018 · Artificial Intelligence

Open-Domain Chatbot Implementation: Retrieval and Generative Approaches

This article explains the implementation of open-domain chatbots for customer service, comparing retrieval‑based and generative seq2seq approaches, describing hybrid methods that first attempt constrained retrieval before falling back to generation, and discusses training data, keyword extraction, and performance observations.

AIChatbotSeq2Seq

0 likes · 6 min read

Open-Domain Chatbot Implementation: Retrieval and Generative Approaches

Hulu Beijing

Dec 20, 2017 · Artificial Intelligence

How Attention Mechanisms Transform Seq2Seq Models for Better Translation

This article explains why attention mechanisms were introduced into Seq2Seq models, how they address the limitations of fixed‑length encoding, the role of bidirectional RNNs, and showcases their impact on machine translation and image captioning with illustrative diagrams.

Attention MechanismMachine TranslationRNN

0 likes · 10 min read

How Attention Mechanisms Transform Seq2Seq Models for Better Translation

Alibaba Cloud Developer

Nov 9, 2017 · Artificial Intelligence

How an Alibaba Engineer Built an AI Hip‑Hop Lyric Generator for Double 11

An Alibaba engineer created MusicGo, an AI program that scrapes hip‑hop lyrics, trains a seq2seq LSTM model, and generates rap songs themed around Double 11 and Alibaba's smart logistics, illustrating the practical steps, challenges, and creative adjustments needed for AI‑driven music creation.

AIAlibabaLSTM

0 likes · 7 min read

How an Alibaba Engineer Built an AI Hip‑Hop Lyric Generator for Double 11