Tag

machine translation

0 views collected around this technical thread.

Tencent Technical Engineering
Tencent Technical Engineering
Apr 16, 2025 · Artificial Intelligence

Understanding Transformer Architecture for Chinese‑English Translation: A Practical Guide

This practical guide walks through the full Transformer architecture for Chinese‑to‑English translation, detailing encoder‑decoder structure, tokenization and embeddings, batch handling with padding and masks, positional encodings, parallel teacher‑forcing, self‑ and multi‑head attention, and the complete forward and back‑propagation training steps.

EmbeddingPositional EncodingPyTorch
0 likes · 26 min read
Understanding Transformer Architecture for Chinese‑English Translation: A Practical Guide
vivo Internet Technology
vivo Internet Technology
Feb 12, 2025 · Artificial Intelligence

Bidirectional Optimization of NLLB-200 and ChatGPT for Low-Resource Language Translation

The paper proposes a bidirectional optimization framework that fine‑tunes the low‑resource NLLB‑200 translation model with LoRA using data generated by ChatGPT, while also translating low‑resource prompts with NLLB before feeding them to LLMs, thereby improving multilingual translation quality yet requiring careful validation of noisy synthetic data.

Fine-tuningLLMLoRA
0 likes · 28 min read
Bidirectional Optimization of NLLB-200 and ChatGPT for Low-Resource Language Translation
Baidu Tech Salon
Baidu Tech Salon
Jun 24, 2024 · Artificial Intelligence

Paperpolisher: AI-Powered Academic Paper Translation and Polishing Assistant

Paperpolisher is an AI-powered tool using Baidu's ERNIE large model and Comate to translate and polish Chinese academic papers into high-quality English, leveraging large paper datasets and retrieval augmentation, streamlining code generation and improving acceptance chances for submissions to top conferences.

AI coding assistantArtificial IntelligenceBaidu Comate
0 likes · 9 min read
Paperpolisher: AI-Powered Academic Paper Translation and Polishing Assistant
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Jun 12, 2024 · Artificial Intelligence

A Simple Introduction to the Transformer Model

This article provides a comprehensive, beginner-friendly explanation of the Transformer architecture, covering its encoder‑decoder structure, self‑attention, multi‑head attention, positional encoding, residual connections, decoding process, final linear and softmax layers, and training considerations, illustrated with numerous diagrams and code snippets.

Self-AttentionTransformerdeep learning
0 likes · 24 min read
A Simple Introduction to the Transformer Model
DataFunSummit
DataFunSummit
Mar 3, 2024 · Artificial Intelligence

Instruction Fine-Tuning Practices for Huawei's Pangu Large Language Model

This presentation details the concepts, methodologies, and experimental results of instruction fine‑tuning for Huawei's Pangu large language model, covering model scale, architecture, training strategies, data quality, parallelism techniques, and case studies on Chinese‑English translation and Thai language adaptation.

AI infrastructureEfficient Fine-TuningInstruction Fine-Tuning
0 likes · 19 min read
Instruction Fine-Tuning Practices for Huawei's Pangu Large Language Model
DataFunTalk
DataFunTalk
Sep 19, 2023 · Artificial Intelligence

Simultaneous Speech Translation: Technical Background, System Architecture, and Key Challenges

This article reviews the technical background of simultaneous speech translation, compares offline and real‑time scenarios, details ASR and MT technologies, describes the system architecture and design strategies, and discusses the major challenges and solutions for deploying robust, low‑latency translation services.

ASRHuaweideep learning
0 likes · 16 min read
Simultaneous Speech Translation: Technical Background, System Architecture, and Key Challenges
NetEase LeiHuo Testing Center
NetEase LeiHuo Testing Center
Mar 31, 2023 · Artificial Intelligence

Comparative Evaluation of Deepl and ChatGPT Machine Translation for Game Localization

This article investigates the translation quality of Deepl and ChatGPT for the game 'Naraka: Bladepoint' by comparing their outputs against professional human translations across Chinese‑English, Chinese‑Spanish, and English‑Spanish pairs using BLEU scores and manual assessment, revealing strengths and limitations of each system.

AIGCBLEUChatGPT
0 likes · 12 min read
Comparative Evaluation of Deepl and ChatGPT Machine Translation for Game Localization
Model Perspective
Model Perspective
Nov 17, 2022 · Artificial Intelligence

How Mathematics Sparked the Rise of Modern Linguistics and NLP

This article traces the historical convergence of mathematics and linguistics, from 19th‑century pioneers to post‑war computer‑driven research, highlighting how statistical, probabilistic, and formal methods laid the foundation for machine translation, morphological analysis, and contemporary natural language processing.

Natural Language Processinghistory of linguisticsmachine translation
0 likes · 7 min read
How Mathematics Sparked the Rise of Modern Linguistics and NLP
DataFunTalk
DataFunTalk
Sep 27, 2022 · Artificial Intelligence

Contrastive Learning for Text Generation: Motivation, Methodology, Experiments, and Discussion (CoNT Framework)

This article reviews the integration of contrastive learning into text generation, explains why it helps mitigate exposure bias, introduces the CoNT framework with three key improvements, presents extensive experiments on translation, summarization, code comment and data‑to‑text tasks, and discusses practical deployment considerations.

AICoNTNatural Language Processing
0 likes · 21 min read
Contrastive Learning for Text Generation: Motivation, Methodology, Experiments, and Discussion (CoNT Framework)
DataFunTalk
DataFunTalk
Jul 30, 2022 · Artificial Intelligence

Technical Analysis of Huawei’s Offline Speech‑to‑Text and Length‑Constrained Speech Translation Systems in IWSLT 2022

This article reviews the IWSLT 2022 competition tasks, explains Huawei’s cascade offline speech‑to‑text translation pipeline, details four major technical innovations—including ensemble‑based ASR de‑noise, context‑aware re‑ranking, domain‑controlled training, and length‑control strategies—and presents experimental results that demonstrate Huawei’s leading performance across multiple language directions.

ASRHuaweiIWSLT
0 likes · 18 min read
Technical Analysis of Huawei’s Offline Speech‑to‑Text and Length‑Constrained Speech Translation Systems in IWSLT 2022
DataFunTalk
DataFunTalk
Jan 16, 2022 · Artificial Intelligence

DeltaLM: A Multilingual Pretrained Encoder‑Decoder Model for Neural Machine Translation and Zero‑Shot Transfer

DeltaLM is a new multilingual pretrained encoder‑decoder model that leverages a pretrained encoder and a novel decoder to improve multilingual neural machine translation, offering efficient training, strong cross‑language transfer, zero‑shot translation, and superior performance on various translation and summarization tasks.

DeltaLMNMTZero-shot
0 likes · 13 min read
DeltaLM: A Multilingual Pretrained Encoder‑Decoder Model for Neural Machine Translation and Zero‑Shot Transfer
DataFunSummit
DataFunSummit
Nov 18, 2021 · Artificial Intelligence

Enterprise Applications and Research of Speech Translation

This article reviews recent advances in speech translation, discusses ByteDance's practical deployments, compares cascade and end‑to‑end modeling approaches, introduces improved encoder‑decoder architectures and training strategies, and reports state‑of‑the‑art results on the IWSLT 2021 benchmark.

AIByteDancecascade model
0 likes · 15 min read
Enterprise Applications and Research of Speech Translation
DataFunTalk
DataFunTalk
Oct 5, 2021 · Artificial Intelligence

From Technology to Experience: Vivo Machine Translation Deployment Practice

This article presents a comprehensive guide to deploying machine translation at Vivo, covering business analysis, algorithm choices beyond standard NMT, language detection challenges, data collection and cleaning, scientific evaluation methods, and engineering optimizations to deliver a seamless user experience.

AINMTdata quality
0 likes · 20 min read
From Technology to Experience: Vivo Machine Translation Deployment Practice
Tencent Tech
Tencent Tech
Jul 22, 2021 · Artificial Intelligence

How Tencent Dominated WMT2021: Winning Five News‑Track Translation Tasks

Tencent’s machine‑translation teams clinched five first‑place wins in the WMT2021 news track—covering Chinese‑English, Japanese‑English and English‑German limited‑resource tasks—outperforming 82 competing teams and showcasing the impact of its AI‑driven translation engine across its products.

AI competitionBLEUTencent
0 likes · 4 min read
How Tencent Dominated WMT2021: Winning Five News‑Track Translation Tasks
iQIYI Technical Product Team
iQIYI Technical Product Team
Jul 9, 2021 · Artificial Intelligence

iQIYI Multi‑Language Subtitle Machine Translation: Practice, Model Exploration, and Deployment

iQIYI’s multi‑language subtitle machine‑translation system combines a one‑to‑many transformer, context‑fusion encoding, four custom attention masks, masked language modeling, global decoding loss, reconstruction and error‑correction modules, plus pronoun, idiom and name‑handling tricks, achieving higher quality than third‑party services and even surpassing human translation for several languages.

One-to-Many ModelSubtitle TranslationTransformer
0 likes · 17 min read
iQIYI Multi‑Language Subtitle Machine Translation: Practice, Model Exploration, and Deployment
DataFunTalk
DataFunTalk
Feb 20, 2021 · Artificial Intelligence

Industrial-Scale Machine Translation at Bytedance: Applications, Demos, and Research Advances

This article presents Bytedance's industrial machine‑translation platform, describing its global deployment, diverse product demos, underlying sequence‑to‑sequence models, BERT‑enhanced training strategies, prune‑tune sparsity techniques, multilingual pre‑training, document translation, and a high‑performance inference engine.

AI applicationsBERTByteDance
0 likes · 19 min read
Industrial-Scale Machine Translation at Bytedance: Applications, Demos, and Research Advances
DataFunTalk
DataFunTalk
Feb 9, 2021 · Artificial Intelligence

Multimodal AI Research: Video-Aware Dialog, Dual-Channel Reasoning, and Multimodal Machine Translation

This article surveys recent multimodal AI research, covering video scene‑aware dialog with a GPT‑2 based unified pre‑training framework, dual‑channel multi‑hop reasoning for visual dialog, capsule‑network‑enhanced multimodal machine translation, and graph‑neural‑network‑driven multimodal translation, highlighting experimental results and future directions.

graph neural networkmachine translationmultimodal AI
0 likes · 12 min read
Multimodal AI Research: Video-Aware Dialog, Dual-Channel Reasoning, and Multimodal Machine Translation
New Oriental Technology
New Oriental Technology
Feb 1, 2021 · Artificial Intelligence

Neural Machine Translation: Seq2Seq, Beam Search, BLEU, Attention Mechanisms, and GNMT Improvements

This article explains key concepts of neural machine translation, covering Seq2Seq encoder‑decoder models, beam search strategies, BLEU evaluation, various attention mechanisms, and the enhancements introduced in Google's Neural Machine Translation system to improve speed, OOV handling, and translation quality.

AttentionBLEUGNMT
0 likes · 11 min read
Neural Machine Translation: Seq2Seq, Beam Search, BLEU, Attention Mechanisms, and GNMT Improvements
DataFunTalk
DataFunTalk
Jan 10, 2021 · Artificial Intelligence

Didi's Machine Translation System: Architecture, Techniques, and WMT2020 Competition Experience

This article presents a comprehensive overview of Didi's machine translation platform, covering its evolution from statistical to neural models, the Transformer architecture with relative position and larger FFN, data preparation, training strategies such as back‑translation and knowledge distillation, deployment optimizations with TensorRT, and the team's successful participation in the WMT2020 news translation task.

BLEUTensorRTTransformer
0 likes · 14 min read
Didi's Machine Translation System: Architecture, Techniques, and WMT2020 Competition Experience