Tagged articles
507 articles
Page 1 of 6
Geek Labs
Geek Labs
May 11, 2026 · Artificial Intelligence

Why AI Translators Feel Mechanical and This Tool Makes PDFs Read Like Chinese

Most AI translation services produce literal, word‑for‑word output that reads like machine‑generated text, but the open‑source Cuimao Translator on GitHub analyzes the source’s style, tone, and domain before rendering English PDFs into fluent, context‑aware Chinese using five translation modes and six built‑in quality checks.

AI translationChinese localizationClaude Code
0 likes · 8 min read
Why AI Translators Feel Mechanical and This Tool Makes PDFs Read Like Chinese
Machine Learning Algorithms & Natural Language Processing
Machine Learning Algorithms & Natural Language Processing
May 1, 2026 · Artificial Intelligence

GPT-5.6 Leaked? Inside GPT-5.5’s Goblin Obsession and OpenAI’s Overnight Ban

The article analyzes how internal logs revealed a GPT‑5.6 route, how GPT‑5.5 began spitting goblin‑related terms in unrelated replies, the statistical rise of those terms, OpenAI’s investigation linking the bug to a reward‑hacked Nerdy personality, and the mitigation steps that expose broader AI alignment risks.

AI AlignmentGPT-5.5Goblin bug
0 likes · 13 min read
GPT-5.6 Leaked? Inside GPT-5.5’s Goblin Obsession and OpenAI’s Overnight Ban
Machine Learning Algorithms & Natural Language Processing
Machine Learning Algorithms & Natural Language Processing
Apr 4, 2026 · Artificial Intelligence

Automate a Year of PhD Research in 7 Days with DeepScientist

DeepScientist V1.5, an open‑source AI system from Westlake University, claims to automate the full research pipeline—from literature review and code debugging to experiment analysis and paper writing—delivering SOTA results in weeks and offering one‑click deployment on Windows, Linux and macOS.

AI automationDeepScientistNLP
0 likes · 8 min read
Automate a Year of PhD Research in 7 Days with DeepScientist
JakartaEE China Community
JakartaEE China Community
Apr 1, 2026 · Artificial Intelligence

Top Java AI Development Tools for 2025

This guide reviews eight leading AI development tools for Java in 2025, explaining how each library or framework—such as DJL, TensorFlow Java, Hugging Face, LangChain, Apache Kafka, Ray, Deeplearning4j, and Neo4j—enables Java developers to build, train, and deploy intelligent applications without switching languages.

AIDeep LearningJava
0 likes · 9 min read
Top Java AI Development Tools for 2025
Machine Learning Algorithms & Natural Language Processing
Machine Learning Algorithms & Natural Language Processing
Mar 24, 2026 · Artificial Intelligence

China’s Tech Circle Wars Over the Chinese Name for AI Tokens – Trends and Aesthetics

Amid a heated debate over the proper Chinese translation of “Token,” China’s AI community examines the term’s technical origins, massive global consumption—30 trillion daily tokens worldwide, 4.69 trillion from China alone—and its economic impact, while proposing names like CiYuan, MoYuan, and ZhiYuan to reflect cultural aesthetics.

Chinese NamingNLPToken
0 likes · 12 min read
China’s Tech Circle Wars Over the Chinese Name for AI Tokens – Trends and Aesthetics
Machine Learning Algorithms & Natural Language Processing
Machine Learning Algorithms & Natural Language Processing
Mar 20, 2026 · Artificial Intelligence

Why Kimi Dropped Residual Connections: A First‑Person Deep Dive into Attention Residuals

This article explains how Attention Residuals (AttnRes) replace traditional residual shortcuts with layer‑wise attention, details the mathematical reformulation, design constraints, static‑Q trick, full and block variants, and presents experimental evidence of significant accuracy gains with modest overhead.

NLPNeural NetworksRMSNorm
0 likes · 11 min read
Why Kimi Dropped Residual Connections: A First‑Person Deep Dive into Attention Residuals
Code Wrench
Code Wrench
Mar 3, 2026 · Artificial Intelligence

Unlocking High‑Performance Chinese Segmentation: Inside Go’s gse Library

This article deeply examines the source code of Go’s high‑performance segmentation library gse, revealing its Double‑Array Trie, shortest‑path dynamic programming, and HMM‑Viterbi implementations, and demonstrates practical usage for Chinese tokenization, part‑of‑speech tagging, keyword extraction, and custom dictionary management.

GoHMMNLP
0 likes · 13 min read
Unlocking High‑Performance Chinese Segmentation: Inside Go’s gse Library
Machine Learning Algorithms & Natural Language Processing
Machine Learning Algorithms & Natural Language Processing
Feb 23, 2026 · Artificial Intelligence

How COMI Achieves 32× Compression and Boosts Performance by 25 Points

The COMI framework introduces a marginal information gain metric and a coarse‑to‑fine two‑stage compression strategy that preserves relevance and diversity, enabling 32× context reduction while improving Exact Match on NaturalQuestions by nearly 25 points and more than doubling inference speed.

Long-Context RetrievalMarginal Information GainNLP
0 likes · 7 min read
How COMI Achieves 32× Compression and Boosts Performance by 25 Points
AI Waka
AI Waka
Feb 23, 2026 · Artificial Intelligence

Essential Books to Master Generative AI: From NLP to Multimodal Apps

This guide outlines the key competencies for generative AI professionals and curates a focused reading list—covering NLP fundamentals, software engineering, LLM libraries, vector databases, and multimodal AI—to help readers build practical expertise and deploy impactful AI solutions.

AI learningBook RecommendationsLangChain
0 likes · 9 min read
Essential Books to Master Generative AI: From NLP to Multimodal Apps
Network Intelligence Research Center (NIRC)
Network Intelligence Research Center (NIRC)
Jan 14, 2026 · Artificial Intelligence

From Black‑Box Guessing to Quantitative Deconstruction: Unveiling the Mystery Inside Large Language Models

At EMNLP 2025, the BUPT NIRC team presented a paper that introduces the ARR metric to quantitatively separate latent reasoning from factual shortcuts in LLMs, using Logit Lens and Attention Knockout to reveal distinct internal pathways and shares their conference experience.

ARR metricAttention KnockoutEMNLP2025
0 likes · 6 min read
From Black‑Box Guessing to Quantitative Deconstruction: Unveiling the Mystery Inside Large Language Models
AI Cyberspace
AI Cyberspace
Jan 13, 2026 · Artificial Intelligence

From Symbolic AI to LLMs: A Complete NLP History and Model Guide

This article provides a comprehensive overview of natural language processing, tracing its evolution from early symbolic and statistical stages through deep learning breakthroughs, detailing sequence models, key NLP tasks, text representation methods, and the development of modern architectures like RNN, LSTM, GRU, Transformer, and GPT series.

Deep LearningGPTLSTM
0 likes · 60 min read
From Symbolic AI to LLMs: A Complete NLP History and Model Guide
php Courses
php Courses
Dec 9, 2025 · Artificial Intelligence

How to Supercharge Your PHP Apps with AI: A Practical Guide

This guide explains why PHP applications need AI, outlines core AI use cases such as intelligent content processing, computer vision, personalization, and chatbots, and provides step‑by‑step implementation paths, tools, best‑practice recommendations, real‑world case studies, and future trends for developers.

AI integrationComputer VisionNLP
0 likes · 10 min read
How to Supercharge Your PHP Apps with AI: A Practical Guide
BirdNest Tech Talk
BirdNest Tech Talk
Oct 20, 2025 · Artificial Intelligence

How Embedding Models Power Semantic Search: A Hands‑On LangChain Guide

This article explains what embeddings are, how LangChain’s Embeddings interface abstracts various providers, compares common models, and walks through a complete Python example that uses a Chinese‑optimized HuggingFace model to generate document and query vectors, compute cosine similarity, and identify the most relevant text.

LangChainNLPPython
0 likes · 9 min read
How Embedding Models Power Semantic Search: A Hands‑On LangChain Guide
Code Mala Tang
Code Mala Tang
Oct 8, 2025 · Artificial Intelligence

Quickly Add NLP to Node Apps with Hugging Face Transformers.js

This tutorial shows how to integrate Hugging Face's open‑source Transformers.js library into Node.js projects, covering setup, the Pipeline API, and practical code examples for sentiment analysis, zero‑shot classification, text generation, translation, and question answering, while also discussing when to prefer Python alternatives.

NLPPipelineTransformers.js
0 likes · 15 min read
Quickly Add NLP to Node Apps with Hugging Face Transformers.js
Architect
Architect
Sep 16, 2025 · Artificial Intelligence

Why Transformers Outperform RNNs: A Beginner’s Guide to Attention and Architecture

This article introduces the Transformer architecture, explaining its attention mechanism, encoder‑decoder design, training and inference processes, and why it surpasses RNN‑based models, while also covering common applications and variations in natural language processing.

Deep LearningModel architectureNLP
0 likes · 13 min read
Why Transformers Outperform RNNs: A Beginner’s Guide to Attention and Architecture
Model Perspective
Model Perspective
Sep 8, 2025 · Artificial Intelligence

How to Build a Dynamically Updating Knowledge Graph for Mathematical Modeling

This article explains how to construct and continuously update a knowledge graph from mathematical modeling solutions, detailing extraction of entities, relations, attributes, and strategies, and showing how dynamic graphs enable intelligent recommendation, research support, and teaching assistance.

Dynamic UpdateKnowledge GraphNLP
0 likes · 9 min read
How to Build a Dynamically Updating Knowledge Graph for Mathematical Modeling
Instant Consumer Technology Team
Instant Consumer Technology Team
Aug 15, 2025 · Artificial Intelligence

Master the iFLYTEK Prohibited Words Classification Challenge: Baselines & BERT

This article introduces the iFLYTEK AI Developer Competition on prohibited‑word classification, outlines the task, dataset, evaluation metric, and provides three baseline solutions—including a logistic‑regression model, a BERT fine‑tuning approach, and a large‑model prompt method—along with code snippets and performance notes.

BERTNLPcompetition
0 likes · 15 min read
Master the iFLYTEK Prohibited Words Classification Challenge: Baselines & BERT
21CTO
21CTO
Jul 1, 2025 · Artificial Intelligence

Large Language Models Explained: Evolution, Architecture & Future Trends

This comprehensive guide traces the origins and development of large language models, explains their transformer-based architecture and self‑attention mechanisms, reviews major models such as GPT, BERT and T5, and discusses practical applications, ethical challenges, resource demands, and future research directions.

AI ethicsNLP
0 likes · 27 min read
Large Language Models Explained: Evolution, Architecture & Future Trends
DataFunSummit
DataFunSummit
Jun 18, 2025 · Artificial Intelligence

How to Upload, Test, and Deploy MiniLM on Modelers.cn: A Step‑by‑Step Guide

This article walks through uploading a MiniLM model to the Modelers.cn community, explains why testing is essential, demonstrates both usability and local tests with openMind, and provides complete Python code for classification and simple question‑answering, enabling developers to quickly deploy and evaluate MiniLM in practice.

MiniLMModel DeploymentNLP
0 likes · 9 min read
How to Upload, Test, and Deploy MiniLM on Modelers.cn: A Step‑by‑Step Guide
Instant Consumer Technology Team
Instant Consumer Technology Team
May 16, 2025 · Artificial Intelligence

Smart AI‑Powered Push Copy: From Templates to Sampling Strategies

This article explores how high‑quality content assets—text, images, and video—drive SEO and user engagement, then delves into the challenges of crafting push‑notification copy and presents an intelligent copy system that uses template and keyword generation, transformer models, BLEU and semantic similarity evaluation, and various sampling strategies to improve relevance and diversity.

AIEvaluation MetricsNLP
0 likes · 30 min read
Smart AI‑Powered Push Copy: From Templates to Sampling Strategies
Didi Tech
Didi Tech
Apr 24, 2025 · Artificial Intelligence

Algorithmic Foundations and Evolution of Natural Language Processing

The article surveys the Algorithmic Foundations of Engineering R&D series, tracing NLP’s evolution from rule‑based systems to today’s multimodal large‑model era, reviewing core machine‑learning and deep‑learning techniques, transformer breakthroughs, representation learning, optimization methods, and emerging research such as retrieval‑augmented generation and AI agents.

AINLPTransformer
0 likes · 43 min read
Algorithmic Foundations and Evolution of Natural Language Processing
JavaEdge
JavaEdge
Mar 30, 2025 · Artificial Intelligence

How GenAI Can Transform E‑Commerce Product Review Analysis

This article examines the critical role of product reviews for buyers and sellers, outlines the limitations of traditional review processing, and proposes a GenAI‑powered solution—including platform and model choices, batch inference, and semantic search—to efficiently analyze large‑scale e‑commerce feedback.

Batch ProcessingGenAINLP
0 likes · 12 min read
How GenAI Can Transform E‑Commerce Product Review Analysis
Code Mala Tang
Code Mala Tang
Mar 27, 2025 · Artificial Intelligence

How Do BPE, WordPiece, and SentencePiece Shape Modern NLP Tokenization?

This article explains the fundamentals, workflows, examples, and trade‑offs of three major subword tokenization algorithms—Byte Pair Encoding, WordPiece, and SentencePiece—helping practitioners choose the right method for their large language model pipelines.

BPENLPSentencePiece
0 likes · 12 min read
How Do BPE, WordPiece, and SentencePiece Shape Modern NLP Tokenization?
Baidu Tech Salon
Baidu Tech Salon
Mar 21, 2025 · Artificial Intelligence

Semantic Embedding with Large Language Models: A Comprehensive Survey

This survey reviews the evolution of semantic embedding—from Word2vec and GloVe to BERT, Sentence‑BERT, and recent contrastive methods—then examines how large language models improve embeddings via synthetic data generation and backbone architectures, detailing techniques such as contrastive prompting, in‑context learning, knowledge distillation, and discussing resource, privacy, and interpretability challenges.

In-Context LearningNLPcontrastive learning
0 likes · 27 min read
Semantic Embedding with Large Language Models: A Comprehensive Survey
Architect
Architect
Mar 19, 2025 · Artificial Intelligence

Choosing the Best Embedding Model for RAG: A Practical Guide Using MTEB Rankings

This guide explains how to leverage the Massive Text Embedding Benchmark (MTEB) to identify high‑performing embedding models for Retrieval‑Augmented Generation (RAG) and outlines key factors such as model size, dimension, language support, resource requirements, inference speed, domain suitability, long‑text handling, scalability, and cost.

AIEmbeddingMTEB
0 likes · 12 min read
Choosing the Best Embedding Model for RAG: A Practical Guide Using MTEB Rankings
JavaEdge
JavaEdge
Mar 16, 2025 · Artificial Intelligence

Boost NLP Data Quality with Multi‑Stage Back‑Translation Augmentation

This article explains the core principles, implementation steps, and practical challenges of using multi‑language back‑translation to enrich text data, provides Python code for a configurable augmentation pipeline, showcases e‑commerce and financial use cases, and presents evaluation metrics that demonstrate significant gains in semantic fidelity and model performance.

NLPPythonText Generation
0 likes · 9 min read
Boost NLP Data Quality with Multi‑Stage Back‑Translation Augmentation
JavaEdge
JavaEdge
Mar 15, 2025 · Artificial Intelligence

Boost NLP Model Performance with n-gram Feature Engineering

This article explains why feature engineering is crucial for NLP tasks, introduces n‑gram enhancements, provides Python implementations for generating bi‑gram and higher‑order features, demonstrates dynamic padding for text length standardization, and offers practical deployment tips such as feature dimension control and monitoring.

Deep LearningN-gramNLP
0 likes · 7 min read
Boost NLP Model Performance with n-gram Feature Engineering
JavaEdge
JavaEdge
Mar 12, 2025 · Artificial Intelligence

How to Analyze Chinese Sentiment Text Data: From Stats to Word Clouds

This article guides Java developers through a complete Chinese sentiment‑analysis dataset exploration, covering label distribution, sentence length statistics, vocabulary counts, adjective extraction, and visual word‑cloud generation using Python libraries such as pandas, seaborn, jieba, and wordcloud.

Data visualizationNLPPython
0 likes · 10 min read
How to Analyze Chinese Sentiment Text Data: From Stats to Word Clouds
Cognitive Technology Team
Cognitive Technology Team
Mar 10, 2025 · Artificial Intelligence

Understanding Transformers: From NLP Challenges to Architecture and Core Mechanisms

This article explains the evolution of natural language processing, the limitations of rule‑based, statistical, and recurrent neural network models, and then introduces the Transformer architecture—covering word and position embeddings, self‑attention, multi‑head attention, Add & Norm, feed‑forward layers, and encoder‑decoder design—to help beginners grasp why Transformers solve key NLP problems.

AINLPSelf-Attention
0 likes · 15 min read
Understanding Transformers: From NLP Challenges to Architecture and Core Mechanisms
Alibaba Cloud Developer
Alibaba Cloud Developer
Mar 10, 2025 · Artificial Intelligence

Why Transformers Revolutionized NLP: From Problems to Solutions

This article explains the historical challenges of natural language processing, from rule‑based and statistical models to recurrent networks and their limitations, then introduces the Transformer architecture, its self‑attention mechanism, multi‑head attention, and supporting layers, illustrating how it overcomes previous issues and enables efficient parallel training.

NLPSelf-AttentionTransformer
0 likes · 16 min read
Why Transformers Revolutionized NLP: From Problems to Solutions
DataFunTalk
DataFunTalk
Mar 2, 2025 · Artificial Intelligence

Top 10 AI Research Papers of 2024: Summaries, Contributions, and Practical Uses

This article presents a curated selection of ten groundbreaking 2024 AI research papers, detailing each model’s abstract, key contributions, and practical application scenarios across computer vision, multimodal learning, NLP, and efficient inference, offering readers inspiration and actionable insights for real‑world projects.

2024 researchAINLP
0 likes · 18 min read
Top 10 AI Research Papers of 2024: Summaries, Contributions, and Practical Uses
Test Development Learning Exchange
Test Development Learning Exchange
Jan 22, 2025 · Artificial Intelligence

Comprehensive Guide to Python Data Science Libraries with Code Examples

This article presents a concise tutorial on essential Python data science libraries, covering data cleaning with Pandas, numerical analysis with NumPy and SciPy, visualization with Matplotlib and Seaborn, machine learning with scikit‑learn, NLP with NLTK and spaCy, time‑series modeling, image processing, database access, and parallel computing, each illustrated with ready‑to‑run code examples.

Data ScienceData visualizationNLP
0 likes · 7 min read
Comprehensive Guide to Python Data Science Libraries with Code Examples
Test Development Learning Exchange
Test Development Learning Exchange
Jan 17, 2025 · Artificial Intelligence

Essential Python Libraries for Data Processing, Visualization, and Machine Learning

This article introduces ten essential Python libraries—including SciPy, Matplotlib, Plotly, Scikit‑learn, TensorFlow, spaCy, BeautifulSoup, OpenPyXL, Feather/Parquet, and SQLAlchemy—detailing their primary uses for scientific computing, visualization, machine learning, deep learning, NLP, web scraping, Excel handling, efficient data storage, and ORM, with practical code examples.

Data ScienceNLPPython
0 likes · 8 min read
Essential Python Libraries for Data Processing, Visualization, and Machine Learning
DaTaobao Tech
DaTaobao Tech
Nov 13, 2024 · Artificial Intelligence

Understanding Neural Networks and Transformers: Principles, Implementation, and Applications

The article surveys neural networks from basic neuron operations and loss functions through deep architectures to the Transformer model, detailing embeddings, positional encoding, self‑attention, multi‑head attention, residual links, and encoder‑decoder design, and includes PyTorch code examples for linear regression, translation, and fine‑tuning Hugging Face’s MiniRBT for text classification.

AIAttention MechanismDeep Learning
0 likes · 44 min read
Understanding Neural Networks and Transformers: Principles, Implementation, and Applications
Infra Learning Club
Infra Learning Club
Oct 31, 2024 · Artificial Intelligence

What Is a Token in Large Language Models?

The article explains that a token is the unit processed by large language models, describes three common tokenizer methods—word‑level, character‑level, and sub‑word level—with English and Chinese examples, discusses their advantages and limitations, and shows how OpenAI’s tokenizer varies across model versions.

NLPTokencharacter-level
0 likes · 5 min read
What Is a Token in Large Language Models?
Tencent Cloud Developer
Tencent Cloud Developer
Oct 30, 2024 · Artificial Intelligence

Comprehensive Survey of AIGC Research: Papers, Resources, and Technical Overview

This survey acts as a comprehensive portal that organizes AIGC research across seven domains—text, image, and audio generation, cross‑modal association, text‑guided image and audio synthesis, and supporting resources—detailing seminal models such as GPT, Diffusion, CLIP, DALL·E, Stable Diffusion, MusicLM, and key papers that shaped each field.

AIGCCLIPComputer Vision
0 likes · 19 min read
Comprehensive Survey of AIGC Research: Papers, Resources, and Technical Overview
Architecture Breakthrough
Architecture Breakthrough
Oct 28, 2024 · R&D Management

Organize Your Work Files with an NLP‑Based Role Mapping Method

This article explains why traditional folder structures and tag tools often fail, introduces Neuro‑Linguistic Programming concepts to clarify personal work roles, and shows how mapping those roles to an enterprise‑architecture framework creates a sustainable, searchable file‑organization system.

NLPfile organizationknowledge management
0 likes · 7 min read
Organize Your Work Files with an NLP‑Based Role Mapping Method
DevOps
DevOps
Oct 8, 2024 · Artificial Intelligence

Top 20+ Retrieval‑Augmented Generation (RAG) Interview Questions and Answers

This article presents over twenty essential Retrieval‑Augmented Generation (RAG) interview questions with detailed answers, covering fundamentals, applications, architecture, training, limitations, ethical considerations, and integration, offering AI enthusiasts and job candidates a comprehensive guide to mastering RAG concepts.

AI InterviewNLPRAG
0 likes · 15 min read
Top 20+ Retrieval‑Augmented Generation (RAG) Interview Questions and Answers
58 Tech
58 Tech
Sep 23, 2024 · Artificial Intelligence

Enhancing Commercial Search with Knowledge Graphs and Large‑Model Techniques

This article describes how a commercial search platform iteratively upgrades its system by structuring business knowledge into a knowledge graph, applying multi‑stage entity extraction (CRF, Electra‑CRF, GLM‑3, OCR), and leveraging large language models to improve relevance, user experience, and revenue.

AIKnowledge GraphNLP
0 likes · 14 min read
Enhancing Commercial Search with Knowledge Graphs and Large‑Model Techniques
JD Tech Talk
JD Tech Talk
Aug 29, 2024 · Artificial Intelligence

Content Compliance Domain Overview and Technical Solutions for Price Governance

The article outlines the role of the content compliance domain in e‑commerce, detailing user‑facing issues, business responsibilities, challenges in detection and mitigation, and technical solutions such as comparable‑price models, large‑scale price prediction, and merchant outreach, while also offering personal growth advice for compliance engineers.

AINLPcontent compliance
0 likes · 9 min read
Content Compliance Domain Overview and Technical Solutions for Price Governance
JD Cloud Developers
JD Cloud Developers
Aug 29, 2024 · Artificial Intelligence

How AI Powers E‑Commerce Content Compliance and Price Governance

This article explains how e‑commerce platforms use AI‑driven content compliance to detect malicious products, price manipulation, and counterfeit goods, outlining the technical challenges, core business metrics, model‑based solutions for price over‑pricing, and personal growth advice for compliance engineers.

AIComputer VisionNLP
0 likes · 9 min read
How AI Powers E‑Commerce Content Compliance and Price Governance
DataFunSummit
DataFunSummit
Aug 27, 2024 · Artificial Intelligence

Applying Large Models to Xiao AI Assistant: Intent Routing, Understanding, and Response Generation

This article presents a comprehensive technical overview of how large language models are integrated into Xiaomi's Xiao AI assistant, detailing the architecture for intent routing, domain‑specific intent understanding, function‑calling mechanisms, fine‑tuning strategies, performance gains, and future research directions.

AI AssistantFunction CallingIntent Routing
0 likes · 14 min read
Applying Large Models to Xiao AI Assistant: Intent Routing, Understanding, and Response Generation
Open Source Tech Hub
Open Source Tech Hub
Aug 22, 2024 · Artificial Intelligence

Unlock AI Power in PHP: A Hands‑On Guide to TransformersPHP

TransformersPHP brings Hugging Face’s Transformer models to PHP, enabling developers to run thousands of pre‑trained NLP models locally for tasks like text generation, summarisation, and translation, with simple installation, ONNX‑based execution, and a Python‑like pipeline API.

AINLPONNX
0 likes · 8 min read
Unlock AI Power in PHP: A Hands‑On Guide to TransformersPHP
JD Retail Technology
JD Retail Technology
Aug 16, 2024 · Artificial Intelligence

Interview with JD Retail AI Director Zhai Zhouwei on the Evolution and Future of E‑commerce Search Powered by Large Models

In this interview, JD Retail’s AI director Zhai Zhouwei outlines the four historical stages of e‑commerce search, explains how large‑model AI is reshaping user interaction, retrieval and content generation, discusses practical challenges and solutions, and shares his vision and advice for enterprises adopting these technologies.

AIJD.comNLP
0 likes · 9 min read
Interview with JD Retail AI Director Zhai Zhouwei on the Evolution and Future of E‑commerce Search Powered by Large Models
AntTech
AntTech
Aug 13, 2024 · Artificial Intelligence

Ant Group Contributions to ACL 2024: Summaries of 14 Accepted Papers Across NLP and AI

From August 11‑16, 2024 the ACL conference in Bangkok featured 14 Ant Group papers covering large‑scale information extraction, decomposed LLMs for semantic search, multimodal hallucination detection, long‑context attention mechanisms, concept‑reasoning datasets, knowledge‑graph alignment, and more, highlighting the group's breadth in natural language processing and AI research.

ACL2024Information ExtractionNLP
0 likes · 20 min read
Ant Group Contributions to ACL 2024: Summaries of 14 Accepted Papers Across NLP and AI
Kuaishou Tech
Kuaishou Tech
Jul 23, 2024 · Artificial Intelligence

Parrot: Enhancing Multi-Turn Instruction Following for Large Language Models

This paper introduces Parrot, a system that enhances large language models' (LLMs) multi-turn instruction following capabilities through context-aware preference optimization (CaPO) and synthetic data generation, achieving significant performance improvements with limited training data.

CaPONLPdata synthesis
0 likes · 9 min read
Parrot: Enhancing Multi-Turn Instruction Following for Large Language Models
JD Tech Talk
JD Tech Talk
Jul 23, 2024 · Artificial Intelligence

Intelligent Parcel Identification Using Large Language Models in JD Express Logistics

This article examines how JD Express applies large‑language‑model‑based natural language processing to accurately recognize and classify shipped items, addressing low matching rates, improving packaging recommendations, reducing damage and claims, and outlining architecture, model selection criteria, caching strategies, and future operational benefits.

AIJD ExpressNLP
0 likes · 21 min read
Intelligent Parcel Identification Using Large Language Models in JD Express Logistics
Ops Development & AI Practice
Ops Development & AI Practice
Jun 22, 2024 · Artificial Intelligence

Why Transformers Revolutionized AI: From NLP to Vision and Speech

Transformers, introduced in 2017, have reshaped neural networks by leveraging attention mechanisms to outperform RNNs and CNNs across NLP, computer vision, and speech tasks, offering parallel processing, long‑range dependency capture, and versatile applications such as translation, text generation, image classification, and speech recognition.

Attention MechanismComputer VisionDeep Learning
0 likes · 6 min read
Why Transformers Revolutionized AI: From NLP to Vision and Speech
Alibaba Cloud Developer
Alibaba Cloud Developer
Jun 11, 2024 · Artificial Intelligence

Mastering Retrieval‑Augmented Generation: Challenges, Paradigms, and Engineering Best Practices

This article explores Retrieval‑Augmented Generation (RAG) by outlining its background, inherent challenges such as knowledge limits and hallucinations, describing the Naïve, Advanced, and Modular RAG paradigms, and presenting practical engineering strategies for pre‑retrieval, retrieval, and post‑retrieval optimization.

Knowledge RetrievalNLPRAG
0 likes · 25 min read
Mastering Retrieval‑Augmented Generation: Challenges, Paradigms, and Engineering Best Practices
Model Perspective
Model Perspective
May 29, 2024 · Artificial Intelligence

How to Build Word Vectors from Scratch: A Step‑by‑Step Guide

This article explains the fundamentals of word vectors in NLP, walks through constructing them via co‑occurrence matrices and dimensionality reduction, demonstrates the process with a concrete example and Python code, and evaluates the resulting embeddings using cosine similarity.

NLPPythonSVD
0 likes · 7 min read
How to Build Word Vectors from Scratch: A Step‑by‑Step Guide
Test Development Learning Exchange
Test Development Learning Exchange
Apr 20, 2024 · Artificial Intelligence

Implementing a Simple University Paper Plagiarism Detection System in Python

This article outlines the design and implementation of a basic university paper plagiarism detection system using Python, covering text preprocessing with NLTK, TF‑IDF weighting, cosine similarity calculation, and a sample in‑memory paper database, while also discussing scalability, UI, and legal considerations.

Cosine SimilarityNLPPython
0 likes · 10 min read
Implementing a Simple University Paper Plagiarism Detection System in Python
Top Architect
Top Architect
Apr 18, 2024 · Artificial Intelligence

Understanding Transformers: Architecture, Attention Mechanism, Training and Inference

This article provides a comprehensive overview of Transformer models, covering their attention-based architecture, encoder-decoder structure, training procedures including teacher forcing, inference workflow, advantages over RNNs, and various applications in natural language processing such as translation, summarization, and classification.

Attention MechanismDeep LearningInference
0 likes · 11 min read
Understanding Transformers: Architecture, Attention Mechanism, Training and Inference
Alimama Tech
Alimama Tech
Apr 17, 2024 · Artificial Intelligence

Applying Large Language Models to Advertising Copy Generation

The article examines how large language models can streamline advertising copy creation by addressing format diversity, creativity, and new media demands, detailing model evaluation, fine‑tuning of Chinese‑adapted LLMs—ultimately selecting QWen 1.5‑7B—and showing that deployment boosts copy quality, click‑through and conversion rates while outlining future personalization and data‑efficient scaling.

AICopy GenerationFine-tuning
0 likes · 18 min read
Applying Large Language Models to Advertising Copy Generation
Airbnb Technology Team
Airbnb Technology Team
Apr 15, 2024 · Artificial Intelligence

Airbnb's Attribute Prioritization System: Machine Learning for Extracting Guest Preferences from Unstructured Text

Airbnb’s Attribute Prioritization System uses a machine‑learning pipeline called LATEX to extract and map guest‑mentioned amenities, activities and places from reviews, messages and tickets, then predicts and ranks the most important attributes per listing, giving hosts personalized suggestions to improve listings and match traveler needs.

AirbnbNERNLP
0 likes · 9 min read
Airbnb's Attribute Prioritization System: Machine Learning for Extracting Guest Preferences from Unstructured Text
DaTaobao Tech
DaTaobao Tech
Mar 29, 2024 · Artificial Intelligence

Text-to-SQL with Large Language Models: DIN-SQL Approach

The DIN‑SQL approach enhances Text‑to‑SQL performance by using large language models in a decomposed in‑context learning framework with schema linking, query classification, SQL generation, and self‑correction modules, achieving state‑of‑the‑art 85.3% execution accuracy on the Spider benchmark by breaking complex queries into manageable sub‑tasks.

AI researchDatabase QueryingNLP
0 likes · 34 min read
Text-to-SQL with Large Language Models: DIN-SQL Approach
Architect
Architect
Mar 26, 2024 · Artificial Intelligence

Why Transformers Outperform RNNs: A Deep Dive into Architecture and Training

This article explains the Transformer model’s core architecture, self‑attention mechanism, encoder‑decoder workflow, training with teacher forcing, inference steps, and why it surpasses RNNs and CNNs, while also outlining its major NLP applications.

Attention MechanismInferenceModel Training
0 likes · 14 min read
Why Transformers Outperform RNNs: A Deep Dive into Architecture and Training
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Mar 22, 2024 · Artificial Intelligence

Improving Document Search with Vector Search: From Elasticsearch Limitations to Milvus Integration

This article explains how traditional keyword search with Elasticsearch often yields inaccurate or incomplete results for document retrieval, introduces vectorization and semantic search using NLP embeddings, and demonstrates a practical workflow that combines these techniques with the Milvus vector database to achieve more accurate and efficient document search.

AIElasticsearchMilvus
0 likes · 13 min read
Improving Document Search with Vector Search: From Elasticsearch Limitations to Milvus Integration
Architect
Architect
Mar 19, 2024 · Artificial Intelligence

How Transformers Power Modern NLP: A Deep Dive into Encoder‑Decoder Mechanics

This article explains the core principles of Transformer models—covering input embeddings, self‑attention, multi‑head attention, positional encoding, feed‑forward networks, and decoder strategies—using concrete examples like "The cat sat on the mat" and "The quick brown fox jumps over the lazy dog" to illustrate each step.

Encoder-DecoderFeed-Forward NetworkNLP
0 likes · 13 min read
How Transformers Power Modern NLP: A Deep Dive into Encoder‑Decoder Mechanics
Open Source Tech Hub
Open Source Tech Hub
Mar 12, 2024 · Artificial Intelligence

Step-by-Step Guide to Install ModelScope and Perform NLP Inference in Python & PHP

This guide walks you through setting up a Conda Python environment, installing PyTorch and the ModelScope library, running NLP pipelines for tasks like word segmentation and text classification, and calling ModelScope models from PHP using the PHPY extension, complete with code examples and troubleshooting tips.

AI inferenceModelScopeNLP
0 likes · 14 min read
Step-by-Step Guide to Install ModelScope and Perform NLP Inference in Python & PHP
DevOps
DevOps
Mar 5, 2024 · Artificial Intelligence

Understanding GPT‑4, ChatGPT, and the Foundations of Large Language Models

This article explains the fundamentals of AI, machine learning, deep learning, and natural language processing, describes how Transformer architectures and attention mechanisms power large language models such as GPT‑4 and ChatGPT, and walks through tokenization, prediction, and practical development with Python.

ChatGPTGPT-4NLP
0 likes · 16 min read
Understanding GPT‑4, ChatGPT, and the Foundations of Large Language Models
NewBeeNLP
NewBeeNLP
Feb 25, 2024 · Interview Experience

Comprehensive Interview Question Cheat Sheet for Top Tech Companies

This article compiles a detailed list of interview question topics from leading tech firms—including search, algorithm engineering, NLP, multimodal LLMs, advertising, recommendation, risk control, and big‑data domains—covering algorithms, system design, machine‑learning concepts, and practical coding challenges.

AlgorithmsBig DataNLP
0 likes · 10 min read
Comprehensive Interview Question Cheat Sheet for Top Tech Companies
JD Tech
JD Tech
Feb 23, 2024 · Operations

Optimizing IT Ticket Management: Process Reengineering, Automation, Knowledge‑Base Pre‑position and Intelligent Q&A Integration

This article presents a comprehensive case study of how an IT operations team reduced weekly ticket volume from over 150 to fewer than 20 by classifying issues, redesigning workflows, introducing automation tools, deploying a knowledge‑base pre‑position strategy, and integrating an intelligent question‑answering chatbot.

ElasticsearchIT OperationsIntelligent QA
0 likes · 11 min read
Optimizing IT Ticket Management: Process Reengineering, Automation, Knowledge‑Base Pre‑position and Intelligent Q&A Integration
Model Perspective
Model Perspective
Feb 13, 2024 · Big Data

Mastering Noisy Data: From Cleaning to Visualization and NLP with Python

This article reviews the key concepts from the Bad Data Handbook, covering noise identification, data validation, human readability, web data restructuring, special domain challenges, and data quality analysis, while also presenting practical data visualization techniques, popular analysis tools, Python web‑scraping libraries, and a basic NLP workflow with code examples.

Data visualizationNLPPython
0 likes · 20 min read
Mastering Noisy Data: From Cleaning to Visualization and NLP with Python
Airbnb Technology Team
Airbnb Technology Team
Jan 31, 2024 · Artificial Intelligence

Airbnb’s Listing Attribute Extraction Platform (LAEP): End-to-End Structured Information Extraction Using Machine Learning and NLP

Airbnb’s Listing Attribute Extraction Platform (LAEP) uses a custom NER model, word‑embedding mapping, and a BERT‑based scorer to automatically pull, normalize, and validate structured attributes from hosts’ unstructured text, boosting coverage for downstream tools and enhancing guest‑host matching at scale.

AirbnbBERTNER
0 likes · 11 min read
Airbnb’s Listing Attribute Extraction Platform (LAEP): End-to-End Structured Information Extraction Using Machine Learning and NLP
Bilibili Tech
Bilibili Tech
Jan 16, 2024 · Artificial Intelligence

Design and Implementation of Bilibili's Intelligent Customer Service System

Bilibili created an AI‑powered customer‑service platform that integrates a WeChat Work chat UI, a conversation state machine, Elasticsearch and LLM‑based FAQ retrieval, and a low‑code admin console, replacing a thousand‑person manual support team, handling thousands of queries, boosting resolution rates and enabling cross‑team reuse.

ChatbotElasticsearchIntelligent Customer Service
0 likes · 13 min read
Design and Implementation of Bilibili's Intelligent Customer Service System
High Availability Architecture
High Availability Architecture
Jan 9, 2024 · Operations

AIOps Practices for Incident Management at Meituan: From Risk Prevention to Post‑Operation

This article presents Meituan's two‑year exploration of AIOps in incident management, detailing risk‑prevention change detection, real‑time anomaly discovery, automated root‑cause diagnosis, multi‑dimensional KPI analysis, and similar‑event recommendation, while sharing architectural designs, algorithmic techniques, performance results, and future directions.

NLPOperationsRoot Cause Analysis
0 likes · 24 min read
AIOps Practices for Incident Management at Meituan: From Risk Prevention to Post‑Operation
政采云技术
政采云技术
Dec 19, 2023 · Backend Development

Principles and Simple Implementation of a Search Engine in Go

This article explains the fundamental concepts of search engine technology—including forward and inverted indexes, tokenizers, stop words, synonym handling, ranking algorithms, and NLP integration—and provides a concise Go implementation with code examples and performance testing.

GoNLPTokenizer
0 likes · 21 min read
Principles and Simple Implementation of a Search Engine in Go
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Dec 13, 2023 · Artificial Intelligence

Comprehensive Overview of BERT: Architecture, Pre‑training Tasks, and Applications

This article provides a detailed introduction to BERT, covering its bidirectional transformer encoder design, pre‑training objectives such as Masked Language Modeling and Next Sentence Prediction, model configurations, differences from GPT/ELMo, and a wide range of downstream NLP applications.

BERTMasked Language ModelNLP
0 likes · 17 min read
Comprehensive Overview of BERT: Architecture, Pre‑training Tasks, and Applications
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Dec 11, 2023 · Artificial Intelligence

How Hyperbolic Space and Contrastive Learning Boost Domain-Specific Language Models

This article introduces the KANGAROO model, which injects hierarchical semantic information via hyperbolic space and leverages contrastive learning on dense subgraph structures to overcome global sparsity in vertical‑domain knowledge‑enhanced pre‑trained language models, and evaluates its performance on finance and medical tasks.

NLPcontrastive learningdomain adaptation
0 likes · 10 min read
How Hyperbolic Space and Contrastive Learning Boost Domain-Specific Language Models
HomeTech
HomeTech
Dec 6, 2023 · Artificial Intelligence

Metaverse-Based Virtual Humans: Technologies and Applications in Intelligent Q&A

This article explores the concept of the metaverse and virtual humans, detailing 3D modeling techniques, NLP-driven language understanding, streaming TTS, VR/AR interaction, AIGC content generation, and the deployment of a large‑model intelligent Q&A system with real‑time facial expression synthesis for virtual anchors.

3D ModelingAIGCMetaverse
0 likes · 8 min read
Metaverse-Based Virtual Humans: Technologies and Applications in Intelligent Q&A
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Dec 4, 2023 · Artificial Intelligence

An Overview of BERT: Architecture, Pre‑training Tasks, Comparisons, and Applications

This article provides a comprehensive English overview of BERT, covering its original paper, model architecture, pre‑training objectives (Masked Language Model and Next Sentence Prediction), differences from ELMo, GPT and vanilla Transformers, parameter counts, main contributions, and a range of NLP application scenarios such as text classification, sentiment analysis, NER, and machine translation.

BERTNLPNext Sentence Prediction
0 likes · 16 min read
An Overview of BERT: Architecture, Pre‑training Tasks, Comparisons, and Applications
DaTaobao Tech
DaTaobao Tech
Dec 1, 2023 · Artificial Intelligence

Design, Evaluation, and Production of a VOC Tagging System for Taobao User Experience

Taobao’s Technical Industry Data team designed a four‑level VOC tagging hierarchy to unify fragmented user‑feedback sources, evaluated label similarity with vector‑based distance matrices, optimized tag groups via entropy‑driven re‑grouping, built a stacking ensemble of FastText and TextCNN achieving over 90% accuracy, and deployed an automated production pipeline that generates tags, maintains ODPS tables, and provides APIs for rapid experimentation.

Data ScienceNLPTagging
0 likes · 18 min read
Design, Evaluation, and Production of a VOC Tagging System for Taobao User Experience
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Nov 26, 2023 · Artificial Intelligence

Overview of T5 (Text-to-Text Transfer Transformer): Architecture, Variants, Experiments, and Applications

This article provides a comprehensive overview of Google's T5 model, detailing its unified text‑to‑text formulation, encoder‑decoder architecture, three model variants, attention mask designs, training strategies, model sizes, experimental results, and key contributions to natural language processing.

NLPT5Text-to-Text
0 likes · 14 min read
Overview of T5 (Text-to-Text Transfer Transformer): Architecture, Variants, Experiments, and Applications
DataFunSummit
DataFunSummit
Nov 20, 2023 · Artificial Intelligence

Personalized Title Generation and Automatic Cover Image Synthesis for Content Feeds

This article presents a comprehensive overview of personalized title generation—covering keyword‑based, click‑sequence‑based, and author‑style‑based methods using transformer and LSTM models—and describes an end‑to‑end pipeline for automatic cover image synthesis that combines image restoration, Seq2Seq key‑phrase extraction, object detection, and layout generation to improve user engagement in information‑flow scenarios.

AINLPTransformer
0 likes · 12 min read
Personalized Title Generation and Automatic Cover Image Synthesis for Content Feeds
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Nov 12, 2023 · Artificial Intelligence

A Comprehensive Introduction to RNN, LSTM, Attention Mechanisms, and Transformers for Large Language Models

This article provides a thorough overview of large language models, explaining the relationship between NLP and LLMs, the evolution from RNN to LSTM, the fundamentals of attention mechanisms, and the architecture and operation of Transformer models, all illustrated with clear examples and diagrams.

LSTMNLPRNN
0 likes · 25 min read
A Comprehensive Introduction to RNN, LSTM, Attention Mechanisms, and Transformers for Large Language Models
Test Development Learning Exchange
Test Development Learning Exchange
Nov 11, 2023 · Artificial Intelligence

Python Techniques for Comprehensive Text Data Analysis

This guide demonstrates how to use Python for end‑to‑end text data analysis, covering preprocessing, word‑frequency visualization, classification, sentiment detection, similarity measurement, entity recognition, keyword extraction, summarization, translation, and generation with clear code examples.

NLPPythonSentiment Analysis
0 likes · 6 min read
Python Techniques for Comprehensive Text Data Analysis
ZhongAn Tech Team
ZhongAn Tech Team
Oct 20, 2023 · Artificial Intelligence

Document Analytics & Anti‑Fraud Support Platform for Hong Kong Virtual Banking

This article describes the design and implementation of a Document Analytics & Anti‑Fraud Support platform for Hong Kong virtual banking, detailing its OCR/NLP‑driven pipeline, dynamic rule engine, multi‑template PDF processing, model training, and the resulting improvements in fraud detection and operational efficiency.

NLPOCRanti-fraud
0 likes · 18 min read
Document Analytics & Anti‑Fraud Support Platform for Hong Kong Virtual Banking
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Oct 19, 2023 · Artificial Intelligence

NLP Basics: Word Embeddings, Word2Vec, and Hand‑crafted RNN Implementation in PyTorch

This article introduces word‑level representations—from one‑hot encoding to dense word embeddings via Word2Vec—explains cosine similarity, then walks through the structure, limitations, and PyTorch implementation of a vanilla RNN, including a custom forward function and verification against the library API.

Cosine SimilarityNLPPyTorch
0 likes · 19 min read
NLP Basics: Word Embeddings, Word2Vec, and Hand‑crafted RNN Implementation in PyTorch
Architect
Architect
Oct 12, 2023 · Artificial Intelligence

Evolution of Language Models: From Statistical N‑grams to GPT‑4

This article provides a comprehensive overview of natural language processing and language‑model research, tracing the historical development from early rule‑based and statistical N‑gram models through neural network approaches such as RNN, LSTM, ELMo, and Transformer, and detailing the architectures, strengths, and limitations of the GPT series up to GPT‑4, while also discussing evaluation metrics, practical applications, and future challenges.

GPTNLPartificial intelligence
0 likes · 34 min read
Evolution of Language Models: From Statistical N‑grams to GPT‑4
Sohu Tech Products
Sohu Tech Products
Oct 11, 2023 · Artificial Intelligence

EcomGPT: Training an E-commerce Domain Large Language Model via Instruction Tuning

EcomGPT, an Alibaba‑trained e‑commerce large language model, uses a 1.5 million‑sample instruction dataset (EcomInstruct) to demonstrate that domain‑specific instruction tuning dramatically outperforms general‑purpose models on e‑commerce tasks, reducing hallucinations and improving task accuracy, with performance scaling as data diversity increases.

Alibaba NLPDomain-Specific AIEcomGPT
0 likes · 7 min read
EcomGPT: Training an E-commerce Domain Large Language Model via Instruction Tuning
DataFunSummit
DataFunSummit
Oct 8, 2023 · Artificial Intelligence

NLP Techniques for Financial Risk Control: Text Modeling, Non‑Text Modeling, Long‑Text Handling, Multi‑Modal Fusion and Sample Optimization

This article presents a comprehensive overview of how natural language processing is applied to financial risk control, covering text and non‑text sequence modeling, tokenization strategies, transformer‑based long‑text architectures, multi‑modal fusion methods, pre‑training techniques and practical sample‑optimization approaches.

AINLPText Modeling
0 likes · 22 min read
NLP Techniques for Financial Risk Control: Text Modeling, Non‑Text Modeling, Long‑Text Handling, Multi‑Modal Fusion and Sample Optimization
DataFunSummit
DataFunSummit
Oct 2, 2023 · Artificial Intelligence

WeChat NLP Algorithm Microservice Governance: Challenges and Solutions

This article examines the governance of WeChat's NLP algorithm microservices, outlining the management, performance, and scheduling challenges they face and presenting solutions such as automated CI/CD pipelines, dynamic scaling, DAG‑based service composition, a custom tracing system, the PyInter interpreter, and an improved load‑balancing algorithm.

MicroservicesModel ServingNLP
0 likes · 12 min read
WeChat NLP Algorithm Microservice Governance: Challenges and Solutions
Zhuanzhuan Tech
Zhuanzhuan Tech
Sep 28, 2023 · Artificial Intelligence

Evolution of Language Models and an Overview of the GPT Series

This article surveys the development of natural language processing from early rule‑based systems through statistical n‑gram models, neural language models, RNNs, LSTMs, ELMo, Transformers and BERT, and then details the architecture, training methods, advantages and limitations of the GPT‑1, GPT‑2, GPT‑3, ChatGPT and GPT‑4 models, concluding with a discussion of future challenges and references.

Deep LearningGPTNLP
0 likes · 30 min read
Evolution of Language Models and an Overview of the GPT Series
Alibaba Cloud Developer
Alibaba Cloud Developer
Sep 4, 2023 · Artificial Intelligence

Hands‑On Building a Transformer from Scratch with PyTorch

This tutorial walks you through implementing a full Transformer model in PyTorch, starting from basic linear‑regression code, adding attention mechanisms, multi‑head attention, encoder‑decoder architecture, training loops, and inference, all reinforced with practical debugging tips.

Deep LearningNLPPyTorch
0 likes · 17 min read
Hands‑On Building a Transformer from Scratch with PyTorch