Tag

BERT

0 views collected around this technical thread.

Architects' Tech Alliance
Architects' Tech Alliance
Jun 11, 2025 · Artificial Intelligence

From Transformers to DeepSeek‑R1: The 2017‑2025 Evolution of Large Language Models

This article chronicles the rapid development of large language models from the 2017 Transformer breakthrough through the rise of BERT, GPT‑3, ChatGPT, multimodal systems like GPT‑4V/o, and the recent cost‑efficient DeepSeek‑R1, highlighting key architectural innovations, scaling trends, alignment techniques, and their transformative impact on AI research and industry.

AI alignmentBERTGPT
0 likes · 26 min read
From Transformers to DeepSeek‑R1: The 2017‑2025 Evolution of Large Language Models
ZhongAn Tech Team
ZhongAn Tech Team
Dec 28, 2024 · Artificial Intelligence

Weekly AI Digest Issue 8: OpenAI Robotics, ModernBERT Upgrade, Spatial Cognition, LLM Agent Evolution, and GNN‑LLM Fusion

This issue surveys recent AI developments, covering OpenAI's renewed robot program, the ModernBERT encoder upgrade, spatial reasoning advances in multimodal models, automated environment generation for LLM agents, and a novel GNN‑LLM approach for label‑free node classification.

Artificial IntelligenceBERTGraph Neural Networks
0 likes · 10 min read
Weekly AI Digest Issue 8: OpenAI Robotics, ModernBERT Upgrade, Spatial Cognition, LLM Agent Evolution, and GNN‑LLM Fusion
DataFunSummit
DataFunSummit
Jul 22, 2024 · Artificial Intelligence

From BERT to LLM: Language Model Applications in 360 Advertising Recommendation

This talk explores how 360's advertising recommendation system leverages language models—from BERT to large‑scale LLMs—to improve user interest modeling, feature extraction, and conversion‑rate prediction, detailing practical challenges, engineering solutions, experimental results, and future research directions.

BERTLLMRecommendation systems
0 likes · 18 min read
From BERT to LLM: Language Model Applications in 360 Advertising Recommendation
Airbnb Technology Team
Airbnb Technology Team
Jan 31, 2024 · Artificial Intelligence

Airbnb’s Listing Attribute Extraction Platform (LAEP): End-to-End Structured Information Extraction Using Machine Learning and NLP

Airbnb’s Listing Attribute Extraction Platform (LAEP) uses a custom NER model, word‑embedding mapping, and a BERT‑based scorer to automatically pull, normalize, and validate structured attributes from hosts’ unstructured text, boosting coverage for downstream tools and enhancing guest‑host matching at scale.

AirbnbBERTNER
0 likes · 11 min read
Airbnb’s Listing Attribute Extraction Platform (LAEP): End-to-End Structured Information Extraction Using Machine Learning and NLP
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Dec 20, 2023 · Artificial Intelligence

BERT Model Overview: Inputs, Encoder, Fine‑tuning, and Variants

This article explains BERT's WordPiece tokenization, input embeddings (token, segment, and position embeddings), encoder architecture for Base and Large models, fine‑tuning strategies for various NLP tasks, and introduces popular variants such as RoBERTa and ALBERT.

BERTFine-tuningNLP
0 likes · 12 min read
BERT Model Overview: Inputs, Encoder, Fine‑tuning, and Variants
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Dec 13, 2023 · Artificial Intelligence

Comprehensive Overview of BERT: Architecture, Pre‑training Tasks, and Applications

This article provides a detailed introduction to BERT, covering its bidirectional transformer encoder design, pre‑training objectives such as Masked Language Modeling and Next Sentence Prediction, model configurations, differences from GPT/ELMo, and a wide range of downstream NLP applications.

BERTMasked Language ModelNLP
0 likes · 17 min read
Comprehensive Overview of BERT: Architecture, Pre‑training Tasks, and Applications
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Dec 4, 2023 · Artificial Intelligence

An Overview of BERT: Architecture, Pre‑training Tasks, Comparisons, and Applications

This article provides a comprehensive English overview of BERT, covering its original paper, model architecture, pre‑training objectives (Masked Language Model and Next Sentence Prediction), differences from ELMo, GPT and vanilla Transformers, parameter counts, main contributions, and a range of NLP application scenarios such as text classification, sentiment analysis, NER, and machine translation.

BERTMasked Language ModelNLP
0 likes · 16 min read
An Overview of BERT: Architecture, Pre‑training Tasks, Comparisons, and Applications
Baidu Geek Talk
Baidu Geek Talk
Nov 2, 2023 · Artificial Intelligence

AI-Powered Code Defect Detection: Leveraging Code Knowledge Graphs and Large Language Models

The paper presents an AI‑driven static analysis framework that builds code knowledge graphs to extract relevant slices and leverages large language models for multilingual defect prediction, achieving up to 80% F1, detecting 662 defects across 1,100 C++ modules with a 26.9% recall gain over traditional rule‑based scanners.

BERTStatic Analysiscode defect detection
0 likes · 9 min read
AI-Powered Code Defect Detection: Leveraging Code Knowledge Graphs and Large Language Models
Zhuanzhuan Tech
Zhuanzhuan Tech
Oct 11, 2023 · Artificial Intelligence

Building a ChatGPT‑Based Intelligent Customer Service System with BERT Classification and Knowledge Filtering

This article describes how to construct an intelligent customer‑service assistant using ChatGPT for natural‑language understanding, BERT for user‑question classification, and Sentence‑BERT for knowledge‑selection, detailing system architecture, prompt design, model training, performance results, and practical cost reductions.

BERTChatGPTIntelligent Customer Service
0 likes · 16 min read
Building a ChatGPT‑Based Intelligent Customer Service System with BERT Classification and Knowledge Filtering
Baidu Tech Salon
Baidu Tech Salon
Sep 20, 2023 · Artificial Intelligence

Live Session: Introduction to NVIDIA Nsight Systems and Compute for AI Performance Analysis

In a live session, NVIDIA senior deep‑learning solutions architect Zhai Jian demonstrates how to use Nsight Systems and Nsight Compute to analyze a simple neural‑network training workload, accelerate BERT with mixed precision, and examine matrix‑transpose kernels, with registration via QR code and a detailed event schedule.

AI toolsBERTGPU performance
0 likes · 2 min read
Live Session: Introduction to NVIDIA Nsight Systems and Compute for AI Performance Analysis
HelloTech
HelloTech
Sep 13, 2023 · Artificial Intelligence

AI Platform‑Powered Automated Ticket Routing: Modeling Workflow, Feature Engineering, and Intent Recognition

The Haro AI platform automates customer‑service ticket routing by applying a four‑step pipeline—feature processing, model training, evaluation, and deployment—using BERT/ALBERT‑based intent recognition, configurable feature storage, AutoML or expert modes, and Faas‑style deployment, as demonstrated in the Universal Ticket System case study, dramatically improving accuracy and efficiency.

AI PlatformALBERTBERT
0 likes · 11 min read
AI Platform‑Powered Automated Ticket Routing: Modeling Workflow, Feature Engineering, and Intent Recognition
Sohu Tech Products
Sohu Tech Products
Jul 26, 2023 · Artificial Intelligence

Attention Mechanism, Transformer Architecture, and BERT: An In-Depth Overview

This article provides a comprehensive overview of the attention mechanism, its mathematical foundations, the transformer model architecture—including encoder and decoder components—and the BERT pre‑training model, detailing their principles, implementations, and applications in natural language processing.

BERTEncoder-DecoderNLP
0 likes · 13 min read
Attention Mechanism, Transformer Architecture, and BERT: An In-Depth Overview
HomeTech
HomeTech
Jul 7, 2023 · Artificial Intelligence

Multi-Modal Video Understanding and AIGC Video Generation at Autohome

This article presents a comprehensive multi-modal video understanding system for AIGC video generation, detailing technical architecture, GCN-based semi-supervised learning, and practical applications across automotive content scenarios.

AIGCBERTNeXtVLAD
0 likes · 8 min read
Multi-Modal Video Understanding and AIGC Video Generation at Autohome
Xianyu Technology
Xianyu Technology
Feb 22, 2023 · Artificial Intelligence

Integrating Retrieval and Generation Tasks for Deep Semantic Matching in Xianyu Search

The paper introduces SimBert, a later‑fusion model that jointly trains a dual‑tower retrieval component and an auxiliary generation task on the item tower, using a two‑stage pre‑training and fine‑tuning pipeline, which yields a 3.6% relevance boost and reduces bad‑case rates in Xianyu search.

BERTSearch Relevancedeep learning
0 likes · 8 min read
Integrating Retrieval and Generation Tasks for Deep Semantic Matching in Xianyu Search
DataFunSummit
DataFunSummit
Feb 3, 2023 · Artificial Intelligence

Interactive BERT for Relevance in Health E‑commerce Search

This article presents an in‑depth exploration of an interactive BERT‑based relevance model for health e‑commerce search, detailing the business context, query and product feature extraction, domain‑specific sample generation, model architecture enhancements, offline and online performance gains, and practical deployment through knowledge distillation.

AIBERTSearch Relevance
0 likes · 14 min read
Interactive BERT for Relevance in Health E‑commerce Search
DataFunTalk
DataFunTalk
Jan 11, 2023 · Artificial Intelligence

Exploring Interactive BERT for Relevance in Health E‑commerce Search

This article presents a comprehensive overview of Alibaba Health's interactive BERT approach for improving relevance in health e‑commerce search, covering business background, model design, domain‑specific data construction, knowledge‑distilled twin‑tower deployment, experimental results, and a detailed Q&A session.

AIBERTSearch Relevance
0 likes · 14 min read
Exploring Interactive BERT for Relevance in Health E‑commerce Search
Ctrip Technology
Ctrip Technology
Nov 10, 2022 · Artificial Intelligence

Improving Search Intent Recognition and Term Weighting with Deep Learning and Model Distillation at Ctrip

This article describes how Ctrip's R&D team applied deep‑learning models, BERT‑based embeddings, knowledge distillation, and term‑weighting techniques to enhance e‑commerce search intent recognition and term importance estimation, achieving high accuracy while meeting sub‑10 ms latency requirements.

BERTModel Distillationdeep learning
0 likes · 12 min read
Improving Search Intent Recognition and Term Weighting with Deep Learning and Model Distillation at Ctrip
DataFunTalk
DataFunTalk
Sep 9, 2022 · Artificial Intelligence

AI-Powered Music Comment Moderation and Ranking: Models, Challenges, and Business Impact

This article presents a comprehensive overview of AI-driven music comment moderation and ranking systems, detailing business scenarios, model architectures, data processing techniques, performance improvements, and future directions for both QQ Music and K‑Song platforms.

AIBERTNLP
0 likes · 17 min read
AI-Powered Music Comment Moderation and Ranking: Models, Challenges, and Business Impact
DataFunSummit
DataFunSummit
Jul 7, 2022 · Artificial Intelligence

Discovering and Enhancing Robustness in Low‑Resource Information Extraction

This article examines the robustness challenges of information extraction tasks such as NER and relation extraction, introduces the Entity Coverage Ratio metric, analyzes why pretrained models like BERT may “take shortcuts,” and proposes evaluation tools and training strategies—including mutual‑information‑based methods, negative‑training, and flooding—to improve model robustness across diverse scenarios.

BERTInformation ExtractionNamed entity recognition
0 likes · 12 min read
Discovering and Enhancing Robustness in Low‑Resource Information Extraction
Ctrip Technology
Ctrip Technology
Jun 16, 2022 · Artificial Intelligence

Entity Linking System for Travel Knowledge Graph at Ctrip AI R&D

The article presents Ctrip's travel AI team's end‑to‑end entity linking solution built on a large‑scale tourism knowledge graph, detailing its background, technical architecture, core modules—including mention detection, candidate generation, and disambiguation using BERT and prefix‑tree techniques—and real‑world applications such as search, intelligent客服, and POI data maintenance.

BERTNLPentity linking
0 likes · 18 min read
Entity Linking System for Travel Knowledge Graph at Ctrip AI R&D