Tag

AI architecture

2 views collected around this technical thread.

DataFunSummit
DataFunSummit
Jun 2, 2025 · Artificial Intelligence

Enterprise Knowledge Brain Powered by Large Models and Knowledge Graphs

This article explains how the rapid development of large language models and knowledge graph technologies creates new opportunities for enterprise knowledge management, outlines the challenges of massive unstructured data, describes the architecture and core data flow of a corporate knowledge brain, and showcases key technologies and real‑world applications.

AI architectureLarge Modelsdata integration
0 likes · 13 min read
Enterprise Knowledge Brain Powered by Large Models and Knowledge Graphs
Tencent Technical Engineering
Tencent Technical Engineering
Apr 14, 2025 · Artificial Intelligence

MCP Protocol: Technical Principles and Business Applications

The article examines the Model Context Protocol (MCP), detailing its microkernel‑based technical architecture, development timeline from Anthropic’s 2024 release to industry adoption, hands‑on implementation examples, and business use cases such as multi‑agent QQ robots, highlighting MCP’s potential to standardize AI tool integration across industries.

AI applicationsAI architectureBusiness Implementation
0 likes · 14 min read
MCP Protocol: Technical Principles and Business Applications
Tencent Cloud Developer
Tencent Cloud Developer
Apr 10, 2025 · Artificial Intelligence

The Magic of GPT‑4o: Technical Overview and Speculated Architecture

GPT‑4o combines extremely long‑form text generation, high‑quality image creation and interactive editing by likely using an autoregressive multimodal transformer that tokenizes visuals via VQ‑VAE/GAN pipelines, trained on massive data and refined through fine‑tuning and RLHF, offering a unified model for generation, editing, and understanding.

AI architectureGPT-4oVQ-VAE
0 likes · 17 min read
The Magic of GPT‑4o: Technical Overview and Speculated Architecture
Architects' Tech Alliance
Architects' Tech Alliance
Feb 24, 2025 · Artificial Intelligence

NSA: Hardware‑Optimized Sparse Attention Mechanism from DeepSeek, Peking University and University of Washington

The NSA mechanism introduces a three‑branch hardware‑optimized sparse attention architecture—token compression, token selection, and sliding window—combined with learnable gating to balance global and local context, dramatically improving inference speed and efficiency for long‑context large language models.

AI architectureDeepSeekSparse Attention
0 likes · 5 min read
NSA: Hardware‑Optimized Sparse Attention Mechanism from DeepSeek, Peking University and University of Washington
Architect
Architect
Feb 16, 2025 · Artificial Intelligence

DeepSeek-V3, DeepSeek-R1, and Janus‑Pro: Architecture, Training Techniques, and Performance Insights

This article provides an in‑depth technical overview of DeepSeek‑V3, DeepSeek‑R1 and Janus‑Pro models, covering their Mixture‑of‑Experts architecture, novel MLA attention, auxiliary‑loss‑free load balancing, multi‑token prediction, FP8 mixed‑precision training, efficient cross‑node communication, reinforcement‑learning pipelines, multimodal modeling strategies, performance comparisons, cost statistics, and current limitations.

AI architectureDeepSeek-V3FP8 Training
0 likes · 18 min read
DeepSeek-V3, DeepSeek-R1, and Janus‑Pro: Architecture, Training Techniques, and Performance Insights
Airbnb Technology Team
Airbnb Technology Team
Dec 12, 2024 · Artificial Intelligence

Airbnb Automation Platform v2: Enabling LLM‑Driven Conversational AI

Airbnb’s Automation Platform v2 replaces the rigid, workflow‑driven architecture of v1 with an LLM‑centric design that orchestrates context gathering, chain‑of‑thought reasoning, tool execution, and guardrails, enabling more natural, scalable, and safe conversational AI while preserving the reliability of traditional workflows.

AI architectureAirbnbAutomation Platform
0 likes · 11 min read
Airbnb Automation Platform v2: Enabling LLM‑Driven Conversational AI
DataFunSummit
DataFunSummit
Nov 27, 2024 · Artificial Intelligence

Applying Large Language Models in Data Management and Risk Control at Ping An One Wallet

This presentation details how Ping An One Wallet leverages large language models across five key areas—current application status, data management, risk control, technical architecture, and a Q&A session—highlighting strategies such as vectorized rule storage, prompt engineering, RAG enhancements, and workflow agents to improve efficiency and accuracy in data governance and fraud detection.

AI architecturePrompt Engineeringdata governance
0 likes · 16 min read
Applying Large Language Models in Data Management and Risk Control at Ping An One Wallet
Tencent Docs Tech Team
Tencent Docs Tech Team
Nov 13, 2024 · Artificial Intelligence

Technical Architecture and Practices of the AI Document Assistant

This article explores the challenges large language models bring to efficiency tools, outlines the AI document assistant's technical thinking and architecture, and details both application‑side and model‑side practices such as retrieval‑augmented generation, intent recognition, and code‑driven table handling, concluding with key lessons.

AIAI architectureDocument Automation
0 likes · 16 min read
Technical Architecture and Practices of the AI Document Assistant
DataFunTalk
DataFunTalk
Mar 21, 2024 · Artificial Intelligence

A Detailed Technical Analysis of Sora: Architecture, Key Components, and Potential Implementation

This article provides a comprehensive, easy‑to‑understand breakdown of Sora’s possible architecture—including its visual encoder‑decoder, Spacetime Latent Patch, transformer‑based diffusion model, long‑time consistency strategies, training techniques, and how it supports variable resolution and duration video generation.

AI architectureSoraSpacetime Patch
0 likes · 49 min read
A Detailed Technical Analysis of Sora: Architecture, Key Components, and Potential Implementation
JD Retail Technology
JD Retail Technology
Feb 28, 2024 · Artificial Intelligence

Edge AI at JD Retail: Architecture, Challenges, and Business Practices

This article details JD Retail's edge AI (on‑device intelligence) platform, covering its definition, performance and security challenges, three‑layer cloud‑edge‑device architecture, key components such as high‑performance inference engine, data pipeline, Python VM container, and real‑world applications in traffic distribution and image recognition.

AI architectureJD RetailOn‑device Intelligence
0 likes · 15 min read
Edge AI at JD Retail: Architecture, Challenges, and Business Practices
HomeTech
HomeTech
Jul 26, 2023 · Artificial Intelligence

Practical Implementation of ChatGPT Technology Products: Architecture, Prompt Engineering, and Future Challenges

This article explores the practical deployment of ChatGPT‑based products, detailing the model fundamentals, technical architecture, engineering‑focused prompt design, real‑world application scenarios, and the challenges of model generalization, resource consumption, data privacy, interpretability, and ethical considerations.

AI architectureChatGPTJava
0 likes · 15 min read
Practical Implementation of ChatGPT Technology Products: Architecture, Prompt Engineering, and Future Challenges
DataFunSummit
DataFunSummit
Mar 30, 2023 · Artificial Intelligence

An Overview of ChatGPT’s Software Architecture and Technology Stack

The article examines ChatGPT’s underlying software architecture, detailing its cloud deployment on AWS and Azure, database choices like PostgreSQL and Redis, front‑end technologies such as TypeScript and React, core AI frameworks including PyTorch and Triton, as well as its container orchestration, monitoring, and programming language ecosystem.

AI architectureChatGPTKubernetes
0 likes · 6 min read
An Overview of ChatGPT’s Software Architecture and Technology Stack