Tagged articles
5000 articles
Page 27 of 50
Efficient Ops
Efficient Ops
Oct 24, 2024 · Operations

How Migu’s AI‑Powered Observability Boosts Cloud Gaming Operations

During the 24th GOPS Global Operations Conference, Migu Interactive Entertainment’s Vice President Su Yi discussed how their AI‑driven AIOps observability framework, validated by ITU standards, enhances cloud gaming platform stability, accelerates issue detection, and supports China Mobile’s 5G‑based digital transformation.

AIDigital TransformationObservability
0 likes · 19 min read
How Migu’s AI‑Powered Observability Boosts Cloud Gaming Operations
System Architect Go
System Architect Go
Oct 24, 2024 · Artificial Intelligence

How to Fine‑Tune Translation Models on Kubernetes Docs with LoRA

This article walks through the complete process of fine‑tuning both domain‑specific and large‑language translation models on Kubernetes documentation, covering data preparation, model selection, training configurations, the differences between Seq2Seq and CausalLM, and how LoRA can dramatically reduce resource usage while improving performance.

AIFine-tuningLLM
0 likes · 7 min read
How to Fine‑Tune Translation Models on Kubernetes Docs with LoRA
Efficient Ops
Efficient Ops
Oct 23, 2024 · Databases

How NineData Boosts R&D Collaboration 5× with Multi‑Cloud Database Management

The NineData presentation at the 2024 GOPS Global Operations Conference in Shanghai detailed multi‑cloud, multi‑source database architecture trends, showcased their intelligent data management platform, explained data replication principles, DevOps challenges and AI‑enhanced solutions, and highlighted real‑world customer success stories across industries.

AICloud NativeDevOps
0 likes · 11 min read
How NineData Boosts R&D Collaboration 5× with Multi‑Cloud Database Management
DaTaobao Tech
DaTaobao Tech
Oct 23, 2024 · Artificial Intelligence

Retrieval-Augmented Generation (RAG): Principles, Applications, Limitations and Challenges

Retrieval-Augmented Generation (RAG) combines a retriever that fetches relevant external documents and a generator that uses them, improving LLM accuracy, relevance, privacy, and up-to-date information, but faces challenges such as retrieval latency, computational cost, chunking strategies, embedding selection, and system integration complexity.

AIKnowledge RetrievalLLM
0 likes · 13 min read
Retrieval-Augmented Generation (RAG): Principles, Applications, Limitations and Challenges
Baidu Geek Talk
Baidu Geek Talk
Oct 23, 2024 · Artificial Intelligence

Integrating Yuan 2.0 Large Model with PaddleNLP: Overview, Usage Steps, and Interaction Examples

The open‑source Yuan 2.0 large model is fully integrated into Baidu’s PaddleNLP, offering quick inference for tasks like code generation, translation, and reasoning, along with efficient distributed training and fine‑tuning features such as Zero Padding optimization, enabling developers to easily deploy and customize the model via simple setup steps and example interactions.

AILLMPaddleNLP
0 likes · 10 min read
Integrating Yuan 2.0 Large Model with PaddleNLP: Overview, Usage Steps, and Interaction Examples
58UXD
58UXD
Oct 22, 2024 · Artificial Intelligence

Boost Webtoon Production: How AI Powers Fast Comic Creation

This article explains how AI tools like GPT and Midjourney can streamline the entire webtoon creation process—from extracting core policy content to generating high‑quality comic panels—showing a complete workflow that reduces production time from weeks to days.

AIMidjourneyWebtoon
0 likes · 8 min read
Boost Webtoon Production: How AI Powers Fast Comic Creation
DataFunSummit
DataFunSummit
Oct 21, 2024 · Artificial Intelligence

Retrieval‑Augmented Generation (RAG) for Office Applications: Architecture, Challenges, and Practical Practices

This article introduces Retrieval‑Augmented Generation (RAG) as a solution to the hallucination, freshness, and data‑privacy issues of large language models, details its modular architecture, explains the layered system design and hybrid retrieval pipeline, and shares the practical challenges and engineering tricks encountered when deploying RAG in enterprise office scenarios.

AIHybrid RetrievalRAG
0 likes · 19 min read
Retrieval‑Augmented Generation (RAG) for Office Applications: Architecture, Challenges, and Practical Practices
OPPO Amber Lab
OPPO Amber Lab
Oct 21, 2024 · Information Security

How OPPO’s AI Private Computing Cloud Secures Your Data End‑to‑End

OPPO’s AI Private Computing Cloud leverages hardware‑based TEE, end‑to‑end encryption, and trusted sandbox technologies to protect user data across both cloud and device, while its terminal AI confidential computing system and new Security & Privacy Trust Center provide certified, high‑assurance privacy safeguards for AI‑driven applications.

AIConfidential ComputingMobile Security
0 likes · 10 min read
How OPPO’s AI Private Computing Cloud Secures Your Data End‑to‑End
Baidu Intelligent Cloud Tech Hub
Baidu Intelligent Cloud Tech Hub
Oct 21, 2024 · Big Data

How Baidu’s Data Lake Acceleration 2.0 Supercharges Big Data and AI Workloads

Baidu's latest data lake acceleration 2.0 replaces HDFS with a scalable object‑storage foundation, introduces a hierarchical Namespace 2.0, a high‑throughput streaming engine, RapidFS caching, and a fully HDFS‑compatible BOS‑HDFS layer, delivering up to 70% higher throughput and dramatically lower costs for big data and AI pipelines.

AICloud Nativeobject storage
0 likes · 12 min read
How Baidu’s Data Lake Acceleration 2.0 Supercharges Big Data and AI Workloads
Efficient Ops
Efficient Ops
Oct 19, 2024 · Operations

How Migu’s Cloud Gaming Platform Achieved Leading AIOps Observability Standards

Migu Interactive Entertainment’s interview reveals how its cloud gaming platform leveraged AI, 5G, and standardized observability practices to pass both international and domestic AIOps assessments, highlighting the strategic importance of intelligent operations for business continuity in complex, distributed systems.

AIDigital TransformationIntelligent Operations
0 likes · 17 min read
How Migu’s Cloud Gaming Platform Achieved Leading AIOps Observability Standards
DaTaobao Tech
DaTaobao Tech
Oct 18, 2024 · Artificial Intelligence

Taobao AI Virtual Try-On: Offline Data Processing and Performance Optimization

Taobao’s AI virtual‑try‑on system pre‑computes fitting results offline, writes them into the Item Center via scalable ScheduleX tasks, optimizes pagination, locking and flow‑control, and thereby processes millions of apparel items in under thirty minutes with 99.9% success and reliable checkpoint‑resume monitoring.

AIBig Dataoffline processing
0 likes · 16 min read
Taobao AI Virtual Try-On: Offline Data Processing and Performance Optimization
Meituan Technology Team
Meituan Technology Team
Oct 17, 2024 · Artificial Intelligence

Meituan Robotics Research Institute 2024 Call for Research Proposals

The Meituan Robotics Research Institute (MARS) is calling full‑time university scholars and researchers to submit independent research proposals for 2024 projects—selected from a predefined topic list, evaluated by Meituan and external experts on novelty, business value and feasibility, and eligible for up‑to ¥200,000 funding, on‑site interns, fast‑track graduate hiring and de‑identified data, with applications due 10 Nov 2024 and projects starting Dec 2024 or Jan 2025.

AIResearch Fundingdrone
0 likes · 4 min read
Meituan Robotics Research Institute 2024 Call for Research Proposals
Alimama Tech
Alimama Tech
Oct 17, 2024 · Artificial Intelligence

FLUX ControlNet Inpainting and 8-Step Turbo Acceleration Models

Alibaba’s Mama Intelligent Creation team has open‑sourced a FLUX‑based ControlNet inpainting model that leverages a DiT‑backed Interleave design for superior repair quality, and an 8‑step LoRA‑Turbo model that cuts inference time three‑fold while preserving near‑original image fidelity, both now available on Hugging Face and ModelScope.

AIControlNetFlux
0 likes · 9 min read
FLUX ControlNet Inpainting and 8-Step Turbo Acceleration Models
Baidu Tech Salon
Baidu Tech Salon
Oct 17, 2024 · Artificial Intelligence

How to Deploy Yuan 2.0 LLM with PaddleNLP: A Step‑by‑Step Guide

This article explains how the open‑source Yuan 2.0 large language model is fully integrated with Baidu’s PaddleNLP, covering its capabilities, fine‑tuning optimizations, step‑by‑step deployment instructions, interaction examples, and training/finetuning results with loss‑curve visualizations.

AIDistributed TrainingFine-tuning
0 likes · 10 min read
How to Deploy Yuan 2.0 LLM with PaddleNLP: A Step‑by‑Step Guide
Wukong Talks Architecture
Wukong Talks Architecture
Oct 17, 2024 · Operations

A Retrospective on DevOps System Design and Platform Engineering (2008‑2022)

From 2008 onward, the author chronicles the development of multiple DevOps systems, examining their origins, design choices, challenges, and evolution—including CI tools like CruiseControl, Hudson, Jenkins, custom plugins, metrics, platform engineering, and the impact of AI—offering insights for modern continuous integration and delivery practices.

AIDevOpsJenkins
0 likes · 34 min read
A Retrospective on DevOps System Design and Platform Engineering (2008‑2022)
Model Perspective
Model Perspective
Oct 17, 2024 · Artificial Intelligence

Visualizing How Neural Networks Approximate Any Function

This article explains the universal approximation theorem, showing how even a simple neural network with one hidden layer can approximate any continuous function by adjusting weights and biases, and illustrates the process with visual examples of step and bump functions, linking theory to recent Nobel recognitions.

AINeural Networksfunction approximation
0 likes · 9 min read
Visualizing How Neural Networks Approximate Any Function
AntData
AntData
Oct 16, 2024 · Artificial Intelligence

Building a Data Assistant Application with DB‑GPT V0.6.0

This tutorial walks through the end‑to‑end process of creating a data‑assistant application using DB‑GPT V0.6.0, covering prerequisite deployment, knowledge‑base construction, sub‑agent creation, RAG‑based QA, AWEL workflow installation, intent‑recognition knowledge base, and unified multi‑agent orchestration.

AIDB-GPTData Assistant
0 likes · 12 min read
Building a Data Assistant Application with DB‑GPT V0.6.0
AntTech
AntTech
Oct 16, 2024 · Artificial Intelligence

CNCC2024 Forum: Industry‑Academia Collaboration for Scientific Exploration and the CCF‑Ant Research Fund Release

The CNCC2024 forum held on October 24‑26 in Hangzhou showcases industry‑academia‑research integration through keynote speeches, speaker introductions, and detailed abstracts on topics such as AI, privacy computing, graph machine learning, and elastic cloud scheduling, while announcing the CCF‑Ant Research Fund and related initiatives.

AIelastic schedulinggraph-ml
0 likes · 10 min read
CNCC2024 Forum: Industry‑Academia Collaboration for Scientific Exploration and the CCF‑Ant Research Fund Release
21CTO
21CTO
Oct 15, 2024 · Artificial Intelligence

Why Mojo Could Redefine AI Programming: Insights from Chris Lattner

The article explores Chris Lattner’s vision for Mojo—a Python‑compatible language designed for AI, GPU, and accelerator workloads—detailing its performance claims, SIMD support, complex‑number handling, and the growing developer community behind it.

AIGPUMojo
0 likes · 9 min read
Why Mojo Could Redefine AI Programming: Insights from Chris Lattner
AntTech
AntTech
Oct 15, 2024 · Artificial Intelligence

AI Large Model Technology Exploration and Application Forum (CNCC2024)

The AI Large Model Technology Exploration and Application Forum, held on October 24‑26, 2024 in Hengdian, Zhejiang, gathers leading experts from Ant Group, universities and research institutes to discuss challenges, knowledge enhancement, data infrastructure, diffusion models, multimodal and medical large models through a series of keynote talks and panel sessions.

AILarge Language Modelsconference
0 likes · 12 min read
AI Large Model Technology Exploration and Application Forum (CNCC2024)
JD Retail Technology
JD Retail Technology
Oct 15, 2024 · Artificial Intelligence

Large‑Model‑Driven Evolution of E‑commerce Search and Recommendation at JD Retail

The article examines how large language models are reshaping JD Retail's e‑commerce search and recommendation pipelines, detailing industry evolution, technical challenges such as knowledge hallucination, intent understanding, personalization, cost, and safety, and presenting JD's end‑to‑end AIGC architecture, data preprocessing, alignment, evaluation, and next‑generation AI search solutions.

AIMultimodale‑commerce
0 likes · 36 min read
Large‑Model‑Driven Evolution of E‑commerce Search and Recommendation at JD Retail
DaTaobao Tech
DaTaobao Tech
Oct 14, 2024 · Artificial Intelligence

MNN Stable Diffusion: On‑Device Deployment and Performance Optimizations

The article presents Alibaba’s open‑source MNN inference engine, demonstrating how quantization, operator fusion (including fused multi‑head attention, GroupNorm/SplitGeLU, Winograd convolutions), optimized GEMM and memory‑paging enable on‑device Stable Diffusion with 1‑second‑per‑step performance on Snapdragon 8 Gen3 and Apple M3 GPUs, and outlines future speed‑up directions.

AIMNNStable Diffusion
0 likes · 11 min read
MNN Stable Diffusion: On‑Device Deployment and Performance Optimizations
Baidu Intelligent Cloud Tech Hub
Baidu Intelligent Cloud Tech Hub
Oct 14, 2024 · Databases

How Baidu’s New Cloud‑Native Databases Power Enterprise AI in 2024

At the 2024 Baidu Cloud Summit, the speaker detailed recent breakthroughs across Baidu’s cloud‑native database suite—including PegaDB KV, GaiaDB relational, VDB vector, and the integrated DBSC, EDAP, and DBStack platforms—highlighting performance, cost, scalability, and AI‑ready features that address enterprise data challenges.

AIBig DataEnterprise Data
0 likes · 11 min read
How Baidu’s New Cloud‑Native Databases Power Enterprise AI in 2024
Java Tech Enthusiast
Java Tech Enthusiast
Oct 13, 2024 · Industry Insights

China’s ‘Luo Bo’ Pushes the First Global AI‑Powered Autonomous Driving Platform

From the 1925 driverless “American Wonder” to today’s AI‑driven robotaxi wars, the article traces the historic roots, recent breakthroughs by Waymo, Tesla and Baidu, and analyzes China’s Luo Bo platform, market forecasts, competitive dynamics, and the strategic challenges facing the global autonomous‑driving industry.

AIChinaMarket Analysis
0 likes · 12 min read
China’s ‘Luo Bo’ Pushes the First Global AI‑Powered Autonomous Driving Platform
JD Tech
JD Tech
Oct 13, 2024 · Artificial Intelligence

Building a Simple Local AI Question‑Answer System with Java, LangChain4J, Ollama, and ChromaDB

This article guides readers through the concepts of large language models, embeddings, vector databases, and Retrieval‑Augmented Generation, then demonstrates step‑by‑step how to set up Ollama, install a local Chroma vector store, configure Maven dependencies, and write Java code using LangChain4J to build and test a functional AI Q&A application.

AILLMLangChain4j
0 likes · 22 min read
Building a Simple Local AI Question‑Answer System with Java, LangChain4J, Ollama, and ChromaDB
Architects' Tech Alliance
Architects' Tech Alliance
Oct 11, 2024 · Industry Insights

Why Common Network Misconceptions Hurt AI Performance and How to Fix Them

The article explains how prevalent misunderstandings in data‑center network design—such as altering end‑to‑end link speeds, overlooking switch radix, and choosing inappropriate buffering architectures—can increase latency and reduce AI workload efficiency, and it outlines the benefits of InfiniBand, cut‑through switching, scalable radix, and resilient AI‑cloud management solutions.

AIBuffer ArchitectureCut-through Switching
0 likes · 9 min read
Why Common Network Misconceptions Hurt AI Performance and How to Fix Them
Xiaohongshu Tech REDtech
Xiaohongshu Tech REDtech
Oct 11, 2024 · Artificial Intelligence

Harmonized Speculative Sampling (HASS): Aligning Training and Decoding for Efficient Large Language Model Inference

HASS aligns training and decoding contexts and objectives for speculative sampling, using harmonized objective distillation and multi-step context alignment, achieving 2.81–4.05× speedup and 8%–20% improvement over EAGLE‑2 while preserving generation quality in real-world deployments at Xiaohongshu.

AIHASSInference Acceleration
0 likes · 11 min read
Harmonized Speculative Sampling (HASS): Aligning Training and Decoding for Efficient Large Language Model Inference
DataFunTalk
DataFunTalk
Oct 11, 2024 · Artificial Intelligence

E‑commerce Innovation and Data Governance: Summaries of Recent Research Topics

This article compiles concise overviews of recent e‑commerce research, covering real‑time online learning re‑ranking models, causal inference for user growth, full‑link data lineage, TikTok's data governance and attribution solutions, Volcano Engine's metric management, AI Agent applications on 1688, and XinXuan Group's live‑stream data architecture.

AIData GovernanceData Lineage
0 likes · 5 min read
E‑commerce Innovation and Data Governance: Summaries of Recent Research Topics
21CTO
21CTO
Oct 10, 2024 · Artificial Intelligence

5 Practical AI Projects to Build Your Skills with Python

This article presents five hands‑on AI project ideas—from resume optimization to multimodal search—complete with step‑by‑step instructions, required Python libraries, and code snippets, helping beginners and intermediate developers quickly build valuable AI applications.

AILLMPython
0 likes · 12 min read
5 Practical AI Projects to Build Your Skills with Python
Architect
Architect
Oct 10, 2024 · Artificial Intelligence

Algorithmic Practices for Meituan's Content Intelligent Distribution

This article summarizes Meituan's content search system, detailing the challenges of heterogeneous, high‑frequency local content, the multi‑modal tagging and representation pipeline, recall and ranking optimizations, satisfaction metrics, multi‑objective fusion, heterogeneous mixing, and future directions for improving user experience in local life services.

AIMeituancontent search
0 likes · 18 min read
Algorithmic Practices for Meituan's Content Intelligent Distribution
Java Tech Enthusiast
Java Tech Enthusiast
Oct 10, 2024 · Artificial Intelligence

Google Rehires AI Pioneer Noam Shazeer for Gemini Development

Google has signed a $2.7 billion agreement to rehire AI pioneer Noam Shazeer—co‑author of the seminal “Attention is All You Need” paper and creator of the Meena chatbot—bringing him back from his Character.AI venture to serve as vice president overseeing the Gemini generative‑AI project alongside DeepMind leaders, thereby bolstering Google’s competitive edge in the field.

AICharacter AIGemini
0 likes · 8 min read
Google Rehires AI Pioneer Noam Shazeer for Gemini Development
DaTaobao Tech
DaTaobao Tech
Oct 9, 2024 · Artificial Intelligence

Building a Vertical Domain QA Bot with Vector Search, RAG, and SFT

This guide walks entry‑level developers through building a logistics‑focused QA bot by first embedding documents for vector similarity search, then adding retrieval‑augmented generation, fine‑tuning a small model, integrating hybrid checks, and optimizing deployment with feedback loops to achieve fast, accurate, out‑of‑scope‑aware answers.

AIChatbotFine-tuning
0 likes · 15 min read
Building a Vertical Domain QA Bot with Vector Search, RAG, and SFT
JD Retail Technology
JD Retail Technology
Oct 9, 2024 · Frontend Development

AIGCDesign: Open-Source Cross‑Platform AI Component Solution – Design, Architecture, and Implementation

The article introduces AIGCDesign, an open‑source cross‑platform AI component solution that combines traditional frontend libraries with large‑language‑model capabilities, outlines its design goals, technical architecture, lifecycle hooks, configuration examples, multi‑framework support, and real‑world business integration cases.

AIAIGCComponent Library
0 likes · 12 min read
AIGCDesign: Open-Source Cross‑Platform AI Component Solution – Design, Architecture, and Implementation
JD Tech Talk
JD Tech Talk
Oct 8, 2024 · Artificial Intelligence

Building a Retrieval‑Augmented Generation (RAG) System with Rust and Qdrant

This article explains how to construct a Retrieval‑Augmented Generation pipeline in Rust, covering knowledge‑base creation with Qdrant, model loading and embedding using the candle library, data ingestion, and integration of a Rust‑based inference service based on mistral.rs, while also discussing resource usage and common pitfalls.

AIEmbeddingLLM
0 likes · 16 min read
Building a Retrieval‑Augmented Generation (RAG) System with Rust and Qdrant
Architects' Tech Alliance
Architects' Tech Alliance
Oct 7, 2024 · Industry Insights

What AMD Unveiled at Computex 2024: Zen 5, XDNA NPU, Ryzen 9000 and AI‑Focused Innovations

At Computex 2024, AMD showcased its latest CPU, GPU, and AI‑accelerated technologies—including the high‑performance Zen 5 core, second‑generation XDNA NPU with 50 TOPS, the Ryzen 9000 consumer processor, the AI‑PC Strix Point platform, Versal AI Edge Gen 2, the upcoming MI‑series AI GPUs, and the new UA‑Link interconnect—highlighting the company’s roadmap for next‑generation computing and AI workloads.

AIAMDCPU
0 likes · 5 min read
What AMD Unveiled at Computex 2024: Zen 5, XDNA NPU, Ryzen 9000 and AI‑Focused Innovations
Baobao Algorithm Notes
Baobao Algorithm Notes
Oct 7, 2024 · Artificial Intelligence

Mastering LLM Supervised Fine‑Tuning: Practical Tips, Data Strategies, and Debugging

This article provides a comprehensive, experience‑driven guide to supervised fine‑tuning (SFT) of large language models, covering special tokens, latency considerations, data diversity and production, training frameworks and hyper‑parameters, over‑/under‑fitting diagnostics, and evaluation metrics such as helpfulness, honesty, and harmlessness.

AILLMSFT
0 likes · 40 min read
Mastering LLM Supervised Fine‑Tuning: Practical Tips, Data Strategies, and Debugging
JavaEdge
JavaEdge
Oct 2, 2024 · Artificial Intelligence

Boost RAG Retrieval Accuracy with Contextual Embeddings and BM25

This article presents a contextual retrieval technique that combines contextual embeddings and contextual BM25 to reduce RAG miss rates by up to 67%, explains the underlying methods, implementation steps, cost considerations, experimental results, and practical deployment guidance.

AIBM25Contextual Retrieval
0 likes · 17 min read
Boost RAG Retrieval Accuracy with Contextual Embeddings and BM25
DataFunTalk
DataFunTalk
Oct 1, 2024 · Artificial Intelligence

From Early AI to Superintelligence: Challenges and Prospects

The article reviews the evolution of artificial intelligence from early statistical models through deep learning and Transformer architectures, examines current breakthroughs like multimodal models, and discusses the technical, computational, and safety challenges that must be overcome before achieving artificial superintelligence (ASI).

AIArtificial IntelligenceMultimodal
0 likes · 8 min read
From Early AI to Superintelligence: Challenges and Prospects
Java Tech Enthusiast
Java Tech Enthusiast
Sep 30, 2024 · Artificial Intelligence

The AI Smile Curve: Profit Distribution and Future Outlook

The AI industry’s profit landscape mirrors a smile curve, with upstream GPU manufacturers and downstream application developers capturing most returns while costly large‑model R&D yields low margins, prompting predictions of GPU valuation corrections, a push for consumer‑facing killer apps, and massive application turnover through creative destruction.

AIGPUIndustry Analysis
0 likes · 11 min read
The AI Smile Curve: Profit Distribution and Future Outlook
JD Tech Talk
JD Tech Talk
Sep 30, 2024 · Artificial Intelligence

Yunli XiaoZhi: An AI‑Powered Intelligent Assistant for Knowledge Q&A and Data Analysis in Logistics Operations

The document describes the design, implementation, and operational results of Yunli XiaoZhi, an AI‑driven portable knowledge‑base and data‑analysis chatbot that consolidates SOPs, manuals, and real‑time information for logistics staff, using LangChain‑based RAG, vector databases, and large‑model prompting to improve query efficiency, proactive alerts, and reporting across multiple user groups.

AIChatbotKnowledge Base
0 likes · 19 min read
Yunli XiaoZhi: An AI‑Powered Intelligent Assistant for Knowledge Q&A and Data Analysis in Logistics Operations
Code Mala Tang
Code Mala Tang
Sep 30, 2024 · Artificial Intelligence

Will AI Assistants Erode Our Core Professional Skills?

Amid rapid AI adoption across programming, design, law, and medicine, experts warn that over‑reliance on AI assistants may erode foundational expertise, urging professionals to balance short‑term efficiency gains with sustained skill development to remain indispensable in an AI‑augmented future.

AIProfessional SkillsSkill development
0 likes · 9 min read
Will AI Assistants Erode Our Core Professional Skills?
Architects' Tech Alliance
Architects' Tech Alliance
Sep 29, 2024 · Industry Insights

Why Super‑Heterogeneous Computing Is the Next Frontier in Computing Architecture

The article analyzes the limits of the von Neumann model and Moore's law, explains how instruction set complexity defines processor categories, and argues that integrating CPUs, GPUs, FPGAs, DPUs and ASICs into a super‑heterogeneous ecosystem—driven by Intel, NVIDIA, ARM and emerging trends—will shape the future of computing through diverse workloads, AI demand, green efficiency and a global compute network by 2030.

AIARMCPU
0 likes · 12 min read
Why Super‑Heterogeneous Computing Is the Next Frontier in Computing Architecture
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Sep 29, 2024 · Cloud Native

Building a Production‑Grade Observability System for Alibaba Cloud ACK Container Service

The presentation outlines Alibaba Cloud's ACK container service observability framework, covering its architecture, key capabilities such as eBPF‑based tracing, GPU profiling, network diagnostics, storage monitoring, and FinOps integration, and demonstrates how these features support AI workloads, large‑scale production stability, and automated incident response.

AICloud NativeContainer Service
0 likes · 15 min read
Building a Production‑Grade Observability System for Alibaba Cloud ACK Container Service
JD Cloud Developers
JD Cloud Developers
Sep 29, 2024 · Artificial Intelligence

Build a Local AI Q&A System with Java, Ollama, and LangChain4J

This article walks through building a local AI question‑answer system using Java, Ollama, LangChain4J, embeddings, and a Chroma vector database, covering LLM fundamentals, embedding techniques, RAG architecture, setup steps, Maven dependencies, and sample code to retrieve and answer queries.

AIEmbeddingLLM
0 likes · 19 min read
Build a Local AI Q&A System with Java, Ollama, and LangChain4J
DataFunSummit
DataFunSummit
Sep 28, 2024 · Artificial Intelligence

Seat Copilot: Design, Large‑Model Architecture, and Business Impact in Financial Services

This article introduces the Seat Copilot developed by Qifu Technology, explains its composition, design, and core large‑model architecture, details data engineering, training and evaluation processes, and presents quantitative results showing improvements in operator efficiency, conversion rates, and management productivity.

AIcall center automationfinancial technology
0 likes · 18 min read
Seat Copilot: Design, Large‑Model Architecture, and Business Impact in Financial Services
Baidu Tech Salon
Baidu Tech Salon
Sep 27, 2024 · Artificial Intelligence

PaddleScience Presents Geometry-Informed Neural Operator Model for Aerodynamic Drag Prediction at 2024 CAE Aerodynamics Branch Academic Annual Meeting

At the 2024 CAE Aerodynamics Branch Academic Annual Meeting, PaddleScience unveiled a geometry‑informed neural operator model that predicts automotive drag with 2.1% average error, runs three orders of magnitude faster than full CFD, earned an Outstanding Paper Award, and showcases the toolkit’s AI‑driven workflow for rapid vehicle‑shape optimization.

AICFDDrag Prediction
0 likes · 7 min read
PaddleScience Presents Geometry-Informed Neural Operator Model for Aerodynamic Drag Prediction at 2024 CAE Aerodynamics Branch Academic Annual Meeting
Model Perspective
Model Perspective
Sep 27, 2024 · Artificial Intelligence

Modeling Everyday Learning: From Reinforcement to Social Learning

The article explores how everyday decision‑making can be modeled using reinforcement learning and social learning frameworks, illustrating their strengths, limitations, and combined insights for understanding individual and collective behavior.

AIReinforcement Learningbehavioral modeling
0 likes · 8 min read
Modeling Everyday Learning: From Reinforcement to Social Learning
AntData
AntData
Sep 26, 2024 · Artificial Intelligence

DB-GPT: Open-Source AI-Native Data Application Development Framework

DB‑GPT is an open‑source AI‑native data‑application framework that provides multi‑model management, Text‑to‑SQL optimization, RAG, multi‑agent collaboration, and intelligent workflow orchestration, enabling developers to build scalable large‑model database applications, with proven enterprise adoption, community growth, and academic publications.

AILarge Language ModelsRAG
0 likes · 6 min read
DB-GPT: Open-Source AI-Native Data Application Development Framework
21CTO
21CTO
Sep 25, 2024 · Artificial Intelligence

Reid Hoffman on Generative AI’s Future, Human Collaboration & Ethics

In this extensive interview, Reid Hoffman reflects on his entrepreneurial journey, the rise of LinkedIn, venture investing, the rapid evolution of generative AI, human‑AI collaboration, scaling laws, data ownership, copyright, AI governance, and the broader societal impact of intelligent agents.

AIEthicsTechnology Governance
0 likes · 48 min read
Reid Hoffman on Generative AI’s Future, Human Collaboration & Ethics
DaTaobao Tech
DaTaobao Tech
Sep 25, 2024 · Artificial Intelligence

Consistent Style Generation in AIGC: Style Aligned and Story Diffusion

The article reviews two AIGC techniques—Style Aligned, which shares self‑attention across a batch to keep style consistent, and Story Diffusion, which uses a training‑free Consistent Self‑Attention module followed by a transformer to generate coherent image sequences—showing promising results in home‑decoration scenarios while noting remaining challenges in fine‑grained spatial and detail alignment.

AIAIGCConsistent Self-Attention
0 likes · 5 min read
Consistent Style Generation in AIGC: Style Aligned and Story Diffusion
Architects' Tech Alliance
Architects' Tech Alliance
Sep 25, 2024 · Fundamentals

NVIDIA Quantum‑2 InfiniBand Platform: Technical Overview, Q&A, and Deployment Guidance

This article explains the growing demand for high‑performance computing, introduces NVIDIA's Quantum‑2 InfiniBand platform with its high‑speed, low‑latency capabilities, provides a curated list of related technical articles, and offers an extensive Q&A covering compatibility, cabling, UFM, PCIe limits, and best‑practice deployment for AI and HPC workloads.

AIGPUInfiniBand
0 likes · 11 min read
NVIDIA Quantum‑2 InfiniBand Platform: Technical Overview, Q&A, and Deployment Guidance
Data Thinking Notes
Data Thinking Notes
Sep 24, 2024 · Artificial Intelligence

Leveraging Large Models to Transform Data Governance: Quality, Cost, Efficiency

This article explains how large language models enhance data governance by improving data quality, reducing implementation costs, and increasing operational efficiency through knowledge bases and interactive prompt libraries, and it also outlines practical empowerment pathways for organizations seeking to leverage AI-driven analytics.

AICost reductionData Governance
0 likes · 3 min read
Leveraging Large Models to Transform Data Governance: Quality, Cost, Efficiency
Xiaohongshu Tech REDtech
Xiaohongshu Tech REDtech
Sep 23, 2024 · Artificial Intelligence

AlignRec: A Joint Training Framework for Aligning Multimodal Representations with Personalized Recommendation

AlignRec is a joint‑training framework that synchronizes multimodal encoders with personalized recommendation models through a staged alignment strategy and three specialized loss functions, preserving both content and ID signals, and achieving state‑of‑the‑art performance on multiple datasets while releasing superior Amazon multimodal features.

AIEvaluation Metricsjoint training
0 likes · 11 min read
AlignRec: A Joint Training Framework for Aligning Multimodal Representations with Personalized Recommendation
JD Tech Talk
JD Tech Talk
Sep 23, 2024 · Artificial Intelligence

JD Advertising R&D: AI‑Driven Solutions for Traffic Valuation, Multimodal Understanding, Auction Mechanisms, Generative Recommendation, and Large‑Model Engineering

The JD Advertising R&D team applies cutting‑edge AI techniques—including query intent models, multimodal representation pipelines, reinforcement‑learning‑based auction mechanisms, generative recommendation with quantized product tokens, and large‑model infrastructure—to boost traffic valuation, ad relevance, revenue, and creative generation across the platform.

AIAdvertisingMultimodal
0 likes · 19 min read
JD Advertising R&D: AI‑Driven Solutions for Traffic Valuation, Multimodal Understanding, Auction Mechanisms, Generative Recommendation, and Large‑Model Engineering
58 Tech
58 Tech
Sep 23, 2024 · Artificial Intelligence

Enhancing Commercial Search with Knowledge Graphs and Large‑Model Techniques

This article describes how a commercial search platform iteratively upgrades its system by structuring business knowledge into a knowledge graph, applying multi‑stage entity extraction (CRF, Electra‑CRF, GLM‑3, OCR), and leveraging large language models to improve relevance, user experience, and revenue.

AINLPcommercial search
0 likes · 14 min read
Enhancing Commercial Search with Knowledge Graphs and Large‑Model Techniques
DataFunSummit
DataFunSummit
Sep 22, 2024 · Artificial Intelligence

Large Language Models for Intelligent Financial Report Writing: Applications, Implementation, and Future Outlook

This article examines how large language models are currently applied to financial report creation, outlines their technical implementation and challenges, and explores future directions such as multimodal data fusion, personalization, and lightweight deployment on consumer devices.

AIDocument Automationfinancial reporting
0 likes · 12 min read
Large Language Models for Intelligent Financial Report Writing: Applications, Implementation, and Future Outlook
AntTech
AntTech
Sep 21, 2024 · Artificial Intelligence

Insights from the 2024 Inclusion·Bund Conference: From Data for AI to AI for Data

The 2024 Inclusion·Bund conference brought together academia and industry leaders to discuss how data technologies are evolving and aligning with AI, covering trends in large‑model storage, synthetic data generation, AI‑enhanced databases, and Ant Group's emerging AI‑centric data ecosystem.

AIAI Alignmentdata strategy
0 likes · 7 min read
Insights from the 2024 Inclusion·Bund Conference: From Data for AI to AI for Data
DataFunSummit
DataFunSummit
Sep 20, 2024 · Artificial Intelligence

Exploring and Applying Large Language Models in Recommendation Systems

Professor Wang Yichao from Huawei Noah's Ark Lab presents a comprehensive exploration of large language models in recommendation systems, covering background, challenges, two key projects (LLM4Rec and Uni-CTR), experimental results, and future directions for open, knowledge‑enhanced, generative recommendation pipelines.

AILLM4RecUni-CTR
0 likes · 13 min read
Exploring and Applying Large Language Models in Recommendation Systems
Senior Brother's Insights
Senior Brother's Insights
Sep 19, 2024 · Artificial Intelligence

Rule Engines vs AI Models: Choosing the Right Approach for Product Logic

The article compares traditional rule‑engine architectures with AI‑driven models, explains their differing characteristics, outlines when deterministic rule matching is preferable over flexible AI inference, and recommends practical technologies such as Drools for rule‑based solutions and LLM‑based RAG/Agent frameworks for AI‑centric scenarios.

AIDroolsLLM
0 likes · 9 min read
Rule Engines vs AI Models: Choosing the Right Approach for Product Logic
JavaEdge
JavaEdge
Sep 19, 2024 · Artificial Intelligence

Unlock Java LLM Power: A Deep Dive into LangChain4j Features and Architecture

LangChain4j streamlines the integration of large language models into Java applications by offering a standardized API, extensive support for over a dozen LLM providers and vector stores, a rich toolbox for RAG, chat memory, and tool calling, plus two abstraction layers that cater to both low‑level control and high‑level convenience.

AIIntegrationLLM
0 likes · 10 min read
Unlock Java LLM Power: A Deep Dive into LangChain4j Features and Architecture
Alibaba Cloud Developer
Alibaba Cloud Developer
Sep 19, 2024 · Artificial Intelligence

How Generative AI Will Transform the Physical World – Alibaba Cloud 2024 Outlook

At the 2024 Cloud Expo, Alibaba Cloud’s CEO Wu Yongming highlighted the rapid evolution of generative AI over the past 22 months, emphasizing its shift from mobile apps to reshaping both digital and physical realms, the exponential drop in inference costs, and the accelerating demand for AI‑driven computing infrastructure.

AIDigital Transformationgenerative AI
0 likes · 9 min read
How Generative AI Will Transform the Physical World – Alibaba Cloud 2024 Outlook
DataFunSummit
DataFunSummit
Sep 18, 2024 · Artificial Intelligence

Multi‑Scenario Modeling for NetEase Cloud Music Recommendation: Architecture, Challenges, and Results

This article presents NetEase Cloud Music's multi‑scenario recommendation modeling work, covering background, overall system architecture, key modules such as unified and private domain networks, modeling objectives and difficulties, experimental results, future outlook, and a detailed Q&A session.

AINetEase Cloud Musiclarge-scale systems
0 likes · 13 min read
Multi‑Scenario Modeling for NetEase Cloud Music Recommendation: Architecture, Challenges, and Results
AntTech
AntTech
Sep 18, 2024 · Artificial Intelligence

2024 Inclusion·Bund Conference: Insights Forum on AI Era Creativity

The 2024 Inclusion·Bund Conference hosted a multidisciplinary forum on AI-era creativity, featuring scholars from Fudan University, East China Normal University, Tongji University, and iFlytek who presented reports on AI's impact on culture, ethics, and creative practice, followed by a round‑table on human‑AI symbiosis.

AIARTEthics
0 likes · 6 min read
2024 Inclusion·Bund Conference: Insights Forum on AI Era Creativity
Bilibili Tech
Bilibili Tech
Sep 18, 2024 · Artificial Intelligence

Index-1.9B-32K: A 2% GPT-Size Model with Powerful Long-Context Capabilities

Index-1.9B-32K is a 1.9B-parameter model with a 32K token context window, achieving strong long‑text performance comparable to larger models while using only about 2% of GPT‑4’s compute, trained via long pre‑training and supervised fine‑tuning, with a trade‑off of reduced short‑context ability.

AIEvaluationFine-tuning
0 likes · 12 min read
Index-1.9B-32K: A 2% GPT-Size Model with Powerful Long-Context Capabilities
phodal
phodal
Sep 18, 2024 · Industry Insights

How AI Agents Could Revolutionize Software Development with Shire

The article examines the challenges of applying generative AI to real‑world software development, proposes a collective‑wisdom Copilot that blends team knowledge, AI agents, and IDE integration, and details how Shire’s new version enables sharing, installing, and executing AI‑driven agents across diverse development workflows.

AIIDEShire
0 likes · 11 min read
How AI Agents Could Revolutionize Software Development with Shire
AntTech
AntTech
Sep 16, 2024 · Artificial Intelligence

Opportunities and Challenges in the Era of Large Models: Technology Integration and Industry Leap

In his keynote at the 2024 Inclusion·Bund Conference, HKUST Board Chair Shen Xiangyang discusses how large‑model AI reshapes human‑computer interaction, introduces the concept of Intelligent Augmentation, emphasizes responsible AI governance, and outlines the practical steps needed to deploy AI agents in industry.

AIAI GovernanceHuman-Computer Interaction
0 likes · 4 min read
Opportunities and Challenges in the Era of Large Models: Technology Integration and Industry Leap
Architect's Alchemy Furnace
Architect's Alchemy Furnace
Sep 16, 2024 · Artificial Intelligence

Why Transformers Revolutionize AI: From Basics to Advanced Applications

This article explains what AI Transformers are, why they matter, their key components and mechanisms, various applications ranging from language processing to bioinformatics, and how they differ from traditional neural networks, providing a comprehensive overview of Transformer architecture and its impact on modern AI research.

AIDeep LearningSelf-Attention
0 likes · 20 min read
Why Transformers Revolutionize AI: From Basics to Advanced Applications
21CTO
21CTO
Sep 15, 2024 · Artificial Intelligence

Will Generative AI Follow the Internet Bubble? Insights from MongoDB’s CEO

MongoDB CEO Dev Ittycheria likens today’s generative AI surge to the early internet era, outlining three AI use cases—chatbots, research summarization, and automation—while warning of potential hype cycles and urging businesses to seek sustainable commercial models.

AIBusiness strategyChatbots
0 likes · 6 min read
Will Generative AI Follow the Internet Bubble? Insights from MongoDB’s CEO
AntTech
AntTech
Sep 15, 2024 · Artificial Intelligence

Dr. Wang Jian’s Keynote on AI, AI+, and AI Infrastructure at the 2024 Inclusion·Bund Conference

In his 2024 Inclusion·Bund Conference keynote, Dr. Wang Jian traces the short yet intense history of artificial intelligence, explains the emergence of AI+, discusses the pivotal role of transformer‑based models and AI infrastructure, and reflects on how cloud computing and innovative business models are reshaping the AI ecosystem.

AIAI+Cloud Computing
0 likes · 16 min read
Dr. Wang Jian’s Keynote on AI, AI+, and AI Infrastructure at the 2024 Inclusion·Bund Conference
AntTech
AntTech
Sep 12, 2024 · Artificial Intelligence

Knowledge‑Enhanced Large Model Service Framework (KAG): Integrating Knowledge Graphs with LLMs for Vertical Domain Applications

The KAG framework combines knowledge‑graph‑driven symbolic reasoning with large language model generation to improve accuracy, reduce hallucinations, and enable controllable, domain‑specific AI services such as government and medical Q&A, with open‑source support via OpenSPG and TuGraph‑DB.

AIFrameworkknowledge graph
0 likes · 13 min read
Knowledge‑Enhanced Large Model Service Framework (KAG): Integrating Knowledge Graphs with LLMs for Vertical Domain Applications
Efficient Ops
Efficient Ops
Sep 11, 2024 · Artificial Intelligence

How AI Large Models Can Automate DevOps Pipeline Failure Analysis

This article explores how AI large‑model technology can be integrated into DevOps pipelines to automatically detect, classify, and resolve interruption events, dramatically reducing manual troubleshooting time and improving overall software development and operations efficiency.

AIDevOpsPipeline
0 likes · 11 min read
How AI Large Models Can Automate DevOps Pipeline Failure Analysis
Baidu Geek Talk
Baidu Geek Talk
Sep 11, 2024 · Databases

Why Vector Databases Are the Next Big Thing in AI: A Deep Dive into RAG and Baidu’s VectorDB

This article examines the 70‑year evolution of databases, explains how large‑model AI drives the rise of vector databases and Retrieval‑Augmented Generation (RAG), outlines the four‑stage RAG workflow, compares Baidu’s self‑built VectorDB with open‑source alternatives, and showcases real‑world deployments that highlight performance, scalability, and enterprise benefits.

AIDatabase ArchitectureIndustry Insights
0 likes · 16 min read
Why Vector Databases Are the Next Big Thing in AI: A Deep Dive into RAG and Baidu’s VectorDB
AntTech
AntTech
Sep 11, 2024 · Artificial Intelligence

AI-Driven Human-Like Robots: Insights from the 2024 Inclusion·外滩大会

The 2024 Inclusion·外滩大会 highlighted how AI is driving humanoid robots toward greater human likeness, with industry leaders discussing AI agents, medical robotics, and embodied intelligent robots, forecasting a booming market and outlining future personalized, intelligent robot applications.

AIEmbodied IntelligenceHumanoid Robots
0 likes · 4 min read
AI-Driven Human-Like Robots: Insights from the 2024 Inclusion·外滩大会
AntTech
AntTech
Sep 11, 2024 · Artificial Intelligence

2024 Inclusion·Bund Conference Forum: Exploring the Creative Boundaries and Application Imagination of Large Models

The 2024 Inclusion·Bund Conference hosted a forum on "Large Model Creativity Boundaries and Application Imagination," featuring leading AI experts who discussed agents, multimodal technology, knowledge graphs, announced a new industry alliance, unveiled three major model products, and presented a trustworthy AI framework report for finance, healthcare, and government sectors.

AIFinancial AIindustry alliance
0 likes · 6 min read
2024 Inclusion·Bund Conference Forum: Exploring the Creative Boundaries and Application Imagination of Large Models
JD Cloud Developers
JD Cloud Developers
Sep 11, 2024 · Artificial Intelligence

How AI Agents Transform Industrial B2B Order Fulfillment in 30 Seconds

This article explains how the AutoBots platform leverages AI agents to streamline industrial B2B order fulfillment, eliminate multi‑step communication, enable instant 30‑second issue resolution, improve address verification, reduce tax risks, and deliver measurable performance gains across logistics and operations.

AIAutoBotsaddress matching
0 likes · 8 min read
How AI Agents Transform Industrial B2B Order Fulfillment in 30 Seconds
DevOps
DevOps
Sep 10, 2024 · Artificial Intelligence

2024 Analysis of China’s Large‑Model “Six Tigers”: Slowing Model Gains, Funding Rounds, and Market Strategies

The article reviews the rapid rise and recent slowdown of Chinese large‑model startups—often called the “six tigers”—examining their product differentiation, heavy marketing spend, funding achievements, internal personnel changes, and the broader challenges they face in both B‑side and C‑side AI markets in 2024.

AIChinaFunding
0 likes · 15 min read
2024 Analysis of China’s Large‑Model “Six Tigers”: Slowing Model Gains, Funding Rounds, and Market Strategies
ITPUB
ITPUB
Sep 9, 2024 · Databases

What Oracle’s 2024 Global Survey Reveals About Top Databases, Cloud, and AI

Oracle’s 2024 global user survey uncovers preferred databases, key factors for database selection, the advantages of Oracle Cloud Infrastructure, and emerging AI use cases, providing a comprehensive view of current enterprise priorities and future expectations in data management and cloud services.

AICloud ComputingDatabase Survey
0 likes · 9 min read
What Oracle’s 2024 Global Survey Reveals About Top Databases, Cloud, and AI
Baobao Algorithm Notes
Baobao Algorithm Notes
Sep 9, 2024 · Artificial Intelligence

How MoSLoRA Reinvents Low‑Rank Adaptation with Mixer Matrices

This article analyzes the Mixture‑of‑Subspaces in Low‑Rank Adaptation (MoSLoRA) paper, explaining its motivation, design choices that replace LoRA's gate with a mixer matrix, connections to multi‑head attention, experimental findings on LLaMA‑3 fine‑tuning, and theoretical proofs of its re‑parameterization properties.

AILoRAMixture of Experts
0 likes · 12 min read
How MoSLoRA Reinvents Low‑Rank Adaptation with Mixer Matrices
Baidu Geek Talk
Baidu Geek Talk
Sep 9, 2024 · Big Data

TDS Platform Overview: Architecture, Modules, and Features of Baidu MEG's Turing 3.0 Data Ecosystem

The TDS platform, central to Baidu MEG’s Turing 3.0 ecosystem, unifies data development, warehouse management, monitoring, and resource control through Spark‑based TDE, a visual studio, and AI‑enhanced tools like Smart Diagnosis and Text2SQL, enabling standardized workflows, scalable scheduling, and handling over 30 k daily tasks.

AIBig DataData Development
0 likes · 21 min read
TDS Platform Overview: Architecture, Modules, and Features of Baidu MEG's Turing 3.0 Data Ecosystem
21CTO
21CTO
Sep 9, 2024 · Artificial Intelligence

How AI Tools Like Copilot, Uizard, and ChatGPT Are Transforming Web Development

This article explores three powerful AI-driven tools—GitHub Copilot, Uizard, and ChatGPT—that streamline web development workflows, enhance functionality, and accelerate design by automating code, converting sketches to prototypes, and providing instant coding assistance, ultimately boosting productivity for developers.

AIChatGPTGitHub Copilot
0 likes · 5 min read
How AI Tools Like Copilot, Uizard, and ChatGPT Are Transforming Web Development
JD Tech Talk
JD Tech Talk
Sep 9, 2024 · Frontend Development

AIGCDesign: A Cross‑Platform Frontend AI Component Library and Its Technical Implementation

The article introduces AIGCDesign, a cross‑platform frontend component library that leverages AI generation capabilities, explains its motivation, research of existing solutions, architectural layers, lifecycle hooks, configuration examples, multi‑framework support, business integration cases, and future stream‑processing enhancements.

AIAIGCReact
0 likes · 15 min read
AIGCDesign: A Cross‑Platform Frontend AI Component Library and Its Technical Implementation
NewBeeNLP
NewBeeNLP
Sep 9, 2024 · Artificial Intelligence

Can Real‑Time Learning at Serving Time Transform Recommendation Re‑ranking?

This article introduces LAST, a novel online learning approach that updates recommendation models instantly at serving time, addressing real‑time learning challenges, re‑ranking complexities, and demonstrating superior offline and online performance in industrial e‑commerce scenarios.

AILASTOnline Learning
0 likes · 12 min read
Can Real‑Time Learning at Serving Time Transform Recommendation Re‑ranking?
phodal
phodal
Sep 7, 2024 · Industry Insights

Bridging Cloud and IDE AI Agents for Seamless DevOps Collaboration

This article analyzes the current state and challenges of AI‑assisted development tools, proposes strategies to integrate cloud‑based and IDE‑embedded intelligent agents, and presents practical examples—including Shire language scripts and tool orchestration—to improve end‑to‑end software engineering efficiency.

AIDevOpsIDE
0 likes · 10 min read
Bridging Cloud and IDE AI Agents for Seamless DevOps Collaboration
DataFunSummit
DataFunSummit
Sep 6, 2024 · Artificial Intelligence

Knowledge Graph and RAG Applications in 360 Document Cloud: Challenges and Solutions

This article presents a comprehensive overview of 360's document cloud knowledge management and Q&A scenarios, discussing business pain points, large‑model challenges, the advantages of the intelligent document solution, and how knowledge graphs enhance retrieval‑augmented generation and document standardization for AI‑driven enterprise applications.

AIDocument ManagementEnterprise AI
0 likes · 15 min read
Knowledge Graph and RAG Applications in 360 Document Cloud: Challenges and Solutions