Tagged articles

Large Language Model

737 articles · Page 6 of 8

Feb 10, 2025 · Artificial Intelligence

How Much Does It Really Cost to Run a Full‑Scale DeepSeek AI Locally?

This article breaks down the hardware and software expenses required to deploy a complete DeepSeek large‑language model on‑premises, revealing a total cost of roughly $110,000 and explaining why such an investment is prohibitive for most individual developers but may be justified for well‑funded research or corporate projects.

DeepSeekDeploymentGPU

0 likes · 4 min read

How Much Does It Really Cost to Run a Full‑Scale DeepSeek AI Locally?

Big Data Tech Team

Feb 9, 2025 · Artificial Intelligence

7 Proven Prompt Techniques to Unlock DeepSeek’s Full Potential

This guide presents seven practical prompt engineering tricks—ranging from precise requirement definition and contextual background provision to step‑by‑step decomposition, keyword tagging, iterative follow‑ups, tone/style adjustments, and model switching—that dramatically improve the relevance and quality of DeepSeek’s responses for work, learning, and creative tasks.

AI productivityArtificial IntelligenceDeepSeek

0 likes · 6 min read

7 Proven Prompt Techniques to Unlock DeepSeek’s Full Potential

AIWalker

Feb 8, 2025 · Artificial Intelligence

Introducing Ola: A Full‑Modal Language Model from Tsinghua & Tencent that Unifies Image, Video, and Audio Understanding

The article presents Ola, an open‑source full‑modal LLM that uses progressive modality alignment to jointly process text, images, video, and audio, and demonstrates competitive performance across image, video, and audio benchmarks, surpassing many specialized models.

BenchmarkLarge Language ModelOla

0 likes · 22 min read

Introducing Ola: A Full‑Modal Language Model from Tsinghua & Tencent that Unifies Image, Video, and Audio Understanding

IT Architects Alliance

Feb 8, 2025 · Artificial Intelligence

Inside DeepSeek: How Its Innovative Architecture Redefines AI Performance

This article examines DeepSeek's advanced Transformer‑based architecture, dynamic routing, MoE system, multi‑stage training, efficient inference, multimodal capabilities, real‑world applications, technical challenges, and future prospects, providing a comprehensive technical analysis of the model's strengths and limitations.

AI ArchitectureDeepSeekLarge Language Model

0 likes · 15 min read

Architect

Feb 7, 2025 · Industry Insights

Can DeepSeek’s Native Chinese LLM Transform Enterprise AI and Organizational Design?

The article evaluates DeepSeek‑R1’s strong reasoning, high performance, native Chinese training and low cost, then explores how such large language models can reshape B2C and B2B services, propose a new “intelligent data store” architecture, and outline comprehensive organizational and strategic changes enterprises must adopt to thrive in the AI era.

AI strategyDeepSeekEnterprise AI

0 likes · 16 min read

Can DeepSeek’s Native Chinese LLM Transform Enterprise AI and Organizational Design?

Alibaba Cloud Developer

Feb 7, 2025 · Artificial Intelligence

Why DeepSeek V3 Achieves Low Training Costs: Inside Its AI Innovations

This article provides a comprehensive analysis of DeepSeek's large‑language‑model technology, covering the company's background, model capabilities, remarkably low training and inference costs, and the core architectural and algorithmic innovations such as MoE, MLA attention, FP8 mixed‑precision, and the DualPipe pipeline that enable efficient large‑scale AI deployment.

AI ArchitectureDeepSeekFP8 training

0 likes · 19 min read

Why DeepSeek V3 Achieves Low Training Costs: Inside Its AI Innovations

Java One

Feb 6, 2025 · Artificial Intelligence

Deploy DeepSeek‑R1 Locally on Your Laptop in Just 3 Minutes

This step‑by‑step guide shows non‑technical users how to install Ollama, pull the desired DeepSeek‑R1 model version, run it from the terminal, and optionally connect the free Chatbox desktop client for a visual chat interface, all without external network dependencies.

AI ModelChatboxDeepSeek

0 likes · 6 min read

Deploy DeepSeek‑R1 Locally on Your Laptop in Just 3 Minutes

Cognitive Technology Team

Feb 6, 2025 · Artificial Intelligence

DeepSeek Model Guide: 10 Practical Tips and Usage Techniques

This article presents ten detailed techniques for effectively using DeepSeek's large language models—including mode selection, model comparisons, knowledge updates, prompt engineering, RAG, file uploads, API access, and open‑source resources—while offering concrete examples and code snippets for each feature.

AI APIDeepSeekLarge Language Model

0 likes · 12 min read

DeepSeek Model Guide: 10 Practical Tips and Usage Techniques

Tencent Cloud Developer

Feb 3, 2025 · Artificial Intelligence

DeepSeek's Emergence: Implications for AI, Enterprise Digital Transformation, and Future Software Development

DeepSeek’s debut marks a watershed for China’s AI, offering low‑cost, Chinese‑native reasoning that outperforms foreign models and prompting enterprises to restructure development around demand‑engineering, AI‑assisted low‑code, intelligent data stores, and a shift from “how to code” to “why to code” across a three‑phase transformation roadmap.

AI strategyDeepSeekEnterprise AI

0 likes · 15 min read

DeepSeek's Emergence: Implications for AI, Enterprise Digital Transformation, and Future Software Development

Software Engineering 3.0 Era

Feb 3, 2025 · Artificial Intelligence

How OpenAI’s New Deep Research Model Aims to Redefine Search and Outpace DeepSeek

OpenAI unveiled Deep Research, an end‑to‑end reinforcement‑learning model built on the o3 architecture that claims deeper problem decomposition, longer response times, modular information discovery, integration, reasoning and output capabilities, and benchmark scores that surpass DeepSeek and rival Google Gemini, while also acknowledging current accuracy and hallucination challenges.

Artificial IntelligenceBenchmarkDeep Research

0 likes · 12 min read

How OpenAI’s New Deep Research Model Aims to Redefine Search and Outpace DeepSeek

Software Engineering 3.0 Era

Feb 1, 2025 · Artificial Intelligence

DeepSeek Deep Dive: How Its Breakthroughs Could Usher in an Era of Universal AI

The article provides a detailed analysis of DeepSeek’s model performance across language, reasoning, and code generation benchmarks, its cost‑effective training methods, novel architecture innovations, the team’s expertise, and the broader impact these factors may have on accelerating AI innovation and reshaping industry competition.

AI benchmarksAI industry impactDeepSeek

0 likes · 18 min read

DeepSeek Deep Dive: How Its Breakthroughs Could Usher in an Era of Universal AI

21CTO

Jan 31, 2025 · Artificial Intelligence

How DeepSeek‑R1 Is Redefining Open‑Source AI and Challenging OpenAI’s O1

DeepSeek‑R1, an open‑source inference model released under the MIT license, matches or surpasses OpenAI’s O1 on math, coding, and reasoning benchmarks, offers multiple scaled versions, runs at lightning speed, and is rapidly adopted worldwide, signaling a shift toward more accessible, high‑performance AI.

BenchmarkDeepSeek-R1Large Language Model

0 likes · 9 min read

How DeepSeek‑R1 Is Redefining Open‑Source AI and Challenging OpenAI’s O1

DataFunTalk

Jan 29, 2025 · Artificial Intelligence

ChatBI: NetEase AI‑Powered Business Intelligence Platform – Architecture, Technology, and Real‑World Applications

This article introduces ChatBI, NetEase’s AI‑driven BI solution that combines large‑model capabilities with traditional data analytics, detailing its product features, AI‑enabled opportunities and challenges, the underlying NL2SQL model, technical architecture, performance optimizations such as materialized views, open APIs, and several enterprise deployment cases.

AIBIData Analytics

0 likes · 21 min read

ChatBI: NetEase AI‑Powered Business Intelligence Platform – Architecture, Technology, and Real‑World Applications

DataFunTalk

Jan 27, 2025 · Artificial Intelligence

Improving AI Agent Planning and Reasoning: Challenges and Practical Solutions

The article examines current limitations of AI agents in planning and complex reasoning, critiques existing methods like COT/TOT and ReAct, and proposes practical strategies—including combined COT‑Reflection approaches, structured memory algorithms, and white‑box interaction designs—to enhance agent performance within the DataFun knowledge map framework.

AI AgentCoTLarge Language Model

0 likes · 3 min read

Improving AI Agent Planning and Reasoning: Challenges and Practical Solutions

DataFunTalk

Jan 26, 2025 · Artificial Intelligence

58.com’s LingXi Large Language Model Platform: Development, Deployment, and Performance Optimizations

Since the launch of ChatGPT, 58.com has built a Model‑as‑a‑Service platform called LingXi that trains and serves domain‑specific large language models, supports over a hundred internal scenarios with daily inference exceeding ten million calls, and continuously improves performance through quantization, GPU optimization, model miniaturization, and advanced AI applications such as interview assistants, voice agents, and RAG‑enabled agents.

AI ApplicationsAI platformInference Optimization

0 likes · 9 min read

58.com’s LingXi Large Language Model Platform: Development, Deployment, and Performance Optimizations

AI Code to Success

Jan 26, 2025 · Industry Insights

How DeepSeek‑R1 Is Challenging OpenAI’s o1 and Shaping the AI Landscape

DeepSeek‑R1 achieved a 1357‑point Arena score, ranking third overall and tying OpenAI o1 for first in StyleCtrl, while its open‑source MIT‑licensed release—including distilled variants—and low‑cost API service aim to democratize advanced AI inference for developers worldwide.

AI competitionArena benchmarkDeepSeek

0 likes · 5 min read

How DeepSeek‑R1 Is Challenging OpenAI’s o1 and Shaping the AI Landscape

DataFunSummit

Jan 25, 2025 · Artificial Intelligence

AI-Driven Next-Generation Sales: Project Overview, Core Technologies, System Deployment, and Future Outlook

This article explores how AI transforms next‑generation sales by detailing project background and goals, core technologies such as efficient sample generation, model training and evaluation, system deployment impact, practical case studies, challenges, solutions, and future directions across multiple industries.

AIEvaluationLarge Language Model

0 likes · 25 min read

AI-Driven Next-Generation Sales: Project Overview, Core Technologies, System Deployment, and Future Outlook

Kuaishou Tech

Jan 24, 2025 · Artificial Intelligence

KwaiCoder-23BA4-v1: An Efficient Large Code Generation Model via Pruning, Knowledge Distillation, and Granular Upcycling

KwaiCoder-23BA4-v1 is a 23B wide MoE code‑completion model that achieves state‑of‑the‑art performance on HumanEval, BigCodeBench and Fill‑in‑Middle benchmarks by using high‑quality data, a cost‑effective training pipeline that combines model pruning, knowledge distillation and fine‑grained merging, and extensive ablation studies.

AIBenchmarkKnowledge Distillation

0 likes · 10 min read

KwaiCoder-23BA4-v1: An Efficient Large Code Generation Model via Pruning, Knowledge Distillation, and Granular Upcycling

Baobao Algorithm Notes

Jan 22, 2025 · Artificial Intelligence

Can RL‑Only Training Make LLMs Beat OpenAI‑o1? Inside DeepSeek‑R1’s Architecture and Results

DeepSeek‑R1’s open‑source series demonstrates that reinforcement‑learning‑only training can match top‑tier models like OpenAI‑o1, while a small amount of SFT further improves readability; the article dissects its technical report, training pipeline, reward design, distillation strategy, benchmark outcomes, and remaining challenges.

DeepSeekLarge Language ModelSupervised Fine‑Tuning

0 likes · 11 min read

Can RL‑Only Training Make LLMs Beat OpenAI‑o1? Inside DeepSeek‑R1’s Architecture and Results

Baidu Tech Salon

Jan 21, 2025 · Artificial Intelligence

How AI Is Transforming Legal Research: Inside the YuanDian WenDa Smart Q&A Engine

Faced with billions of legal documents and the shortcomings of keyword search, Chinese legal professionals are turning to the AI‑powered YuanDian WenDa engine, which leverages Baidu's Wenxin model, a structured legal database, and prompt‑engineering to deliver trustworthy, citation‑rich answers and rapid research reports.

AIInformation RetrievalLarge Language Model

0 likes · 10 min read

How AI Is Transforming Legal Research: Inside the YuanDian WenDa Smart Q&A Engine

AIWalker

Jan 18, 2025 · Artificial Intelligence

How InternLM 3.0 Achieves High Performance with Just 4 TB of Training Data

Shanghai AI Laboratory’s InternLM 3.0 upgrade demonstrates that a refined 4 TB token dataset can boost a large‑language model’s performance beyond that of open‑source peers trained on 18 TB, cutting training cost by over 75% while merging regular dialogue with deep reasoning capabilities.

AI evaluationData EfficiencyInternLM

0 likes · 9 min read

How InternLM 3.0 Achieves High Performance with Just 4 TB of Training Data

AIWalker

Jan 17, 2025 · Artificial Intelligence

InternLM 3.0: Boosting Model Performance with Only 4 TB of Training Data

Shanghai AI Laboratory’s InternLM 3.0 upgrade demonstrates that refining data quality—measured as intelligence‑per‑token—can replace massive datasets, achieving higher reasoning and dialogue capabilities with just 4 TB of tokens, cutting training cost by over 75 % while approaching GPT‑4‑level performance.

AI researchData EfficiencyInternLM

0 likes · 9 min read

InternLM 3.0: Boosting Model Performance with Only 4 TB of Training Data

AIWalker

Jan 16, 2025 · Artificial Intelligence

How InternLM 3.0 Achieves High Performance with Just 4 TB of Training Data

InternLM 3.0 (InternLM‑3) upgrades the Shusheng‑PuYu model by refining data to boost "thinking density", using only 4 TB of tokens to surpass peer open‑source models, cutting training cost by over 75% while merging ordinary dialogue with deep reasoning capabilities.

Data EfficiencyInternLMLarge Language Model

0 likes · 9 min read

Alibaba Cloud Native

Jan 13, 2025 · Cloud Native

Build a Serverless AI Summarization Assistant with Alibaba Cloud Function Compute and Baileian

This guide explains how to use Alibaba Cloud Function Compute together with the Baileian large‑model platform to create a highly available, cloud‑native AI summarization service that automatically extracts key information from massive documents.

AI-summarizationAlibaba CloudLarge Language Model

0 likes · 8 min read

Build a Serverless AI Summarization Assistant with Alibaba Cloud Function Compute and Baileian

Infra Learning Club

Jan 12, 2025 · Artificial Intelligence

How to Connect a XiaoAI Speaker to a Large Language Model

This guide walks through preparing a XiaoAI speaker, selecting a free LLM service, creating an API key, installing Docker, running the MiGPT server, and configuring the speaker to query the chosen large language model.

DockerLarge Language ModelMiGPT

0 likes · 6 min read

How to Connect a XiaoAI Speaker to a Large Language Model

JD Cloud Developers

Jan 9, 2025 · Artificial Intelligence

Boost Your Java Apps with LangChain4j: A Hands‑On RAG Guide

This article walks Java developers through the fundamentals of Retrieval‑Augmented Generation (RAG), explains the LangChain4j framework, compares large‑model development with traditional Java coding, and provides step‑by‑step code examples for environment setup, document splitting, embedding, vector‑store operations, and LLM interaction.

EmbeddingJavaLangChain4j

0 likes · 34 min read

Boost Your Java Apps with LangChain4j: A Hands‑On RAG Guide

Alibaba Cloud Developer

Jan 9, 2025 · Artificial Intelligence

Unlocking Large Model Power: From Semantic Vectors to Real‑World Business Applications

This article explores large‑model capabilities through semantic‑vector theory, outlines business‑scenario focus, presents practical case studies such as AI customer‑service bots, and details prompt‑engineering techniques and optimization workflows to help practitioners effectively apply foundation models in real‑world tasks.

Business ApplicationLarge Language Modelsemantic vectors

0 likes · 36 min read

Unlocking Large Model Power: From Semantic Vectors to Real‑World Business Applications

Baobao Algorithm Notes

Jan 7, 2025 · Artificial Intelligence

How Efficient Is DeepSeek V3? Calculating Its MFU Around 37%

This article derives DeepSeek V3's training Model FLOPs Utilization (MFU) using publicly available data, showing an MFU of roughly 37%—about a 60% improvement over V2—and provides detailed formulas, parameter settings, and a reproducible Python script.

AI performanceDeepSeekLarge Language Model

0 likes · 8 min read

How Efficient Is DeepSeek V3? Calculating Its MFU Around 37%

Alibaba Cloud Developer

Jan 2, 2025 · Operations

Mastering Error and Latency Diagnosis for Online Applications

This article presents a systematic root‑cause diagnosis framework for online applications, covering how to identify and resolve both error ("wrong") and performance ("slow") problems using trace links, associated data, high‑quality observability, and large‑language‑model‑driven intelligence.

Cloud MonitoringLarge Language ModelRoot Cause Analysis

0 likes · 12 min read

Mastering Error and Latency Diagnosis for Online Applications

DataFunTalk

Dec 27, 2024 · Artificial Intelligence

Designing Enterprise Business Analysis Agents with Large Language Models

This article explains how large‑model capabilities combined with metric and tag platforms can be used to build intelligent data‑analysis products for enterprises, covering challenges, solution routes such as NLP2SQL, NLP2API, NLP2Python, agent design, planning, and future outlooks.

AI AgentBusiness IntelligenceEnterprise Analytics

0 likes · 21 min read

Designing Enterprise Business Analysis Agents with Large Language Models

NewBeeNLP

Dec 23, 2024 · Artificial Intelligence

What’s New in Qwen2.5? A Deep Dive into the Latest LLM Advances

The Qwen2.5 Technical Report introduces a new series of large language models with up to 72 B parameters, expanded pre‑training data to 18 trillion tokens, advanced supervised fine‑tuning and reinforcement learning pipelines, and demonstrates strong performance across comprehension, reasoning, coding, and long‑context tasks.

LLMLarge Language ModelQwen2.5

0 likes · 5 min read

What’s New in Qwen2.5? A Deep Dive into the Latest LLM Advances

iQIYI Technical Product Team

Dec 19, 2024 · Artificial Intelligence

Project BaixiaoSheng: An AI‑Powered Project Management Assistant – iQIYI Case Study

Project BaixiaoSheng, iQIYI’s AI‑powered project management assistant unveiled at the 13th TOP 100 Global Software Case Study Summit, uses a Retrieval‑Augmented Generation framework with static knowledge Q&A, dynamic data consulting, and scenario‑assistant automation to cut context‑switching, streamline data flow, and boost cross‑system efficiency, while future plans target fine‑tuned LLMs, multi‑model fusion, and AI‑agent orchestration.

AIKnowledge BaseLarge Language Model

0 likes · 11 min read

Project BaixiaoSheng: An AI‑Powered Project Management Assistant – iQIYI Case Study

Alimama Tech

Dec 11, 2024 · Artificial Intelligence

Engineering Architecture of Alibaba's AI Digital Employee "AI XiaoWan"

Alibaba’s AI digital employee “AI XiaoWan” uses a native multi‑agent architecture where a Controller Agent interprets intent, plans tasks, and orchestrates execution while an Executable Agent performs domain‑specific operations, communicating via a standardized Agent Communication Protocol, leveraging a centralized Tool Center, a retrieval‑augmented knowledge base, and a data‑flywheel feedback loop to continuously improve and evolve toward memory‑based reasoning and self‑learning.

AIKnowledge BaseLarge Language Model

0 likes · 14 min read

Engineering Architecture of Alibaba's AI Digital Employee "AI XiaoWan"

DataFunTalk

Dec 10, 2024 · Artificial Intelligence

Tencent Large Language Model Applications: RAG, GraphRAG, and Agent Technologies

This article explores Tencent's large language model deployments across various business scenarios, detailing the principles and practical implementations of Retrieval‑Augmented Generation (RAG), GraphRAG for role‑playing, and Agent technologies, while also covering model fine‑tuning, knowledge‑base construction, and evaluation methods.

AI ApplicationsAgentGraphRAG

0 likes · 15 min read

Tencent Large Language Model Applications: RAG, GraphRAG, and Agent Technologies

AI Large Model Application Practice

Dec 9, 2024 · Artificial Intelligence

How GUI Agents Use Large Models to Automate Any Desktop Task

This article explains why GUI agents are needed, defines their multimodal capabilities, walks through a high‑level automation scenario, details the architecture of large‑model‑driven GUI agents, highlights recent open‑source projects, and compares them with traditional RPA solutions.

AI automationGUI AgentHuman-Computer Interaction

0 likes · 10 min read

How GUI Agents Use Large Models to Automate Any Desktop Task

DataFunSummit

Dec 7, 2024 · Artificial Intelligence

Technical Practices of Tencent's Intelligent BI System: Architecture, Model Fine‑Tuning, and Agent Design

This article details Tencent's shift from traditional BI to an AI‑driven intelligent BI platform, describing the challenges of architecture, large‑language‑model integration, and data integration, and presenting the OlaChat framework, unified orchestration, atomic agents, DSL conversion, monitoring, and future roadmap.

AIIntelligent BILarge Language Model

0 likes · 22 min read

Technical Practices of Tencent's Intelligent BI System: Architecture, Model Fine‑Tuning, and Agent Design

Tencent Cloud Developer

Dec 5, 2024 · Industry Insights

Why Most RAG Projects Fail and How Tencent’s LeXiang AI Assistant Overcomes Them

The article analyses the rapid growth of Retrieval‑Augmented Generation (RAG) in enterprises, explains why self‑built RAG solutions often collapse under cost and maintenance pressures, and demonstrates how Tencent LeXiang AI Assistant addresses these issues through a robust knowledge‑management core, extensive industry experience, scalable resources, and advanced multimodal capabilities.

AI assistantEnterprise AIKnowledge Management

0 likes · 16 min read

Why Most RAG Projects Fail and How Tencent’s LeXiang AI Assistant Overcomes Them

Baidu Tech Salon

Nov 29, 2024 · Artificial Intelligence

How AI‑Powered “WenZhi” Transforms Job Matching with Baidu’s ERNIE Model

Faced with overloaded job listings and low offer rates, a group of students built “WenZhi,” an AI‑driven job‑matching app that leverages Baidu’s ERNIE SDK, generative recommendation, and workflow orchestration to deliver personalized role suggestions and interview advice within minutes.

AICareer TechnologyERNIE SDK

0 likes · 7 min read

How AI‑Powered “WenZhi” Transforms Job Matching with Baidu’s ERNIE Model

Tencent Cloud Developer

Nov 27, 2024 · Artificial Intelligence

Tencent Cloud AI Code Assistant: Product Evolution, Architecture, and Technical Implementation

Tencent Cloud AI Code Assistant has evolved from token‑level IDE completions to LLM‑driven multi‑modal coding and chat features, employing a dual‑loop R&D system, Hunyuan‑based code models, and sophisticated trigger, prompt, stop, and display strategies to deliver context‑aware, secure, and efficient code generation within IDE and review environments.

AB testingAI code assistantAST analysis

0 likes · 15 min read

Tencent Cloud AI Code Assistant: Product Evolution, Architecture, and Technical Implementation

Meituan Technology Team

Nov 21, 2024 · Frontend Development

AutoConsis: Automated UI Consistency Detection for Mobile Apps Using Multimodal AI

AutoConsis is a research‑driven, AI‑powered workflow that automatically detects UI content inconsistencies across mobile app pages by combining target region recognition, OCR‑based extraction, and large language model reasoning, achieving low cost, high generalization, and high confidence as demonstrated on Meituan's large‑scale marketing scenarios.

CLIPICSE 2024Large Language Model

0 likes · 15 min read

AutoConsis: Automated UI Consistency Detection for Mobile Apps Using Multimodal AI

Rare Earth Juejin Tech Community

Nov 20, 2024 · Artificial Intelligence

Resolving 02_DocQA.py Errors and Using LangChain to Call Large Models Locally

This guide explains how to fix the ArkNotFoundError in the 02_DocQA.py script by configuring a Doubao‑embedding endpoint, setting up a Conda environment with the latest LangChain packages, and provides step‑by‑step code examples for invoking both Zhipu glm‑4 and Volcano large language models via LangChain.

EmbeddingEnvironment setupLangChain

0 likes · 9 min read

Resolving 02_DocQA.py Errors and Using LangChain to Call Large Models Locally

Baidu Tech Salon

Nov 19, 2024 · Artificial Intelligence

Baidu's Wenxin AI Agent Technology Wins Leading Science and Technology Award at 2024 World Internet Conference

At the 2024 World Internet Conference in Wuzhen, Baidu’s Wenxin AI Agent technology earned the Leading Science and Technology Award, marking its second consecutive win and highlighting the system’s brain‑inspired “System 2” architecture that enhances large‑model reasoning, accelerates diverse applications, and drives significant social and economic value.

AIAI AgentAward

0 likes · 6 min read

Baidu's Wenxin AI Agent Technology Wins Leading Science and Technology Award at 2024 World Internet Conference

Baidu Tech Salon

Nov 14, 2024 · Artificial Intelligence

How Baidu’s Wenxin Model Hit 430 Million Users and What Its New Tech Means for AI

At Baidu World 2024, CTO Wang Haifeng revealed that Wenxin Yiyan has reached 430 million users, detailed the model’s retrieval‑augmented and multimodal generation breakthroughs, showcased intelligent‑agent‑driven coding tools, and highlighted expanding AI applications across education, sports, and industry.

AIIntelligent agentsLarge Language Model

0 likes · 7 min read

How Baidu’s Wenxin Model Hit 430 Million Users and What Its New Tech Means for AI

Architects' Tech Alliance

Nov 12, 2024 · Artificial Intelligence

How Retrieval‑Augmented Generation Boosts Enterprise AI with Intel Optimizations

This article explains the fundamentals of Retrieval‑Augmented Generation (RAG), its four‑step workflow, architecture, and how Intel’s hardware and software optimizations—including vector search, quantized embeddings, and advanced inference extensions—enhance performance, security, and scalability for enterprise LLM applications.

AI inferenceEmbedding QuantizationIntel Optimization

0 likes · 14 min read

How Retrieval‑Augmented Generation Boosts Enterprise AI with Intel Optimizations

DataFunSummit

Nov 8, 2024 · Artificial Intelligence

ChatDBA: An AI‑Powered Database Fault Diagnosis Assistant Using Retrieval‑Augmented Generation

ChatDBA, developed by Shanghai Aikesheng, is an AI-driven database operation assistant that leverages large language models and Retrieval‑Augmented Generation to provide fault diagnosis, knowledge learning, SQL generation and optimization, addressing challenges such as vague outputs, complex troubleshooting logic, and memory management through a structured architecture and multi‑modal retrieval strategies.

AIDatabaseFault diagnosis

0 likes · 10 min read

ChatDBA: An AI‑Powered Database Fault Diagnosis Assistant Using Retrieval‑Augmented Generation

Tencent Cloud Developer

Nov 6, 2024 · Artificial Intelligence

Overview of Tencent Hunyuan Large and 3D Generation Model Open‑Source Release

Tencent has open‑sourced its 389‑billion‑parameter Hunyuan Large Mixture‑of‑Experts model—featuring 52 B active parameters, 256 K token context, novel routing, KV‑cache compression, and advanced training optimizations that beat leading open‑source models—and its first text‑to‑3D/image‑to‑3D Hunyuan 3D Generation model, both downloadable via GitHub, Hugging Face, and Tencent Cloud.

3D generationAI researchLarge Language Model

0 likes · 9 min read

Overview of Tencent Hunyuan Large and 3D Generation Model Open‑Source Release

DataFunSummit

Oct 27, 2024 · Artificial Intelligence

How Siemens Harnesses Generative AI to Build the Enterprise Knowledge Chatbot “XiaoYu”

This article describes Siemens' journey in applying generative AI and Retrieval‑Augmented Generation to create an internal knowledge chatbot, detailing the business challenges, technical architecture, data integration, multi‑modal capabilities, deployment outcomes, and strategic lessons for enterprise AI adoption.

AI ChatbotData IntegrationEnterprise Knowledge Management

0 likes · 21 min read

How Siemens Harnesses Generative AI to Build the Enterprise Knowledge Chatbot “XiaoYu”

DataFunSummit

Oct 24, 2024 · Big Data

Bilibili’s Large Language Model‑Based Intelligent Assistant for the Big Data Platform: Architecture, Principles, and Deployment

This article details Bilibili’s implementation of a large‑language‑model‑driven intelligent assistant for its massive big‑data platform, covering background, problem analysis, architectural design, knowledge‑base construction, precision and recall challenges, deployment across offline and real‑time Spark/Flink diagnostics, and future outlooks.

AgentBig DataFlink

0 likes · 23 min read

Bilibili’s Large Language Model‑Based Intelligent Assistant for the Big Data Platform: Architecture, Principles, and Deployment

DataFunSummit

Oct 21, 2024 · Artificial Intelligence

Retrieval‑Augmented Generation (RAG) for Office Applications: Architecture, Challenges, and Practical Practices

This article introduces Retrieval‑Augmented Generation (RAG) as a solution to the hallucination, freshness, and data‑privacy issues of large language models, details its modular architecture, explains the layered system design and hybrid retrieval pipeline, and shares the practical challenges and engineering tricks encountered when deploying RAG in enterprise office scenarios.

AIHybrid RetrievalLarge Language Model

0 likes · 19 min read

Retrieval‑Augmented Generation (RAG) for Office Applications: Architecture, Challenges, and Practical Practices

Baidu Tech Salon

Oct 17, 2024 · Artificial Intelligence

How to Deploy Yuan 2.0 LLM with PaddleNLP: A Step‑by‑Step Guide

This article explains how the open‑source Yuan 2.0 large language model is fully integrated with Baidu’s PaddleNLP, covering its capabilities, fine‑tuning optimizations, step‑by‑step deployment instructions, interaction examples, and training/finetuning results with loss‑curve visualizations.

AILarge Language ModelPaddleNLP

0 likes · 10 min read

How to Deploy Yuan 2.0 LLM with PaddleNLP: A Step‑by‑Step Guide

DataFunTalk

Oct 11, 2024 · Artificial Intelligence

ChatBI: Leveraging Large Language Models for Intelligent Business Intelligence at Ximalaya

This article details Ximalaya’s ChatBI project, describing how large language models are integrated into a BI platform to improve data accessibility, reduce development effort, and enhance query accuracy through prompt engineering, RAG, fine‑tuning, and multi‑agent architectures.

AIBusiness IntelligenceData Platform

0 likes · 10 min read

ChatBI: Leveraging Large Language Models for Intelligent Business Intelligence at Ximalaya

Java Tech Enthusiast

Oct 10, 2024 · Artificial Intelligence

Google Rehires AI Pioneer Noam Shazeer for Gemini Development

Google has signed a $2.7 billion agreement to rehire AI pioneer Noam Shazeer—co‑author of the seminal “Attention is All You Need” paper and creator of the Meena chatbot—bringing him back from his Character.AI venture to serve as vice president overseeing the Gemini generative‑AI project alongside DeepMind leaders, thereby bolstering Google’s competitive edge in the field.

AICharacter.AIGemini

0 likes · 8 min read

Google Rehires AI Pioneer Noam Shazeer for Gemini Development

Zhihu Tech Column

Oct 10, 2024 · Artificial Intelligence

Massive Multi-Label Text Classification via Semantic Retrieval and Large AI Model

This article presents a method for massive multi-label text classification on Zhihu content by combining a semantic retrieval model with a proprietary large AI model, detailing the challenges of large label spaces, model architecture, loss optimization, and experimental results showing significant accuracy gains.

BGELarge Language ModelSemantic Retrieval

0 likes · 16 min read

Massive Multi-Label Text Classification via Semantic Retrieval and Large AI Model

58 Tech

Sep 23, 2024 · Artificial Intelligence

Enhancing Commercial Search with Knowledge Graphs and Large‑Model Techniques

This article describes how a commercial search platform iteratively upgrades its system by structuring business knowledge into a knowledge graph, applying multi‑stage entity extraction (CRF, Electra‑CRF, GLM‑3, OCR), and leveraging large language models to improve relevance, user experience, and revenue.

AILarge Language ModelNLP

0 likes · 14 min read

Enhancing Commercial Search with Knowledge Graphs and Large‑Model Techniques

Data Thinking Notes

Sep 13, 2024 · Artificial Intelligence

How OpenAI’s o1 Series Redefines Complex Reasoning and AI Safety

OpenAI’s new o1 series, including o1‑preview and o1‑mini, leverages reinforcement‑learning‑based chain‑of‑thought reasoning to achieve superior performance on academic exams, coding contests, and safety benchmarks, offering faster, cost‑effective options while advancing AI alignment and human‑preference evaluation.

AI safetyBenchmarkLarge Language Model

0 likes · 15 min read

How OpenAI’s o1 Series Redefines Complex Reasoning and AI Safety

MaGe Linux Operations

Sep 13, 2024 · Artificial Intelligence

Can OpenAI’s New o1 Model Reach Human‑Level Reasoning?

OpenAI’s newly released o1 series introduces a reinforcement‑learning‑trained LLM that generates long chain‑of‑thought reasoning, achieving top‑50% scores on IOI contests, high rankings on Codeforces and AIME, and dramatically outperforming GPT‑4o across scientific and mathematical tasks.

AI reasoningArtificial IntelligenceLarge Language Model

0 likes · 8 min read

Can OpenAI’s New o1 Model Reach Human‑Level Reasoning?

Qunhe Technology Quality Tech

Sep 10, 2024 · Artificial Intelligence

Boost Test Case Creation with AI: How a Multi‑Model Platform Cuts Effort by 80%

An AI-driven test case generation platform at KuJiaLe leverages multiple large language models, offering three input methods, online editing, and dual export options, while addressing stability, length limits, and security challenges to improve testing efficiency and achieve over 80% success rate.

AI testingLarge Language Modelautomation

0 likes · 10 min read

Boost Test Case Creation with AI: How a Multi‑Model Platform Cuts Effort by 80%

Xiaohongshu Tech REDtech

Sep 2, 2024 · Artificial Intelligence

How AIGC Transforms Advertising Material Creation on Xiaohongshu

This article analyzes how large‑model AIGC reshapes the production, evaluation, and deployment of advertising creatives on Xiaohongshu, detailing the business motivations, technical pipeline, controllable generation, reward‑model filtering, and experimental results that balance commercial efficiency with community tone.

AIGCAdvertisingIndustry Case Study

0 likes · 14 min read

How AIGC Transforms Advertising Material Creation on Xiaohongshu

Volcano Engine Developer Services

Aug 29, 2024 · Artificial Intelligence

Building a Multi‑Model AI Bot: Design, Prompt Tricks, and Lessons Learned

This article details the creation of a multi‑model AI chatbot, covering its core features, workflow, prompt role configuration, parameter tuning, anti‑reverse‑engineering measures, competitive landscape, and reflective insights for developers building large‑model applications.

AI botLarge Language Modelbuild in public

0 likes · 12 min read

Building a Multi‑Model AI Bot: Design, Prompt Tricks, and Lessons Learned

Baobao Algorithm Notes

Aug 27, 2024 · Artificial Intelligence

Unlock Free GLM-4-Flash API: Step-by-Step Guide, Code Samples, and Logic Puzzle Test

This article explores the free GLM-4-Flash API from Zhipu AI, detailing its lightweight architecture, performance specs, a logic‑puzzle demonstration, and provides a comprehensive step‑by‑step tutorial—including data upload, model fine‑tuning, deployment commands and example code for building a LangChain‑based knowledge‑base retrieval system.

AI DeploymentFree APIGLM-4-Flash

0 likes · 11 min read

Unlock Free GLM-4-Flash API: Step-by-Step Guide, Code Samples, and Logic Puzzle Test

DataFunSummit

Aug 20, 2024 · Artificial Intelligence

Applying Large Language Models to Intelligent Telemarketing: Evolution, Architecture, and Future Outlook

This article reviews the evolution of telephone sales, introduces large model technologies, outlines their integration into intelligent telemarketing workflows, discusses practical implementation methods, challenges, and future trends, and shares insights from industry experts on optimizing AI‑driven sales automation.

AICustomer ExperienceData Security

0 likes · 17 min read

Applying Large Language Models to Intelligent Telemarketing: Evolution, Architecture, and Future Outlook

JD Tech Talk

Aug 19, 2024 · Artificial Intelligence

AI‑Driven Automated Question Generation for Aviation Maintenance Training

The article describes how JD Aviation’s maintenance department uses a vector‑based knowledge base and large‑language‑model services to automatically generate, evaluate, and maintain training exam questions, addressing the rapid growth of manuals, frequent updates, and the heavy manual workload of traditional test creation.

AIKnowledge BaseLarge Language Model

0 likes · 12 min read

AI‑Driven Automated Question Generation for Aviation Maintenance Training

21CTO

Aug 17, 2024 · Artificial Intelligence

Understanding Large Language Models: Training, Uses, and a Llama 3 Code Demo

This article explains what large language models (LLMs) are, how they are trained, their diverse applications across industries, the challenges they face, and provides a practical Python example using Replicate to run Meta's Llama 3‑70b‑instruct model.

AILLMLarge Language Model

0 likes · 11 min read

Understanding Large Language Models: Training, Uses, and a Llama 3 Code Demo

Meituan Technology Team

Aug 8, 2024 · Artificial Intelligence

BlackPearl Team Wins All Three Tracks of KDD 2024 OAG‑Challenge Cup with Large‑Model Solutions

The BlackPearl team from Meituan’s Dazhong Dianping division swept all three KDD 2024 OAG‑Challenge Cup tracks—WhoIsWho, PST, and AQA—by deploying innovative large‑model techniques such as iterative text clustering, graft‑learning‑enhanced BERT RAG pipelines, and a Boosting LLM‑for‑Vector search, and have released the code publicly on GitHub.

Academic DisambiguationKDD CupLarge Language Model

0 likes · 4 min read

BlackPearl Team Wins All Three Tracks of KDD 2024 OAG‑Challenge Cup with Large‑Model Solutions

58 Tech

Aug 7, 2024 · Artificial Intelligence

Bridging Compute and Applications: 58.com AI Lab’s Large‑Model Platform and AI Agent Solutions

In this article, 58.com AI Lab senior director Zhan Kunlin explains how the company built a multi‑layer AI platform, created a vertical large‑language model called LingXi, and developed an AI Agent system with RAG capabilities to accelerate practical AI applications across various business scenarios.

AI AgentsAI platformLarge Language Model

0 likes · 10 min read

Bridging Compute and Applications: 58.com AI Lab’s Large‑Model Platform and AI Agent Solutions

NewBeeNLP

Aug 5, 2024 · Industry Insights

How Alibaba Cloud Scales Search Recommendations with Big Data, AI, and LLMs

This article details Alibaba Cloud's end‑to‑end architecture for search and advertising recommendation, covering the data platform, AI services, feature‑store design, training and inference optimizations, and the integration of large language models for new recommendation scenarios.

AI platformAlibaba CloudBig Data

0 likes · 17 min read

How Alibaba Cloud Scales Search Recommendations with Big Data, AI, and LLMs

Java Tech Enthusiast

Aug 1, 2024 · Artificial Intelligence

Apple Intelligence: Inside the New Apple Foundation Model

Apple Intelligence, an on‑device AI suite debuting with iOS 18.1 beta, centers on the Apple Foundation Model—a 3‑billion‑parameter on‑device LLM (and a larger undisclosed cloud version) trained on TPUs with novel RL algorithms and mixed‑precision quantization, delivering Siri, writing assistance, photo search, and benchmark performance that surpasses GPT‑4, though currently limited to paid developers.

AIApple IntelligenceLarge Language Model

0 likes · 11 min read

Apple Intelligence: Inside the New Apple Foundation Model

DataFunTalk

Aug 1, 2024 · Artificial Intelligence

Ant Group's Time Series AI Practices: AntFlux Engine and Real‑World Applications

This article presents Ant Group's comprehensive time‑series AI solutions, detailing the AntFlux platform, the evolution from statistical to deep and large‑scale models—including Time‑LLM, iTransformer, and SLOTH—and illustrating how these technologies empower business insight, forecasting, decision‑making, and green computing across diverse scenarios.

AntFluxLarge Language Modelforecasting

0 likes · 17 min read

Ant Group's Time Series AI Practices: AntFlux Engine and Real‑World Applications

Baobao Algorithm Notes

Jul 31, 2024 · Artificial Intelligence

What Makes Mistral’s 7B, Mixtral, and Large 2 Models Stand Out? A Deep Technical Dive

This article compiles key technical details of the Mistral model family—including Mistral 7B, Mixtral 8×7B, Mixtral 8×22B, Mistral Nemo, and Mistral Large 2—covering their architectural innovations such as sliding‑window attention, grouped‑query attention, mixture‑of‑experts design, scaling parameters, performance benchmarks, quantization requirements, and practical deployment commands.

Grouped Query AttentionLarge Language ModelMistral

0 likes · 17 min read

What Makes Mistral’s 7B, Mixtral, and Large 2 Models Stand Out? A Deep Technical Dive

DataFunSummit

Jul 30, 2024 · Artificial Intelligence

Multimodal Mobile AI Agent (Mobile‑Agent): From V1 to V2 and Open‑Source Practice

This article introduces Alibaba Tongyi Lab's multimodal mobile AI agent, Mobile‑Agent, covering the background of large‑model agents, the design and capabilities of V1 and V2, the multi‑agent framework, evaluation results, open‑source resources, and future development directions.

AI planningLarge Language ModelMobile Agent

0 likes · 13 min read

Multimodal Mobile AI Agent (Mobile‑Agent): From V1 to V2 and Open‑Source Practice

DataFunTalk

Jul 26, 2024 · Artificial Intelligence

Llama 3: Open‑source Large Language Model Technical Report and Evaluation

This comprehensive technical report details the development, architecture, training methodology, extensive benchmark evaluations, safety measures, and inference optimizations of Meta's open‑source Llama 3 large language model series, covering models up to 405 billion parameters and supporting multilingual, multimodal, and tool‑use capabilities.

AILLaMALarge Language Model

0 likes · 115 min read

Llama 3: Open‑source Large Language Model Technical Report and Evaluation

Data Thinking Notes

Jul 25, 2024 · Information Security

How Large Language Models Transform Data Security Compliance Management

This article explains how a leading insurance technology group leverages large language models to streamline data security compliance, detailing the evolution of data management, key governance challenges, multimodal AI architecture, and practical workflows for policy enforcement, risk monitoring, and asset management.

AIData GovernanceData Security

0 likes · 10 min read

How Large Language Models Transform Data Security Compliance Management

NewBeeNLP

Jul 25, 2024 · Artificial Intelligence

Llama 3.1 Unveiled: How the New Open‑Source Giant Matches GPT‑4o and Claude 3.5

Meta has officially released Llama 3.1, a 405‑billion‑parameter open‑source model that matches or surpasses GPT‑4o and Claude 3.5 on over 150 benchmarks, expands context to 128 K tokens, supports eight languages, and is accompanied by a detailed 100‑page paper describing its data, training stack, architecture, quantization, safety measures, and ecosystem support.

AI safetyLarge Language ModelLlama 3.1

0 likes · 15 min read

Llama 3.1 Unveiled: How the New Open‑Source Giant Matches GPT‑4o and Claude 3.5

Kuaishou Tech

Jul 17, 2024 · Artificial Intelligence

Key Technical Innovations in Kuaishou’s “Kuaiyi” Large Model and Its Real-World Applications

The article details Kuaishou’s development of the 175B “Kuaiyi” multimodal large model, presenting eight novel technical innovations—from Temporal Scaling Law and MiLe Loss to MoE‑enhanced reward modeling—and describes how these advances enable high‑performance AI services such as the AI Xiao Kuai chatbot across diverse real‑world scenarios.

AI ApplicationsLarge Language ModelModel Optimization

0 likes · 12 min read

Key Technical Innovations in Kuaishou’s “Kuaiyi” Large Model and Its Real-World Applications

Alibaba Cloud Developer

Jul 17, 2024 · Artificial Intelligence

How Alibaba Cloud Built Service‑Domain AI Agents: Design, Practice, and Results

This article explains how Alibaba Cloud designed and deployed large‑language‑model agents for its service domain, covering background, ideal LLM deployment, the shift from explanation to problem solving, the agent framework, practical implementation, automation trade‑offs, training, evaluation, and real‑world impact.

AI AgentAlibaba CloudLLM

0 likes · 20 min read

How Alibaba Cloud Built Service‑Domain AI Agents: Design, Practice, and Results

DataFunSummit

Jul 16, 2024 · Artificial Intelligence

Knowledge Graph Construction, Reasoning, and QA for Intelligent Hypertension Diagnosis

This article presents a comprehensive exploration of knowledge‑graph‑based modeling, neural‑symbolic multi‑hop reasoning, and large‑model‑driven question answering applied to precise medication decision‑making in hypertension, detailing system architecture, experimental evaluations, real‑world deployments, and future research directions.

Large Language Modelhypertensionknowledge graph

0 likes · 26 min read

Knowledge Graph Construction, Reasoning, and QA for Intelligent Hypertension Diagnosis

NewBeeNLP

Jul 16, 2024 · Artificial Intelligence

Can Item Language Models Bridge LLMs and Collaborative Filtering for Conversational Recommendation?

This paper identifies three challenges of applying large language models to recommendation systems and proposes an Item Language Model that combines an item encoder with a frozen LLM, demonstrating through extensive experiments that language‑item alignment and interaction knowledge significantly improve conversational recommendation performance.

Large Language ModelQ-Formercollaborative filtering

0 likes · 10 min read

Can Item Language Models Bridge LLMs and Collaborative Filtering for Conversational Recommendation?

Baidu Geek Talk

Jul 15, 2024 · Industry Insights

How AI Is Revolutionizing Physical Network Fault Localization

This article explains how Baidu Cloud evolved from manual and integrated network fault detection to AI-driven localization using large language models, detailing structured prompting, multi‑agent workflows, and real‑world comparisons that demonstrate improved accuracy and faster mitigation.

AIFault LocalizationLarge Language Model

0 likes · 14 min read

How AI Is Revolutionizing Physical Network Fault Localization

Baidu Intelligent Cloud Tech Hub

Jul 10, 2024 · Artificial Intelligence

How AI Transforms Physical Network Fault Localization: From Manual to LLM‑Powered Precision

This article explains how Baidu Cloud evolved its physical network fault‑location workflow—from manual analysis and integrated multi‑signal algorithms to AI‑driven reasoning with large language models—highlighting structured prompting, multi‑agent collaboration, and measurable improvements in accuracy and automation.

AIFault LocalizationLarge Language Model

0 likes · 15 min read

How AI Transforms Physical Network Fault Localization: From Manual to LLM‑Powered Precision

Baidu Tech Salon

Jul 9, 2024 · Artificial Intelligence

AI-Powered Job Matching Application Using ERNIE SDK

The AI‑powered job‑matching application built with Baidu’s ERNIE SDK, created by PaddlePaddle expert Gao Fuzhi, intelligently parses a candidate’s resume, matches them to suitable positions, supplies detailed salary, location and benefit data, analyzes job requirements, and offers personalized skill and interview guidance, aiming to improve recruitment efficiency for both seekers and employers.

AIERNIE SDKLarge Language Model

0 likes · 8 min read

AI-Powered Job Matching Application Using ERNIE SDK

Alibaba Cloud Native

Jul 9, 2024 · Artificial Intelligence

Inside Alibaba Cloud’s Tongyi Lingma: How Its Code Model Earned the Top 4+ Rating

Alibaba Cloud’s Tongyi Lingma code model achieved the highest 4+ rating in the trusted AI code‑model evaluation, and in an interview its product lead explains the model’s capabilities, the rigorous assessment process, real‑world enterprise benefits, and future development plans.

AI code modelAlibaba CloudLarge Language Model

0 likes · 8 min read

Inside Alibaba Cloud’s Tongyi Lingma: How Its Code Model Earned the Top 4+ Rating

Architect's Alchemy Furnace

Jul 6, 2024 · Artificial Intelligence

ChatGLM Evolution: Deep Dive into GLM Architecture, Pretraining, and ChatGLM‑4

This article provides a comprehensive technical overview of the ChatGLM series—from the original ChatGLM‑6B model and its GLM‑based pre‑training framework to the enhancements in ChatGLM‑2, the architectural parity of ChatGLM‑3, and the advanced capabilities of the latest ChatGLM‑4, covering model structure, position encoding, attention mechanisms, multi‑task pretraining, and tool integration.

AIChatGLMGLM

0 likes · 25 min read

ChatGLM Evolution: Deep Dive into GLM Architecture, Pretraining, and ChatGLM‑4

Baidu Geek Talk

Jul 3, 2024 · Databases

How Vector Databases Power AI‑Driven Retrieval: Inside Baidu’s VectorDB

This article reviews the evolution of databases and large models, explains vector database fundamentals and RAG pipelines, and details Baidu's VectorDB architecture, performance advantages, and its role in AI‑enhanced database operations.

AI integrationDatabase operationsLarge Language Model

0 likes · 15 min read

How Vector Databases Power AI‑Driven Retrieval: Inside Baidu’s VectorDB

ByteDance SYS Tech

Jun 30, 2024 · Operations

How Large‑Model AI Is Transforming Intelligent Operations (AIOps)

This article explores the latest concepts, planning roadmap, and practical applications of large‑model AI in intelligent operations, detailing AIOps use cases, system‑level automation, multi‑agent architectures, and how a dedicated platform accelerates deployment and efficiency across data‑center environments.

AI AgentsAIOpsIntelligent Operations

0 likes · 18 min read

How Large‑Model AI Is Transforming Intelligent Operations (AIOps)

JD Cloud Developers

Jun 25, 2024 · Artificial Intelligence

Why Do Large Language Models Output Text Word‑by‑Word? Inside the Transformer Mechanics

This article explains the fundamental architecture of large language models, from the dual file nature of parameters and code, through neural network basics, perceptrons, and weight training, to the Transformer’s tokenization, positional encoding, self‑attention, and inference processes, illustrated with diagrams and examples.

Large Language ModelNeural NetworkSelf-Attention

0 likes · 22 min read

Why Do Large Language Models Output Text Word‑by‑Word? Inside the Transformer Mechanics

DataFunTalk

Jun 23, 2024 · Artificial Intelligence

OpenKG Seminar – Knowledge Graphs + Large Language Models Empowering General AI (Session 3)

On June 25, 2024, OpenKG hosted a hybrid academic salon at Alibaba Cloud Valley, featuring expert talks on knowledge graphs, large language models, and their joint impact on AI, with presentations from leading researchers and industry professionals across multiple sessions.

AILarge Language ModelMachine Learning

0 likes · 9 min read

OpenKG Seminar – Knowledge Graphs + Large Language Models Empowering General AI (Session 3)

JD Tech Talk

Jun 21, 2024 · Artificial Intelligence

Multilingual Support System Using Large Language Models: Architecture, Workflow, and Implementation Plan

This document outlines a comprehensive plan to enhance international logistics systems with real‑time multilingual support using large language models, detailing goals, architecture, automated translation, user‑driven term management, approval workflows, cloud deployment, and expected efficiency and quality improvements.

Large Language Modelmultilingualterm management

0 likes · 14 min read

Multilingual Support System Using Large Language Models: Architecture, Workflow, and Implementation Plan

Architecture Digest

Jun 21, 2024 · Artificial Intelligence

Getting Started with Spring Cloud Alibaba AI: Integrating Tongyi Large Models in Spring Boot

This article introduces Spring Cloud Alibaba AI, explains its relationship to Spring AI, and provides a step‑by‑step tutorial—including Maven setup, dependency configuration, code examples, and sample calls—to integrate Alibaba's Tongyi large‑model services for text QA, image generation, and speech synthesis in a Java Spring Boot application.

AI integrationAlibaba CloudJava

0 likes · 11 min read

Getting Started with Spring Cloud Alibaba AI: Integrating Tongyi Large Models in Spring Boot

AntTech

Jun 20, 2024 · Artificial Intelligence

Predicting Football Match Outcomes with Graph Neural Networks and Large Language Models: The “Smart Guess Football” Project

During the 2024 European Championship, TuGraph engineers built an interactive system called “Smart Guess Football” that combines graph computing, graph neural networks, transformers and large language models to model player relationships and predict match outcomes, achieving up to 71% accuracy on limited test matches.

AIGraph Neural NetworkLarge Language Model

0 likes · 7 min read

Predicting Football Match Outcomes with Graph Neural Networks and Large Language Models: The “Smart Guess Football” Project

NewBeeNLP

Jun 18, 2024 · Artificial Intelligence

How Shopee Builds an E‑Commerce Knowledge Graph and Leverages Large Models

This article presents Shopee's comprehensive approach to constructing an e‑commerce knowledge graph, detailing the challenges of heterogeneous data, multi‑language handling, entity disambiguation, and the integration of deep learning and large language models to improve product matching, recommendation, and operational efficiency.

AILarge Language Modele-commerce

0 likes · 22 min read

How Shopee Builds an E‑Commerce Knowledge Graph and Leverages Large Models

Bilibili Tech

Jun 14, 2024 · Artificial Intelligence

Technical Report on the Index-1.9B Series: Model Variants, Pre‑training Optimizations, and Alignment Experiments

The report presents the open‑source Index‑1.9B family—base, pure, chat, and character variants—detailing benchmark results, pre‑training optimizations such as a normalized LM‑Head and deeper‑slim architectures, the importance of modest instruction data, alignment via SFT/DPO, role‑play enhancements with RAG, and acknowledges remaining safety and factual limitations.

EvaluationInstruction TuningLLM

0 likes · 15 min read

Technical Report on the Index-1.9B Series: Model Variants, Pre‑training Optimizations, and Alignment Experiments

JD Tech Talk

Jun 6, 2024 · Artificial Intelligence

AI‑Powered Code Review Integrated into CI Pipelines for Faster, Higher‑Quality Development

This article analyses the drawbacks of manual code review, explains why they arise, and presents a practical solution that embeds a large‑language‑model‑based AI reviewer into a CI/CD pipeline, detailing configuration steps, script examples, and the resulting efficiency and quality gains.

AI code reviewCI/CDLarge Language Model

0 likes · 8 min read

AI‑Powered Code Review Integrated into CI Pipelines for Faster, Higher‑Quality Development

JD Cloud Developers

Jun 6, 2024 · Artificial Intelligence

Boost Code Review Efficiency with AI-Powered CI Integration

This guide explains how embedding a large‑language‑model AI into a CI pipeline can automate code reviews, cut review time, improve consistency and accuracy, and ultimately raise development efficiency and code quality while reducing manual effort and communication overhead.

AICI/CDJava

0 likes · 9 min read

Boost Code Review Efficiency with AI-Powered CI Integration

21CTO

Jun 2, 2024 · Artificial Intelligence

How Codestral Redefines AI‑Powered Code Generation: Features, Benchmarks, and Real‑World Use

The article introduces Codestral, Mistral AI's new code‑generation model, detailing its multilingual training, benchmark superiority, integration options, practical use cases, and current limitations, offering developers a comprehensive view of this AI‑driven coding assistant.

AI code generationBenchmarkCodestral

0 likes · 11 min read

How Codestral Redefines AI‑Powered Code Generation: Features, Benchmarks, and Real‑World Use

Baidu Tech Salon

May 30, 2024 · Artificial Intelligence

How AI Code Assistant Baidu Comate Boosted Medical Imaging Processing by 9×

A graduate student’s lab cut the time to process 150 GB of medical imaging data from one week for three people to two days for one person by using Baidu Comate’s AI‑driven code generation, annotation, and private‑knowledge enhancement features, achieving over nine‑fold productivity gains.

AI code assistantBaidu ComateLarge Language Model

0 likes · 8 min read

How AI Code Assistant Baidu Comate Boosted Medical Imaging Processing by 9×

Baidu Geek Talk

May 29, 2024 · Artificial Intelligence

How Baidu’s AI Code Assistant Boosted R&D Efficiency by Over 11% in Marketing Platforms

The article analyzes how Baidu's marketing service team leveraged the Wenxin large model and the Baidu Comate AI code assistant to accelerate product reconstruction, achieve AI‑native development, and quantify a daily engineering productivity gain of roughly 11.2% through reduced coding time and automated deployment workflows.

AI code assistantAI-native developmentBaidu Comate

0 likes · 13 min read

How Baidu’s AI Code Assistant Boosted R&D Efficiency by Over 11% in Marketing Platforms

21CTO

May 28, 2024 · Artificial Intelligence

When Google’s AI Overview Hallucinates: Surprising Misanswers and What They Reveal

Google’s AI Overview, unveiled at I/O 2024, replaces traditional search results with AI‑generated summaries, but real‑world usage shows bizarre hallucinations—from claiming the internet is 100% true to recommending eating stones—highlighting the lingering challenges of large language models.

AI OverviewAI hallucinationGoogle AI

0 likes · 7 min read

When Google’s AI Overview Hallucinates: Surprising Misanswers and What They Reveal

JD Retail Technology

May 27, 2024 · Artificial Intelligence

Automating Test Case Generation with Large Language Models and LangChain

This article describes how large language models and the LangChain framework can be combined with PDF parsing, text chunking, memory management, and a vector database to automatically generate software test cases, achieving significant efficiency gains while outlining implementation details, results, and future challenges.

AILangChainLarge Language Model

0 likes · 10 min read

Automating Test Case Generation with Large Language Models and LangChain

21CTO

May 23, 2024 · Artificial Intelligence

How xAI’s Grok 1.5V Adds Multimodal Image Input for Developers

xAI’s Grok 1.5V is set to support multimodal image input, allowing developers to upload pictures and receive text‑based answers via the Python SDK, marking a major upgrade that narrows the gap with leading models like GPT‑4 and signals a new frontier for AI chatbots.

AI chatbotsGrokLarge Language Model

0 likes · 4 min read

How xAI’s Grok 1.5V Adds Multimodal Image Input for Developers

Baidu Tech Salon

May 22, 2024 · Industry Insights

How Baidu’s AI‑Powered Code Assistant Boosts R&D Efficiency by Over 11 %

The article examines Baidu Marketing Service's AI‑native transformation using the Wenxin large model and Baidu Comate, detailing how real‑time code recommendations, open‑platform integration, and generative AI dramatically improve developer productivity, reduce coding time, and increase marketing ROI.

AIAI-native developmentBaidu Comate

0 likes · 11 min read

How Baidu’s AI‑Powered Code Assistant Boosts R&D Efficiency by Over 11 %