Tagged articles
649 articles
Page 3 of 7
Wuming AI
Wuming AI
Nov 30, 2025 · Artificial Intelligence

What Exactly Is a Large Language Model? A Simple Guide to AI, Transformers, and How They Work

This article explains the relationship between AI, machine learning, deep learning, and large language models, detailing their evolution, training stages, transformer architecture, attention mechanisms, inference APIs, and practical usage examples, while demystifying common misconceptions about LLM capabilities.

AI fundamentalsDeep LearningRLHF
0 likes · 10 min read
What Exactly Is a Large Language Model? A Simple Guide to AI, Transformers, and How They Work
Kuaishou Tech
Kuaishou Tech
Nov 28, 2025 · Artificial Intelligence

Keye-VL-671B-A37B Leads Vision, Video, and Math Benchmarks

Kwai has open‑sourced its new flagship multimodal model Keye‑VL‑671B‑A37B, which upgrades visual perception, cross‑modal alignment and complex reasoning, achieving top scores on image, video, and mathematical reasoning benchmarks while detailing its architecture, three‑stage pre‑training, post‑training strategies, and future multimodal agent plans.

Deep LearningMultimodallarge language model
0 likes · 10 min read
Keye-VL-671B-A37B Leads Vision, Video, and Math Benchmarks
AsiaInfo Technology: New Tech Exploration
AsiaInfo Technology: New Tech Exploration
Nov 28, 2025 · Artificial Intelligence

Boosting 5G Complaint Intent Detection with Large-Model-Enhanced Few-Shot Learning

This paper presents a collaborative framework where a large language model generates high‑quality synthetic samples to augment a lightweight model, dramatically improving few‑shot user‑complaint intent recognition in 5G networks, achieving a 21% boost for rare categories and a 9% overall accuracy gain.

Few‑Shot Learningcomplaint intent detectiondata augmentation
0 likes · 27 min read
Boosting 5G Complaint Intent Detection with Large-Model-Enhanced Few-Shot Learning
Alibaba Cloud Developer
Alibaba Cloud Developer
Nov 27, 2025 · Artificial Intelligence

How AI Powers Ethnic Product Categorization for Global E‑Commerce

This article presents an end‑to‑end AI solution that builds a cultural knowledge base and leverages large language models to automatically identify and match ethnic‑specific product categories on a cross‑border e‑commerce platform, reducing mis‑matches from 8.4% to 1.8% and cutting iteration time from days to under one day.

Knowledge Baseaiethnic categorization
0 likes · 19 min read
How AI Powers Ethnic Product Categorization for Global E‑Commerce
DataFunSummit
DataFunSummit
Nov 20, 2025 · Artificial Intelligence

How 1688 Reinvented E‑commerce Search with AI‑Powered Generative Retrieval

This article details Alibaba’s 1688 platform’s shift from traditional e‑commerce search to AI‑driven generative retrieval, covering the AI Deep Search 1.0 and 2.0 cascaded frameworks, multimodal capabilities, an end‑to‑end “model‑as‑search‑engine” approach, experimental results, challenges, and future directions.

E-commerce SearchGenerative Retrievalai
0 likes · 18 min read
How 1688 Reinvented E‑commerce Search with AI‑Powered Generative Retrieval
HyperAI Super Neural
HyperAI Super Neural
Nov 20, 2025 · Artificial Intelligence

From 9,874 Papers to 15,000 Structures: MOF‑ChemUnity Rebuilds MOF Knowledge for Explainable AI

MOF‑ChemUnity constructs a scalable, extensible knowledge graph that links millions of MOF names and synonyms to over 15,000 crystal structures using LLM‑driven entity matching, enabling accurate, explainable AI‑assisted material discovery, water‑stability prediction, expert recommendation validation, and graph‑enhanced retrieval across diverse applications.

Graph RAGMOFMaterials Discovery
0 likes · 17 min read
From 9,874 Papers to 15,000 Structures: MOF‑ChemUnity Rebuilds MOF Knowledge for Explainable AI
Alibaba Cloud Developer
Alibaba Cloud Developer
Nov 19, 2025 · Artificial Intelligence

Building an AI-Powered Proofreading Agent for Media: Architecture, Prompt Engineering, and Evaluation

This article details a practical case study of designing, implementing, and evaluating an AI-driven proofreading agent for a media client, covering background challenges, a three‑layer architecture, prompt engineering techniques, RAG knowledge‑base construction, model selection, fine‑tuning, automated metrics, and lessons learned.

Model EvaluationProofreadingRAG
0 likes · 26 min read
Building an AI-Powered Proofreading Agent for Media: Architecture, Prompt Engineering, and Evaluation
Wuming AI
Wuming AI
Nov 10, 2025 · Artificial Intelligence

What Exactly Is an AI Agent? A Clear, Practical Guide

This article explains the concept of AI agents, contrasting them with chatbots, detailing their ability and structural layers, summarizing academic surveys and whitepapers, and illustrating how agents plan, perceive, and act to autonomously accomplish user‑defined goals.

AI AgentAgent ArchitectureAutonomous Planning
0 likes · 9 min read
What Exactly Is an AI Agent? A Clear, Practical Guide
JD Tech Talk
JD Tech Talk
Nov 10, 2025 · Artificial Intelligence

Designing an AI-Powered Experiment Analysis Agent: Architecture, Workflow, and Future Enhancements

This article outlines the motivation, design, architecture, engineering implementation, large‑model selection, and future improvement plans for an AI‑driven experiment analysis agent that integrates data aggregation, modular workflow orchestration, and interactive frontend features to streamline AB‑test insights.

AI AgentModel Selectionexperiment analysis
0 likes · 14 min read
Designing an AI-Powered Experiment Analysis Agent: Architecture, Workflow, and Future Enhancements
Efficient Ops
Efficient Ops
Nov 9, 2025 · Operations

How Tencent’s PCG Achieves Full‑Link Observability and AI‑Powered SRE

The talk details Tencent PCG’s end‑to‑end observability platform, its data‑standardization pipeline, client‑backend session linking, AI‑enhanced SRE Agent with large language models, and the roadmap toward a SaaS offering, illustrating how modern operations integrate AI for rapid fault localization.

SREaifull‑link
0 likes · 17 min read
How Tencent’s PCG Achieves Full‑Link Observability and AI‑Powered SRE
Bighead's Algorithm Notes
Bighead's Algorithm Notes
Nov 7, 2025 · Artificial Intelligence

Weekly AI Finance Paper Digest (Nov 1‑7 2025)

This digest summarizes three recent AI‑driven finance papers—DeltaLag’s dynamic lead‑lag detection, MS‑HGFN’s multi‑scale graph network for stock movement, and LiveTradeBench’s real‑time LLM trading benchmark—highlighting their methods, datasets, and performance gains.

Financial AIGraph Neural NetworkStock Prediction
0 likes · 8 min read
Weekly AI Finance Paper Digest (Nov 1‑7 2025)
Tencent Advertising Technology
Tencent Advertising Technology
Nov 6, 2025 · Artificial Intelligence

Boosting Web UI Test Efficiency with AIGC: From Manual Scripts to Intelligent Automation

This report examines the challenges of Web UI testing in Tencent's advertising platform, analyzes current inefficiencies, and presents an AIGC-driven solution that leverages large language models, semantic scripts, and automated pipelines to dramatically improve test case generation, execution accuracy, and CI/CD integration.

AIGCWeb UI testingautomation
0 likes · 27 min read
Boosting Web UI Test Efficiency with AIGC: From Manual Scripts to Intelligent Automation
Amap Tech
Amap Tech
Nov 4, 2025 · Artificial Intelligence

Spacetime‑GR: AI‑Powered Spatiotemporal Model Transforming POI Recommendations

This article introduces Spacetime‑GR, a large‑scale generative recommendation model that integrates hierarchical geographic POI indexing and spatiotemporal token encoding to enhance POI prediction for Amap, detailing its pre‑training pipeline, data cleaning, curriculum learning strategy, experimental results, scaling law observations, and the resulting improvements in hit rate and discovery rate.

AmapPOI recommendationcurriculum learning
0 likes · 14 min read
Spacetime‑GR: AI‑Powered Spatiotemporal Model Transforming POI Recommendations
21CTO
21CTO
Nov 4, 2025 · Artificial Intelligence

LongCat-Flash-Omni: How an Open-Source 560B Model Achieves Real-Time Multimodal Mastery

LongCat-Flash-Omni, an open‑source 560 billion‑parameter multimodal model, combines efficient Shortcut‑Connected MoE architecture with advanced perception and speech modules to deliver low‑latency real‑time audio‑video interaction and state‑of‑the‑art performance across text, image, video, and audio tasks.

Efficient InferenceMultimodal AIaudio-visual processing
0 likes · 10 min read
LongCat-Flash-Omni: How an Open-Source 560B Model Achieves Real-Time Multimodal Mastery
Meituan Technology Team
Meituan Technology Team
Nov 3, 2025 · Artificial Intelligence

LongCat-Flash-Omni: 560B Open‑Source Multimodal Model with Real‑Time Interaction

LongCat-Flash-Omni, the latest open‑source model from Meituan, combines a 560 billion‑parameter architecture, efficient multimodal perception and speech reconstruction modules, and a progressive training strategy to deliver real‑time audio‑video interaction and state‑of‑the‑art performance across text, image, audio, and video tasks.

Multimodalaibenchmark
0 likes · 9 min read
LongCat-Flash-Omni: 560B Open‑Source Multimodal Model with Real‑Time Interaction
Huawei Cloud Developer Alliance
Huawei Cloud Developer Alliance
Nov 3, 2025 · Artificial Intelligence

How AI Agents Are Revolutionizing Technology: The New Engine of Innovation

This article explores the rise of AI agents—from their definition as intelligent digital assistants powered by large language models to their evolution through planning, memory, and tool use—highlighting real‑world applications, core technical mechanisms, code implementations, and future trends such as autonomy, multimodal fusion, standardization, and safety considerations.

AI AgentAutonomous AIMultimodal
0 likes · 24 min read
How AI Agents Are Revolutionizing Technology: The New Engine of Innovation
Data Party THU
Data Party THU
Nov 2, 2025 · Artificial Intelligence

From RNN to LLM: How Transformers Power Modern Language Models

This article explains the evolution from RNNs through Encoder‑Decoder models to Transformers, detailing self‑attention, multi‑head attention, and masked attention, and then describes what Large Language Models are, their key components, capabilities, limitations, and common applications.

Deep LearningLLMTransformer
0 likes · 9 min read
From RNN to LLM: How Transformers Power Modern Language Models
DataFunSummit
DataFunSummit
Oct 31, 2025 · Artificial Intelligence

How OPPO’s AndesVL Is Revolutionizing On‑Device Multimodal AI

OPPO AI Center introduces AndesVL, an open‑source, fully‑adapted multimodal large model ranging from 0.6B to 4B parameters, designed for high‑performance, privacy‑preserving, low‑latency AI on mobile devices, with advanced architecture, training pipelines, on‑device optimizations, and state‑of‑the‑art benchmark results.

Mobile AIlarge language modelmodel compression
0 likes · 21 min read
How OPPO’s AndesVL Is Revolutionizing On‑Device Multimodal AI
DataFunSummit
DataFunSummit
Oct 30, 2025 · Artificial Intelligence

How Multimodal Large Models Are Revolutionizing Document Processing and OCR

This article explores how the explosion of unstructured data exposes the limits of traditional OCR and shows how emerging multimodal large language models provide end‑to‑end document understanding, reduce pipeline complexity, cut training costs, enable hybrid retrieval‑augmented generation, and drive real‑world industry deployments.

Document ProcessingMultimodalOCR
0 likes · 28 min read
How Multimodal Large Models Are Revolutionizing Document Processing and OCR
DataFunSummit
DataFunSummit
Oct 30, 2025 · Artificial Intelligence

Bilibili’s AI Assistant: Using Large Language Models to Tackle Big Data Ops

This article explains how Bilibili’s massive video platform built a five‑layer, storage‑compute separated big‑data infrastructure and employed a large language model‑driven intelligent assistant to automatically diagnose and resolve frequent offline task failures and slowdowns, addressing common user queries about task reliability and performance.

Intelligent Assistantbig data platformlarge language model
0 likes · 4 min read
Bilibili’s AI Assistant: Using Large Language Models to Tackle Big Data Ops
Zhuanzhuan Tech
Zhuanzhuan Tech
Oct 29, 2025 · Artificial Intelligence

How Reinforcement Learning Boosts Stability and Speed in LLM QA Systems

This article examines how reinforcement‑learning techniques such as PPO, DPO, and GRPO are integrated into the Baixiaosheng QA system to improve answer stability, deepen domain knowledge understanding, and accelerate response generation, and it evaluates the impact of Reinforcement Fine‑Tuning (RFT) on real‑world performance.

DPOGRPOPPO
0 likes · 16 min read
How Reinforcement Learning Boosts Stability and Speed in LLM QA Systems
AntTech
AntTech
Oct 29, 2025 · Artificial Intelligence

Inside Ant’s Baoling: Balancing Efficiency and Reasoning in a 1‑Trillion‑Parameter Model

At the Ant Star Innovation Journey event, the Baoling team unveiled their roadmap for trillion‑parameter models, detailing the development of Ling‑1T, Ring‑1T and multimodal Ming series, the scaling‑law‑guided architecture, training innovations, evaluation methods, and open‑source releases that aim to advance efficient, high‑performance AI.

Efficient Inferencelarge language modelscaling law
0 likes · 24 min read
Inside Ant’s Baoling: Balancing Efficiency and Reasoning in a 1‑Trillion‑Parameter Model
AntTech
AntTech
Oct 28, 2025 · Artificial Intelligence

Ming-Flash-Omni-Preview: 103B Open-Source Multimodal Model Excelling in Image, Video, and Speech

Introducing Ming‑Flash‑Omni‑Preview, a 103‑billion‑parameter open‑source multimodal model built on a sparse MoE architecture that delivers state‑of‑the‑art performance in controllable image generation, streaming video understanding, and context‑aware speech recognition, surpassing prior models on GenEval and GEdit benchmarks.

Image GenerationMultimodallarge language model
0 likes · 8 min read
Ming-Flash-Omni-Preview: 103B Open-Source Multimodal Model Excelling in Image, Video, and Speech
DataFunTalk
DataFunTalk
Oct 28, 2025 · Artificial Intelligence

How Bilibili Uses Large Language Models to Solve Big Data Platform Issues

This article explains Bilibili's massive data platform architecture, the common offline‑task failures and slowdowns users encounter, and how the company applies a large‑language‑model‑driven intelligent assistant to diagnose and resolve these engineering problems efficiently.

AI assistanceBilibilibig data platform
0 likes · 4 min read
How Bilibili Uses Large Language Models to Solve Big Data Platform Issues
Amap Tech
Amap Tech
Oct 27, 2025 · Artificial Intelligence

Turning Maps into a Living Map: Amap’s G-Where Generative AI Recommendation

Amap upgrades its homepage recommendation by integrating large‑model capabilities—G‑Where, G‑Action, and G‑Plan—through semantic ID generation, item tokenization, and multi‑stage LLM training, achieving significant offline and online performance gains while illustrating a scalable generative recommendation framework.

Generative RecommendationMap Servicesai
0 likes · 21 min read
Turning Maps into a Living Map: Amap’s G-Where Generative AI Recommendation
DataFunTalk
DataFunTalk
Oct 23, 2025 · Artificial Intelligence

How Tencent Leverages RAG and Agents to Supercharge Large Language Models

This article examines Tencent's large language model deployments across diverse business scenarios, detailing how Retrieval‑Augmented Generation, Supervised Fine‑Tuning, and autonomous agents boost model intelligence, reduce hallucinations, and enable sophisticated content creation, understanding, and interactive applications.

AI agentsRAGTencent
0 likes · 4 min read
How Tencent Leverages RAG and Agents to Supercharge Large Language Models
Wu Shixiong's Large Model Academy
Wu Shixiong's Large Model Academy
Oct 23, 2025 · Artificial Intelligence

Why the Transformer Core Structure Is the Key to AI Interview Success

This article explains the fundamental purpose, architecture, and variants of the Transformer model—including Encoder‑Decoder, Encoder‑only, and Decoder‑only designs—while detailing how attention mechanisms work and why modern large‑language models favor the Decoder‑only approach, providing a concise framework for answering interview questions.

AI InterviewEncoder-DecoderSelf-Attention
0 likes · 10 min read
Why the Transformer Core Structure Is the Key to AI Interview Success
Data Party THU
Data Party THU
Oct 22, 2025 · Artificial Intelligence

Demystifying Large‑Model Reinforcement Learning: From MDP Basics to Bellman and Advantage Functions

This article provides a comprehensive introduction to reinforcement learning for large language models, covering the Markov Decision Process formulation, the four core elements of RL, state‑value and action‑value functions, Bellman equations, and the advantage function that underpins modern policy‑gradient algorithms.

AI fundamentalsBellman equationMDP
0 likes · 13 min read
Demystifying Large‑Model Reinforcement Learning: From MDP Basics to Bellman and Advantage Functions
IT Services Circle
IT Services Circle
Oct 20, 2025 · Artificial Intelligence

How NanoChat Lets Anyone Train a ChatGPT‑Like Model for $100

NanoChat, an open‑source full‑stack AI model solution created by Andrej Karpathy, enables users to train a functional chat model on a modest $100 cloud GPU rental, offering a low‑cost, hands‑on alternative to proprietary large‑language‑model services.

AI trainingcost-effectivelarge language model
0 likes · 4 min read
How NanoChat Lets Anyone Train a ChatGPT‑Like Model for $100
AI2ML AI to Machine Learning
AI2ML AI to Machine Learning
Oct 15, 2025 · Artificial Intelligence

NanoChat Source Code Deep Dive: Karpathy’s Full‑Stack LLM Pipeline Explained

This article dissects NanoChat’s end‑to‑end LLM pipeline—from a lightweight 561M‑parameter transformer and custom Rust BPE tokenizer to Chinchilla‑scaled training, multi‑task fine‑tuning, optional RL on GSM8K, KV‑cache inference optimizations, and benchmark results that slightly surpass GPT‑2 Large.

CORE benchmarkChinchilla scalingFastAPI
0 likes · 10 min read
NanoChat Source Code Deep Dive: Karpathy’s Full‑Stack LLM Pipeline Explained
AntTech
AntTech
Oct 14, 2025 · Artificial Intelligence

How Ring-1T Achieves Trillion-Scale Deep Thinking and Competitive Benchmarks

The Ring-1T model, a trillion-parameter AI system released as open source, leverages advanced reinforcement learning techniques, extensive benchmark evaluations, and custom training frameworks to deliver balanced performance across math, code, reasoning, and creative tasks while highlighting current limitations and future development plans.

AI modelReinforcement Learningbenchmark evaluation
0 likes · 8 min read
How Ring-1T Achieves Trillion-Scale Deep Thinking and Competitive Benchmarks
AI2ML AI to Machine Learning
AI2ML AI to Machine Learning
Oct 13, 2025 · Artificial Intelligence

How Large‑and‑Small Language Model Collaboration Is Shaping the Future

The article argues that combining large, high‑capacity models with lightweight, fine‑tuned small models can cut costs, lower latency, enable specialized vertical tasks, and shift development from chasing ever‑bigger models toward optimal system architectures, outlining key techniques such as state‑space models, knowledge distillation, and staged fine‑tuning.

AI ArchitectureFine-tuningefficiency
0 likes · 3 min read
How Large‑and‑Small Language Model Collaboration Is Shaping the Future
DataFunTalk
DataFunTalk
Oct 13, 2025 · Artificial Intelligence

How Tencent Uses RAG, GraphRAG, and Agents to Power Large Language Model Applications

This article examines Tencent's large language model deployments across diverse business scenarios, detailing core use cases such as content generation, intelligent customer service, and role‑playing, while explaining the underlying technologies of Supervised Fine‑Tuning, Retrieval‑Augmented Generation, and Agent systems.

AI applicationsRAGSupervised Fine‑Tuning
0 likes · 4 min read
How Tencent Uses RAG, GraphRAG, and Agents to Power Large Language Model Applications
Bighead's Algorithm Notes
Bighead's Algorithm Notes
Oct 11, 2025 · Artificial Intelligence

Recent Advances in Multivariate Time Series Forecasting: Paper Summaries (Sep 27 – Oct 10 2025)

This article summarizes eight newly released AI papers on multivariate time‑series forecasting and anomaly detection, detailing each work's motivation, proposed methodology, key innovations such as CRIB, TS‑JEPA, DSAT‑HD, DIMIGNN, ASTGI, IndexNet, TsLLM, Moon, TimeSeriesScientist, MLG‑4TS, and Augur, and reports their experimental validation on real‑world datasets.

Deep LearningTransformeranomaly detection
0 likes · 23 min read
Recent Advances in Multivariate Time Series Forecasting: Paper Summaries (Sep 27 – Oct 10 2025)
DataFunSummit
DataFunSummit
Oct 10, 2025 · Artificial Intelligence

How Ping An Life Built ChatBI: An AI‑Powered Intelligent BI Platform

This article details Ping An Life's self‑developed large‑model reporting product ChatBI, covering its background, goals, solution architecture, technical stack, real‑world use cases, deployment challenges, and future outlook, offering practical insights for enterprises adopting AI‑driven business intelligence.

Business IntelligenceChatbotData Platform
0 likes · 17 min read
How Ping An Life Built ChatBI: An AI‑Powered Intelligent BI Platform
DataFunTalk
DataFunTalk
Oct 9, 2025 · Artificial Intelligence

How Bilibili Uses Large Language Models to Solve Big Data Task Failures

This article explains Bilibili's massive data platform architecture, the common reasons offline tasks fail or slow down, and how the company is exploring large‑language‑model‑driven assistants to automatically diagnose and resolve these engineering issues.

AI assistanceBilibilibig data platform
0 likes · 4 min read
How Bilibili Uses Large Language Models to Solve Big Data Task Failures
HyperAI Super Neural
HyperAI Super Neural
Oct 8, 2025 · Artificial Intelligence

From WeChat’s AI Podcast Trial to Google, ByteDance and Xiaohongshu: Can AI Podcasts Capture the Emerging AIGC Blue Ocean?

The article examines how breakthroughs in large language models and high‑fidelity TTS are powering AI‑generated podcasts, analyzes the technical advances behind the "human‑like" sound, surveys major players such as Google, ByteDance, Xiaohongshu and startups, and evaluates the market potential of this rapidly expanding AIGC niche.

AI podcastAIGCByteDance
0 likes · 9 min read
From WeChat’s AI Podcast Trial to Google, ByteDance and Xiaohongshu: Can AI Podcasts Capture the Emerging AIGC Blue Ocean?
DataFunSummit
DataFunSummit
Oct 7, 2025 · Artificial Intelligence

Bilibili’s AI‑Powered Assistant: Solving Big Data Task Failures with LLMs

This article details Bilibili's implementation of a large‑language‑model‑driven intelligent assistant that helps engineers diagnose and resolve massive offline and real‑time data‑processing failures, describing the platform’s five‑layer architecture, common failure and slowdown causes, and the need for AI‑powered troubleshooting support.

BilibiliIntelligent Assistantbig data platform
0 likes · 4 min read
Bilibili’s AI‑Powered Assistant: Solving Big Data Task Failures with LLMs
DataFunSummit
DataFunSummit
Oct 6, 2025 · Artificial Intelligence

How Bilibili Leverages Large Language Models to Solve Big Data Platform Failures

This article explains Bilibili's massive video platform data architecture, the huge daily workload of offline and real‑time tasks, common user problems like task failures and slowdowns, their root causes, and how a large language model assistant is being used to automate troubleshooting.

AI assistanceBilibililarge language model
0 likes · 4 min read
How Bilibili Leverages Large Language Models to Solve Big Data Platform Failures
Fun with Large Models
Fun with Large Models
Sep 30, 2025 · Artificial Intelligence

DeepSeek-V3.2 Architecture Breakthrough: A 5‑Minute Guide to Its Core Features

The article introduces DeepSeek-V3.2, highlighting its new DeepSeek Sparse Attention (DSA) that boosts training and inference efficiency by up to 50%, cuts model usage costs dramatically, explains the updated API endpoints, and details the four‑stage post‑training pipeline that underpins the model’s performance improvements.

AI ArchitectureDSADeepSeek-V3.2
0 likes · 8 min read
DeepSeek-V3.2 Architecture Breakthrough: A 5‑Minute Guide to Its Core Features
HyperAI Super Neural
HyperAI Super Neural
Sep 30, 2025 · Artificial Intelligence

SpikingBrain-1.0 Achieves 100× Faster Inference with Brain‑Inspired Spiking Architecture

SpikingBrain-1.0, the first domestically‑produced brain‑inspired spiking large model, links spiking neuron dynamics to linear attention, delivering over 100× faster first‑token latency on 4‑million‑token sequences, 23.4% FLOP utilization, 69% sparsity, and a one‑click deployment tutorial on HyperAI.

SpikingBrain-1.0brain-inspired AIinference speedup
0 likes · 7 min read
SpikingBrain-1.0 Achieves 100× Faster Inference with Brain‑Inspired Spiking Architecture
Alipay Experience Technology
Alipay Experience Technology
Sep 29, 2025 · Artificial Intelligence

How a Constraint-Aware Multi-Agent AI Won the IJCAI‑2025 Travel Planning Challenge

Alipay’s AI research team, together with Ant Group and East China Normal University, leveraged a self‑developed large‑model‑plus‑optimization framework to create a constraint‑aware multi‑agent system that won both the Original OS Track and DSL Track at the IJCAI‑2025 Autonomous Travel Itinerary Planning Competition.

Multi-AgentTravel Planningai
0 likes · 8 min read
How a Constraint-Aware Multi-Agent AI Won the IJCAI‑2025 Travel Planning Challenge
DataFunTalk
DataFunTalk
Sep 28, 2025 · Artificial Intelligence

How Bilibili Leverages Large Language Models to Automate Big Data Operations

This article explores Bilibili’s implementation of a large‑language‑model‑driven intelligent assistant that helps troubleshoot massive offline and real‑time data processing tasks, detailing the platform’s five‑layer architecture, common failure causes, and how AI can streamline issue resolution.

AI OperationsIntelligent Assistantbig data platform
0 likes · 4 min read
How Bilibili Leverages Large Language Models to Automate Big Data Operations
DataFunTalk
DataFunTalk
Sep 27, 2025 · Artificial Intelligence

Bilibili’s AI Assistant: Using Large Language Models to Tackle Massive Data Tasks

This article explains how Bilibili leverages a large‑language‑model‑based intelligent agent to diagnose and resolve failures and slowdowns in its massive big‑data platform, detailing the platform architecture, workload scale, common user issues, and the need for automated assistance.

AI OperationsBilibiliIntelligent Assistant
0 likes · 5 min read
Bilibili’s AI Assistant: Using Large Language Models to Tackle Massive Data Tasks
Data Party THU
Data Party THU
Sep 26, 2025 · Artificial Intelligence

How Keye‑VL‑1.5 Redefines Video Understanding with Slow‑Fast Encoding

Keye‑VL‑1.5, an 8‑billion‑parameter multimodal large language model, introduces a Slow‑Fast video encoding strategy, a four‑stage progressive pre‑training pipeline with 128K context, and a sophisticated post‑training regime that together achieve state‑of‑the‑art performance on video and vision‑language benchmarks while maintaining strong general capabilities.

benchmarklarge language modelmultimodal LLM
0 likes · 21 min read
How Keye‑VL‑1.5 Redefines Video Understanding with Slow‑Fast Encoding
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Sep 25, 2025 · Artificial Intelligence

Unlocking Trillion‑Parameter MoE Models: Expert Parallelism and Alibaba Cloud PAI‑EAS Deployment Guide

This article explains the opportunities and challenges of Mixture of Experts (MoE) models, introduces expert parallelism as a solution to scaling and deployment bottlenecks, and provides a step‑by‑step guide for deploying MoE models with Alibaba Cloud PAI‑EAS, including configuration tips and code examples.

AI Model DeploymentExpert ParallelismMoE
0 likes · 11 min read
Unlocking Trillion‑Parameter MoE Models: Expert Parallelism and Alibaba Cloud PAI‑EAS Deployment Guide
DataFunSummit
DataFunSummit
Sep 25, 2025 · Artificial Intelligence

Can NL→MQL→SQL Bridge the Gap to End‑to‑End Intelligent BI?

Aloudata Agent introduces a novel NL→MQL→SQL framework that combines large language models with a custom metric query language, enabling business users to perform end‑to‑end intelligent data analysis, attribution, and reporting without technical expertise, while balancing accuracy, cost, and performance.

Intelligent BIMetric Query LanguageNL2SQL
0 likes · 18 min read
Can NL→MQL→SQL Bridge the Gap to End‑to‑End Intelligent BI?
Fighter's World
Fighter's World
Sep 24, 2025 · Artificial Intelligence

Aivis: Pioneering Autonomous Agents for Alibaba Cloud’s Next‑Gen Intelligent Services

The talk outlines how Alibaba Cloud’s Aivis autonomous service agent tackles the “impossible triangle” of ultra‑high experience, low cost, and complex services by evolving from tool‑based chatbots to teammate‑level agents, detailing a four‑layer architecture, domain‑model training, and actionable steps for enterprise AI service transformation.

AI AgentAgent ArchitectureCloud Service
0 likes · 14 min read
Aivis: Pioneering Autonomous Agents for Alibaba Cloud’s Next‑Gen Intelligent Services
AIWalker
AIWalker
Sep 23, 2025 · Artificial Intelligence

Manzano: A Small 3B Multimodal Model That Unifies Image Understanding and Generation with SOTA Performance

Manzano introduces a hybrid vision tokenizer and a three‑stage training recipe that let a 3‑billion‑parameter multimodal LLM achieve state‑of‑the‑art results on both image‑understanding benchmarks and text‑to‑image generation, while scaling smoothly to larger sizes and minimizing task conflict.

AI researchManzanoMultimodal
0 likes · 25 min read
Manzano: A Small 3B Multimodal Model That Unifies Image Understanding and Generation with SOTA Performance
Meituan Technology Team
Meituan Technology Team
Sep 22, 2025 · Artificial Intelligence

LongCat-Flash-Thinking: The New SOTA Open-Source LLM for Deep Reasoning and Tool Use

Meituan’s LongCat team unveiled LongCat-Flash-Thinking, an open‑source large language model that combines deep logical reasoning with tool‑calling capabilities, achieving state‑of‑the‑art performance across logic, mathematics, code, and agentic tasks, and introducing novel training frameworks such as domain‑parallel RL and DORA.

Tool Useaibenchmark
0 likes · 7 min read
LongCat-Flash-Thinking: The New SOTA Open-Source LLM for Deep Reasoning and Tool Use
Data Party THU
Data Party THU
Sep 20, 2025 · Artificial Intelligence

How DeepSeek Trained a $30M LLM for Just $29.4K – Inside the R1 Model

The article reports that DeepSeek’s R1 large language model, detailed in a peer‑reviewed Nature paper, was built with roughly $300 k in total cost—about $29.4 k for training—using Nvidia H800 chips and novel pure reinforcement‑learning techniques, achieving competitive performance while remaining open‑source.

DeepSeekNvidia H800Peer Review
0 likes · 9 min read
How DeepSeek Trained a $30M LLM for Just $29.4K – Inside the R1 Model
HyperAI Super Neural
HyperAI Super Neural
Sep 18, 2025 · Artificial Intelligence

DeepSeek‑R1 Costs $294K to Train, Hits Nature Cover as First Peer‑Reviewed Large Model

DeepSeek‑R1, the first mainstream large language model to pass peer review in Nature, was trained for $294,000 using 648 H800 GPUs, and its RL‑enhanced version, DeepSeek‑R1‑Zero, achieved up to 86.7% pass@1 on AIME 2024, outperforming human averages across math, coding, and science tasks.

AI researchDeepSeek-R1Peer Review
0 likes · 10 min read
DeepSeek‑R1 Costs $294K to Train, Hits Nature Cover as First Peer‑Reviewed Large Model
DataFunTalk
DataFunTalk
Sep 18, 2025 · Artificial Intelligence

How DeepSeek‑R1’s Reinforcement Learning Earned a Nature Cover

DeepSeek‑R1, the first peer‑reviewed large language model, leveraged a pure reinforcement‑learning framework and the novel GRPO algorithm to achieve breakthrough reasoning performance, low training cost, and widespread acclaim, culminating in a Nature magazine cover story.

AI reasoningDeepSeekGRPO
0 likes · 14 min read
How DeepSeek‑R1’s Reinforcement Learning Earned a Nature Cover
DataFunSummit
DataFunSummit
Sep 17, 2025 · Artificial Intelligence

How Tencent’s Large Language Model Powers Real-World AI Applications

This article explores Tencent’s large language model across diverse business scenarios—content generation, intelligent customer service, role‑playing, and more—detailing the principles and practical uses of Retrieval‑Augmented Generation (RAG), GraphRAG, and Agent technologies, and how they enhance model intelligence and user experience.

RAGTencentagent
0 likes · 4 min read
How Tencent’s Large Language Model Powers Real-World AI Applications
DataFunSummit
DataFunSummit
Sep 14, 2025 · Artificial Intelligence

How Tencent’s Large Language Models Boost Business with RAG, GraphRAG, and AI Agents

This article examines Tencent's large language model deployments across various business scenarios, detailing the use of Retrieval‑Augmented Generation, GraphRAG for role‑playing, and Agent technologies, while also outlining core application areas and the three main technical approaches—SFT, RAG, and Agents.

AI agentsAI applicationsGraphRAG
0 likes · 4 min read
How Tencent’s Large Language Models Boost Business with RAG, GraphRAG, and AI Agents
Data Party THU
Data Party THU
Sep 13, 2025 · Artificial Intelligence

How a Multi‑Agent Large Model Transforms Ecological Big‑Data Analysis

This report details a university project that built a flexible, high‑performance multi‑agent large‑model framework for ecological environment big‑data analysis, covering system architecture, individual agents, memory mechanisms, report generation, a FastAPI‑LangGraph backend, a React frontend, testing methodology, and future directions.

Big DataFastAPILangGraph
0 likes · 7 min read
How a Multi‑Agent Large Model Transforms Ecological Big‑Data Analysis
Bighead's Algorithm Notes
Bighead's Algorithm Notes
Sep 11, 2025 · Artificial Intelligence

Fin-PRM: Alibaba’s Dianjin Team Introduces a Domain-Specific Process Reward Model for Financial Reasoning

Fin‑PRM, a domain‑specific process reward model for financial reasoning introduced by Alibaba’s Dianjin team, employs dual‑level step and trajectory rewards to provide fine‑grained supervision, achieving up to 12.9% accuracy gains in supervised fine‑tuning and 5.1% improvements in Best‑of‑N inference on benchmarks such as CFLUE and FinQA.

CFLUEFin-PRMFinQA
0 likes · 11 min read
Fin-PRM: Alibaba’s Dianjin Team Introduces a Domain-Specific Process Reward Model for Financial Reasoning
DataFunSummit
DataFunSummit
Sep 10, 2025 · Artificial Intelligence

Claude’s Exit from China: How Domestic AI Models Can Fill the Void

Anthropic’s new policy blocks Chinese‑controlled firms from using Claude and Claude Code, prompting a deep dive into the model’s strengths and exploring fast‑growing domestic AI alternatives—such as Qwen3‑Coder, GLM‑4.5, and others—to understand their capabilities, gaps, and future opportunities for Chinese developers.

Chinese AIClaudeai
0 likes · 11 min read
Claude’s Exit from China: How Domestic AI Models Can Fill the Void
Eric Tech Circle
Eric Tech Circle
Sep 10, 2025 · Artificial Intelligence

Deploy High‑Performance Local LLMs with vLLM: A Step‑by‑Step Guide

This article walks through installing and configuring vLLM for local large language model inference, compares it with Ollama and LM Studio, details environment setup, model download, testing scripts, and shows how to expose an OpenAI‑compatible API for production use.

Inference OptimizationModelScopeOpenAI API
0 likes · 11 min read
Deploy High‑Performance Local LLMs with vLLM: A Step‑by‑Step Guide
Wuming AI
Wuming AI
Sep 6, 2025 · Artificial Intelligence

Can Qwen3-Max-Preview Outperform Claude? A Deep Dive into China’s New 1‑T LLM

The article reviews Alibaba's 1‑trillion‑parameter Qwen3‑Max‑Preview model, comparing its benchmark scores, hallucination rate, math and coding accuracy, and SVG generation quality against Claude, Kimi K2, and DeepSeek, while providing usage links and real‑world user impressions.

AI BenchmarkQwen3SVG generation
0 likes · 4 min read
Can Qwen3-Max-Preview Outperform Claude? A Deep Dive into China’s New 1‑T LLM
Kuaishou Tech
Kuaishou Tech
Sep 5, 2025 · Artificial Intelligence

How Keye‑VL‑1.5‑8B Sets New Benchmarks in Multimodal AI

Fast‑search platform Kwai has open‑sourced the 8‑billion‑parameter multimodal LLM Keye‑VL‑1.5, which introduces a slow‑fast frame encoding, a progressive four‑stage pre‑training pipeline, and an automated data construction workflow, achieving state‑of‑the‑art results on video and vision‑language benchmarks and surpassing many closed‑source models.

Multimodal AIbenchmark performancelarge language model
0 likes · 12 min read
How Keye‑VL‑1.5‑8B Sets New Benchmarks in Multimodal AI
Efficient Ops
Efficient Ops
Sep 2, 2025 · Artificial Intelligence

How AI Is Revolutionizing Knowledge‑Base Building for Smarter Operations

At the 27th GOPS Global Operations Conference in Shanghai (Oct 17‑18, 2025), Professor Wang Peng of Fudan University will reveal how large language models can extract and structure heterogeneous operational data into high‑quality knowledge bases, and how RAG‑driven Q&A enhances fault diagnosis, SOP generation, and automated decision‑making.

Artificial IntelligenceIntelligent OperationsKnowledge Base
0 likes · 3 min read
How AI Is Revolutionizing Knowledge‑Base Building for Smarter Operations
Baobao Algorithm Notes
Baobao Algorithm Notes
Sep 2, 2025 · Artificial Intelligence

How LongCat‑Flash Achieves Record Speed and Efficiency for a 560B MoE Model

LongCat‑Flash is a 560‑billion‑parameter Mixture‑of‑Experts LLM that combines a dynamic zero‑computation expert design, shortcut‑connected MoE communication, variance‑aligned scaling, and a three‑stage agent‑centric pre‑training pipeline, delivering over 100 TPS on H800 GPUs at a cost of $0.70 per million tokens.

Artificial IntelligenceInference OptimizationLongCat-Flash
0 likes · 23 min read
How LongCat‑Flash Achieves Record Speed and Efficiency for a 560B MoE Model
Java Tech Enthusiast
Java Tech Enthusiast
Sep 1, 2025 · Artificial Intelligence

How Meituan’s LongCat‑Flash‑Chat Beats Top LLMs with Zero‑Computation Experts

LongCat‑Flash‑Chat, Meituan’s newly open‑sourced 560B MoE model, outperforms leading LLMs on agent tool use and instruction following benchmarks, introduces zero‑computation experts and shortcut‑connected MoE for higher throughput, and demonstrates strong programming and reasoning abilities across diverse evaluation tasks.

Meituan AIModel architectureZero Computation Experts
0 likes · 12 min read
How Meituan’s LongCat‑Flash‑Chat Beats Top LLMs with Zero‑Computation Experts
DataFunTalk
DataFunTalk
Aug 29, 2025 · Artificial Intelligence

Grok Code Fast 1: xAI’s New Coding Model 3× Faster, 6× Cheaper

Elon Musk’s xAI has launched Grok Code Fast 1, a new code‑generation model that claims to be three times faster and six times cheaper than GPT‑5, offering agentic programming capabilities, broad language support, free‑week trials on major IDE platforms, and competitive pricing with high cache hit rates.

AI code modelagentic programmingcoding efficiency
0 likes · 6 min read
Grok Code Fast 1: xAI’s New Coding Model 3× Faster, 6× Cheaper
DataFunTalk
DataFunTalk
Aug 26, 2025 · Artificial Intelligence

Exploring Cutting-Edge AI & Knowledge Graph Applications: A Curated Resource Guide

This resource guide presents a curated list of cutting‑edge topics—including multimodal GraphRAG, knowledge‑graph‑driven large‑model applications in finance, traditional Chinese medicine, automotive manufacturing, and knowledge‑management trends—offering insights into AI‑powered knowledge services, and invites readers to scan the QR code to download the full e‑book.

Data IntegrationMultimodalai
0 likes · 2 min read
Exploring Cutting-Edge AI & Knowledge Graph Applications: A Curated Resource Guide
Alibaba Cloud Native
Alibaba Cloud Native
Aug 25, 2025 · Artificial Intelligence

How 1688 AI App Redefines B2B E‑commerce with AI‑Powered Search and Multimodal Interfaces

The article examines the design shift from the traditional 1688 App to the AI‑native 1688 AI App, detailing how AI‑driven interfaces, system prompts, embedding‑based retrieval, multi‑agent routing, and AI gateways transform B2B product discovery, recommendation, and customization.

AI searchB2B e-commerceMultimodal Retrieval
0 likes · 20 min read
How 1688 AI App Redefines B2B E‑commerce with AI‑Powered Search and Multimodal Interfaces
Baidu Geek Talk
Baidu Geek Talk
Aug 25, 2025 · Artificial Intelligence

How ERNIE‑4.5‑VL Redefines Multimodal AI with 100+ Language Support

The ERNIE‑4.5‑VL visual‑language model breaks single‑modality limits by delivering breakthrough image, video, and text understanding across more than 100 languages, offering lightweight yet competitive performance against models like Qwen2.5‑VL, supporting 128K context, dual “thinking” modes, and extensive deployment resources.

AI researchErnieMultimodal AI
0 likes · 4 min read
How ERNIE‑4.5‑VL Redefines Multimodal AI with 100+ Language Support
Data Party THU
Data Party THU
Aug 24, 2025 · Artificial Intelligence

Can a ‘Centaur’ AI Model Truly Predict Human Decisions? A Deep Dive

This article reviews the Centaur foundation model—fine‑tuned from Llama 3‑70B on the Psych‑101 dataset—to assess its ability to predict human choices, brain activity, and decision rationales across diverse psychological experiments, while discussing generalization, over‑fitting, and future research limits.

CentaurPsychologycognitive modeling
0 likes · 17 min read
Can a ‘Centaur’ AI Model Truly Predict Human Decisions? A Deep Dive
Kuaishou Tech
Kuaishou Tech
Aug 23, 2025 · Artificial Intelligence

How Thyme Enables Models to Think Beyond Images with Code‑Driven Multimodal Reasoning

The Kwai Keye team presents Thyme, a novel multimodal reasoning framework that lets large language models generate and safely execute Python code for image manipulation and complex calculations, achieving significant performance gains over existing vision‑language models across perception, reasoning, and hallucination‑reduction benchmarks.

AI researchMultimodalReinforcement Learning
0 likes · 12 min read
How Thyme Enables Models to Think Beyond Images with Code‑Driven Multimodal Reasoning
Open Source Tech Hub
Open Source Tech Hub
Aug 22, 2025 · Artificial Intelligence

Automate User Feedback Classification with a Large‑Model API in PHP

This guide shows how to use the Tongyi Qianwen large‑model API with PHP to automatically classify user feedback into predefined categories, eliminating manual analysis and complex NLP development while providing clear steps, code, and result interpretation for rapid business insights.

APIPHPautomation
0 likes · 7 min read
Automate User Feedback Classification with a Large‑Model API in PHP
AI Algorithm Path
AI Algorithm Path
Aug 20, 2025 · Artificial Intelligence

DeepSeek V3.1 Open‑Source: Unlocking a New Era of Long‑Context AI

DeepSeek V3.1, a 685‑billion‑parameter open‑source model, supports up to 128,000 tokens, delivers mixed‑architecture capabilities, matches top‑tier closed systems in benchmarks, and its rapid community adoption signals a shift toward democratized AI development and new industry dynamics.

AI PerformanceDeepSeeklarge language model
0 likes · 6 min read
DeepSeek V3.1 Open‑Source: Unlocking a New Era of Long‑Context AI
Instant Consumer Technology Team
Instant Consumer Technology Team
Aug 15, 2025 · Artificial Intelligence

Master the iFLYTEK Prohibited Words Classification Challenge: Baselines & BERT

This article introduces the iFLYTEK AI Developer Competition on prohibited‑word classification, outlines the task, dataset, evaluation metric, and provides three baseline solutions—including a logistic‑regression model, a BERT fine‑tuning approach, and a large‑model prompt method—along with code snippets and performance notes.

BERTNLPcompetition
0 likes · 15 min read
Master the iFLYTEK Prohibited Words Classification Challenge: Baselines & BERT
Data Party THU
Data Party THU
Aug 11, 2025 · Artificial Intelligence

What Makes GPT‑5 the Most Powerful AI Model Yet? A Deep Dive into Its Architecture and Benchmarks

The article analyzes GPT‑5’s unified system, advanced reasoning models, and impressive benchmark gains across programming, creative writing, and health domains, highlighting its new router, Verbosity API, and record‑setting performance on tasks such as Aider polyglot, AIME 2025, and HealthBench.

AI benchmarksAI reasoningGPT-5
0 likes · 7 min read
What Makes GPT‑5 the Most Powerful AI Model Yet? A Deep Dive into Its Architecture and Benchmarks
AI Algorithm Path
AI Algorithm Path
Aug 8, 2025 · Artificial Intelligence

GPT‑5 Is Here: In‑Depth Technical Walkthrough of Architecture, Features, and Benchmarks

OpenAI’s GPT‑5, released on August 7 2025, introduces a unified system with real‑time routing, up to 400 k token context windows, multiple model families, refined safety mechanisms, new API controls, and benchmark results that show it surpasses GPT‑4 across intelligence, coding, instruction following, function calling and multimodal tasks.

AI ArchitectureAPIGPT-5
0 likes · 9 min read
GPT‑5 Is Here: In‑Depth Technical Walkthrough of Architecture, Features, and Benchmarks
AntTech
AntTech
Aug 6, 2025 · Artificial Intelligence

Ring-lite-2507: Boosted Deep Reasoning and Balanced General Capabilities

The AntBailing team releases Ring-lite-2507, enhancing deep reasoning through a Two‑staged RL pipeline while simultaneously balancing overall model abilities, showcasing notable gains on benchmarks like ARC‑AGI‑v1 and offering the model as an open‑source resource across major platforms.

RL trainingRing-litedeep reasoning
0 likes · 5 min read
Ring-lite-2507: Boosted Deep Reasoning and Balanced General Capabilities
AI Info Trend
AI Info Trend
Aug 4, 2025 · Industry Insights

How AI Agents and Small Models Are Redefining Productivity in 2025 H1

The report analyzes first‑half‑2025 AI breakthroughs, covering the rise of general‑purpose agents, rapid inference improvements, small‑model proliferation, reinforcement‑learning compute dominance, evolving transformer architectures, and shifting industry dynamics, offering actionable insights for researchers, product leaders, and decision‑makers.

MultimodalReinforcement LearningTrend
0 likes · 9 min read
How AI Agents and Small Models Are Redefining Productivity in 2025 H1
Baobao Algorithm Notes
Baobao Algorithm Notes
Aug 1, 2025 · Artificial Intelligence

Unlocking Qwen3-Coder-30B: Features, Fast Start, and Agentic Coding Guide

The article introduces Qwen3‑Coder‑30B‑A3B‑Instruct (aka Qwen3‑Coder‑Flash), detailing its architecture, 256K‑to‑1M token context, agentic coding capabilities, installation steps with Transformers, sample code for tool use, optimal sampling parameters, and deployment tips across various runtimes.

AI coding assistantAgentic CodingDeep Learning
0 likes · 6 min read
Unlocking Qwen3-Coder-30B: Features, Fast Start, and Agentic Coding Guide
AI Algorithm Path
AI Algorithm Path
Jul 29, 2025 · Artificial Intelligence

Why GLM‑4.5 Sets a New Benchmark for Open‑Source Large Language Models

GLM‑4.5 and its lightweight Air variant, featuring a deep‑layered MoE design, grouped‑query attention, and dual inference modes, achieve third‑place overall on 12 hard‑core benchmarks, excel in web‑browsing and tool‑calling with a 90.6 % success rate, and introduce novel training tricks such as the Muon optimizer and Slime RL framework.

GLM-4.5MoEai
0 likes · 8 min read
Why GLM‑4.5 Sets a New Benchmark for Open‑Source Large Language Models
AntTech
AntTech
Jul 29, 2025 · Artificial Intelligence

How Ant Group’s Agentar‑Fin‑R1 Redefines Financial AI with Expert‑Level Reasoning

Ant Group’s Ant Financial Science released Agentar‑Fin‑R1, a finance‑focused large model that claims expert‑level knowledge, efficient training, and continuous self‑evolution, outperforming open‑source rivals on benchmarks like FinEval1.0, FinanceIQ and Finova, while supporting industry standards through a collaborative AI alliance.

Agentar-Fin-R1Ant GroupFinancial AI
0 likes · 5 min read
How Ant Group’s Agentar‑Fin‑R1 Redefines Financial AI with Expert‑Level Reasoning
Model Perspective
Model Perspective
Jul 27, 2025 · Artificial Intelligence

Build a Practical AI Agent from Scratch with Coze’s Low‑Code Platform

This guide walks you through creating a functional AI agent using the Coze low‑code platform, covering account setup, goal definition, visual workflow design with large‑model and image‑generation nodes, variable configuration, testing, and publishing the agent to multiple channels.

AI AgentCozePrompt engineering
0 likes · 10 min read
Build a Practical AI Agent from Scratch with Coze’s Low‑Code Platform
Architecture and Beyond
Architecture and Beyond
Jul 27, 2025 · Artificial Intelligence

What Makes an AI Agent Tick? From Expert Systems to Modern Architectures

This article traces the evolution of AI agents from early expert systems to today’s multimodal, memory‑rich agents, explains their perception, reasoning, memory and action modules, discusses model selection, prompt engineering, RAG techniques, and highlights current limitations such as hallucinations, reliability, cost, and security.

AI AgentFunction CallingMemory Architecture
0 likes · 28 min read
What Makes an AI Agent Tick? From Expert Systems to Modern Architectures
Zhihu Tech Column
Zhihu Tech Column
Jul 25, 2025 · Artificial Intelligence

Boost Creative Writing with Zhi-Create-Qwen3-32B: Training, Eval & Deployment

This article introduces the open‑source Zhi‑Create‑Qwen3‑32B model, detailing its fine‑tuned training on creative‑writing data, the multi‑domain dataset strategy, curriculum‑learning based SFT, evaluation on WritingBench, and practical deployment options across various hardware and inference frameworks.

Deploymentcreative writingevaluation
0 likes · 11 min read
Boost Creative Writing with Zhi-Create-Qwen3-32B: Training, Eval & Deployment
Fun with Large Models
Fun with Large Models
Jul 24, 2025 · Artificial Intelligence

Qwen3‑Coder vs Claude 4: In‑Depth Performance Review and Usage Guide

This article evaluates the open‑source Qwen3‑Coder‑480B‑A35B model, comparing its programming and agentic capabilities to Claude 4 and other leading models, detailing its architecture, token length, reinforcement‑learning‑after‑training technique, ecosystem tools, and real‑world code‑generation case studies.

AI CodingAgent RLQwen3-Coder
0 likes · 14 min read
Qwen3‑Coder vs Claude 4: In‑Depth Performance Review and Usage Guide
DataFunTalk
DataFunTalk
Jul 23, 2025 · Artificial Intelligence

Qwen3‑Coder: Open‑Source AI Programming Agent That Beats the Competition

Alibaba’s Tongyi team unveiled the open‑source Qwen3‑Coder, a massive 450‑billion‑parameter programming model that outperforms leading closed‑source solutions, supports up to 1 M token context, offers a free CLI tool, and demonstrates impressive code generation capabilities across animations, games, and real‑world tasks.

AI programmingQwen3-CoderReinforcement Learning
0 likes · 5 min read
Qwen3‑Coder: Open‑Source AI Programming Agent That Beats the Competition
Kuaishou Tech
Kuaishou Tech
Jul 21, 2025 · Artificial Intelligence

Can AI Models Think on Demand? Inside KAT‑V1 AutoThink’s Dynamic Reasoning

The article introduces KAT‑V1 AutoThink, a dual‑mode large language model that automatically switches between thinking and non‑thinking modes based on problem difficulty, details its novel training paradigm, reinforcement‑learning enhancements, performance benchmarks against leading open‑source models, and provides open‑source resources for further research.

Reinforcement Learningauto-thinkknowledge distillation
0 likes · 14 min read
Can AI Models Think on Demand? Inside KAT‑V1 AutoThink’s Dynamic Reasoning
Alibaba Cloud Developer
Alibaba Cloud Developer
Jul 21, 2025 · Artificial Intelligence

How Browser‑Use Leverages AI Prompts for Seamless Browser Automation

This article explains how the open‑source browser‑use framework combines carefully designed SystemMessage prompts, structured HumanMessage inputs, and LangChain‑driven tool calls to enable large language models to automate complex web tasks such as shopping, CRM updates, résumé processing, and document generation, while providing concrete code examples and best‑practice tips.

AI automationBrowser AutomationLangChain
0 likes · 21 min read
How Browser‑Use Leverages AI Prompts for Seamless Browser Automation
Mingyi World Elasticsearch
Mingyi World Elasticsearch
Jul 18, 2025 · Artificial Intelligence

Video: Building an Intelligent Knowledge‑Base Q&A System with Large Models and Elasticsearch (RAG)

The video walks through the differences between traditional keyword search and vector search, explains the core concept of Retrieval‑Augmented Generation, and demonstrates how to construct a knowledge‑base Q&A system using a large language model integrated with Elasticsearch.

ElasticsearchKnowledge BaseQ&A system
0 likes · 1 min read
Video: Building an Intelligent Knowledge‑Base Q&A System with Large Models and Elasticsearch (RAG)