Tagged articles
1025 articles
Page 6 of 11
Code Mala Tang
Code Mala Tang
Jun 5, 2025 · Artificial Intelligence

Mastering LLM Prompts: Proven Techniques to Get Precise Answers

By rethinking how we interact with large language models—using role‑play, task decomposition, chain‑of‑thought, ReAct, and other advanced prompting strategies—readers can transform generic ChatGPT answers into precise, context‑aware responses, leveraging pattern recognition and context windows for superior AI assistance.

AI reasoningChain-of-ThoughtLLM techniques
0 likes · 21 min read
Mastering LLM Prompts: Proven Techniques to Get Precise Answers
Kuaishou Large Model
Kuaishou Large Model
Jun 5, 2025 · Artificial Intelligence

7 Kuaishou Papers Accepted at ACL 2025 Reveal Cutting‑Edge AI Advances

Kuaishou's foundational large‑model team secured seven papers at the prestigious ACL 2025 conference, covering alignment bias during model training, safety in inference, decoding strategies, fine‑grained video‑temporal understanding, and new evaluation benchmarks that push the frontier of multimodal large language models.

ACL 2025Large Language ModelsMultimodal AI
0 likes · 16 min read
7 Kuaishou Papers Accepted at ACL 2025 Reveal Cutting‑Edge AI Advances
Fun with Large Models
Fun with Large Models
Jun 5, 2025 · Artificial Intelligence

EvalScope: The Ultimate Large‑Model Evaluation Framework You Control

This article introduces EvalScope, an open‑source framework for evaluating large language models, detailing its architecture, built‑in benchmarks, installation steps, and step‑by‑step guides for both performance stress testing and dataset‑based capability assessment, enabling users to independently verify model quality without relying on official documentation.

EvalScopeLarge Language ModelsModel Evaluation
0 likes · 12 min read
EvalScope: The Ultimate Large‑Model Evaluation Framework You Control
Kuaishou Tech
Kuaishou Tech
Jun 5, 2025 · Artificial Intelligence

7 Kuaishou AI Papers Accepted at ACL 2025: Video Understanding & Safe LLM Decoding

Kuaishou’s foundational large-model team has secured seven papers at ACL 2025, spanning alignment bias in training, safety defenses during inference, decoding strategies, fine-grained video-temporal understanding, reward fairness in RLHF, multimodal captioning benchmarks, and methods to curb hallucinations in vision-language models.

ACLAI SafetyLarge Language Models
0 likes · 13 min read
7 Kuaishou AI Papers Accepted at ACL 2025: Video Understanding & Safe LLM Decoding
AntTech
AntTech
Jun 4, 2025 · Artificial Intelligence

LLaDA and LLaDA‑V: Large Language Diffusion Models and Their Multimodal Extensions

This article presents the LLaDA series of diffusion‑based large language models, explains how their generative‑modeling principle yields language intelligence comparable to autoregressive models, and details the multimodal LLaDA‑V architecture, training methods, experimental results, and broader implications for AI research.

Diffusion ModelsGenerative ModelingLarge Language Models
0 likes · 10 min read
LLaDA and LLaDA‑V: Large Language Diffusion Models and Their Multimodal Extensions
DataFunTalk
DataFunTalk
Jun 3, 2025 · Artificial Intelligence

Meta‑Capability Alignment: Psychologically Inspired Training to Endow Large Language Models with Stable Reasoning

Researchers from NUS, Tsinghua and Salesforce AI Research introduce a meta‑capability alignment framework that integrates deductive, inductive and abductive reasoning via a psychology‑based triple, automatically generates and validates training data, and demonstrates over 10% accuracy gains on math, coding and scientific benchmarks for 7B and 32B models.

Large Language ModelsMeta‑Capability AlignmentModel Training
0 likes · 8 min read
Meta‑Capability Alignment: Psychologically Inspired Training to Endow Large Language Models with Stable Reasoning
Baobao Algorithm Notes
Baobao Algorithm Notes
Jun 3, 2025 · Artificial Intelligence

Can 1K Fine‑Tuning Replace 100K RL Steps? Insights from Re‑distillation Research

An extensive analysis shows that a 1K‑sample fine‑tuning stage can replicate the generalization gains of thousands of reinforcement‑learning steps, explains the compressibility of RL, introduces a sample‑effect theory, and demonstrates that re‑distillation and small‑scale SFT dramatically improve LLM performance.

Large Language ModelsRe-distillationReinforcement Learning
0 likes · 23 min read
Can 1K Fine‑Tuning Replace 100K RL Steps? Insights from Re‑distillation Research
Data Thinking Notes
Data Thinking Notes
Jun 2, 2025 · Artificial Intelligence

Why Pre‑Training Powers Modern AI: From Theory to Real‑World Applications

Pre‑training enables AI models to first acquire a universal knowledge map from massive unlabelled text, then quickly adapt to specific tasks with minimal labelled data, offering superior generalization, reduced annotation costs, and versatile applications across chatbots, content creation, retrieval, coding assistance, and more.

AI applicationsLarge Language ModelsTransformer
0 likes · 14 min read
Why Pre‑Training Powers Modern AI: From Theory to Real‑World Applications
AntTech
AntTech
May 31, 2025 · Artificial Intelligence

Machine Reasoning and Deep Thinking: Insights from Ant Financial’s NLP Lead Wu Wei

The article explores how DeepSeek R1 and long‑thinking chains have revived interest in machine reasoning, tracing the evolution of natural‑language models, defining reasoning as logical knowledge composition, and outlining future research directions in efficient reasoning architectures and deep‑thinking applications.

AI researchEfficient ReasoningLarge Language Models
0 likes · 8 min read
Machine Reasoning and Deep Thinking: Insights from Ant Financial’s NLP Lead Wu Wei
AntTech
AntTech
May 30, 2025 · Artificial Intelligence

Insights from Ant Group’s 10th Technical Open Day: Multimodal, Embodied, and Future Model Architectures for AGI

The Ant Group’s 10th Technical Open Day gathered leading AI experts who examined the current state and future directions of multimodal large models, embodied AI, world models, transformer architectures, and vertical applications, offering a comprehensive view of the challenges and opportunities on the path toward AGI.

AGIAI SafetyEmbodied AI
0 likes · 16 min read
Insights from Ant Group’s 10th Technical Open Day: Multimodal, Embodied, and Future Model Architectures for AGI
Model Perspective
Model Perspective
May 30, 2025 · Artificial Intelligence

Why Large Language Models Are Just Mathematical Functions: A Rational Perspective

The article argues that large language models are fundamentally mathematical functions that model human language, emphasizing their role as simplified representations, explaining their structural nature, sources of errors, the importance of prompts as boundary conditions, and the need for clear usage assumptions to avoid anthropomorphic misconceptions.

AI fundamentalsLarge Language ModelsPrompt engineering
0 likes · 11 min read
Why Large Language Models Are Just Mathematical Functions: A Rational Perspective
DevOps
DevOps
May 28, 2025 · Artificial Intelligence

Google Proposes a “Sufficient Context” Framework to Strengthen Enterprise Retrieval‑Augmented Generation Systems

Google researchers introduce a “sufficient context” framework that classifies retrieved passages as adequate or inadequate for answering a query, enabling large language models in enterprise RAG systems to decide when to answer, refuse, or request more information, thereby improving accuracy and reducing hallucinations.

AI reliabilityEnterprise AILarge Language Models
0 likes · 9 min read
Google Proposes a “Sufficient Context” Framework to Strengthen Enterprise Retrieval‑Augmented Generation Systems
JD Cloud Developers
JD Cloud Developers
May 27, 2025 · Artificial Intelligence

How JD’s Young AI Engineers Tackle Real-World Model Challenges

Young JD algorithm engineers share how they solve tough AI problems—from optimizing large‑model training and reward‑model design for ad image generation, to building LLM‑based query expansion, agent evaluation, and model pruning with FFT and RDP—illustrating practical breakthroughs and personal growth in cutting‑edge AI research.

AILarge Language ModelsModel Pruning
0 likes · 15 min read
How JD’s Young AI Engineers Tackle Real-World Model Challenges
Efficient Ops
Efficient Ops
May 26, 2025 · Artificial Intelligence

How AI Agents Are Revolutionizing AIOps: Boosting Automation and Efficiency

This article explains how AI agents enhance large‑model capabilities for AIOps, detailing single‑agent use cases like knowledge retrieval, tool guidance, and fault diagnosis, as well as multi‑agent collaborations, required skills, and future prospects for autonomous operations.

AILarge Language ModelsOperations
0 likes · 7 min read
How AI Agents Are Revolutionizing AIOps: Boosting Automation and Efficiency
JD Tech
JD Tech
May 26, 2025 · Artificial Intelligence

Solving Technical Challenges at JD Retail: Multi‑Reward Models, LLM‑Based Query Expansion, Model Pruning, and Reinforcement Learning

This article details how JD Retail's young algorithm engineers tackled a series of AI engineering problems—including advertising image quality assessment with multi‑reward models, large‑language‑model‑driven query expansion, FFT‑and‑RDP‑based model pruning, and agent‑centric reinforcement learning—while sharing practical growth insights and code snippets.

AIComputer VisionLarge Language Models
0 likes · 15 min read
Solving Technical Challenges at JD Retail: Multi‑Reward Models, LLM‑Based Query Expansion, Model Pruning, and Reinforcement Learning
AI Frontier Lectures
AI Frontier Lectures
May 25, 2025 · Artificial Intelligence

Can Alternating Generation‑Reduction Make LLMs Think Faster? Introducing PENCIL

The paper presents PENCIL, a novel alternating generation‑and‑erasure reasoning paradigm that achieves optimal space‑time complexity for chain‑of‑thought tasks, dramatically improves accuracy and efficiency on hard SAT, QBF, and Einstein puzzle benchmarks, and is provably Turing‑complete.

Benchmark resultsChain-of-ThoughtLarge Language Models
0 likes · 12 min read
Can Alternating Generation‑Reduction Make LLMs Think Faster? Introducing PENCIL
DataFunTalk
DataFunTalk
May 24, 2025 · Artificial Intelligence

Why Apple and WeChat’s AI Rollouts Are Slower Than Expected

The article analyses how privacy concerns, data‑security priorities and an application‑first strategy cause both Apple’s Apple Intelligence and WeChat’s AI features to lag behind hype, examining product decisions, technical constraints, and the potential future of AI agents within these ecosystems.

AI integrationAppleLarge Language Models
0 likes · 13 min read
Why Apple and WeChat’s AI Rollouts Are Slower Than Expected
AsiaInfo Technology: New Tech Exploration
AsiaInfo Technology: New Tech Exploration
May 19, 2025 · Artificial Intelligence

How WASP Generates High‑Quality DP Synthetic Data with Multi‑Model Collaboration

WASP is a privacy‑preserving framework that fuses multiple pretrained language models through a weighted Top‑Q voting scheme to synthesize differential‑private data, dramatically improving downstream task performance even when only a few private samples are available, and it scales to federated settings.

Federated LearningLarge Language ModelsMulti-Model Fusion
0 likes · 19 min read
How WASP Generates High‑Quality DP Synthetic Data with Multi‑Model Collaboration
21CTO
21CTO
May 17, 2025 · Artificial Intelligence

Are Large Language Models Killing Stack Overflow? Data Shows the Decline

Recent data confirms that large language models have dramatically reduced Stack Overflow’s monthly question volume, dropping to levels seen in 2009, with key milestones from 2014 to 2025 illustrating how policy changes, the pandemic surge, and the rise of ChatGPT accelerated the platform’s decline.

Large Language ModelsQuestion Volumedata analysis
0 likes · 5 min read
Are Large Language Models Killing Stack Overflow? Data Shows the Decline
Fighter's World
Fighter's World
May 17, 2025 · Industry Insights

Hidden Roadblocks That Sabotage B2B Large Model Products

The article dissects why many B2B GenAI projects fail to scale despite heavy investment, highlighting overlooked challenges in data preparation, model specialization, product integration, user experience, and organizational culture, and proposes concrete ways to bridge these gaps.

B2BGenAILarge Language Models
0 likes · 21 min read
Hidden Roadblocks That Sabotage B2B Large Model Products
Architects' Tech Alliance
Architects' Tech Alliance
May 16, 2025 · Industry Insights

Can DeepSeek Survive the AI Arms Race? A Deep Dive into Its Challenges and Competition

The article provides a comprehensive analysis of DeepSeek’s rise in the large‑model market, examining its technical merits, security and customization hurdles, slowing innovation, fierce competition from OpenAI, Google and Alibaba’s Qwen3, as well as the fragility of its open‑source ecosystem and data preparation, ultimately questioning its long‑term viability.

AI modelsDeepSeekIndustry Analysis
0 likes · 13 min read
Can DeepSeek Survive the AI Arms Race? A Deep Dive into Its Challenges and Competition
Architect's Guide
Architect's Guide
May 13, 2025 · Artificial Intelligence

DeepSeek Model Distillation Technology: Overview, Innovations, Architecture, Training, Performance, and Challenges

This article provides a comprehensive overview of DeepSeek's model distillation technology, detailing its definition, key innovations, architecture, training methods, performance gains, and the remaining challenges such as the implicit performance ceiling and multimodal data distillation.

AI OptimizationDeepSeekKnowledge Transfer
0 likes · 14 min read
DeepSeek Model Distillation Technology: Overview, Innovations, Architecture, Training, Performance, and Challenges
Baidu Geek Talk
Baidu Geek Talk
May 12, 2025 · Artificial Intelligence

One‑Click Deployment of Baidu Qwen3 Large Models on Baidu Baige AI Platform

This guide explains how to use Baidu Baige's AI heterogeneous computing platform to deploy the eight‑model Qwen3 family—including dense and MoE variants—via a one‑click process, covering resource configuration, inference acceleration options, and post‑deployment service access.

AIBaidu BaigeCloud AI
0 likes · 4 min read
One‑Click Deployment of Baidu Qwen3 Large Models on Baidu Baige AI Platform
Youzan Coder
Youzan Coder
May 12, 2025 · Artificial Intelligence

How Large Language Models Empower Business Development Engineers: Data Analysis, Model Training, and Rapid Prototyping

This article demonstrates how large language models can augment business development engineers by providing data insight, automating algorithm training, and enabling low‑cost rapid product prototyping, thereby transforming traditional backend‑focused roles into full‑stack, AI‑enhanced innovators.

AILarge Language ModelsPython
0 likes · 10 min read
How Large Language Models Empower Business Development Engineers: Data Analysis, Model Training, and Rapid Prototyping
AI Frontier Lectures
AI Frontier Lectures
May 10, 2025 · Artificial Intelligence

Can the ‘Canon’ Layer Unlock New Limits in Large Language Models?

A new study introduces the lightweight “Canon” layer for large language models, showing how it improves information flow, inference depth, and scalability across Transformers, linear attention, and state‑space architectures, while offering a controlled synthetic pre‑training benchmark for deeper architectural analysis.

AI researchLarge Language ModelsMamba
0 likes · 11 min read
Can the ‘Canon’ Layer Unlock New Limits in Large Language Models?
JD Tech
JD Tech
May 8, 2025 · Artificial Intelligence

The Emerging Boom of Large Model Applications and Why 2025 Will Be the Turning Point

Amid the AI wave, large language models like DeepSeek R1 are poised to explode by 2025, driven by open-source, low-cost access and superior reasoning, with successful deployment requiring four key factors—domain expertise, knowledge bases, robust search, and engineered agent architectures—to unlock value beyond simple chat.

2025AI applicationsAgent Architecture
0 likes · 10 min read
The Emerging Boom of Large Model Applications and Why 2025 Will Be the Turning Point
Frontend AI Walk
Frontend AI Walk
May 7, 2025 · Artificial Intelligence

How Cursor AI Coding Tool Transforms Development Workflow

The article introduces Cursor, an AI‑powered coding assistant, outlines its supported large models, demonstrates practical front‑end use cases such as automatic layout creation, button logic, screenshot‑to‑code generation, error fixing and code cleanup, and reflects on prompt engineering and tool selection.

AI coding assistantCursorLarge Language Models
0 likes · 6 min read
How Cursor AI Coding Tool Transforms Development Workflow
Architect
Architect
May 5, 2025 · Artificial Intelligence

How Agentic RAG‑R1 Turns Retrieval‑Augmented Generation into an Autonomous AI Agent

Agentic RAG‑R1, an open‑source project from Peking University, combines Retrieval‑Augmented Generation with an agentic AI loop, introduces the GRPO reinforcement‑learning optimizer, supports LoRA‑based fine‑tuning, quantization and multimodal tool calls, and demonstrates significant accuracy gains on the MedQA benchmark across both Chinese and English test sets.

Agentic AILLM Tool UseLarge Language Models
0 likes · 8 min read
How Agentic RAG‑R1 Turns Retrieval‑Augmented Generation into an Autonomous AI Agent
AI Frontier Lectures
AI Frontier Lectures
May 5, 2025 · Industry Insights

What Will Large Language Models Look Like in the Next Five Years? A Deep Dive into Trends and Challenges

The article reviews five years of AI model evolution, analyzes current scaling and reinforcement‑learning trends, and forecasts architectural, mathematical, and infrastructure directions for large language models through 2030, highlighting potential breakthroughs and the risks of over‑reliance on benchmarks.

AI trendsIndustry AnalysisLarge Language Models
0 likes · 22 min read
What Will Large Language Models Look Like in the Next Five Years? A Deep Dive into Trends and Challenges
Code Mala Tang
Code Mala Tang
May 2, 2025 · Artificial Intelligence

Debunking Common Misconceptions About the Model Context Protocol (MCP)

This article clarifies three major misunderstandings about the Model Context Protocol (MCP), explaining that it does not require large‑model support, works even without function‑calling capabilities, and is not natively built into models, while outlining how MCP standardizes context augmentation through a black‑box server architecture.

AIFunction CallingLarge Language Models
0 likes · 5 min read
Debunking Common Misconceptions About the Model Context Protocol (MCP)
JD Tech
JD Tech
Apr 30, 2025 · Artificial Intelligence

TimeHF: A Billion‑Scale Time Series Forecasting Model Guided by Human Feedback

The JD Supply Chain algorithm team introduces TimeHF, a billion‑parameter time‑series large model that leverages RLHF to boost demand‑forecast accuracy by over 10%, detailing dataset construction, the PCTLM architecture, a custom RLHF framework (TPO), and extensive SOTA experimental results.

Big DataDeep LearningLarge Language Models
0 likes · 10 min read
TimeHF: A Billion‑Scale Time Series Forecasting Model Guided by Human Feedback
Cognitive Technology Team
Cognitive Technology Team
Apr 30, 2025 · Artificial Intelligence

AI Claims of Human-Level Intelligence Unveiled: Reliance on Massive Rules Over True Reasoning

The article critiques AI giants’ claims of nearing human-level intelligence, highlighting research that shows current models rely on massive rule memorization rather than genuine reasoning, leading to brittleness in navigation, mathematics, and adaptability, and emphasizing the need to understand these limitations for future progress.

AI limitationsArtificial IntelligenceLarge Language Models
0 likes · 8 min read
AI Claims of Human-Level Intelligence Unveiled: Reliance on Massive Rules Over True Reasoning
Data Thinking Notes
Data Thinking Notes
Apr 29, 2025 · Artificial Intelligence

From Transformers to DeepSeek‑R1: How LLMs Evolved to 2025

This article chronicles the evolution of large language models from the 2017 Transformer breakthrough through BERT, GPT series, multimodal models, and recent cost‑efficient innovations like DeepSeek‑R1, highlighting key architectures, training methods, alignment techniques, and their transformative impact on AI applications.

AI AlignmentLarge Language ModelsMultimodal
0 likes · 29 min read
From Transformers to DeepSeek‑R1: How LLMs Evolved to 2025
DevOps
DevOps
Apr 28, 2025 · Artificial Intelligence

Vibe Coding: An Introduction to AI‑Driven Natural‑Language Programming

This article introduces Vibe Coding, an AI‑driven programming approach proposed by Andrej Karpathy, explains its core concepts, workflow, advantages, tools, use cases, best practices, and future outlook, and provides a complete example of generating a simple weather app using natural‑language prompts.

AI programmingLarge Language ModelsVibe Coding
0 likes · 16 min read
Vibe Coding: An Introduction to AI‑Driven Natural‑Language Programming
ITPUB
ITPUB
Apr 28, 2025 · Artificial Intelligence

How Large Language Models are Transforming Automotive Operations and Optimization

In this interview, an automotive industry expert explains how large language models and advanced operations‑optimization techniques are reshaping vehicle design, production planning, logistics, and customer services, while also discussing implementation challenges, team requirements, and future AI‑driven opportunities.

AI adoptionAutomotive AILarge Language Models
0 likes · 15 min read
How Large Language Models are Transforming Automotive Operations and Optimization
ZhongAn Tech Team
ZhongAn Tech Team
Apr 28, 2025 · Artificial Intelligence

Weekly Tech Overview: Major AI Model Updates, Industry Funding, and Expert Perspectives on AI Agents and Consciousness

This weekly technology digest highlights significant advancements in artificial intelligence, including OpenAI's GPT-4o upgrades, Tencent's Hunyuan 3D v2.5 release, and major funding rounds for xAI and Manus, alongside expert discussions on the future evolution of AI agent networks and the theoretical possibility of machine consciousness.

AI agentsAI fundingArtificial Intelligence
0 likes · 7 min read
Weekly Tech Overview: Major AI Model Updates, Industry Funding, and Expert Perspectives on AI Agents and Consciousness
Volcano Engine Developer Services
Volcano Engine Developer Services
Apr 28, 2025 · Artificial Intelligence

How ByteBrain’s AI‑Powered Infra is Redefining Cloud and Database Performance

ByteDance’s ByteBrain team showcases how large‑model AI, operations research, and system‑level innovations have produced award‑winning papers and billions of yuan in cost savings while improving on‑call efficiency, database estimation, and cloud infrastructure reliability.

AIInfrastructure OptimizationLarge Language Models
0 likes · 9 min read
How ByteBrain’s AI‑Powered Infra is Redefining Cloud and Database Performance
AsiaInfo Technology: New Tech Exploration
AsiaInfo Technology: New Tech Exploration
Apr 25, 2025 · Artificial Intelligence

How Evidence Generation Boosts Document-Grounded Dialogue with LLMs

This study introduces DGDE, a document‑grounded dialogue framework that leverages large language model‑generated evidence, combining retrieval, reranking, fine‑tuning, and iterative question correction to markedly improve accuracy, comprehensiveness, coherence, and completeness on the Doc2dial benchmark.

Fine-tuningLarge Language Modelsdocument-grounded dialogue
0 likes · 21 min read
How Evidence Generation Boosts Document-Grounded Dialogue with LLMs
DataFunTalk
DataFunTalk
Apr 25, 2025 · Artificial Intelligence

Does Reinforcement Learning Really Expand Reasoning Capacity in Large Language Models? Insights from Recent Empirical Study

Recent empirical research by Tsinghua’s LeapLab and Shanghai Jiao Tong University reveals that reinforcement‑learning‑based fine‑tuning (RLVR) improves sampling efficiency but does not extend the fundamental reasoning abilities of large language models beyond their base capabilities, as demonstrated across mathematics, code, and visual reasoning benchmarks.

AI researchLarge Language ModelsRLVR
0 likes · 12 min read
Does Reinforcement Learning Really Expand Reasoning Capacity in Large Language Models? Insights from Recent Empirical Study
Didi Tech
Didi Tech
Apr 24, 2025 · Artificial Intelligence

Algorithmic Foundations and Evolution of Natural Language Processing

The article surveys the Algorithmic Foundations of Engineering R&D series, tracing NLP’s evolution from rule‑based systems to today’s multimodal large‑model era, reviewing core machine‑learning and deep‑learning techniques, transformer breakthroughs, representation learning, optimization methods, and emerging research such as retrieval‑augmented generation and AI agents.

AILarge Language ModelsNLP
0 likes · 43 min read
Algorithmic Foundations and Evolution of Natural Language Processing
Alimama Tech
Alimama Tech
Apr 23, 2025 · Artificial Intelligence

How AI Agents Outsmart Humans in the “Who Is Spy” Campus Challenge

The campus AI Agent competition showcased how large‑language‑model‑powered agents can reason, deceive, and collaborate in a social deduction game, revealing model performance trends, participant insights, and future directions for multi‑agent AI research.

AIAgent CompetitionLarge Language Models
0 likes · 6 min read
How AI Agents Outsmart Humans in the “Who Is Spy” Campus Challenge
DevOps
DevOps
Apr 22, 2025 · Artificial Intelligence

How to Think About Agent Frameworks: A Critical Review of Design Patterns, Challenges, and LangGraph

This article critically examines popular agent frameworks, compares OpenAI and Anthropic definitions, highlights the core difficulty of maintaining proper context for reliable agents, and presents LangGraph’s declarative and imperative features along with practical guidance for building production‑grade agent systems.

AI EngineeringAgent FrameworksAgent Systems
0 likes · 24 min read
How to Think About Agent Frameworks: A Critical Review of Design Patterns, Challenges, and LangGraph
Architects' Tech Alliance
Architects' Tech Alliance
Apr 22, 2025 · Artificial Intelligence

What Are AI Agents? Definitions, Types, and Cutting‑Edge Technologies Explained

This article provides a comprehensive overview of AI agents, covering their definition, classification into language‑based, vision‑based, and multimodal types, core capabilities such as understanding, perception, planning, and action, and recent breakthroughs like OpenAI ComputerUse, SpiritSight, and MobileFlow.

AI agentsComputerUseLarge Language Models
0 likes · 9 min read
What Are AI Agents? Definitions, Types, and Cutting‑Edge Technologies Explained
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Apr 22, 2025 · Artificial Intelligence

How DistilQwen2.5-DS3-0324 Achieves Fast, Accurate Reasoning via Quick‑Think Distillation

This article introduces DistilQwen2.5-DS3-0324, a distilled language model series that balances rapid inference with strong reasoning by applying a fast‑thinking chain‑of‑thought strategy, details its two‑stage distillation framework, evaluation on diverse benchmarks, and provides code for downloading and using the models.

Chain-of-ThoughtDeep LearningLarge Language Models
0 likes · 17 min read
How DistilQwen2.5-DS3-0324 Achieves Fast, Accurate Reasoning via Quick‑Think Distillation
Architect
Architect
Apr 21, 2025 · Artificial Intelligence

Microsoft Research Releases BitNet b1.58 2B4T: A 1‑Bit Native Large Language Model with Ultra‑Low Memory and Energy Consumption

Microsoft Research introduced BitNet b1.58 2B4T, a native 1‑bit large language model with 2 billion parameters trained on 4 trillion tokens, achieving only 0.4 GB non‑embedding memory, 0.028 J decoding energy, and 29 ms CPU latency while matching full‑precision performance.

1-bit LLMAI researchBitNet
0 likes · 7 min read
Microsoft Research Releases BitNet b1.58 2B4T: A 1‑Bit Native Large Language Model with Ultra‑Low Memory and Energy Consumption
AI Frontier Lectures
AI Frontier Lectures
Apr 19, 2025 · Artificial Intelligence

Why Recent AI Model Gains May Be Illusory: Benchmark Gaps and Real‑World Limits

The author argues that since August 2023 AI large‑model improvements have stalled in practical applications, with benchmark scores diverging from user experience, citing security‑scanning experiments, possible benchmark gaming, and alignment bottlenecks that undermine confidence in claimed progress.

AIBenchmarkingLarge Language Models
0 likes · 13 min read
Why Recent AI Model Gains May Be Illusory: Benchmark Gaps and Real‑World Limits
Baidu Tech Salon
Baidu Tech Salon
Apr 16, 2025 · Artificial Intelligence

Release of the 'Fangsheng' Large Model Benchmark Results (Q1 2025) and Overview of Baidu's Wenxin 4.5 and X1 Models

The China AI Industry Alliance unveiled its Q1 2025 Fangsheng benchmark, showing Baidu’s new multimodal models—Wenxin 4.5 leading basic abilities and Wenxin X1 excelling in reasoning—available for free on the Wenxin Yiyan platform, while Baidu pledges major 2025 investments in AI, data‑center and cloud infrastructure.

AIFactTestingLarge Language Models
0 likes · 4 min read
Release of the 'Fangsheng' Large Model Benchmark Results (Q1 2025) and Overview of Baidu's Wenxin 4.5 and X1 Models
Baidu Geek Talk
Baidu Geek Talk
Apr 16, 2025 · Industry Insights

What Do the Latest AIIA FactTesting Benchmarks Reveal About China’s Large Language Models?

At the AIIA’s 14th plenary meeting in Nanjing, the FactTesting benchmark released its Q1 2025 results, evaluating over 200 large models and highlighting Baidu’s Wenxin 4.5 and Wenxin X1 as leaders in basic and reasoning capabilities, while outlining the expanded multimodal and agent testing roadmap for the year.

AI BenchmarkChina AIFactTesting
0 likes · 5 min read
What Do the Latest AIIA FactTesting Benchmarks Reveal About China’s Large Language Models?
DaTaobao Tech
DaTaobao Tech
Apr 16, 2025 · Artificial Intelligence

Comparative Analysis of AI Development Tools (2024‑2025)

The 2024‑2025 comparative review evaluates cloud‑based AI development platforms, AI‑native code editors, IDE plugins, and their underlying large language models—detailing features, user experience, pricing, open‑source status, strengths and weaknesses, offering recommendations for UI prototyping, full‑stack projects, and forecasting future multimodal, collaborative AI‑assisted development trends.

AI Development ToolsLarge Language ModelsProduct Comparison
0 likes · 24 min read
Comparative Analysis of AI Development Tools (2024‑2025)
Data Thinking Notes
Data Thinking Notes
Apr 15, 2025 · Artificial Intelligence

Understanding AI Agents: From Reinforcement Learning to LLM-Powered Planning

Professor Li Hongyi’s lecture provides a comprehensive, step‑by‑step exploration of AI agents, covering their definitions, reinforcement‑learning roots, LLM integration, memory mechanisms, tool usage, planning strategies, benchmarks, and practical examples, offering a valuable resource for anyone studying modern artificial intelligence.

AI agentsLarge Language ModelsMemory
0 likes · 67 min read
Understanding AI Agents: From Reinforcement Learning to LLM-Powered Planning
Baidu Geek Talk
Baidu Geek Talk
Apr 14, 2025 · Artificial Intelligence

PaddlePaddle Framework 3.0: Five Core Breakthroughs Reshaping Large Model Development

PaddlePaddle Framework 3.0 delivers five breakthroughs—dynamic‑static unified automatic parallelism, integrated training‑inference pipelines, high‑order scientific differentiation, a neural‑network compiler with automatic operator fusion, and streamlined heterogeneous chip adaptation—drastically reducing development effort, boosting training speed, and expanding compatibility for large‑scale AI models.

AI InfrastructureDistributed TrainingLarge Language Models
0 likes · 23 min read
PaddlePaddle Framework 3.0: Five Core Breakthroughs Reshaping Large Model Development
AI Algorithm Path
AI Algorithm Path
Apr 13, 2025 · Artificial Intelligence

Understanding GRPO: Group Relative Policy Optimization for LLM Training

The article explains GRPO, a reinforcement‑learning algorithm that extends PPO with group sampling, no value network, dual penalties and KL regularisation, showing how it improves efficiency and stability when fine‑tuning large language models such as DeepSeek‑Math and DeepSeek‑R1.

DeepSeekGRPOLarge Language Models
0 likes · 6 min read
Understanding GRPO: Group Relative Policy Optimization for LLM Training
AntTech
AntTech
Apr 10, 2025 · Artificial Intelligence

Ant Group Presents Four AI Research Papers at ICLR 2025 Live Showcase

At the ICLR 2025 live session in Singapore, Ant Group showcased four cutting‑edge papers—CodePlan, Animate‑X, Group Position Embedding, and OmniKV—demonstrating advances in large‑language‑model reasoning, universal character animation, layout‑aware document understanding, and efficient long‑context inference.

AI researchLarge Language ModelsMultimodal
0 likes · 6 min read
Ant Group Presents Four AI Research Papers at ICLR 2025 Live Showcase
Alibaba Cloud Developer
Alibaba Cloud Developer
Apr 9, 2025 · Artificial Intelligence

Unlocking LLM Reasoning: A Deep Dive into Prompt Engineering Techniques

This article surveys classic prompt‑engineering methods such as Chain‑of‑Thought, Self‑Consistency, Least‑to‑Most, Boosting of Thoughts, Tree of Thoughts, and AutoGPT, summarizing their core ideas, advantages, limitations, and experimental results to help readers understand how to enhance large language model reasoning without model fine‑tuning.

AI reasoningChain-of-ThoughtLarge Language Models
0 likes · 22 min read
Unlocking LLM Reasoning: A Deep Dive into Prompt Engineering Techniques
Big Data Technology & Architecture
Big Data Technology & Architecture
Apr 9, 2025 · Artificial Intelligence

Overview of Data Agents: Definitions, Applications, and Recent Developments by Google, Alibaba Cloud, and ByteDance

This article introduces the concept of AI-powered Data Agents, outlines their key features and use cases across enterprise analytics, data governance, and intelligent customer service, and reviews recent implementations from Google, Alibaba Cloud, and ByteDance, highlighting their impact on modern data-driven workflows.

Artificial IntelligenceData AgentEnterprise AI
0 likes · 8 min read
Overview of Data Agents: Definitions, Applications, and Recent Developments by Google, Alibaba Cloud, and ByteDance
AIWalker
AIWalker
Apr 8, 2025 · Artificial Intelligence

AgenticIR: An Agentic System for Restoring Images with Complex Degradations

AgenticIR combines visual language models and large language models in a multi‑stage reasoning workflow—perception, planning, execution, reflection, and adjustment—to evaluate, plan, and iteratively apply specialized restoration tools, achieving superior results on complexly degraded images compared to baseline methods.

ICLR 2025Image RestorationLarge Language Models
0 likes · 15 min read
AgenticIR: An Agentic System for Restoring Images with Complex Degradations
Model Perspective
Model Perspective
Apr 8, 2025 · Artificial Intelligence

Why Learning Machine Learning Still Matters in the Age of Giant AI Models

The article argues that despite the rapid rise of powerful large language models, mastering machine learning remains essential because it underpins these models, offers customized solutions for specialized tasks, and cultivates the mathematical, programming, and analytical skills needed to effectively use and extend AI technologies.

AILarge Language Modelseducation
0 likes · 10 min read
Why Learning Machine Learning Still Matters in the Age of Giant AI Models
macrozheng
macrozheng
Apr 8, 2025 · Artificial Intelligence

Boost AI Prompt Quality with Prompt Optimizer: Features, Docker Setup & Real‑World Demo

This guide introduces Prompt Optimizer, a client‑side AI prompt‑enhancement tool with over 2k GitHub stars, outlines its key features, provides step‑by‑step Docker installation commands, showcases a real‑world SpringBoot‑Vue e‑commerce project, and demonstrates how to generate and compare optimized prompts for better LLM responses.

AI Prompt OptimizationDockerLarge Language Models
0 likes · 6 min read
Boost AI Prompt Quality with Prompt Optimizer: Features, Docker Setup & Real‑World Demo
Alibaba Cloud Developer
Alibaba Cloud Developer
Apr 8, 2025 · Artificial Intelligence

Unlocking LLM Secrets: From Prompt Basics to RAG and Tool Integration

This article introduces the fundamental paradigms of large language models, explaining how simple prompts, messages, and tools like RAG and ReAct enable powerful applications, while providing practical code examples, translation strategies, and insights on prompt engineering, tool usage, and model fine‑tuning.

AILLM applicationsLarge Language Models
0 likes · 23 min read
Unlocking LLM Secrets: From Prompt Basics to RAG and Tool Integration
DataFunSummit
DataFunSummit
Apr 7, 2025 · Artificial Intelligence

Bridging the Gap Between Large Models and Real‑World Applications with RAG and Agents

This article examines how Retrieval‑Augmented Generation (RAG) and multi‑agent technologies narrow the gap between large language models and practical deployment, highlighting their roles in operations automation, financial risk control, intelligent data governance, database localization, edge inference, and future AI‑driven solutions.

Data GovernanceLarge Language ModelsOperations Automation
0 likes · 8 min read
Bridging the Gap Between Large Models and Real‑World Applications with RAG and Agents
Architecture and Beyond
Architecture and Beyond
Apr 5, 2025 · Artificial Intelligence

Why Defining Problem Boundaries Is Crucial for Effective AI Agents

The article discusses how defining clear problem boundaries is essential for AI agents, explains the challenges of vague tasks for large language models, and proposes multi‑stage decomposition, self‑reflection, and human‑in‑the‑loop strategies to improve AI performance on complex, dynamic tasks.

AIAgent ArchitectureLarge Language Models
0 likes · 13 min read
Why Defining Problem Boundaries Is Crucial for Effective AI Agents
Ops Development & AI Practice
Ops Development & AI Practice
Apr 4, 2025 · Industry Insights

Are Open‑Source LLMs Closing the Gap with Closed‑Source Giants?

A recent leaderboard analysis of top LLMs reveals that while closed‑source models like Gemini‑2.5‑Pro and ChatGPT‑4o still lead overall, open‑source models such as DeepSeek‑V3 and Llama are rapidly narrowing the performance gap, especially in specialized tasks like coding, driven by faster tech diffusion, public datasets, community collaboration, and reduced compute costs.

AI competitionLarge Language Modelsindustry trends
0 likes · 8 min read
Are Open‑Source LLMs Closing the Gap with Closed‑Source Giants?
Code Mala Tang
Code Mala Tang
Apr 3, 2025 · Artificial Intelligence

Intel Core Ultra 5 vs Apple M1: Which Wins for Large Language Model Inference?

This article compares the inference performance of a high‑end Intel Core Ultra 5 AI workstation with an Apple M1 MacBook Air using the IPEX‑LLM library, detailing installation steps, minimal code changes, resource usage, and benchmark results for small and large language models.

AI inferenceApple M1IPEX-LLM
0 likes · 9 min read
Intel Core Ultra 5 vs Apple M1: Which Wins for Large Language Model Inference?
JD Retail Technology
JD Retail Technology
Apr 2, 2025 · Artificial Intelligence

One4All: A Scalable Multi‑Task Generative Recommendation Framework for CPS Advertising

The paper introduces One4All, a scalable multi‑task generative recommendation framework for CPS advertising that combines few‑shot intent prompting, a Rewards‑in‑Context multi‑objective optimization, and an online model‑selection strategy, delivering 2‑3× offline HitRate/NDCG gains and notable online CTR, CVR, and commission improvements.

AdvertisingLLMLarge Language Models
0 likes · 14 min read
One4All: A Scalable Multi‑Task Generative Recommendation Framework for CPS Advertising
Architects' Tech Alliance
Architects' Tech Alliance
Apr 1, 2025 · Artificial Intelligence

What’s New in Large Language Models? DeepSeek V3, Qwen2.5‑Omni, Gemini 2.5 Pro, and GPT‑4o Unpacked

This article reviews the latest updates from major LLM providers—DeepSeek V3’s parameter boost and longer context, Qwen2.5‑Omni’s open‑source multimodal 7B model, Google Gemini 2.5 Pro’s 1 M‑token window and multimodal prowess, and OpenAI GPT‑4o’s image generation and reduced pricing—highlighting technical specs, capabilities, and availability.

DeepSeekGPT-4oGemini
0 likes · 9 min read
What’s New in Large Language Models? DeepSeek V3, Qwen2.5‑Omni, Gemini 2.5 Pro, and GPT‑4o Unpacked
Architect
Architect
Apr 1, 2025 · Artificial Intelligence

When to Fine‑Tune Large Language Models vs. Relying on Prompting and RAG

The article explains why most projects should start with prompt engineering or simple agent workflows, outlines the scenarios where model fine‑tuning adds real value, compares fine‑tuning with Retrieval‑Augmented Generation, and offers practical criteria for deciding which approach to adopt.

AI deploymentLarge Language ModelsLoRA
0 likes · 9 min read
When to Fine‑Tune Large Language Models vs. Relying on Prompting and RAG
AntTech
AntTech
Mar 31, 2025 · Artificial Intelligence

Ant Group Papers Accepted at ICLR 2025: Summaries and Links

The article presents the abstracts, publication types, links, and research areas of seventeen Ant Group papers accepted at ICLR 2025, covering topics such as embodied robot co‑design, efficient distributed training for large language models, optimization via LLMs, character animation, interactive frame interpolation, KV‑cache management, and privacy‑preserving Transformers.

AI researchAnt GroupICLR2025
0 likes · 23 min read
Ant Group Papers Accepted at ICLR 2025: Summaries and Links
Architects' Tech Alliance
Architects' Tech Alliance
Mar 31, 2025 · Artificial Intelligence

A Comprehensive History of Large Language Models from the Transformer Era (2017) to DeepSeek‑R1 (2025)

This article reviews the evolution of large language models from the 2017 Transformer breakthrough through BERT, GPT series, alignment techniques, multimodal extensions, open‑weight releases, and the cost‑efficient DeepSeek‑R1 in 2025, highlighting key technical advances, scaling trends, and their societal impact.

AI AlignmentLLM evolutionLarge Language Models
0 likes · 26 min read
A Comprehensive History of Large Language Models from the Transformer Era (2017) to DeepSeek‑R1 (2025)
Data Thinking Notes
Data Thinking Notes
Mar 30, 2025 · Artificial Intelligence

How DeepSeek‑R1 and Kimi‑K1.5 Push the Boundaries of Strong Reasoning Models

This comprehensive analysis by the Peking University AI Alignment team dissects the technical innovations behind DeepSeek‑R1, DeepSeek‑R1 Zero, and Kimi‑K1.5, covering reinforcement‑learning‑based post‑training, rule‑based rewards, GRPO optimization, scaling laws, multimodal extensions, safety challenges, and future research directions.

AI AlignmentDeepSeekKimi
0 likes · 57 min read
How DeepSeek‑R1 and Kimi‑K1.5 Push the Boundaries of Strong Reasoning Models
Architect
Architect
Mar 30, 2025 · Artificial Intelligence

What Is Retrieval-Augmented Generation? A Deep Dive into RAG Techniques

This article provides a comprehensive survey of Retrieval‑Augmented Generation (RAG), covering its basic principles, key components, seven technical variants, challenges, evaluation methods, and future research directions across multimodal, graph‑based, and agentic extensions.

AI SurveyKnowledge RetrievalLarge Language Models
0 likes · 9 min read
What Is Retrieval-Augmented Generation? A Deep Dive into RAG Techniques
AI Frontier Lectures
AI Frontier Lectures
Mar 30, 2025 · Artificial Intelligence

Do Large Language Models Mirror Human Brain Language Processing? Google’s Groundbreaking Findings

Google researchers discovered a linear relationship between brain activity recorded during natural conversation and the internal embeddings of a speech‑to‑text large language model, revealing that acoustic and lexical representations from the model can accurately predict neural responses in both language comprehension and production.

AI researchGoogleLarge Language Models
0 likes · 8 min read
Do Large Language Models Mirror Human Brain Language Processing? Google’s Groundbreaking Findings
Cognitive Technology Team
Cognitive Technology Team
Mar 30, 2025 · Artificial Intelligence

Why Prompt Engineering Is the “Mind‑Reading” Technique of AI: The Crucial Role of In‑Context Learning

Prompt engineering uses in‑context learning to turn large language models into precise, task‑aware assistants by providing well‑crafted prompts that guide the model’s probability distribution, reduce hallucinations, and unlock hidden knowledge without any parameter tuning.

Artificial IntelligenceIn-Context LearningLarge Language Models
0 likes · 6 min read
Why Prompt Engineering Is the “Mind‑Reading” Technique of AI: The Crucial Role of In‑Context Learning
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Mar 29, 2025 · Artificial Intelligence

How DistilQwen2.5‑R1 Boosts Small‑Model Reasoning with Innovative Knowledge Distillation

The article introduces the DistilQwen2.5‑R1 series, which leverages a novel knowledge‑distillation pipeline—including CoT data evaluation, improvement, and validation—to transfer deep reasoning abilities from large models like DeepSeek‑R1 to compact models, achieving superior performance across math, code, and scientific benchmarks and providing open‑source checkpoints and deployment guides for practical use.

AI inferenceLarge Language Modelsbenchmark evaluation
0 likes · 17 min read
How DistilQwen2.5‑R1 Boosts Small‑Model Reasoning with Innovative Knowledge Distillation
Alimama Tech
Alimama Tech
Mar 28, 2025 · Artificial Intelligence

How Alibaba’s Taobao AI Models Revolutionize E‑Commerce Recommendations and Bidding

Alibaba’s Taobao Group unveiled its AIGX technology suite, including the RecGPT recommendation model, the AIGB generative bidding system, and a new AI‑generated video engine, detailing open‑source benchmarks, NeurIPS workshop participation, and measurable ROI improvements for e‑commerce advertising.

AIGenerative BiddingLarge Language Models
0 likes · 5 min read
How Alibaba’s Taobao AI Models Revolutionize E‑Commerce Recommendations and Bidding
Qborfy AI
Qborfy AI
Mar 28, 2025 · Artificial Intelligence

Master Prompt Engineering: From Basics to Advanced SQL Generation

This article walks readers through the fundamentals of prompt engineering—covering role, context, instruction, examples, and output formatting—then demonstrates a step‑by‑step construction of a sophisticated SQL‑generation prompt, complete with concrete code snippets, best‑practice tips, and reference resources.

AI Prompt DesignInstruction TuningLarge Language Models
0 likes · 21 min read
Master Prompt Engineering: From Basics to Advanced SQL Generation
Alibaba Cloud Developer
Alibaba Cloud Developer
Mar 26, 2025 · Artificial Intelligence

Why DeepSeek Is Shaking Up the LLM Landscape: Architecture, Performance, and Cost

DeepSeek, a Chinese AI startup, offers open‑source large language models—DeepSeek‑V3 for general tasks and DeepSeek‑R1 for intensive reasoning—featuring MoE, MLA, low‑cost training, and competitive performance against OpenAI’s GPT‑4o, while providing detailed usage guidance and cost analysis.

AI inferenceDeepSeekLarge Language Models
0 likes · 21 min read
Why DeepSeek Is Shaking Up the LLM Landscape: Architecture, Performance, and Cost
Architects' Tech Alliance
Architects' Tech Alliance
Mar 25, 2025 · Industry Insights

How Near‑Memory Computing Can Power Edge LLMs: A 2025 Storage Framework

The article analyzes the challenges of deploying large language models on cloud servers—such as latency, security, and constant connectivity—and explains how near‑memory computing architectures (PNM, PIM, CIM) can integrate storage and processing to enable efficient, high‑performance edge AI deployments, outlining the trade‑offs of each approach.

Artificial IntelligenceLarge Language ModelsNear-Memory Computing
0 likes · 5 min read
How Near‑Memory Computing Can Power Edge LLMs: A 2025 Storage Framework
Alibaba Cloud Developer
Alibaba Cloud Developer
Mar 25, 2025 · Artificial Intelligence

Boost Your AI Search Skills: Advanced Prompt & Query Tricks

This guide explains how to leverage AI tools with deep web‑search capabilities, covering site‑specific queries, wildcard operators, date ranges, Boolean logic, and effective prompt engineering techniques—including Socratic questioning and CRISPE framework—to improve information retrieval accuracy and efficiency across various domains.

AILarge Language ModelsSearch Operators
0 likes · 8 min read
Boost Your AI Search Skills: Advanced Prompt & Query Tricks
AI Frontier Lectures
AI Frontier Lectures
Mar 24, 2025 · Artificial Intelligence

What Can AI Agents Learn from the Latest AIR 2025 Research?

The article compiles insights from the AIR 2025 conference and related talks, covering the evolution of agents from reinforcement‑learning to LLM‑driven systems, novel agent architectures like AIDE, GUI agents, natural‑language reinforcement learning, and scaling advances in large language models such as Qwen, while highlighting key algorithms, benchmarks, and open research questions.

AI agentsAgent ArchitectureGUI agents
0 likes · 27 min read
What Can AI Agents Learn from the Latest AIR 2025 Research?
Model Perspective
Model Perspective
Mar 21, 2025 · Artificial Intelligence

How DeepSeek’s Tree‑Based Reasoning Transforms AI Interaction

DeepSeek’s R1 inference mode replaces linear chain‑of‑thought with a transparent, multi‑path tree reasoning system, offering layered analysis, intent understanding, memory management, emotion detection, and hallucination mitigation, illustrated through a practical example of buying authentic cigarettes and detailed technical breakdowns.

Artificial IntelligenceLarge Language ModelsMemory
0 likes · 16 min read
How DeepSeek’s Tree‑Based Reasoning Transforms AI Interaction
Continuous Delivery 2.0
Continuous Delivery 2.0
Mar 21, 2025 · Artificial Intelligence

AI-Driven Automated Unit Test Generation Framework: Architecture, Workflow, and Evaluation

This article presents an AI‑powered framework that automatically scans codebases, generates comprehensive unit tests using large language models, and includes self‑repair agents, detailing its workflow, core components, strategies for accuracy, practical benefits, and current limitations.

AI testingLarge Language Modelsself-repair
0 likes · 9 min read
AI-Driven Automated Unit Test Generation Framework: Architecture, Workflow, and Evaluation
AI Algorithm Path
AI Algorithm Path
Mar 20, 2025 · Artificial Intelligence

Understanding Multimodal Large Language Models: Recent Advances and Comparative Analysis

This article surveys the latest multimodal large language model research, dissecting the design, training strategies, and performance trade‑offs of models such as Llama 3.2, Molmo, NVLM, Qwen2‑VL, Pixtral, MM1.5, Emu3, and Janus, and highlights the challenges of fair cross‑model evaluation.

AI researchCross-AttentionLarge Language Models
0 likes · 16 min read
Understanding Multimodal Large Language Models: Recent Advances and Comparative Analysis
AI Frontier Lectures
AI Frontier Lectures
Mar 20, 2025 · Artificial Intelligence

Why Multimodal LLMs Still Struggle with Multi-Image Math Reasoning: Insights from MV‑MATH

This article introduces the MV‑MATH dataset, a large‑scale multi‑image math benchmark, and evaluates 24 open‑source and closed‑source multimodal large language models, revealing significant performance gaps, especially on complex visual dependencies and higher difficulty levels.

DatasetLarge Language ModelsModel Evaluation
0 likes · 8 min read
Why Multimodal LLMs Still Struggle with Multi-Image Math Reasoning: Insights from MV‑MATH
JavaScript
JavaScript
Mar 20, 2025 · Artificial Intelligence

How MiniMax’s Linear‑Attention Architecture Is Redefining Long‑Context AI Models

MiniMax’s rapid 2025 releases—including a video model, open‑source LLM, and high‑fidelity voice model—showcase its multimodal linear‑attention architecture that handles up to 4 million tokens, earns a16z recognition, and signals China’s growing influence in open‑source AI innovation.

Artificial IntelligenceLarge Language ModelsLinear Attention
0 likes · 8 min read
How MiniMax’s Linear‑Attention Architecture Is Redefining Long‑Context AI Models
AI Frontier Lectures
AI Frontier Lectures
Mar 17, 2025 · Artificial Intelligence

Can Diffusion Models Outrun Traditional LLMs? Mercury Coder’s Speed & Architecture

The article analyzes Mercury Coder, a diffusion‑based language model that generates text and code in parallel, compares its speed and quality against traditional autoregressive LLMs like GPT‑4o‑mini using a ball‑collision benchmark, and discusses the underlying score‑entropy training, current limitations, and future multimodal potential.

AI PerformanceDiffusion ModelsLarge Language Models
0 likes · 8 min read
Can Diffusion Models Outrun Traditional LLMs? Mercury Coder’s Speed & Architecture
ZhongAn Tech Team
ZhongAn Tech Team
Mar 17, 2025 · Artificial Intelligence

Weekly Tech Digest: AI Model Advancements, Strategic Infrastructure Deals, and Industry Insights on AI Agents

This weekly technology digest highlights significant advancements in artificial intelligence, including OpenAI's Python-enabled o1 model, Google's open-source Gemma 3, and Alibaba's AI-driven Quark application, alongside major industry partnerships, expert forecasts on AI agent proliferation, and emerging developments in robotics and wearable technology.

AI agentsArtificial IntelligenceLarge Language Models
0 likes · 7 min read
Weekly Tech Digest: AI Model Advancements, Strategic Infrastructure Deals, and Industry Insights on AI Agents
Alibaba Cloud Developer
Alibaba Cloud Developer
Mar 17, 2025 · Artificial Intelligence

23 Proven Prompt Engineering Techniques to Make AI Understand You Instantly

As large language models become increasingly adept at natural language, mastering prompt engineering remains essential; this article compiles 23 practical strategies—from concise commands and role‑playing to structured formatting and output guidance—that empower users to communicate clearly with AI and obtain high‑quality, targeted results.

AI interactionAI productivityLarge Language Models
0 likes · 18 min read
23 Proven Prompt Engineering Techniques to Make AI Understand You Instantly
Fighter's World
Fighter's World
Mar 14, 2025 · Industry Insights

Will the 10× Growth Promise of Vertical AI Crumble as Generalist LLMs Like Manus Dominate the Market?

The article examines whether the surge of general‑purpose large language models such as Manus, Claude Sonet, and Qwen undermines the Bessemer Venture Partners claim that Vertical AI will grow tenfold, by analysing market size, use‑case demand, technical challenges, emerging business models, and competitive moats.

AI MarketAI agentsBusiness Models
0 likes · 19 min read
Will the 10× Growth Promise of Vertical AI Crumble as Generalist LLMs Like Manus Dominate the Market?
Zhihu Tech Column
Zhihu Tech Column
Mar 14, 2025 · Artificial Intelligence

Insights from Zhihu’s ZhiLight Large Model Inference Framework: Architecture, Parallelism, and Performance Optimizations

The article summarizes Zhihu’s technical talk on the ZhiLight large‑model inference framework, detailing model execution mechanisms, GPU load analysis, multi‑GPU parallel strategies, open‑source engine comparisons, compute‑communication overlap, quantization techniques, benchmark results, and future directions for scalable LLM deployment.

GPU parallelismLarge Language ModelsSGLang
0 likes · 11 min read
Insights from Zhihu’s ZhiLight Large Model Inference Framework: Architecture, Parallelism, and Performance Optimizations
Alimama Tech
Alimama Tech
Mar 14, 2025 · Artificial Intelligence

Advances in Search Advertising Models with Large Language Models (2024)

In 2024 Alibaba Mama outlines how large‑language models transform search advertising through a three‑line scaling roadmap—explicit inductive‑bias design, implicit compute growth, and auxiliary CV/NLP advances—implemented via a pre‑train/post‑train/CTR paradigm and the LUM user‑behavior model, promising gains in relevance, recall, and real‑time serving while highlighting inference efficiency challenges.

CTR predictionLarge Language Modelsmultimodal embedding
0 likes · 25 min read
Advances in Search Advertising Models with Large Language Models (2024)
Baidu Tech Salon
Baidu Tech Salon
Mar 13, 2025 · Artificial Intelligence

How PaddlePaddle 3.0 Boosts Large‑Model Inference with 4‑Bit Quantization and MLA Optimizations

PaddlePaddle 3.0 introduces a full‑stack inference engine that supports FP8, INT8, and 4‑bit quantization for popular LLMs such as DeepSeek V3/R1, delivers up to 2× token throughput on a single H800 GPU, and provides detailed deployment scripts for single‑node and multi‑node setups, including MTP speculative decoding and SageAttention for long‑sequence acceleration.

DockerInference OptimizationLarge Language Models
0 likes · 13 min read
How PaddlePaddle 3.0 Boosts Large‑Model Inference with 4‑Bit Quantization and MLA Optimizations
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Mar 13, 2025 · Artificial Intelligence

From Chain‑of‑Thought to Self‑Evolving Agents: Lessons from AI Agent Engineering

This article traces the evolution of large‑model agents from a simple chain‑of‑thought design through tool and agent instantiation, structured PEER patterns, and self‑evolving architectures, highlighting practical challenges, middleware solutions, and open‑source resources for building robust AI agents.

AI agentsAgent ArchitectureLarge Language Models
0 likes · 16 min read
From Chain‑of‑Thought to Self‑Evolving Agents: Lessons from AI Agent Engineering
Architects' Tech Alliance
Architects' Tech Alliance
Mar 11, 2025 · Artificial Intelligence

How DeepSeek’s Breakthrough AI Models Thrive on Huawei Ascend: A Deep Dive

An in‑depth analysis reveals how DeepSeek’s V3 and R1 large‑language models achieve superior inference performance and cost efficiency on Huawei’s Ascend AI platform, detailing architectural optimizations, KV‑Cache reductions, multimodal support, real‑world deployments across finance, government, manufacturing, and the projected impact on the AI industry.

AI OptimizationDeepSeekHuawei Ascend
0 likes · 4 min read
How DeepSeek’s Breakthrough AI Models Thrive on Huawei Ascend: A Deep Dive
AI Algorithm Path
AI Algorithm Path
Mar 11, 2025 · Artificial Intelligence

AI Agents Overview: Foundations, Core Components, and When to Use Them

This article provides a comprehensive overview of AI Agents, tracing their evolution from traditional chatbots to LLM‑driven agents, explaining core components such as perception, reasoning, action, knowledge bases, learning and communication interfaces, and discussing practical use cases, interaction cycles, and future prospects.

AI agentsLarge Language ModelsRetrieval Augmented Generation
0 likes · 15 min read
AI Agents Overview: Foundations, Core Components, and When to Use Them
58 Tech
58 Tech
Mar 11, 2025 · Artificial Intelligence

Applying Large Language Models to Real Estate Recommendation: Case Studies and Optimization Techniques

This article presents a comprehensive case study on how large language models are integrated into 58.com’s real‑estate recommendation platform, detailing challenges, data adaptation, prompt and parameter optimizations, embedding generation, conversational recommendation, and future directions for multimodal and generative recommendation systems.

AI OptimizationEmbeddingLarge Language Models
0 likes · 14 min read
Applying Large Language Models to Real Estate Recommendation: Case Studies and Optimization Techniques
Efficient Ops
Efficient Ops
Mar 9, 2025 · Artificial Intelligence

Essential LLMOps Tools: Build, Deploy, Monitor, and Manage Large Language Models

LLMOps, the end-to-end methodology for managing large language models, encompasses a curated set of development, deployment, monitoring, and local management tools—such as LangChain, vLLM, LangSmith, and Ollama—enabling practitioners to efficiently build, scale, and maintain AI applications.

AI DevelopmentLLMOpsLarge Language Models
0 likes · 6 min read
Essential LLMOps Tools: Build, Deploy, Monitor, and Manage Large Language Models