Tagged articles

2011 articles

Page 5 of 21

Mar 23, 2026 · Artificial Intelligence

How OpenClaw Turns AI Agents into Real‑World Automation Tools

OpenClaw is an AI Agent framework that bridges chat platforms and large language models, enabling automated tasks through context‑engineered prompts, tool usage, memory management, sub‑agents, and security controls, while illustrating practical examples, workflow steps, and mitigation strategies for potential shell‑command exploits.

AI AgentLLMOpenClaw

0 likes · 18 min read

How OpenClaw Turns AI Agents into Real‑World Automation Tools

Architect

Mar 22, 2026 · Artificial Intelligence

Can Frozen LLMs Keep Learning? Inside Memento‑Skills' Deployment‑Time Learning

The article analyses the Memento‑Skills paper and its open‑source implementation, showing how a frozen large language model can continuously improve by treating skills as external memory, using a five‑step Observe‑Read‑Act‑Feedback‑Write loop, advanced routing, and modular architecture to achieve significant gains on GAIA and HLE benchmarks.

AI ArchitectureAgentDeployment-Time Learning

0 likes · 21 min read

Can Frozen LLMs Keep Learning? Inside Memento‑Skills' Deployment‑Time Learning

Woodpecker Software Testing

Mar 22, 2026 · Artificial Intelligence

How to Test Retrieval‑Augmented Generation Systems: Practical Strategies for 2024

This article explains why traditional API, assertion, and UI testing fail for Retrieval‑Augmented Generation (RAG) systems, and presents a four‑step, evidence‑driven testing framework—including golden test sets, dual‑track validation, chaos engineering, and continuous trust dashboards—to ensure factual reliability and operational robustness in real‑world deployments.

Fact CheckingLLMOpenTelemetry

0 likes · 8 min read

How to Test Retrieval‑Augmented Generation Systems: Practical Strategies for 2024

Radish, Keep Going!

Mar 22, 2026 · Artificial Intelligence

Why LLMs Pick 7200–7500: The Hidden Bias Behind Their ‘Random’ Numbers

A Reddit experiment showed that major LLMs consistently choose numbers between 7200 and 7500 when asked to pick a random value from 1 to 10000, revealing deterministic patterns rooted in training data bias, human heuristics, and serious security implications.

AIBiasLLM

0 likes · 11 min read

Why LLMs Pick 7200–7500: The Hidden Bias Behind Their ‘Random’ Numbers

AI Product Manager Community

Mar 22, 2026 · Artificial Intelligence

How to Build a Systematic Solution for LLM Hallucinations in Enterprise AI

This article outlines a comprehensive, multi‑layered approach—including data anchoring, architectural guardrails, prompt engineering, and LLMOps—to mitigate hallucinations in large language models for enterprise applications.

AI guardrailsKnowledge GraphLLM

0 likes · 7 min read

How to Build a Systematic Solution for LLM Hallucinations in Enterprise AI

PaperAgent

Mar 22, 2026 · Artificial Intelligence

Can LLM Agents Self‑Evolve Without Retraining? Inside Memento‑Skills

The article analyzes the Memento‑Skills framework, which treats external memory as executable skills to enable deployment‑time continual learning for frozen LLM agents, detailing its read‑write reflective loop, skill‑as‑memory design, behavior‑trained skill router, experimental validation on GAIA and HLE benchmarks, and theoretical guarantees without gradient updates.

AIAgentLLM

0 likes · 9 min read

Can LLM Agents Self‑Evolve Without Retraining? Inside Memento‑Skills

Architect's Journey

Mar 22, 2026 · Artificial Intelligence

Cursor Builds Its Own Code Model, Cutting Costs 90% and Sparking a Price War

Cursor unveiled Composer 2, a self‑trained code‑only LLM that scores 61.3 on its CursorBench, outperforms Claude Opus 4.6 and rivals GPT‑5.4, while charging just $0.50 per million tokens—about one‑tenth the price of competitors—prompting a new AI pricing battle.

AI competitionCode GenerationComposer 2

0 likes · 5 min read

Cursor Builds Its Own Code Model, Cutting Costs 90% and Sparking a Price War

DataFunSummit

Mar 21, 2026 · Artificial Intelligence

How Slidebatching Revolutionizes LLM Inference Scheduling for Faster, More Efficient AI Services

The article examines the memory and latency challenges of 1750‑billion‑parameter LLM inference, introduces the xLLM framework’s Slidebatching and PD‑separation scheduling strategies, and details how these techniques achieve up to 35% system‑throughput gains and 52% SLO compliance improvements in real‑world multi‑priority workloads.

AI PerformanceLLMPD separation

0 likes · 15 min read

How Slidebatching Revolutionizes LLM Inference Scheduling for Faster, More Efficient AI Services

Data Party THU

Mar 21, 2026 · Artificial Intelligence

Why Bigger Context Windows Hurt LLMs and How RAG Still Wins

The article explains that expanding LLM context windows leads to attention dilution and retrieval collapse, degrading answer quality, and argues that Retrieval‑Augmented Generation remains essential because it preserves signal density through focused retrieval and selective prompting.

AI ArchitectureAttention DilutionLLM

0 likes · 8 min read

Why Bigger Context Windows Hurt LLMs and How RAG Still Wins

SuanNi

Mar 21, 2026 · Artificial Intelligence

Mastering Context Engineering: Six Pillars, Retrieval Strategies, and Structured Output

This article explains the six pillars of context engineering, focusing on structuring techniques, advanced retrieval methods, hybrid search, reranking, query transformation, and practical pipelines that turn raw data into reliable, LLM‑ready inputs for higher quality AI responses.

Hybrid SearchLLMRAG

0 likes · 14 min read

Mastering Context Engineering: Six Pillars, Retrieval Strategies, and Structured Output

Architect's Guide

Mar 21, 2026 · Artificial Intelligence

Turn PDFs, Word Docs, and Images into Instant Answers with WeKnora’s LLM‑Powered Search

WeKnora is a Tencent‑open‑source LLM‑based document understanding and semantic search framework that extracts structured content from PDFs, Word files and images, offers agent‑driven reasoning, multi‑modal retrieval, and a modular architecture, with step‑by‑step Docker deployment and a web UI for instant querying.

AILLMRAG

0 likes · 7 min read

Turn PDFs, Word Docs, and Images into Instant Answers with WeKnora’s LLM‑Powered Search

Machine Learning Algorithms & Natural Language Processing

Mar 20, 2026 · Artificial Intelligence

Cursor’s Composer 2 Beats Claude Opus 4.6 with ‘Ankle‑Cut’ Pricing via New Reinforcement‑Learning Method

Cursor’s newly released Composer 2 model surpasses Claude Opus 4.6 on benchmarks such as Terminal‑Bench 2.0, offers dramatically lower token pricing, and achieves these gains by introducing a novel self‑summary reinforcement‑learning technique that compresses long‑context tasks while preserving critical information.

BenchmarkComposer 2Cursor

0 likes · 9 min read

Cursor’s Composer 2 Beats Claude Opus 4.6 with ‘Ankle‑Cut’ Pricing via New Reinforcement‑Learning Method

Architect's Alchemy Furnace

Mar 20, 2026 · Artificial Intelligence

Why Vector‑Based RAG Falls Short and How PageIndex’s Reasoning‑Based Retrieval Solves It

This article analyzes the fundamental limitations of traditional vector‑based Retrieval‑Augmented Generation, introduces Vectify AI’s reasoning‑driven PageIndex framework, and explains how hierarchical, non‑vector indexing enables more accurate, context‑aware document retrieval for complex, domain‑specific texts.

AILLMPageIndex

0 likes · 15 min read

Why Vector‑Based RAG Falls Short and How PageIndex’s Reasoning‑Based Retrieval Solves It

AI Engineering

Mar 20, 2026 · Artificial Intelligence

Cursor Unveils Composer 2: A Code‑Focused Model Priced at a Fraction of GPT‑5

Cursor's Composer 2, a code‑only AI model, jumps from a 44.2 to 61.3 benchmark score, outperforms Claude Opus 4.6, nears GPT‑5.4, and costs just $0.50 per million tokens, reshaping its strategy after heavy reliance on external APIs.

AI modelBenchmarkCode Generation

0 likes · 4 min read

Cursor Unveils Composer 2: A Code‑Focused Model Priced at a Fraction of GPT‑5

Wu Shixiong's Large Model Academy

Mar 20, 2026 · Artificial Intelligence

Mastering MinerU: Overcoming Its Top 9 Limitations for Reliable Document Parsing

This article examines MinerU's strengths and nine critical shortcomings—such as layout order errors, cross‑page table splits, merged‑cell failures, OCR misrecognition, and licensing issues—and provides concrete improvement strategies, interview‑ready resume bullets, and practical response frameworks for engineers.

LLMLayout AnalysisMinerU

0 likes · 13 min read

Mastering MinerU: Overcoming Its Top 9 Limitations for Reliable Document Parsing

Shuge Unlimited

Mar 20, 2026 · Artificial Intelligence

How a Single Gateway Manages 30+ Messaging Platforms in OpenClaw

This article dissects OpenClaw’s low‑level architecture, showing how a long‑lived Gateway process coordinates over 30 messaging platforms, how the pi‑mono embedded agent runtime drives the thinking cycle, and how the Context Engine and Session Management ensure consistent state, persistence, and extensibility.

AI AssistantContext EngineLLM

0 likes · 17 min read

How a Single Gateway Manages 30+ Messaging Platforms in OpenClaw

Machine Learning Algorithms & Natural Language Processing

Mar 19, 2026 · Artificial Intelligence

Inside Xiaomi’s Hunter Alpha: 1‑Trillion‑Parameter LLM with 1M Context and Top Global Rankings

Xiaomi’s newly unveiled MiMo‑V2‑Pro, codenamed Hunter Alpha, is a trillion‑parameter LLM with a 1 million‑token context window that tops OpenRouter usage, achieves the second‑best domestic and eighth‑best global scores on Artificial Analysis, and delivers strong benchmark results across PinchBench, ClawEval, and SWE‑bench.

BenchmarkLLMMiMo-V2-Pro

0 likes · 9 min read

Inside Xiaomi’s Hunter Alpha: 1‑Trillion‑Parameter LLM with 1M Context and Top Global Rankings

DeepHub IMBA

Mar 19, 2026 · Artificial Intelligence

Understanding Agent Memory: From Stateless LLMs to Persistent Multi‑Layer Architecture

The article analyzes why large language models are inherently stateless, outlines a four‑layer memory architecture for AI agents—including working, situational, semantic, and procedural memory—and explains write, retrieval, and forgetting mechanisms along with current tooling such as Mem0 and Letta.

Agent MemoryContext WindowLLM

0 likes · 9 min read

Understanding Agent Memory: From Stateless LLMs to Persistent Multi‑Layer Architecture

JakartaEE China Community

Mar 19, 2026 · Artificial Intelligence

Quarkus Meets LangChain4j: A Guide to Integrating Large Language Models

This article explains how the new quarkus‑langchain4j 0.1 extension lets developers embed LLMs into Quarkus applications using a declarative @RegisterAiService API, with support for tools, document stores, RAG, and full code examples.

AI integrationJavaLLM

0 likes · 13 min read

Quarkus Meets LangChain4j: A Guide to Integrating Large Language Models

PaperAgent

Mar 19, 2026 · Artificial Intelligence

How MDER‑DR Boosts Multi‑Hop KG QA with Entity‑Centric Summaries

The article presents the MDER‑DR two‑stage framework that tackles semantic loss in knowledge‑graph triple indexing by generating context‑aware entity summaries and using an LLM‑driven decompose‑parse retrieval loop, achieving up to 66% performance gains on multi‑hop question answering benchmarks.

Entity SummarizationKG QAKnowledge Graph

0 likes · 5 min read

How MDER‑DR Boosts Multi‑Hop KG QA with Entity‑Centric Summaries

AgentGuide

Mar 19, 2026 · Artificial Intelligence

What Exactly Is an AI Agent? Complete Interview Guide

This article breaks down the concept of AI agents for interview preparation, covering their definition, core components like planning, memory, and tool use, differences from plain LLM chats, real‑world challenges, typical use cases, detailed component analysis, and a runnable pseudo‑code example.

AI AgentLLMMemory

0 likes · 9 min read

What Exactly Is an AI Agent? Complete Interview Guide

Architect's Ambition

Mar 18, 2026 · Artificial Intelligence

From Zero to a Real AI Agent: Master Its Core Essence, Not Just API Calls

The article explains why an AI Agent is more than a simple LLM API call, outlines its four essential modules—memory, planning, tool use, and feedback—shows how they differ from ordinary models, and offers practical steps and common pitfalls for building a production‑grade single‑agent system.

AI AgentFeedback LoopLLM

0 likes · 13 min read

From Zero to a Real AI Agent: Master Its Core Essence, Not Just API Calls

AI Explorer

Mar 18, 2026 · Artificial Intelligence

Run and Fine‑Tune Hundreds of Open‑Source LLMs Locally with Unsloth

Unsloth offers a unified web UI that accelerates fine‑tuning by up to 2×, cuts VRAM usage by 70% (80% for RL), supports hundreds of open‑source models, and provides simple installation steps for rapid local AI experimentation.

AI workstationGPU OptimizationLLM

0 likes · 6 min read

Run and Fine‑Tune Hundreds of Open‑Source LLMs Locally with Unsloth

AI Explorer

Mar 18, 2026 · Artificial Intelligence

Unlock Instant AI Agents with LangGraph‑Powered Deep Agents

Deep Agents, an open‑source framework built on LangGraph, bundles planning, file‑system tools, sub‑agent coordination and context management into a ready‑to‑run AI agent that can be launched with three lines of Python code and fully customized for diverse applications.

AI agentsAgent FrameworkDeep Agents

0 likes · 7 min read

Unlock Instant AI Agents with LangGraph‑Powered Deep Agents

DeepHub IMBA

Mar 18, 2026 · Artificial Intelligence

CRAG Architecture Explained: Fixing Erroneous Retrieval Results Before the Generator

The article analyzes how most RAG pipelines blindly feed retrieved documents to LLMs, introduces CRAG's lightweight evaluator with confidence thresholds, describes its sentence‑level decomposition, filtering, and dual‑knowledge routing, and provides a full implementation walkthrough with a real insurance query example.

CRAGFAISSLLM

0 likes · 13 min read

CRAG Architecture Explained: Fixing Erroneous Retrieval Results Before the Generator

o-ai.tech

Mar 18, 2026 · Artificial Intelligence

How Anthropic Builds Effective AI Agents: Practical Patterns and Principles

This guide distills Anthropic’s frontline experience into a concise framework for building high‑performing AI agents, covering the workflow‑vs‑agent distinction, five composable architecture patterns, core design principles, tool‑centric optimization, and pragmatic advice on using or bypassing agent frameworks.

AI agentsAnthropicLLM

0 likes · 9 min read

How Anthropic Builds Effective AI Agents: Practical Patterns and Principles

Huolala Tech

Mar 18, 2026 · Artificial Intelligence

Boosting LLM Accuracy: From RAG to GraphRAG for Enterprise Metadata Retrieval

This article explains the fundamentals of Retrieval‑Augmented Generation (RAG), introduces GraphRAG as an advanced architecture using knowledge graphs, details implementation pipelines, evaluates performance improvements, analyzes common pitfalls, and outlines future enhancements for enterprise metadata search.

AIGraphRAGKnowledge Graph

0 likes · 17 min read

Boosting LLM Accuracy: From RAG to GraphRAG for Enterprise Metadata Retrieval

Data STUDIO

Mar 18, 2026 · Artificial Intelligence

Building a Smart Web AI Agent with FastAPI, LangGraph, and MCP

This article walks through the design and implementation of a production‑ready Web AI agent that uses FastAPI as the HTTP layer, LangGraph to orchestrate multi‑step reasoning, and MCP to expose external tools, showing how to manage state, integrate multiple LLM providers, and extend the system with persistence, rate‑limiting, and monitoring.

AI AgentFastAPILLM

0 likes · 20 min read

Building a Smart Web AI Agent with FastAPI, LangGraph, and MCP

SuanNi

Mar 18, 2026 · Artificial Intelligence

Explore the LLM Architecture Gallery: Visualizing Seven Years of Model Evolution

The LLM Architecture Gallery, created by Sebastian Raschka, offers an interactive visual compendium of open‑weight large language models from 2019 to 2026, detailing their core parameters, architectural innovations, and the broader trends shaping modern AI research.

AILLMModel architecture

0 likes · 8 min read

Explore the LLM Architecture Gallery: Visualizing Seven Years of Model Evolution

AI Tech Publishing

Mar 18, 2026 · Artificial Intelligence

How Context Engineering Turns AI Agents from ‘Usable’ to ‘Highly Effective’

The article explains how organizing the prompt, tool schemas, dialogue history, and retrieved documents—collectively the context window—affects an AI agent’s decisions, introduces the concepts of Lost‑in‑the‑Middle, Thinking Tokens, tool‑response caching, compaction versus SubAgent strategies, and shows a step‑by‑step evolution that raised accuracy from 60 % to over 95 %.

AI agentsContext EngineeringLLM

0 likes · 9 min read

How Context Engineering Turns AI Agents from ‘Usable’ to ‘Highly Effective’

Machine Learning Algorithms & Natural Language Processing

Mar 17, 2026 · Artificial Intelligence

From Lists to Decision Reports: The Deep Research Framework for Recommender Systems

The paper introduces Deep Research for Recommender Systems, a multi‑agent framework called RecPilot that replaces traditional list‑based recommendations with AI‑driven exploration, trajectory simulation, and structured decision‑support reports, and demonstrates its superiority on TMALL data through extensive trajectory and report‑generation evaluations.

Deep ResearchLLMMulti-Agent

0 likes · 10 min read

From Lists to Decision Reports: The Deep Research Framework for Recommender Systems

Woodpecker Software Testing

Mar 17, 2026 · Artificial Intelligence

Why Direct Prompting Beats LLM Knowledge‑Base Agents for Test‑Case Generation

The article explains that feeding requirements directly in the prompt yields far better test‑case designs than using an LLM‑powered knowledge‑base agent, because the model processes the full context without the loss and fragmentation introduced by retrieval‑augmented generation.

AI agentsKnowledge BaseLLM

0 likes · 6 min read

Why Direct Prompting Beats LLM Knowledge‑Base Agents for Test‑Case Generation

DeepHub IMBA

Mar 17, 2026 · Artificial Intelligence

Advanced RAG Techniques: Boosting Retrieval with Query Translation and Decomposition

The article examines how retrieval‑augmented generation suffers from poor query formulation and presents two advanced strategies—query translation, which generates multiple semantically similar variants, and query decomposition, which breaks complex questions into finer sub‑queries—detailing methods such as fan‑out retrieval, reciprocal rank fusion, HyDE, step‑back prompting, and chain‑of‑thought retrieval, and explains when to combine them.

Hybrid RetrievalLLMQuery Decomposition

0 likes · 9 min read

Advanced RAG Techniques: Boosting Retrieval with Query Translation and Decomposition

IT Services Circle

Mar 17, 2026 · Artificial Intelligence

How AI Workflows and Agents Transform Automation: From Rigid Rules to Intelligent Decision‑Making

This article explains the distinction and synergy between traditional AI workflows and modern agents, outlines their four‑step processes, showcases practical examples such as intelligent customer service and content generation, and recommends tools for beginners to quickly build AI‑driven applications.

AIAgentAutomation

0 likes · 11 min read

How AI Workflows and Agents Transform Automation: From Rigid Rules to Intelligent Decision‑Making

Data Party THU

Mar 17, 2026 · Artificial Intelligence

How OpenMAIC Is Redefining AI-Powered Learning: From Multi‑Agent Labs to Classroom Revolution

OpenMAIC, the world’s first multi‑agent generative learning framework released by Tsinghua University, transforms technical documents into zero‑barrier interactive courses, supports AI‑driven lesson planning, multi‑agent discussions, and plug‑in extensions, and is rapidly evolving through 2024‑2026 to reshape education and beyond.

AI educationLLMMulti-Agent AI

0 likes · 10 min read

How OpenMAIC Is Redefining AI-Powered Learning: From Multi‑Agent Labs to Classroom Revolution

PaperAgent

Mar 17, 2026 · Artificial Intelligence

Can Attention Replace Fixed Residuals? Inside the ‘Attention Residuals’ Breakthrough

This article analyzes the newly released Attention Residuals paper, explaining how learnable attention weighting replaces fixed residual addition to mitigate information dilution in deep LLMs, detailing the proposed Block AttnRes design, engineering trade‑offs, experimental results, and its significance for foundational model architecture.

Block AttentionDeep LearningLLM

0 likes · 9 min read

Can Attention Replace Fixed Residuals? Inside the ‘Attention Residuals’ Breakthrough

AI Step-by-Step

Mar 17, 2026 · Industry Insights

How OpenClaw Redefines Enterprise Software with AI‑Powered Business Integration

The article analyzes OpenClaw as an AI‑driven business‑connection layer that unifies chat entry, tool execution, and event‑driven automation, showing how enterprises can shorten system gaps, automate long‑tail workflows, and adopt a new agent‑based service model without replacing existing ERP or CRM solutions.

AI agentsLLMOpenClaw

0 likes · 8 min read

How OpenClaw Redefines Enterprise Software with AI‑Powered Business Integration

Wu Shixiong's Large Model Academy

Mar 17, 2026 · Artificial Intelligence

Mastering Chunk Splitting for RAG: From Fixed Length to Semantic Segmentation

Chunk splitting, a critical yet often overlooked step in RAG pipelines, dramatically impacts retrieval recall and LLM output quality; this guide walks through three evolution stages—from naive fixed‑length splits to sentence‑aware overlaps and finally semantic, structure‑driven segmentation—complete with code, experiments, and practical pitfalls.

LLMRAGchunking

0 likes · 15 min read

Mastering Chunk Splitting for RAG: From Fixed Length to Semantic Segmentation

Tencent Cloud Developer

Mar 17, 2026 · Artificial Intelligence

Why Anthropic Skips Function Calling: Inside the 5 Skill Execution Modes

This article dissects Anthropic's Skill framework, revealing how it drives AI agents through five distinct execution modes—pure prompt injection, script execution, library calls, progressive document loading, and workflow orchestration—while avoiding function‑calling registration and optimizing token usage.

AIAgentFunction Calling

0 likes · 32 min read

Why Anthropic Skips Function Calling: Inside the 5 Skill Execution Modes

Model Perspective

Mar 16, 2026 · Artificial Intelligence

Can AI‑Generated “Silicon Samples” Replace Real Survey Respondents?

The article explains how large language models can simulate virtual respondents—called silicon samples—to generate synthetic survey data, outlines the four fidelity criteria for evaluating their credibility, and demonstrates practical workflows with the open‑source EDSL Python library.

EDSLLLMPython

0 likes · 14 min read

Can AI‑Generated “Silicon Samples” Replace Real Survey Respondents?

Architect's Ambition

Mar 16, 2026 · Artificial Intelligence

Understanding AI Agents: From Chatting to Getting Things Done

The article explains the four essential components of AI Agents—brain, memory, tool, and planning layers—illustrates their implementation with Python code, compares planning strategies, shares a real-world OOM fault‑diagnosis case, and lists common pitfalls to help newcomers build functional agents.

AI AgentLLMMemory Management

0 likes · 17 min read

Understanding AI Agents: From Chatting to Getting Things Done

AI Engineer Programming

Mar 16, 2026 · Artificial Intelligence

Why “Agent Development” Misleads: Framework vs. Harness in LLM Agents

The article explains that the term “Agent development” hides a fundamental split between Agent Frameworks, which give developers building blocks to assemble their own agents, and Agent Harnesses, which provide ready‑to‑run agents, and shows how this distinction affects decisions, maintenance, and troubleshooting.

AI EngineeringAgentClaude Code

0 likes · 10 min read

Why “Agent Development” Misleads: Framework vs. Harness in LLM Agents

macrozheng

Mar 16, 2026 · Artificial Intelligence

How LLMFit Automates Hardware Compatibility Checks for Local Large‑Model Deployment

LLMFit, a Rust‑based terminal tool, automatically detects system hardware, recommends optimal quantization levels, and scores models across multiple dimensions, enabling developers to quickly identify and run large language models that suit their machines without trial‑and‑error.

CLI toolLLMModel Quantization

0 likes · 5 min read

How LLMFit Automates Hardware Compatibility Checks for Local Large‑Model Deployment

Wu Shixiong's Large Model Academy

Mar 16, 2026 · Artificial Intelligence

Designing a Complete RAG System from Zero: A Step‑by‑Step Interview Guide

This article outlines a full‑stack RAG architecture—offline parsing, query understanding, online retrieval, and context generation—explains six critical module interactions, and provides a concise interview framework for presenting the design from start to finish.

Interview PreparationLLMRAG

0 likes · 14 min read

Designing a Complete RAG System from Zero: A Step‑by‑Step Interview Guide

Code Mala Tang

Mar 16, 2026 · Artificial Intelligence

Turn Opencode into a Persistent Research Assistant with Simple Agent Setup

This guide shows how to extend the open‑source Opencode LLM coding tool with a custom agent and folder structure so it can store topic summaries, retain memory across sessions, and serve as a powerful terminal‑based research assistant.

AgentKnowledge BaseLLM

0 likes · 8 min read

Turn Opencode into a Persistent Research Assistant with Simple Agent Setup

Bighead's Algorithm Notes

Mar 15, 2026 · Artificial Intelligence

Paper Reading: TiMi – An Inference‑Driven Multi‑Agent System for Quantitative Trading

TiMi is a reasoning‑driven multi‑agent framework that decouples strategy development from minute‑level deployment, leverages LLMs for semantic analysis, code generation and mathematical reasoning, and achieves stable profits, high execution efficiency and strong risk control across more than 200 stock and crypto trading pairs.

Financial AILLMMulti-Agent System

0 likes · 17 min read

Paper Reading: TiMi – An Inference‑Driven Multi‑Agent System for Quantitative Trading

Machine Learning Algorithms & Natural Language Processing

Mar 15, 2026 · Artificial Intelligence

Is RL Dead in LLM Post-Training? MIT’s RandOpt Challenges Traditional Methods

The MIT‑CSAIL paper introduces RandOpt, a single‑step, gradient‑free, fully parallel post‑training algorithm that adds Gaussian noise to pretrained LLM weights and ensembles the results, achieving or surpassing PPO/GRPO performance by exploiting dense "neural thickets" that emerge as model scale grows.

LLMRandOptensemble

0 likes · 12 min read

Is RL Dead in LLM Post-Training? MIT’s RandOpt Challenges Traditional Methods

Woodpecker Software Testing

Mar 15, 2026 · Artificial Intelligence

AI‑Powered Intelligent Regression Testing: Turning Tests into Precise, Real‑Time Defense (2026)

In 2026, intelligent regression testing leverages fine‑tuned LLMs, runtime dependency graphs, and business‑impact weighting to shrink test suites from thousands to dozens, cut execution time by over 90 %, and shift quality from static coverage to real‑time, AI‑driven risk mitigation, while demanding new organizational practices.

AI-driven TestingCIIntelligent Regression Testing

0 likes · 8 min read

AI‑Powered Intelligent Regression Testing: Turning Tests into Precise, Real‑Time Defense (2026)

Fun with Large Models

Mar 15, 2026 · Artificial Intelligence

A Complete Guide to 2026’s Hottest Tech Concept: Agent Engineering

The article explains Agent Engineering—a systematic approach that turns nondeterministic large‑language‑model agents into reliable production‑grade applications through an iterative build‑test‑deploy‑observe‑improve loop, combining product, engineering, and data‑science thinking to address unpredictability and achieve continuous growth.

AI AgentData‑Driven OptimizationIterative Development

0 likes · 12 min read

A Complete Guide to 2026’s Hottest Tech Concept: Agent Engineering

Radish, Keep Going!

Mar 15, 2026 · Artificial Intelligence

How Chrome’s WebMCP Lets LLMs Control Browsers Without APIs or Bots

The article examines Chrome 146’s WebMCP standard, showing how declarative and imperative APIs let large language models interact with real browser sessions directly, outperforming prior screenshot‑or‑Playwright tricks in success rate, token cost, speed, and robustness while exposing new challenges for anti‑bot systems.

AI agentsBrowser AutomationChrome

0 likes · 9 min read

How Chrome’s WebMCP Lets LLMs Control Browsers Without APIs or Bots

PaperAgent

Mar 15, 2026 · Artificial Intelligence

Why LLM Tool‑Calling Benchmarks Miss Real Users: Introducing WildToolBench

WildToolBench reveals that existing LLM tool‑calling benchmarks overlook real‑world user behavior, and a comprehensive evaluation of 58 models shows even the strongest agents achieve less than 15% session accuracy, highlighting a huge gap between reported performance and practical usability.

Agentic AIBenchmarkLLM

0 likes · 10 min read

Why LLM Tool‑Calling Benchmarks Miss Real Users: Introducing WildToolBench

Machine Learning Algorithms & Natural Language Processing

Mar 13, 2026 · Artificial Intelligence

Why MCP Is Dead and CLI Is Rising: Perplexity’s Shift Sparks Community Support

Although the Model Context Protocol (MCP) was launched by Anthropic in late 2024 and initially praised, users now report severe context‑window costs, instability, and cumbersome authentication, leading Perplexity and others to abandon it in favor of traditional CLI tools that remain more composable and reliable.

AI toolingAgentAnthropic

0 likes · 8 min read

Why MCP Is Dead and CLI Is Rising: Perplexity’s Shift Sparks Community Support

AI Waka

Mar 13, 2026 · Artificial Intelligence

Rethinking LLM Agents: Stream Tool Outputs Directly to the Client

The article critiques the conventional LLM‑agent loop that forces every tool output back through the model, proposes a dual‑output architecture where tools stream multimedia events directly to the client while still returning a compact semantic result to the model, and demonstrates the design with Python code examples.

AgentLLMPython

0 likes · 14 min read

Rethinking LLM Agents: Stream Tool Outputs Directly to the Client

DeepHub IMBA

Mar 13, 2026 · Artificial Intelligence

Why Bigger Context Windows Make RAG Essential, Not Redundant

Although expanding LLM context windows seems to eliminate the need for Retrieval‑Augmented Generation, in practice larger windows dilute attention and cause retrieval failures, so RAG remains crucial for filtering high‑signal content and maintaining answer quality.

AI ArchitectureAttention DilutionLLM

0 likes · 7 min read

Why Bigger Context Windows Make RAG Essential, Not Redundant

AI Waka

Mar 13, 2026 · Artificial Intelligence

How to Map Enterprise Workflows to Agentic AI Execution Graphs

This article explores the evolution of Agentic AI, outlines a full lifecycle for designing, deploying, and governing AI agents, presents a reference architecture, and demonstrates a practical case study of automating a customer service desk using agentified workflows.

AI ArchitectureAgentic AIEnterprise Automation

0 likes · 15 min read

How to Map Enterprise Workflows to Agentic AI Execution Graphs

AI Engineer Programming

Mar 13, 2026 · Artificial Intelligence

Big Model vs. Big Harness: Who Really Powers AI Agents?

The article examines whether the success of AI agents stems from ever‑stronger large language models or from the surrounding harness—context management, tool orchestration, and reliability engineering—by comparing viewpoints, empirical evaluations, and practical guidance for developers.

AI AgentHarness EngineeringLLM

0 likes · 11 min read

Big Model vs. Big Harness: Who Really Powers AI Agents?

Wu Shixiong's Large Model Academy

Mar 13, 2026 · Artificial Intelligence

Why Every RAG System Needs Smart Query Understanding and Routing

The article explains how diverse user queries in a RAG‑based insurance system require intent classification, entity extraction, and multi‑path routing to choose between vector search, calculation, database lookup, or chit‑chat, and outlines practical rule‑ML‑LLM hybrid solutions with safety safeguards.

LLMQuery UnderstandingRAG

0 likes · 11 min read

Why Every RAG System Needs Smart Query Understanding and Routing

Aikesheng Open Source Community

Mar 12, 2026 · Artificial Intelligence

Which LLM Generates the Best SQL? A 19‑Model Benchmark on a 200M‑Row GitHub Dataset

This article presents a comprehensive benchmark of 19 large language models (plus a human baseline) on generating analytical SQL queries over a 200 million‑row GitHub events dataset, detailing the methodology, metrics, results, and practical guidance for using LLMs in data analysis.

AIBenchmarkLLM

0 likes · 18 min read

Which LLM Generates the Best SQL? A 19‑Model Benchmark on a 200M‑Row GitHub Dataset

Wu Shixiong's Large Model Academy

Mar 12, 2026 · Artificial Intelligence

How to Build Cross-Session Memory for RAG Chatbots: Short‑Term vs Long‑Term Strategies

This article explains the role of memory modules in Retrieval‑Augmented Generation systems, compares short‑term and long‑term memory techniques, outlines storage and retrieval methods, discusses management strategies like forgetting and deduplication, and compares LangChain and LlamaIndex implementations for practical deployment.

LLMLangChainLlamaIndex

0 likes · 11 min read

How to Build Cross-Session Memory for RAG Chatbots: Short‑Term vs Long‑Term Strategies

Java Backend Technology

Mar 12, 2026 · Artificial Intelligence

Why a Decade‑Old Java Library Is Jumping Into the AI Race with TOON

The article introduces TOON, a token‑oriented data format that cuts JSON token usage by 30‑60%, and explains how the veteran Java serialization library json‑io has added full TOON support, offering zero‑config, cyclic‑reference handling, and seamless Spring Boot integration for cost‑effective LLM applications.

LLMSpring BootTOON

0 likes · 7 min read

Why a Decade‑Old Java Library Is Jumping Into the AI Race with TOON

Machine Learning Algorithms & Natural Language Processing

Mar 11, 2026 · Artificial Intelligence

Why LLMs Overthink: ICLR2026 Study Reveals the Key Bottleneck in Inference Efficiency

The ICLR2026 paper identifies reasoning miscalibration—overthinking easy steps and underthinking critical ones—as the root cause of runaway LLM inference costs, and proposes the Budget Allocation Model (BAM) and a training‑free Plan‑and‑Budget framework that smartly distributes compute, achieving up to 70% higher accuracy while cutting token usage by 39% and boosting the new E³ efficiency metric by 193.8%.

Budget Allocation ModelE3 MetricEpistemic Uncertainty

0 likes · 12 min read

Why LLMs Overthink: ICLR2026 Study Reveals the Key Bottleneck in Inference Efficiency

Bighead's Algorithm Notes

Mar 11, 2026 · Artificial Intelligence

Paper Review: AlphaBench – Benchmarking LLMs for Formalized Alpha‑Factor Mining

The article reviews AlphaBench, the first benchmark suite for assessing large language models in formalized alpha‑factor mining (FAFM), detailing its three core tasks—factor generation, evaluation, and search—along with experiments on various commercial and open‑source LLMs that reveal strong potential but challenges in robustness, efficiency, and practical usability.

AlphaBenchBenchmarkFAFM

0 likes · 14 min read

Paper Review: AlphaBench – Benchmarking LLMs for Formalized Alpha‑Factor Mining

AI Waka

Mar 11, 2026 · Artificial Intelligence

Why Context Engineering Is the Secret to Smarter AI Agents

The article explains how context engineering—designing the entire information environment for large language models—overcomes prompt engineering limits, mitigates context decay, and improves speed, accuracy, and cost by strategically selecting, compressing, ordering, isolating, and formatting context for production‑grade AI agents.

AI agentsAWS BedrockContext Engineering

0 likes · 24 min read

Why Context Engineering Is the Secret to Smarter AI Agents

macrozheng

Mar 11, 2026 · Backend Development

Why json-io’s New TOON Support Could Cut LLM Token Costs by Up to 60%

The article introduces json-io’s recent addition of full TOON format support—a token‑oriented data notation that removes JSON’s syntactic noise, saving 30‑60% of tokens for LLM APIs, and shows how to integrate it with Java, Maven, and Spring Boot.

LLMSpring BootTOON

0 likes · 7 min read

Why json-io’s New TOON Support Could Cut LLM Token Costs by Up to 60%

Java Backend Technology

Mar 11, 2026 · Artificial Intelligence

Explore The Agency: 55 AI Agent Roles Organized into 9 Departments

The article introduces The Agency, an open‑source collection of 55 specialized AI role definitions grouped into nine functional departments, explains how each Markdown file describes an agent’s identity, mission, workflow and deliverables, and shows two ways to use the agents with Claude Code or as generic prompt templates.

AIAgent RolesClaude

0 likes · 6 min read

Explore The Agency: 55 AI Agent Roles Organized into 9 Departments

Machine Learning Algorithms & Natural Language Processing

Mar 10, 2026 · Artificial Intelligence

Install OpenClaw in 1 Minute and Deploy It to Feishu with One Click

AutoClaw repackages OpenClaw into a one‑click desktop app that eliminates the multi‑step setup, automatically integrates with Feishu, bundles free GLM models and the new Pony‑Alpha‑2 LLM, and offers a plug‑and‑play skill system for AI‑driven tasks.

AI agentsAutoClawFeishu

0 likes · 8 min read

Install OpenClaw in 1 Minute and Deploy It to Feishu with One Click

Machine Learning Algorithms & Natural Language Processing

Mar 10, 2026 · Artificial Intelligence

Why the First Token Becomes a Value Garbage Bin – LeCun Team Dissects Spike and Attention Sink Mechanics

The paper by Yann LeCun’s team reveals that massive activation spikes and attention sinks in Transformers are not inherently coupled; spikes arise from position‑0 token interactions and specific feed‑forward dynamics, while attention sinks emerge from Pre‑norm normalization and head dimension, offering practical insights for model quantization and long‑context inference.

Attention SinkLLMMassive Activations

0 likes · 9 min read

Why the First Token Becomes a Value Garbage Bin – LeCun Team Dissects Spike and Attention Sink Mechanics

AI Cyberspace

Mar 10, 2026 · Artificial Intelligence

Mastering Prompt Engineering: Techniques to Guide LLMs Effectively

This article explains the fundamentals of prompt engineering for large language models, covering LLM output configuration, length and sampling controls, various prompt types, chain‑of‑thought and tree‑of‑thought reasoning methods, and practical best‑practice guidelines for creating high‑quality prompts.

AI Prompt DesignFew‑Shot LearningLLM

0 likes · 18 min read

Mastering Prompt Engineering: Techniques to Guide LLMs Effectively

PaperAgent

Mar 10, 2026 · Artificial Intelligence

How MemSifter Delivers High‑Precision, Low‑Cost Long‑Term Memory for LLMs

MemSifter introduces a lightweight agent that outsources memory retrieval for large language models, using a Think‑and‑Rank pipeline and a task‑result‑oriented reinforcement‑learning training paradigm to achieve superior retrieval accuracy and efficiency across eight benchmark tasks while keeping inference overhead minimal.

AgentBenchmarkLLM

0 likes · 13 min read

How MemSifter Delivers High‑Precision, Low‑Cost Long‑Term Memory for LLMs

HyperAI Super Neural

Mar 10, 2026 · Artificial Intelligence

Deploy Popular Open‑Source LLMs on Free CPU in Minutes – Qwen3.5, DeepSeek‑R1, Gemma 3, Llama 3.2 and More

This guide shows how to use HyperAI’s free CPU quota to quickly deploy popular open‑source LLMs such as Qwen3.5, DeepSeek‑R1, Gemma 3 and Llama 3.2, walking through environment setup, model download, and inference execution without needing local GPU hardware.

CPUHyperAILLM

0 likes · 6 min read

Deploy Popular Open‑Source LLMs on Free CPU in Minutes – Qwen3.5, DeepSeek‑R1, Gemma 3, Llama 3.2 and More

AI Tech Publishing

Mar 10, 2026 · Artificial Intelligence

Agent Frameworks vs. Agent Harness: Understanding the Key Differences

The article explains how Agent Frameworks and Agent Harness occupy different points on an opinionated spectrum, detailing their abstractions, built‑in components, trade‑offs, and when to choose each, with examples like OpenClaw, LangChain, and Deep Agents.

Agent FrameworkAgent HarnessLLM

0 likes · 5 min read

Agent Frameworks vs. Agent Harness: Understanding the Key Differences

Su San Talks Tech

Mar 10, 2026 · Artificial Intelligence

Inside Nanobot: A Deep Dive into a Lightweight AI Assistant Framework

This article provides a comprehensive walkthrough of the open‑source Nanobot project, detailing its architecture, core configuration, message bus, tool system, LLM provider, context builder, session management, agent loop, channel integration, cron and heartbeat services, and CLI commands, while illustrating each component with code snippets and diagrams.

AI AssistantLLMNanobot

0 likes · 29 min read

Inside Nanobot: A Deep Dive into a Lightweight AI Assistant Framework

PaperAgent

Mar 9, 2026 · Artificial Intelligence

How SkillNet Turns AI Agent Experience into Reusable Skills

SkillNet proposes a three‑layer infrastructure that extracts, evaluates, and connects over 200,000 AI‑agent skills into a structured graph, dramatically improving performance across benchmark environments while turning transient agent experience into durable, reusable assets.

AI agentsLLMSkillNet

0 likes · 6 min read

How SkillNet Turns AI Agent Experience into Reusable Skills

Tencent Technical Engineering

Mar 9, 2026 · Artificial Intelligence

How Does OpenClaw Power Multi‑Agent AI? A Deep Dive into Architecture, Deployment, and Risks

This article explains OpenClaw’s core framework, multi‑agent communication mechanisms, deployment options on cloud or local machines, hardware recommendations, IM tool selection, session and memory management, skill handling, version control, and practical use cases while highlighting important security considerations.

DeploymentLLMMulti-Agent

0 likes · 26 min read

How Does OpenClaw Power Multi‑Agent AI? A Deep Dive into Architecture, Deployment, and Risks

Java Tech Enthusiast

Mar 9, 2026 · Artificial Intelligence

PinchBench: Open‑Source Benchmark for Evaluating LLM‑Powered AI Agents like OpenClaw

PinchBench is an open‑source benchmark that measures the success rate, speed, and cost of large language models when used as the core of AI agents such as OpenClaw across 23 real‑world tasks, providing concrete rankings, usage instructions, and a GitHub repository for developers.

AI AgentLLMOpenClaw

0 likes · 5 min read

PinchBench: Open‑Source Benchmark for Evaluating LLM‑Powered AI Agents like OpenClaw

Shi's AI Notebook

Mar 9, 2026 · Artificial Intelligence

Unpacking the Hype: A Clear Map of LLM, RAG, Agent and Agent Platforms

The article explains why the buzz around AI agents can mislead learners, breaks down overlapping concepts such as LLM, RAG, Tool Use, Agent, Code Agent, and Agent Platform into distinct layers, and outlines a step‑by‑step learning plan to build a solid conceptual map.

AI conceptsAgentAgent Platform

0 likes · 9 min read

Unpacking the Hype: A Clear Map of LLM, RAG, Agent and Agent Platforms

Data STUDIO

Mar 9, 2026 · Artificial Intelligence

Boost RAG Accuracy from 60% to 94% with 11 Proven Strategies

This article dissects why naive Retrieval‑Augmented Generation (RAG) often yields only 60% accuracy, then presents eleven concrete ingestion, query, and hybrid techniques—complete with code samples, performance trade‑offs, and real‑world case studies—that together can raise RAG accuracy to 94% while outlining practical implementation roadmaps and common pitfalls.

EmbeddingKnowledge GraphLLM

0 likes · 31 min read

Boost RAG Accuracy from 60% to 94% with 11 Proven Strategies

SuanNi

Mar 9, 2026 · Artificial Intelligence

How Hypernetworks Turn Documents into Instant LLM Skills

This article analyzes the memory and adaptation limits of large language models and presents a hypernetwork‑based approach that instantly converts documents or task descriptions into low‑rank LoRA modules, enabling cheap, on‑demand model updates and cross‑modal knowledge transfer.

AILLMLoRA

0 likes · 9 min read

How Hypernetworks Turn Documents into Instant LLM Skills

Java Architecture Diary

Mar 9, 2026 · Artificial Intelligence

Mastering Agent Skills in LangChain4j: Build a Java Code Analysis Assistant

This tutorial introduces LangChain4j's new Agent Skills feature, explains its core concepts, and walks through a complete example of creating a reusable code‑analysis skill in Java, covering dependency setup, skill definition, integration, and expected runtime behavior.

Agent SkillsLLMLangChain4j

0 likes · 8 min read

Mastering Agent Skills in LangChain4j: Build a Java Code Analysis Assistant

High Availability Architecture

Mar 9, 2026 · Artificial Intelligence

Build a Minimal AI CLI with Claude in 80 Lines of Node.js

This tutorial walks you through creating a lightweight, single‑file AI command‑line interface powered by Anthropic's Claude API, covering prerequisites, repository setup, core components, streaming responses, built‑in commands, and the underlying stateless architecture.

AICLIClaude

0 likes · 9 min read

Build a Minimal AI CLI with Claude in 80 Lines of Node.js

DeepHub IMBA

Mar 8, 2026 · Artificial Intelligence

MIT Study: How Self‑Generated History Pollutes LLM Context and Degrades Multi‑Turn Chats

An MIT paper reveals that storing a language model’s own prior replies—known as context pollution—significantly lengthens the dialogue context while offering little quality benefit, with up to a ten‑fold reduction in tokens and comparable responses for about 70% of turns, especially in open‑source models.

AI agentsLLMMIT study

0 likes · 11 min read

MIT Study: How Self‑Generated History Pollutes LLM Context and Degrades Multi‑Turn Chats

IT Services Circle

Mar 8, 2026 · Artificial Intelligence

Mastering LLM Skills: Modular Prompt Engineering for Scalable AI Workflows

The article explains how to replace monolithic prompts with reusable, lazy‑loaded Skill files, compares Skills with Prompt, MCP and Function Calling, shows concrete Skill structures and examples, and demonstrates a Spring Boot AI interview platform with open‑source repositories.

AI workflowFunction CallingLLM

0 likes · 12 min read

Mastering LLM Skills: Modular Prompt Engineering for Scalable AI Workflows

Data Party THU

Mar 8, 2026 · Artificial Intelligence

6 Practical Context‑Engineering Techniques to Tame RAG Hallucinations

This article explains why retrieval‑augmented generation (RAG) models often hallucinate, introduces the concept of context engineering, and details six practical techniques—including selective retrieval, context compression, hierarchical layout, dynamic query rewriting, memory management, and tool‑aware context—along with their trade‑offs and real‑world impact.

AIContext EngineeringLLM

0 likes · 23 min read

6 Practical Context‑Engineering Techniques to Tame RAG Hallucinations

Fun with Large Models

Mar 8, 2026 · Artificial Intelligence

EasyDataset: End-to-End Guide for Generating QA Datasets for LLM Fine‑Tuning

This article walks through the complete workflow of using EasyDataset to create high‑quality question‑answer pairs for supervised fine‑tuning, covering question generation (single and batch), three generation algorithms, answer generation (including chain‑of‑thought and multi‑turn dialogue), a hybrid quality‑assessment pipeline, and export to Alpaca or ShareGPT formats.

Alpaca formatData QualityEasyDataset

0 likes · 18 min read

EasyDataset: End-to-End Guide for Generating QA Datasets for LLM Fine‑Tuning

AI Explorer

Mar 8, 2026 · Artificial Intelligence

AutoClip: One‑Click AI Video Highlight Extraction and Editing

AutoClip is an open‑source, locally‑run tool that uses Alibaba's Qwen large language model and OpenAI Whisper to automatically download, transcribe, analyze, and cut high‑light segments from YouTube or Bilibili videos, offering real‑time task monitoring, smart collections, preview, Docker deployment, and a roadmap of future AI‑driven features.

AI video editingDockerFastAPI

0 likes · 7 min read

AutoClip: One‑Click AI Video Highlight Extraction and Editing

Architect

Mar 7, 2026 · Databases

Why an LLM‑Rewritten SQLite Is 20,000× Slower: Hidden Path Errors and Lessons

A Rust rewrite of SQLite generated largely by an LLM runs a simple primary‑key lookup 20,171 times slower than native SQLite, exposing how seemingly correct code can miss critical system constraints, and illustrating the need for explicit acceptance criteria, benchmark baselines, and governance when using AI‑generated software.

BenchmarkDatabase designLLM

0 likes · 19 min read

Why an LLM‑Rewritten SQLite Is 20,000× Slower: Hidden Path Errors and Lessons

AI Engineer Programming

Mar 7, 2026 · Artificial Intelligence

Prompt Caching, Tool Design, and Agent Architecture: Insights from Claude Code

The article explains LLM inference stages, how KV‑cache and vLLM's Paged Attention enable cross‑request prompt caching, and shares practical guidelines for prompt ordering, immutable caching, and robust tool design that together shape efficient and reliable AI agent architectures.

Agent ArchitectureLLMPaged Attention

0 likes · 18 min read

Prompt Caching, Tool Design, and Agent Architecture: Insights from Claude Code

AI Large-Model Wave and Transformation Guide

Mar 7, 2026 · Artificial Intelligence

Master Prompt Engineering: Craft Precise Prompts to Unlock LLM Power

This guide breaks down prompt engineering for large language models, explaining why clear, detailed prompts matter, how to define types, avoid ambiguity, use constraints, examples, role‑playing, long‑context techniques, chain‑of‑thought reasoning, and provides ready‑to‑use templates for various scenarios.

AIChatGPTLLM

0 likes · 88 min read

Master Prompt Engineering: Craft Precise Prompts to Unlock LLM Power

AI Tech Publishing

Mar 7, 2026 · Artificial Intelligence

A Practical Guide to Evaluating Agent Skills

This article explains why many Agent Skills are released without testing, defines measurable success criteria, and presents a lightweight evaluation framework—including prompt set creation, deterministic checks, optional LLM‑based qualitative checks, and best‑practice recommendations—demonstrated by improving a Gemini Interactions API skill from 66.7% to 100% pass rate.

AI agentsAgent SkillsGemini

0 likes · 13 min read

A Practical Guide to Evaluating Agent Skills

Architecture and Beyond

Mar 7, 2026 · Artificial Intelligence

Effective Context Transfer in Multi‑Agent Systems: Strategies and Pitfalls

Choosing how to pass context between agents determines system stability, token cost, and debugging difficulty; the article defines context, categorizes four context types, and evaluates four main strategies—shared state, message passing, context compression, and hierarchical routing—detailing mechanisms, use‑cases, implementation pitfalls, and cost‑effectiveness trade‑offs.

LLMSystem Designagent-routing

0 likes · 20 min read

Effective Context Transfer in Multi‑Agent Systems: Strategies and Pitfalls

DaTaobao Tech

Mar 6, 2026 · Artificial Intelligence

How We Built an LLM‑Powered User Feedback Sentiment Monitoring System

The transaction terminal team created an AI‑driven workflow that automatically collects, cleans, classifies, alerts, distributes, attributes, and reviews user feedback, using a four‑step LLM model to ensure controllable, consistent, and explainable sentiment analysis while boosting efficiency and trust.

AI workflowAutomationLLM

0 likes · 12 min read

How We Built an LLM‑Powered User Feedback Sentiment Monitoring System

Woodpecker Software Testing

Mar 6, 2026 · Artificial Intelligence

How RAG Testing Teams Can Successfully Transform in 2024

With RAG becoming the backbone of enterprise AI, traditional API‑UI testing misses critical semantic errors, leading to high hallucination rates; this article outlines why conventional methods fail and presents a three‑pillar transformation—skill rebuilding, process reengineering, and advanced tooling—backed by real‑world case studies.

AI testingLLMMLOps

0 likes · 9 min read

How RAG Testing Teams Can Successfully Transform in 2024

AI Tech Publishing

Mar 6, 2026 · Artificial Intelligence

How Codex CLI Compresses Context: Inside the compact() API

The article dissects Codex CLI's two compression paths—local LLM summarization for non‑Codex models and an encrypted compact() API for Codex models—by injecting prompts, extracting system, handoff, and compression prompts, and comparing them with open‑source references to reveal the underlying mechanism.

API analysisCodex CLILLM

0 likes · 5 min read

How Codex CLI Compresses Context: Inside the compact() API

Machine Learning Algorithms & Natural Language Processing

Mar 5, 2026 · Artificial Intelligence

How a Broken CEO Built an 8‑Agent AI Team in 14 Days and Launched a Site in 24 Hours

After breaking his hip, Cheetah Mobile CEO Fu Sheng used voice commands to train an OpenClaw‑based AI agent called "Sanwan" into an eight‑member team that generated 100k+ reads, millions of views, and a fully functional website in 24 hours, illustrating the power of skill‑driven AI agents over traditional SaaS.

AutomationEasyClawLLM

0 likes · 14 min read

How a Broken CEO Built an 8‑Agent AI Team in 14 Days and Launched a Site in 24 Hours

Mingyi World Elasticsearch

Mar 5, 2026 · Artificial Intelligence

Build a Natural‑Language Easysearch Assistant with LLM‑Powered Tool Use (No DSL Required)

This article shows how to create an Easysearch intelligent assistant that lets users manage indexes, write data, search and aggregate documents using Chinese natural language, by combining the DeepSeek large‑language model with OpenAI‑compatible function calling (Tool Use) and a lightweight Node.js executor.

DeepSeekEasysearchLLM

0 likes · 12 min read

Build a Natural‑Language Easysearch Assistant with LLM‑Powered Tool Use (No DSL Required)

Su San Talks Tech

Mar 5, 2026 · Artificial Intelligence

Choosing Between LangChain4j and Spring AI for Java Projects: A Practical Comparison

This article compares LangChain4j and Spring AI, outlining their design philosophies, core features, ecosystem integration, community maturity, and provides concrete code examples and selection guidelines to help Java developers decide which framework best fits their AI application needs.

AI frameworksComparisonJava

0 likes · 13 min read

Choosing Between LangChain4j and Spring AI for Java Projects: A Practical Comparison

Tencent Cloud Developer

Mar 5, 2026 · Artificial Intelligence

20 Cutting‑Edge RAG Optimization Techniques: From Semantic Chunking to Self‑RAG

This article systematically presents twenty practical RAG (Retrieval‑Augmented Generation) optimization methods—covering semantic chunking, chunk‑size evaluation, context‑enhanced retrieval, query transformation, re‑ranking, feedback loops, multimodal and graph RAG, hierarchical retrieval, HyDE, Self‑RAG and reinforcement‑learning‑enhanced RAG—each with clear Python code examples, advantages, limitations and ideal use‑cases.

AILLMRAG

0 likes · 57 min read

20 Cutting‑Edge RAG Optimization Techniques: From Semantic Chunking to Self‑RAG

Kuaishou Tech

Mar 4, 2026 · Artificial Intelligence

How LLMs Are Revolutionizing Reinforcement Learning for Recommendation Systems

This survey examines the emerging LLM‑RL collaborative recommendation paradigm, outlining its research background, five main collaboration patterns, standardized evaluation protocols, and the key challenges and future directions for building smarter, more robust recommender systems.

LLMRecommendation Systemsartificial intelligence

0 likes · 14 min read

How LLMs Are Revolutionizing Reinforcement Learning for Recommendation Systems

Woodpecker Software Testing

Mar 4, 2026 · Artificial Intelligence

Practical Testing of AI Agents: From ChatOps Assistants to Autonomous Driving Bots

The article examines the 2024 shift to dynamic AI agents, outlines why traditional testing falls short, and presents three real‑world case studies—ChatOps IT assistant, multi‑agent e‑commerce risk platform, and embodied inspection robot—detailing novel testing frameworks and measurable improvements.

AI agentsChatOpsHybrid Testing

0 likes · 8 min read

Practical Testing of AI Agents: From ChatOps Assistants to Autonomous Driving Bots

Woodpecker Software Testing

Mar 4, 2026 · Artificial Intelligence

Optimizing Prompt Performance: A Must‑Read Guide for Test Engineers

In the era of LLM‑driven intelligent testing, prompts act as test cases whose latency, token usage, retry rate, context retention, and determinism must be measured and optimized, and this article provides a concrete five‑metric framework and a four‑step practical method backed by real‑world data.

AI testingLLMPerformance Testing

0 likes · 8 min read

Optimizing Prompt Performance: A Must‑Read Guide for Test Engineers