All Articles

140314 articles · Page 17 of 7016

Jun 29, 2026 · Databases

Inside Milvus’ Index Engine: 3‑Layer Parameter Filling, Compile‑time Hardware Split, and a 16× Memory Trade‑off

The article dissects Milvus’ index engine, revealing that AUTOINDEX relies on a three‑stage default‑parameter pipeline, that CPU/GPU index selection is fixed at compile time via Go build tags, that the C++ Knowhere engine executes the algorithms, and that version aggregation, scalar V3 format, and the new AISAQ index embody deliberate memory‑vs‑IO trade‑offs.

AISAQAUTOINDEXCPU/GPU build tags

0 likes · 26 min read

Inside Milvus’ Index Engine: 3‑Layer Parameter Filling, Compile‑time Hardware Split, and a 16× Memory Trade‑off

Geek Labs

Jun 29, 2026 · Artificial Intelligence

DeepSpec Boosts Large-Model Inference Speed by 2–5× with Speculative Decoding

DeepSpec, an open‑source framework from DeepSeek, accelerates large‑language‑model inference by 2–5× through speculative decoding, where a lightweight draft model generates candidate tokens that the target model validates in parallel, reducing the serial bottleneck of autoregressive decoding and offering a full‑stack pipeline from data preparation to evaluation.

DeepSpecGPUPython

0 likes · 6 min read

DeepSpec Boosts Large-Model Inference Speed by 2–5× with Speculative Decoding

AI Engineering

Jun 29, 2026 · Artificial Intelligence

How Coinbase Halved AI Costs While Token Usage Continued to Surge

In June, Coinbase CEO Brian Armstrong revealed an internal AI cost‑optimization program that cut the company's AI dollar spend by almost 50% while token consumption kept growing exponentially, achieved through five concrete measures involving model defaults, intelligent routing, cache reuse, context trimming, and transparent usage monitoring.

AI cost optimizationCoinbaseLLM routing

0 likes · 9 min read

How Coinbase Halved AI Costs While Token Usage Continued to Surge

AI Engineer Programming

Jun 29, 2026 · Artificial Intelligence

Managing LLM Hallucinations: Strategies, Metrics, and Layered Controls

The article examines why large language models hallucinate, categorizes factual, faithfulness, and reasoning hallucinations, critiques existing benchmarks, and proposes a layered governance framework—including training‑time RLHF/DPO, retrieval‑augmented generation, post‑generation verification, uncertainty quantification, and compliance considerations—to mitigate risks in production systems.

EvaluationHallucinationLLM

0 likes · 13 min read

Managing LLM Hallucinations: Strategies, Metrics, and Layered Controls

AI Architecture Path

Jun 29, 2026 · Frontend Development

Terax: A 7 MB Open‑Source Tauri2+Rust IDE that Unifies Terminal, Editor, Git, and AI Agent Offline

The article analyzes the fragmented, heavyweight, and privacy‑risk‑laden workflow of using separate VSCode, iTerm2, Git clients, and AI chat tools, then introduces Terax—a 7 MB Tauri2+Rust‑based open‑source IDE that integrates a native terminal, CodeMirror6 editor, Git visual panel, web preview, and a controllable AI Agent with offline model support, while detailing its architecture, feature set, installation steps, and current limitations.

AI AgentCodeMirror6Git

0 likes · 12 min read

Terax: A 7 MB Open‑Source Tauri2+Rust IDE that Unifies Terminal, Editor, Git, and AI Agent Offline

Linyb Geek Road

Jun 29, 2026 · Artificial Intelligence

Understanding Loop Engineering: Concepts, Insights, and Practical Applications

The article explains Loop Engineering by distinguishing it from basic Agent Loops, outlines its six core components, showcases a text‑classification example, and discusses when the approach boosts efficiency versus when traditional Human‑in‑the‑Loop remains preferable.

AI AgentsAutomationSkill Evolution

0 likes · 21 min read

Understanding Loop Engineering: Concepts, Insights, and Practical Applications

Linyb Geek Road

Jun 29, 2026 · Artificial Intelligence

Deep Dive into Loop Engineering: From Prompt Engineering to System Design

Loop Engineering replaces manual prompting with system‑designed loops that let AI agents iterate autonomously, covering its definition, origins, five core modules plus memory, a full‑stack example, experimental results, limitations, and a comparison between Claude Code and Codex.

AI AgentsAutomationConnector

0 likes · 16 min read

Deep Dive into Loop Engineering: From Prompt Engineering to System Design

TonyBai

Jun 29, 2026 · Fundamentals

Why I Keep Returning to Go After Trying Every Language

The article argues that Go’s batteries‑included standard library eliminates dependency fatigue, its built‑in diagnostics let engineers locate production issues in hours instead of weeks, its uncolored concurrency model avoids async/await pitfalls, and its minimalism reduces cognitive load and boosts team efficiency.

ConcurrencyGodependency fatigue

0 likes · 10 min read

Why I Keep Returning to Go After Trying Every Language

Wuming AI

Jun 28, 2026 · Artificial Intelligence

How a Dedicated Mailbox Lets AI Agents Receive Tasks Autonomously

The article walks through assigning a unique @agent.qq.com mailbox to an AI Agent, explains why a separate email address is essential for identity, permission, and audit in enterprise workflows, and demonstrates the setup, testing, and automation possibilities with practical examples.

AI AgentAgent ManagementAutomation

0 likes · 10 min read

How a Dedicated Mailbox Lets AI Agents Receive Tasks Autonomously

Java Architect Essentials

Jun 28, 2026 · Artificial Intelligence

Claude Code Repo Hits 54K Stars in 60 Days, Supercharging Front‑End Development

Within two months the open‑source Claude Code best‑practice repository amassed over 54 000 GitHub stars by systematically cataloguing community‑validated concepts, features, workflows and 83 practical tips, offering concrete guidance—such as context compression thresholds, staged planning, and disciplined hook usage—to dramatically improve front‑end and back‑end coding efficiency.

AI coding assistantClaude CodeGitHub

0 likes · 8 min read

Claude Code Repo Hits 54K Stars in 60 Days, Supercharging Front‑End Development

Code Mala Tang

Jun 28, 2026 · Artificial Intelligence

7 Essential Things to Know About MCP AI (Multi‑Context Prompting)

MCP AI, a multi‑context prompting approach, replaces linear chat interactions by maintaining several active contexts that the model can switch between, solving context‑window limits, improving coherence, and enabling system‑level workflows, while requiring proper role definition, rules, and feedback loops.

AI ArchitectureClaudeCrewAI

0 likes · 7 min read

7 Essential Things to Know About MCP AI (Multi‑Context Prompting)

21CTO

Jun 28, 2026 · Artificial Intelligence

Apple Demonstrates Building a Local AI App on Mac in 13 Minutes with MLX and Xcode

Apple’s WWDC26 video shows how developers can run Agentic AI locally on a Mac using the MLX framework, integrate it directly into Xcode, and achieve high‑performance, privacy‑preserving AI‑assisted app creation without relying on cloud APIs.

Agentic AIAppleM5 chip

0 likes · 7 min read

Apple Demonstrates Building a Local AI App on Mac in 13 Minutes with MLX and Xcode

LuTiao Programming

Jun 28, 2026 · Backend Development

Over 10% of Users Run Three Codex Agents Simultaneously—Java Development Enters a Multithreaded Era

OpenAI’s recent study shows that more than 10% of Java developers now manage three or more Codex agents at once, prompting a shift from single‑agent coding to a multithreaded, task‑splitting workflow that demands clear boundaries, isolated workspaces, and disciplined coordination.

AI codingCodex AgentGit worktree

0 likes · 21 min read

Over 10% of Users Run Three Codex Agents Simultaneously—Java Development Enters a Multithreaded Era

Model Perspective

Jun 28, 2026 · Industry Insights

DeepSeek’s Hiring Surge: Can It Shift From Model Base to Platform Leader?

DeepSeek’s recent staff doubling is examined through ecological niche theory and a Lotka‑Volterra competition model, showing its current API‑centric niche, potential move into enterprise agent tools, and the strategic need to define new standards rather than merely replicating existing Harness products.

AI competitionAgent platformsDeepSeek

0 likes · 10 min read

DeepSeek’s Hiring Surge: Can It Shift From Model Base to Platform Leader?

dbaplus Community

Jun 28, 2026 · Operations

Why Tencent Music Rejects AI Hype: Building an OpenClaw‑Powered Intelligent Ops Ecosystem

The article details Tencent Music's step‑by‑step evolution from manual alert handling to a three‑layer cloud‑native AIOps platform, describing data pipelines, dynamic 3‑sigma alerts, full‑link observability, and the OpenClaw sandbox with multi‑agent architecture that prioritises scenario‑driven, safe AI integration.

AIAIOpsData Engineering

0 likes · 17 min read

Why Tencent Music Rejects AI Hype: Building an OpenClaw‑Powered Intelligent Ops Ecosystem

Coder Trainee

Jun 28, 2026 · Fundamentals

Java Performance Tuning Part 5: Hands‑On GC Optimization from G1 to ZGC

This article walks through Java GC tuning by defining low‑latency and high‑throughput goals, comparing major collectors, presenting G1 and ZGC configuration examples, and demonstrating a real‑world payment system case where pause times were reduced from 150‑200 ms to under 50 ms.

GCJVMJava

0 likes · 8 min read

Java Performance Tuning Part 5: Hands‑On GC Optimization from G1 to ZGC

Machine Learning Algorithms & Natural Language Processing

Jun 28, 2026 · Artificial Intelligence

DSpark Explained in 10 Essential Concepts: System‑Level Engineering Insights

DSpark, DeepSeek’s new LLM inference framework, combines batch processing, speculative decoding, Eagle‑style draft models and DFlash‑style parallel generation with a lightweight sequential head and hardware‑aware scheduling, delivering 60‑85% speedups while preserving model quality.

Batch ProcessingDeepSeekGPU Optimization

0 likes · 12 min read

DSpark Explained in 10 Essential Concepts: System‑Level Engineering Insights

Machine Learning Algorithms & Natural Language Processing

Jun 28, 2026 · Artificial Intelligence

Why the Log‑Ratio Reward in OPD Is Fundamentally Flawed and Should Be Replaced

The paper reveals that the unbounded log‑ratio reward used in vanilla On‑Policy Distillation causes extreme gradient variance, early‑stage instability, and poor final performance, and demonstrates that replacing the log with a bounded Box‑Cox power transform (PowerOPD) resolves these issues while improving accuracy, efficiency, and memory usage.

Box-CoxOPDReinforcement Learning

0 likes · 16 min read

Why the Log‑Ratio Reward in OPD Is Fundamentally Flawed and Should Be Replaced

Machine Learning Algorithms & Natural Language Processing

Jun 28, 2026 · Artificial Intelligence

Evaluating Research Ideas with InnoEval and SciAtlas: Leveraging 43M Papers and 3B Triples

As large language models accelerate idea generation and the volume of scientific papers soars, InnoEval formalizes multi‑perspective, knowledge‑grounded evaluation of research ideas, while SciAtlas provides a massive cross‑disciplinary knowledge graph that powers evidence‑rich assessments and agent‑driven workflows.

AI AgentsInnoEvalLLM

0 likes · 13 min read

Evaluating Research Ideas with InnoEval and SciAtlas: Leveraging 43M Papers and 3B Triples

ITPUB

Jun 28, 2026 · Industry Insights

What’s the Longest‑Running Server Ever? Real‑World Uptime Stories

The article compiles dozens of real‑world examples of computers and servers that have stayed online for years or even decades, from a Chinese provincial telecom data‑center Red Hat Linux box running 14 years, to a 20‑year‑old base‑station, a 20‑plus‑year DOS server, two Linux boxes up since 2007, and NASA’s Voyager 2 spacecraft computer that has been operating for over 43 years.

DOSRed Hat LinuxVoyager 2

0 likes · 8 min read

What’s the Longest‑Running Server Ever? Real‑World Uptime Stories