All Articles

140314 articles · Page 17 of 7016
Shuge Unlimited
Shuge Unlimited
Jun 29, 2026 · Databases

Inside Milvus’ Index Engine: 3‑Layer Parameter Filling, Compile‑time Hardware Split, and a 16× Memory Trade‑off

The article dissects Milvus’ index engine, revealing that AUTOINDEX relies on a three‑stage default‑parameter pipeline, that CPU/GPU index selection is fixed at compile time via Go build tags, that the C++ Knowhere engine executes the algorithms, and that version aggregation, scalar V3 format, and the new AISAQ index embody deliberate memory‑vs‑IO trade‑offs.

AISAQAUTOINDEXCPU/GPU build tags
0 likes · 26 min read
Inside Milvus’ Index Engine: 3‑Layer Parameter Filling, Compile‑time Hardware Split, and a 16× Memory Trade‑off
Geek Labs
Geek Labs
Jun 29, 2026 · Artificial Intelligence

DeepSpec Boosts Large-Model Inference Speed by 2–5× with Speculative Decoding

DeepSpec, an open‑source framework from DeepSeek, accelerates large‑language‑model inference by 2–5× through speculative decoding, where a lightweight draft model generates candidate tokens that the target model validates in parallel, reducing the serial bottleneck of autoregressive decoding and offering a full‑stack pipeline from data preparation to evaluation.

DeepSpecGPUPython
0 likes · 6 min read
DeepSpec Boosts Large-Model Inference Speed by 2–5× with Speculative Decoding
AI Engineering
AI Engineering
Jun 29, 2026 · Artificial Intelligence

How Coinbase Halved AI Costs While Token Usage Continued to Surge

In June, Coinbase CEO Brian Armstrong revealed an internal AI cost‑optimization program that cut the company's AI dollar spend by almost 50% while token consumption kept growing exponentially, achieved through five concrete measures involving model defaults, intelligent routing, cache reuse, context trimming, and transparent usage monitoring.

AI cost optimizationCoinbaseLLM routing
0 likes · 9 min read
How Coinbase Halved AI Costs While Token Usage Continued to Surge
AI Engineer Programming
AI Engineer Programming
Jun 29, 2026 · Artificial Intelligence

Managing LLM Hallucinations: Strategies, Metrics, and Layered Controls

The article examines why large language models hallucinate, categorizes factual, faithfulness, and reasoning hallucinations, critiques existing benchmarks, and proposes a layered governance framework—including training‑time RLHF/DPO, retrieval‑augmented generation, post‑generation verification, uncertainty quantification, and compliance considerations—to mitigate risks in production systems.

EvaluationHallucinationLLM
0 likes · 13 min read
Managing LLM Hallucinations: Strategies, Metrics, and Layered Controls
AI Architecture Path
AI Architecture Path
Jun 29, 2026 · Frontend Development

Terax: A 7 MB Open‑Source Tauri2+Rust IDE that Unifies Terminal, Editor, Git, and AI Agent Offline

The article analyzes the fragmented, heavyweight, and privacy‑risk‑laden workflow of using separate VSCode, iTerm2, Git clients, and AI chat tools, then introduces Terax—a 7 MB Tauri2+Rust‑based open‑source IDE that integrates a native terminal, CodeMirror6 editor, Git visual panel, web preview, and a controllable AI Agent with offline model support, while detailing its architecture, feature set, installation steps, and current limitations.

AI AgentCodeMirror6Git
0 likes · 12 min read
Terax: A 7 MB Open‑Source Tauri2+Rust IDE that Unifies Terminal, Editor, Git, and AI Agent Offline
Linyb Geek Road
Linyb Geek Road
Jun 29, 2026 · Artificial Intelligence

Understanding Loop Engineering: Concepts, Insights, and Practical Applications

The article explains Loop Engineering by distinguishing it from basic Agent Loops, outlines its six core components, showcases a text‑classification example, and discusses when the approach boosts efficiency versus when traditional Human‑in‑the‑Loop remains preferable.

AI AgentsAutomationSkill Evolution
0 likes · 21 min read
Understanding Loop Engineering: Concepts, Insights, and Practical Applications
Linyb Geek Road
Linyb Geek Road
Jun 29, 2026 · Artificial Intelligence

Deep Dive into Loop Engineering: From Prompt Engineering to System Design

Loop Engineering replaces manual prompting with system‑designed loops that let AI agents iterate autonomously, covering its definition, origins, five core modules plus memory, a full‑stack example, experimental results, limitations, and a comparison between Claude Code and Codex.

AI AgentsAutomationConnector
0 likes · 16 min read
Deep Dive into Loop Engineering: From Prompt Engineering to System Design
TonyBai
TonyBai
Jun 29, 2026 · Fundamentals

Why I Keep Returning to Go After Trying Every Language

The article argues that Go’s batteries‑included standard library eliminates dependency fatigue, its built‑in diagnostics let engineers locate production issues in hours instead of weeks, its uncolored concurrency model avoids async/await pitfalls, and its minimalism reduces cognitive load and boosts team efficiency.

ConcurrencyGodependency fatigue
0 likes · 10 min read
Why I Keep Returning to Go After Trying Every Language
Wuming AI
Wuming AI
Jun 28, 2026 · Artificial Intelligence

How a Dedicated Mailbox Lets AI Agents Receive Tasks Autonomously

The article walks through assigning a unique @agent.qq.com mailbox to an AI Agent, explains why a separate email address is essential for identity, permission, and audit in enterprise workflows, and demonstrates the setup, testing, and automation possibilities with practical examples.

AI AgentAgent ManagementAutomation
0 likes · 10 min read
How a Dedicated Mailbox Lets AI Agents Receive Tasks Autonomously
Java Architect Essentials
Java Architect Essentials
Jun 28, 2026 · Artificial Intelligence

Claude Code Repo Hits 54K Stars in 60 Days, Supercharging Front‑End Development

Within two months the open‑source Claude Code best‑practice repository amassed over 54 000 GitHub stars by systematically cataloguing community‑validated concepts, features, workflows and 83 practical tips, offering concrete guidance—such as context compression thresholds, staged planning, and disciplined hook usage—to dramatically improve front‑end and back‑end coding efficiency.

AI coding assistantClaude CodeGitHub
0 likes · 8 min read
Claude Code Repo Hits 54K Stars in 60 Days, Supercharging Front‑End Development
Code Mala Tang
Code Mala Tang
Jun 28, 2026 · Artificial Intelligence

7 Essential Things to Know About MCP AI (Multi‑Context Prompting)

MCP AI, a multi‑context prompting approach, replaces linear chat interactions by maintaining several active contexts that the model can switch between, solving context‑window limits, improving coherence, and enabling system‑level workflows, while requiring proper role definition, rules, and feedback loops.

AI ArchitectureClaudeCrewAI
0 likes · 7 min read
7 Essential Things to Know About MCP AI (Multi‑Context Prompting)
Model Perspective
Model Perspective
Jun 28, 2026 · Industry Insights

DeepSeek’s Hiring Surge: Can It Shift From Model Base to Platform Leader?

DeepSeek’s recent staff doubling is examined through ecological niche theory and a Lotka‑Volterra competition model, showing its current API‑centric niche, potential move into enterprise agent tools, and the strategic need to define new standards rather than merely replicating existing Harness products.

AI competitionAgent platformsDeepSeek
0 likes · 10 min read
DeepSeek’s Hiring Surge: Can It Shift From Model Base to Platform Leader?
dbaplus Community
dbaplus Community
Jun 28, 2026 · Operations

Why Tencent Music Rejects AI Hype: Building an OpenClaw‑Powered Intelligent Ops Ecosystem

The article details Tencent Music's step‑by‑step evolution from manual alert handling to a three‑layer cloud‑native AIOps platform, describing data pipelines, dynamic 3‑sigma alerts, full‑link observability, and the OpenClaw sandbox with multi‑agent architecture that prioritises scenario‑driven, safe AI integration.

AIAIOpsData Engineering
0 likes · 17 min read
Why Tencent Music Rejects AI Hype: Building an OpenClaw‑Powered Intelligent Ops Ecosystem
Coder Trainee
Coder Trainee
Jun 28, 2026 · Fundamentals

Java Performance Tuning Part 5: Hands‑On GC Optimization from G1 to ZGC

This article walks through Java GC tuning by defining low‑latency and high‑throughput goals, comparing major collectors, presenting G1 and ZGC configuration examples, and demonstrating a real‑world payment system case where pause times were reduced from 150‑200 ms to under 50 ms.

GCJVMJava
0 likes · 8 min read
Java Performance Tuning Part 5: Hands‑On GC Optimization from G1 to ZGC
Machine Learning Algorithms & Natural Language Processing
Machine Learning Algorithms & Natural Language Processing
Jun 28, 2026 · Artificial Intelligence

DSpark Explained in 10 Essential Concepts: System‑Level Engineering Insights

DSpark, DeepSeek’s new LLM inference framework, combines batch processing, speculative decoding, Eagle‑style draft models and DFlash‑style parallel generation with a lightweight sequential head and hardware‑aware scheduling, delivering 60‑85% speedups while preserving model quality.

Batch ProcessingDeepSeekGPU Optimization
0 likes · 12 min read
DSpark Explained in 10 Essential Concepts: System‑Level Engineering Insights
Machine Learning Algorithms & Natural Language Processing
Machine Learning Algorithms & Natural Language Processing
Jun 28, 2026 · Artificial Intelligence

Why the Log‑Ratio Reward in OPD Is Fundamentally Flawed and Should Be Replaced

The paper reveals that the unbounded log‑ratio reward used in vanilla On‑Policy Distillation causes extreme gradient variance, early‑stage instability, and poor final performance, and demonstrates that replacing the log with a bounded Box‑Cox power transform (PowerOPD) resolves these issues while improving accuracy, efficiency, and memory usage.

Box-CoxOPDReinforcement Learning
0 likes · 16 min read
Why the Log‑Ratio Reward in OPD Is Fundamentally Flawed and Should Be Replaced
Machine Learning Algorithms & Natural Language Processing
Machine Learning Algorithms & Natural Language Processing
Jun 28, 2026 · Artificial Intelligence

Evaluating Research Ideas with InnoEval and SciAtlas: Leveraging 43M Papers and 3B Triples

As large language models accelerate idea generation and the volume of scientific papers soars, InnoEval formalizes multi‑perspective, knowledge‑grounded evaluation of research ideas, while SciAtlas provides a massive cross‑disciplinary knowledge graph that powers evidence‑rich assessments and agent‑driven workflows.

AI AgentsInnoEvalLLM
0 likes · 13 min read
Evaluating Research Ideas with InnoEval and SciAtlas: Leveraging 43M Papers and 3B Triples
ITPUB
ITPUB
Jun 28, 2026 · Industry Insights

What’s the Longest‑Running Server Ever? Real‑World Uptime Stories

The article compiles dozens of real‑world examples of computers and servers that have stayed online for years or even decades, from a Chinese provincial telecom data‑center Red Hat Linux box running 14 years, to a 20‑year‑old base‑station, a 20‑plus‑year DOS server, two Linux boxes up since 2007, and NASA’s Voyager 2 spacecraft computer that has been operating for over 43 years.

DOSRed Hat LinuxVoyager 2
0 likes · 8 min read
What’s the Longest‑Running Server Ever? Real‑World Uptime Stories