All Articles

140340 articles · Page 22 of 7017

Jun 27, 2026 · Artificial Intelligence

vLLM Quantized Inference: Loading AWQ/GPTQ Models and Optimizing GPU Memory

This article provides a step‑by‑step guide on using vLLM to load AWQ and GPTQ quantized large language models, covering environment setup, calibration data preparation, model quantization, deployment scripts, performance benchmarking, accuracy checks, best‑practice recommendations, and troubleshooting tips for GPU memory optimization.

AWQGPTQGPU memory optimization

0 likes · 45 min read

vLLM Quantized Inference: Loading AWQ/GPTQ Models and Optimizing GPU Memory

Raymond Ops

Jun 27, 2026 · Operations

Hands‑On DNS Ops: Deploy BIND and CoreDNS with Full Troubleshooting Guide

This comprehensive guide walks you through DNS fundamentals, compares BIND, CoreDNS, PowerDNS and Unbound, provides step‑by‑step deployment scripts for BIND 9.20 and CoreDNS 1.12, explains DNSSEC configuration, caching optimizations, security hardening, high‑availability designs, monitoring, backup and recovery procedures, and advanced troubleshooting techniques.

BINDCoreDNSDNS

0 likes · 43 min read

Hands‑On DNS Ops: Deploy BIND and CoreDNS with Full Troubleshooting Guide

ITPUB

Jun 27, 2026 · Databases

Why Does Database Master‑Slave Replication Lag Every Day at 5‑7 AM?

The article investigates why master‑slave replication delay spikes each morning between 05:00 and 07:00, tracing it to the inventory‑snapshot worker that floods binlog traffic, evaluates five mitigation strategies, implements a big‑data extraction pipeline to Elasticsearch, and reports that the nightly delay disappeared and disk utilization improved.

BDPElasticsearchbig data extraction

0 likes · 9 min read

Why Does Database Master‑Slave Replication Lag Every Day at 5‑7 AM?

IT Learning Made Simple

Jun 27, 2026 · Fundamentals

What’s the Real Difference Between Clusters, Distributed Systems, and Microservices?

Clusters, distributed systems, and microservices are often confused, but this article clarifies each concept with simple analogies, outlines the problems they solve, shows their hierarchical relationship, and provides practical guidelines for choosing the right approach in real-world projects.

High Availabilityclusterdistributed system

0 likes · 3 min read

What’s the Real Difference Between Clusters, Distributed Systems, and Microservices?

High Availability Architecture

Jun 27, 2026 · Artificial Intelligence

How Should Tech Organizations Restructure for the Deepening AI‑Native Era?

The GIAC 2026 conference in Shenzhen showcased AI‑native transformation across leading tech firms, presenting the DRIVE model for organizational redesign, Google Cloud's Agentic AI strategy, Kuaishou's three‑layer AI overhaul, MoonBit's AI‑friendly programming language, and Kuaidi100's CLI‑native Agent ecosystem, highlighting practical challenges and future directions.

AI-nativeAgentic AICloud Computing

0 likes · 13 min read

How Should Tech Organizations Restructure for the Deepening AI‑Native Era?

AI Engineering

Jun 27, 2026 · Artificial Intelligence

Open Tag: The Open‑Source Counterpart That Quickly Matches Claude Tag’s AI Agent Features

Anthropic’s Claude Tag brought AI‑agent assistance to Slack, and within days the open‑source Open Tag appeared, supporting any large model, multiple chat platforms, and full customisation, while the community debates its deployment effort versus the flexibility it offers.

AI agentsClaude TagCopilotKit

0 likes · 4 min read

Open Tag: The Open‑Source Counterpart That Quickly Matches Claude Tag’s AI Agent Features

DataFunSummit

Jun 27, 2026 · Artificial Intelligence

How We Turned AI Coding for Data Warehouses into an End‑to‑End Pipeline with Harness

The article analyzes why AI‑generated SQL alone cannot meet production data‑warehouse requirements, outlines four critical pain points, and presents a seven‑layer Harness framework that adds deterministic engineering controls, state persistence, skill registration, anti‑pattern libraries, and evidence‑based checks, achieving up to 94% time reduction and near‑zero side‑effects.

AIAutomationData Warehouse

0 likes · 34 min read

How We Turned AI Coding for Data Warehouses into an End‑to‑End Pipeline with Harness

DataFunSummit

Jun 27, 2026 · Industry Insights

Why Palantir’s Ontology Outperforms Traditional Data Platforms for Decision‑Making

The article examines costly data‑platform failures, contrasts traditional data‑middle‑platforms with Palantir’s ontology‑driven decision system, showcases real‑world ROI examples, and breaks down the three‑layer semantic‑dynamics‑decision architecture that turns data into actionable business outcomes.

Business IntelligenceData PlatformDecision System

0 likes · 4 min read

Why Palantir’s Ontology Outperforms Traditional Data Platforms for Decision‑Making

Java Companion

Jun 27, 2026 · Industry Insights

Why D2’s 24k‑Star Open‑Source Diagram Language Beats PlantUML and Mermaid

The article reviews D2, a declarative diagram scripting language with indentation‑based hierarchy, multiple layout engines, built‑in themes, precise error messages, and Go library support, comparing it against PlantUML, Mermaid and Graphviz while also noting its current limitations such as lack of native GitHub rendering and a limited icon set.

CLID2Go

0 likes · 11 min read

Why D2’s 24k‑Star Open‑Source Diagram Language Beats PlantUML and Mermaid

21CTO

Jun 27, 2026 · Artificial Intelligence

Large vs Small Language Models: An Apple‑Centric Technical Comparison

The article analyses how deployment targets, inference economics, and training budgets drive divergent design choices for large (LLM) and small (SLM) Transformer‑based language models, covering architecture tweaks, data‑centric training methods, quantisation, KV‑cache management, and hybrid routing strategies for production systems.

Inference OptimizationQuantizationTransformer architecture

0 likes · 16 min read

Large vs Small Language Models: An Apple‑Centric Technical Comparison

IT Services Circle

Jun 27, 2026 · Industry Insights

Claude Fable 5 Returns in Staged Rollout, GPT‑5.6 Follows in Seconds

Claude Fable 5 reappeared in the Claude Code mobile app, allowing interactive SVG generation and git operations, but users reported poor output quality; the sighting sparked widespread community verification, speculation about a gray‑test, AWS Bedrock access, Anthropic’s internal negotiations led by Tom Brown, and predictions that the upcoming GPT‑5.6 will be released in stages before July 31.

AI model rolloutAWSAnthropic

0 likes · 7 min read

Claude Fable 5 Returns in Staged Rollout, GPT‑5.6 Follows in Seconds

IT Services Circle

Jun 27, 2026 · Frontend Development

Why TypeScript 7.0’s Go‑rewritten Compiler Boosts Type‑Checking Speed Tenfold

Microsoft’s TypeScript 7.0 RC rewrites the compiler in Go, delivering roughly ten times faster type checking through native execution and parallelism, introduces new CLI flags for concurrency, replaces the file‑watcher with a Go implementation, adds many breaking changes, and ships a multithreaded language server with richer editor features.

Breaking ChangesGoLanguage Server

0 likes · 9 min read

Why TypeScript 7.0’s Go‑rewritten Compiler Boosts Type‑Checking Speed Tenfold

Java Tech Enthusiast

Jun 27, 2026 · R&D Management

When Technical Mastery Becomes a Liability: My Unfair Dismissal Story

A senior backend engineer was promoted to team lead, but his obsession with coding, low emotional intelligence, and failure to delegate led to strained relationships, missed deadlines, and ultimately a forced resignation, illustrating the Peter Principle and offering hard‑won lessons for technical leaders.

Career AdvicePeter principlemanagement pitfalls

0 likes · 8 min read

When Technical Mastery Becomes a Liability: My Unfair Dismissal Story

Java Tech Enthusiast

Jun 27, 2026 · Industry Insights

Why AI-Generated Patches Drove Linux to Drop the 40‑Year‑Old AppleTalk Protocol

Linux maintainer Jakub Kicinski removed roughly 4,000 lines of AppleTalk code—a networking protocol that survived 40 years—citing a flood of unreviewed AI‑generated patches that made maintenance untenable, highlighting how AI is reshaping the fate of legacy modules in open‑source projects.

AI-generated patchesAppleTalkLinux kernel

0 likes · 8 min read

Why AI-Generated Patches Drove Linux to Drop the 40‑Year‑Old AppleTalk Protocol

Data Party THU

Jun 27, 2026 · Artificial Intelligence

AI and Chemists Co-Develop TYR Inhibitors via Dual-Track Optimization

The study presents a dual-track strategy that combines deep reinforcement‑learning‑driven de novo molecular generation with expert‑guided medicinal chemistry to discover and optimize TYR inhibitors, demonstrating how AI expands chemical space while chemists ensure synthetic feasibility, leading to potent candidates such as AI10‑m15 with strong anti‑melanogenesis activity.

AI-driven drug discoveryTYR inhibitorchemical space exploration

0 likes · 8 min read

AI and Chemists Co-Develop TYR Inhibitors via Dual-Track Optimization

Data Party THU

Jun 27, 2026 · Artificial Intelligence

Defining a Good Answer in the Agent Era: A Rubrics Survey

This survey examines how rubrics—structured, multi‑dimensional evaluation criteria—are defined, constructed, and applied to train and evaluate large language models, especially for open‑ended, high‑risk and agentic tasks, while highlighting current challenges such as reward hacking and bias.

AI safetyAgentEvaluation

0 likes · 15 min read

Defining a Good Answer in the Agent Era: A Rubrics Survey

James' Growth Diary

Jun 27, 2026 · Artificial Intelligence

Why the Top‑Tier GPT‑5.6 Model Is Still Unavailable

GPT‑5.6 has been announced but, because of U.S. government intervention, its highest‑performance Sol ultra version remains inaccessible, even though benchmark tests show it already outperforms the previous Mythos model in coding and cybersecurity tasks.

AI modelGPT-5.6Mythos

0 likes · 4 min read

Why the Top‑Tier GPT‑5.6 Model Is Still Unavailable

Golang Shines

Jun 27, 2026 · Backend Development

Master Nginx Quickly: A Comprehensive Guide Loved by Thousands

This article explains why Nginx outperforms Apache as a high‑performance web and load‑balancing server, details its simple installation, core and advanced configurations—including virtual hosts, access control, HTTPS and reverse proxy—while showcasing real‑world usage and deployment diagrams.

Backend DevelopmentConfigurationHTTPS

0 likes · 7 min read

Master Nginx Quickly: A Comprehensive Guide Loved by Thousands

Machine Heart

Jun 27, 2026 · Artificial Intelligence

DSpark in DeepSeek V4 Cuts LLM Inference Latency by Up to 85%

DeepSeek V4’s DSpark adds a speculative decoding framework that combines a lightweight draft model, semi‑autoregressive generation, and confidence‑scheduled verification, delivering 60‑85% faster inference for Qwen3 and Gemma models while providing an open‑source DeepSpec toolkit for training and evaluation.

Confidence-Scheduled VerificationDSparkDeepSeek

0 likes · 7 min read

DSpark in DeepSeek V4 Cuts LLM Inference Latency by Up to 85%

Java Architect Handbook

Jun 27, 2026 · Backend Development

How a Simple Queue Prevents Server Crashes When Multiple Users Export Excel Simultaneously

The article explains why concurrent Excel exports can overload a Java backend, describes a FIFO queue with a fixed capacity to serialize export tasks, shows the full implementation with Spring, EasyExcel and synchronized wait/notify, and discusses test results and remaining limitations.

BackendEasyExcelExport

0 likes · 12 min read

How a Simple Queue Prevents Server Crashes When Multiple Users Export Excel Simultaneously