All Articles

140340 articles · Page 22 of 7017
Raymond Ops
Raymond Ops
Jun 27, 2026 · Artificial Intelligence

vLLM Quantized Inference: Loading AWQ/GPTQ Models and Optimizing GPU Memory

This article provides a step‑by‑step guide on using vLLM to load AWQ and GPTQ quantized large language models, covering environment setup, calibration data preparation, model quantization, deployment scripts, performance benchmarking, accuracy checks, best‑practice recommendations, and troubleshooting tips for GPU memory optimization.

AWQGPTQGPU memory optimization
0 likes · 45 min read
vLLM Quantized Inference: Loading AWQ/GPTQ Models and Optimizing GPU Memory
Raymond Ops
Raymond Ops
Jun 27, 2026 · Operations

Hands‑On DNS Ops: Deploy BIND and CoreDNS with Full Troubleshooting Guide

This comprehensive guide walks you through DNS fundamentals, compares BIND, CoreDNS, PowerDNS and Unbound, provides step‑by‑step deployment scripts for BIND 9.20 and CoreDNS 1.12, explains DNSSEC configuration, caching optimizations, security hardening, high‑availability designs, monitoring, backup and recovery procedures, and advanced troubleshooting techniques.

BINDCoreDNSDNS
0 likes · 43 min read
Hands‑On DNS Ops: Deploy BIND and CoreDNS with Full Troubleshooting Guide
ITPUB
ITPUB
Jun 27, 2026 · Databases

Why Does Database Master‑Slave Replication Lag Every Day at 5‑7 AM?

The article investigates why master‑slave replication delay spikes each morning between 05:00 and 07:00, tracing it to the inventory‑snapshot worker that floods binlog traffic, evaluates five mitigation strategies, implements a big‑data extraction pipeline to Elasticsearch, and reports that the nightly delay disappeared and disk utilization improved.

BDPElasticsearchbig data extraction
0 likes · 9 min read
Why Does Database Master‑Slave Replication Lag Every Day at 5‑7 AM?
High Availability Architecture
High Availability Architecture
Jun 27, 2026 · Artificial Intelligence

How Should Tech Organizations Restructure for the Deepening AI‑Native Era?

The GIAC 2026 conference in Shenzhen showcased AI‑native transformation across leading tech firms, presenting the DRIVE model for organizational redesign, Google Cloud's Agentic AI strategy, Kuaishou's three‑layer AI overhaul, MoonBit's AI‑friendly programming language, and Kuaidi100's CLI‑native Agent ecosystem, highlighting practical challenges and future directions.

AI-nativeAgentic AICloud Computing
0 likes · 13 min read
How Should Tech Organizations Restructure for the Deepening AI‑Native Era?
DataFunSummit
DataFunSummit
Jun 27, 2026 · Artificial Intelligence

How We Turned AI Coding for Data Warehouses into an End‑to‑End Pipeline with Harness

The article analyzes why AI‑generated SQL alone cannot meet production data‑warehouse requirements, outlines four critical pain points, and presents a seven‑layer Harness framework that adds deterministic engineering controls, state persistence, skill registration, anti‑pattern libraries, and evidence‑based checks, achieving up to 94% time reduction and near‑zero side‑effects.

AIAutomationData Warehouse
0 likes · 34 min read
How We Turned AI Coding for Data Warehouses into an End‑to‑End Pipeline with Harness
DataFunSummit
DataFunSummit
Jun 27, 2026 · Industry Insights

Why Palantir’s Ontology Outperforms Traditional Data Platforms for Decision‑Making

The article examines costly data‑platform failures, contrasts traditional data‑middle‑platforms with Palantir’s ontology‑driven decision system, showcases real‑world ROI examples, and breaks down the three‑layer semantic‑dynamics‑decision architecture that turns data into actionable business outcomes.

Business IntelligenceData PlatformDecision System
0 likes · 4 min read
Why Palantir’s Ontology Outperforms Traditional Data Platforms for Decision‑Making
Java Companion
Java Companion
Jun 27, 2026 · Industry Insights

Why D2’s 24k‑Star Open‑Source Diagram Language Beats PlantUML and Mermaid

The article reviews D2, a declarative diagram scripting language with indentation‑based hierarchy, multiple layout engines, built‑in themes, precise error messages, and Go library support, comparing it against PlantUML, Mermaid and Graphviz while also noting its current limitations such as lack of native GitHub rendering and a limited icon set.

CLID2Go
0 likes · 11 min read
Why D2’s 24k‑Star Open‑Source Diagram Language Beats PlantUML and Mermaid
21CTO
21CTO
Jun 27, 2026 · Artificial Intelligence

Large vs Small Language Models: An Apple‑Centric Technical Comparison

The article analyses how deployment targets, inference economics, and training budgets drive divergent design choices for large (LLM) and small (SLM) Transformer‑based language models, covering architecture tweaks, data‑centric training methods, quantisation, KV‑cache management, and hybrid routing strategies for production systems.

Inference OptimizationQuantizationTransformer architecture
0 likes · 16 min read
Large vs Small Language Models: An Apple‑Centric Technical Comparison
IT Services Circle
IT Services Circle
Jun 27, 2026 · Industry Insights

Claude Fable 5 Returns in Staged Rollout, GPT‑5.6 Follows in Seconds

Claude Fable 5 reappeared in the Claude Code mobile app, allowing interactive SVG generation and git operations, but users reported poor output quality; the sighting sparked widespread community verification, speculation about a gray‑test, AWS Bedrock access, Anthropic’s internal negotiations led by Tom Brown, and predictions that the upcoming GPT‑5.6 will be released in stages before July 31.

AI model rolloutAWSAnthropic
0 likes · 7 min read
Claude Fable 5 Returns in Staged Rollout, GPT‑5.6 Follows in Seconds
IT Services Circle
IT Services Circle
Jun 27, 2026 · Frontend Development

Why TypeScript 7.0’s Go‑rewritten Compiler Boosts Type‑Checking Speed Tenfold

Microsoft’s TypeScript 7.0 RC rewrites the compiler in Go, delivering roughly ten times faster type checking through native execution and parallelism, introduces new CLI flags for concurrency, replaces the file‑watcher with a Go implementation, adds many breaking changes, and ships a multithreaded language server with richer editor features.

Breaking ChangesGoLanguage Server
0 likes · 9 min read
Why TypeScript 7.0’s Go‑rewritten Compiler Boosts Type‑Checking Speed Tenfold
Java Tech Enthusiast
Java Tech Enthusiast
Jun 27, 2026 · R&D Management

When Technical Mastery Becomes a Liability: My Unfair Dismissal Story

A senior backend engineer was promoted to team lead, but his obsession with coding, low emotional intelligence, and failure to delegate led to strained relationships, missed deadlines, and ultimately a forced resignation, illustrating the Peter Principle and offering hard‑won lessons for technical leaders.

Career AdvicePeter principlemanagement pitfalls
0 likes · 8 min read
When Technical Mastery Becomes a Liability: My Unfair Dismissal Story
Data Party THU
Data Party THU
Jun 27, 2026 · Artificial Intelligence

AI and Chemists Co-Develop TYR Inhibitors via Dual-Track Optimization

The study presents a dual-track strategy that combines deep reinforcement‑learning‑driven de novo molecular generation with expert‑guided medicinal chemistry to discover and optimize TYR inhibitors, demonstrating how AI expands chemical space while chemists ensure synthetic feasibility, leading to potent candidates such as AI10‑m15 with strong anti‑melanogenesis activity.

AI-driven drug discoveryTYR inhibitorchemical space exploration
0 likes · 8 min read
AI and Chemists Co-Develop TYR Inhibitors via Dual-Track Optimization
Data Party THU
Data Party THU
Jun 27, 2026 · Artificial Intelligence

Defining a Good Answer in the Agent Era: A Rubrics Survey

This survey examines how rubrics—structured, multi‑dimensional evaluation criteria—are defined, constructed, and applied to train and evaluate large language models, especially for open‑ended, high‑risk and agentic tasks, while highlighting current challenges such as reward hacking and bias.

AI safetyAgentEvaluation
0 likes · 15 min read
Defining a Good Answer in the Agent Era: A Rubrics Survey
James' Growth Diary
James' Growth Diary
Jun 27, 2026 · Artificial Intelligence

Why the Top‑Tier GPT‑5.6 Model Is Still Unavailable

GPT‑5.6 has been announced but, because of U.S. government intervention, its highest‑performance Sol ultra version remains inaccessible, even though benchmark tests show it already outperforms the previous Mythos model in coding and cybersecurity tasks.

AI modelGPT-5.6Mythos
0 likes · 4 min read
Why the Top‑Tier GPT‑5.6 Model Is Still Unavailable
Golang Shines
Golang Shines
Jun 27, 2026 · Backend Development

Master Nginx Quickly: A Comprehensive Guide Loved by Thousands

This article explains why Nginx outperforms Apache as a high‑performance web and load‑balancing server, details its simple installation, core and advanced configurations—including virtual hosts, access control, HTTPS and reverse proxy—while showcasing real‑world usage and deployment diagrams.

Backend DevelopmentConfigurationHTTPS
0 likes · 7 min read
Master Nginx Quickly: A Comprehensive Guide Loved by Thousands
Machine Heart
Machine Heart
Jun 27, 2026 · Artificial Intelligence

DSpark in DeepSeek V4 Cuts LLM Inference Latency by Up to 85%

DeepSeek V4’s DSpark adds a speculative decoding framework that combines a lightweight draft model, semi‑autoregressive generation, and confidence‑scheduled verification, delivering 60‑85% faster inference for Qwen3 and Gemma models while providing an open‑source DeepSpec toolkit for training and evaluation.

Confidence-Scheduled VerificationDSparkDeepSeek
0 likes · 7 min read
DSpark in DeepSeek V4 Cuts LLM Inference Latency by Up to 85%