AI Engineering
Author

AI Engineering

Focused on cutting‑edge product and technology information and practical experience sharing in the AI field (large models, MLOps/LLMOps, AI application development, AI infrastructure).

127
Articles
0
Likes
0
Views
0
Comments
Recent Articles

Latest from AI Engineering

100 recent articles max
AI Engineering
AI Engineering
Feb 21, 2026 · Artificial Intelligence

Why Pi-mono Powers OpenClaw: A Minimalist AI Coding Assistant

Pi-mono is a four‑tool, four‑layer AI coding assistant built by Mario Zechner that replaces bloated agents with a minimalist design, supports dozens of LLM providers, offers a terminal UI, extensible TypeScript plugins, and demonstrates superior benchmark performance in Terminal‑Bench.

AI Coding AssistantLLM integrationagent framework
0 likes · 13 min read
Why Pi-mono Powers OpenClaw: A Minimalist AI Coding Assistant
AI Engineering
AI Engineering
Feb 20, 2026 · Artificial Intelligence

Gemini 3.1 Pro Doubles Reasoning Power and Outperforms Claude Opus 4.6

Google's Gemini 3.1 Pro achieves a 77.1% ARC‑AGI‑2 score—more than double its predecessor—leads in multiple benchmark categories, cuts inference cost by half compared to top rivals, and demonstrates advanced multimodal and programming capabilities through real‑world demos.

AI benchmarksARC-AGI-2Claude Opus 4.6
0 likes · 9 min read
Gemini 3.1 Pro Doubles Reasoning Power and Outperforms Claude Opus 4.6
AI Engineering
AI Engineering
Feb 17, 2026 · Artificial Intelligence

Claude Sonnet 4.6: Million‑Token Context, Human‑Level Computer Skills, Near‑Opus Performance

Claude Sonnet 4.6, Anthropic’s latest model, introduces a beta‑stage million‑token window and markedly better coding, computer‑use and long‑context reasoning, scoring 72.5% on OSWorld versus 14.9% for Sonnet 3.5, while offering Excel connectors, dynamic search filtering, stronger prompt‑injection resistance, and a pricing tier that makes it a strong alternative to Opus for many workloads.

AI codingAPIClaude
0 likes · 4 min read
Claude Sonnet 4.6: Million‑Token Context, Human‑Level Computer Skills, Near‑Opus Performance
AI Engineering
AI Engineering
Feb 16, 2026 · Artificial Intelligence

Qwen3.5-397B: 397B‑Parameter Multimodal LLM Boosts Inference Speed 8‑19×

Alibaba’s Qwen3.5-397B-A17B, a 397‑billion‑parameter open‑source multimodal LLM, combines mixed linear attention with a sparse MoE architecture to achieve 8.6‑19× higher decoding throughput than Qwen3‑Max, supports 201 languages, and can be deployed via vLLM, Docker, Transformers, or SGLang with various optimization presets.

Large Language ModelQwen3.5inference optimization
0 likes · 8 min read
Qwen3.5-397B: 397B‑Parameter Multimodal LLM Boosts Inference Speed 8‑19×
AI Engineering
AI Engineering
Feb 15, 2026 · Industry Insights

OpenClaw Joins OpenAI: Sam Altman Moves Faster Than Zuckerberg

Peter Steinberger announced his move to OpenAI and the conversion of OpenClaw into an independent foundation, sparking community debate over OpenAI's open‑source strategy, the future of AI agents, and the strategic implications of this partnership.

AI agentsOpenAIOpenClaw
0 likes · 4 min read
OpenClaw Joins OpenAI: Sam Altman Moves Faster Than Zuckerberg
AI Engineering
AI Engineering
Feb 15, 2026 · Artificial Intelligence

Qwen3‑ASR Runs Natively on Apple Silicon via MLX for Full‑Speed Speech Recognition

A developer has re‑implemented the state‑of‑the‑art Qwen3‑ASR model in MLX, enabling native execution on Apple M1‑M4 chips with real‑time factors as low as 0.08, 4‑bit quantization speedups of 4.7×, multilingual support for 52 languages, and features such as word‑level timestamps and streaming transcription.

Apple SiliconMLXQuantization
0 likes · 5 min read
Qwen3‑ASR Runs Natively on Apple Silicon via MLX for Full‑Speed Speech Recognition
AI Engineering
AI Engineering
Feb 14, 2026 · Industry Insights

How Cloudflare’s Markdown for Agents Redefines AI Web Scraping

Cloudflare’s new Markdown for Agents feature lets AI systems request web pages as Markdown via content negotiation, cutting token usage by up to 80%, simplifying scraping pipelines, and signaling a broader shift in how AI consumes web content.

AI web scrapingCloudflareContent negotiation
0 likes · 6 min read
How Cloudflare’s Markdown for Agents Redefines AI Web Scraping
AI Engineering
AI Engineering
Feb 14, 2026 · Artificial Intelligence

ByteDance’s Seed 2.0 Pro Beats GPT‑5.2 High in Math Benchmarks

ByteDance’s newly released Seed 2.0 series, especially the Pro model, outperforms GPT‑5.2 High and Claude Opus on MathVista and MathVision tests, offers competitive coding scores, multimodal capabilities, and a pricing model up to four times cheaper, while still lagging behind in some programming and factual‑accuracy benchmarks.

ByteDanceCodeforcesGPT-5.2
0 likes · 4 min read
ByteDance’s Seed 2.0 Pro Beats GPT‑5.2 High in Math Benchmarks