AI Engineering
Author

AI Engineering

Focused on cutting‑edge product and technology information and practical experience sharing in the AI field (large models, MLOps/LLMOps, AI application development, AI infrastructure).

127
Articles
0
Likes
0
Views
0
Comments
Recent Articles

Latest from AI Engineering

100 recent articles max
AI Engineering
AI Engineering
Feb 14, 2026 · Artificial Intelligence

DeepSeek‑V4‑Lite‑285B Hits 100% Recall in 256K Token Tests – A Needle‑in‑a‑Haystack Benchmark

Community testing of DeepSeek's rumored V4‑Lite‑285B model using the OpenAI MRCR 8‑pin standard shows perfect 1.0000 scores on several 128K‑token samples and a 256K‑token sample, achieving 100% recall in native 256K context while longer contexts drop to about 60%, with a note that the "needle‑in‑a‑haystack" method may be exploitable by DSA mechanisms.

DeepSeekLLMlong context
0 likes · 3 min read
DeepSeek‑V4‑Lite‑285B Hits 100% Recall in 256K Token Tests – A Needle‑in‑a‑Haystack Benchmark
AI Engineering
AI Engineering
Feb 13, 2026 · Artificial Intelligence

ByteDance’s Open‑Source 12B‑Parameter Video Model “Alive” Runs on a Single RTX 3090/4090

ByteDance has open‑sourced the 12‑billion‑parameter video generation model Alive, which supports text‑to‑video/audio, image‑to‑video/audio, pure text‑to‑video and text‑to‑audio modes, runs on a 24 GB GPU, outperforms competitors in cross‑modal synchronization, and includes novel TA‑CrossAttn and UniTemp‑RoPE techniques.

Alive ModelByteDanceCross‑Modal Synchronization
0 likes · 5 min read
ByteDance’s Open‑Source 12B‑Parameter Video Model “Alive” Runs on a Single RTX 3090/4090
AI Engineering
AI Engineering
Feb 12, 2026 · Artificial Intelligence

MiniMax M2.5: 230B‑Parameter Model Activates 10B, Near Claude Sonnet for One‑Tenth the Cost

MiniMax’s new open‑source M2.5 model, built on a 230 billion‑parameter mixture‑of‑experts architecture that activates only 10 billion parameters per inference, delivers performance comparable to Claude Opus 4.6 across benchmarks, while costing roughly one‑tenth as much, and is already handling a large share of the company’s internal tasks.

AI agentsClaude OpusMiniMax M2.5
0 likes · 6 min read
MiniMax M2.5: 230B‑Parameter Model Activates 10B, Near Claude Sonnet for One‑Tenth the Cost
AI Engineering
AI Engineering
Feb 12, 2026 · Artificial Intelligence

GLM-5 Unveiled: 744B‑Parameter Model Takes on Claude in Complex Tasks

GLM-5, the new 744‑billion‑parameter open‑source LLM, expands on GLM‑4.5 with GlmMoeDsa architecture, achieves higher HLE benchmark scores than Claude Opus 4.5, demonstrates strong long‑context and agent capabilities, supports vLLM/SGLang, runs on various Chinese chips, and can directly generate Office documents.

AI benchmarksChinese chipsClaude
0 likes · 5 min read
GLM-5 Unveiled: 744B‑Parameter Model Takes on Claude in Complex Tasks
AI Engineering
AI Engineering
Feb 11, 2026 · Artificial Intelligence

Harrison Chase Explains Two Sandbox Architectures for AI Agents

The article analyzes why AI agents need isolated sandboxes, outlines two architectural patterns—running the agent inside a sandbox or using the sandbox as an external tool—compares their advantages and challenges, and provides concrete implementation examples and community insights.

AI agentsAPIDocker
0 likes · 11 min read
Harrison Chase Explains Two Sandbox Architectures for AI Agents
AI Engineering
AI Engineering
Feb 10, 2026 · Industry Insights

Why the Internet Is Dead and Agents Are the Future

The article argues that traditional internet metrics like DAU and SaaS models are obsolete because AI agents have become the new software users, shifting value from attention to token consumption, and outlines six strategic "cuts" that illustrate how businesses must pivot to serve agents rather than humans.

AI agentsAI erabusiness model shift
0 likes · 12 min read
Why the Internet Is Dead and Agents Are the Future
AI Engineering
AI Engineering
Feb 9, 2026 · Artificial Intelligence

Three Unconventional Ways to Run OpenClaw on Edge Devices

The article showcases three low‑cost edge deployments of OpenClaw—a $25 Moto phone using Termux, a $5 ESP32‑S3 board running a pure‑C MimiClaw, and the BotDrop Android app that turns an old phone into an AI agent host—detailing setup steps, challenges, and security considerations.

AI agentsAndroidESP32
0 likes · 10 min read
Three Unconventional Ways to Run OpenClaw on Edge Devices
AI Engineering
AI Engineering
Feb 8, 2026 · Artificial Intelligence

Sora Now Allows Real‑Person Photo Uploads for Video Generation – How It Works and What’s Changed

OpenAI’s Sora has lifted its ban on real‑person photos, letting eligible users upload images to create videos, but requires consent confirmation, applies strict stylization and watermarking, enforces tighter checks for children, and reflects a market‑driven compromise between competition and safety compliance.

AI video generationMarket CompetitionOpenAI
0 likes · 4 min read
Sora Now Allows Real‑Person Photo Uploads for Video Generation – How It Works and What’s Changed
AI Engineering
AI Engineering
Feb 5, 2026 · Artificial Intelligence

Claude Code Introduces Agents Teams for Autonomous Multi‑Agent Programming

Claude Code’s preview adds an Agents Teams mode that lets a leading Claude instance delegate tasks to multiple independent agents, enabling parallel coding, direct inter‑agent communication, two display options, higher token costs, and several practical collaboration scenarios.

AI programmingAgents TeamsClaude Code
0 likes · 6 min read
Claude Code Introduces Agents Teams for Autonomous Multi‑Agent Programming