Tagged articles
33 articles
Page 1 of 1
Lao Guo's Learning Space
Lao Guo's Learning Space
May 12, 2026 · Artificial Intelligence

Which Inference Framework Maximizes Your GPU Performance in 2026?

This article compares six popular LLM inference frameworks—vLLM, TensorRT‑LLM, llama.cpp, ds4.c, Ollama, and Omlx—across performance, ease of use, and hardware compatibility, then provides a practical matrix to help users select the best fit for their GPU.

Apple SiliconGPU performanceLLM inference
0 likes · 10 min read
Which Inference Framework Maximizes Your GPU Performance in 2026?
Old Zhang's AI Learning
Old Zhang's AI Learning
May 6, 2026 · Artificial Intelligence

Google Boosts Gemma 4 Inference Speed Up to 3× with MTP Drafter and Day‑0 vLLM Support

Google’s new Multi‑Token Prediction (MTP) drafter for Gemma 4 delivers up to three‑fold inference speedups across hardware and frameworks—validated by official benchmarks and independent DGX Spark tests—while preserving identical output quality, and is immediately usable via Hugging Face, vLLM, MLX, Ollama and edge‑device runtimes.

Apple SiliconGemma 4LLM inference
0 likes · 9 min read
Google Boosts Gemma 4 Inference Speed Up to 3× with MTP Drafter and Day‑0 vLLM Support
Machine Heart
Machine Heart
May 6, 2026 · Artificial Intelligence

Turning Your Mac into a Private AI Workstation with Cider and Mano‑P

The article analyzes how Ollama's shift to Apple’s MLX framework unlocks major speed gains on M5‑class Macs, then introduces the open‑source Cider inference accelerator and Mano‑P visual agent, detailing their quantization modes, benchmark results, hardware constraints, and how together they enable fast, offline private AI on macOS.

Apple SiliconCiderMLX
0 likes · 15 min read
Turning Your Mac into a Private AI Workstation with Cider and Mano‑P
Machine Learning Algorithms & Natural Language Processing
Machine Learning Algorithms & Natural Language Processing
May 3, 2026 · Artificial Intelligence

Running a 400B Mixture‑of‑Experts LLM on iPhone 17 Pro: Inside Flash‑MoE

The article details how the open‑source Flash‑MoE engine streams a 400‑billion‑parameter Mixture‑of‑Experts language model on an iPhone 17 Pro, achieving interactive‑level token throughput by eliminating Python dependencies, crafting a custom Metal pipeline, and streaming weights directly from SSD.

Apple SiliconFlash-MoEGCD
0 likes · 7 min read
Running a 400B Mixture‑of‑Experts LLM on iPhone 17 Pro: Inside Flash‑MoE
Geek Labs
Geek Labs
Apr 14, 2026 · Artificial Intelligence

Device‑Side Real‑Time Multimodal AI: Deep Dive into Two Open‑Source Projects

This article examines two open‑source projects—Parlor for on‑device multimodal inference and Gemma Tuner Multimodal for Apple Silicon fine‑tuning—detailing their architectures, privacy and cost benefits, performance on Apple M3 Pro, hands‑free VAD, streaming TTS, multilingual support, setup steps, and current limitations.

Apple SiliconGemma TunerParlor
0 likes · 8 min read
Device‑Side Real‑Time Multimodal AI: Deep Dive into Two Open‑Source Projects
Old Zhang's AI Learning
Old Zhang's AI Learning
Apr 13, 2026 · Artificial Intelligence

Fine‑Tune Any Large Model on Apple Silicon with mlx‑tune

The article introduces mlx‑tune, a community project that wraps the MLX library with Unsloth's API to enable local fine‑tuning of large language, vision, TTS, STT, OCR, and embedding models on Apple Silicon Macs, outlines its workflow from prototype to cloud, provides installation steps, code examples, and discusses its capabilities and limitations.

Apple SiliconUnsloth APIlarge language models
0 likes · 9 min read
Fine‑Tune Any Large Model on Apple Silicon with mlx‑tune
Old Zhang's AI Learning
Old Zhang's AI Learning
Apr 1, 2026 · Artificial Intelligence

Running Large Models Locally on Mac: The Most Powerful Current Solution

This article reviews the JANG quantization format, the vMLX inference engine with a five‑layer cache stack, and the MLX Studio GUI, showing how their combination enables 397B‑parameter models to fit on 128 GB Apple Silicon Macs, achieve up to 224× faster first‑token latency for 100K context, and provide a full‑featured local AI experience.

Apple SiliconJANGMLX Studio
0 likes · 8 min read
Running Large Models Locally on Mac: The Most Powerful Current Solution
Old Zhang's AI Learning
Old Zhang's AI Learning
Mar 19, 2026 · Artificial Intelligence

Testing the Hot oMLX on Mac: Claude‑Opus‑4.6 Distilled and Qwen3.5‑9B Performance Review

The article evaluates oMLX, a Mac‑only LLM runtime built on Apple Silicon and MLX, by walking through installation, UI features, memory usage, single‑request speed, benchmark results for Claude‑Opus‑4.6 and Qwen3.5‑9B, continuous batch processing gains, Claude Code optimizations, multi‑model support, and the failure to run a 27B model.

Apple SiliconBenchmarkClaude Opus
0 likes · 9 min read
Testing the Hot oMLX on Mac: Claude‑Opus‑4.6 Distilled and Qwen3.5‑9B Performance Review
AI Engineering
AI Engineering
Feb 27, 2026 · Artificial Intelligence

jina-grep: Adding Semantic Search Capabilities to Grep on Apple Silicon

Jina-grep is an open-source CLI that adds fast, MLX-powered semantic search to grep on Apple Silicon, offering three modes, sub-millisecond latency, high token throughput, and easy installation, making local code and log searching more accurate than keyword matching.

Apple SiliconCLIMLX
0 likes · 4 min read
jina-grep: Adding Semantic Search Capabilities to Grep on Apple Silicon
AI Engineering
AI Engineering
Feb 15, 2026 · Artificial Intelligence

Qwen3‑ASR Runs Natively on Apple Silicon via MLX for Full‑Speed Speech Recognition

A developer has re‑implemented the state‑of‑the‑art Qwen3‑ASR model in MLX, enabling native execution on Apple M1‑M4 chips with real‑time factors as low as 0.08, 4‑bit quantization speedups of 4.7×, multilingual support for 52 languages, and features such as word‑level timestamps and streaming transcription.

Apple SiliconMLXQwen3-ASR
0 likes · 5 min read
Qwen3‑ASR Runs Natively on Apple Silicon via MLX for Full‑Speed Speech Recognition
AI Engineering
AI Engineering
Jan 7, 2026 · Artificial Intelligence

Unsloth-MLX: Fine‑Tune LLMs on Mac and Seamlessly Move Code to Cloud GPUs

Unsloth‑MLX leverages Apple’s MLX framework to let Mac users with Apple Silicon fine‑tune large language models locally with a single import change, offering zero‑cost migration to cloud GPUs, supporting SFT, DPO, ORPO, GRPO training, and export to HuggingFace or GGUF formats.

Apple SiliconGPU cloudLLM fine-tuning
0 likes · 4 min read
Unsloth-MLX: Fine‑Tune LLMs on Mac and Seamlessly Move Code to Cloud GPUs
FunTester
FunTester
Feb 27, 2025 · Operations

Lume: A Lightweight CLI Tool for Managing macOS and Linux Virtual Machines on Apple Silicon

Lume is an open‑source, lightweight command‑line and API tool designed for Apple Silicon that simplifies creation, control, and automation of macOS and Linux virtual machines using macOS’s Virtualization.framework, offering installation methods, core commands, architecture details, advantages, limitations, and typical use cases.

Apple SiliconCLIVM management
0 likes · 8 min read
Lume: A Lightweight CLI Tool for Managing macOS and Linux Virtual Machines on Apple Silicon
21CTO
21CTO
Apr 28, 2024 · Artificial Intelligence

Run Meta Llama 3 Locally on Apple Silicon Macs with Ollama & OpenWebUI

This step‑by‑step guide shows how to install Ollama on Apple Silicon Macs, choose between Meta Llama 3 8B or 70B models, run them via Docker‑powered OpenWebUI, and interact with the LLM through a ChatGPT‑like interface, all while keeping data private.

Apple SiliconDockerMeta Llama 3
0 likes · 5 min read
Run Meta Llama 3 Locally on Apple Silicon Macs with Ollama & OpenWebUI
21CTO
21CTO
Mar 20, 2024 · Backend Development

Why Oracle Urges Java Developers to Skip macOS Sonoma 14.4 Update

Oracle warns Java developers that upgrading Apple Silicon macOS Sonoma to version 14.4 can crash JetBrains IDEs by sending a SIGKILL signal, and advises postponing the update until a fix is available, highlighting the impact on multiple JDK versions.

Apple SiliconIDEJava
0 likes · 5 min read
Why Oracle Urges Java Developers to Skip macOS Sonoma 14.4 Update
Programmer DD
Programmer DD
Jun 9, 2023 · Game Development

Can Macs Become Gaming Machines? Inside Apple’s New Game Porting Toolkit

Apple is reshaping macOS with a new Game Porting Toolkit and a dedicated Game Mode in macOS Sonoma, enabling easier Windows game migration to Apple Silicon, leveraging DirectX‑to‑Metal translation, while also positioning Macs as future gaming platforms alongside initiatives like Apple Vision Pro.

AppleApple SiliconDirectX
0 likes · 10 min read
Can Macs Become Gaming Machines? Inside Apple’s New Game Porting Toolkit
Open Source Linux
Open Source Linux
Aug 3, 2022 · Operations

How Asahi Linux Got Linux Running on Apple M2 Macs in Just One Month

After successfully porting Linux to M1 Macs, the Asahi Linux team swiftly extended support to M2 devices, delivering a new release that enables USB, NVMe, battery management, Wi‑Fi, and basic keyboard/trackpad functionality on M2 MacBook Pro, with experimental support for M2 MacBook Air and M1 Ultra Mac Studio, while outlining current limitations and future GPU driver goals.

Apple SiliconAsahi LinuxLinux
0 likes · 10 min read
How Asahi Linux Got Linux Running on Apple M2 Macs in Just One Month
Programmer DD
Programmer DD
May 28, 2022 · Backend Development

Visual Studio 2022 for Mac: Faster Native UI and .NET 6 Power

Microsoft's Visual Studio 2022 for Mac v17.0 GA introduces a fully native macOS UI, runs on .NET 6, offers Apple Silicon optimization, and delivers up to 50% performance gains while adding early support for .NET 7 and .NET MAUI cross‑platform development.

.NET 6Apple SiliconIDE
0 likes · 5 min read
Visual Studio 2022 for Mac: Faster Native UI and .NET 6 Power
Programmer DD
Programmer DD
Apr 20, 2022 · Backend Development

Why IntelliJ IDEA Feels Slow on M1 Macs and How to Fix It

After switching to an M1‑based MacBook Pro, many developers experience sluggish IntelliJ IDEA performance, but selecting the proper Apple‑silicon IDEA build and an ARM‑compatible JDK resolves the issue.

Apple SiliconIDE performanceIntelliJ IDEA
0 likes · 2 min read
Why IntelliJ IDEA Feels Slow on M1 Macs and How to Fix It
JavaScript
JavaScript
Apr 21, 2021 · Backend Development

What’s New in Node.js 16.0.0? Key Updates and Features

Node.js 16.0.0, built on the V8 engine, introduces a stabilized Timers Promises API, Apple Silicon pre‑built binaries, V8 9.0 upgrades, global btoa/atob functions, and npm 7.10.0, marking a significant step forward for JavaScript runtime performance and compatibility.

Apple SiliconBackendV8
0 likes · 1 min read
What’s New in Node.js 16.0.0? Key Updates and Features
Java Architect Essentials
Java Architect Essentials
Mar 15, 2021 · Industry Insights

Can Apple M1 Macs Mine Ethereum Effectively? A Hands‑On Test

This article documents a technical experiment that runs Ethereum mining software on an M1‑based MacBook Air, detailing the required code patches, build process, performance logs, and the resulting profit of roughly one Chinese yuan per day, while comparing the M1’s capabilities to traditional GPU miners.

Apple SiliconEthereumM1
0 likes · 9 min read
Can Apple M1 Macs Mine Ethereum Effectively? A Hands‑On Test
Laravel Tech Community
Laravel Tech Community
Dec 21, 2020 · Fundamentals

CMake 3.19.2 Release: Apple Silicon Support and Various Improvements

CMake 3.19.2 introduces Apple ARM support, new variables for host processor selection, enhanced error handling, updated FindHDF5, Java test controls, CI job updates, and numerous bug fixes across compilers, PCH, and platform detection, making the cross‑platform build system more robust and flexible.

Apple SiliconBuild SystemCMake
0 likes · 3 min read
CMake 3.19.2 Release: Apple Silicon Support and Various Improvements
Programmer DD
Programmer DD
Nov 23, 2020 · Fundamentals

Why Apple’s M1 Chip Struggles with Docker and Legacy Software – What You Need to Know

Apple’s M1 chip, based on ARM architecture, faces compatibility challenges with Docker and many mainstream x86 applications, prompting solutions like Rosetta 2 while highlighting cost advantages and future software support hurdles, as developers work to adapt tools such as Node, Python, Go, and browsers for Apple Silicon.

ARM architectureApple SiliconDocker Compatibility
0 likes · 6 min read
Why Apple’s M1 Chip Struggles with Docker and Legacy Software – What You Need to Know
Programmer DD
Programmer DD
Nov 20, 2020 · Backend Development

Can Docker Run Natively on Apple M1 Macs? What You Need to Know

Docker confirms plans to support Apple Silicon Macs but faces several technical hurdles, requiring pipeline changes, reliance on Go and Electron, and coordination with other virtualization tools like VMware and Parallels to deliver a seamless Docker Desktop experience on M1 devices.

Apple SiliconBackend DevelopmentDocker
0 likes · 4 min read
Can Docker Run Natively on Apple M1 Macs? What You Need to Know
JD Cloud Developers
JD Cloud Developers
Nov 2, 2020 · Artificial Intelligence

This Week’s Tech Highlights: AI Research Breakthroughs, 5G Surge, Multi‑Cloud DB & More

The newsletter recaps recent tech developments, including JD's four AI papers at Interspeech 2020, Shenzhen's supercomputing boost, T‑Mobile's mid‑band 5G expansion, Apple's upcoming A14T iMac processor, MongoDB Atlas multi‑cloud support, Wikimedia's migration to GitLab, and advances in graph neural network pre‑training and deep clustering.

5G expansionAI researchApple Silicon
0 likes · 9 min read
This Week’s Tech Highlights: AI Research Breakthroughs, 5G Surge, Multi‑Cloud DB & More