Tagged articles

Llama 4

8 articles · Page 1 of 1

Apr 8, 2025 · Artificial Intelligence

Meta AI VP Responds to Llama 4 Controversies and Allegations of Benchmark Manipulation

Meta AI Vice President Ahmad Al‑Dahle addressed recent criticisms of the newly released Llama 4 model, denying claims of test‑set cheating, explaining quality variations as post‑release optimization, and acknowledging internal concerns that led to staff resignations and calls for transparency.

BenchmarkingLlama 4Meta AI

0 likes · 5 min read

Meta AI VP Responds to Llama 4 Controversies and Allegations of Benchmark Manipulation

DevOps

Apr 7, 2025 · Artificial Intelligence

Meta Llama 4 Scout, Maverick, and Behemoth: Architecture, NoPE Innovation, and Training Advances

The article introduces Meta's newly open‑sourced Llama 4 series—including Scout with a 1 billion‑token context window, Maverick with 400 billion parameters, and the upcoming Behemoth teacher model—detailing their expert‑mix architecture, the NoPE positional‑encoding removal, training pipelines, performance benchmarks, and infrastructure improvements for large‑scale AI research.

AI researchLarge Language ModelLlama 4

0 likes · 8 min read

Meta Llama 4 Scout, Maverick, and Behemoth: Architecture, NoPE Innovation, and Training Advances

DataFunTalk

Apr 7, 2025 · Artificial Intelligence

Llama 4 Open‑Source Release Marred by Performance Failures and Alleged Training‑Data Cheating

Meta's newly released Llama 4 quickly became a controversy as internal leaks reveal training‑data cheating, benchmark over‑optimization, and disappointing code‑generation performance that fails to match even older models, prompting resignations and widespread criticism from the AI community.

AI model performanceLlama 4Meta AI

0 likes · 7 min read

Llama 4 Open‑Source Release Marred by Performance Failures and Alleged Training‑Data Cheating

21CTO

Apr 7, 2025 · Artificial Intelligence

Llama 4 Unveiled: Breakthrough Multimodal Models Redefine AI Capabilities

Meta's Llama 4 series introduces the Scout, Maverick, and Behemoth models—featuring Mixture‑of‑Experts architectures, unprecedented 10‑million‑token context windows, and state‑of‑the‑art performance across vision, language, and multimodal benchmarks—while emphasizing efficient training, open‑source availability, and robust safety safeguards.

AI safetyLarge Language ModelLlama 4

0 likes · 14 min read

Llama 4 Unveiled: Breakthrough Multimodal Models Redefine AI Capabilities

AI Algorithm Path

Apr 6, 2025 · Artificial Intelligence

Meta’s Open-Source Llama 4: 2‑Trillion‑Parameter Behemoth Redefines AI

Meta’s newly released Llama 4 models—Maverick with 4 020 billion total parameters and Scout with 1 090 billion—feature a 128‑expert MoE, 10 million‑token context, native multimodal fusion, and FP8 training, delivering benchmark‑leading performance that outpaces GPT‑4o, Gemini 2.0 Flash and DeepSeek v3, while being openly available on Hugging Face and GitHub.

FP8 trainingLlama 4Meta AI

0 likes · 8 min read

Meta’s Open-Source Llama 4: 2‑Trillion‑Parameter Behemoth Redefines AI

Top Architect

Apr 6, 2025 · Artificial Intelligence

GPT-5 Delayed but Will Be Free, OpenAI Plans Open‑Source Model; Meta’s Llama 4 Continues to Be Postponed

OpenAI announced that GPT‑5 will be delayed yet offered for free, with upcoming releases of o3 and o4‑mini, while also promising an open‑source inference model, whereas Meta’s Llama 4 faces repeated postponements amid performance concerns and a massive AI infrastructure investment.

AI modelsGPT-5Llama 4

0 likes · 8 min read

GPT-5 Delayed but Will Be Free, OpenAI Plans Open‑Source Model; Meta’s Llama 4 Continues to Be Postponed

DataFunTalk

Apr 6, 2025 · Artificial Intelligence

Meta Unveils Llama 4: New Multimodal AI Models with Mixture‑of‑Experts Architecture and 10 Million‑Token Context

Meta announced the Llama 4 series—Scout, Maverick and Behemoth—featuring multimodal capabilities, Mixture‑of‑Experts design, up to 10 million‑token context windows, and state‑of‑the‑art performance on STEM, multilingual and image benchmarks, with models now downloadable from llama.com and Hugging Face.

Llama 4Long ContextMixture of Experts

0 likes · 14 min read

Meta Unveils Llama 4: New Multimodal AI Models with Mixture‑of‑Experts Architecture and 10 Million‑Token Context

Baobao Algorithm Notes

Apr 6, 2025 · Artificial Intelligence

Inside Llama 4: How Meta’s New Multimodal MoE Models Achieve 10M‑Token Contexts

Meta unveils Llama 4 Scout, Maverick, and the upcoming Behemoth, detailing their Mixture‑of‑Experts architecture, massive 10‑million‑token context windows, efficient FP8 training, safety mechanisms, and competitive benchmark results that surpass leading multimodal models.

AI safetyLlama 4Mixture of Experts

0 likes · 16 min read

Inside Llama 4: How Meta’s New Multimodal MoE Models Achieve 10M‑Token Contexts