Tagged articles
10 articles
Page 1 of 1
DataFunTalk
DataFunTalk
Jul 20, 2025 · Artificial Intelligence

Why Meta’s AI Pioneer Yang Li‑kun Is Being Marginalized: Power Struggles Behind the Scenes

The article examines how Meta’s CEO Mark Zuckerberg’s aggressive talent‑buying and commercial focus have sidelined Turing‑award winner Yang Li‑kun, detailing the restructuring of Meta’s AI labs, the clash over research directions, and the broader dilemma of balancing academic innovation with business imperatives in the AI industry.

AI industryAI researchJEPA
0 likes · 14 min read
Why Meta’s AI Pioneer Yang Li‑kun Is Being Marginalized: Power Struggles Behind the Scenes
DataFunTalk
DataFunTalk
Jul 19, 2025 · Artificial Intelligence

Meta’s Quest for Personal Superintelligence: Zuckerberg’s AI Vision & Massive Compute Push

In a candid interview, Mark Zuckerberg outlines Meta’s ambition to deliver a personal superintelligence through AI‑powered glasses, predicts a two‑to‑three‑year timeline for superintelligent models, and details the company’s multibillion‑dollar investment in gigawatt‑scale data centers, talent recruitment, and internal AI enhancements while acknowledging monetisation challenges.

AI strategyMeta AISuperintelligence
0 likes · 19 min read
Meta’s Quest for Personal Superintelligence: Zuckerberg’s AI Vision & Massive Compute Push
DataFunTalk
DataFunTalk
Apr 7, 2025 · Artificial Intelligence

Llama 4 Open‑Source Release Marred by Performance Failures and Alleged Training‑Data Cheating

Meta's newly released Llama 4 quickly became a controversy as internal leaks reveal training‑data cheating, benchmark over‑optimization, and disappointing code‑generation performance that fails to match even older models, prompting resignations and widespread criticism from the AI community.

AI model performanceCode GenerationLlama 4
0 likes · 7 min read
Llama 4 Open‑Source Release Marred by Performance Failures and Alleged Training‑Data Cheating
AI Algorithm Path
AI Algorithm Path
Apr 6, 2025 · Artificial Intelligence

Meta’s Open-Source Llama 4: 2‑Trillion‑Parameter Behemoth Redefines AI

Meta’s newly released Llama 4 models—Maverick with 4 020 billion total parameters and Scout with 1 090 billion—feature a 128‑expert MoE, 10 million‑token context, native multimodal fusion, and FP8 training, delivering benchmark‑leading performance that outpaces GPT‑4o, Gemini 2.0 Flash and DeepSeek v3, while being openly available on Hugging Face and GitHub.

BenchmarkFP8 trainingLlama 4
0 likes · 8 min read
Meta’s Open-Source Llama 4: 2‑Trillion‑Parameter Behemoth Redefines AI
Programmer DD
Programmer DD
Jul 25, 2024 · Artificial Intelligence

How to Run Meta’s New Llama 3.1 Model Locally with Ollama

Meta’s latest open‑source Llama 3.1 model, available in 8B, 70B, and 405B sizes, is evaluated against top competitors and can be easily run locally on the 8B version using Ollama with a simple step‑by‑step guide.

Llama 3.1Meta AIOllama
0 likes · 4 min read
How to Run Meta’s New Llama 3.1 Model Locally with Ollama
NewBeeNLP
NewBeeNLP
Apr 19, 2024 · Artificial Intelligence

Llama 3 Unveiled: 8B & 70B Models Set New SOTA Across Benchmarks

Meta announced the open‑source Llama 3 series (8B and 70B parameters), detailing its decoder‑only Transformer architecture, 15 T‑token multilingual training data, superior benchmark scores over competitors, a limited 8K context window, and upcoming cloud and web‑based deployments.

BenchmarkLlama 3Meta AI
0 likes · 7 min read
Llama 3 Unveiled: 8B & 70B Models Set New SOTA Across Benchmarks
21CTO
21CTO
Aug 28, 2023 · Artificial Intelligence

What Is Code Llama? Meta’s Open-Source LLM for Generating Code

Code Llama, Meta’s specialized extension of Llama 2, is a large language model fine‑tuned on code data that can generate, complete, and debug software across multiple languages, supports up to 100 k tokens, and is freely available for research and commercial use.

Code GenerationCode LlamaLLM
0 likes · 5 min read
What Is Code Llama? Meta’s Open-Source LLM for Generating Code
21CTO
21CTO
Jul 23, 2023 · Artificial Intelligence

What Nathan Lambert Reveals About Meta’s Llama 2: Key Insights and Technical Deep‑Dive

This article translates and analyzes Nathan Lambert’s commentary on Meta’s Llama 2 paper, detailing the model’s architecture, training data, RLHF pipeline, reward models, evaluation methods, safety improvements, licensing terms, and the broader implications for open‑source large language models.

Llama-2Meta AIModel Evaluation
0 likes · 22 min read
What Nathan Lambert Reveals About Meta’s Llama 2: Key Insights and Technical Deep‑Dive