Tagged articles
12 articles
Page 1 of 1
AI Engineer Programming
AI Engineer Programming
Apr 18, 2026 · Artificial Intelligence

How AI Fortune‑Telling Works—and Why It Can’t Truly Predict Love, Wealth, or Feng Shui

The article explains that predictive AI combines statistical analysis with machine learning, shows how recommendation systems and large language models generate seemingly personal fortune‑telling results, and outlines five fundamental reasons—data limits, hidden variables, randomness, cumulative small effects, and self‑fulfilling predictions—that prevent reliable forecasts of personal destiny.

AI predictiondata limitationsemergent abilities
0 likes · 13 min read
How AI Fortune‑Telling Works—and Why It Can’t Truly Predict Love, Wealth, or Feng Shui
JD Tech
JD Tech
Jan 13, 2026 · Artificial Intelligence

Mastering Large Language Models: Transformers, Scaling Laws, and MoE Explained

This extensive guide walks readers through the fundamentals of large language models, covering transformer architecture, pre‑training and fine‑tuning techniques, scaling laws, emergent abilities, mixture‑of‑experts designs, and practical comparisons, providing clear explanations, code snippets, and visual illustrations for deep learning practitioners.

Fine-tuningMixture of Expertsemergent abilities
0 likes · 47 min read
Mastering Large Language Models: Transformers, Scaling Laws, and MoE Explained
AIWalker
AIWalker
Aug 6, 2025 · Artificial Intelligence

Why ByteDance’s 7B BAGEL Model Rivals GPT‑4o in Unified Multimodal Understanding and Generation

The article provides an in‑depth technical analysis of ByteDance’s 7‑billion‑parameter BAGEL model, detailing its MoT architecture, high‑quality interleaved multimodal pre‑training data, multi‑stage training strategy, emergent capabilities, and extensive benchmark results that show BAGEL matching or surpassing GPT‑4o on vision‑language tasks.

BAGELGPT-4o comparisonMultimodal AI
0 likes · 24 min read
Why ByteDance’s 7B BAGEL Model Rivals GPT‑4o in Unified Multimodal Understanding and Generation
DataFunTalk
DataFunTalk
Jul 16, 2025 · Artificial Intelligence

How Jason Wei’s Breakthroughs Are Shaping the Future of Large Language Models

Jason Wei, a former Google Brain and OpenAI researcher now at Meta, has driven key advances in large language models—including chain‑of‑thought prompting, instruction tuning, emergent abilities, zero‑shot learning, and data augmentation—shaping both AI research paradigms and real‑world applications.

Chain-of-ThoughtInstruction Tuningemergent abilities
0 likes · 7 min read
How Jason Wei’s Breakthroughs Are Shaping the Future of Large Language Models
Architects' Tech Alliance
Architects' Tech Alliance
Sep 4, 2024 · Fundamentals

Why Bigger Transformers Win: Scaling Laws and Parallel Computing Essentials

The article explains OpenAI's 2020 Scaling Laws that show larger transformer models, more data, and greater compute consistently improve performance, introduces the concept of emergent abilities at critical size thresholds, and outlines the core principles of parallel computing such as multi‑processor usage, task decomposition, concurrent execution, and inter‑processor communication.

communicationconcurrencyemergent abilities
0 likes · 6 min read
Why Bigger Transformers Win: Scaling Laws and Parallel Computing Essentials
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Jun 11, 2023 · Artificial Intelligence

Comprehensive Technical Overview of GPT Series, Transformers, and Emerging Capabilities in Large Language Models

This article provides a detailed technical review of the evolution of GPT models, the Transformer architecture, large language model training methods, emergent abilities such as in‑context learning and chain‑of‑thought, multimodal extensions, and the challenges of data, scaling, and alignment, offering a holistic view for researchers and practitioners.

AIGPTInstructGPT
0 likes · 28 min read
Comprehensive Technical Overview of GPT Series, Transformers, and Emerging Capabilities in Large Language Models
Architecture Digest
Architecture Digest
Feb 17, 2023 · Artificial Intelligence

Analyzing the Emergent Abilities of ChatGPT and the Technical Roadmap of GPT‑3.5

This article dissects how ChatGPT acquired its surprising capabilities by tracing the evolution from the original GPT‑3 model through instruction tuning, code‑based pre‑training, and reinforcement learning from human feedback, ultimately presenting a comprehensive technical roadmap for reproducing GPT‑3.5‑scale models.

ChatGPTGPT-3.5Instruction Tuning
0 likes · 26 min read
Analyzing the Emergent Abilities of ChatGPT and the Technical Roadmap of GPT‑3.5
Architect
Architect
Feb 9, 2023 · Artificial Intelligence

Emergent Abilities of Large Language Models: Complex Reasoning, Knowledge Reasoning, and Out‑of‑Distribution Robustness

This article reviews recent research on the emergent abilities of large language models—such as chain‑of‑thought reasoning, knowledge retrieval without external sources, and robustness to distribution shifts—examining scaling laws, model size thresholds, and the open questions surrounding a potential paradigm shift from fine‑tuning to in‑context learning.

AI researchchain-of-thought promptingemergent abilities
0 likes · 23 min read
Emergent Abilities of Large Language Models: Complex Reasoning, Knowledge Reasoning, and Out‑of‑Distribution Robustness
IT Architects Alliance
IT Architects Alliance
Feb 9, 2023 · Artificial Intelligence

Analyzing the Evolution and Emergent Abilities of GPT‑3.5 Models

This article examines how OpenAI's GPT‑3.5 series evolved from the original GPT‑3 through large‑scale pre‑training, instruction tuning, code training, and RLHF, detailing the origins of language generation, world knowledge, in‑context learning, code understanding, complex reasoning, and the trade‑offs introduced by alignment.

Code TrainingGPT-3.5RLHF
0 likes · 25 min read
Analyzing the Evolution and Emergent Abilities of GPT‑3.5 Models
Top Architect
Top Architect
Feb 8, 2023 · Artificial Intelligence

A Technical Roadmap of GPT‑3.5: From Pre‑training to RLHF and Emerging Capabilities

This article analyses how ChatGPT and the GPT‑3.5 series evolved from the original GPT‑3 through large‑scale pre‑training, code‑based training, instruction tuning, and reinforcement learning from human feedback, identifying the origins of their language generation, in‑context learning, world knowledge, code understanding, chain‑of‑thought reasoning, and alignment capabilities while also outlining current limitations.

ChatGPTGPT-3.5Instruction Tuning
0 likes · 27 min read
A Technical Roadmap of GPT‑3.5: From Pre‑training to RLHF and Emerging Capabilities
21CTO
21CTO
Dec 29, 2022 · Artificial Intelligence

Uncovering ChatGPT’s Emergent Abilities: A Technical Roadmap from GPT‑3 to GPT‑3.5

This article analyses how OpenAI’s ChatGPT evolved from the original GPT‑3 model, tracing the emergence of language generation, world knowledge, in‑context learning, code training, instruction tuning, and reinforcement learning from human feedback, and highlights both its strengths and current limitations.

ChatGPTGPT-3.5Instruction Tuning
0 likes · 27 min read
Uncovering ChatGPT’s Emergent Abilities: A Technical Roadmap from GPT‑3 to GPT‑3.5