Tagged articles

12 articles

Page 1 of 1

Apr 18, 2026 · Artificial Intelligence

How AI Fortune‑Telling Works—and Why It Can’t Truly Predict Love, Wealth, or Feng Shui

The article explains that predictive AI combines statistical analysis with machine learning, shows how recommendation systems and large language models generate seemingly personal fortune‑telling results, and outlines five fundamental reasons—data limits, hidden variables, randomness, cumulative small effects, and self‑fulfilling predictions—that prevent reliable forecasts of personal destiny.

AI predictiondata limitationsemergent abilities

0 likes · 13 min read

How AI Fortune‑Telling Works—and Why It Can’t Truly Predict Love, Wealth, or Feng Shui

JD Tech

Jan 13, 2026 · Artificial Intelligence

Mastering Large Language Models: Transformers, Scaling Laws, and MoE Explained

This extensive guide walks readers through the fundamentals of large language models, covering transformer architecture, pre‑training and fine‑tuning techniques, scaling laws, emergent abilities, mixture‑of‑experts designs, and practical comparisons, providing clear explanations, code snippets, and visual illustrations for deep learning practitioners.

Fine-tuningMixture of Expertsemergent abilities

0 likes · 47 min read

Mastering Large Language Models: Transformers, Scaling Laws, and MoE Explained

AIWalker

Aug 6, 2025 · Artificial Intelligence

Why ByteDance’s 7B BAGEL Model Rivals GPT‑4o in Unified Multimodal Understanding and Generation

The article provides an in‑depth technical analysis of ByteDance’s 7‑billion‑parameter BAGEL model, detailing its MoT architecture, high‑quality interleaved multimodal pre‑training data, multi‑stage training strategy, emergent capabilities, and extensive benchmark results that show BAGEL matching or surpassing GPT‑4o on vision‑language tasks.

BAGELGPT-4o comparisonMultimodal AI

0 likes · 24 min read

Why ByteDance’s 7B BAGEL Model Rivals GPT‑4o in Unified Multimodal Understanding and Generation

DataFunTalk

Jul 16, 2025 · Artificial Intelligence

How Jason Wei’s Breakthroughs Are Shaping the Future of Large Language Models

Jason Wei, a former Google Brain and OpenAI researcher now at Meta, has driven key advances in large language models—including chain‑of‑thought prompting, instruction tuning, emergent abilities, zero‑shot learning, and data augmentation—shaping both AI research paradigms and real‑world applications.

Chain-of-ThoughtInstruction Tuningemergent abilities

0 likes · 7 min read

How Jason Wei’s Breakthroughs Are Shaping the Future of Large Language Models

Architects' Tech Alliance

Sep 4, 2024 · Fundamentals

Why Bigger Transformers Win: Scaling Laws and Parallel Computing Essentials

The article explains OpenAI's 2020 Scaling Laws that show larger transformer models, more data, and greater compute consistently improve performance, introduces the concept of emergent abilities at critical size thresholds, and outlines the core principles of parallel computing such as multi‑processor usage, task decomposition, concurrent execution, and inter‑processor communication.

communicationconcurrencyemergent abilities

0 likes · 6 min read

Why Bigger Transformers Win: Scaling Laws and Parallel Computing Essentials

Rare Earth Juejin Tech Community

Jun 11, 2023 · Artificial Intelligence

Comprehensive Technical Overview of GPT Series, Transformers, and Emerging Capabilities in Large Language Models

This article provides a detailed technical review of the evolution of GPT models, the Transformer architecture, large language model training methods, emergent abilities such as in‑context learning and chain‑of‑thought, multimodal extensions, and the challenges of data, scaling, and alignment, offering a holistic view for researchers and practitioners.

AIGPTInstructGPT

0 likes · 28 min read

Comprehensive Technical Overview of GPT Series, Transformers, and Emerging Capabilities in Large Language Models

Architecture Digest

Feb 17, 2023 · Artificial Intelligence

Analyzing the Emergent Abilities of ChatGPT and the Technical Roadmap of GPT‑3.5

This article dissects how ChatGPT acquired its surprising capabilities by tracing the evolution from the original GPT‑3 model through instruction tuning, code‑based pre‑training, and reinforcement learning from human feedback, ultimately presenting a comprehensive technical roadmap for reproducing GPT‑3.5‑scale models.

ChatGPTGPT-3.5Instruction Tuning

0 likes · 26 min read

Analyzing the Emergent Abilities of ChatGPT and the Technical Roadmap of GPT‑3.5

DataFunTalk

Feb 10, 2023 · Artificial Intelligence

ChatGPT: A Revolutionary Breakthrough, Its Core Capabilities, and Impact on Investment Research

This article analyzes why ChatGPT represents a revolutionary advance in AI, explores its emergent abilities and code‑training advantages, evaluates its practical value for investment research through real‑world comparisons with experts, and discusses future trends and challenges for large language models.

AIChatGPTCode Training

0 likes · 16 min read

ChatGPT: A Revolutionary Breakthrough, Its Core Capabilities, and Impact on Investment Research

Architect

Feb 9, 2023 · Artificial Intelligence

Emergent Abilities of Large Language Models: Complex Reasoning, Knowledge Reasoning, and Out‑of‑Distribution Robustness

This article reviews recent research on the emergent abilities of large language models—such as chain‑of‑thought reasoning, knowledge retrieval without external sources, and robustness to distribution shifts—examining scaling laws, model size thresholds, and the open questions surrounding a potential paradigm shift from fine‑tuning to in‑context learning.

AI researchchain-of-thought promptingemergent abilities

0 likes · 23 min read

Emergent Abilities of Large Language Models: Complex Reasoning, Knowledge Reasoning, and Out‑of‑Distribution Robustness

IT Architects Alliance

Feb 9, 2023 · Artificial Intelligence

Analyzing the Evolution and Emergent Abilities of GPT‑3.5 Models

This article examines how OpenAI's GPT‑3.5 series evolved from the original GPT‑3 through large‑scale pre‑training, instruction tuning, code training, and RLHF, detailing the origins of language generation, world knowledge, in‑context learning, code understanding, complex reasoning, and the trade‑offs introduced by alignment.

Code TrainingGPT-3.5RLHF

0 likes · 25 min read

Analyzing the Evolution and Emergent Abilities of GPT‑3.5 Models

Top Architect

Feb 8, 2023 · Artificial Intelligence

A Technical Roadmap of GPT‑3.5: From Pre‑training to RLHF and Emerging Capabilities

This article analyses how ChatGPT and the GPT‑3.5 series evolved from the original GPT‑3 through large‑scale pre‑training, code‑based training, instruction tuning, and reinforcement learning from human feedback, identifying the origins of their language generation, in‑context learning, world knowledge, code understanding, chain‑of‑thought reasoning, and alignment capabilities while also outlining current limitations.

ChatGPTGPT-3.5Instruction Tuning

0 likes · 27 min read

A Technical Roadmap of GPT‑3.5: From Pre‑training to RLHF and Emerging Capabilities

21CTO

Dec 29, 2022 · Artificial Intelligence

Uncovering ChatGPT’s Emergent Abilities: A Technical Roadmap from GPT‑3 to GPT‑3.5

This article analyses how OpenAI’s ChatGPT evolved from the original GPT‑3 model, tracing the emergence of language generation, world knowledge, in‑context learning, code training, instruction tuning, and reinforcement learning from human feedback, and highlights both its strengths and current limitations.

ChatGPTGPT-3.5Instruction Tuning

0 likes · 27 min read

Uncovering ChatGPT’s Emergent Abilities: A Technical Roadmap from GPT‑3 to GPT‑3.5