Tagged articles
76 articles
Page 1 of 1
Geek Labs
Geek Labs
May 6, 2026 · Artificial Intelligence

Build a GPT from Scratch and Decode AI Coding Jargon with Two Top GitHub Projects

The article introduces two practical GitHub repositories—how-to-train-your-gpt, a step‑by‑step guide that builds a LLaMA‑style GPT model across 12 chapters, and dictionary-of-ai-coding, a plain‑language glossary of AI‑coding terms—showing how they together provide a complete understanding of modern LLM fundamentals and terminology.

AIGPTGitHub
0 likes · 9 min read
Build a GPT from Scratch and Decode AI Coding Jargon with Two Top GitHub Projects
Machine Learning Algorithms & Natural Language Processing
Machine Learning Algorithms & Natural Language Processing
Apr 11, 2026 · Artificial Intelligence

From Claude Code to Codex: Migrating Anthropic’s Harness Design

The author reproduces Anthropic’s long‑running harness architecture on a Codex + GPT stack, separates planner, generator, and evaluator roles, persists state to concrete artifacts, adds strict execution constraints, and demonstrates that the approach improves task success despite higher costs, while highlighting practical pitfalls and cost‑control strategies.

Agent HarnessAnthropicClaude Code
0 likes · 12 min read
From Claude Code to Codex: Migrating Anthropic’s Harness Design
Data STUDIO
Data STUDIO
Feb 25, 2026 · Artificial Intelligence

Build a Large Language Model from Scratch with PyTorch—No Libraries, No Shortcuts

This guide walks you through building, training, and fine‑tuning a Transformer‑based large language model entirely from scratch using PyTorch, covering tokenization, self‑attention, multi‑head attention, positional encoding, model architecture, data preparation, training loops, and fine‑tuning on custom lyrics.

Fine-tuningGPTLLM
0 likes · 43 min read
Build a Large Language Model from Scratch with PyTorch—No Libraries, No Shortcuts
Shuge Unlimited
Shuge Unlimited
Feb 20, 2026 · Artificial Intelligence

Gemini 3.1 Pro Boosts Reasoning Ability by 148% – What’s New?

Google’s Gemini 3.1 Pro jumps to a 77.1% ARC‑AGI‑2 score—a 148% gain over its predecessor—offering stronger reasoning, agentic workflows, SVG generation and multimodal support, while the article compares its performance with Claude, GPT and outlines preview‑stage caveats.

AI reasoningARC-AGI-2Benchmark
0 likes · 15 min read
Gemini 3.1 Pro Boosts Reasoning Ability by 148% – What’s New?
AI Cyberspace
AI Cyberspace
Feb 15, 2026 · Artificial Intelligence

From GPT-1 to GPT-4o: A Deep Dive into the Evolution of Large Language Models

This article chronicles the rapid progression of GPT models from the 2018 GPT‑1 pre‑training breakthrough through GPT‑2’s multitask learning, GPT‑3’s scaling laws and few‑shot abilities, to GPT‑4’s multimodal capabilities and the 2024 GPT‑4 Turbo, Sora, and GPT‑4o releases, while also explaining core LLM abilities and the decoder‑only architecture of GPT‑2.

AI evolutionFew‑Shot LearningGPT
0 likes · 20 min read
From GPT-1 to GPT-4o: A Deep Dive into the Evolution of Large Language Models
AI Cyberspace
AI Cyberspace
Jan 13, 2026 · Artificial Intelligence

From Symbolic AI to LLMs: A Complete NLP History and Model Guide

This article provides a comprehensive overview of natural language processing, tracing its evolution from early symbolic and statistical stages through deep learning breakthroughs, detailing sequence models, key NLP tasks, text representation methods, and the development of modern architectures like RNN, LSTM, GRU, Transformer, and GPT series.

Deep LearningGPTLSTM
0 likes · 60 min read
From Symbolic AI to LLMs: A Complete NLP History and Model Guide
Practical DevOps Architecture
Practical DevOps Architecture
Nov 10, 2025 · Operations

Convert MBR to GPT and Resize Linux Partitions Without Data Loss

This guide walks you through checking the current partition layout, converting an MBR‑styled disk to GPT using fdisk and GNU Parted, recreating the partition, formatting it with ext4, mounting it, and finally expanding the filesystem to utilize the full cloud‑disk capacity, all while preserving existing data.

GPTLinuxPartition
0 likes · 7 min read
Convert MBR to GPT and Resize Linux Partitions Without Data Loss
MoonWebTeam
MoonWebTeam
Oct 1, 2025 · Artificial Intelligence

Unlocking ChatGPT: A Deep Dive into Transformers, Tokenization, and Self‑Attention

This tutorial walks through the fundamentals of ChatGPT by explaining language modeling, character‑level tokenization, data preprocessing pipelines, the evolution from simple bigram models to scaled dot‑product self‑attention, multi‑head mechanisms, full Transformer blocks, and how to train and generate Shakespeare‑style text with a GPT model.

ChatGPTGPTLanguage Modeling
0 likes · 50 min read
Unlocking ChatGPT: A Deep Dive into Transformers, Tokenization, and Self‑Attention
Raymond Ops
Raymond Ops
Aug 24, 2025 · Operations

Mastering Parted: From MBR/GPT Basics to Automated Disk Partitioning

This guide explains the purpose and advantages of the Parted tool, compares MBR and GPT partition tables, demonstrates interactive and scripted usage with detailed command examples for creating, deleting, formatting, and mounting partitions on large disks, and provides a complete end‑to‑end workflow.

GPTLinuxMBR
0 likes · 8 min read
Mastering Parted: From MBR/GPT Basics to Automated Disk Partitioning
21CTO
21CTO
Aug 15, 2025 · Artificial Intelligence

Build a Working AI Agent in 5 Days: Guide with GPT, n8n, CrewAI & Streamlit

Learn how to create a functional AI agent from scratch in just five days by leveraging GPT for the brain, n8n for automation, CrewAI for multi‑agent orchestration, Cursor for code generation, and Streamlit for a user interface, complete with step‑by‑step instructions and ready‑to‑run Python code.

AI agentsCrewAIGPT
0 likes · 15 min read
Build a Working AI Agent in 5 Days: Guide with GPT, n8n, CrewAI & Streamlit
Architects' Tech Alliance
Architects' Tech Alliance
Jun 11, 2025 · Artificial Intelligence

From Transformers to DeepSeek‑R1: The 2017‑2025 Evolution of Large Language Models

This article chronicles the rapid development of large language models from the 2017 Transformer breakthrough through the rise of BERT, GPT‑3, ChatGPT, multimodal systems like GPT‑4V/o, and the recent cost‑efficient DeepSeek‑R1, highlighting key architectural innovations, scaling trends, alignment techniques, and their transformative impact on AI research and industry.

AI AlignmentBERTCost‑Efficient Inference
0 likes · 26 min read
From Transformers to DeepSeek‑R1: The 2017‑2025 Evolution of Large Language Models
Cognitive Technology Team
Cognitive Technology Team
Feb 9, 2025 · Artificial Intelligence

A Beginner’s Guide to the History and Key Concepts of Deep Learning

From the perceptron’s inception in 1958 to modern Transformer-based models like GPT, this article traces the evolution of deep learning, explaining foundational architectures such as DNNs, CNNs, RNNs, LSTMs, attention mechanisms, and recent innovations like DeepSeek’s MLA, highlighting their principles and impact.

Deep LearningGPTMLA
0 likes · 19 min read
A Beginner’s Guide to the History and Key Concepts of Deep Learning
DevOps
DevOps
Dec 5, 2024 · Artificial Intelligence

A Brief History of Artificial Intelligence: From McCulloch‑Pitts Neurons to GPT‑4

This article traces the evolution of artificial intelligence from the 1943 McCulloch‑Pitts neuron model through key milestones such as Turing's test, the Dartmouth conference, the rise of neural networks, deep learning breakthroughs, and recent large language models like GPT‑4, illustrating the field's rapid progress.

GPTNeural Networksartificial intelligence
0 likes · 7 min read
A Brief History of Artificial Intelligence: From McCulloch‑Pitts Neurons to GPT‑4
Tencent Cloud Developer
Tencent Cloud Developer
Oct 30, 2024 · Artificial Intelligence

Comprehensive Survey of AIGC Research: Papers, Resources, and Technical Overview

This survey acts as a comprehensive portal that organizes AIGC research across seven domains—text, image, and audio generation, cross‑modal association, text‑guided image and audio synthesis, and supporting resources—detailing seminal models such as GPT, Diffusion, CLIP, DALL·E, Stable Diffusion, MusicLM, and key papers that shaped each field.

AIGCCLIPComputer Vision
0 likes · 19 min read
Comprehensive Survey of AIGC Research: Papers, Resources, and Technical Overview
Open Source Linux
Open Source Linux
Aug 6, 2024 · Artificial Intelligence

What Is AI? A Beginner’s Guide to Definitions, Types, and Real‑World Impact

This article explains what artificial intelligence (AI) is, how it differs from traditional programming, outlines its main categories, introduces machine learning, deep learning, neural network models such as CNN, RNN, and Transformer, describes large models and GPT, and discusses AI’s wide‑range applications and societal implications.

AIAI applicationsDeep Learning
0 likes · 16 min read
What Is AI? A Beginner’s Guide to Definitions, Types, and Real‑World Impact
Architects' Tech Alliance
Architects' Tech Alliance
Jul 10, 2024 · Industry Insights

Why AI Large Models Are Driving the Next Industrial Revolution

The article analyzes the rapid evolution of AI large models—from their role in advancing AGI through massive pre‑training and fine‑tuning, to current market dynamics led by GPT and domestic Chinese players, and finally to future multimodal applications, content‑factory capabilities, and emerging AIGC revenue models projected to reach trillion‑yuan scales by 2030.

AIAIGCGPT
0 likes · 7 min read
Why AI Large Models Are Driving the Next Industrial Revolution
JD Tech Talk
JD Tech Talk
Jun 7, 2024 · Artificial Intelligence

AI‑Powered JUnit Rule for Automatic Error Reporting to GPT

This tutorial shows Java engineers how to build a JUnit Rule that captures test failures, extracts the exception stack and source file, and automatically sends the information to OpenAI's GPT for analysis and code‑fix suggestions, complete with reusable data‑model classes and utility methods.

AIAutomationGPT
0 likes · 13 min read
AI‑Powered JUnit Rule for Automatic Error Reporting to GPT
DataFunTalk
DataFunTalk
Apr 26, 2024 · Artificial Intelligence

Large Language Models in the Automotive Industry: Overview, Impact, and Practical Exploration

This article examines how large language models such as GPT and Transformer‑based architectures are reshaping the automotive sector by enhancing in‑vehicle intelligence, streamlining product development, improving customer service, and redefining data analyst roles, while also presenting practical experiments, deployment challenges, and future directions.

Automotive AIGPTLLM applications
0 likes · 18 min read
Large Language Models in the Automotive Industry: Overview, Impact, and Practical Exploration
NewBeeNLP
NewBeeNLP
Apr 11, 2024 · Artificial Intelligence

How Karpathy Built a 1,000‑Line C LLM Trainer Without Any Deep‑Learning Framework

Andrej Karpathy released LLM.C, a pure C/CUDA implementation that trains GPT‑2‑style models in about 1,000 lines of code, detailing manual forward/backward passes, memory allocation tricks, SIMD CPU acceleration, CUDA porting, and migration tutorials, while comparing it to PyTorch and discussing broader LLM OS implications.

C programmingCUDAGPT
0 likes · 6 min read
How Karpathy Built a 1,000‑Line C LLM Trainer Without Any Deep‑Learning Framework
DataFunTalk
DataFunTalk
Dec 26, 2023 · Artificial Intelligence

The Evolution of AI and Its Challenges in the Data Industry

This article reviews the historical development of artificial intelligence, explains how AI technologies such as large language models are reshaping data processing and analysis, and discusses the practical challenges, trust issues, and governance requirements when applying AI to the data industry.

Data GovernanceData IndustryGPT
0 likes · 10 min read
The Evolution of AI and Its Challenges in the Data Industry
DataFunSummit
DataFunSummit
Dec 21, 2023 · Artificial Intelligence

The Evolution of AI and Its Challenges and Opportunities in the Data Industry

This article reviews the historical development of artificial intelligence, examines how AI—especially large language models like GPT—can transform data analysis and governance, and outlines the practical challenges, reliability concerns, and future opportunities of integrating AI into the data industry.

Data AnalyticsGPTartificial intelligence
0 likes · 8 min read
The Evolution of AI and Its Challenges and Opportunities in the Data Industry
Ximalaya Technology Team
Ximalaya Technology Team
Nov 13, 2023 · Artificial Intelligence

Step-by-Step Guide to Creating Custom GPTs with Knowledge and Action Plugins

This step‑by‑step tutorial shows how to build a custom nutrition‑analysis GPT—named “Nutri Buddy”—by logging into ChatGPT, creating a GPT, uploading knowledge files, configuring an Action plugin that calls the Boohee food API for calorie, protein and carbohydrate data, and then publishing the assistant with private, link‑only, or public visibility.

Custom AIGPTKnowledge Base
0 likes · 8 min read
Step-by-Step Guide to Creating Custom GPTs with Knowledge and Action Plugins
Architect
Architect
Oct 12, 2023 · Artificial Intelligence

Evolution of Language Models: From Statistical N‑grams to GPT‑4

This article provides a comprehensive overview of natural language processing and language‑model research, tracing the historical development from early rule‑based and statistical N‑gram models through neural network approaches such as RNN, LSTM, ELMo, and Transformer, and detailing the architectures, strengths, and limitations of the GPT series up to GPT‑4, while also discussing evaluation metrics, practical applications, and future challenges.

GPTNLPartificial intelligence
0 likes · 34 min read
Evolution of Language Models: From Statistical N‑grams to GPT‑4
Zhuanzhuan Tech
Zhuanzhuan Tech
Sep 28, 2023 · Artificial Intelligence

Evolution of Language Models and an Overview of the GPT Series

This article surveys the development of natural language processing from early rule‑based systems through statistical n‑gram models, neural language models, RNNs, LSTMs, ELMo, Transformers and BERT, and then details the architecture, training methods, advantages and limitations of the GPT‑1, GPT‑2, GPT‑3, ChatGPT and GPT‑4 models, concluding with a discussion of future challenges and references.

Deep LearningGPTNLP
0 likes · 30 min read
Evolution of Language Models and an Overview of the GPT Series
JD Tech
JD Tech
Sep 19, 2023 · Artificial Intelligence

Leveraging GPT for Code Writing, Script Generation, and Business Integration

This article demonstrates how to use GPT to accelerate code development, automate script creation, and integrate AI-driven suggestions into real‑world business scenarios such as e‑commerce promotion, providing practical tips, example prompts, and ready‑to‑run code snippets.

AI DevelopmentCode GenerationFlutter
0 likes · 10 min read
Leveraging GPT for Code Writing, Script Generation, and Business Integration
Sohu Tech Products
Sohu Tech Products
Aug 23, 2023 · Artificial Intelligence

Engineering GPT Applications: Capabilities, Limitations, and Solutions

The guide explains GPT’s core capabilities—natural language mastery, domain reasoning, and code generation—while detailing its limits such as prompt sensitivity, token caps, and lack of memory, then offers engineering workarounds like systematic prompting, chain‑of‑thought, external memory, tool integration, safety checks, and a six‑layer architecture for building robust commercial AI applications.

AI Application ArchitectureGPTPrompt engineering
0 likes · 20 min read
Engineering GPT Applications: Capabilities, Limitations, and Solutions
dbaplus Community
dbaplus Community
Aug 22, 2023 · Operations

Designing a Multi‑Cloud Intelligent Monitoring Platform at Huolala: Architecture, Practices, and Future Directions

This article details Huolala's one‑stop monitoring platform called Monitor, covering its multi‑cloud architecture, data collection pipelines, real‑time business monitoring, unified alarm handling, and future AI‑driven enhancements, while sharing concrete metrics, incident case studies, and practical implementation steps for large‑scale observability.

GPTObservabilityOperations
0 likes · 19 min read
Designing a Multi‑Cloud Intelligent Monitoring Platform at Huolala: Architecture, Practices, and Future Directions
Model Perspective
Model Perspective
Jul 31, 2023 · Artificial Intelligence

From RNN to ChatGPT: How AIGC Evolved with Transformers and Large Models

This article traces the evolution of AI‑generated content (AIGC) from early RNN‑based Seq2Seq models through the transformative impact of the Transformer architecture, covering key milestones such as UniLM, T5, BART, the GPT series, InstructGPT, and the emergence of ChatGPT.

AI content generationAIGCGPT
0 likes · 9 min read
From RNN to ChatGPT: How AIGC Evolved with Transformers and Large Models
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Jul 30, 2023 · Artificial Intelligence

ChatGPT Technical Analysis Series – Part 2: GPT1, GPT2, and GPT3 (Encoder vs Decoder, Zero‑Shot, and Scaling)

This article reviews the evolution of the GPT family from GPT‑1 to GPT‑3, comparing encoder‑decoder architectures, explaining the shift from supervised fine‑tuning to zero‑shot and few‑shot learning, and highlighting the architectural and training innovations that enabled large‑scale language models.

Fine-tuningGPTLLM
0 likes · 13 min read
ChatGPT Technical Analysis Series – Part 2: GPT1, GPT2, and GPT3 (Encoder vs Decoder, Zero‑Shot, and Scaling)
Network Intelligence Research Center (NIRC)
Network Intelligence Research Center (NIRC)
Jul 29, 2023 · Artificial Intelligence

Getting Started with GPT: How Generative Pre‑Training and Discriminative Fine‑Tuning Work

This article explains GPT's two‑stage learning—unsupervised generative pre‑training on large raw corpora followed by discriminative fine‑tuning on labeled tasks—detailing the underlying Transformer decoder architecture, loss functions, and task‑specific input transformations.

Fine-tuningGPTGenerative Pre‑Training
0 likes · 5 min read
Getting Started with GPT: How Generative Pre‑Training and Discriminative Fine‑Tuning Work
ByteFE
ByteFE
Jun 28, 2023 · Frontend Development

How GPT is Transforming Frontend Development and UI Interaction

The article examines the rapid rise of GPT models, their technical capabilities and limitations, and how their integration is reshaping software interaction from command‑line to GUI‑plus‑Language UI, offering frontend engineers new opportunities, practical examples, and guidance on leveraging large‑model AI in product design.

AI integrationGPTLLM
0 likes · 21 min read
How GPT is Transforming Frontend Development and UI Interaction
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Jun 11, 2023 · Artificial Intelligence

Comprehensive Technical Overview of GPT Series, Transformers, and Emerging Capabilities in Large Language Models

This article provides a detailed technical review of the evolution of GPT models, the Transformer architecture, large language model training methods, emergent abilities such as in‑context learning and chain‑of‑thought, multimodal extensions, and the challenges of data, scaling, and alignment, offering a holistic view for researchers and practitioners.

AIGPTInstructGPT
0 likes · 28 min read
Comprehensive Technical Overview of GPT Series, Transformers, and Emerging Capabilities in Large Language Models
Tencent Cloud Developer
Tencent Cloud Developer
Jun 7, 2023 · Artificial Intelligence

Prompt Engineering Techniques and Their Application in Low‑Code Development with GPT and LangChain

The article explains prompt‑engineering fundamentals—definitions, instruction, context, and output formatting—and showcases tricks like few‑shot, chain‑of‑thought, and ReAct, then demonstrates testing with OpenAI APIs, token management, LangChain integration, and low‑code applications such as AI‑generated SQL, API gateways, DSL‑driven UI, chart creation, and vector‑based semantic search.

AIGPTKnowledge Generation
0 likes · 30 min read
Prompt Engineering Techniques and Their Application in Low‑Code Development with GPT and LangChain
DataFunTalk
DataFunTalk
May 31, 2023 · Artificial Intelligence

Why GPT Can Exhibit Intelligence Through Next‑Token Prediction: A Comprehensive Exploration of Compression, Knowledge Circuits, and Model Scaling

This article examines the debate over whether large language models truly possess intelligence, arguing that next‑token prediction functions as a form of lossless data compression whose efficiency reflects intelligence, and it surveys research on knowledge extraction, neuron semantics, circuit competition, scaling effects, and the broader philosophical implications of GPT as a mirror of the world’s parameters.

GPTModel ScalingNext Token Prediction
0 likes · 59 min read
Why GPT Can Exhibit Intelligence Through Next‑Token Prediction: A Comprehensive Exploration of Compression, Knowledge Circuits, and Model Scaling
DataFunSummit
DataFunSummit
May 26, 2023 · Artificial Intelligence

Assessing GPT's Potential to Replace Chinese Jobs: A Data‑Driven Study

Using eight years of Chinese recruitment data and GPT‑based labeling, the study quantifies how likely various occupations are to be automated, revealing that high‑growth, knowledge‑intensive roles such as translation, design, and programming face the greatest replacement risk while blue‑collar manufacturing jobs are least vulnerable.

AIGPTjob automation
0 likes · 12 min read
Assessing GPT's Potential to Replace Chinese Jobs: A Data‑Driven Study
Programmer DD
Programmer DD
May 16, 2023 · Artificial Intelligence

Inside OpenAI: How the Platform Democratizes Generative AI

Since its 2015 founding, OpenAI has built a suite of generative AI models—including GPT, DALL‑E, and Whisper—exposed via simple REST APIs, enabling developers to integrate advanced language, vision, and speech capabilities without deep ML expertise, while offering fine‑tuning, SDKs, and Azure integration.

APIDALL·EGPT
0 likes · 9 min read
Inside OpenAI: How the Platform Democratizes Generative AI
21CTO
21CTO
Apr 28, 2023 · Artificial Intelligence

Which Chinese Jobs Are Most Vulnerable to AI? A Data‑Driven Ranking

Using eight years of Chinese recruitment data mapped to O*net, the study breaks down occupations into thousands of tasks, lets GPT score each for automation potential, and reveals the top and bottom jobs by AI replacement risk along with the economic factors that drive these trends.

AIChinaGPT
0 likes · 13 min read
Which Chinese Jobs Are Most Vulnerable to AI? A Data‑Driven Ranking
Architect
Architect
Apr 27, 2023 · Artificial Intelligence

Survey of Large Language Model Research: From GPT‑1 to ChatGPT and Open‑Source Alternatives

This article provides a comprehensive overview of the development of large language models, reviewing classic papers from GPT‑1 through GPT‑4, discussing open‑source implementations such as LLaMA, Alpaca, GLM, and ChatGLM, and analyzing training methods, datasets, and future research directions.

AI researchGPTlarge language models
0 likes · 36 min read
Survey of Large Language Model Research: From GPT‑1 to ChatGPT and Open‑Source Alternatives
Nightwalker Tech
Nightwalker Tech
Apr 26, 2023 · Artificial Intelligence

Understanding GPT: Meaning, Evolution, and Training Process

This article explains what GPT (Generative Pre‑trained Transformer) is, traces its development from early neural networks to the latest GPT‑4 models, and details the three‑stage training pipeline of unsupervised learning, supervised fine‑tuning, and reinforcement learning with human feedback.

GPTTransformer
0 likes · 15 min read
Understanding GPT: Meaning, Evolution, and Training Process
21CTO
21CTO
Apr 15, 2023 · Artificial Intelligence

Will GPT Replace Programmers? Strategies to Thrive in the AI Era

Despite sensational claims that GPT will eliminate programmers, this analysis argues that AI will reshape rather than eradicate the profession, emphasizing the need for developers to adopt full‑stack and low‑code skills, diversify into business and management roles, and adapt to evolving market demands.

AIGPTcareer
0 likes · 18 min read
Will GPT Replace Programmers? Strategies to Thrive in the AI Era
Alibaba Cloud Developer
Alibaba Cloud Developer
Apr 14, 2023 · Artificial Intelligence

Why Large Models Are Revolutionizing AI: From Foundations to AIGC

This article explores the concept and evolution of large foundation models, their transformative impact on AI-generated content, the underlying technologies such as transformers, diffusion, and CLIP, and discusses the challenges, emerging abilities, and future prospects of these models across multiple modalities.

AIGCGPTdiffusion
0 likes · 32 min read
Why Large Models Are Revolutionizing AI: From Foundations to AIGC
Top Architect
Top Architect
Apr 12, 2023 · Artificial Intelligence

Data‑Centric AI Perspective on GPT Models: Training, Inference, and Maintenance

This article examines how large language models such as GPT‑1 through GPT‑4 succeed largely due to high‑quality, large‑scale training data, and explains the Data‑centric AI framework—training data development, inference data development, and data maintenance—while discussing prompt engineering, data‑driven improvements, and future trends in AI.

AIData‑Centric AIGPT
0 likes · 19 min read
Data‑Centric AI Perspective on GPT Models: Training, Inference, and Maintenance
21CTO
21CTO
Mar 31, 2023 · Artificial Intelligence

From Student to AI Pioneer: Ilya Sutskever’s Journey Behind ChatGPT

This article chronicles Ilya Sutskever’s two‑decade rise from a young researcher to a leading figure in artificial intelligence, highlighting his early mentorship, breakthroughs in image recognition, language translation, the founding of OpenAI, and the development of GPT and DALL‑E models.

AI researchDeep LearningGPT
0 likes · 13 min read
From Student to AI Pioneer: Ilya Sutskever’s Journey Behind ChatGPT
Top Architect
Top Architect
Mar 10, 2023 · Artificial Intelligence

Understanding InstructGPT and ChatGPT: Architecture, Training Pipeline, and Performance Analysis

This article provides a comprehensive overview of the GPT series, explains the differences between prompt learning and instruction learning, details the three‑stage training pipeline of InstructGPT/ChatGPT—including supervised fine‑tuning, reward‑model training, and PPO‑based reinforcement learning—examines their strengths, weaknesses, and future research directions, and discusses the broader impact of these models on AI development.

AIChatGPTGPT
0 likes · 22 min read
Understanding InstructGPT and ChatGPT: Architecture, Training Pipeline, and Performance Analysis
IT Architects Alliance
IT Architects Alliance
Feb 23, 2023 · Artificial Intelligence

Training a Positive Review Generator with RLHF and PPO

This article demonstrates how to use Reinforcement Learning from Human Feedback (RLHF) with a PPO algorithm and a sentiment‑analysis model to train a language model that generates positive product reviews, covering task definition, data sampling, reward evaluation, model optimization, and experimental results.

GPTLanguage ModelPPO
0 likes · 11 min read
Training a Positive Review Generator with RLHF and PPO
DataFunSummit
DataFunSummit
Feb 8, 2023 · Artificial Intelligence

Technical Architecture and Training Process of ChatGPT

ChatGPT, a dialogue-focused language model, builds on the GPT family and employs techniques such as Reinforcement Learning from Human Feedback (RLHF), the TAMER framework, and a three-stage training pipeline (supervised fine‑tuning, reward modeling, and PPO reinforcement learning) to achieve advanced conversational capabilities.

ChatGPTGPTLanguage Model
0 likes · 7 min read
Technical Architecture and Training Process of ChatGPT
DataFunSummit
DataFunSummit
Jan 15, 2023 · Artificial Intelligence

Intelligent Writing: AIGC Technologies, Models, Evaluation Metrics, and Real‑World Applications

This article surveys the evolution of AI‑generated content for intelligent writing, covering its definition, key technologies from RNN Seq2Seq to Transformer‑based models such as UniLM, T5, BART and GPT series, evaluation datasets and metrics, product deployments by Datagrand, and the remaining challenges and future directions.

AI writingAIGCGPT
0 likes · 25 min read
Intelligent Writing: AIGC Technologies, Models, Evaluation Metrics, and Real‑World Applications
DataFunSummit
DataFunSummit
Dec 28, 2022 · Artificial Intelligence

OpenAI’s Language Model Evolution Toward AGI

This article traces OpenAI’s progression from GPT‑1 through GPT‑3, Codex, InstructGPT, and ChatGPT, highlighting how increasing model scale, prompt‑based task integration, and human‑feedback alignment have driven the evolution toward more capable, generalizable language intelligence aimed at achieving artificial general intelligence.

AGIAIChatGPT
0 likes · 15 min read
OpenAI’s Language Model Evolution Toward AGI
Liangxu Linux
Liangxu Linux
Jan 10, 2022 · Fundamentals

Master Linux Disk Partitioning with fdisk and parted

This guide explains the differences between MBR and GPT, shows how to list, create, modify, and delete partitions using fdisk for smaller disks and parted for larger than 2 TB, and provides practical command examples and safety tips.

GPTMBRcommand-line
0 likes · 5 min read
Master Linux Disk Partitioning with fdisk and parted
Open Source Linux
Open Source Linux
Mar 8, 2020 · Fundamentals

Why UEFI Is Replacing BIOS: Speed, Security, and Flexibility Explained

This article explains how UEFI, the Unified Extensible Firmware Interface, supersedes traditional BIOS by offering faster boot times, enhanced security through separate boot partitions and Secure Boot, graphical interfaces, greater storage capacity, and more flexible boot options for modern computers.

BIOSGPTSecure Boot
0 likes · 6 min read
Why UEFI Is Replacing BIOS: Speed, Security, and Flexibility Explained
Tencent Cloud Developer
Tencent Cloud Developer
Jan 20, 2020 · Cloud Computing

Linux Cloud Server Storage Expansion: Methods and Step‑by‑Step Guide

The guide shows how to enlarge storage on a Tencent Cloud Linux VM by adding and expanding CBS cloud disks using three approaches—direct bare‑device formatting, MBR partitioning for disks under 2 TB, and GPT partitioning for larger disks—detailing step‑by‑step commands, online vs offline resizing, and best‑practice recommendations.

CBSDisk ExpansionFilesystem
0 likes · 32 min read
Linux Cloud Server Storage Expansion: Methods and Step‑by‑Step Guide
Alibaba Cloud Developer
Alibaba Cloud Developer
Jun 5, 2019 · Artificial Intelligence

Tracing the Evolution of Language Models: From N‑grams to GPT‑2

This article reviews the historical development of natural language processing language models, covering expert rule‑based systems, statistical n‑grams, smoothing techniques, neural network models such as NNLM, RNN, word2vec, GloVe, ELMo, and the transformer‑based breakthroughs of GPT, BERT and GPT‑2, and summarizes their impact on modern NLP tasks.

BERTDeep LearningGPT
0 likes · 25 min read
Tracing the Evolution of Language Models: From N‑grams to GPT‑2
Hulu Beijing
Hulu Beijing
Apr 4, 2019 · Artificial Intelligence

How BERT, GPT, and ELMo Revolutionize Language Feature Representation

Natural language processing, a cornerstone of AI, relies on language models to capture linguistic features; this article reviews classic pre‑training models—ELMo, GPT, and BERT—explaining their architectures, training objectives, and how they boost downstream NLP tasks despite data‑scarcity challenges.

BERTDeep LearningELMo
0 likes · 10 min read
How BERT, GPT, and ELMo Revolutionize Language Feature Representation