Tagged articles
11 articles
Page 1 of 1
AI Explorer
AI Explorer
Apr 4, 2026 · Artificial Intelligence

Can GPT-3-Powered Robots Achieve 99% Success? Inside Sia’s GEN-1 Breakthrough

Sia’s GEN-1 robot, powered by a GPT-3-style large language model, claims a jump in task-success rate from 64% to 99%, signaling a shift from simple perception-execution to cognitive decision-making, while the article scrutinizes the definition of success, cost, safety, and industry impact.

AI integrationGPT-3Reliability
0 likes · 6 min read
Can GPT-3-Powered Robots Achieve 99% Success? Inside Sia’s GEN-1 Breakthrough
Infra Learning Club
Infra Learning Club
Oct 30, 2024 · Artificial Intelligence

How GPT-3 Evolved: From Transformer Roots to Massive Language Models

The article traces the development of GPT series—from the 2017 Transformer breakthrough, through GPT‑1, GPT‑2, and GPT‑3’s 175 billion parameters, to later models like Codex and ChatGPT—highlighting key papers, architectural choices, and the surprising role of OpenAI’s decoder‑only approach.

GPT-3GoogleLanguage Model
0 likes · 4 min read
How GPT-3 Evolved: From Transformer Roots to Massive Language Models
DataFunTalk
DataFunTalk
Aug 23, 2023 · Artificial Intelligence

Evaluating Large Language Model Item Encoders for Textual Collaborative Filtering in Recommendation Systems

This article investigates whether replacing traditional ID-based item encoders with massive LLMs such as GPT‑3 improves recommendation performance, by conducting extensive experiments on three real‑world datasets, analyzing performance limits, generality of item representations, and comparing against ID‑based and prompt‑based methods.

GPT-3LLMTextual Collaborative Filtering
0 likes · 15 min read
Evaluating Large Language Model Item Encoders for Textual Collaborative Filtering in Recommendation Systems
Network Intelligence Research Center (NIRC)
Network Intelligence Research Center (NIRC)
May 22, 2023 · Artificial Intelligence

How Microsoft Leverages LLMs to Auto‑Generate Cloud Incident Root Causes and Fixes

Microsoft researchers fine‑tuned GPT‑3.x models with LoRA on over 40,000 cloud incident records, evaluated them with six NLP metrics and human interviews, and found that LLMs can generate root‑cause analyses and mitigation steps comparable to BERT models, especially for machine‑detected failures.

AI for operationsGPT-3LLM
0 likes · 8 min read
How Microsoft Leverages LLMs to Auto‑Generate Cloud Incident Root Causes and Fixes
Full-Stack Trendsetter
Full-Stack Trendsetter
May 15, 2023 · Artificial Intelligence

Do You Really Understand ChatGPT, the Era‑Defining AI?

This article explains what ChatGPT is, how it builds on natural-language-processing and the Transformer-based GPT series, details its model-size growth, architectural enhancements, multilingual support, and walks through the tokenization-to-generation pipeline that enables coherent AI-driven conversations.

ChatGPTDeep LearningGPT-3
0 likes · 8 min read
Do You Really Understand ChatGPT, the Era‑Defining AI?
Architect
Architect
Apr 14, 2023 · Artificial Intelligence

Overview of Prominent Large Language Models and Instruction Fine‑Tuning Techniques

The article surveys major large language models—including GPT‑3, T5, LaMDA, Jurassic‑1, MT‑NLG, Gopher, Chinchilla, PaLM, U‑PaLM, OPT, LLaMA, BLOOM, GLM‑130B, and ERNIE 3.0 Titan—explains their architectures, scaling trade‑offs, and then details instruction‑fine‑tuned variants such as T0, FLAN, GPT‑3.5, ChatGPT, GPT‑4, Alpaca and ChatGLM, providing references for further study.

ChatGPTGPT-3Instruction Tuning
0 likes · 27 min read
Overview of Prominent Large Language Models and Instruction Fine‑Tuning Techniques
360 Tech Engineering
360 Tech Engineering
Mar 17, 2023 · Artificial Intelligence

Understanding ChatGPT: OpenAI’s Development, Model Evolution, and Training Techniques

This article provides an overview of ChatGPT’s rapid rise, OpenAI’s founding, the evolution of GPT models up to GPT‑3, the data‑driven training process, model capabilities and limitations, and practical guidance for users, highlighting the interplay between open‑source research and commercial deployment.

Artificial IntelligenceChatGPTGPT-3
0 likes · 14 min read
Understanding ChatGPT: OpenAI’s Development, Model Evolution, and Training Techniques
DataFunSummit
DataFunSummit
Feb 19, 2023 · Artificial Intelligence

Understanding In-Context Learning in Large Language Models: Experiments, Analysis, and Theoretical Insights

This article explains the concept of in‑context learning in large language models, presents experimental evaluations such as copy‑output, date‑formatting, and label‑remapping tasks, and discusses a recent theoretical analysis that links attention layers to implicit gradient‑based fine‑tuning, highlighting why model scale and data volume matter.

Attention MechanismFew‑Shot LearningGPT-3
0 likes · 15 min read
Understanding In-Context Learning in Large Language Models: Experiments, Analysis, and Theoretical Insights
JD Cloud Developers
JD Cloud Developers
Feb 5, 2021 · Artificial Intelligence

2020 NLP Milestones & Future Trends: Insights from JD’s AI Scientist

In an InfoQ interview, JD Technology senior algorithm scientist Wu Youzheng reviews the rapid advances of natural language processing in 2020—including GPT‑3, multimodal dialogue, knowledge‑enhanced pre‑training, and knowledge graphs—while outlining the most promising research directions and practical challenges for the coming year.

AI applicationsGPT-3Multimodal
0 likes · 18 min read
2020 NLP Milestones & Future Trends: Insights from JD’s AI Scientist
ITPUB
ITPUB
Feb 3, 2021 · Artificial Intelligence

Can GPT‑3 Write Complex SQL Queries? A Hands‑On Exploration

This article demonstrates how GPT‑3 can translate plain‑English questions into SQL statements, starting with simple counts and progressing to multi‑table joins, while highlighting prompting techniques, limitations, and the impact of model variants such as GPT‑3 Instruct.

AI-generated codeGPT-3Prompt engineering
0 likes · 7 min read
Can GPT‑3 Write Complex SQL Queries? A Hands‑On Exploration