Tagged articles

11 articles

Page 1 of 1

Apr 4, 2026 · Artificial Intelligence

Can GPT-3-Powered Robots Achieve 99% Success? Inside Sia’s GEN-1 Breakthrough

Sia’s GEN-1 robot, powered by a GPT-3-style large language model, claims a jump in task-success rate from 64% to 99%, signaling a shift from simple perception-execution to cognitive decision-making, while the article scrutinizes the definition of success, cost, safety, and industry impact.

AI integrationGPT-3Reliability

0 likes · 6 min read

Can GPT-3-Powered Robots Achieve 99% Success? Inside Sia’s GEN-1 Breakthrough

Infra Learning Club

Oct 30, 2024 · Artificial Intelligence

How GPT-3 Evolved: From Transformer Roots to Massive Language Models

The article traces the development of GPT series—from the 2017 Transformer breakthrough, through GPT‑1, GPT‑2, and GPT‑3’s 175 billion parameters, to later models like Codex and ChatGPT—highlighting key papers, architectural choices, and the surprising role of OpenAI’s decoder‑only approach.

GPT-3GoogleLanguage Model

0 likes · 4 min read

How GPT-3 Evolved: From Transformer Roots to Massive Language Models

DataFunTalk

Aug 23, 2023 · Artificial Intelligence

Evaluating Large Language Model Item Encoders for Textual Collaborative Filtering in Recommendation Systems

This article investigates whether replacing traditional ID-based item encoders with massive LLMs such as GPT‑3 improves recommendation performance, by conducting extensive experiments on three real‑world datasets, analyzing performance limits, generality of item representations, and comparing against ID‑based and prompt‑based methods.

GPT-3LLMTextual Collaborative Filtering

0 likes · 15 min read

Evaluating Large Language Model Item Encoders for Textual Collaborative Filtering in Recommendation Systems

Network Intelligence Research Center (NIRC)

Jul 1, 2023 · Artificial Intelligence

Prompting Large Language Models for Knowledge‑Based Visual Question Answering: The Prophet Framework

This article analyzes the Prophet framework, which leverages a traditional VQA model to generate answer candidates and in‑context examples that prompt GPT‑3, achieving state‑of‑the‑art performance on the challenging OK‑VQA and A‑OKVQA benchmarks.

GPT-3Large Language ModelsMCAN

0 likes · 9 min read

Prompting Large Language Models for Knowledge‑Based Visual Question Answering: The Prophet Framework

Network Intelligence Research Center (NIRC)

May 22, 2023 · Artificial Intelligence

How Microsoft Leverages LLMs to Auto‑Generate Cloud Incident Root Causes and Fixes

Microsoft researchers fine‑tuned GPT‑3.x models with LoRA on over 40,000 cloud incident records, evaluated them with six NLP metrics and human interviews, and found that LLMs can generate root‑cause analyses and mitigation steps comparable to BERT models, especially for machine‑detected failures.

AI for operationsGPT-3LLM

0 likes · 8 min read

How Microsoft Leverages LLMs to Auto‑Generate Cloud Incident Root Causes and Fixes

Full-Stack Trendsetter

May 15, 2023 · Artificial Intelligence

Do You Really Understand ChatGPT, the Era‑Defining AI?

This article explains what ChatGPT is, how it builds on natural-language-processing and the Transformer-based GPT series, details its model-size growth, architectural enhancements, multilingual support, and walks through the tokenization-to-generation pipeline that enables coherent AI-driven conversations.

ChatGPTDeep LearningGPT-3

0 likes · 8 min read

Do You Really Understand ChatGPT, the Era‑Defining AI?

Architect

Apr 14, 2023 · Artificial Intelligence

Overview of Prominent Large Language Models and Instruction Fine‑Tuning Techniques

The article surveys major large language models—including GPT‑3, T5, LaMDA, Jurassic‑1, MT‑NLG, Gopher, Chinchilla, PaLM, U‑PaLM, OPT, LLaMA, BLOOM, GLM‑130B, and ERNIE 3.0 Titan—explains their architectures, scaling trade‑offs, and then details instruction‑fine‑tuned variants such as T0, FLAN, GPT‑3.5, ChatGPT, GPT‑4, Alpaca and ChatGLM, providing references for further study.

ChatGPTGPT-3Instruction Tuning

0 likes · 27 min read

Overview of Prominent Large Language Models and Instruction Fine‑Tuning Techniques

360 Tech Engineering

Mar 17, 2023 · Artificial Intelligence

Understanding ChatGPT: OpenAI’s Development, Model Evolution, and Training Techniques

This article provides an overview of ChatGPT’s rapid rise, OpenAI’s founding, the evolution of GPT models up to GPT‑3, the data‑driven training process, model capabilities and limitations, and practical guidance for users, highlighting the interplay between open‑source research and commercial deployment.

Artificial IntelligenceChatGPTGPT-3

0 likes · 14 min read

Understanding ChatGPT: OpenAI’s Development, Model Evolution, and Training Techniques

DataFunSummit

Feb 19, 2023 · Artificial Intelligence

Understanding In-Context Learning in Large Language Models: Experiments, Analysis, and Theoretical Insights

This article explains the concept of in‑context learning in large language models, presents experimental evaluations such as copy‑output, date‑formatting, and label‑remapping tasks, and discusses a recent theoretical analysis that links attention layers to implicit gradient‑based fine‑tuning, highlighting why model scale and data volume matter.

Attention MechanismFew‑Shot LearningGPT-3

0 likes · 15 min read

Understanding In-Context Learning in Large Language Models: Experiments, Analysis, and Theoretical Insights

JD Cloud Developers

Feb 5, 2021 · Artificial Intelligence

2020 NLP Milestones & Future Trends: Insights from JD’s AI Scientist

In an InfoQ interview, JD Technology senior algorithm scientist Wu Youzheng reviews the rapid advances of natural language processing in 2020—including GPT‑3, multimodal dialogue, knowledge‑enhanced pre‑training, and knowledge graphs—while outlining the most promising research directions and practical challenges for the coming year.

AI applicationsGPT-3Multimodal

0 likes · 18 min read

2020 NLP Milestones & Future Trends: Insights from JD’s AI Scientist

ITPUB

Feb 3, 2021 · Artificial Intelligence

Can GPT‑3 Write Complex SQL Queries? A Hands‑On Exploration

This article demonstrates how GPT‑3 can translate plain‑English questions into SQL statements, starting with simple counts and progressing to multi‑table joins, while highlighting prompting techniques, limitations, and the impact of model variants such as GPT‑3 Instruct.

AI-generated codeGPT-3Prompt engineering

0 likes · 7 min read

Can GPT‑3 Write Complex SQL Queries? A Hands‑On Exploration