Tagged articles
648 articles
Page 7 of 7
NetEase Cloud Music Tech Team
NetEase Cloud Music Tech Team
Jun 2, 2023 · Frontend Development

How NetEase Cloud Music’s Front‑End Team Built an AI‑Powered Low‑Code Copilot

NetEase Cloud Music’s front‑end team integrated large language models into their internal low‑code platform, creating an AI Copilot that supports smart page creation, editing, component configuration, code snippet generation, and Q&A, while detailing the underlying architecture, prompt engineering, and mixed‑mode development workflow.

AI CopilotMixed DevelopmentPrompt engineering
0 likes · 11 min read
How NetEase Cloud Music’s Front‑End Team Built an AI‑Powered Low‑Code Copilot
Baidu Tech Salon
Baidu Tech Salon
May 29, 2023 · Artificial Intelligence

Baidu CTO Wang Haifeng Highlights Wenxin Yiyan Large Language Model at Zhongguancun Forum

At the Zhongguancun Forum, Baidu CTO Wang Haifeng showcased the self‑developed Wenxin Yiyan large language model—demonstrating its knowledge‑enhanced Q&A, writing, poetry, video generation and reasoning abilities, its integration as an intelligent office assistant, and its role in driving a model‑as‑a‑service ecosystem that fuels China’s AI‑led industrial transformation.

AI policyBaiduWenxin Yiyan
0 likes · 7 min read
Baidu CTO Wang Haifeng Highlights Wenxin Yiyan Large Language Model at Zhongguancun Forum
JD Tech
JD Tech
May 23, 2023 · Artificial Intelligence

Understanding ChatGPT: Principles, Limitations, and a Five‑Layer Application Guide

This article explains the fundamentals of GPT models, contrasts large models with traditional AI, details ChatGPT's architecture and token processing, outlines its limitations, and presents a five‑layer framework for applying ChatGPT across chat, language, text, reasoning, and private model use cases.

AIChatGPTapplication
0 likes · 21 min read
Understanding ChatGPT: Principles, Limitations, and a Five‑Layer Application Guide
JD Retail Technology
JD Retail Technology
May 18, 2023 · Artificial Intelligence

Local Deployment, Inference, and Fine‑tuning of the Vicuna‑7B Large Language Model

This article details the step‑by‑step process of preparing the environment, merging weights, installing dependencies, running inference, evaluating Vicuna‑7B against other models, and attempting fine‑tuning, while highlighting performance results, encountered issues, and future work for large language model deployment.

Fine-tuningGPUInference
0 likes · 11 min read
Local Deployment, Inference, and Fine‑tuning of the Vicuna‑7B Large Language Model
Full-Stack Trendsetter
Full-Stack Trendsetter
May 18, 2023 · Artificial Intelligence

How 360 and ChatGLM Are Building China’s “Microsoft + OpenAI” Large‑Model Duo

On May 16, 360 and Zhipu AI announced a strategic partnership to co‑develop the trillion‑parameter models 360GLM and 360GPT, positioning them as China’s answer to Microsoft‑OpenAI by combining large‑scale pre‑training, bilingual capabilities, and integration with 360’s search and browser ecosystem.

360AI CollaborationChatGLM
0 likes · 7 min read
How 360 and ChatGLM Are Building China’s “Microsoft + OpenAI” Large‑Model Duo
Baidu Geek Talk
Baidu Geek Talk
May 10, 2023 · Artificial Intelligence

Baidu's AI Infrastructure for Large-Scale LLM Training: Architecture, Challenges, and Optimization

Baidu’s AI infrastructure combines a massive InfiniBand‑linked GPU cluster, Kunlun chips, the PaddlePaddle framework, and the Wenxin model suite with 4D hybrid parallelism, elastic fault tolerance, and a two‑stage training pipeline to overcome computation, memory, and communication walls, delivering world‑leading MLPerf performance for large‑scale LLMs.

GPU clusterInfiniBandModel Training Optimization
0 likes · 15 min read
Baidu's AI Infrastructure for Large-Scale LLM Training: Architecture, Challenges, and Optimization
Baidu Tech Salon
Baidu Tech Salon
May 9, 2023 · Artificial Intelligence

How Baidu’s Award‑Winning Dialogue Tech Powers China’s AI Surge

The article examines Baidu’s groundbreaking knowledge‑deep learning dialogue system that earned the 2022 Wu Wenjun AI Science and Technology Award, detailing its technical breakthroughs, patent portfolio, large‑scale deployments, and how it underpins China’s rapid advancement in large language models and AI industry integration.

AIBaiduDialogue Technology
0 likes · 12 min read
How Baidu’s Award‑Winning Dialogue Tech Powers China’s AI Surge
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
May 8, 2023 · Artificial Intelligence

Understanding the Principles Behind ChatGPT: NLP, Transformers, and Reinforcement Learning

This article explains how ChatGPT works by covering the fundamentals of natural language processing, generative language models, deep learning, the Transformer architecture, attention mechanisms, few‑shot learning, and the reinforcement‑learning techniques that align its outputs with human preferences.

AIChatGPTNLP
0 likes · 24 min read
Understanding the Principles Behind ChatGPT: NLP, Transformers, and Reinforcement Learning
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Apr 28, 2023 · Artificial Intelligence

Exploring Alibaba’s Tongyi Qianwen AI Model, SWOT, Recipe Demo, and Code Samples for Spark Same‑Period Analysis and Java Bubble Sort

The article reviews Alibaba’s Tongyi Qianwen large‑language model, shares a cooking recipe generated by the AI, presents a SWOT analysis, and provides code examples—including a Spark Scala script for same‑period month‑over‑month calculations and a Java bubble‑sort implementation.

AIJavaSWOT
0 likes · 12 min read
Exploring Alibaba’s Tongyi Qianwen AI Model, SWOT, Recipe Demo, and Code Samples for Spark Same‑Period Analysis and Java Bubble Sort
21CTO
21CTO
Apr 24, 2023 · Artificial Intelligence

Inside MOSS 003: Fudan University's Open-Source Large Language Model

This article details the evolution of Fudan University's open‑source MOSS series—from the early OpenChat 001 prototype to the current MOSS 003—covering data collection, multilingual capabilities, plugin architecture, model releases on HuggingFace, and how developers can start using the models.

AIChinese NLPMOSS
0 likes · 10 min read
Inside MOSS 003: Fudan University's Open-Source Large Language Model
Architect
Architect
Apr 24, 2023 · Artificial Intelligence

MOSS 003: Open‑Source Large Language Model Development, Training Data, and Plugin‑Enabled Deployment

The article details the evolution of the open‑source MOSS series—from OpenChat 001 to MOSS 003—covering data collection, fine‑tuning procedures, multilingual capabilities, plugin architecture, example code for inference, and upcoming releases, providing a comprehensive technical overview for AI practitioners.

AIMOSSPlugins
0 likes · 11 min read
MOSS 003: Open‑Source Large Language Model Development, Training Data, and Plugin‑Enabled Deployment
Programmer DD
Programmer DD
Apr 18, 2023 · Artificial Intelligence

Can OpenAssistant Rival ChatGPT? Inside the Largest Open‑Source AI Assistant

This article examines OpenAssistant, the world’s largest open‑source ChatGPT replica, detailing its dataset of over 160 k annotated conversations, the fine‑tuned LLaMA and Pythia models, evaluation results against GPT‑3.5‑turbo, practical usage examples, and the project's current limitations and future directions.

AI datasetChatGPT alternativeOpenAssistant
0 likes · 11 min read
Can OpenAssistant Rival ChatGPT? Inside the Largest Open‑Source AI Assistant
Python Programming Learning Circle
Python Programming Learning Circle
Apr 3, 2023 · Artificial Intelligence

Key Highlights of GPT‑4: Multimodal Capabilities, Benchmark Performance, and Future Implications

GPT‑4, the new multimodal AI model, can process images and text, generate code and natural language, achieve human‑level scores on standardized exams, handle up to 32 K tokens, and demonstrates advanced reasoning, while OpenAI emphasizes its safety improvements and current limitations as a still‑emerging technology.

AI SafetyGPT-4Multimodal AI
0 likes · 6 min read
Key Highlights of GPT‑4: Multimodal Capabilities, Benchmark Performance, and Future Implications
ITPUB
ITPUB
Mar 22, 2023 · Artificial Intelligence

What Can GPT‑4 Do? Vision, Long Memory, Safer AI and More

OpenAI’s GPT‑4 arrives with multimodal vision, a dramatically longer context window, higher exam scores, Socratic prompting, improved safety, and new partnerships, while still in research mode and subject to bias and code‑trust limitations.

AI SafetyGPT-4large language model
0 likes · 7 min read
What Can GPT‑4 Do? Vision, Long Memory, Safer AI and More
Programmer DD
Programmer DD
Mar 22, 2023 · Artificial Intelligence

How Baidu’s Ernie Bot Stacks Up Against GPT‑4: A Deep Dive

The article reviews Baidu’s newly launched Ernie Bot, a multimodal large language model, comparing its literary, business, mathematical, Chinese comprehension, and multimodal abilities with GPT‑4, while detailing the underlying technologies, knowledge‑enhancement techniques, and deployment strategy behind the model.

AI comparisonBaiduErnie Bot
0 likes · 10 min read
How Baidu’s Ernie Bot Stacks Up Against GPT‑4: A Deep Dive
21CTO
21CTO
Mar 17, 2023 · Artificial Intelligence

Exploring OpenChatKit: The Open-Source Alternative to ChatGPT

OpenChatKit, released by Together Computer in March 2023, is an open‑source ChatGPT‑like large language model built on GPT‑NeoX‑20B, offering developers fine‑tuned conversational abilities, a modular architecture, retrieval system, and content‑filtering, while also outlining its current limitations and future potential.

AIChatGPT alternativeOpenChatKit
0 likes · 7 min read
Exploring OpenChatKit: The Open-Source Alternative to ChatGPT
Architecture Digest
Architecture Digest
Mar 17, 2023 · Artificial Intelligence

Baidu’s Ernie Bot (Wenxin Yiyan) vs GPT‑4: Capabilities, Technical Foundations, and Market Reaction

The article reviews Baidu's launch of the multimodal large language model Wenxin Yiyan, compares its literary, business, mathematical, Chinese‑understanding and multimodal abilities with GPT‑4, explains the underlying six‑core technologies and hardware stack, and reports the mixed market and netizen response.

AIBaiduErnie Bot
0 likes · 11 min read
Baidu’s Ernie Bot (Wenxin Yiyan) vs GPT‑4: Capabilities, Technical Foundations, and Market Reaction
21CTO
21CTO
Mar 15, 2023 · Artificial Intelligence

What Makes OpenAI’s New GPT‑4 a Game‑Changer for Multimodal AI?

OpenAI’s GPT‑4, a multimodal large language model that accepts text and image inputs, powers ChatGPT and Bing, offers improved creativity and problem‑solving while still facing hallucination risks, and is now available via ChatGPT Plus and an open API for developers.

AI SafetyGPT-4Multimodal AI
0 likes · 5 min read
What Makes OpenAI’s New GPT‑4 a Game‑Changer for Multimodal AI?
DataFunSummit
DataFunSummit
Mar 15, 2023 · Artificial Intelligence

Key Features and Capabilities of OpenAI's GPT‑4

OpenAI's GPT‑4, a large multimodal language model, expands token limits, adds image understanding, demonstrates strong reasoning on professional exams, supports many languages, and is already integrated into Microsoft Bing, while offering various access options and improved safety compared to its predecessor.

AIGPT-4Microsoft Bing
0 likes · 9 min read
Key Features and Capabilities of OpenAI's GPT‑4
Tencent Advertising Technology
Tencent Advertising Technology
Mar 2, 2023 · Artificial Intelligence

Tencent's HunYuan‑NLP 1T Large‑Scale AI Model: Training Techniques, Optimization, and Real‑World Applications

This article details Tencent's development of the 1‑trillion‑parameter HunYuan‑NLP model, covering its MoE architecture, cost‑effective pre‑training strategies, distributed training framework, model compression toolkit, and successful deployment across advertising, gaming, and other Tencent services.

AI InfrastructureMixture of Expertslarge language model
0 likes · 17 min read
Tencent's HunYuan‑NLP 1T Large‑Scale AI Model: Training Techniques, Optimization, and Real‑World Applications
DataFunSummit
DataFunSummit
Feb 26, 2023 · Artificial Intelligence

Fudan University's MOSS: China's First Conversational Large Language Model

Fudan University's Natural Language Processing Lab introduced MOSS, the country's first conversational large language model capable of dialogue generation, programming, factual QA and ethical reasoning, with plans for open‑source release despite current limitations in Chinese language proficiency.

AIFudan UniversityMOSS
0 likes · 3 min read
Fudan University's MOSS: China's First Conversational Large Language Model
DataFunSummit
DataFunSummit
Feb 24, 2023 · Artificial Intelligence

Baidu PLATO Open‑Domain Dialogue Model: Technology, Challenges, and Applications

The article presents Baidu's PLATO open‑domain dialogue system, detailing its evolution from expert‑rule to retrieval‑based and large‑scale generative models, describing its hidden‑variable architecture, major research challenges such as persona stability, long‑term memory, knowledge accuracy, and showcasing real‑world applications and Q&A from a DataFunSummit2022 livestream.

AIKnowledge RetrievalLong-term Memory
0 likes · 25 min read
Baidu PLATO Open‑Domain Dialogue Model: Technology, Challenges, and Applications
Programmer DD
Programmer DD
Feb 21, 2023 · Artificial Intelligence

Meet MOSS: China’s Homegrown ChatGPT Rival and Its Capabilities

MOSS, a Chinese large‑language model released by Fudan University, offers ChatGPT‑like functions such as text generation, summarization, translation, and code writing, while being open‑source and free during preview, yet it still lags behind due to limited data, compute, and model size.

AIChatGPTFudan University
0 likes · 11 min read
Meet MOSS: China’s Homegrown ChatGPT Rival and Its Capabilities
DataFunTalk
DataFunTalk
Feb 20, 2023 · Artificial Intelligence

ChatGPT Technology, Localization Efforts, and Open‑Source Large Models – Overview and Practices

This article presents an overview of ChatGPT technology, its evolution, current challenges, a three‑stage learning process, data organization and evaluation, details of domestic localization efforts, practical solutions, and the release of a Chinese open‑source large model with training guidance.

ChatGPTModel Localizationdata annotation
0 likes · 12 min read
ChatGPT Technology, Localization Efforts, and Open‑Source Large Models – Overview and Practices
DataFunTalk
DataFunTalk
Feb 19, 2023 · Artificial Intelligence

How ChatGPT Works: An In‑Depth Explanation by Stephen Wolfram

This article provides a comprehensive, step‑by‑step explanation of how ChatGPT generates text, covering token probabilities, n‑gram models, embeddings, attention mechanisms, and the Transformer architecture, while illustrating concepts with Wolfram‑language examples and visualizations.

AIChatGPTNeural Network
0 likes · 20 min read
How ChatGPT Works: An In‑Depth Explanation by Stephen Wolfram
Tencent Cloud Developer
Tencent Cloud Developer
Feb 14, 2023 · Artificial Intelligence

ChatGPT: Technology, Impact, and Future Perspectives

Since its November 2022 launch, OpenAI’s ChatGPT—built on Transformer‑based generative AI—has surged to over 100 million users, demonstrated capabilities from MBA exams to software‑engineer interviews, sparked a multibillion‑dollar market with paid subscriptions and Microsoft investment, spurred rival models like Claude, and is reshaping human‑computer interaction while raising ethical concerns and promising multimodal, industry‑specific future applications.

ChatGPTTransformergenerative AI
0 likes · 15 min read
ChatGPT: Technology, Impact, and Future Perspectives
21CTO
21CTO
Feb 7, 2023 · Artificial Intelligence

Google’s Bard vs ChatGPT: Inside the New AI Chatbot and Its LaMDA Roots

Google unveiled its new conversational AI, Bard, built on the LaMDA model, positioning it as a direct competitor to ChatGPT; the article details its public testing, technical foundations, feature set, and key differences such as real‑time web integration and cost‑free access.

AI chatbotChatGPTGoogle AI
0 likes · 11 min read
Google’s Bard vs ChatGPT: Inside the New AI Chatbot and Its LaMDA Roots
Architect
Architect
Dec 20, 2022 · Artificial Intelligence

Understanding ChatGPT: Architecture, Training Process, Features, and Applications

An in‑depth overview of ChatGPT covering its conversational model nature, core technologies such as InstructGPT, large language model capabilities, RLHF training pipeline, strengths, limitations, safety mechanisms, and potential applications across content creation, search, and multimodal integration.

ApplicationsChatGPTRLHF
0 likes · 19 min read
Understanding ChatGPT: Architecture, Training Process, Features, and Applications
Architecture Digest
Architecture Digest
Dec 15, 2022 · Artificial Intelligence

Technical Overview of ChatGPT: Training Pipeline, RLHF, and Its Potential to Replace Search Engines

This article explains ChatGPT's underlying technology—including its three‑stage training pipeline with supervised fine‑tuning, reward‑model learning, and reinforcement learning from human feedback—while analyzing whether the model can realistically replace traditional search engines such as Google or Baidu.

AIChatGPTRLHF
0 likes · 15 min read
Technical Overview of ChatGPT: Training Pipeline, RLHF, and Its Potential to Replace Search Engines
IT Architects Alliance
IT Architects Alliance
Dec 13, 2022 · Artificial Intelligence

Technical Principles and Training Process of ChatGPT

The article explains ChatGPT’s underlying technology, detailing its three-stage training pipeline—supervised fine‑tuning, reward‑model learning, and reinforcement learning with PPO—while discussing its strengths, limitations, and potential integration with traditional search engines.

AIChatGPTLLM
0 likes · 14 min read
Technical Principles and Training Process of ChatGPT
Tencent Cloud Developer
Tencent Cloud Developer
Dec 9, 2022 · Artificial Intelligence

An Overview of ChatGPT: Technology, Training Process, and Applications

The article outlines ChatGPT’s conversational capabilities, its InstructGPT‑based architecture, a three‑stage RLHF training pipeline involving supervised fine‑tuning, human‑ranked response generation, and PPO optimization, and discusses its strengths, limitations, diverse applications, and future directions for multimodal, up‑to‑date assistants.

AI applicationsChatGPTPPO
0 likes · 18 min read
An Overview of ChatGPT: Technology, Training Process, and Applications
Architect's Guide
Architect's Guide
Dec 9, 2022 · Artificial Intelligence

Technical Principles and Training Process of ChatGPT

The article explains how ChatGPT builds on the GPT‑3.5 large language model, using human‑annotated data and Reinforcement Learning from Human Feedback (RLHF) across three training stages to improve instruction understanding, answer quality, and continual model enhancement, while also discussing its potential to complement or replace traditional search engines.

AIChatGPTInstruction Tuning
0 likes · 15 min read
Technical Principles and Training Process of ChatGPT
IT Architects Alliance
IT Architects Alliance
Dec 8, 2022 · Artificial Intelligence

Technical Principles and Training Process of ChatGPT

This article explains the technical foundations of ChatGPT, detailing its three-stage training pipeline—supervised fine‑tuning with human‑annotated data, reward model training via pairwise ranking, and reinforcement learning from human feedback—while also discussing its limitations compared to traditional search engines and potential future enhancements.

AIChatGPTRLHF
0 likes · 14 min read
Technical Principles and Training Process of ChatGPT
Top Architect
Top Architect
Dec 7, 2022 · Artificial Intelligence

Technical Principles of ChatGPT and Its Prospects for Replacing Traditional Search Engines

The article explains how ChatGPT builds on GPT‑3.5 with supervised fine‑tuning, reward‑model training and reinforcement learning from human feedback, analyzes why it cannot yet replace search engines due to hallucinations, knowledge freshness and cost, and proposes a hybrid architecture that combines LLM generation with traditional retrieval to overcome these limitations.

AIChatGPTRLHF
0 likes · 16 min read
Technical Principles of ChatGPT and Its Prospects for Replacing Traditional Search Engines
DataFunTalk
DataFunTalk
Oct 7, 2022 · Artificial Intelligence

Overview of Baidu's PLATO Open‑Domain Dialogue Technology, Challenges, and Applications

This article introduces Baidu's PLATO open‑domain dialogue technology, explains the evolution from rule‑based to retrieval‑based and large‑scale generative models, discusses major challenges such as persona stability, long‑term memory, knowledge accuracy, and proactive conversation, and showcases real‑world applications and Q&A insights.

AI ChallengesChatbot ApplicationsOpen-domain Dialogue
0 likes · 23 min read
Overview of Baidu's PLATO Open‑Domain Dialogue Technology, Challenges, and Applications
DataFunTalk
DataFunTalk
Jun 30, 2022 · Artificial Intelligence

OBERT: A Billion‑Parameter Pretrained Language Model for Large‑Scale NLP Applications

The OPPO XiaoBu team introduced OBERT, a series of 100M‑, 300M‑, and 1B‑parameter pretrained language models that leverage massive TB‑scale corpora, multi‑granular masking, retrieval‑augmented training, and distributed acceleration to achieve state‑of‑the‑art results on CLUE and KgCLUE benchmarks while enabling efficient industrial deployment.

Fine-tuningKnowledge augmentationNLP
0 likes · 12 min read
OBERT: A Billion‑Parameter Pretrained Language Model for Large‑Scale NLP Applications
21CTO
21CTO
Jun 14, 2022 · Artificial Intelligence

Does Google’s LaMDA Really Possess Sentience? A Deep Dive into the Debate

The article examines the controversy surrounding Google’s LaMDA chatbot, detailing engineer Blake Lemoine’s claims of sentience, his suspension, the model’s technical specs, contrasting expert opinions from figures like Andrej Karpathy and Gary Marcus, and ultimately argues that LaMDA’s apparent emotions are a projection rather than true consciousness.

AI ethicsGoogleLaMDA
0 likes · 8 min read
Does Google’s LaMDA Really Possess Sentience? A Deep Dive into the Debate
Tencent Tech
Tencent Tech
Apr 29, 2022 · Artificial Intelligence

Tencent’s Hunyuan AI Model Tops CLUE Leaderboard with Record Score

Tencent’s Hunyuan AI large model shattered records by scoring 80.888 to claim first place on the CLUE benchmark, showcasing its advanced natural language processing, multimodal abilities, curriculum‑learning training approach, and real‑world deployments in WeChat Search and advertising.

AICLUEHunyuan
0 likes · 3 min read
Tencent’s Hunyuan AI Model Tops CLUE Leaderboard with Record Score
DataFunTalk
DataFunTalk
Sep 22, 2021 · Artificial Intelligence

Baidu Unveils PLATO-XL: A 110‑Billion‑Parameter Bilingual Dialogue Generation Model

Baidu's newly released PLATO‑XL, a 110‑billion‑parameter bilingual pre‑training dialogue model, surpasses previous large‑scale models, introduces multi‑role awareness for consistent multi‑turn conversations, and demonstrates state‑of‑the‑art performance across open‑domain, knowledge‑grounded, and task‑oriented dialogue tasks.

PLATO-XLbilingual AIdialogue generation
0 likes · 9 min read
Baidu Unveils PLATO-XL: A 110‑Billion‑Parameter Bilingual Dialogue Generation Model
DataFunTalk
DataFunTalk
Jul 8, 2021 · Artificial Intelligence

Baidu ERNIE 3.0: Knowledge‑Enhanced 100B‑Parameter Model Sets New Chinese NLP Benchmarks and Tops SuperGLUE

Baidu's ERNIE 3.0 introduces a 100‑billion‑parameter, knowledge‑graph‑augmented language model that breaks 54 Chinese NLP benchmarks, achieves human‑level performance on SuperGLUE, and demonstrates strong generation and zero‑shot capabilities, now available for public demo and research.

BaiduERNIE 3.0Knowledge Graph
0 likes · 7 min read
Baidu ERNIE 3.0: Knowledge‑Enhanced 100B‑Parameter Model Sets New Chinese NLP Benchmarks and Tops SuperGLUE