Tagged articles

Large Language Model

737 articles · Page 7 of 8

May 20, 2024 · Artificial Intelligence

How RecGPT Leverages ChatGPT‑Style Prompt Tuning for Better Sequential Recommendation

RecGPT applies a ChatGPT‑like pre‑training and personalized prompt‑tuning paradigm to sequential recommendation, introducing a two‑stage recall mechanism that improves offline HR/NDCG metrics and yields modest online interaction gains in a real‑world short‑video platform.

Large Language ModelPrompt TuningRecGPT

0 likes · 8 min read

How RecGPT Leverages ChatGPT‑Style Prompt Tuning for Better Sequential Recommendation

360 Tech Engineering

May 17, 2024 · Artificial Intelligence

360VL: An Open‑Source Multimodal Large Language Model Based on Llama‑3‑70B

The article introduces 360VL, an open‑source multimodal large language model built on Llama‑3‑70B, describes its novel C‑abs bridge architecture for high‑resolution visual understanding, outlines the two‑stage training with bilingual data, and presents benchmark results showing superior performance over prior LMMs.

AI researchLarge Language ModelLlama3

0 likes · 8 min read

360VL: An Open‑Source Multimodal Large Language Model Based on Llama‑3‑70B

Rare Earth Juejin Tech Community

May 15, 2024 · Artificial Intelligence

OpenAI Unveils GPT‑4o: An Omni‑Capable Multimodal Model Offered Free to All Users

OpenAI introduced GPT‑4o, a free, omni‑capable multimodal model that processes text, audio, and images together, delivers near‑human response latency, showcases impressive live demos, and will soon be available via a discounted API, marking a significant step forward in end‑to‑end AI research.

AI researchGPT-4oLarge Language Model

0 likes · 7 min read

OpenAI Unveils GPT‑4o: An Omni‑Capable Multimodal Model Offered Free to All Users

CSS Magic

May 13, 2024 · Artificial Intelligence

DeepSeek: China’s New LLM Dark Horse – First Impressions and Shockingly Low Prices

The article evaluates DeepSeek v2, a 100‑billion‑parameter MoE model, highlighting its near‑GPT‑4 benchmark performance, OpenAI‑compatible API, 32k‑token context, exceptionally low pricing, a custom token‑utilization metric, and the practical drawbacks observed during hands‑on testing.

API compatibilityBenchmarkDeepSeek

0 likes · 9 min read

DeepSeek: China’s New LLM Dark Horse – First Impressions and Shockingly Low Prices

Baobao Algorithm Notes

May 9, 2024 · Artificial Intelligence

Inside Deepseek‑V2: How Multi‑Head Latent Attention Cuts KV‑Cache and Boosts Performance

This article provides an in‑depth technical analysis of Deepseek‑V2, covering its 236B parameter size, Multi‑Head Latent Attention optimization that reduces KV‑cache memory, architectural details, training pipelines, infrastructure choices, and performance results on benchmarks such as MMLU and instruction following.

AI ArchitectureDeepSeekLarge Language Model

0 likes · 17 min read

Inside Deepseek‑V2: How Multi‑Head Latent Attention Cuts KV‑Cache and Boosts Performance

Baidu Tech Salon

May 8, 2024 · Artificial Intelligence

Sugar BI: AI‑Driven Next‑Generation Business Intelligence Platform

Sugar BI, Baidu’s AI‑driven next‑generation business intelligence platform, evolves from the 2016 ShowX system into a zero‑code, multi‑source analytics suite that integrates over 30 data connectors, advanced semantic modeling, and the Wenxin‑powered Sugar Bot, which transforms natural‑language queries into optimized visualizations via intelligent chart recommendation, positioning it as a leading AI‑augmented BI solution.

AIData VisualizationIntelligent Chart Recommendation

0 likes · 19 min read

Sugar BI: AI‑Driven Next‑Generation Business Intelligence Platform

Baidu Geek Talk

May 8, 2024 · Artificial Intelligence

Sugar BI: AI‑Driven Next‑Generation Business Intelligence Platform

Sugar BI, evolving from the internal ShowX platform to versions 2.0‑4.0, now offers a zero‑code, drag‑and‑drop visual editor, support for over 30 data sources, AI‑powered automatic analysis and the Sugar Bot Q&A module that transforms multi‑day data tasks into minutes, delivering containerized SaaS BI with intelligent chart recommendation and rapid, code‑free decision‑making for enterprises.

AIAnalyticsBI

0 likes · 19 min read

Baidu Intelligent Cloud Tech Hub

May 8, 2024 · Artificial Intelligence

How AI Powers the Next‑Gen Sugar BI Platform for Smarter Decision‑Making

This article details the evolution of Baidu's Sugar BI platform, highlighting its AI‑driven analytics, extensive data source support, zero‑code visual design, smart chart recommendation, and the conversational Sugar Bot that transforms natural‑language queries into actionable visual insights.

AIAnalyticsBI

0 likes · 18 min read

How AI Powers the Next‑Gen Sugar BI Platform for Smarter Decision‑Making

JD Cloud Developers

May 8, 2024 · Artificial Intelligence

Deploy Meta’s LLaMA 3 on JD Cloud: A Complete Step‑by‑Step Tutorial

Meta’s newly released LLaMA 3 models (8B and 70B) boast record‑breaking performance, and this guide walks you through the community buzz, technical specs, and a detailed JD Cloud workflow—from provisioning a GPU instance to running the model in a Jupyter environment.

AI DeploymentJD CloudLarge Language Model

0 likes · 6 min read

Deploy Meta’s LLaMA 3 on JD Cloud: A Complete Step‑by‑Step Tutorial

Baobao Algorithm Notes

May 6, 2024 · Artificial Intelligence

DeepSeek-V2: 236B MoE LLM Delivers Higher Performance While Cutting Training Cost by 42%

DeepSeek‑V2 is a 236‑billion‑parameter mixture‑of‑experts language model that reduces training cost by 42.5 %, cuts KV‑cache usage by 93.3 %, and boosts generation throughput 5.76×, while achieving state‑of‑the‑art scores on benchmarks such as MMLU, C‑Eval, BBH, HumanEval, and GSM8K for both base and chat variants.

AIDeepSeek-V2Large Language Model

0 likes · 11 min read

DeepSeek-V2: 236B MoE LLM Delivers Higher Performance While Cutting Training Cost by 42%

IT Services Circle

May 1, 2024 · Artificial Intelligence

Summary of Andrew Ng’s AI Agent Talk: Models, Workflows, and Design Patterns

The article summarizes Andrew Ng’s presentation on AI agents, contrasting traditional single‑prompt large‑model usage with iterative agent‑based workflows, reporting experimental accuracy gains, and outlining four agent design patterns—reflection, tool use, planning, and multi‑agent collaboration—while discussing practical trade‑offs such as latency and token speed.

AI AgentDesign PatternsLarge Language Model

0 likes · 7 min read

Summary of Andrew Ng’s AI Agent Talk: Models, Workflows, and Design Patterns

Baidu Geek Talk

Apr 22, 2024 · Artificial Intelligence

Designing Effective Prompts for Large Language Models: Structure, Code Examples, and Regex Extraction

The article presents a systematic prompt template—comprising Instruction, Input Data, Context, and Output Indicator—demonstrates code examples for single‑ and multi‑task formatting, shows how clear markers enable regex extraction, and introduces Baidu’s PaddlePaddle Star River Community to simplify building reliable LLM‑driven applications.

AILarge Language ModelPython

0 likes · 13 min read

Designing Effective Prompts for Large Language Models: Structure, Code Examples, and Regex Extraction

DataFunTalk

Apr 21, 2024 · Artificial Intelligence

Guidelines for Building Domain-Specific Large Models: Dataset Construction, Training Methods, Evaluation, and Hardware Benchmarking

This article presents a comprehensive guide on constructing domain-specific large language models, covering the differences from general models, how to build high‑quality domain datasets, selecting appropriate training methods, designing validation sets, evaluating model capabilities, and benchmarking domestic hardware performance.

AIDataset ConstructionLarge Language Model

0 likes · 20 min read

Guidelines for Building Domain-Specific Large Models: Dataset Construction, Training Methods, Evaluation, and Hardware Benchmarking

21CTO

Apr 20, 2024 · Artificial Intelligence

What Developers Need to Know About Meta’s New Open‑Source Llama 3 Model

Meta’s newly open‑source Llama 3 model pushes the frontier of large language models with a larger context window, Mixture‑of‑Experts architecture, multilingual support, and multimodal capabilities, while facing challenges in transparency, bias, and computational resources, and offering diverse applications from NLU to code generation.

AIBenchmarkLarge Language Model

0 likes · 10 min read

What Developers Need to Know About Meta’s New Open‑Source Llama 3 Model

New Oriental Technology

Apr 19, 2024 · Artificial Intelligence

Effective Prompt Engineering for Large Language Models

This article explains how large language models work, why well‑crafted prompts are essential, and presents practical strategies—such as clarity, conciseness, focus, role‑setting, delimiters, few‑shot examples, and step‑by‑step instructions—to help users obtain accurate and relevant responses from AI systems.

AILLM strategiesLarge Language Model

0 likes · 12 min read

Effective Prompt Engineering for Large Language Models

AntTech

Apr 19, 2024 · Artificial Intelligence

AgentUniverse: An Enterprise‑Grade Multi‑Agent Framework for Complex Financial Analysis

The article introduces AgentUniverse, a large‑model multi‑agent framework that orchestrates specialized agents through a PEER collaboration pattern to overcome LLM limitations in complex financial tasks, demonstrates its architecture, workflow, experimental superiority on benchmarks, and provides open‑source installation details.

AIAgent frameworkFinancial Analysis

0 likes · 10 min read

AgentUniverse: An Enterprise‑Grade Multi‑Agent Framework for Complex Financial Analysis

NewBeeNLP

Apr 19, 2024 · Artificial Intelligence

Llama 3 Unveiled: 8B & 70B Models Set New SOTA Across Benchmarks

Meta announced the open‑source Llama 3 series (8B and 70B parameters), detailing its decoder‑only Transformer architecture, 15 T‑token multilingual training data, superior benchmark scores over competitors, a limited 8K context window, and upcoming cloud and web‑based deployments.

BenchmarkLarge Language ModelLlama 3

0 likes · 7 min read

Llama 3 Unveiled: 8B & 70B Models Set New SOTA Across Benchmarks

DataFunSummit

Apr 16, 2024 · Artificial Intelligence

Intelligent Risk Control: Definitions, Expert Systems, Algorithmic Systems, and Emerging AI Techniques

This article explains intelligent risk control as a synergy of expert experience and algorithmic decision‑making, outlines its definition, expert human systems, digital algorithmic systems, and explores advanced AI methods such as reinforcement learning, large language models with knowledge graphs, adversarial learning, graph neural networks, and a practical supply‑chain case study.

Graph Neural NetworkLarge Language Modeladversarial learning

0 likes · 11 min read

Intelligent Risk Control: Definitions, Expert Systems, Algorithmic Systems, and Emerging AI Techniques

CSS Magic

Apr 12, 2024 · Artificial Intelligence

Answering Common Kimi API Questions and Exploring AI App Development

This article addresses frequent Kimi API queries, explains the API's purpose, available endpoints, model specifications, token‑based pricing, differences from the web assistant, response variability, JSON output workarounds, and shares upcoming roadmap items for developers building AI applications.

Chat CompletionJSON outputKimi API

0 likes · 10 min read

Answering Common Kimi API Questions and Exploring AI App Development

21CTO

Apr 11, 2024 · Artificial Intelligence

Google Unveils CodeGemma: New AI Models for Code Generation & Reasoning

Google has introduced the CodeGemma series, expanding its Gemma AI models with new variants optimized for code generation and reasoning, featuring 2B‑7B parameter models trained on 500 billion tokens, delivering full‑code block generation, strong benchmark results, and availability on Kaggle, Hugging Face, and Vertex AI.

AIGoogleLarge Language Model

0 likes · 4 min read

Google Unveils CodeGemma: New AI Models for Code Generation & Reasoning

DataFunSummit

Apr 10, 2024 · Artificial Intelligence

Large Language Model Inference Overview and Performance Optimizations

This article presents a comprehensive overview of large language model inference, describing the prefill and decoding stages, key performance metrics such as throughput, latency and QPS, and detailing a series of system-level optimizations—including pipeline parallelism, dynamic batching, KV‑cache quantization, and hardware considerations—to significantly improve inference efficiency on modern GPUs.

GPUInferenceLarge Language Model

0 likes · 23 min read

Large Language Model Inference Overview and Performance Optimizations

21CTO

Apr 8, 2024 · Artificial Intelligence

How Naver’s HyperCLOVA X Advances Multilingual AI for Asian Languages

Naver’s newly unveiled HyperCLOVA X large‑language model, detailed in an arXiv technical report, claims superior cross‑lingual reasoning for Asian languages, especially Korean, by pre‑training on a data mix of Korean, multilingual text and code, achieving state‑of‑the‑art translation and multilingual capabilities.

AI researchHyperCLOVA XKorean NLP

0 likes · 4 min read

How Naver’s HyperCLOVA X Advances Multilingual AI for Asian Languages

21CTO

Mar 29, 2024 · Artificial Intelligence

Why Databricks’ Open‑Source DBRX LLM Is Outpacing GPT‑3.5 and Llama 2

Databricks unveiled the open‑source DBRX large language model, which leverages a mixed‑expert architecture to deliver faster, more cost‑effective inference and beats leading open‑source and proprietary models like Llama 2, Mixtral‑8x7B, and GPT‑3.5 on multiple benchmarks.

AIDBRXDatabricks

0 likes · 7 min read

Why Databricks’ Open‑Source DBRX LLM Is Outpacing GPT‑3.5 and Llama 2

Baobao Algorithm Notes

Mar 28, 2024 · Artificial Intelligence

How Qwen1.5‑MoE‑A2.7B Matches 70B LLM Performance with Only 2.7B Activated Parameters

Qwen1.5‑MoE‑A2.7B is a 2.7 billion‑parameter Mixture‑of‑Experts model that delivers performance comparable to leading 7 billion‑parameter LLMs while cutting training cost by 75% and boosting inference speed by 1.74×, and the article details its architecture, benchmarks, efficiency analysis, and deployment steps.

Large Language ModelMoEModel Benchmark

0 likes · 13 min read

How Qwen1.5‑MoE‑A2.7B Matches 70B LLM Performance with Only 2.7B Activated Parameters

OPPO Kernel Craftsman

Mar 22, 2024 · Artificial Intelligence

InternLM Model Fine-Tuning Tutorial with XTuner: Chat Format and Practical Implementation Guide

This tutorial walks through fine‑tuning Shanghai AI Lab’s open‑source InternLM models with XTuner, explaining chat‑format conventions, loading and inference (including multimodal InternLM‑XComposer), dataset preparation, configuration sections, DeepSpeed acceleration, and memory‑efficient QLoRA details for 7‑B‑parameter chat models.

Chat FormatDeepSpeedHuggingFace

0 likes · 22 min read

InternLM Model Fine-Tuning Tutorial with XTuner: Chat Format and Practical Implementation Guide

Rare Earth Juejin Tech Community

Mar 20, 2024 · Artificial Intelligence

Elon Musk’s xAI Open‑Sources Grok‑1: A 314‑Billion‑Parameter MoE Large Language Model

Elon Musk’s xAI has open‑sourced Grok‑1, a 314‑billion‑parameter mixture‑of‑experts language model built with Rust and JAX, released under an Apache‑2.0 license, and the announcement includes detailed architecture specs, hardware requirements, and the broader context of Musk’s rivalry with OpenAI.

AIGrok-1Large Language Model

0 likes · 6 min read

Elon Musk’s xAI Open‑Sources Grok‑1: A 314‑Billion‑Parameter MoE Large Language Model

Open Source Tech Hub

Mar 17, 2024 · Artificial Intelligence

What Is Grok? Inside Elon Musk’s New Open‑Source LLM and the ‘Grokking’ Phenomenon

Elon Musk announced the open‑source release of Grok, xAI’s new large‑language‑model chatbot, while recalling his lawsuit against OpenAI; the article explains Grok’s rapid development, links to the GitHub repository, summarizes the seminal “Grokking” research paper that describes a sudden generalization breakthrough in neural networks, and provides reference links.

AI researchGrokGrokking

0 likes · 3 min read

What Is Grok? Inside Elon Musk’s New Open‑Source LLM and the ‘Grokking’ Phenomenon

CSS Magic

Mar 13, 2024 · Artificial Intelligence

How Moonshot’s Kimi Model Beats Big‑Tech LLMs with 200k‑Token Context

The author tests Moonshot’s Kimi API, revealing its 200 k‑character context window, superior token‑to‑character ratio compared with GPT‑3.5 and Gemini, and performance that, while slower than GPT‑3.5 Turbo, rivals GPT‑4 Turbo, all while offering OpenAI‑compatible endpoints and free credit for developers.

API compatibilityKimiLarge Language Model

0 likes · 8 min read

How Moonshot’s Kimi Model Beats Big‑Tech LLMs with 200k‑Token Context

DataFunSummit

Mar 11, 2024 · Artificial Intelligence

The Synergy of Large Language Models and Knowledge Graphs: Current Status and Future Directions

This article examines how large language models enhance human‑machine interaction and can be combined with knowledge graphs to improve factual Q&A, task‑oriented services, and structured decision‑making, while highlighting ongoing challenges and the enduring role of knowledge graphs in structured domains.

AILarge Language Modeldialogue system

0 likes · 4 min read

The Synergy of Large Language Models and Knowledge Graphs: Current Status and Future Directions

DataFunTalk

Mar 7, 2024 · Artificial Intelligence

Integrating Large Language Models with Knowledge Graphs: Current Status and Future Directions

Large language models enhance human‑machine interaction and natural language understanding, but knowledge graphs remain essential for structured, low‑cost decision making, factual retrieval, and domains like finance; combining both can improve conversational systems, while ongoing challenges in knowledge graph construction persist, as highlighted for the upcoming DataFunSummit2024.

Conversational AILarge Language ModelStructured Data

0 likes · 5 min read

Integrating Large Language Models with Knowledge Graphs: Current Status and Future Directions

DevOps

Mar 5, 2024 · Artificial Intelligence

Understanding GPT‑4, ChatGPT, and the Foundations of Large Language Models

This article explains the fundamentals of AI, machine learning, deep learning, and natural language processing, describes how Transformer architectures and attention mechanisms power large language models such as GPT‑4 and ChatGPT, and walks through tokenization, prediction, and practical development with Python.

Artificial IntelligenceChatGPTGPT-4

0 likes · 16 min read

Understanding GPT‑4, ChatGPT, and the Foundations of Large Language Models

Java Tech Enthusiast

Mar 5, 2024 · Artificial Intelligence

Claude 3 vs GPT‑4: A Deep Dive into the New AI Giant’s Multimodal Edge

Claude 3 has arrived, outperforming GPT‑4 across benchmark scores, offering free Sonnet and paid Opus tiers, and showcasing unprecedented multimodal, long‑context, and code‑generation abilities that reshape competitive dynamics in large‑language‑model research.

AnthropicClaude 3GPT-4 comparison

0 likes · 12 min read

Claude 3 vs GPT‑4: A Deep Dive into the New AI Giant’s Multimodal Edge

Smart Era Software Development

Feb 28, 2024 · Artificial Intelligence

Google Unleashes Gemma: Open‑Source LLM That Beats Llama 2 and Challenges OpenAI

Google has released the open‑source Gemma large language model in 2 B and 7 B parameter versions, claiming superior performance to Llama 2 and Mistral across 18 benchmarks, especially in math and code, while running on laptops, desktops, IoT and cloud devices.

AIBenchmarkGemma

0 likes · 10 min read

Google Unleashes Gemma: Open‑Source LLM That Beats Llama 2 and Challenges OpenAI

21CTO

Feb 27, 2024 · Artificial Intelligence

Mistral Large: The Open‑Source LLM Challenging GPT‑4 on Azure

Mistral AI, a Paris‑based startup, unveiled Mistral Large—an open‑source, multilingual LLM rivaling GPT‑4 with a 32k token context window, advanced code and math abilities, and native Azure AI integration, marking a major milestone in European AI development.

Azure AILarge Language ModelMistral AI

0 likes · 6 min read

Mistral Large: The Open‑Source LLM Challenging GPT‑4 on Azure

Architects' Tech Alliance

Feb 25, 2024 · Artificial Intelligence

How Sora Redefined Video Generation: Breakthroughs and Industry Impact

The article provides an in‑depth technical analysis of OpenAI's Sora, highlighting its 60‑second 1080p video generation capability, the novel patches‑vectorization and transformer training pipeline that leverages GPT‑generated prompts for multimodal alignment, and its potential to become a universal video‑generation base model that could reshape the AI industry.

AGILarge Language ModelSora

0 likes · 6 min read

How Sora Redefined Video Generation: Breakthroughs and Industry Impact

Programmer DD

Feb 22, 2024 · Artificial Intelligence

Google Unveils Gemma: Open‑Source LLM Matching Gemini’s Power

Google has launched Gemma, an open‑source large language model available in 2B and 7B parameter versions, built on the same technology as Gemini, outperforming many existing models and capable of running on ordinary laptops, with a detailed technical report and quick‑start guide provided online.

AIGemmaGoogle

0 likes · 3 min read

Google Unveils Gemma: Open‑Source LLM Matching Gemini’s Power

DataFunSummit

Feb 21, 2024 · Artificial Intelligence

Applying Knowledge Graphs to E‑commerce AIGC: From Domain to General KG and Large Language Models

This article presents a comprehensive overview of how knowledge graphs are integrated into e‑commerce AIGC pipelines, covering domain‑specific and generic KG‑driven text generation, model architecture, controllable generation techniques, experimental results, and future directions for large language models in commercial settings.

AIAIGCLarge Language Model

0 likes · 23 min read

Applying Knowledge Graphs to E‑commerce AIGC: From Domain to General KG and Large Language Models

Rare Earth Juejin Tech Community

Feb 18, 2024 · Artificial Intelligence

Llama 2: Open Foundation and Fine‑Tuned Chat Models – Overview and Technical Details

The article provides a comprehensive overview of Meta’s Llama 2 series, detailing model sizes, pre‑training data, architectural enhancements, supervised fine‑tuning, RLHF procedures, safety evaluations, reward‑model training, and iterative improvements, highlighting its open‑source release and comparative performance.

AI safetyLarge Language ModelLlama2

0 likes · 27 min read

Llama 2: Open Foundation and Fine‑Tuned Chat Models – Overview and Technical Details

DataFunSummit

Feb 12, 2024 · Artificial Intelligence

Ant Group's Time Series AI Practices and the AntFlux Intelligent Engine

This article presents Ant Group's comprehensive time‑series AI solutions, covering the business value of temporal data, the evolution of statistical and deep learning models, large‑scale time‑series platforms such as AntFlux, and real‑world applications ranging from financial forecasting to green computing.

AIAntFluxLarge Language Model

0 likes · 17 min read

Ant Group's Time Series AI Practices and the AntFlux Intelligent Engine

Baidu Geek Talk

Feb 7, 2024 · Artificial Intelligence

Design and Implementation of a Knowledge-Base Intelligent Q&A System for Database Operations Using Large Models

The paper details Baidu Intelligent Cloud’s design and deployment of a domain‑specific knowledge‑base Q&A system for database operations, combining prompt‑engineered LLMs with hybrid vector‑search using LangChain, BES vector store, and custom ingestion, addressing recall, token limits, and hallucination challenges across dashboard and IM bot interfaces.

AIDatabase operationsKnowledge Base

0 likes · 16 min read

Design and Implementation of a Knowledge-Base Intelligent Q&A System for Database Operations Using Large Models

DataFunTalk

Feb 6, 2024 · Artificial Intelligence

Overview of Vivo BlueLM Large Model: Evolution, Training Challenges, and Product Deployment

This article presents a comprehensive overview of Vivo's BlueLM large language model, covering its historical evolution, the massive data and algorithmic challenges faced during training, safety and performance optimizations, and how the model has been integrated into various consumer and enterprise products.

AIAI safetyBlueLM

0 likes · 16 min read

Overview of Vivo BlueLM Large Model: Evolution, Training Challenges, and Product Deployment

Architect

Jan 27, 2024 · Industry Insights

How We Built a Scalable Smart Customer Service System for an Activity Platform

This article details the end‑to‑end design, implementation, and operational results of a smart customer‑service platform that automates FAQ capture, leverages both Elasticsearch and LLM‑based models, and provides a low‑code, multi‑team backend for rapid issue resolution.

ElasticsearchLarge Language ModelOperations

0 likes · 13 min read

How We Built a Scalable Smart Customer Service System for an Activity Platform

Baidu Geek Talk

Jan 24, 2024 · Artificial Intelligence

Building AI‑Native Applications with Baidu Cloud AppBuilder

Sun Ke’s keynote at the 2023 Baidu Cloud Intelligence Conference explains how AI‑native development has shifted from model selection to building practical applications, and introduces Baidu Cloud AppBuilder—a three‑layer, low‑code‑and‑code platform that provides multimodal, LLM, and infrastructure services, enabling rapid prototyping of solutions such as automated resume screening and interview preparation.

AIAppBuilderLarge Language Model

0 likes · 12 min read

Building AI‑Native Applications with Baidu Cloud AppBuilder

JD Tech

Jan 24, 2024 · Artificial Intelligence

JD Retail Technology 2023 Highlights: AI‑Driven Supply Chain, Large Language Models, Edge AI, Data Security, and 3D Modeling Innovations

In 2023 JD Retail’s technology team delivered a suite of AI‑powered innovations—including end‑to‑end inventory management, explainable AI for supply chain, privacy‑preserving advertising models, a ReAct‑SFT‑RAG large language model framework, edge AI inference, secure data‑safe‑house infrastructure, and high‑quality 3D modeling pipelines—demonstrating broad academic and industrial impact across multiple domains.

3D modelingAIGCArtificial Intelligence

0 likes · 19 min read

JD Retail Technology 2023 Highlights: AI‑Driven Supply Chain, Large Language Models, Edge AI, Data Security, and 3D Modeling Innovations

360 Quality & Efficiency

Jan 19, 2024 · Artificial Intelligence

Using Large Language Models to Rapidly Build Simple Frontend and Backend Test Tools

This article explains how to quickly create simple web‑based and backend test tools for internal use by leveraging a large language model to generate annotated HTML, CSS, JavaScript and minimal Flask code, outlining prompt design, tool requirements, and deployment tips to boost testing efficiency.

AI code generationBackend DevelopmentLarge Language Model

0 likes · 8 min read

Using Large Language Models to Rapidly Build Simple Frontend and Backend Test Tools

DataFunTalk

Jan 16, 2024 · Artificial Intelligence

Applying Knowledge Graphs to E‑commerce AIGC: From Domain‑Specific to General Knowledge Graphs and LLM Integration

This article presents a comprehensive overview of how knowledge graphs are leveraged in e‑commerce AIGC pipelines, detailing domain‑specific and general graph‑based text generation, model architecture, controllable generation techniques, experimental results, and future directions for large language model integration.

AIGCDomain AdaptationLarge Language Model

0 likes · 22 min read

Applying Knowledge Graphs to E‑commerce AIGC: From Domain‑Specific to General Knowledge Graphs and LLM Integration

DataFunSummit

Jan 10, 2024 · Artificial Intelligence

Baidu Commercial Multimodal Understanding and AIGC Innovation Practices

This article presents Baidu's commercial multimodal understanding and AIGC innovations, detailing rich‑media multimodal perception, a unified large‑scale representation framework, scenario‑specific fine‑tuning, and practical applications such as marketing copy, digital‑human video, and poster generation.

AIGCAdvertisingBaidu

0 likes · 12 min read

Baidu Commercial Multimodal Understanding and AIGC Innovation Practices

DataFunSummit

Jan 8, 2024 · Artificial Intelligence

Enterprise Knowledge Recommendation System at Alibaba: Architecture, Challenges, and Large Model Applications

This article presents Alibaba's enterprise knowledge recommendation system, detailing its role in digital transformation, the challenges of long‑document recommendation, the multi‑layer architecture spanning feature, engine, ranking, and functional layers, various recall strategies, progressive ranking models, and the integration and evaluation of large language models for improved recommendation performance.

AIAlibabaLarge Language Model

0 likes · 23 min read

Enterprise Knowledge Recommendation System at Alibaba: Architecture, Challenges, and Large Model Applications

Architecture & Thinking

Jan 8, 2024 · Artificial Intelligence

How Baidu Comate Supercharges Coding: A Practical AI Assistant Guide

This article introduces Baidu Comate, an AI-powered coding assistant built on the Wenxin model, explains how to install it, demonstrates its real-time code completion, comment generation, test creation, and optimization features across multiple languages and IDEs, and highlights its benefits for developers.

AI coding assistantLarge Language ModelVS Code

0 likes · 10 min read

How Baidu Comate Supercharges Coding: A Practical AI Assistant Guide

21CTO

Dec 31, 2023 · Artificial Intelligence

2023’s Leading Open-Source LLMs: LLaMA, Pythia, MPT, Falcon, BLOOM, Mistral

Since ChatGPT’s debut, interest in large language models has surged, prompting the AI community to explore open‑source alternatives such as LLaMA, Pythia, MPT, Falcon, BLOOM, and Mistral, which together illustrate the rapid diversification and growing competitiveness of open‑source LLMs in 2023.

2023AILarge Language Model

0 likes · 9 min read

2023’s Leading Open-Source LLMs: LLaMA, Pythia, MPT, Falcon, BLOOM, Mistral

DataFunTalk

Dec 29, 2023 · Artificial Intelligence

Enterprise Knowledge Assistant: Leveraging Vector Databases and Large Language Models

This article explores the emerging enterprise knowledge assistant paradigm in the era of large models, detailing traditional knowledge management challenges, solution architecture using vector databases and LLMs, core technologies such as ETL pipelines, reranking, secure fine‑tuning, and future prospects for intelligent enterprise applications.

Enterprise AIKnowledge ManagementLLM fine-tuning

0 likes · 11 min read

Enterprise Knowledge Assistant: Leveraging Vector Databases and Large Language Models

AI Large Model Application Practice

Dec 28, 2023 · Artificial Intelligence

How AI Agents Can Transform Enterprise Operations and Architecture

The article examines the rise of AI Agents as a bridge to AGI, analyzes their value, application domains, and user groups in enterprises, and proposes a layered architecture with model, data, ops, and agent components to guide practical implementation and integration.

AI AgentAI ArchitectureData Management

0 likes · 14 min read

How AI Agents Can Transform Enterprise Operations and Architecture

21CTO

Dec 18, 2023 · Artificial Intelligence

Why Did Google’s Gemini‑Pro Claim to Be Baidu’s Model in Chinese Chats?

A recent test on Google Vertex AI showed Gemini‑Pro introducing itself as Baidu’s Wenxin model during Chinese conversations, sparking debate about model attribution, pricing, developer tools, and the broader competition among major AI platforms.

AI PlatformsGemini ProGoogle AI

0 likes · 5 min read

Why Did Google’s Gemini‑Pro Claim to Be Baidu’s Model in Chinese Chats?

CSS Magic

Dec 15, 2023 · Artificial Intelligence

Google Gemini Free API Launch: A Deep Dive for Developers

Google has opened its Gemini Pro large‑language model via a completely free API with a 60‑calls‑per‑minute limit, offering an online playground, straightforward key registration, efficient token usage, and streaming output, while noting it remains a technical preview rather than a consumer‑ready service.

AIAPI usageFree API

0 likes · 3 min read

Google Gemini Free API Launch: A Deep Dive for Developers

Rare Earth Juejin Tech Community

Dec 9, 2023 · Artificial Intelligence

Google Unveils Gemini: A New Multimodal Large Model Family (Ultra, Pro, Nano)

Google announced Gemini, a suite of multimodal large language models—including Ultra, Pro, and Nano—that achieve state‑of‑the‑art results on dozens of benchmarks, support native multimodal pre‑training, and are being integrated across Google products such as Bard, Search, and upcoming Pixel devices.

Artificial IntelligenceBenchmarkGemini

0 likes · 7 min read

Google Unveils Gemini: A New Multimodal Large Model Family (Ultra, Pro, Nano)

Tencent Cloud Developer

Dec 7, 2023 · Artificial Intelligence

Student Score Ranking and Distribution Analysis Using Python and Tencent Hunyuan Model

Using Tencent's Hunyuan model, the tutorial walks through a Python workflow that scrapes a student‑score table from a web page, saves it as CSV and Excel, cleans missing values, computes total and average scores, and visualizes their distributions with matplotlib, illustrating how LLMs can accelerate data‑analysis coding while still needing human verification.

Data VisualizationLarge Language ModelMatplotlib

0 likes · 8 min read

Student Score Ranking and Distribution Analysis Using Python and Tencent Hunyuan Model

AntTech

Dec 2, 2023 · Artificial Intelligence

TechTalk AI Sharing Season: OpenKG Enters Ant Group – Knowledge Graphs and Large Language Models Empower General AI

The TechTalk AI Sharing Season event on November 28 brought together nearly thirty experts from academia and industry to discuss how knowledge graphs and large language models can be integrated to enhance Ant Group's AI strategy across diverse business scenarios, highlighting collaborations, research labs, and future development directions.

AI strategyAnt GroupIndustry-Academia Collaboration

0 likes · 7 min read

TechTalk AI Sharing Season: OpenKG Enters Ant Group – Knowledge Graphs and Large Language Models Empower General AI

HomeTech

Dec 1, 2023 · Artificial Intelligence

Building a Private Knowledge Base and Large‑Model Platform for Enterprise AI Assistants

This article describes how an enterprise leveraged GPT‑3.5 and other large language models to create a private knowledge base, design prompt engineering, implement plugin extensions, and build a secure, scalable backend and front‑end integration platform that enables AI‑driven customer‑service assistants across multiple business lines.

AILarge Language ModelPrivate Knowledge Base

0 likes · 19 min read

Building a Private Knowledge Base and Large‑Model Platform for Enterprise AI Assistants

Baidu Geek Talk

Nov 27, 2023 · Industry Insights

Inside Baidu’s Lingjing Platform: How AI Developer Ecosystems Are Built

This article examines Baidu’s Lingjing developer platform, exploring its origins, design choices, integration of plugins and agents, ecosystem advantages, commercial‑monetization loops, and future roadmap, while providing insights from an interview with platform head Zhang Ruixing on the challenges and opportunities of building AI‑native developer platforms.

AIAgentDeveloper Platform

0 likes · 16 min read

Inside Baidu’s Lingjing Platform: How AI Developer Ecosystems Are Built

NetEase Smart Enterprise Tech+

Nov 22, 2023 · Artificial Intelligence

How Large Language Models Can Boost Smart Chatbot Resolution Rates

This article explains how large language models can automatically analyze the factors affecting smart chatbot resolution rates, identify why customers are transferred to human agents, and provide data‑driven solutions, illustrated by a case study with a major automotive client.

AIKnowledge BaseLarge Language Model

0 likes · 13 min read

How Large Language Models Can Boost Smart Chatbot Resolution Rates

Ant R&D Efficiency

Nov 21, 2023 · Artificial Intelligence

Can AI Code Completion Transform Java Development? One Engineer’s Journey

Java engineer Wu Ming shares his experience with CodeFuse, an AI-powered code completion tool, describing how large language models enhance coding efficiency, the challenges of early versions, practical tips for integrating AI assistants into workflows, and his vision for AI’s expanding role across the entire software development lifecycle.

AI code assistantAI workflowCodeFuse

0 likes · 12 min read

Can AI Code Completion Transform Java Development? One Engineer’s Journey

DataFunSummit

Nov 11, 2023 · Artificial Intelligence

RWKV: Next‑Generation Heterogeneous Large Model – Design, Evolution, Performance, and Training Strategies

This article presents a comprehensive overview of the RWKV large language model, covering its origin, attention‑free RNN architecture, performance benchmarks, evolution through v4 and v5, training pipelines, diverse application cases, open‑source ecosystem, and a detailed Q&A session.

AILarge Language ModelModel Training

0 likes · 18 min read

RWKV: Next‑Generation Heterogeneous Large Model – Design, Evolution, Performance, and Training Strategies

Programmer DD

Nov 7, 2023 · Artificial Intelligence

Inside xAI’s Grok: How a 330‑B Model Beats ChatGPT and Redefines AI Development

The article details xAI’s newly launched Grok AI assistant, its multi‑session UI, real‑time Twitter integration, benchmark performance surpassing ChatGPT‑3.5, the underlying 330‑billion‑parameter Grok‑1 model, Rust‑based infrastructure, current limitations, and the research directions xAI is pursuing to advance reliable, scalable artificial intelligence.

AI benchmarkingGrokLarge Language Model

0 likes · 12 min read

Inside xAI’s Grok: How a 330‑B Model Beats ChatGPT and Redefines AI Development

DataFunTalk

Nov 1, 2023 · Artificial Intelligence

Data‑Centric LLM and Knowledge Graph Integration: Fabarta’s AI‑Era Data Infrastructure

The article presents Fabarta’s AI‑era data infrastructure that combines large language models with knowledge graphs and a multimodal database, detailing its data‑centric architecture, HTAP capabilities, cloud‑native design, and real‑world demos that illustrate how graph‑plus‑vector techniques improve model reliability and answer precision.

AIHTAPLarge Language Model

0 likes · 19 min read

Data‑Centric LLM and Knowledge Graph Integration: Fabarta’s AI‑Era Data Infrastructure

DataFunTalk

Oct 30, 2023 · Databases

Engineering Practices and Evolution of Douyin’s Cloud‑Native Vector Database

This article outlines Douyin’s step‑by‑step engineering evolution of its cloud‑native vector database, covering the background of vector search, core concepts, algorithmic optimizations, storage‑compute separation, streaming updates, multi‑tenant orchestration, and future applications such as large language model integration.

ANNDouyinLarge Language Model

0 likes · 17 min read

Engineering Practices and Evolution of Douyin’s Cloud‑Native Vector Database

Ant R&D Efficiency

Oct 26, 2023 · Artificial Intelligence

TestAgent: Open-Source 7B LLM That Supercharges Automated Test Generation

TestAgent is an open-source 7B test-domain LLM that delivers multi-language test-case generation, automatic assert completion, and a rapid deployment framework, offering industry-leading pass@1 scores, a ChatBot UI, and detailed setup instructions for diverse hardware environments.

AI testingLarge Language ModelModel Deployment

0 likes · 8 min read

TestAgent: Open-Source 7B LLM That Supercharges Automated Test Generation

Huawei Cloud Developer Alliance

Oct 25, 2023 · Artificial Intelligence

Unlocking GLM & ChatGLM: Deep Dive into MindSpore Large‑Model Techniques

The MindSpore Season 2 open class offers a comprehensive overview of GLM to ChatGLM architectures, positional‑embedding strategies, stable training optimizations, and step‑by‑step instructions for deploying large language models with Ascend, ModelArts, and MindSpore Transformers, while previewing upcoming multimodal remote‑sensing sessions.

Artificial IntelligenceChatGLMGLM

0 likes · 6 min read

Unlocking GLM & ChatGLM: Deep Dive into MindSpore Large‑Model Techniques

Bilibili Tech

Oct 13, 2023 · Artificial Intelligence

Multimodal Video High‑Energy Segment Extraction for Dynamic Video Covers

The authors present a multimodal system that automatically extracts high‑energy video segments for dynamic covers by analyzing subtitles, audio, visual frames, and danmu, employing LLM prompt‑tuning, scene‑cut detection, and aesthetic scoring to reduce manual effort and boost click‑through rates.

ASRLarge Language ModelOCR

0 likes · 14 min read

Multimodal Video High‑Energy Segment Extraction for Dynamic Video Covers

Sohu Tech Products

Oct 11, 2023 · Artificial Intelligence

EcomGPT: Training an E-commerce Domain Large Language Model via Instruction Tuning

EcomGPT, an Alibaba‑trained e‑commerce large language model, uses a 1.5 million‑sample instruction dataset (EcomInstruct) to demonstrate that domain‑specific instruction tuning dramatically outperforms general‑purpose models on e‑commerce tasks, reducing hallucinations and improving task accuracy, with performance scaling as data diversity increases.

Alibaba NLPDomain-Specific AIEcomGPT

0 likes · 7 min read

EcomGPT: Training an E-commerce Domain Large Language Model via Instruction Tuning

Ant R&D Efficiency

Sep 28, 2023 · Artificial Intelligence

CodeFuse: Open‑Source Large Code Model with Multi‑Task Fine‑Tuning and 4‑Bit Quantization

Ant Group’s open‑source CodeFuse is a large‑scale code‑generation model featuring multi‑task fine‑tuning and 4‑bit quantization, achieving a 74.4% HumanEval score that outperforms GPT‑4, supporting tasks from code synthesis to bug fixing, and can be deployed on a single high‑end GPU.

AICodeFuseLarge Language Model

0 likes · 9 min read

CodeFuse: Open‑Source Large Code Model with Multi‑Task Fine‑Tuning and 4‑Bit Quantization

Baidu Tech Salon

Sep 13, 2023 · Artificial Intelligence

How Baidu’s Wenxin Yiyan Powers AI Text Creation in Mobile Apps

This article explains the technical workflow behind Baidu App’s AI‑assisted text generation, covering large‑language‑model fundamentals, three‑layer system architecture, prompt design, risk‑control measures, SSE streaming, and a custom Android TextView implementation for smooth, gradient‑styled output.

AI text generationAndroid UILarge Language Model

0 likes · 13 min read

How Baidu’s Wenxin Yiyan Powers AI Text Creation in Mobile Apps

Alipay Experience Technology

Sep 12, 2023 · Artificial Intelligence

Demystifying ChatGPT: From Transformer Basics to Business Applications

This article offers a non‑algorithmic engineer’s clear overview of large language models, explaining ChatGPT’s generative‑pre‑training‑transformer foundation, core mechanisms like attention, practical prompt‑engineering tips, and how enterprises can integrate LLMs into data analysis, smart‑customer service, and other business workflows while noting associated risks.

AI ApplicationsChatGPTLarge Language Model

0 likes · 28 min read

Demystifying ChatGPT: From Transformer Basics to Business Applications

Tencent Cloud Developer

Sep 7, 2023 · Artificial Intelligence

What Is Tencent’s Hunyuan LLM and How Is It Shaping AI Applications?

At the 2023 Global Digital Ecology Conference, Tencent unveiled its self‑developed, open‑access large language model Hunyuan, detailing its massive scale, full‑stack proprietary technology, performance advantages, diverse business integrations, and the company’s broader strategy to embrace and commercialize advanced AI across its ecosystem.

AI ApplicationsArtificial IntelligenceIndustry insight

0 likes · 8 min read

What Is Tencent’s Hunyuan LLM and How Is It Shaping AI Applications?

Sohu Tech Products

Sep 6, 2023 · Artificial Intelligence

Can Domain-Specific LLMs Outperform General Models? Insights from EcomGPT

This article presents the development and evaluation of EcomGPT, a domain‑specific large language model for e‑commerce, detailing dataset construction, instruction‑tuning methods, experimental results, and the impact of atomic tasks on model performance.

Domain AdaptationEcomGPTInstruction Tuning

0 likes · 9 min read

Can Domain-Specific LLMs Outperform General Models? Insights from EcomGPT

DataFunTalk

Sep 6, 2023 · Databases

Large Model + OLAP: Enabling a New Data Service Platform

This article details how Tencent Music combines large language models with an Apache Doris‑based OLAP engine, introduces a semantic layer, manual‑experience routing, schema mapping and plugin integration, and outlines the evolution of its data architecture through four versions to achieve real‑time, cost‑effective, and scalable intelligent data services.

Apache DorisData WarehouseLarge Language Model

0 likes · 24 min read

Large Model + OLAP: Enabling a New Data Service Platform

AI Large Model Application Practice

Sep 6, 2023 · Artificial Intelligence

Prompt Engineering vs Fine‑Tuning: How to Choose the Best Strategy for Reliable LLM Outputs

This article compares Prompt Engineering and Supervised Fine‑Tuning for large language models, explains their principles, showcases common prompt patterns such as Chain‑of‑Thought, ReAct and Self‑Ask, outlines fine‑tuning stages and trade‑offs, and provides practical guidance on selecting the most suitable approach for specific enterprise AI Agent scenarios.

AI AgentLLMLarge Language Model

0 likes · 17 min read

Prompt Engineering vs Fine‑Tuning: How to Choose the Best Strategy for Reliable LLM Outputs

JD Retail Technology

Aug 29, 2023 · Artificial Intelligence

ChatGPT 0720 Update: Custom Instructions, System Messages, and Implementation Guide

The article introduces the ChatGPT 0720 update, explains the new Custom Instructions feature, shows how to enable and configure it, compares responses with and without the feature, and provides detailed code examples for implementing similar functionality via system messages in developer tools.

ChatGPTCustom InstructionsDeveloper Tools

0 likes · 8 min read

ChatGPT 0720 Update: Custom Instructions, System Messages, and Implementation Guide

Baidu Tech Salon

Aug 29, 2023 · Artificial Intelligence

Insights into Baidu's Wenxin Yiyan Large Language Model and Its Role in AI-Driven Industrial Production

Baidu’s Wenxin Yiyan, a knowledge‑enhanced large language model now at version 3.5, outperforms ChatGPT, showcases rapid training and inference, and is being deployed across transportation, agriculture, energy and other sectors, illustrating AI’s transition to industrial mass production and its strategic boost for China’s high‑quality economic growth.

AI strategyArtificial IntelligenceIndustrial Applications

0 likes · 9 min read

Insights into Baidu's Wenxin Yiyan Large Language Model and Its Role in AI-Driven Industrial Production

php Courses

Aug 25, 2023 · Artificial Intelligence

Meta Launches Code Llama: An Advanced AI Coding Model

Meta introduced Code Llama, a Llama 2‑based AI coding model available in base, Python‑specific, and instruction‑tuned versions across 7B, 13B, and 34B sizes, claiming superior benchmark performance and free community licensing for research and commercial use.

AI codingBenchmarkCode Llama

0 likes · 5 min read

Meta Launches Code Llama: An Advanced AI Coding Model

Sohu Tech Products

Aug 23, 2023 · Artificial Intelligence

Engineering GPT Applications: Capabilities, Limitations, and Solutions

The guide explains GPT’s core capabilities—natural language mastery, domain reasoning, and code generation—while detailing its limits such as prompt sensitivity, token caps, and lack of memory, then offers engineering workarounds like systematic prompting, chain‑of‑thought, external memory, tool integration, safety checks, and a six‑layer architecture for building robust commercial AI applications.

AI Application ArchitectureGPTLarge Language Model

0 likes · 20 min read

Engineering GPT Applications: Capabilities, Limitations, and Solutions

DataFunSummit

Aug 3, 2023 · Artificial Intelligence

Integrating Vector Databases with Large Language Models for Enterprise AI Applications

The article explains how combining vector databases with large language models can help governments and enterprises leverage massive private data for AI, covering semantic search, approximate nearest neighbor techniques, alignment challenges across modalities, and future directions for fine‑grained data integration.

AILarge Language ModelVector Database

0 likes · 7 min read

Integrating Vector Databases with Large Language Models for Enterprise AI Applications

Model Perspective

Jul 31, 2023 · Artificial Intelligence

From RNN to ChatGPT: How AIGC Evolved with Transformers and Large Models

This article traces the evolution of AI‑generated content (AIGC) from early RNN‑based Seq2Seq models through the transformative impact of the Transformer architecture, covering key milestones such as UniLM, T5, BART, the GPT series, InstructGPT, and the emergence of ChatGPT.

AI content generationAIGCGPT

0 likes · 9 min read

From RNN to ChatGPT: How AIGC Evolved with Transformers and Large Models

JD Tech

Jul 31, 2023 · Artificial Intelligence

Local Deployment, Fine‑tuning, and Inference of the Open‑source Alpaca‑LoRA Model on GPU Servers

This article details the step‑by‑step process of installing GPU drivers, setting up a Python environment, deploying the open‑source Alpaca‑LoRA large language model, fine‑tuning it with Chinese data on a multi‑GPU server, and running inference, while discussing practical challenges and performance observations.

AlpacaGPUInference

0 likes · 14 min read

Local Deployment, Fine‑tuning, and Inference of the Open‑source Alpaca‑LoRA Model on GPU Servers

HomeTech

Jul 26, 2023 · Artificial Intelligence

Practical Implementation of ChatGPT Technology Products: Architecture, Prompt Engineering, and Future Challenges

This article explores the practical deployment of ChatGPT‑based products, detailing the model fundamentals, technical architecture, engineering‑focused prompt design, real‑world application scenarios, and the challenges of model generalization, resource consumption, data privacy, interpretability, and ethical considerations.

AI ArchitectureChatGPTJava

0 likes · 15 min read

Practical Implementation of ChatGPT Technology Products: Architecture, Prompt Engineering, and Future Challenges

21CTO

Jul 8, 2023 · Artificial Intelligence

What Developers Need to Know About GPT‑4’s New 8K Context and Multimodal Capabilities

OpenAI has opened GPT‑4’s API to all paid users, offering an 8K‑token context window (up to 32K), multimodal image input, enhanced creativity, longer text handling, and upcoming fine‑tuning options, while also outlining phased deprecation of older models and current limitations.

AI safetyAPIGPT-4

0 likes · 10 min read

What Developers Need to Know About GPT‑4’s New 8K Context and Multimodal Capabilities

Cloud Native Technology Community

Jun 28, 2023 · Artificial Intelligence

Building and Deploying Custom Large Language Models with Alauda Cloud‑Native MLOps

This article explains how enterprises can use the Alauda MLOps platform to quickly set up, fine‑tune, and deploy private large language models on cloud‑native infrastructure, covering notebook preparation, GPU allocation, model download, inference service creation, distributed training pipelines, and Docker image building.

AILarge Language ModelMLOps

0 likes · 9 min read

Building and Deploying Custom Large Language Models with Alauda Cloud‑Native MLOps

HelloTech

Jun 21, 2023 · Artificial Intelligence

Overview of Haro Intelligent Customer Service: Algorithms, Challenges, and AI Solutions

Haro’s intelligent customer service combines a smart FAQ recommender and a conversational chatbot that leverages matching‑based intent recognition, large‑scale domain pre‑training, metric‑learning for new intents, and fine‑tuned generative LLMs, achieving 82 % top‑1 accuracy while reducing human workload and outlining future API‑orchestrated, multimodal AI enhancements.

AILarge Language ModelNLP

0 likes · 10 min read

Overview of Haro Intelligent Customer Service: Algorithms, Challenges, and AI Solutions

Rare Earth Juejin Tech Community

Jun 11, 2023 · Artificial Intelligence

Comprehensive Technical Overview of GPT Series, Transformers, and Emerging Capabilities in Large Language Models

This article provides a detailed technical review of the evolution of GPT models, the Transformer architecture, large language model training methods, emergent abilities such as in‑context learning and chain‑of‑thought, multimodal extensions, and the challenges of data, scaling, and alignment, offering a holistic view for researchers and practitioners.

AIGPTInstructGPT

0 likes · 28 min read

Comprehensive Technical Overview of GPT Series, Transformers, and Emerging Capabilities in Large Language Models

NetEase Cloud Music Tech Team

Jun 2, 2023 · Frontend Development

How NetEase Cloud Music’s Front‑End Team Built an AI‑Powered Low‑Code Copilot

NetEase Cloud Music’s front‑end team integrated large language models into their internal low‑code platform, creating an AI Copilot that supports smart page creation, editing, component configuration, code snippet generation, and Q&A, while detailing the underlying architecture, prompt engineering, and mixed‑mode development workflow.

AI CopilotLarge Language ModelMixed Development

0 likes · 11 min read

How NetEase Cloud Music’s Front‑End Team Built an AI‑Powered Low‑Code Copilot

Baidu Tech Salon

May 29, 2023 · Artificial Intelligence

Baidu CTO Wang Haifeng Highlights Wenxin Yiyan Large Language Model at Zhongguancun Forum

At the Zhongguancun Forum, Baidu CTO Wang Haifeng showcased the self‑developed Wenxin Yiyan large language model—demonstrating its knowledge‑enhanced Q&A, writing, poetry, video generation and reasoning abilities, its integration as an intelligent office assistant, and its role in driving a model‑as‑a‑service ecosystem that fuels China’s AI‑led industrial transformation.

AI policyArtificial IntelligenceBaidu

0 likes · 7 min read

Baidu CTO Wang Haifeng Highlights Wenxin Yiyan Large Language Model at Zhongguancun Forum

JD Tech

May 23, 2023 · Artificial Intelligence

Understanding ChatGPT: Principles, Limitations, and a Five‑Layer Application Guide

This article explains the fundamentals of GPT models, contrasts large models with traditional AI, details ChatGPT's architecture and token processing, outlines its limitations, and presents a five‑layer framework for applying ChatGPT across chat, language, text, reasoning, and private model use cases.

AIChatGPTLarge Language Model

0 likes · 21 min read

Understanding ChatGPT: Principles, Limitations, and a Five‑Layer Application Guide

JD Retail Technology

May 18, 2023 · Artificial Intelligence

Local Deployment, Inference, and Fine‑tuning of the Vicuna‑7B Large Language Model

This article details the step‑by‑step process of preparing the environment, merging weights, installing dependencies, running inference, evaluating Vicuna‑7B against other models, and attempting fine‑tuning, while highlighting performance results, encountered issues, and future work for large language model deployment.

GPUInferenceLarge Language Model

0 likes · 11 min read

Local Deployment, Inference, and Fine‑tuning of the Vicuna‑7B Large Language Model

Full-Stack Trendsetter

May 18, 2023 · Artificial Intelligence

How 360 and ChatGLM Are Building China’s “Microsoft + OpenAI” Large‑Model Duo

On May 16, 360 and Zhipu AI announced a strategic partnership to co‑develop the trillion‑parameter models 360GLM and 360GPT, positioning them as China’s answer to Microsoft‑OpenAI by combining large‑scale pre‑training, bilingual capabilities, and integration with 360’s search and browser ecosystem.

360AI collaborationChatGLM

0 likes · 7 min read

How 360 and ChatGLM Are Building China’s “Microsoft + OpenAI” Large‑Model Duo

Baidu Geek Talk

May 10, 2023 · Artificial Intelligence

Baidu's AI Infrastructure for Large-Scale LLM Training: Architecture, Challenges, and Optimization

Baidu’s AI infrastructure combines a massive InfiniBand‑linked GPU cluster, Kunlun chips, the PaddlePaddle framework, and the Wenxin model suite with 4D hybrid parallelism, elastic fault tolerance, and a two‑stage training pipeline to overcome computation, memory, and communication walls, delivering world‑leading MLPerf performance for large‑scale LLMs.

GPU ClusterInfiniBandLarge Language Model

0 likes · 15 min read

Baidu's AI Infrastructure for Large-Scale LLM Training: Architecture, Challenges, and Optimization

Baidu Tech Salon

May 9, 2023 · Artificial Intelligence

How Baidu’s Award‑Winning Dialogue Tech Powers China’s AI Surge

The article examines Baidu’s groundbreaking knowledge‑deep learning dialogue system that earned the 2022 Wu Wenjun AI Science and Technology Award, detailing its technical breakthroughs, patent portfolio, large‑scale deployments, and how it underpins China’s rapid advancement in large language models and AI industry integration.

AIArtificial IntelligenceBaidu

0 likes · 12 min read

How Baidu’s Award‑Winning Dialogue Tech Powers China’s AI Surge

Rare Earth Juejin Tech Community

May 8, 2023 · Artificial Intelligence

Review of Alibaba's Tongyi Qianwen AI Model with Sample Code, Recipe, and SWOT Analysis

This article reviews Alibaba's Tongyi Qianwen large language model, shares personal impressions, provides a fish‑flavored pork recipe, conducts a SWOT analysis, and includes Scala Spark and Java code examples illustrating its capabilities and usage scenarios.

JavaLarge Language ModelSWOT analysis

0 likes · 12 min read

Review of Alibaba's Tongyi Qianwen AI Model with Sample Code, Recipe, and SWOT Analysis

Rare Earth Juejin Tech Community

May 8, 2023 · Artificial Intelligence

Understanding the Principles Behind ChatGPT: NLP, Transformers, and Reinforcement Learning

This article explains how ChatGPT works by covering the fundamentals of natural language processing, generative language models, deep learning, the Transformer architecture, attention mechanisms, few‑shot learning, and the reinforcement‑learning techniques that align its outputs with human preferences.

AIChatGPTLarge Language Model

0 likes · 24 min read

Understanding the Principles Behind ChatGPT: NLP, Transformers, and Reinforcement Learning

Rare Earth Juejin Tech Community

Apr 28, 2023 · Artificial Intelligence

Exploring Alibaba’s Tongyi Qianwen AI Model, SWOT, Recipe Demo, and Code Samples for Spark Same‑Period Analysis and Java Bubble Sort

The article reviews Alibaba’s Tongyi Qianwen large‑language model, shares a cooking recipe generated by the AI, presents a SWOT analysis, and provides code examples—including a Spark Scala script for same‑period month‑over‑month calculations and a Java bubble‑sort implementation.

AIJavaLarge Language Model

0 likes · 12 min read

Exploring Alibaba’s Tongyi Qianwen AI Model, SWOT, Recipe Demo, and Code Samples for Spark Same‑Period Analysis and Java Bubble Sort

21CTO

Apr 24, 2023 · Artificial Intelligence

Inside MOSS 003: Fudan University's Open-Source Large Language Model

This article details the evolution of Fudan University's open‑source MOSS series—from the early OpenChat 001 prototype to the current MOSS 003—covering data collection, multilingual capabilities, plugin architecture, model releases on HuggingFace, and how developers can start using the models.

AIChinese NLPLarge Language Model

0 likes · 10 min read

Inside MOSS 003: Fudan University's Open-Source Large Language Model

Architect

Apr 24, 2023 · Artificial Intelligence

MOSS 003: Open‑Source Large Language Model Development, Training Data, and Plugin‑Enabled Deployment

The article details the evolution of the open‑source MOSS series—from OpenChat 001 to MOSS 003—covering data collection, fine‑tuning procedures, multilingual capabilities, plugin architecture, example code for inference, and upcoming releases, providing a comprehensive technical overview for AI practitioners.

AILarge Language ModelMOSS

0 likes · 11 min read

MOSS 003: Open‑Source Large Language Model Development, Training Data, and Plugin‑Enabled Deployment