Tagged articles

2015 articles

Page 19 of 21

Apr 6, 2024 · Artificial Intelligence

Exploring Large Language Models for Recommendation Systems: Experiments and Insights

This article investigates how large language models can be applied to recommendation tasks, describing two usage strategies, various ranking approaches, experimental evaluations on multiple datasets, comparisons with traditional models, and analyses of prompt design, cost, and cold‑start capabilities.

LLMPrompt engineeringranking

0 likes · 13 min read

Exploring Large Language Models for Recommendation Systems: Experiments and Insights

AI Large Model Application Practice

Apr 5, 2024 · Artificial Intelligence

Hands‑On Comparison of Baidu AppBuilder, Alibaba Bailei, and ByteDance Coze LLM Platforms

This article provides a practical, side‑by‑side review of three major large‑model application development platforms—Baidu AppBuilder, Alibaba Bailei, and ByteDance Coze—detailing their creation workflows, configuration options, SDK capabilities, plugin ecosystems, workflow orchestration, and overall strengths and limitations for building AI agents.

AI PlatformAppBuilderComparison

0 likes · 18 min read

Hands‑On Comparison of Baidu AppBuilder, Alibaba Bailei, and ByteDance Coze LLM Platforms

dbaplus Community

Apr 4, 2024 · Artificial Intelligence

10 Guiding Principles for Building LLM‑Powered Software Applications

This article outlines ten practical principles for designing applications with large language models, emphasizing a model‑first mindset, precision through interactive disambiguation, clear division of code and model responsibilities, data quality, handling uncertainty, and recognizing the limits of LLMs to build robust, maintainable software.

AI designData QualityLLM

0 likes · 13 min read

10 Guiding Principles for Building LLM‑Powered Software Applications

NewBeeNLP

Apr 2, 2024 · Artificial Intelligence

Jamba: How AI21 Labs Merged Mamba and Transformer for 3× Faster 128k Contexts

Jamba, a hybrid Mamba‑Transformer model from AI21 Labs, combines state‑space and attention layers with Mixture‑of‑Experts to deliver up to three times the throughput of comparable 52‑billion‑parameter LLMs on 128k context windows while maintaining high output quality and low memory usage.

JambaLLMMamba

0 likes · 6 min read

Jamba: How AI21 Labs Merged Mamba and Transformer for 3× Faster 128k Contexts

Rare Earth Juejin Tech Community

Mar 30, 2024 · Artificial Intelligence

Comprehensive Guide to Coze: AI Bot Development, Prompt Engineering, and Workflow Design

This article provides an in‑depth overview of the Coze low‑code AI bot platform, covering its core features, product comparisons, step‑by‑step bot creation, RAG implementation, plugin usage, memory mechanisms, cron jobs, agent design, advanced workflow techniques, quality management, and future prospects.

AI botCozeLLM

0 likes · 25 min read

Comprehensive Guide to Coze: AI Bot Development, Prompt Engineering, and Workflow Design

Baobao Algorithm Notes

Mar 29, 2024 · Artificial Intelligence

Can Data Mixing Laws Predict LLM Performance? A Deep Dive into Scaling Laws

This article reviews the paper “Data Mixing Laws: Optimizing Data Mixture by Predicting Language Modeling Performance”, explaining how the authors quantify the impact of data mixture ratios on LLM loss, propose a simple predictive model, validate it on RedPajama and multi‑domain mixes, and outline a scaling‑law procedure for continual pre‑training.

Data MixingData SchedulingLLM

0 likes · 9 min read

Can Data Mixing Laws Predict LLM Performance? A Deep Dive into Scaling Laws

AI Large Model Application Practice

Mar 29, 2024 · Artificial Intelligence

How RAG Architecture Evolves: From Simple Chains to Flexible RAG Flows

This article examines the evolution of Retrieval‑Augmented Generation (RAG) architectures for large language models, outlines the challenges they face, introduces the modular RAG Flow concept with four workflow paradigms, and provides a step‑by‑step implementation using LangChain and LlamaIndex with code examples.

LLMLangChainRAG

0 likes · 15 min read

How RAG Architecture Evolves: From Simple Chains to Flexible RAG Flows

DataFunSummit

Mar 29, 2024 · Artificial Intelligence

Large Language Model (LLM) Revolution in Recommendation Systems: Overview, Techniques, and Future Directions

This article reviews how the rapid rise of large language models, exemplified by ChatGPT, is transforming recommendation systems by addressing traditional ID‑centric limitations, introducing prompt‑based and ID‑free representations, discussing recent research advances, practical challenges, and future research directions.

AILLMlarge models

0 likes · 18 min read

Large Language Model (LLM) Revolution in Recommendation Systems: Overview, Techniques, and Future Directions

Bilibili Tech

Mar 26, 2024 · Frontend Development

Design and Implementation of the AutoMotion UI Automation Testing Platform

The AutoMotion platform streamlines UI automation by recording user actions through a Chrome extension, converting them into Cypress scripts, isolating test data in a sandbox, and employing LLM‑driven self‑healing selectors, while offering open‑API integration and scalable containerized execution for reliable, low‑maintenance testing.

CypressData SandboxLLM

0 likes · 27 min read

Design and Implementation of the AutoMotion UI Automation Testing Platform

Eric Tech Circle

Mar 24, 2024 · Artificial Intelligence

Running Local LLMs: Ollama vs Hugging Face – A Hands‑On Comparison

This guide compares Ollama and Hugging Face for running large language models locally, detailing API and local execution methods, installation steps, model selection, resource requirements, integration with AnythingLLM, container deployment, embedding and vector store setup, and practical observations on performance and limitations.

AnythingLLMDockerEmbedding

0 likes · 15 min read

Running Local LLMs: Ollama vs Hugging Face – A Hands‑On Comparison

AI Large Model Application Practice

Mar 22, 2024 · Artificial Intelligence

How to Build a Real‑Time AI‑Powered 3D Digital Human with Unreal Engine

This guide explains the architecture of an interactive digital‑human system, walks through 3D avatar creation with Unreal Engine, details the AI controller that combines ASR, LLM and TTS, and provides step‑by‑step instructions for deploying the open‑source Fay project.

AI AvatarASRDigital Human

0 likes · 14 min read

How to Build a Real‑Time AI‑Powered 3D Digital Human with Unreal Engine

Sohu Tech Products

Mar 20, 2024 · Artificial Intelligence

Comparison of Base LLM and Instruction Tuned LLM

The diagram contrasts a Base LLM, which merely predicts the next word from training data and can continue stories or answer simple facts but may generate unsafe text, with an Instruction‑Tuned LLM that is fine‑tuned via RLHF to understand and follow commands, delivering more accurate, useful, and safe responses.

AIAI applicationsBASE model

0 likes · 7 min read

Comparison of Base LLM and Instruction Tuned LLM

DeWu Technology

Mar 18, 2024 · Frontend Development

QCon Shanghai 2023: LLM-Powered Frontend Debugging, WebNN, AI-Native Development, and HarmonyOS Insights

QCon Shanghai 2023 highlighted LLM‑driven frontend debugging, the emerging WebNN API for accelerated browser inference, AI‑native UI patterns with evaluation‑driven development, LLM‑enhanced developer bots using RAG and fine‑tuning, and a HarmonyOS round‑table exploring ArkUI’s declarative framework and opportunities for frontend engineers.

AIHarmonyOSLLM

0 likes · 18 min read

QCon Shanghai 2023: LLM-Powered Frontend Debugging, WebNN, AI-Native Development, and HarmonyOS Insights

NewBeeNLP

Mar 18, 2024 · Artificial Intelligence

Mastering RAG and LLM Techniques: From Retrieval to Fine‑Tuning

This article provides a comprehensive technical guide on Retrieval‑Augmented Generation (RAG), open‑source large language models such as LLaMA, fine‑tuning methods, evaluation metrics, memory‑optimization tricks, and attention‑related optimizations for modern AI systems.

LLMLangChainMemory Optimization

0 likes · 19 min read

Mastering RAG and LLM Techniques: From Retrieval to Fine‑Tuning

Baobao Algorithm Notes

Mar 18, 2024 · Industry Insights

Inside the 2024 KDD Cup ShopBench Challenge: Tasks, Data, and Evaluation Metrics

The 2024 KDD Cup introduces the ShopBench benchmark, a large‑scale LLM competition that simulates real‑world online shopping with 57 tasks, over 20,000 questions, and multiple tracks covering concept understanding, knowledge reasoning, user‑behavior alignment, multilingual ability, and an all‑round track, all evaluated with task‑specific metrics and a hidden test set.

DatasetEvaluation MetricsKDD Cup

0 likes · 11 min read

Inside the 2024 KDD Cup ShopBench Challenge: Tasks, Data, and Evaluation Metrics

Baobao Algorithm Notes

Mar 17, 2024 · Artificial Intelligence

Why Role‑Playing LLMs Need More Than Assistant Fine‑Tuning

The article explains that current large language models lack true self‑awareness and act as assistants, so achieving convincing role‑playing behavior requires dedicated system prompts, specialized data, careful balance of continue pre‑training and general SFT, and evaluation methods to detect dissonance and preserve base capabilities.

AILLMPrompt engineering

0 likes · 19 min read

Why Role‑Playing LLMs Need More Than Assistant Fine‑Tuning

Bilibili Tech

Mar 15, 2024 · Artificial Intelligence

Hardware Resource Estimation and Bottleneck Analysis for Large Language Models (LLMs)

The article analyzes the compute, memory, and communication resources required to train and run large language models, quantifies bottlenecks such as the massive FLOP demand, terabyte‑scale GPU memory, and high‑bandwidth interconnect needs, and evaluates parallelism strategies and bandwidth estimates to guide hardware and software design for scaling LLMs.

AI InfrastructureHardwareLLM

0 likes · 53 min read

Hardware Resource Estimation and Bottleneck Analysis for Large Language Models (LLMs)

Sohu Tech Products

Mar 13, 2024 · Artificial Intelligence

Build a Minimal Retrieval‑Augmented Generation (Tiny‑RAG) from Scratch

This step‑by‑step guide explains how to implement a lightweight Retrieval‑Augmented Generation system—Tiny‑RAG—by creating embedding classes, loading and chunking documents, building a simple vector store, performing similarity search, and integrating a large language model for answer generation, complete with runnable Python code.

EmbeddingLLMPython

0 likes · 14 min read

Build a Minimal Retrieval‑Augmented Generation (Tiny‑RAG) from Scratch

Alipay Experience Technology

Mar 13, 2024 · Artificial Intelligence

Unlock LangChain: Build Powerful LLM Apps Like LEGO with Agents and Tools

This article explains how LangChain turns large language models into modular LEGO‑like building blocks, covering its core concepts, practical travel‑assistant and face‑recognition troubleshooting examples, and explores the rapid evolution of AI agents such as Gorilla, ToolLLaMa, MetaGPT and ChatDev.

LLMLangChaintool integration

0 likes · 43 min read

Unlock LangChain: Build Powerful LLM Apps Like LEGO with Agents and Tools

Efficient Ops

Mar 13, 2024 · Operations

Why Traditional Ops Stalls and How AI‑Driven Solutions Can Revitalize It

The article examines common operational pain points such as cumbersome release processes, lack of standardization, and weak security controls, then explores how AI‑powered SRE tools and automation can address these challenges and guide teams toward more efficient, standardized, and resilient operations.

AILLMSRE

0 likes · 9 min read

Why Traditional Ops Stalls and How AI‑Driven Solutions Can Revitalize It

AI Large Model Application Practice

Mar 12, 2024 · Artificial Intelligence

How to Build a Corrective RAG Agent with LangGraph: A Step‑by‑Step Guide

This article explains how to use LangGraph—a graph‑based extension of LangChain—to implement a corrective RAG (C‑RAG) pipeline that evaluates retrieved documents, rewrites queries when needed, performs web search, and generates accurate answers, complete with code snippets and a runnable example.

Corrective RAGLLMLangChain

0 likes · 14 min read

How to Build a Corrective RAG Agent with LangGraph: A Step‑by‑Step Guide

AntTech

Mar 11, 2024 · Artificial Intelligence

Can Small Language Models be Good Reasoners in Recommender Systems?

This article presents SLIM, a knowledge‑distillation framework that transfers the reasoning abilities of large language models to compact models for sequential recommendation, enhancing item representation, user profiling, and bias mitigation while achieving comparable performance with far lower computational resources.

AILLMefficiency

0 likes · 12 min read

Can Small Language Models be Good Reasoners in Recommender Systems?

Aikesheng Open Source Community

Mar 11, 2024 · Databases

Google Adds Vector Search to MySQL Service, Advancing LLM Capabilities

Google has introduced preview vector search to its Cloud SQL for MySQL service, positioning it ahead of Oracle, while industry analysts note the growing importance of vector capabilities for generative AI applications across major database platforms.

Artificial IntelligenceCloud SQLDatabase Trends

0 likes · 7 min read

Google Adds Vector Search to MySQL Service, Advancing LLM Capabilities

NewBeeNLP

Mar 10, 2024 · Industry Insights

What WWW'24 Papers Reveal About LLMs in Search & Recommendation

This overview summarizes six WWW 2024 industry papers that apply large language models to e‑commerce search, personalized query suggestion, article recommendation, collaborative filtering, and lifelong sequential behavior understanding, highlighting their methods, experimental results, deployment status, and emerging trends in LLM‑driven search and recommendation.

LLMWWW2024industry applications

0 likes · 16 min read

What WWW'24 Papers Reveal About LLMs in Search & Recommendation

Alibaba Cloud Native

Mar 9, 2024 · Cloud Computing

Deploy Google Gemma LLM on Alibaba Cloud Function Compute GPU with Low‑Cost Idle Mode

This guide shows how to quickly and cheaply deploy the open‑source Google Gemma large language model on Alibaba Cloud Function Compute GPU using the new idle‑billing mode, covering prerequisites, Docker image creation, function setup, idle reservation, testing, monitoring, and cost estimation.

Function ComputeGPUGemma

0 likes · 10 min read

Deploy Google Gemma LLM on Alibaba Cloud Function Compute GPU with Low‑Cost Idle Mode

NewBeeNLP

Mar 8, 2024 · Industry Insights

Why Building LLMs Is Like Buying a Hardware Lottery – Lessons from a Startup

The article recounts Yi Tay’s experience founding Reka and building large language models from scratch, highlighting the unpredictable quality of GPU clusters, the challenges of multi‑cluster orchestration, code‑base choices, and how startups must rely on fast, intuition‑driven experimentation to succeed.

Cluster ManagementGPUHardware

0 likes · 12 min read

Why Building LLMs Is Like Buying a Hardware Lottery – Lessons from a Startup

Sohu Tech Products

Mar 6, 2024 · Mobile Development

On‑Device Deployment of Large Language Models Using Sohu’s Hybrid AI Engine and GPT‑2

The article outlines how Sohu’s Hybrid AI Engine enables on‑device deployment of a distilled GPT‑2 model by converting it to TensorFlow Lite, detailing the setup, customization with Keras, inference workflow, and core SDK calls, and argues that this approach offers fast, private, and cost‑effective AI for mobile devices despite typical LLM constraints.

GPT-2Hybrid AIKeras

0 likes · 9 min read

On‑Device Deployment of Large Language Models Using Sohu’s Hybrid AI Engine and GPT‑2

Alibaba Cloud Developer

Mar 6, 2024 · Artificial Intelligence

Unlocking LangChain: Build Powerful LLM Apps Like LEGO with Real-World Examples

This article explains how LangChain simplifies building and integrating large language model applications by providing modular components such as models, prompts, indexes, tools, memory, chains, and agents, illustrated with practical use cases like travel assistants, face‑recognition troubleshooting, and multi‑agent workflows.

AI agentsLLMLangChain

0 likes · 44 min read

Unlocking LangChain: Build Powerful LLM Apps Like LEGO with Real-World Examples

21CTO

Mar 5, 2024 · Artificial Intelligence

Can Generative AI Replace Human Programmers? LLM Insights & Future of Coding

The article examines why large language models (LLMs) cannot fully replace human programmers, compares major models like Gemma, Code Llama, GPT‑4 and Claude, discusses trust and copyright concerns, and explores how smaller, specialized LLMs may shape the future of software development.

AI ethicsLLMcode-generation

0 likes · 7 min read

Can Generative AI Replace Human Programmers? LLM Insights & Future of Coding

JD Retail Technology

Mar 4, 2024 · Artificial Intelligence

How JD Retail Integrates LLMs with SFT, RAG, and AI Agents for Real-World Impact

This article examines JD Retail's end‑to‑end large language model framework that combines supervised fine‑tuning, retrieval‑augmented generation, and ReAct‑based AI agents to overcome retail‑specific challenges, improve model accuracy, reduce hallucinations, and enable autonomous multi‑step business workflows.

AI AgentArtificial IntelligenceLLM

0 likes · 20 min read

How JD Retail Integrates LLMs with SFT, RAG, and AI Agents for Real-World Impact

21CTO

Feb 29, 2024 · Artificial Intelligence

StarCoder2 Unveiled: Open-Source LLM That Outperforms Its Predecessor with Fewer Parameters

StarCoder2, the latest open-source large language model from ServiceNow, Hugging Face, and NVIDIA, offers three sizes—30B, 70B, and 150B parameters—delivering performance comparable to the original 150B StarCoder while being more efficient and freely accessible under the BigCode Open RAIL‑M license.

Artificial IntelligenceLLMStarCoder2

0 likes · 4 min read

StarCoder2 Unveiled: Open-Source LLM That Outperforms Its Predecessor with Fewer Parameters

Baobao Algorithm Notes

Feb 28, 2024 · Artificial Intelligence

Building a LLM‑Powered Theme‑Park Queue Planner with Baidu Qianfan AppBuilder

This article walks through using large language model agents to create a theme‑park queue planning assistant for Baidu's Qianfan competition, covering the problem definition, dynamic‑programming‑style solution, prompt engineering, Python code generation, and step‑by‑step deployment in AppBuilder.

AI AgentAppBuilderLLM

0 likes · 17 min read

Building a LLM‑Powered Theme‑Park Queue Planner with Baidu Qianfan AppBuilder

NewBeeNLP

Feb 27, 2024 · Artificial Intelligence

Boosting E‑Commerce AIGC with Knowledge Graphs: From Multimodal Inputs to Controlled LLMs

The article details how JD.com leverages domain‑specific and generic knowledge graphs to enhance multimodal product information, improve controlled text generation, and boost LLM performance for e‑commerce copywriting, covering model architecture, copy‑only mechanisms, token‑type encoding, experimental results, and practical deployment scenarios.

AIGCLLMMultimodal

0 likes · 23 min read

Boosting E‑Commerce AIGC with Knowledge Graphs: From Multimodal Inputs to Controlled LLMs

Alibaba Cloud Big Data AI Platform

Feb 27, 2024 · Artificial Intelligence

Build a Knowledge‑Enhanced LLM Chatbot with Alibaba Cloud PAI: A Step‑by‑Step RAG Guide

This comprehensive guide walks AI developers through building a Retrieval‑Augmented Generation (RAG) chatbot on Alibaba Cloud PAI, covering architecture, vector store setup, model deployment, knowledge ingestion, multi‑modal retrieval, fusion, re‑ranking, prompt design, and end‑to‑end configuration with code examples.

Alibaba CloudChatbotLLM

0 likes · 26 min read

Build a Knowledge‑Enhanced LLM Chatbot with Alibaba Cloud PAI: A Step‑by‑Step RAG Guide

DataFunTalk

Feb 26, 2024 · Artificial Intelligence

Large Language Model Empowered Recommendation Systems: Overview, Techniques, and Future Directions

With the rapid rise of ChatGPT and large language models, recommendation systems are undergoing a transformative shift, moving beyond traditional behavior‑based methods to leverage LLMs for improved generalization, representation, and prompt‑based learning, while addressing challenges such as scalability, interpretability, bias, and deployment costs.

AIGeneralizationLLM

0 likes · 19 min read

Large Language Model Empowered Recommendation Systems: Overview, Techniques, and Future Directions

AI Large Model Application Practice

Feb 23, 2024 · Artificial Intelligence

How to Build a Text‑to‑SQL Chatbot with Vanna’s Open‑Source RAG Framework

This guide explains Vanna, an open‑source Python RAG framework for Text2SQL, covering its core concepts, RAG‑based architecture, step‑by‑step model training, code examples for customization, and how to deploy a conversational database chatbot with a Flask web UI.

ChatbotLLMPython

0 likes · 11 min read

How to Build a Text‑to‑SQL Chatbot with Vanna’s Open‑Source RAG Framework

Ops Development & AI Practice

Feb 22, 2024 · Artificial Intelligence

Exploring GPT4All: Open-Source LLMs You Can Run Locally on Any Device

GPT4All, an open‑source LLM ecosystem from Nomic AI, lets users run and customize large language models locally on CPUs or GPUs, offering features like GGUF support, multi‑platform installers, API access, and community contribution guidelines, making it a versatile tool for AI enthusiasts and developers.

AIGPT4AllLLM

0 likes · 4 min read

Exploring GPT4All: Open-Source LLMs You Can Run Locally on Any Device

DaTaobao Tech

Feb 21, 2024 · Artificial Intelligence

An Overview of LangChain: Core Concepts and Practical Implementations

The article introduces LangChain as a framework that unifies LLM providers through model I/O, connects external data via retrievers, composes workflows with chains, maintains context with memory, and enables tool use through agents, and demonstrates Java examples for TongYi embeddings, a ChatGLM‑6B RetrievalQA chain, and discusses agent registration and micro‑service‑based agent factories.

EmbeddingLLMLangChain

0 likes · 9 min read

An Overview of LangChain: Core Concepts and Practical Implementations

Rare Earth Juejin Tech Community

Feb 21, 2024 · Artificial Intelligence

Building a Personal Blog Knowledge Base with Coze AI Bot Platform

This guide explains how to use the Coze no‑code AI Bot platform to create a personal blog knowledge base by configuring plugins, data sources, persistence, workflow nodes, prompt engineering, and publishing the bot for seamless AI‑assisted content retrieval.

AI botCozeKnowledge Base

0 likes · 12 min read

Building a Personal Blog Knowledge Base with Coze AI Bot Platform

21CTO

Feb 20, 2024 · Artificial Intelligence

Which LLM Dominates Coding? GPT‑4 vs CodeLlama vs Mixtral vs Gemini

This article presents a head‑to‑head evaluation of four leading large language models—GPT‑4, CodeLlama 70B, CodeLlama 7B, and Mixtral 8x7B—across eight coding‑related tasks, revealing GPT‑4 as the overall winner while highlighting the trade‑offs of smaller models and emerging competitors like Google Gemini.

AI EvaluationCodeLlamaCoding Assistant

0 likes · 9 min read

Which LLM Dominates Coding? GPT‑4 vs CodeLlama vs Mixtral vs Gemini

Alibaba Cloud Developer

Feb 20, 2024 · Artificial Intelligence

Boost LLM Inference Speed with KV‑Cache Reuse and Speculative Sampling

This article explains two production‑grade optimization techniques for large language model inference—KV‑cache reuse across multi‑turn dialogues and speculative sampling with a small draft model—detailing their design, implementation, and performance impact.

AIInference OptimizationKV cache

0 likes · 14 min read

Boost LLM Inference Speed with KV‑Cache Reuse and Speculative Sampling

DaTaobao Tech

Feb 19, 2024 · Artificial Intelligence

AI/ML Technology Articles Collection

This collection compiles technical articles that explore diverse AI/ML applications, from deploying large language models on MacBooks and building e‑commerce recommendation engines, to leveraging the LangChain framework, creating AIGC‑driven fashion solutions, and implementing Stable Diffusion for image generation.

AIAIGCDeployment

0 likes · 1 min read

DataFunTalk

Feb 19, 2024 · Artificial Intelligence

Large Language Model Inference Overview and Performance Optimizations

This article presents a comprehensive overview of large language model inference, detailing the prefill and decoding stages, key performance metrics such as throughput, latency and QPS, and a series of system-level optimizations—including pipeline parallelism, dynamic batching, specialized attention kernels, virtual memory allocation, KV‑cache quantization, and mixed‑precision strategies—to improve GPU utilization and overall inference efficiency.

GPULLMLatency

0 likes · 24 min read

Large Language Model Inference Overview and Performance Optimizations

Java Tech Enthusiast

Feb 16, 2024 · Artificial Intelligence

Google's Gemini 1.5: Breakthrough in Long-Context Understanding and Multimodal Capabilities

Google’s Gemini 1.5, a new multimodal Mixture‑of‑Experts model, supports up to a million‑token context (10 million internally), can understand text, video, audio and code, learns a new language from a single prompt, and is already being used by Samsung, Jasper and Quora, positioning it as a direct challenger to OpenAI’s flagship models.

Gemini 1.5Google AILLM

0 likes · 7 min read

Google's Gemini 1.5: Breakthrough in Long-Context Understanding and Multimodal Capabilities

AI Large Model Application Practice

Feb 15, 2024 · Artificial Intelligence

How Generative AI is Transforming RPA: Three Powerful Integration Scenarios

This article explores three key ways large language models and multimodal generative AI can enhance robotic process automation, from cognition‑boosted RPA and AI‑Agent collaboration to visual‑intelligent navigation, illustrating practical examples and future prospects for smarter digital workers.

AI AgentLLMRPA

0 likes · 12 min read

How Generative AI is Transforming RPA: Three Powerful Integration Scenarios

NewBeeNLP

Feb 11, 2024 · Industry Insights

What 2023 Taught Us About LLMs and AI‑Guided Optimization

The author reviews a year of rapid progress in large language models, highlighting breakthrough papers such as Positional Interpolation, StreamingLLM, Deja Vu, and RLCD, and discusses how AI‑guided optimization techniques like SurCo, LANCER, and GenCo are reshaping research and industry applications.

AI OptimizationLLMTransformers

0 likes · 13 min read

What 2023 Taught Us About LLMs and AI‑Guided Optimization

Rare Earth Juejin Tech Community

Feb 7, 2024 · Artificial Intelligence

Step-by-Step Guide to Building Multi‑Agent Applications with LangChain LangGraph in Google Colab

This tutorial walks through installing LangChain, LangGraph and related packages in Google Colab, configuring environment variables, defining search and Twitter‑writer tools, constructing a StateGraph workflow with supervisor logic, and executing a multi‑agent LLM pipeline using LangChain’s new multi‑agent capabilities.

AIGoogle ColabLLM

0 likes · 11 min read

Step-by-Step Guide to Building Multi‑Agent Applications with LangChain LangGraph in Google Colab

21CTO

Feb 4, 2024 · Artificial Intelligence

Running Large Language Models on Raspberry Pi with Ollama: A Step‑by‑Step Guide

This tutorial walks you through installing Ollama on a Raspberry Pi, exploring TinyLlama, Phi, and LLaVA models, and demonstrates how to run and interact with these LLMs locally, including hardware requirements and practical command examples.

AILLMOllama

0 likes · 5 min read

Running Large Language Models on Raspberry Pi with Ollama: A Step‑by‑Step Guide

DataFunSummit

Feb 3, 2024 · Artificial Intelligence

Practical Application of Large Language Models in MaShang Consumer Finance: From Model Building to Deployment

This article details how MaShang Consumer Finance leverages large language models for sales, collection, and customer service, covering company background, AI research achievements, model training infrastructure, data‑quality and compliance challenges, prompt engineering, inference acceleration, evaluation methods, and lessons learned from real‑world deployment.

Data QualityLLMModel Deployment

0 likes · 21 min read

Practical Application of Large Language Models in MaShang Consumer Finance: From Model Building to Deployment

NewBeeNLP

Feb 2, 2024 · Artificial Intelligence

ControlRec: Aligning LLMs with IDs to Boost Personalized Recommendations

ControlRec introduces heterogeneous feature matching and instruction contrastive learning to bridge the semantic gap between language models and discrete user/item IDs, enabling more effective personalized recommendation across multiple tasks such as rating prediction, sequential recommendation, and explanation generation.

ControlRecHeterogeneous Feature MatchingInstruction Contrast Learning

0 likes · 10 min read

ControlRec: Aligning LLMs with IDs to Boost Personalized Recommendations

Rare Earth Juejin Tech Community

Jan 28, 2024 · Artificial Intelligence

Building a Weibo Influencer Finder with LangChain and LLM

This article demonstrates how to use LangChain, LLMs, and SerpAPI to create a Weibo influencer‑search tool that extracts UID numbers, scrapes profile data, filters Chinese content, and prepares the information for automated marketing outreach.

AgentLLMLangChain

0 likes · 9 min read

Building a Weibo Influencer Finder with LangChain and LLM

Ctrip Technology

Jan 26, 2024 · Artificial Intelligence

Implementing Plugin Functionality for a Large Language Model Chatbot Using Function Calling and Asynchronous Execution

This article explains how Ctrip's security R&D team built a web‑based LLM chatbot with version‑2.0 features such as plugin support, function calling, synchronous and asynchronous execution, WebSocket/Socket.IO communication, and provides full Python code examples for defining and invoking plugins.

AIBackendFunction Calling

0 likes · 15 min read

Implementing Plugin Functionality for a Large Language Model Chatbot Using Function Calling and Asynchronous Execution

Rare Earth Juejin Tech Community

Jan 22, 2024 · Artificial Intelligence

Prompt Engineering and CAMEL: Role‑Playing AI Agents for Automated Prompt Generation

This article explains how Prompt Engineering combined with the CAMEL framework enables role‑playing AI agents to automatically generate and manage prompts, illustrates the concept with a stock‑trading example, and provides Python code using LangChain to build a marketing‑automation agent for a small business.

AI agentsCAMELInception Prompting

0 likes · 11 min read

Prompt Engineering and CAMEL: Role‑Playing AI Agents for Automated Prompt Generation

Rare Earth Juejin Tech Community

Jan 21, 2024 · Artificial Intelligence

Understanding Pretraining and Fine‑Tuning of Large Language Models: Methods, Resources, and Practical Applications

This article explains the concepts of pretraining and fine‑tuning for large language models, compares full‑parameter, LoRA and QLoRA approaches, discusses resource consumption, introduces the ModelScope SWIFT framework with code examples, and shows how fine‑tuning can improve data‑visualisation tasks while reducing token usage.

Data visualizationLLMLoRA

0 likes · 24 min read

Understanding Pretraining and Fine‑Tuning of Large Language Models: Methods, Resources, and Practical Applications

Rare Earth Juejin Tech Community

Jan 20, 2024 · Artificial Intelligence

Understanding LangChain Callback Mechanism, Custom Async Handlers, and Token Cost Management in Python

This article introduces LangChain's callback mechanism, demonstrates how to implement custom synchronous and asynchronous callbacks in Python, compares them with JavaScript async patterns, and shows how to monitor token usage and control costs using OpenAI callbacks.

LLMLangChainPython

0 likes · 10 min read

Understanding LangChain Callback Mechanism, Custom Async Handlers, and Token Cost Management in Python

AI Large Model Application Practice

Jan 18, 2024 · Operations

How to Build an RPA Bot with Robot Framework and Compare It to AI Agents

This article explains the fundamentals of Robotic Process Automation (RPA), compares RPA with BPA and AI Agents, and provides a step‑by‑step tutorial for building and running an RPA robot using Robot Framework and the open‑source RPA Framework, including full code examples.

AI AgentLLMRPA

0 likes · 14 min read

How to Build an RPA Bot with Robot Framework and Compare It to AI Agents

Bitu Technology

Jan 17, 2024 · Artificial Intelligence

Rosetta Stone: Scalable ID Mapping System for Tubi's Content Library Using LLMs and Embeddings

This article describes how Tubi built the Rosetta Stone system—a flexible ID mapping workflow that leverages large language models, embedding similarity ranking, and K‑nearest‑neighbors to unify and enrich metadata across a 200,000‑title library, improve content recommendation, and streamline operations.

Big DataLLMcontent ID mapping

0 likes · 10 min read

Rosetta Stone: Scalable ID Mapping System for Tubi's Content Library Using LLMs and Embeddings

Tencent Cloud Developer

Jan 16, 2024 · Frontend Development

Frontend Technology Review 2023 and Outlook 2024

The 2023 frontend review highlights TypeScript’s size and speed gains, ECMAScript 2023 features, evolving frameworks like React, Vue, Svelte, Angular and emerging Qwik, while Rust tooling, Bun, browser changes, AI‑driven low‑code, and WASM progress set the stage for 2024’s LLM‑powered, Rust‑centric, cross‑platform development.

BunD2CHarmonyOS

0 likes · 49 min read

Frontend Technology Review 2023 and Outlook 2024

Xiaohongshu Tech REDtech

Jan 12, 2024 · Artificial Intelligence

Negative Sample Assisted Distillation for Large Language Models

The AAAI‑2024 paper introduces a Negative Sample Assisted Distillation framework—comprising Negative Assistance Training, Negative Calibration Enhancement, and Adaptive Self‑Consistency—that leverages both correct and incorrect reasoning examples to train a compact LLaMA‑7B student, achieving up to 75.75 % accuracy gains over fine‑tuning on MATH and improving out‑of‑domain benchmarks.

LLMchain-of-thoughtknowledge distillation

0 likes · 13 min read

Negative Sample Assisted Distillation for Large Language Models

Data Thinking Notes

Jan 7, 2024 · Artificial Intelligence

Boost Text2SQL Accuracy with Retrieval‑Augmented Generation and LangChain

This article explains how Retrieval‑Augmented Generation (RAG) can improve LLM‑based Text2SQL conversion, covering RAG fundamentals, LangChain implementation steps, practical enhancements for SQL agents, and future directions for integrating domain knowledge.

AI agentsLLMLangChain

0 likes · 16 min read

Boost Text2SQL Accuracy with Retrieval‑Augmented Generation and LangChain

Rare Earth Juejin Tech Community

Jan 7, 2024 · Artificial Intelligence

A Comprehensive Guide to Generative AI Tools, Prompts, and Learning Resources

This article provides an extensive overview of generative AI concepts such as AIGC and AGI, evaluates various coding assistants and chat models, offers prompt engineering tips, and lists numerous free and paid AI tools and learning resources for developers and everyday users.

AIAIGCLLM

0 likes · 15 min read

A Comprehensive Guide to Generative AI Tools, Prompts, and Learning Resources

Baobao Algorithm Notes

Jan 6, 2024 · Artificial Intelligence

How to Pick the Best Fine‑Tuning Data for LLMs with the Nuggets Method

This article explains the Nuggets approach for selecting a high‑quality subset of annotated instructions to fine‑tune large language models, describing its three inputs, the gold‑score computation based on perplexity improvement, empirical results on Alpaca, and practical considerations such as task‑set design.

LLMNuggetsdata selection

0 likes · 7 min read

How to Pick the Best Fine‑Tuning Data for LLMs with the Nuggets Method

DaTaobao Tech

Jan 5, 2024 · Mobile Development

Edge Deployment and Performance Optimization of Large Language Models with MNN

The upgraded mnn‑llm framework adds a unified llm‑export pipeline, cross‑platform inference with tokenizers and disk‑embedding, and ARM‑focused linear‑layer optimizations—including SIMD, hand‑written assembly and 4‑bit quantization—that dramatically speed up prefilling and achieve real‑time LLM conversation on mobile devices within a 2 GB memory budget, outperforming llama.cpp, fastllm and mlc‑llm.

ARM CPULLMMNN

0 likes · 17 min read

Edge Deployment and Performance Optimization of Large Language Models with MNN

DataFunSummit

Jan 4, 2024 · Big Data

YY Live Business Metric Governance Practice

This presentation details YY Live’s data product team’s end‑to‑end business metric governance practice, covering problem background, analysis, governance objectives, multi‑team collaboration, implementation steps, achieved efficiencies, and future directions leveraging large language models.

Big DataData PlatformLLM

0 likes · 16 min read

YY Live Business Metric Governance Practice

Baobao Algorithm Notes

Jan 2, 2024 · Artificial Intelligence

Uncovering Mixtral‑8x7B: How MoE Experts Shape Performance and Training

This article analyses the Mixtral‑8x7B Mixture‑of‑Experts LLM, explains its gate‑driven 8‑expert architecture, presents a simplified PyTorch implementation, and reports a series of experiments that probe top‑2 gating during training, individual expert contributions, task‑specific pre‑training, the impact of expert count, and similarity with Mistral‑7B, ultimately offering hypotheses about its training pipeline.

LLMMixtralMixture of Experts

0 likes · 14 min read

Uncovering Mixtral‑8x7B: How MoE Experts Shape Performance and Training

Rare Earth Juejin Tech Community

Dec 29, 2023 · Artificial Intelligence

Overview of Major Benchmark Datasets for Evaluating Large Language Models

This article provides a comprehensive overview of major benchmark datasets—including CMMLU, MMLU, C‑Eval, GSM8K, Gaokao‑Bench, AGIEval, MATH, BBH, HumanEval, and MBPP—used to evaluate large language models' knowledge, reasoning, and coding abilities, and summarizes related leaderboards and evaluation tools.

Artificial IntelligenceDatasetLLM

0 likes · 14 min read

Overview of Major Benchmark Datasets for Evaluating Large Language Models

Huolala Tech

Dec 28, 2023 · Artificial Intelligence

How Huolala Built a Low‑Code LLM Platform to Accelerate AI Agent Deployment

Huolala created a visual, drag‑and‑drop LLM application platform that streamlines AI integration, reduces development costs, and enables rapid deployment of agents across marketing, invitation, advertising, and modeling scenarios, boosting efficiency by over 98% while cutting integration time from hours to minutes.

AIAgentLLM

0 likes · 13 min read

How Huolala Built a Low‑Code LLM Platform to Accelerate AI Agent Deployment

Alibaba Cloud Big Data AI Platform

Dec 28, 2023 · Big Data

How LLMs Can Revolutionize Data Warehouse ETL: From Push‑Pull to Stable Queries

This article explores the challenges of traditional data‑warehouse ETL, compares push and pull models, and presents an LLM‑driven architecture that generates both on‑demand SQL queries and streaming ETL code with automatic error‑feedback loops, dramatically improving cost, accuracy, and maintainability.

Big DataETLFlink

0 likes · 16 min read

How LLMs Can Revolutionize Data Warehouse ETL: From Push‑Pull to Stable Queries

Alibaba Cloud Native

Dec 27, 2023 · Cloud Computing

One‑Click Deployment of LLMs to Alibaba Cloud Function Compute with SwingDeploy

This guide explains how to quickly select a ModelScope open‑source LLM, deploy it to Alibaba Cloud Function Compute using the SwingDeploy one‑click feature, enable reserved idle billing, and evaluate the cost savings compared with traditional GPU provisioning.

Cost OptimizationFunction ComputeGPU

0 likes · 11 min read

One‑Click Deployment of LLMs to Alibaba Cloud Function Compute with SwingDeploy

DaTaobao Tech

Dec 27, 2023 · Artificial Intelligence

Deploying a Private LLM Knowledge Base on a MacBook

The guide walks through installing and quantizing the open‑source ChatGLM3‑6B model and the m3e‑base embedder on a MacBook, wrapping them with a FastAPI OpenAI‑compatible service, routing requests through a One‑API gateway, storing metadata in MongoDB and vectors in PostgreSQL pgvector, deploying FastGPT for RAG, ingesting data, and demonstrating 5‑7 second response times, while outlining future improvements.

ChatGLM3DeploymentFastAPI

0 likes · 23 min read

Deploying a Private LLM Knowledge Base on a MacBook

Rare Earth Juejin Tech Community

Dec 27, 2023 · Artificial Intelligence

Comprehensive Overview of Large Language Models: Capabilities, Limitations, Deployment, and Future Trends

This article provides a detailed examination of large language models, covering their underlying technologies, capabilities and constraints, model families, training processes, cloud and edge deployment challenges, agent architectures, and emerging trends, offering practical insights for developers, product managers, and researchers.

Artificial IntelligenceEdge ComputingLLM

0 likes · 43 min read

Comprehensive Overview of Large Language Models: Capabilities, Limitations, Deployment, and Future Trends

21CTO

Dec 15, 2023 · Artificial Intelligence

Why 2024 Will Be the Year of AI Engineers and LLM‑Driven Apps

The article outlines five major AI engineering trends for 2024—including the rise of AI engineers, evolving LLM tech stacks, open‑source large models, vector databases, and AI agents—highlighting how these shifts will reshape application development and industry competition.

2024 trendsAI EngineeringAI agents

0 likes · 9 min read

Why 2024 Will Be the Year of AI Engineers and LLM‑Driven Apps

DataFunSummit

Dec 15, 2023 · Artificial Intelligence

Integrating Large Language Models into Recommender Systems: Opportunities, Methods, and Challenges

This article explores how large language models can be incorporated into recommender systems, discussing background challenges, specific integration points across the recommendation pipeline, practical implementation methods, experimental results, and future research directions, while highlighting industrial considerations and potential improvements.

Industrial ApplicationsLLMModel Fusion

0 likes · 20 min read

Integrating Large Language Models into Recommender Systems: Opportunities, Methods, and Challenges

Data Thinking Notes

Dec 12, 2023 · Artificial Intelligence

Boosting Text‑to‑SQL Accuracy with Prompt Engineering and LLMs

This article examines the challenges of LLM‑based Text‑to‑SQL such as hallucinations, data‑security risks, and user input errors, and presents prompt‑engineering strategies, fine‑tuning comparisons, prompt types, code examples, and experimental results to improve reliability and cost‑effectiveness.

Artificial IntelligenceLLMLangChain

0 likes · 15 min read

Boosting Text‑to‑SQL Accuracy with Prompt Engineering and LLMs

NetEase Cloud Music Tech Team

Dec 12, 2023 · Artificial Intelligence

How LangChain Powers AI Agents: Principles, Debugging, and Real‑World Optimizations

This article explains the concept of AI Agents in the large‑language‑model era, details LangChain's implementation mechanics, shares practical challenges and optimizations encountered by NetEase Cloud Music, and provides step‑by‑step code examples and performance insights for building robust AI Agents.

AI AgentLLMLangChain

0 likes · 20 min read

How LangChain Powers AI Agents: Principles, Debugging, and Real‑World Optimizations

AI Large Model Application Practice

Dec 12, 2023 · Artificial Intelligence

Boost Enterprise LLM Performance: Solving Common RAG Challenges

This article explains Retrieval‑Augmented Generation for enterprise LLMs, outlines four production‑grade problems, and presents practical solutions such as parent‑child chunking, multi‑vector and multi‑query retrieval, and context‑aware question refinement with concrete prompts and workflow diagrams.

LLMRAG

0 likes · 13 min read

Boost Enterprise LLM Performance: Solving Common RAG Challenges

Rare Earth Juejin Tech Community

Dec 8, 2023 · Artificial Intelligence

Simplifying Transformer Blocks: Removing Residual Connections, LayerNorm, and Other Components without Losing Performance

A recent ETH Zurich paper shows that standard Transformer blocks can be drastically simplified by removing residual connections, LayerNorm, projection and value parameters, and even MLP sub‑block components, achieving up to 16% fewer parameters and comparable training speed and downstream performance on both GPT‑style decoders and BERT models.

AIDeep LearningLLM

0 likes · 11 min read

Simplifying Transformer Blocks: Removing Residual Connections, LayerNorm, and Other Components without Losing Performance

Sohu Tech Products

Dec 6, 2023 · Databases

GPTuner: LLM-Driven PostgreSQL Knob Tuning

GPTuner, an LLM‑driven system for PostgreSQL knob tuning developed by researchers at Sichuan University, demonstrates that knowledge processing, parameter selection, search‑range optimization, and a two‑stage Bayesian framework each significantly improve performance, while costing roughly 880 000 GPT‑4 tokens (≈ $30) with reusable knowledge.

Ablation StudyDatabase TuningGPTuner

0 likes · 9 min read

GPTuner: LLM-Driven PostgreSQL Knob Tuning

DataFunTalk

Dec 6, 2023 · Artificial Intelligence

Distributed Training Techniques and Quantitative Analysis for Large Language Models (GPT‑175B)

This article presents a comprehensive overview of state‑of‑the‑art distributed training methods for large language models, using GPT‑175B as a case study to analyze memory, communication, and compute overheads, and to recommend practical optimization strategies such as tensor, pipeline, and sequence parallelism, ZeRO‑1 optimizer, and selective activation checkpointing.

Distributed TrainingGPU memory optimizationLLM

0 likes · 22 min read

Distributed Training Techniques and Quantitative Analysis for Large Language Models (GPT‑175B)

Rare Earth Juejin Tech Community

Dec 6, 2023 · Artificial Intelligence

Multi-Agent Research Overview, Open-Source Implementations, and Design Considerations

This article reviews the background of multi‑agent systems, compares major open‑source frameworks such as AutoGen, MetaGPT, AgentVerse, and XAgent, discusses design principles, collaboration strategies, and offers conclusions on LLM‑driven versus SOP‑driven approaches for building multi‑agent applications.

AIAgent FrameworkAutoGen

0 likes · 15 min read

Multi-Agent Research Overview, Open-Source Implementations, and Design Considerations

Alibaba Cloud Big Data AI Platform

Dec 5, 2023 · Artificial Intelligence

How to Efficiently Fine‑Tune Qwen LLMs on Alibaba Cloud PAI Lingjun

This guide walks you through setting up Alibaba Cloud PAI Lingjun resources, preparing Qwen‑7B/14B/72B models, preprocessing large‑scale WuDao data, configuring distributed training with Megatron‑LM, performing continued pre‑training and supervised fine‑tuning, and finally deploying the model as an online service via PAI‑EAS.

Alibaba CloudLLMMegatron-LM

0 likes · 27 min read

How to Efficiently Fine‑Tune Qwen LLMs on Alibaba Cloud PAI Lingjun

Huawei Cloud Developer Alliance

Nov 30, 2023 · Artificial Intelligence

Mastering LLM Text Generation: Decoding Methods Explained

This review of the recent MindSpore NLP public class walks through the fundamentals of large language model text generation, detailing deterministic decoding such as greedy and beam search, stochastic sampling techniques like temperature, top‑k and top‑p, and advanced methods including constrained beam, contrastive, and assisted search, with illustrative examples.

Beam SearchGreedy SearchLLM

0 likes · 5 min read

Mastering LLM Text Generation: Decoding Methods Explained

Rare Earth Juejin Tech Community

Nov 29, 2023 · Artificial Intelligence

Building a Private LLM‑Powered Knowledge Base with LangChain and ChatGLM3

This article explains how to migrate personal notes into a private knowledge base by combining a large language model with an external vector store, detailing the concepts of tokenization, embedding, vector databases, and step‑by‑step deployment using LangChain‑Chatchat and the open‑source ChatGLM3 model.

ChatGLM3EmbeddingKnowledge Base

0 likes · 10 min read

Building a Private LLM‑Powered Knowledge Base with LangChain and ChatGLM3

Data Thinking Notes

Nov 28, 2023 · Artificial Intelligence

Build a Text‑to‑SQL App with LangChain and OpenAI: Step‑by‑Step Guide

This article explains how to build a Text‑to‑SQL application using LangChain, OpenAI LLMs, and SQLDatabaseChain, covering the fundamentals of Text2SQL, LangChain components, code examples, and a practical SQLite case that transforms natural‑language questions into executable SQL queries.

LLMLangChainPython

0 likes · 12 min read

Build a Text‑to‑SQL App with LangChain and OpenAI: Step‑by‑Step Guide

Open Source Tech Hub

Nov 25, 2023 · Artificial Intelligence

How to Deploy FastGPT Locally with Docker Compose: A Step‑by‑Step Guide

This guide walks you through installing Docker, configuring Docker‑Compose, setting up FastGPT’s config files, launching the containers, and creating a private knowledge base to enable AI‑driven question answering on your own server.

AI deploymentDocker ComposeFastGPT

0 likes · 10 min read

How to Deploy FastGPT Locally with Docker Compose: A Step‑by‑Step Guide

DataFunSummit

Nov 20, 2023 · Artificial Intelligence

ModelScope Agents: Open‑Source LLM Agent Framework and Practical Guide

This article introduces ModelScope Agents, an open‑source LLM‑based agent framework that addresses limitations of GPT Store, outlines its features, provides installation and usage instructions, showcases a RPG game example, and invites the community to contribute to its roadmap.

AIAgent FrameworkLLM

0 likes · 7 min read

ModelScope Agents: Open‑Source LLM Agent Framework and Practical Guide

Alibaba Cloud Big Data AI Platform

Nov 16, 2023 · Artificial Intelligence

How Alibaba Cloud’s AI-Powered OpenSearch Boosts Search Accuracy and Cuts Costs

Alibaba Cloud unveiled AI-driven upgrades to its OpenSearch and Elasticsearch services, highlighting LLM‑based conversational search, three‑fold vector retrieval speed gains, and up to 70% cost reductions through serverless architectures and extensive performance optimizations.

ElasticsearchLLMOpenSearch

0 likes · 6 min read

How Alibaba Cloud’s AI-Powered OpenSearch Boosts Search Accuracy and Cuts Costs

JD Retail Technology

Nov 14, 2023 · Artificial Intelligence

An Overview of LangChain: Core Concepts, Components, and Practical Applications

This article introduces LangChain—a Python framework for building LLM‑driven applications—explains its core components such as models, indexes, chains, memory, and agents, and provides practical code examples for document summarization, retrieval‑augmented QA, and future development directions.

LLMLangChainPromptTemplate

0 likes · 19 min read

An Overview of LangChain: Core Concepts, Components, and Practical Applications

Baobao Algorithm Notes

Nov 13, 2023 · Artificial Intelligence

Mastering LLM Fundamentals: Tokenizers, Layer Norm, and PEFT Explained

This article provides a comprehensive technical guide on large language model fundamentals, covering tokenizer construction methods such as BPE, WordPiece, and SentencePiece, detailed explanations of Layer Normalization variants, Deep Norm concepts with code, and an overview of parameter‑efficient fine‑tuning techniques like LoRA and PEFT.

Artificial IntelligenceLLMLayer Normalization

0 likes · 36 min read

Mastering LLM Fundamentals: Tokenizers, Layer Norm, and PEFT Explained

Data Thinking Notes

Nov 12, 2023 · Artificial Intelligence

Unlocking LLM Power: Semantic Search, Private Knowledge Bases, and Text‑to‑SQL for Data Teams

This article explores how large language models can boost data workflows by using embeddings for semantic retrieval, building domain‑specific knowledge bases for private Q&A, generating SQL code from natural language, and automating exploratory data analysis, offering practical steps and visual examples.

EmbeddingKnowledge BaseLLM

0 likes · 7 min read

Unlocking LLM Power: Semantic Search, Private Knowledge Bases, and Text‑to‑SQL for Data Teams

Baobao Algorithm Notes

Nov 7, 2023 · Artificial Intelligence

A Complete Technical Guide to LLM Foundations, Advanced Topics, Fine‑Tuning, and LangChain Applications

This article provides an in‑depth technical overview of large language models (LLMs), covering core model families, architectural differences, emergent abilities, common challenges such as repetition and token limits, detailed fine‑tuning strategies including PEFT, practical guidance for training custom models, and a thorough introduction to the LangChain framework with code examples, core concepts, and troubleshooting tips for building LLM‑powered applications.

Fine-tuningLLMLangChain

0 likes · 97 min read

A Complete Technical Guide to LLM Foundations, Advanced Topics, Fine‑Tuning, and LangChain Applications

Huawei Cloud Developer Alliance

Nov 3, 2023 · Artificial Intelligence

Can LLMs Master Lifelong Learning? Exploring MoE and Continuous Adaptation

This article explains how large language models can achieve continual lifelong learning, outlines the key properties required, reviews mixture‑of‑experts (MoE) techniques—including sparse MoE, GShard, Switch Transformer, GLaM and PanGu‑Sigma—and discusses the remaining challenges such as model complexity, expert balancing and distributed communication overhead.

Artificial IntelligenceLLMLifelong Learning

0 likes · 9 min read

Can LLMs Master Lifelong Learning? Exploring MoE and Continuous Adaptation

JD Tech

Nov 2, 2023 · Artificial Intelligence

An Introduction to LangChain: Core Components, Usage Patterns, and Practical Code Examples

This article explains what LangChain is, outlines its core components such as Models, Indexes, Chains, Memory and Agents, and demonstrates how to build LLM‑driven applications with detailed Python code snippets, visual diagrams, and future development suggestions.

AI FrameworkLLMLangChain

0 likes · 20 min read

An Introduction to LangChain: Core Components, Usage Patterns, and Practical Code Examples

DataFunSummit

Nov 1, 2023 · Artificial Intelligence

Exploring Large Language Models for Recommendation Systems: Experiments and Insights

This article investigates how large language models can be applied to recommendation tasks, presenting two usage strategies, experimental evaluations on multiple datasets, comparisons with traditional baselines, and analyses of prompting methods, cost, and cold‑start performance.

Artificial IntelligenceLLMPrompt engineering

0 likes · 13 min read

DataFunSummit

Nov 1, 2023 · Artificial Intelligence

DataFunCon2023 Shenzhen: Program Overview and Session Highlights

DataFunCon2023 Shenzhen showcases a comprehensive program featuring expert talks on building Data+LLM applications, large-scale storage, cloud‑native architectures, metric systems, data governance, AB testing, and industry‑specific large language model use cases across finance, gaming, advertising, and more, providing valuable insights for practitioners and researchers alike.

@DataAIGCArtificial Intelligence

0 likes · 50 min read

DataFunCon2023 Shenzhen: Program Overview and Session Highlights

Software Development Quality

Oct 27, 2023 · Artificial Intelligence

TestAgent: Open-Source 7B LLM for Multi-Language Test Generation

TestAgent introduces an open-source 7B large language model tailored for software testing, offering multi‑language test case generation, automatic assert completion, and a lightweight engineering framework with quick‑start scripts, performance benchmarks, and deployment options for various hardware accelerators.

AI modelLLMMulti-language Generation

0 likes · 10 min read

TestAgent: Open-Source 7B LLM for Multi-Language Test Generation

JD Retail Technology

Oct 26, 2023 · Artificial Intelligence

Leveraging Large Language Models for Text-to-SQL: Prompt Design and End-to-End Pipeline

This article explains how large language models can be used to convert natural language queries into SQL statements, describes two main approaches—direct generation and fine‑tuned open‑source models—details prompt engineering techniques, and outlines an end‑to‑end pipeline that executes the generated SQL and summarizes results.

ChatGLMLLMPrompt engineering

0 likes · 7 min read

Leveraging Large Language Models for Text-to-SQL: Prompt Design and End-to-End Pipeline

Alibaba Cloud Native

Oct 24, 2023 · Cloud Native

Deploy a Qwen‑Powered AI Assistant on Alibaba Cloud Function Compute in 5 Minutes

This tutorial walks you through quickly setting up a Qwen‑based AI assistant on Alibaba Cloud Function Compute, covering prerequisite API‑key acquisition, deployment steps, password protection, and how to access the running service.

AICloud NativeFunction Compute

0 likes · 4 min read

Deploy a Qwen‑Powered AI Assistant on Alibaba Cloud Function Compute in 5 Minutes

Open Source Tech Hub

Oct 22, 2023 · Artificial Intelligence

How to Integrate Xunfei Starfire Cognitive Model into PHP Projects – Step-by-Step Guide

This guide walks you through the background of Xunfei's Starfire large language model, its 2.0 features, account setup, obtaining API credentials, cloning the example repository, installing dependencies, configuring keys, and troubleshooting common errors for PHP integration.

AIAPIIntegration

0 likes · 7 min read

How to Integrate Xunfei Starfire Cognitive Model into PHP Projects – Step-by-Step Guide

phodal

Oct 19, 2023 · Operations

Can LLMs Revolutionize Code Review? Inside AutoDev’s AI‑Powered Approach

The article examines how rising code volume and AI‑generated snippets challenge traditional code review, proposes an LLM‑assisted workflow using AutoDev and DevOpsGenius, details prompt design, commit filtering, and implementation steps, and discusses the benefits and limitations for different team roles.

AI automationCode reviewDevOps

0 likes · 9 min read

Can LLMs Revolutionize Code Review? Inside AutoDev’s AI‑Powered Approach