Tagged articles
55 articles
Page 1 of 1
Architect's Guide
Architect's Guide
May 7, 2026 · Artificial Intelligence

Spring AI 2.0 vs LangChain4j: Which Should You Choose?

The article provides a side‑by‑side analysis of Spring AI 2.0 and LangChain4j, comparing their goals, version alignment, programming models, RAG and agent capabilities, ecosystem integration, learning curve, and operational considerations to help Java teams decide which library best fits their project constraints.

AI agentsLLM integrationLangChain4j
0 likes · 11 min read
Spring AI 2.0 vs LangChain4j: Which Should You Choose?
java1234
java1234
Apr 24, 2026 · Artificial Intelligence

Choosing Between Spring AI 2.0 and LangChain4j for Java AI Development

This article compares Spring AI 2.0 and LangChain4j, examining their positioning, version alignment, architecture, programming model, RAG support, observability, learning curve, and ecosystem integration to help Java teams decide which library best fits their AI project constraints.

AI librariesLLM integrationLangChain4j
0 likes · 13 min read
Choosing Between Spring AI 2.0 and LangChain4j for Java AI Development
Geek Labs
Geek Labs
Apr 13, 2026 · Artificial Intelligence

Hermes Agent: The Fast‑Rising AI Framework You Should Learn Now

Hermes Agent, an open‑source AI framework released by Nous Research, introduces a built‑in self‑evolution loop, a three‑layer memory system, and multi‑platform support; the article reviews its core features, compares it with Claude Code/OpenClaw, and highlights two companion projects—the Orange Book guide and a Web UI monitoring dashboard.

AI AgentHermes AgentLLM integration
0 likes · 15 min read
Hermes Agent: The Fast‑Rising AI Framework You Should Learn Now
AI Architect Hub
AI Architect Hub
Apr 11, 2026 · Artificial Intelligence

10 Open‑Source AI Tools Every Developer Should Add to Their Toolkit

This article curates ten free, open‑source AI‑focused projects—from web‑crawlers and browser automation to audio transcription, video downloading, and persistent memory—explaining their core capabilities, typical use cases, and how they can be integrated into developer workflows to boost productivity.

AI toolsLLM integrationdeveloper productivity
0 likes · 11 min read
10 Open‑Source AI Tools Every Developer Should Add to Their Toolkit
AI Explorer
AI Explorer
Apr 6, 2026 · Artificial Intelligence

Goose: An Open‑Source AI Agent That Automates Full‑Stack Development Tasks

Goose is an open‑source, on‑machine AI agent written in Rust that can autonomously handle end‑to‑end development workflows—from project scaffolding and API coding to testing and debugging—by orchestrating multiple LLMs via a Model Context Protocol, offering a full‑stack engineering partner for developers and teams.

AI AgentFull-Stack DevelopmentLLM integration
0 likes · 8 min read
Goose: An Open‑Source AI Agent That Automates Full‑Stack Development Tasks
PaperAgent
PaperAgent
Apr 4, 2026 · Artificial Intelligence

Accelerate Research 10× with Academic-Search: Open‑Source AI Literature Retrieval

Academic‑Search is an open‑source AI‑powered literature retrieval skill that unifies multi‑platform search, deduplication, citation tracking, BibTeX export, PDF download, and code completion, dramatically accelerating research workflows by up to ten times while integrating smoothly with agents like AutoGPT and LangChain.

AI literature searchLLM integrationPython
0 likes · 10 min read
Accelerate Research 10× with Academic-Search: Open‑Source AI Literature Retrieval
大转转FE
大转转FE
Mar 30, 2026 · Industry Insights

5 Cutting‑Edge AI Agent & AICoding Analyses Shaping Enterprise Development

This newsletter curates five in‑depth industry analyses covering Claude‑driven AICoding engineering, large‑model integration in e‑commerce data warehouses, AI agent identity‑permission governance, a step‑by‑step AI agent construction guide, and Tair‑based short‑term memory architecture for millisecond‑level response.

AI CodingAI agentsEnterprise AI
0 likes · 6 min read
5 Cutting‑Edge AI Agent & AICoding Analyses Shaping Enterprise Development
AI Explorer
AI Explorer
Mar 26, 2026 · Artificial Intelligence

LiteLLM: A Unified Gateway for Over 100 LLM APIs

LiteLLM provides a single Python SDK and proxy server that let developers call more than 100 large‑language‑model APIs with a uniform OpenAI‑style interface, handling cost tracking, load balancing, rate limiting, and detailed logging to simplify multi‑model experimentation and production deployments.

AI gatewayLLM integrationLiteLLM
0 likes · 6 min read
LiteLLM: A Unified Gateway for Over 100 LLM APIs
AI Waka
AI Waka
Mar 26, 2026 · Artificial Intelligence

When to Use MCP vs. Skills with Claude: A Practical Decision Framework

This article analyzes the trade‑offs between Model Context Protocol (MCP) and Skills for Claude‑based AI workflows, offering a three‑question test, real‑world examples, and architectural diagrams to help engineers choose the most efficient approach for their tasks.

AI workflowClaudeLLM integration
0 likes · 10 min read
When to Use MCP vs. Skills with Claude: A Practical Decision Framework
AI Waka
AI Waka
Mar 25, 2026 · Artificial Intelligence

How OpenClaw Turns Your Machine into an Autonomous AI Agent Runtime

OpenClaw is an open‑source, OS‑level autonomous agent runtime that combines dynamic system prompts, powerful tool access, file‑based memory, and sub‑agent generation, offering a secure, extensible architecture that runs on a single Node.js process and integrates with any LLM provider.

Autonomous AgentsLLM integrationMemory Management
0 likes · 19 min read
How OpenClaw Turns Your Machine into an Autonomous AI Agent Runtime
JD Cloud Developers
JD Cloud Developers
Mar 23, 2026 · Artificial Intelligence

OpenClaw Deep Dive: Turning LLMs into Actionable AI Agents

This article provides a comprehensive technical analysis of OpenClaw, an open‑source autonomous‑agent framework that integrates large language models with local system operations through a four‑layer architecture, detailed message‑processing steps, ReAct reasoning loops, security mechanisms, performance optimizations, and real‑world application scenarios.

AI agentsAutonomous AgentsLLM integration
0 likes · 13 min read
OpenClaw Deep Dive: Turning LLMs into Actionable AI Agents
AI Engineering
AI Engineering
Feb 21, 2026 · Artificial Intelligence

Why Pi-mono Powers OpenClaw: A Minimalist AI Coding Assistant

Pi-mono is a four‑tool, four‑layer AI coding assistant built by Mario Zechner that replaces bloated agents with a minimalist design, supports dozens of LLM providers, offers a terminal UI, extensible TypeScript plugins, and demonstrates superior benchmark performance in Terminal‑Bench.

AI coding assistantAgent FrameworkLLM integration
0 likes · 13 min read
Why Pi-mono Powers OpenClaw: A Minimalist AI Coding Assistant
Code Mala Tang
Code Mala Tang
Feb 20, 2026 · Artificial Intelligence

How to Integrate Claude Code with Ollama for Local and Cloud LLM Workflows

This guide walks you through installing Claude Code and Ollama, pulling and configuring various open‑source models, setting environment variables, and running Claude Code with both local and cloud‑hosted models, while covering context length, performance considerations, and tool‑calling examples.

Claude CodeEnvironment VariablesLLM integration
0 likes · 14 min read
How to Integrate Claude Code with Ollama for Local and Cloud LLM Workflows
Old Zhang's AI Learning
Old Zhang's AI Learning
Feb 15, 2026 · Artificial Intelligence

PicoClaw: Ultra‑Light OpenClaw Fork Runs a Full AI Agent on $10 Hardware in <1 s

PicoClaw is a Go‑rewritten, 99% smaller version of OpenClaw that fits under 10 MB, boots in under a second on a 0.6 GHz CPU, runs on $10‑$100 single‑board computers, offers full‑stack engineer mode, task scheduling, sandbox security, and multi‑platform chat integration, with detailed installation guides and a balanced list of strengths and current limitations.

AI AgentGoLLM integration
0 likes · 11 min read
PicoClaw: Ultra‑Light OpenClaw Fork Runs a Full AI Agent on $10 Hardware in <1 s
Fun with Large Models
Fun with Large Models
Feb 3, 2026 · Artificial Intelligence

OpenClaw Core Features and Architecture Explained (with LLM & Chat Integration)

OpenClaw, a 2026 open‑source AI agent, combines a protocol‑agnostic gateway, a four‑layer memory system, and an extensible Agent Skills framework to enable real‑time cross‑platform interaction, long‑term learning, and seamless integration with Chinese large language models and popular chat applications.

AI AgentAgent SkillsChat Tool Integration
0 likes · 11 min read
OpenClaw Core Features and Architecture Explained (with LLM & Chat Integration)
AI Waka
AI Waka
Jan 24, 2026 · Artificial Intelligence

2026 Agentic AI Roadmap: How to Build Autonomous AI Agents

This comprehensive 2026 roadmap outlines the essential programming foundations, core agent architectures, LLM and API integrations, tool usage, memory management, RAG systems, deployment strategies, monitoring, and security practices needed to design, develop, and operate autonomous AI agents.

AI roadmapAgentic AIAutonomous Agents
0 likes · 10 min read
2026 Agentic AI Roadmap: How to Build Autonomous AI Agents
Tech Minimalism
Tech Minimalism
Jan 17, 2026 · Artificial Intelligence

Master OpenCode and Oh My OpenCode: The Ultimate Open‑Source AI Coding Assistant Guide

This guide walks you through installing and configuring OpenCode—a free, open‑source AI programming assistant supporting 75+ LLM providers—and its Oh My OpenCode plugin suite that adds specialized parallel agents, automatic model switching, and rich CLI commands, showing how they transform real‑world development workflows.

AI coding assistantCLI toolsLLM integration
0 likes · 15 min read
Master OpenCode and Oh My OpenCode: The Ultimate Open‑Source AI Coding Assistant Guide
IT Services Circle
IT Services Circle
Dec 6, 2025 · Artificial Intelligence

Chat‑Powered Diagram Creation: Exploring the next‑ai‑draw‑io Tool

This article introduces next‑ai‑draw‑io, an open‑source AI extension for Draw.io that lets developers generate and edit architecture diagrams through natural‑language prompts, supports image‑to‑diagram conversion, multiple LLM providers, and offers easy installation via Next.js or Vercel.

AI diagram generationDraw.ioLLM integration
0 likes · 6 min read
Chat‑Powered Diagram Creation: Exploring the next‑ai‑draw‑io Tool
Baidu Geek Talk
Baidu Geek Talk
Sep 22, 2025 · Databases

How BaikalDB’s MCP Server Turns LLMs into Zero‑Code Database Analysts

This article explains how BaikalDB leverages the Model Context Protocol to let large language models query and analyze its distributed database without writing any code, detailing the protocol’s principles, server components, transport methods, tool capabilities, use cases, configuration, and future challenges.

BaikalDBLLM integrationMCP
0 likes · 11 min read
How BaikalDB’s MCP Server Turns LLMs into Zero‑Code Database Analysts
Senior Tony
Senior Tony
Sep 16, 2025 · Artificial Intelligence

What Is MCP? Exploring the AI‑LLM Interaction Protocol

MCP, a protocol from Anbhropic, standardizes how large language models communicate with external tools, databases, and APIs through a client‑server architecture, offering three communication modes (Stdio, HTTP with SSE, Streamable HTTP) and enabling use cases such as intelligent analytics, knowledge hubs, AI chatbots, BPM, API integration, automated testing, and programming assistance.

AI protocolLLM integrationMCP
0 likes · 9 min read
What Is MCP? Exploring the AI‑LLM Interaction Protocol
IT Services Circle
IT Services Circle
Sep 1, 2025 · Artificial Intelligence

Unlocking Gemini CLI: Extending Google’s AI Agent for Any LLM

This article introduces the rapidly popular Gemini CLI, compares it with Claude Code, explains its core features, demonstrates coding, multimodal, and MCP use cases, and details the author’s Easy LLM CLI fork that enables custom model integration, flexible configuration, and direct code embedding for developers.

AI AgentGemini CLILLM integration
0 likes · 15 min read
Unlocking Gemini CLI: Extending Google’s AI Agent for Any LLM
DaTaobao Tech
DaTaobao Tech
Aug 20, 2025 · Artificial Intelligence

How Cursor’s Coding Agent Works: Deep Dive into Its Architecture and Real‑World Experiments

This article examines the Cursor coding‑assistant by dissecting its backend architecture, running three practical experiments (a Go hello‑world program, a CUDA flash‑attention code search, and a single‑page to‑do web app), and analyzing why the tool succeeds or fails in real development scenarios.

Coding AgentCursorLLM integration
0 likes · 9 min read
How Cursor’s Coding Agent Works: Deep Dive into Its Architecture and Real‑World Experiments
AI Large Model Application Practice
AI Large Model Application Practice
Jul 16, 2025 · Artificial Intelligence

Unlocking LLM Integration: A Deep Dive into MCP, A2A, and AG‑UI Protocols

This article introduces three emerging standards—MCP, A2A, and AG‑UI—that simplify connecting large language models to external tools, other agents, and user interfaces, explaining their origins, architectures, development workflows, key features, and how they complement each other in AI application development.

A2AAG-UIAI protocols
0 likes · 14 min read
Unlocking LLM Integration: A Deep Dive into MCP, A2A, and AG‑UI Protocols
Network Intelligence Research Center (NIRC)
Network Intelligence Research Center (NIRC)
Jun 25, 2025 · Artificial Intelligence

Hands‑On Review of OpenManus: An Open‑Source AI Agent Framework

The article provides a detailed walkthrough of OpenManus, an open‑source AI agent framework built with Python 3.12, covering its modular architecture, agent and tool hierarchies, a step‑by‑step usage example for generating a Beijing travel plan, and a deployment guide on Alibaba Cloud.

AI agentsAlibaba Cloud deploymentLLM integration
0 likes · 10 min read
Hands‑On Review of OpenManus: An Open‑Source AI Agent Framework
JD Cloud Developers
JD Cloud Developers
Jun 19, 2025 · Artificial Intelligence

Quickly Master MCP: Build a Python SSE Server and Client in Minutes

This guide introduces the Model Context Protocol (MCP), explains its purpose as a standardized USB‑C‑like interface for LLMs, and provides step‑by‑step Python code to set up SSE‑based MCP servers and a client, plus essential installation details and execution commands.

AI protocolLLM integrationMCP
0 likes · 11 min read
Quickly Master MCP: Build a Python SSE Server and Client in Minutes
Architect's Alchemy Furnace
Architect's Alchemy Furnace
May 18, 2025 · Artificial Intelligence

A2A vs MCP: Are Google’s Agent2Agent and Anthropic’s Protocol Complementary?

Google’s newly released Agent2Agent (A2A) protocol and Anthropic’s Model Context Protocol (MCP) are examined side‑by‑side, outlining their purposes, complementary features, potential competition, and how they together shape the future of multi‑agent systems, security, task management, and integration with legacy data sources.

A2AAI protocolsLLM integration
0 likes · 17 min read
A2A vs MCP: Are Google’s Agent2Agent and Anthropic’s Protocol Complementary?
Mingyi World Elasticsearch
Mingyi World Elasticsearch
May 10, 2025 · Artificial Intelligence

Understanding MCP: From Its Origins to AI’s Universal Interface

MCP (Model Context Protocol) is an emerging open‑standard that lets large language models interact with tools, data sources, and services through a unified JSON‑RPC or gRPC interface, addressing fragmentation, data silos, and low development efficiency while offering a flexible, community‑driven ecosystem with clear advantages and current limitations.

AI tool callingJSON-RPCLLM integration
0 likes · 14 min read
Understanding MCP: From Its Origins to AI’s Universal Interface
37 Interactive Technology Team
37 Interactive Technology Team
May 9, 2025 · Backend Development

Engineering an MCP (Model Context Protocol) Tool for Cursor AI: Step-by-Step Guide

This guide walks developers through creating a reusable MCP (Model Context Protocol) tool for Cursor AI—encapsulating the SDK, configuring files, implementing PSD‑slicing logic, defining JSON schemas, adding utilities and build scripts, and deploying it as a command‑type server while avoiding common integration pitfalls.

Cursor AILLM integrationMCP
0 likes · 7 min read
Engineering an MCP (Model Context Protocol) Tool for Cursor AI: Step-by-Step Guide
DaTaobao Tech
DaTaobao Tech
Apr 30, 2025 · Artificial Intelligence

Model Context Protocol (MCP): A USB‑like Standard for Connecting Large Language Models to External Resources

The Model Context Protocol (MCP) is an open, USB‑like standard that lets large language models securely connect to external data sources, tools, and services through a client‑server architecture, enabling developers to integrate diverse resources with standardized SDKs, fostering rapid, scalable AI‑enhanced applications across many domains.

AI StandardsLLM integrationMCP
0 likes · 18 min read
Model Context Protocol (MCP): A USB‑like Standard for Connecting Large Language Models to External Resources
Alibaba Cloud Developer
Alibaba Cloud Developer
Apr 24, 2025 · Artificial Intelligence

How agents.json Empowers AI Agents to Seamlessly Call APIs

This article explains the agents.json specification, its OpenAPI foundation, how it differs from MCP and Google A2A, and demonstrates how AI agents can load, interpret, and execute multi‑step API flows using code examples and schema illustrations.

AI agentsAPI flowsLLM integration
0 likes · 16 min read
How agents.json Empowers AI Agents to Seamlessly Call APIs
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Apr 4, 2025 · Backend Development

Implementing a Model Context Protocol (MCP) Service with APISIX and Node.js

This article explains what Model Context Protocol (MCP) is, demonstrates a simple e‑commerce example, and provides a step‑by‑step guide to create, configure, build, and debug an MCP service using APISIX Admin API, Node.js, TypeScript, and Zod, enabling natural‑language interaction with backend APIs.

API automationAPISIXLLM integration
0 likes · 15 min read
Implementing a Model Context Protocol (MCP) Service with APISIX and Node.js
Architect
Architect
Apr 2, 2025 · Artificial Intelligence

Connecting LLMs to External Tools with Anthropic’s Model Context Protocol (MCP)

This article explains the open‑source Model Context Protocol (MCP) created by Anthropic, describes its client‑server architecture for safely linking LLMs with external data sources and tools, and provides a complete step‑by‑step Python tutorial—including environment setup, server and client code—to demonstrate MCP in action.

AI agentsLLM integrationLangChain
0 likes · 9 min read
Connecting LLMs to External Tools with Anthropic’s Model Context Protocol (MCP)
Architect
Architect
Mar 27, 2025 · Artificial Intelligence

How to Use Anthropic’s Model Context Protocol for Seamless LLM Integration

This article explains Anthropic’s open‑source Model Context Protocol (MCP), its client‑server architecture, resource and tool definitions, sampling workflow, and provides step‑by‑step Python examples for building a PoE2 hot‑fix fetcher and a simple chatbot that leverages MCP to connect large language models with external data sources and functions.

AI toolsLLM integrationMCP
0 likes · 14 min read
How to Use Anthropic’s Model Context Protocol for Seamless LLM Integration
Tencent Cloud Developer
Tencent Cloud Developer
Mar 26, 2025 · Artificial Intelligence

Introduction to Anthropic's Model Context Protocol (MCP) with Example Implementations

The article presents Anthropic’s open‑source Model Context Protocol (MCP) – a client‑server framework that standardizes how large language models securely access resources, prompts, and tools (the “HTTP of AI”) – and demonstrates its use through a hot‑fix scraper and a dynamic chatbot that discovers and invokes tools via JSON‑formatted calls.

AI protocolsLLM integrationMCP
0 likes · 15 min read
Introduction to Anthropic's Model Context Protocol (MCP) with Example Implementations
Sohu Tech Products
Sohu Tech Products
Mar 19, 2025 · Artificial Intelligence

Debugging MCP Service Registration Issues in Cursor IDE

The article details how the author discovered a bug in Cursor’s closed‑source IDE where correctly registered MCP services, such as a weather query tool, are omitted from the LLM’s tool list, preventing calls, and explains the debugging steps that compared Cursor with the open‑source Cline implementation to pinpoint the registration flaw.

AI debuggingCursorLLM integration
0 likes · 12 min read
Debugging MCP Service Registration Issues in Cursor IDE
AI Algorithm Path
AI Algorithm Path
Mar 19, 2025 · Artificial Intelligence

What Is the Rapidly Growing Model Context Protocol (MCP)?

The article explains how the Model Context Protocol (MCP) addresses the difficulty of connecting large language models to external data, tools, and APIs by providing an open, standardized interface that enables AI agents to access real‑time information, act autonomously, and do so securely and modularly.

AI agentsAI tool interoperabilityLLM integration
0 likes · 7 min read
What Is the Rapidly Growing Model Context Protocol (MCP)?
Ma Wei Says
Ma Wei Says
Mar 15, 2025 · Artificial Intelligence

Understanding Model Context Protocol (MCP) vs. Function Calling

The Model Context Protocol (MCP), announced by Anthropic, standardizes how AI applications provide context to LLMs, offering a client‑server architecture that simplifies data and tool integration, and is compared with function calling, highlighting its benefits, workflow, controversies, and future prospects.

AI ArchitectureAnthropicFunction Calling
0 likes · 9 min read
Understanding Model Context Protocol (MCP) vs. Function Calling
Alibaba Cloud Developer
Alibaba Cloud Developer
Mar 10, 2025 · Artificial Intelligence

Seamlessly Switch Between DeepSeek‑R1 and QwQ‑32B with Higress AI Gateway

Learn how to deploy the new QwQ‑32B inference model alongside DeepSeek‑R1 using the Higress AI gateway, covering environment setup, model configuration, routing, token‑level rate limiting, content safety, semantic caching, and advanced features like automatic fallback and internet‑search integration.

DeepSeekHigressLLM integration
0 likes · 16 min read
Seamlessly Switch Between DeepSeek‑R1 and QwQ‑32B with Higress AI Gateway
Alibaba Cloud Developer
Alibaba Cloud Developer
Mar 7, 2025 · Artificial Intelligence

Build an MCP Client & Server: Step‑by‑Step Guide to Anthropic’s Model Context Protocol

This tutorial explains how to use Anthropic's open‑source Model Context Protocol (MCP) to connect large language models with external data sources by building both MCP servers and clients in Node.js/TypeScript, covering architecture, transport options, SDK usage, and practical code examples.

Function CallingLLM integrationMCP
0 likes · 16 min read
Build an MCP Client & Server: Step‑by‑Step Guide to Anthropic’s Model Context Protocol
Java Web Project
Java Web Project
Feb 12, 2025 · Backend Development

How to Connect DeepSeek LLM to a WeChat Public Account in 7 Steps

This step‑by‑step guide shows how to obtain a DeepSeek API key, set up an Alibaba Cloud ECS instance, configure the WeChat public platform, clone the open‑source COW project, edit its JSON configuration, and run the service so that a WeChat public account can interact with the DeepSeek large language model.

API keyBackend tutorialDeepSeek
0 likes · 12 min read
How to Connect DeepSeek LLM to a WeChat Public Account in 7 Steps
Youzan Coder
Youzan Coder
Nov 13, 2024 · Big Data

How a Unified Metric Service Transforms Data Queries with Headless BI

Facing inconsistent metrics and low reuse in siloed data services, the team built a unified metric service using a headless BI semantic layer and virtual data models, enabling consistent metric definitions, reusable data models, AI-friendly queries, and faster, scalable reporting across the organization.

Big DataHeadless BILLM integration
0 likes · 17 min read
How a Unified Metric Service Transforms Data Queries with Headless BI
Ant R&D Efficiency
Ant R&D Efficiency
Sep 20, 2024 · Artificial Intelligence

Building a Cursor‑like AI Native IDE with OpenSumi and CodeFuse – A Step‑by‑Step Guide

This guide shows how to create a Cursor‑style AI‑native IDE by forking the open‑source CodeFuse project built on the extensible OpenSumi framework, configuring a large‑language model, and packaging an Electron app, while explaining Cursor’s advantages over plugin‑based tools and the strategic benefits of a fully integrated AI development environment.

AI IDECodeFuseCursor
0 likes · 17 min read
Building a Cursor‑like AI Native IDE with OpenSumi and CodeFuse – A Step‑by‑Step Guide
Tencent Cloud Developer
Tencent Cloud Developer
Jun 14, 2024 · Artificial Intelligence

GPT-4o Speech Multimodal Technology: Speech Tokenization, LLM Integration, and Zero-shot TTS

GPT‑4o’s speech multimodal system discretizes audio into semantic and acoustic tokens, integrates these tokens with large language models through multi‑stage instruction tuning, and employs hierarchical zero‑shot text‑to‑speech decoding, enabling low‑latency, streaming, and prompt‑driven voice synthesis for applications like gaming.

AudioLMGPT-4oLLM integration
0 likes · 33 min read
GPT-4o Speech Multimodal Technology: Speech Tokenization, LLM Integration, and Zero-shot TTS
DeWu Technology
DeWu Technology
Jan 22, 2024 · Artificial Intelligence

How to Integrate Business Systems with LLMs: Prompt, RAG, and Fine‑Tuning Strategies

This article outlines three practical approaches—direct prompting, retrieval‑augmented generation (RAG), and fine‑tuning—to connect enterprise applications to large language models, explains key prompt‑engineering techniques, details RAG workflow and vector‑database integration, and provides step‑by‑step guidance for fine‑tuning on the KubeAI platform.

AI for businessFine-tuningKubeAI
0 likes · 20 min read
How to Integrate Business Systems with LLMs: Prompt, RAG, and Fine‑Tuning Strategies
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Jan 12, 2024 · Artificial Intelligence

Understanding Vector Databases, ANN Algorithms, and Their Integration with Large Language Models

This article explains the fundamentals of vector databases, how high‑dimensional vector data is generated and stored, reviews common ANN search algorithms such as Flat, k‑means and LSH, discusses benchmarking and product selection, and demonstrates practical integration of vector stores with LLMs using LangChain and Python code.

ANNLLM integrationPython
0 likes · 17 min read
Understanding Vector Databases, ANN Algorithms, and Their Integration with Large Language Models
JD Cloud Developers
JD Cloud Developers
Jan 10, 2024 · Artificial Intelligence

Boosting Elasticsearch with Generative AI: Relevance Engine & Vector Search

This article explores the rise of generative AI, outlines popular models like ChatGPT, DALL‑E, and Google Bard, examines their limitations, and then delves into Elasticsearch’s Relevance Engine and vector capabilities, demonstrating how to store, index, and query dense embeddings with practical code examples.

ElasticsearchLLM integrationgenerative AI
0 likes · 17 min read
Boosting Elasticsearch with Generative AI: Relevance Engine & Vector Search
Sohu Tech Products
Sohu Tech Products
Nov 1, 2023 · Databases

Engineering Practices of Douyin's Vector Database: From Retrieval Challenges to Cloud‑Native Solutions

Douyin tackled vector‑retrieval challenges by optimizing HNSW and creating a high‑performance IVF algorithm, implementing custom scalar quantization, SIMD acceleration, and a DSL‑driven engine that merges filtering with search, then built a cloud‑native, storage‑compute‑separated vector database (VikingDB) delivering sub‑10 ms latency, real‑time updates, multi‑tenant support, and secure, scalable retrieval for LLM‑driven applications.

ANNLLM integrationStorage Compute Separation
0 likes · 18 min read
Engineering Practices of Douyin's Vector Database: From Retrieval Challenges to Cloud‑Native Solutions
phodal
phodal
Jul 23, 2023 · Artificial Intelligence

How AutoDev Boosts Coding Efficiency with AI-Powered IDE Integration

AutoDev is an AI‑assisted JetBrains IDE plugin that connects to issue trackers, automates prompt generation, enforces code standards, supports multiple languages and LLM providers, and offers smart prompt strategies, aiming to reduce developer interaction costs while improving generated code quality.

AI CodingAutoDevLLM integration
0 likes · 9 min read
How AutoDev Boosts Coding Efficiency with AI-Powered IDE Integration
21CTO
21CTO
Jul 11, 2023 · Artificial Intelligence

What Is Microsoft’s Semantic Kernel and How It Powers AI‑First Applications

Microsoft’s newly released Semantic Kernel (SK) is an open‑source SDK that blends large language models with traditional programming languages, offering features like prompt chaining, recursive reasoning, memory management, and planner integration, while outlining a roadmap that includes vector database support and Azure Cognitive Search.

AI SDKLLM integrationSemantic Kernel
0 likes · 5 min read
What Is Microsoft’s Semantic Kernel and How It Powers AI‑First Applications
ByteFE
ByteFE
Apr 3, 2023 · Artificial Intelligence

Mastering Prompt Engineering: Building a CLI Command Translator with the ChatGPT API

This comprehensive tutorial demonstrates how to leverage the official ChatGPT API and advanced prompt engineering techniques to develop a command-line tool that translates natural language instructions into precise shell commands, covering iterative prompt design, constraint implementation, and Node.js integration.

AI toolsCLI DevelopmentChatGPT API
0 likes · 15 min read
Mastering Prompt Engineering: Building a CLI Command Translator with the ChatGPT API