Tagged articles
32 articles
Page 1 of 1
Geek Labs
Geek Labs
May 7, 2026 · Backend Development

DS2API: Turning DeepSeek into an OpenAI‑Compatible API

DS2API is an open‑source Go‑based service that converts DeepSeek’s web interface into OpenAI, Claude, and Gemini compatible APIs, offering multi‑API support, account pool management, long‑history handling, PoW verification, and a React admin UI, with simple Docker deployment.

API compatibilityDS2APIDeepSeek
0 likes · 4 min read
DS2API: Turning DeepSeek into an OpenAI‑Compatible API
CodeTrend
CodeTrend
Apr 29, 2026 · Artificial Intelligence

qwen2API: Turning Qwen Web Chat into OpenAI, Claude, and Gemini Compatible APIs

The qwen2API project offers a FastAPI backend and React+Vite frontend that expose the Qwen web chat as OpenAI Chat Completions, Anthropic Messages, and Gemini GenerateContent interfaces, featuring tool calling, image generation, account pool management, multiple deployment options, and various execution engines.

AnthropicFastAPIGemini
0 likes · 6 min read
qwen2API: Turning Qwen Web Chat into OpenAI, Claude, and Gemini Compatible APIs
SuanNi
SuanNi
Apr 13, 2026 · Artificial Intelligence

Deploy Qwen3 8B Model with vLLM: Step‑by‑Step Guide for Remote Inference

This guide walks you through deploying Alibaba’s open‑source Qwen‑3 8B model on the SumW platform using vLLM, covering environment activation, server launch with OpenAI‑compatible parameters, SSH tunneling for remote access, and Python client calls, while highlighting key configuration tips and common pitfalls.

Model DeploymentOpenAI APIPython SDK
0 likes · 6 min read
Deploy Qwen3 8B Model with vLLM: Step‑by‑Step Guide for Remote Inference
AI Cyberspace
AI Cyberspace
Mar 10, 2026 · Artificial Intelligence

Mastering Prompt Engineering: Techniques to Guide LLMs Effectively

This article explains the fundamentals of prompt engineering for large language models, covering LLM output configuration, length and sampling controls, various prompt types, chain‑of‑thought and tree‑of‑thought reasoning methods, and practical best‑practice guidelines for creating high‑quality prompts.

AI Prompt DesignFew‑Shot LearningLLM
0 likes · 18 min read
Mastering Prompt Engineering: Techniques to Guide LLMs Effectively
Fun with Large Models
Fun with Large Models
Jan 18, 2026 · Artificial Intelligence

Step‑by‑Step Guide to Deploying Large Language Models Locally with VLLM and Ollama

This article walks through two mainstream local deployment solutions—high‑performance VLLM for production Linux servers and lightweight Ollama for personal Windows machines—covering environment setup, model download, server launch, API testing, key configuration parameters, and the quantization technique that makes Ollama models compact.

GPU OptimizationModel QuantizationOllama
0 likes · 18 min read
Step‑by‑Step Guide to Deploying Large Language Models Locally with VLLM and Ollama
Ops Community
Ops Community
Dec 28, 2025 · Artificial Intelligence

Boost LLM Inference Speed: Build a High‑Concurrency vLLM Service with Best‑Practice Ops

This guide walks through the complete process of deploying a high‑throughput large language model inference service using vLLM, covering environment preparation, installation, configuration tuning, performance testing, real‑world case studies, monitoring, troubleshooting, and backup strategies for production‑grade deployments.

DeploymentGPU OptimizationLLM inference
0 likes · 44 min read
Boost LLM Inference Speed: Build a High‑Concurrency vLLM Service with Best‑Practice Ops
Frontend AI Walk
Frontend AI Walk
Dec 14, 2025 · Artificial Intelligence

Advanced AI Agent Skills: Behind the Scenes and a Developer’s Guide

This article explains the origins of AI Agent Skills, walks through the standard tool‑use loop, provides step‑by‑step code for defining and invoking Skills with the OpenAI API, compares Skills with the emerging Model Context Protocol (MCP), and offers practical guidance for developers and architects building AI‑enabled systems.

AI Agent SkillsFunction CallingMCP
0 likes · 11 min read
Advanced AI Agent Skills: Behind the Scenes and a Developer’s Guide
Instant Consumer Technology Team
Instant Consumer Technology Team
Nov 5, 2025 · Artificial Intelligence

Why AI Agents Fail: 70% Failure Rate & How Interleaved Thinking Improves Reliability

Recent CMU and Salesforce studies reveal that top‑tier AI agents like Gemini 2.5 Pro, Claude 3.7 Sonnet and GPT‑4o fail in 69‑70% of multi‑step tasks, but MiniMax‑M2’s Interleaved Thinking reduces failure dramatically, highlighting that execution mechanisms, not model size, are key to reliable AI agents.

BenchmarkOpen-source modelsOpenAI API
0 likes · 17 min read
Why AI Agents Fail: 70% Failure Rate & How Interleaved Thinking Improves Reliability
Raymond Ops
Raymond Ops
Nov 4, 2025 · Artificial Intelligence

How to Deploy GPUStack with Docker for Scalable AI Model Serving

This guide walks you through installing NVIDIA drivers and Docker, configuring the NVIDIA Container Toolkit, and deploying GPUStack in Docker to manage heterogeneous GPU resources, run large language, multimodal, diffusion, and embedding models, and scale from a single node to a multi‑node GPU cluster.

AI Model DeploymentDockerGPU cluster
0 likes · 15 min read
How to Deploy GPUStack with Docker for Scalable AI Model Serving
Eric Tech Circle
Eric Tech Circle
Sep 10, 2025 · Artificial Intelligence

Deploy High‑Performance Local LLMs with vLLM: A Step‑by‑Step Guide

This article walks through installing and configuring vLLM for local large language model inference, compares it with Ollama and LM Studio, details environment setup, model download, testing scripts, and shows how to expose an OpenAI‑compatible API for production use.

Inference OptimizationModelScopeOpenAI API
0 likes · 11 min read
Deploy High‑Performance Local LLMs with vLLM: A Step‑by‑Step Guide
MaGe Linux Operations
MaGe Linux Operations
Jun 3, 2025 · Artificial Intelligence

How to Deploy GPUStack with Docker for Scalable AI Model Serving

This guide walks you through installing NVIDIA drivers, Docker, and the NVIDIA Container Toolkit, then shows step‑by‑step how to run GPUStack in Docker, expand a GPU cluster, and serve large language, multimodal, diffusion, and embedding models with OpenAI‑compatible APIs.

AI Model DeploymentDockerGPU cluster
0 likes · 15 min read
How to Deploy GPUStack with Docker for Scalable AI Model Serving
21CTO
21CTO
May 8, 2025 · Artificial Intelligence

Build a Simple ChatGPT Bot with Python and OpenAI API in Minutes

This tutorial walks you through registering on OpenAI, installing required Python packages, setting up API keys, and writing a Jupyter Notebook that defines a chat function and an interactive loop to create a functional GPT‑3.5 chatbot.

ChatGPTChatbotGPT-3.5
0 likes · 7 min read
Build a Simple ChatGPT Bot with Python and OpenAI API in Minutes
DevOps
DevOps
Jan 6, 2025 · Artificial Intelligence

Ten Popular Large Language Model Deployment Engines and Tools: Features, Advantages, and Limitations

This article reviews ten mainstream LLM deployment solutions—including WebLLM, LM Studio, Ollama, vLLM, LightLLM, OpenLLM, HuggingFace TGI, GPT4ALL, llama.cpp, and Triton Inference Server—detailing their technical characteristics, strengths, drawbacks, and example deployment workflows for both personal and enterprise environments.

AI inferenceGPU AccelerationLLM
0 likes · 16 min read
Ten Popular Large Language Model Deployment Engines and Tools: Features, Advantages, and Limitations
CSS Magic
CSS Magic
Nov 8, 2024 · Artificial Intelligence

LLM Application Development Tips (3): Exploring LLM API Inputs and Outputs

This article explains how to configure key OpenAI chat completion parameters—such as temperature, top_p, streaming, response format, and tool selection—and walks through the structure of the API's JSON response, highlighting fields like id, model, choices, finish_reason, and usage for better control and cost estimation.

AI agentsAPI parametersJSON response
0 likes · 8 min read
LLM Application Development Tips (3): Exploring LLM API Inputs and Outputs
CSS Magic
CSS Magic
Nov 1, 2024 · Artificial Intelligence

Refining System Prompts for LLMs: Practical Tips for Batch Automation

This article explains how to automate batch document processing with LLM APIs by mastering the messages parameter, defining system, user, and assistant roles, and iteratively polishing system prompts through scripts or OpenAI's GPTs editor and Playground interfaces.

AutomationChatGPTLLM
0 likes · 7 min read
Refining System Prompts for LLMs: Practical Tips for Batch Automation
Java High-Performance Architecture
Java High-Performance Architecture
Mar 11, 2024 · Backend Development

Build a Spring Boot App that Calls OpenAI’s ChatGPT API

This tutorial walks through creating a Spring Boot application that integrates the OpenAI ChatGPT API, covering prompt concepts, API endpoint details, required request parameters, project setup, necessary dependencies, DTO definitions, RestTemplate configuration, and a REST controller to generate AI‑driven responses.

Backend DevelopmentChatGPTJava
0 likes · 10 min read
Build a Spring Boot App that Calls OpenAI’s ChatGPT API
php Courses
php Courses
Oct 24, 2023 · Artificial Intelligence

How to Build a ChatGPT-Powered Smart Chatbot Using PHP

This article explains step-by-step how to develop an intelligent ChatGPT-powered chatbot with PHP, covering environment setup, installing the PHP ChatGPT library, writing code to send messages and receive responses via the OpenAI API, and testing the bot with example scripts.

AI chatbotChatGPTOpenAI API
0 likes · 4 min read
How to Build a ChatGPT-Powered Smart Chatbot Using PHP
21CTO
21CTO
Aug 1, 2023 · Artificial Intelligence

How to Set Up and Master CodeGPT in VS Code for AI-Powered Coding

This guide walks you through installing the CodeGPT extension in Visual Studio Code, configuring your OpenAI API key, and using its features—code generation, explanation, refactoring, and unit test creation—to boost productivity with AI-assisted programming.

AI CodingCode GenerationCodeGPT
0 likes · 8 min read
How to Set Up and Master CodeGPT in VS Code for AI-Powered Coding
21CTO
21CTO
Jun 2, 2023 · Artificial Intelligence

Build a ChatGPT-Powered Python Chatbot: Step-by-Step Guide

This tutorial walks developers through setting up an OpenAI account, obtaining an API key, configuring the Python environment, and writing a complete Python script that uses the ChatGPT API to create an interactive chatbot, complete with code examples and execution tips.

ChatGPTChatbotOpenAI API
0 likes · 11 min read
Build a ChatGPT-Powered Python Chatbot: Step-by-Step Guide
Continuous Delivery 2.0
Continuous Delivery 2.0
May 16, 2023 · Artificial Intelligence

How to Write Effective Prompts and Use the OpenAI Python API

This article explains practical principles and techniques for crafting clear, specific prompts for ChatGPT, demonstrates how to structure prompts with separators and output formats, and provides a step‑by‑step guide—including code examples—for installing the OpenAI Python library, obtaining an API key, and writing a reusable get_completion function to interact with the model.

AIChatGPTCode Examples
0 likes · 9 min read
How to Write Effective Prompts and Use the OpenAI Python API
Java Architect Essentials
Java Architect Essentials
May 8, 2023 · Frontend Development

Eliminate ChatGPT “Something Went Wrong” Errors with the KeepChatGPT Chrome Extension

ChatGPT’s web interface has been increasingly showing “Something went wrong” errors after idle periods, and this guide explains two remedies: using an OpenAI API key for more stable access, and installing the KeepChatGPT Chrome/Tampermonkey extension, which suppresses the error and adds several usability enhancements.

ChatGPTChrome ExtensionError Fix
0 likes · 6 min read
Eliminate ChatGPT “Something Went Wrong” Errors with the KeepChatGPT Chrome Extension
Java High-Performance Architecture
Java High-Performance Architecture
Mar 23, 2023 · Artificial Intelligence

Build Your Own ChatGPT-Powered Chatbot with JavaScript and Spring Boot

This guide walks you through creating a personal ChatGPT chatbot by first setting up an OpenAI account and API key, then demonstrating how to call the completions endpoint using plain JavaScript and HTML, followed by integrating the same API into a Spring Boot application with the official Java client library, including all necessary code snippets and configuration steps.

ChatGPTJavaScriptOpenAI API
0 likes · 9 min read
Build Your Own ChatGPT-Powered Chatbot with JavaScript and Spring Boot