Collection size
100 articles
Page 5 of 5
Tencent Technical Engineering
Tencent Technical Engineering
Jun 16, 2025 · Artificial Intelligence

Mastering RAG and AI Agents: Practical Tips, Code Samples, and Evaluation Strategies

This comprehensive guide walks you through the fundamentals of Retrieval‑Augmented Generation (RAG) and AI agents, explains their inner workings, shares optimization tricks, provides ready‑to‑run code snippets, and demonstrates how to evaluate performance with metrics such as recall, faithfulness, and answer relevance.

AI agentsEvaluationLLM
0 likes · 36 min read
Mastering RAG and AI Agents: Practical Tips, Code Samples, and Evaluation Strategies
Sohu Tech Products
Sohu Tech Products
Apr 1, 2026 · Artificial Intelligence

Build a Code‑Repository Q&A Agent Skill for OpenCode: From Installation to Custom Prompt Design

This guide walks you through creating an Agent Skill that turns OpenCode into a code‑repository expert, covering OpenCode installation, skill‑creator setup, DeepWiki integration, SKILL.md design, disambiguation, hallucination safeguards, and practical examples for querying Ascend inference ecosystem repositories.

AIAgentDeepWiki
0 likes · 26 min read
Build a Code‑Repository Q&A Agent Skill for OpenCode: From Installation to Custom Prompt Design
SuanNi
SuanNi
Mar 11, 2026 · Artificial Intelligence

How Gemini Embedding 2 Gives AI True Five‑Senses Perception

Google's Gemini Embedding 2 unifies text, image, video, audio, and document processing into a single multimodal embedding space, offering massive token capacity, multilingual support, and interleaved input, which dramatically improves retrieval speed, recall, and the quality of AI‑generated content across diverse applications.

Gemini Embedding 2Multimodal AIUnified Embedding Space
0 likes · 9 min read
How Gemini Embedding 2 Gives AI True Five‑Senses Perception
Old Zhang's AI Learning
Old Zhang's AI Learning
Feb 16, 2026 · Artificial Intelligence

Qwen3.5 Deep Dive: Multimodal Architecture, Benchmarks, and Deployment Guide

This article provides a detailed analysis of Qwen3.5, covering its multimodal MoE design, massive inference speedups, extensive benchmark results against GPT‑5.2, Claude 4.5 Opus and Gemini‑3 Pro, RL scaling strategies, training infrastructure innovations, and practical usage via API and local deployment.

FP8 trainingLarge Language ModelMultimodal AI
0 likes · 13 min read
Qwen3.5 Deep Dive: Multimodal Architecture, Benchmarks, and Deployment Guide
DeWu Technology
DeWu Technology
Jan 22, 2024 · Artificial Intelligence

How to Integrate Business Systems with LLMs: Prompt, RAG, and Fine‑Tuning Strategies

This article outlines three practical approaches—direct prompting, retrieval‑augmented generation (RAG), and fine‑tuning—to connect enterprise applications to large language models, explains key prompt‑engineering techniques, details RAG workflow and vector‑database integration, and provides step‑by‑step guidance for fine‑tuning on the KubeAI platform.

AI for businessFine-tuningKubeAI
0 likes · 20 min read
How to Integrate Business Systems with LLMs: Prompt, RAG, and Fine‑Tuning Strategies
Python Programming Learning Circle
Python Programming Learning Circle
Apr 3, 2023 · Artificial Intelligence

Key Highlights of GPT‑4: Multimodal Capabilities, Benchmark Performance, and Future Implications

GPT‑4, the new multimodal AI model, can process images and text, generate code and natural language, achieve human‑level scores on standardized exams, handle up to 32 K tokens, and demonstrates advanced reasoning, while OpenAI emphasizes its safety improvements and current limitations as a still‑emerging technology.

AI safetyGPT-4Large Language Model
0 likes · 6 min read
Key Highlights of GPT‑4: Multimodal Capabilities, Benchmark Performance, and Future Implications
DataFunTalk
DataFunTalk
Sep 10, 2025 · Artificial Intelligence

Why RAG is Evolving: From Retrieval to Integrated Reasoning, Memory, and Multimodal AI

This article explores how Retrieval‑Augmented Generation (RAG) is transitioning from basic retrieve‑and‑generate pipelines to a unified architecture that incorporates reasoning chains, agent layers, knowledge graphs, Monte‑Carlo Tree Search, reinforcement learning, sophisticated memory management, and multimodal tensor‑based retrieval, while addressing engineering challenges such as storage expansion, re‑ranking, and index dimensionality.

AI reasoningRAGRetrieval-Augmented Generation
0 likes · 19 min read
Why RAG is Evolving: From Retrieval to Integrated Reasoning, Memory, and Multimodal AI
Sohu Tech Products
Sohu Tech Products
Mar 19, 2026 · Artificial Intelligence

Testing GLM‑5 Turbo: From AutoClaw Integration to a Browser‑Based War3 Clone

This article walks through a hands‑on evaluation of the GLM‑5 Turbo model, detailing its integration with AutoClaw for rapid Feishu bot deployment, comparing its performance against a baseline model on OpenClaw data‑dashboard tasks, and showcasing a fully client‑side War3‑style RTS built in a single HTML file.

AI evaluationAgent EngineAutoClaw
0 likes · 23 min read
Testing GLM‑5 Turbo: From AutoClaw Integration to a Browser‑Based War3 Clone
Qborfy AI
Qborfy AI
Apr 14, 2026 · Artificial Intelligence

How to Give AI a Map, Not an Encyclopedia: Mastering Context Engineering

This article explains why AI can only act on information that appears in its context window, outlines the twin problems of missing data and overload, and presents a practical methodology—including AGENTS.md maps and LangChain's LocalContextMiddleware implementation—to engineer concise, searchable context for reliable agent behavior.

AIContext EngineeringLangChain
0 likes · 29 min read
How to Give AI a Map, Not an Encyclopedia: Mastering Context Engineering
Alibaba Cloud Native
Alibaba Cloud Native
Mar 19, 2025 · Artificial Intelligence

Mastering Retrieval‑Augmented Generation with Spring AI: A Complete Guide

This article explains the Retrieval‑Augmented Generation (RAG) paradigm, walks through its four core steps, and provides a detailed Spring AI implementation—including configuration, vector storage, REST controller, multi‑query expansion, query rewriting, document joining, and error handling—plus best‑practice recommendations for production deployments.

AIJavaRAG
0 likes · 23 min read
Mastering Retrieval‑Augmented Generation with Spring AI: A Complete Guide
Instant Consumer Technology Team
Instant Consumer Technology Team
Dec 5, 2025 · Artificial Intelligence

Transform Complex Prompts into Reusable AI Skills and Hook DeepSeek into Claude Code

This article explains how to replace cumbersome, city‑specific prompt strings with modular AI Skills, demonstrates the food‑diorama‑skill that generates 3D gourmet dioramas, and provides a step‑by‑step guide for connecting the DeepSeek V3.2 model to Claude Code using environment variables or the CC Switch GUI.

AIClaudeDeepSeek
0 likes · 8 min read
Transform Complex Prompts into Reusable AI Skills and Hook DeepSeek into Claude Code
Machine Learning Algorithms & Natural Language Processing
Machine Learning Algorithms & Natural Language Processing
Apr 17, 2026 · Artificial Intelligence

Claude Opus 4.7’s Visual and Long‑Context Leap: Near‑Full Vision and 1M‑Token Tasks Redefine Knowledge Work

Claude Opus 4.7, announced as Anthropic’s most capable publicly available model, dramatically improves visual reasoning, long‑context task handling and instruction following, delivering up to a 2.4‑fold boost on benchmarks such as XBOW, SWE‑bench and structural biology, while also introducing new security guardrails and token‑usage costs.

AI benchmarksAnthropicClaude Opus 4.7
0 likes · 11 min read
Claude Opus 4.7’s Visual and Long‑Context Leap: Near‑Full Vision and 1M‑Token Tasks Redefine Knowledge Work
Sohu Tech Products
Sohu Tech Products
May 15, 2024 · Artificial Intelligence

OpenAI Assistants API Quickstart Project for Next.js

OpenAI’s open‑source openai‑assistants‑quickstart project shows how to integrate the Assistants API into a Next.js app, offering streaming chat, code‑interpreter, file‑search, and function‑calling tools, and provides step‑by‑step setup instructions so developers can quickly build and customize AI assistants.

AI AssistantAssistants APIFunction Calling
0 likes · 4 min read
OpenAI Assistants API Quickstart Project for Next.js
Java Architect Essentials
Java Architect Essentials
Mar 7, 2025 · Artificial Intelligence

Introducing DeepSeek4j 1.4: A Java Spring Boot Integration for DeepSeek AI with Chain‑of‑Thought and Streaming Support

The article introduces DeepSeek4j 1.4, a Java Spring Boot library that overcomes existing framework limitations by preserving DeepSeek's chain‑of‑thought capabilities, adding full reactive streaming, and providing a simple one‑line API along with quick‑start instructions and code examples.

AI integrationDeepSeekJava
0 likes · 5 min read
Introducing DeepSeek4j 1.4: A Java Spring Boot Integration for DeepSeek AI with Chain‑of‑Thought and Streaming Support
Alibaba Cloud Infrastructure
Alibaba Cloud Infrastructure
Apr 13, 2026 · Artificial Intelligence

How to Speed Up Bulk Vector Searches with CLI and SDK Concurrency

This guide explains how to dramatically reduce latency for batch semantic search, RAG multi‑path retrieval, and multimodal vector queries by running multiple OSS Vectors embed requests in parallel using CLI‑based, xargs, shell background jobs, Python asyncio, and SDK‑level concurrency techniques.

CLIGoOSS
0 likes · 21 min read
How to Speed Up Bulk Vector Searches with CLI and SDK Concurrency
HyperAI Super Neural
HyperAI Super Neural
Oct 27, 2025 · Artificial Intelligence

Weekly AI Paper Digest: New OCR Model, Multimodal LLM, Next‑Gen DNA Sequencing

This week’s AI roundup highlights five recent papers: DeepSeek‑OCR’s context‑compression model for large‑scale data generation, Rex‑Omni’s 3‑billion‑parameter multimodal LLM achieving state‑of‑the‑art object perception, Alpha‑Service’s proactive AI‑glass framework, a bias‑variance approach to narrowing cross‑lingual gaps, and GATK’s MapReduce‑based toolkit for next‑generation DNA sequencing.

AI GlassesCross-lingual NLPDNA Sequencing
0 likes · 6 min read
Weekly AI Paper Digest: New OCR Model, Multimodal LLM, Next‑Gen DNA Sequencing
AI Algorithm Path
AI Algorithm Path
Mar 3, 2025 · Artificial Intelligence

DeepSeek‑R1 Model Performance: Comparing 32B, 70B, and R1

This article evaluates DeepSeek‑R1’s 32B and 70B distilled models alongside the original R1 on a range of reasoning and coding tasks, detailing hardware setup, test methodology, per‑task results, and a comparative analysis of their strengths and weaknesses.

32B70BDeepSeek
0 likes · 6 min read
DeepSeek‑R1 Model Performance: Comparing 32B, 70B, and R1
Old Meng AI Explorer
Old Meng AI Explorer
Apr 23, 2026 · Artificial Intelligence

GLM-5.1 vs Qwen3.6 Plus vs MiniMax M2.7: In‑Depth 2026 Review of China’s Top AI Models

This article provides a detailed, data‑driven comparison of three 2026 Chinese flagship large language models—GLM-5.1, Qwen3.6 Plus, and MiniMax M2.7—covering knowledge, math, code, long‑task, multimodal performance, pricing, open‑source status, ecosystem support, and scenario‑based recommendations.

GLM-5.1Large Language ModelMiniMax M2.7
0 likes · 12 min read
GLM-5.1 vs Qwen3.6 Plus vs MiniMax M2.7: In‑Depth 2026 Review of China’s Top AI Models
HelloTech
HelloTech
Apr 10, 2024 · Artificial Intelligence

An Overview of LangChain: Architecture, Core Components, and Code Examples

LangChain is an open‑source framework that provides Python and JavaScript SDKs, templates, and services such as LangServe and LangSmith to compose models, embeddings, prompts, indexes, memory, chains, and agents via a concise expression language, enabling rapid prototyping, debugging, and deployment of LLM‑driven applications.

AI engineeringJavaScriptLLM
0 likes · 19 min read
An Overview of LangChain: Architecture, Core Components, and Code Examples