Collection size

96 articles

Page 4 of 5

May 25, 2026 · Artificial Intelligence

Practical Agent Performance Tuning: Slash Latency 75%, Cut Token Costs 71%, Boost Throughput 217%

The article walks through a systematic performance map of LangChain agents and demonstrates concrete latency, token‑usage, and concurrency optimizations—streaming responses, Redis caching, model routing, prompt trimming, context summarisation, dynamic tool selection, parallel graph nodes and batch processing—showing real‑world gains of up to 75% lower latency, 71% fewer tokens and a 217% throughput increase.

Agent OptimizationConcurrencyLangChain

0 likes · 30 min read

Practical Agent Performance Tuning: Slash Latency 75%, Cut Token Costs 71%, Boost Throughput 217%

Java Web Project

Mar 11, 2025 · Artificial Intelligence

Claude 3.7 Sonnet: How the Hybrid Reasoning Model Redefines AI‑Assisted Coding

Claude 3.7 Sonnet, billed as the world’s first hybrid‑reasoning model, dramatically boosts code generation, supports fast‑response and extended‑thinking modes, and demonstrates real‑world UI reconstruction, game creation, and physics simulation, while its companion Claude Code tool automates complex engineering tasks and large‑codebase integration.

AI code generationAutomationClaude 3.7

0 likes · 6 min read

Claude 3.7 Sonnet: How the Hybrid Reasoning Model Redefines AI‑Assisted Coding

James' Growth Diary

Apr 26, 2026 · Databases

Vector Database Fundamentals: Embedding, Similarity Search, and Index Structures Explained in One Go

This article walks through the complete workflow of turning split text into high‑dimensional vectors, choosing the right embedding model, selecting an appropriate similarity metric, comparing index structures such as Flat, IVF, HNSW and PQ, and finally picking a vector database and integrating it with LangChain.js for production‑grade RAG pipelines.

LangChainRAGembeddings

0 likes · 25 min read

Vector Database Fundamentals: Embedding, Similarity Search, and Index Structures Explained in One Go

Design Hub

Jan 15, 2026 · Artificial Intelligence

Hands‑On Guide to Mastering Claude Agentic Skills + Full Resource List

This article walks designers through creating and installing Claude Agentic Skills, explains the required YAML metadata and Markdown commands, shows step‑by‑step commands for building a code‑explaining skill, and provides a curated list of useful Skill repositories.

AI assistantsClaudeGitHub

0 likes · 9 min read

Hands‑On Guide to Mastering Claude Agentic Skills + Full Resource List

AI Architecture Path

Jun 5, 2026 · Artificial Intelligence

Supermemory Tops Three Authority Benchmarks, Solving AI Forgetting

Supermemory, the open‑source AI memory engine, eliminates repeated forgetting by offering a zero‑configuration, multi‑modal memory layer that tops LongMemEval, LoCoMo and ConvoMo benchmarks, integrates automatic learning, mixed RAG‑Memory search, built‑in connectors, privacy tags, and multiple deployment options from no‑code web to local offline versions.

AI memoryRAGSupermemory

0 likes · 14 min read

Supermemory Tops Three Authority Benchmarks, Solving AI Forgetting

Old Meng AI Explorer

Apr 19, 2026 · Artificial Intelligence

How to Access Alibaba’s Free Qwen3.6 Plus LLM and Compare It to Global Rivals

Qwen3.6 Plus, Alibaba’s new multimodal LLM, offers a million‑token context window, top‑tier coding scores and free access via OpenRouter, Alibaba Cloud Bailei, or Qiniu, with step‑by‑step setup, code examples, and a performance comparison against Claude Opus, GPT‑5 and other leading models.

AI codingFree APILLM

0 likes · 11 min read

How to Access Alibaba’s Free Qwen3.6 Plus LLM and Compare It to Global Rivals

Su San Talks Tech

Apr 20, 2026 · Artificial Intelligence

Master Spring AI: From Hello World to Advanced RAG, Tool Calling, and Agent Development

This step‑by‑step guide shows Java developers how to set up Spring AI, configure various model providers, build basic and streaming chat APIs, enable multi‑turn memory, implement RAG with vector stores, add tool‑calling and multimodal capabilities, integrate MCP, and create sophisticated agents, while comparing ChatModel and ChatClient and outlining strengths, weaknesses, and ideal use cases.

AI integrationChatClientJava

0 likes · 17 min read

Master Spring AI: From Hello World to Advanced RAG, Tool Calling, and Agent Development

Rare Earth Juejin Tech Community

Mar 24, 2025 · Artificial Intelligence

AI SDK 4.2 Release: New Reasoning, MCP Client, useChat Message Components, Image Generation, URL Sources, and Provider Updates

The AI SDK 4.2 release introduces powerful new features such as step‑by‑step reasoning support, a Model Context Protocol (MCP) client for tool integration, useChat message components, multimodal image generation, standardized URL sources, OpenAI Responses API support, Svelte 5 compatibility, and numerous middleware and provider enhancements, all illustrated with practical JavaScript/TypeScript examples.

AI SDKJavaScriptMCP

0 likes · 19 min read

AI SDK 4.2 Release: New Reasoning, MCP Client, useChat Message Components, Image Generation, URL Sources, and Provider Updates

Baobao Algorithm Notes

Dec 24, 2023 · Artificial Intelligence

Must‑Read AI Agent and LLM Research Papers for Deep Understanding

This curated reading list compiles essential papers on AI agents, task planning, hallucination mitigation, multimodal models, image/video generation, foundational LLM research, open‑source large models, fine‑tuning techniques, and performance optimization, providing a comprehensive roadmap for anyone aiming to master modern generative AI.

AI agentsMultimodal LearningPerformance Optimization

0 likes · 23 min read

Must‑Read AI Agent and LLM Research Papers for Deep Understanding

AI Explorer

Feb 28, 2026 · Artificial Intelligence

Explore the Awesome LLM Apps Repository: Hands‑On RAG and AI Agent Examples

The article presents the “Awesome LLM Apps” GitHub repository—over 98 000 stars and hundreds of open‑source LLM projects that showcase Retrieval‑Augmented Generation, AI agents, and multi‑agent collaborations across diverse use‑cases, and offers step‑by‑step guidance on browsing, cloning, configuring, and running these examples for developers, product managers, students, and AI enthusiasts.

AI agentsGitHubLLM

0 likes · 6 min read

Explore the Awesome LLM Apps Repository: Hands‑On RAG and AI Agent Examples

AI Open-Source Efficiency Guide

Apr 14, 2026 · Artificial Intelligence

Awesome Claude Plugins – Curated Extensions for Commands, Agents, Hooks, and MCP

This guide presents a curated list of Claude Code plugins maintained by Composio, detailing their structure, quick‑start steps, and categories ranging from integrations and frontend design to DevOps, security, and AI‑agent productivity, enabling Claude to perform real‑world actions.

AI agentsClaudeComposio

0 likes · 12 min read

Awesome Claude Plugins – Curated Extensions for Commands, Agents, Hooks, and MCP

DeepHub IMBA

Mar 20, 2026 · Artificial Intelligence

Claude Code Command System Explained: 3 Types, 7 Categories, 50+ Commands

This article provides a comprehensive guide to Claude Code’s command system, covering all slash commands, CLI flags, keyboard shortcuts, hidden features, and practical workflows, showing how to initialize projects, manage context, switch models, control costs, and automate development tasks efficiently.

AI coding assistantAutomationCLI

0 likes · 29 min read

Claude Code Command System Explained: 3 Types, 7 Categories, 50+ Commands

Code Mala Tang

Jun 2, 2026 · Artificial Intelligence

Demystifying Model Evaluation: 8 Key Terms You Must Know

The article breaks down eight technical terms—frontier coding, 1M‑long context, native multimodal, open‑source levels, benchmark layers, CUDA operators, autonomous iteration, and verifiable engineering strength—to help readers understand what modern AI model release notes actually mean.

CUDA operatorsModel Evaluationbenchmark

0 likes · 11 min read

Demystifying Model Evaluation: 8 Key Terms You Must Know

PaperAgent

Dec 14, 2025 · Artificial Intelligence

GPT‑5.2 vs Gemini 3 Pro: Coding Tests, NeurIPS 2025 Paper Insights, and RAG Refactor

The article evaluates GPT‑5.2 and Gemini 3 Pro on real‑world coding tasks, analyzes trends from the 6000 papers presented at NeurIPS 2025, and demonstrates how to extract and refactor the tree‑building component of the open‑source RAPTOR RAG system into an independent module.

AI model evaluationCode RefactoringGPT-5.2

0 likes · 5 min read

GPT‑5.2 vs Gemini 3 Pro: Coding Tests, NeurIPS 2025 Paper Insights, and RAG Refactor

SuanNi

Apr 2, 2026 · Artificial Intelligence

How Alibaba’s New Qwen3.5‑Omni, Wan2.7‑Image, and Qwen3.6‑Plus Redefine Multimodal AI

Alibaba unveiled three cutting‑edge models—Qwen3.5‑Omni with native multimodal interaction, Wan2.7‑Image for high‑precision image generation and editing, and Qwen3.6‑Plus boosting coding agent performance—each achieving dozens of SOTA benchmarks, massive context windows, and novel capabilities such as Audio‑Visual Vibe Coding and transparent layer separation.

AICoding AgentLarge Language Model

0 likes · 7 min read

How Alibaba’s New Qwen3.5‑Omni, Wan2.7‑Image, and Qwen3.6‑Plus Redefine Multimodal AI

Lao Guo's Learning Space

May 2, 2026 · Industry Insights

AI News Flash: DeepSeek Multimodal Breakthrough, Codex Major Update, Grok 4.3 Launch (May 1‑2)

The AI roundup covers OpenAI's Codex upgrade with Workspace Agents and 40% token efficiency, xAI's Grok 4.3 API offering 128K context and 60% lower pricing, Ant Group's open‑source Ling 2.6‑1T model, DeepSeek's multimodal Visual Primitives framework and its sudden removal, plus the ongoing GPT‑Plus account bans and their mitigation.

AI model benchmarksCodexDeepSeek

0 likes · 11 min read

AI News Flash: DeepSeek Multimodal Breakthrough, Codex Major Update, Grok 4.3 Launch (May 1‑2)

Java Architecture Diary

Jan 22, 2026 · Artificial Intelligence

Unlock Java Power with Claude Agent SDK: From One‑Shot to Reactive APIs

This article explains how Claude Code, a super‑intelligent AI agent, differs from traditional code‑completion tools, introduces its official SDK limitations, and provides a comprehensive guide to the community‑driven Claude Agent SDK for Java—including one‑shot, blocking, and reactive APIs and a practical RAG‑based Q&A example.

AI AgentClaude CodeJava SDK

0 likes · 10 min read

Unlock Java Power with Claude Agent SDK: From One‑Shot to Reactive APIs

Design Hub

Dec 9, 2025 · Artificial Intelligence

AI Frontiers: GLM‑4.6V, AutoGLM 2.0 & RealGen for Designers & Developers

The article reviews three recent AI breakthroughs—GLM‑4.6V’s multimodal large‑model with 128K context and native function calling, AutoGLM 2.0’s open‑source mobile‑operating AI agent, and RealGen’s detector‑rewarded image generator that achieves a 50.15% realism win rate—highlighting how they expand toolkits for designers and developers.

AI agentsAutoGLMGLM-4.6V

0 likes · 11 min read

AI Frontiers: GLM‑4.6V, AutoGLM 2.0 & RealGen for Designers & Developers

AI Engineering

Mar 31, 2026 · Artificial Intelligence

Qwen3.5-Omni Introduces Audio‑Visual Vibe Coding: Code by Speaking and Gesturing

Alibaba's newly released Qwen3.5-Omni multimodal model adds an Audio‑Visual Vibe Coding feature that lets users describe a website or game with speech and gestures to generate code, while offering advanced audio comprehension, long‑duration media support, multilingual capabilities, fine‑grained voice control, and voice cloning, though its weights remain closed‑source.

AIAlibabaAudio-Visual Vibe Coding

0 likes · 3 min read

Qwen3.5-Omni Introduces Audio‑Visual Vibe Coding: Code by Speaking and Gesturing

MaGe Linux Operations

May 8, 2023 · Artificial Intelligence

20 Creative Ways to Harness ChatGPT: From Code Fixes to Storytelling

This article compiles a wide range of practical ChatGPT applications—including grammar correction, translation, code explanation, error fixing, creative generation, and more—illustrated with descriptions, code snippets, and images to help users explore the model's versatile capabilities.

ChatGPTPrompt Engineeringcode debugging

0 likes · 17 min read

20 Creative Ways to Harness ChatGPT: From Code Fixes to Storytelling