Collection size
96 articles
Page 4 of 5
James' Growth Diary
James' Growth Diary
May 25, 2026 · Artificial Intelligence

Practical Agent Performance Tuning: Slash Latency 75%, Cut Token Costs 71%, Boost Throughput 217%

The article walks through a systematic performance map of LangChain agents and demonstrates concrete latency, token‑usage, and concurrency optimizations—streaming responses, Redis caching, model routing, prompt trimming, context summarisation, dynamic tool selection, parallel graph nodes and batch processing—showing real‑world gains of up to 75% lower latency, 71% fewer tokens and a 217% throughput increase.

Agent OptimizationConcurrencyLangChain
0 likes · 30 min read
Practical Agent Performance Tuning: Slash Latency 75%, Cut Token Costs 71%, Boost Throughput 217%
Java Web Project
Java Web Project
Mar 11, 2025 · Artificial Intelligence

Claude 3.7 Sonnet: How the Hybrid Reasoning Model Redefines AI‑Assisted Coding

Claude 3.7 Sonnet, billed as the world’s first hybrid‑reasoning model, dramatically boosts code generation, supports fast‑response and extended‑thinking modes, and demonstrates real‑world UI reconstruction, game creation, and physics simulation, while its companion Claude Code tool automates complex engineering tasks and large‑codebase integration.

AI code generationAutomationClaude 3.7
0 likes · 6 min read
Claude 3.7 Sonnet: How the Hybrid Reasoning Model Redefines AI‑Assisted Coding
James' Growth Diary
James' Growth Diary
Apr 26, 2026 · Databases

Vector Database Fundamentals: Embedding, Similarity Search, and Index Structures Explained in One Go

This article walks through the complete workflow of turning split text into high‑dimensional vectors, choosing the right embedding model, selecting an appropriate similarity metric, comparing index structures such as Flat, IVF, HNSW and PQ, and finally picking a vector database and integrating it with LangChain.js for production‑grade RAG pipelines.

LangChainRAGembeddings
0 likes · 25 min read
Vector Database Fundamentals: Embedding, Similarity Search, and Index Structures Explained in One Go
Design Hub
Design Hub
Jan 15, 2026 · Artificial Intelligence

Hands‑On Guide to Mastering Claude Agentic Skills + Full Resource List

This article walks designers through creating and installing Claude Agentic Skills, explains the required YAML metadata and Markdown commands, shows step‑by‑step commands for building a code‑explaining skill, and provides a curated list of useful Skill repositories.

AI assistantsClaudeGitHub
0 likes · 9 min read
Hands‑On Guide to Mastering Claude Agentic Skills + Full Resource List
AI Architecture Path
AI Architecture Path
Jun 5, 2026 · Artificial Intelligence

Supermemory Tops Three Authority Benchmarks, Solving AI Forgetting

Supermemory, the open‑source AI memory engine, eliminates repeated forgetting by offering a zero‑configuration, multi‑modal memory layer that tops LongMemEval, LoCoMo and ConvoMo benchmarks, integrates automatic learning, mixed RAG‑Memory search, built‑in connectors, privacy tags, and multiple deployment options from no‑code web to local offline versions.

AI memoryRAGSupermemory
0 likes · 14 min read
Supermemory Tops Three Authority Benchmarks, Solving AI Forgetting
Su San Talks Tech
Su San Talks Tech
Apr 20, 2026 · Artificial Intelligence

Master Spring AI: From Hello World to Advanced RAG, Tool Calling, and Agent Development

This step‑by‑step guide shows Java developers how to set up Spring AI, configure various model providers, build basic and streaming chat APIs, enable multi‑turn memory, implement RAG with vector stores, add tool‑calling and multimodal capabilities, integrate MCP, and create sophisticated agents, while comparing ChatModel and ChatClient and outlining strengths, weaknesses, and ideal use cases.

AI integrationChatClientJava
0 likes · 17 min read
Master Spring AI: From Hello World to Advanced RAG, Tool Calling, and Agent Development
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Mar 24, 2025 · Artificial Intelligence

AI SDK 4.2 Release: New Reasoning, MCP Client, useChat Message Components, Image Generation, URL Sources, and Provider Updates

The AI SDK 4.2 release introduces powerful new features such as step‑by‑step reasoning support, a Model Context Protocol (MCP) client for tool integration, useChat message components, multimodal image generation, standardized URL sources, OpenAI Responses API support, Svelte 5 compatibility, and numerous middleware and provider enhancements, all illustrated with practical JavaScript/TypeScript examples.

AI SDKJavaScriptMCP
0 likes · 19 min read
AI SDK 4.2 Release: New Reasoning, MCP Client, useChat Message Components, Image Generation, URL Sources, and Provider Updates
Baobao Algorithm Notes
Baobao Algorithm Notes
Dec 24, 2023 · Artificial Intelligence

Must‑Read AI Agent and LLM Research Papers for Deep Understanding

This curated reading list compiles essential papers on AI agents, task planning, hallucination mitigation, multimodal models, image/video generation, foundational LLM research, open‑source large models, fine‑tuning techniques, and performance optimization, providing a comprehensive roadmap for anyone aiming to master modern generative AI.

AI agentsMultimodal LearningPerformance Optimization
0 likes · 23 min read
Must‑Read AI Agent and LLM Research Papers for Deep Understanding
AI Explorer
AI Explorer
Feb 28, 2026 · Artificial Intelligence

Explore the Awesome LLM Apps Repository: Hands‑On RAG and AI Agent Examples

The article presents the “Awesome LLM Apps” GitHub repository—over 98 000 stars and hundreds of open‑source LLM projects that showcase Retrieval‑Augmented Generation, AI agents, and multi‑agent collaborations across diverse use‑cases, and offers step‑by‑step guidance on browsing, cloning, configuring, and running these examples for developers, product managers, students, and AI enthusiasts.

AI agentsGitHubLLM
0 likes · 6 min read
Explore the Awesome LLM Apps Repository: Hands‑On RAG and AI Agent Examples
DeepHub IMBA
DeepHub IMBA
Mar 20, 2026 · Artificial Intelligence

Claude Code Command System Explained: 3 Types, 7 Categories, 50+ Commands

This article provides a comprehensive guide to Claude Code’s command system, covering all slash commands, CLI flags, keyboard shortcuts, hidden features, and practical workflows, showing how to initialize projects, manage context, switch models, control costs, and automate development tasks efficiently.

AI coding assistantAutomationCLI
0 likes · 29 min read
Claude Code Command System Explained: 3 Types, 7 Categories, 50+ Commands
Code Mala Tang
Code Mala Tang
Jun 2, 2026 · Artificial Intelligence

Demystifying Model Evaluation: 8 Key Terms You Must Know

The article breaks down eight technical terms—frontier coding, 1M‑long context, native multimodal, open‑source levels, benchmark layers, CUDA operators, autonomous iteration, and verifiable engineering strength—to help readers understand what modern AI model release notes actually mean.

CUDA operatorsModel Evaluationbenchmark
0 likes · 11 min read
Demystifying Model Evaluation: 8 Key Terms You Must Know
PaperAgent
PaperAgent
Dec 14, 2025 · Artificial Intelligence

GPT‑5.2 vs Gemini 3 Pro: Coding Tests, NeurIPS 2025 Paper Insights, and RAG Refactor

The article evaluates GPT‑5.2 and Gemini 3 Pro on real‑world coding tasks, analyzes trends from the 6000 papers presented at NeurIPS 2025, and demonstrates how to extract and refactor the tree‑building component of the open‑source RAPTOR RAG system into an independent module.

AI model evaluationCode RefactoringGPT-5.2
0 likes · 5 min read
GPT‑5.2 vs Gemini 3 Pro: Coding Tests, NeurIPS 2025 Paper Insights, and RAG Refactor
SuanNi
SuanNi
Apr 2, 2026 · Artificial Intelligence

How Alibaba’s New Qwen3.5‑Omni, Wan2.7‑Image, and Qwen3.6‑Plus Redefine Multimodal AI

Alibaba unveiled three cutting‑edge models—Qwen3.5‑Omni with native multimodal interaction, Wan2.7‑Image for high‑precision image generation and editing, and Qwen3.6‑Plus boosting coding agent performance—each achieving dozens of SOTA benchmarks, massive context windows, and novel capabilities such as Audio‑Visual Vibe Coding and transparent layer separation.

AICoding AgentLarge Language Model
0 likes · 7 min read
How Alibaba’s New Qwen3.5‑Omni, Wan2.7‑Image, and Qwen3.6‑Plus Redefine Multimodal AI
Lao Guo's Learning Space
Lao Guo's Learning Space
May 2, 2026 · Industry Insights

AI News Flash: DeepSeek Multimodal Breakthrough, Codex Major Update, Grok 4.3 Launch (May 1‑2)

The AI roundup covers OpenAI's Codex upgrade with Workspace Agents and 40% token efficiency, xAI's Grok 4.3 API offering 128K context and 60% lower pricing, Ant Group's open‑source Ling 2.6‑1T model, DeepSeek's multimodal Visual Primitives framework and its sudden removal, plus the ongoing GPT‑Plus account bans and their mitigation.

AI model benchmarksCodexDeepSeek
0 likes · 11 min read
AI News Flash: DeepSeek Multimodal Breakthrough, Codex Major Update, Grok 4.3 Launch (May 1‑2)
Java Architecture Diary
Java Architecture Diary
Jan 22, 2026 · Artificial Intelligence

Unlock Java Power with Claude Agent SDK: From One‑Shot to Reactive APIs

This article explains how Claude Code, a super‑intelligent AI agent, differs from traditional code‑completion tools, introduces its official SDK limitations, and provides a comprehensive guide to the community‑driven Claude Agent SDK for Java—including one‑shot, blocking, and reactive APIs and a practical RAG‑based Q&A example.

AI AgentClaude CodeJava SDK
0 likes · 10 min read
Unlock Java Power with Claude Agent SDK: From One‑Shot to Reactive APIs
Design Hub
Design Hub
Dec 9, 2025 · Artificial Intelligence

AI Frontiers: GLM‑4.6V, AutoGLM 2.0 & RealGen for Designers & Developers

The article reviews three recent AI breakthroughs—GLM‑4.6V’s multimodal large‑model with 128K context and native function calling, AutoGLM 2.0’s open‑source mobile‑operating AI agent, and RealGen’s detector‑rewarded image generator that achieves a 50.15% realism win rate—highlighting how they expand toolkits for designers and developers.

AI agentsAutoGLMGLM-4.6V
0 likes · 11 min read
AI Frontiers: GLM‑4.6V, AutoGLM 2.0 & RealGen for Designers & Developers
AI Engineering
AI Engineering
Mar 31, 2026 · Artificial Intelligence

Qwen3.5-Omni Introduces Audio‑Visual Vibe Coding: Code by Speaking and Gesturing

Alibaba's newly released Qwen3.5-Omni multimodal model adds an Audio‑Visual Vibe Coding feature that lets users describe a website or game with speech and gestures to generate code, while offering advanced audio comprehension, long‑duration media support, multilingual capabilities, fine‑grained voice control, and voice cloning, though its weights remain closed‑source.

AIAlibabaAudio-Visual Vibe Coding
0 likes · 3 min read
Qwen3.5-Omni Introduces Audio‑Visual Vibe Coding: Code by Speaking and Gesturing
MaGe Linux Operations
MaGe Linux Operations
May 8, 2023 · Artificial Intelligence

20 Creative Ways to Harness ChatGPT: From Code Fixes to Storytelling

This article compiles a wide range of practical ChatGPT applications—including grammar correction, translation, code explanation, error fixing, creative generation, and more—illustrated with descriptions, code snippets, and images to help users explore the model's versatile capabilities.

ChatGPTPrompt Engineeringcode debugging
0 likes · 17 min read
20 Creative Ways to Harness ChatGPT: From Code Fixes to Storytelling