Author

JavaEdge

First‑line development experience at multiple leading tech firms; now a software architect at a Shanghai state‑owned enterprise and founder of Programming Yanxuan. Nearly 300k followers online; expertise in distributed system design, AIGC application development, and quantitative finance investing.

372

Articles

Likes

885

Views

Comments

Latest from JavaEdge

100 recent articles max

JavaEdge

Jun 27, 2025 · Artificial Intelligence

Why Inference Engines Are Essential for Deploying Large Language Models in Production

The article explains what inference engines are, why they are needed beyond raw Python scripts, and outlines best practices such as model quantization, batching, and parallelism, while comparing popular open‑source and commercial options for production AI workloads.

AI deploymentLLMParallelism

0 likes · 14 min read

Why Inference Engines Are Essential for Deploying Large Language Models in Production

JavaEdge

Jun 11, 2025 · Backend Development

Java Weekly Update: JDK 25 Rampdown, JDK 26 Roadmap, Jakarta EE, Spring Cloud, Hibernate and More

This weekly roundup covers the latest Java ecosystem news, including JDK 25 entering ramp‑down, the formation of the JDK 26 expert group, Jakarta EE 11/12 milestones, new releases of Eclipse JNoSQL, Spring Cloud, Hibernate Search, Helidon, Open Liberty, Grails, JBang, and a preview of Oracle Labs' Project Crema.

HelidonHibernateJDK

0 likes · 10 min read

Java Weekly Update: JDK 25 Rampdown, JDK 26 Roadmap, Jakarta EE, Spring Cloud, Hibernate and More

JavaEdge

Jun 6, 2025 · Artificial Intelligence

Why Qwen3 Embedding Models Are Setting New Benchmarks in Text Representation

The article introduces the Qwen3 Embedding series, detailing its model variants, architecture, training methodology, multilingual support, performance metrics across several benchmarks, and future development plans, highlighting its superior generalization and flexibility for diverse AI applications.

AIEmbeddingMultilingual

0 likes · 9 min read

Why Qwen3 Embedding Models Are Setting New Benchmarks in Text Representation

JavaEdge

Jun 5, 2025 · Artificial Intelligence

How Amazon’s Strands Agents SDK Simplifies Building AI Agents

Amazon’s newly open‑source Strands Agents SDK lets developers create AI agents with minimal code by defining prompts, tools, and models, offering a lightweight, production‑ready framework that supports multiple model providers, observability, multi‑agent collaboration, and extensible tooling via dedicated packages.

AI AgentsAmazonLLM

0 likes · 7 min read

How Amazon’s Strands Agents SDK Simplifies Building AI Agents

JavaEdge

May 30, 2025 · Artificial Intelligence

How to Build a Deep Research Workflow in Dify Using AI Agents

This guide explains how to construct a deep research workflow in Dify that leverages AI agents, loop variables, and structured outputs to automatically explore complex topics, gather sources, and synthesize comprehensive reports with proper citations.

AI WorkflowAgentAutomation

0 likes · 9 min read

How to Build a Deep Research Workflow in Dify Using AI Agents

JavaEdge

May 27, 2025 · Artificial Intelligence

Boost LLM App Performance: Master Parallel Workflows in Dify v0.8.0

Version 0.8.0 of Dify introduces parallel workflow capabilities, allowing multiple branches to run concurrently, which dramatically reduces latency for complex LLM tasks; the guide explains how to create simple, nested, iterative, and conditional parallel branches, with step‑by‑step instructions and visual examples.

DifyLLMparallel processing

0 likes · 8 min read

Boost LLM App Performance: Master Parallel Workflows in Dify v0.8.0

JavaEdge

May 24, 2025 · Databases

Redis 8 Goes Open Source Again: AGPLv3 License, Speed Gains, New Features

Redis 8 has been released under the OSI‑approved AGPLv3 license, reversing its 2024 SSPL shift, and brings up to 87% faster command execution, double the throughput, and integrated modules like JSON, TimeSeries, and the new Vector Sets data type, sparking community debate and competition with Valkey.

AGPLv3Valkeydatabase

0 likes · 6 min read

Redis 8 Goes Open Source Again: AGPLv3 License, Speed Gains, New Features

JavaEdge

May 7, 2025 · Artificial Intelligence

Why AI Agents Pose New Security Risks and How to Safeguard Them

The article explains what AI agents are, highlights their emerging security risks such as data leakage and lack of accountability, and offers practical strategies—including risk analysis, threat modeling, and engineering best practices—to mitigate these challenges for enterprises.

AI AgentsAI safetySecurity Risks

0 likes · 9 min read

Why AI Agents Pose New Security Risks and How to Safeguard Them

JavaEdge

May 2, 2025 · Artificial Intelligence

Exploring Qwen3: Open‑Source LLM Features, Benchmarks, and Deployment Guides

This article introduces the Qwen3 family of open‑source large language models, details their architecture, parameter counts, multilingual support, and benchmark performance, and provides step‑by‑step instructions for deploying them with frameworks like SGLang, vLLM, and local runtimes such as Ollama and LMStudio.

AIAgentQwen3

0 likes · 22 min read

Exploring Qwen3: Open‑Source LLM Features, Benchmarks, and Deployment Guides

JavaEdge

Apr 26, 2025 · Artificial Intelligence

Turn LM Studio into a Local OpenAI‑Compatible API Server

This guide shows how to select a model in LM Studio, expose a local port, start the HTTP server, and interact with it via curl commands, covering quick model listing, chat requests, and the difference between streaming and full‑response modes.

AIAPILM‑Studio

0 likes · 5 min read

Turn LM Studio into a Local OpenAI‑Compatible API Server