Collection size
100 articles
Page 4 of 5
Java Tech Enthusiast
Java Tech Enthusiast
Feb 16, 2024 · Artificial Intelligence

Google's Gemini 1.5: Breakthrough in Long-Context Understanding and Multimodal Capabilities

Google’s Gemini 1.5, a new multimodal Mixture‑of‑Experts model, supports up to a million‑token context (10 million internally), can understand text, video, audio and code, learns a new language from a single prompt, and is already being used by Samsung, Jasper and Quora, positioning it as a direct challenger to OpenAI’s flagship models.

Gemini 1.5Google AILLM
0 likes · 7 min read
Google's Gemini 1.5: Breakthrough in Long-Context Understanding and Multimodal Capabilities
Baobao Algorithm Notes
Baobao Algorithm Notes
Jul 29, 2025 · Artificial Intelligence

Qwen3‑30B‑A3B‑Instruct‑2507: New Instruction Model with Boosted General and Multilingual Skills

The Qwen3‑30B‑A3B‑Instruct‑2507 model, an updated non‑thinking version of Qwen3‑30B‑A3B, delivers significant gains in instruction following, reasoning, multilingual knowledge coverage, and 256K context length, and its performance is benchmarked against leading LLMs across a wide range of tasks.

Instruction TuningMixture‑of‑ExpertsQwen3
0 likes · 6 min read
Qwen3‑30B‑A3B‑Instruct‑2507: New Instruction Model with Boosted General and Multilingual Skills
JD Tech Talk
JD Tech Talk
Dec 1, 2025 · Artificial Intelligence

How JoyAgent Enables Multimodal RAG for Enterprise Knowledge Management

JoyAgent, JD's open‑source intelligent‑agent platform, now adds multimodal Retrieval‑Augmented Generation (RAG) capabilities, combining graph‑based knowledge, hierarchical chunking, and vision‑language models to handle text, images, tables, and API data for enterprise knowledge processing and evaluation.

Enterprise AIKnowledge GraphMultimodal RAG
0 likes · 11 min read
How JoyAgent Enables Multimodal RAG for Enterprise Knowledge Management
Baobao Algorithm Notes
Baobao Algorithm Notes
Mar 2, 2026 · Artificial Intelligence

How “Skills” Turn LLM Prompts into Portable, Engineered Workflows

This article dissects the evolution of LLM prompts into structured, version‑controlled skill packages, explains the AgentSkills specification, details OpenClaw’s implementation, compares prompts, memory, MCP and skills, and provides end‑to‑end examples with code, flowcharts and best‑practice recommendations.

Agent SkillsAutomationLLM
0 likes · 40 min read
How “Skills” Turn LLM Prompts into Portable, Engineered Workflows
PaperAgent
PaperAgent
Feb 11, 2026 · Industry Insights

Is DeepSeek’s New V4 Model Redefining the AI Landscape?

DeepSeek has quietly released a new large‑language model—likely V4—featuring a May 2025 knowledge cutoff, a 1 million‑token context window, and pure‑text capabilities, while industry trends in 2026 shift focus toward agentic AI systems that coordinate multiple specialized models.

AI ModelsAgentic AIDeepSeek
0 likes · 3 min read
Is DeepSeek’s New V4 Model Redefining the AI Landscape?
AI Algorithm Path
AI Algorithm Path
Apr 15, 2026 · Artificial Intelligence

8 Must-Collect Agent Skills Repositories for Claude and AI Agents

This article explains what Agent Skills are, why a curated skill library is valuable, and reviews eight actively maintained GitHub repositories—detailing their structure, core capabilities, integration points, and practical usage examples for building production‑grade AI agents.

AI agentsAI toolsAgent Skills
0 likes · 11 min read
8 Must-Collect Agent Skills Repositories for Claude and AI Agents
Wuming AI
Wuming AI
Sep 20, 2025 · Artificial Intelligence

How to Use Cherry Studio for Simultaneous Multi‑Model AI Calls

This guide shows how to install Cherry Studio, configure multiple AI model providers, and send a single prompt that triggers several models at once, with step‑by‑step screenshots, layout tips, and cost considerations for free and paid services.

AI toolsArtificial IntelligenceCherry Studio
0 likes · 5 min read
How to Use Cherry Studio for Simultaneous Multi‑Model AI Calls
Amap Tech
Amap Tech
Apr 21, 2025 · Artificial Intelligence

Lenna: Language‑Enhanced Reasoning Detection Assistant and a Chain‑of‑Thought Image Editing Framework Using Multimodal Large Language Models

At ICASSP 2025, Gaode’s two accepted papers present Lenna, a language‑enhanced reasoning detection assistant that adds a DET token to multimodal LLMs and achieves state‑of‑the‑art accuracy on RefCOCO benchmarks, and a chain‑of‑thought image‑editing framework that converts complex prompts into segmented masks and repair prompts for diffusion‑based inpainting, surpassing existing methods.

AIICASSPchain of thought
0 likes · 10 min read
Lenna: Language‑Enhanced Reasoning Detection Assistant and a Chain‑of‑Thought Image Editing Framework Using Multimodal Large Language Models
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Oct 10, 2023 · Artificial Intelligence

Create a Custom Enterprise Conversational Search with Alibaba Cloud OpenSearch Vector & LLM

This guide walks you through setting up Alibaba Cloud OpenSearch Vector Search and LLM Intelligent Q&A editions, covering environment preparation, instance creation, data source configuration, field and index setup, document ingestion, query processing, and a complete Java SDK demo for building a flexible enterprise conversational search system.

Alibaba CloudConversational AIJava SDK
0 likes · 20 min read
Create a Custom Enterprise Conversational Search with Alibaba Cloud OpenSearch Vector & LLM
AI Cyberspace
AI Cyberspace
Sep 15, 2025 · Artificial Intelligence

What Is Agentic AI? From LLM Limits to Autonomous AI Agents

Agentic AI transforms static large language models into autonomous agents by adding perception, goal orientation, planning, action, interaction, and iterative loops, tracing its evolution from early chatbots through Prompt Engineering, ReAct, AutoGPT, OpenAI Function Calling, to modern multi‑agent frameworks, while addressing challenges like memory, hallucinations, and scalability.

Agentic AIMulti-agentRAG
0 likes · 38 min read
What Is Agentic AI? From LLM Limits to Autonomous AI Agents
Fun with Large Models
Fun with Large Models
Oct 10, 2025 · Artificial Intelligence

Coze Low-Code Agent Platform: In‑Depth Look at Its Six Core Features

This article provides a comprehensive overview of the Coze low‑code AI agent platform, detailing its free, multi‑model capabilities and six core functions—plugins, knowledge base, database, image flow, workflow, and multi‑agent collaboration—while illustrating how each feature lowers development barriers and enables sophisticated agent applications.

Agent PlatformCozeKnowledge Base
0 likes · 13 min read
Coze Low-Code Agent Platform: In‑Depth Look at Its Six Core Features
Baidu Intelligent Cloud Tech Hub
Baidu Intelligent Cloud Tech Hub
Jan 27, 2026 · Artificial Intelligence

Deploying Qwen3 on Kunlun P800: Full‑Parameter DPO Training and Inference Guide

This guide walks through setting up a Kunlun P800 XPU host, preparing Docker containers, deploying Qwen3‑8B/‑32B/‑VL models with vLLM‑Kunlun, benchmarking performance, and running full‑parameter DPO training using LLaMA‑Factory, providing scripts, configuration files, and troubleshooting tips for AI engineers.

DPOInferenceKunlun P800
0 likes · 32 min read
Deploying Qwen3 on Kunlun P800: Full‑Parameter DPO Training and Inference Guide
PaperAgent
PaperAgent
Jan 30, 2026 · Artificial Intelligence

How LLM‑in‑Sandbox Turns Large Models into General‑Purpose Agents Without Extra Training

The LLM‑in‑Sandbox framework places large language models inside a virtual machine that provides external tool access, persistent storage, and code execution, yielding up to a 24.2% performance boost across six benchmark tasks without additional training, and it scales from zero‑shot to reinforcement‑learning‑enhanced agents while remaining cost‑effective.

Agentic AILLMefficiency
0 likes · 6 min read
How LLM‑in‑Sandbox Turns Large Models into General‑Purpose Agents Without Extra Training
AI Frontier Lectures
AI Frontier Lectures
Jul 11, 2025 · Artificial Intelligence

Can LLMs ‘Squint’ to Recognize Hidden Faces? A Comparative Test

The article evaluates several large language models—including ChatGPT, Gemini, Grok, Qwen, and o3‑Pro—on a visual illusion that requires squinting to identify the Mona Lisa, revealing varied success rates, reasoning differences, and insights into model capabilities and limitations.

LLMmodel comparisonprompt engineering
0 likes · 6 min read
Can LLMs ‘Squint’ to Recognize Hidden Faces? A Comparative Test
JD Retail Technology
JD Retail Technology
Feb 18, 2025 · Artificial Intelligence

Engineering Practices of JD Advertising Agent: JDZunTong Intelligent Assistant

JD’s advertising R&D team created the JDZunTong Intelligent Assistant by engineering a modular Agent platform that combines advanced Retrieval‑Augmented Generation (RAG 1.0 → 2.0) and Function‑Call capabilities, a visual designer, custom tool registration, and a native Python workflow engine to deliver intelligent customer service, data queries, and ad creation for merchants.

AIAgentFunction Call
0 likes · 18 min read
Engineering Practices of JD Advertising Agent: JDZunTong Intelligent Assistant
Kuaishou Tech
Kuaishou Tech
Jun 4, 2025 · Artificial Intelligence

KwaiCoder-AutoThink-preview: An Automatic‑Thinking Large Model Enhanced with Step‑SRPO Reinforcement Learning

The KwaiPilot team released the KwaiCoder‑AutoThink‑preview model, which introduces a novel automatic‑thinking training paradigm and a process‑supervised reinforcement‑learning method called Step‑SRPO, enabling the model to dynamically switch between thinking and non‑thinking modes, reduce inference cost, and achieve up to 20‑point gains on code and math benchmarks while handling large‑scale codebases.

AI researchLarge Language ModelModel Optimization
0 likes · 12 min read
KwaiCoder-AutoThink-preview: An Automatic‑Thinking Large Model Enhanced with Step‑SRPO Reinforcement Learning