Tag

AI model

0 views collected around this technical thread.

Java Architecture Diary
Java Architecture Diary
Mar 19, 2025 · Artificial Intelligence

Unlocking Google’s Gemma 3: Multimodal Power, 128k Context & Local Deployment Guide

This article introduces Google’s open‑source Gemma 3 model, highlighting its multimodal capabilities, massive 128k token context window, multilingual support, and provides step‑by‑step instructions for installing Ollama, pulling the model, and running local tests with code examples.

AI modelGemma 3Ollama
0 likes · 7 min read
Unlocking Google’s Gemma 3: Multimodal Power, 128k Context & Local Deployment Guide
Code Mala Tang
Code Mala Tang
Mar 15, 2025 · Artificial Intelligence

What Makes Google’s New Gemma 3 Model a Game‑Changer for AI Developers?

Google’s Gemma 3, a lightweight open‑source model with up to 27 billion parameters, offers multimodal input, 128K token context, and broad language support, outperforming leading rivals on single‑GPU benchmarks and providing flexible deployment options for developers and researchers alike.

AI modelGemma 3Google AI
0 likes · 9 min read
What Makes Google’s New Gemma 3 Model a Game‑Changer for AI Developers?
DataFunTalk
DataFunTalk
Feb 26, 2025 · Artificial Intelligence

Alibaba Cloud's Wanxiang 2.1: Open‑Source Dual‑Version Visual Generation Model with Full‑Scale Capabilities

Wanxiang 2.1, an open‑source visual generation model released by Alibaba Cloud, offers a 140‑billion‑parameter professional version and a 13‑billion‑parameter consumer‑grade version, delivering SOTA performance across multiple benchmarks, supporting diverse video generation tasks, and employing advanced DiT‑based architecture, 3D VAE, and efficient distributed training strategies.

AI modeldeep learningmultimodal
0 likes · 11 min read
Alibaba Cloud's Wanxiang 2.1: Open‑Source Dual‑Version Visual Generation Model with Full‑Scale Capabilities
DevOps
DevOps
Feb 25, 2025 · Artificial Intelligence

Claude 3.7 Sonnet: First Hybrid Reasoning Model with Enhanced Coding Tool and Strong Benchmark Performance

Claude 3.7 Sonnet, Anthropic's new hybrid reasoning model, introduces dual thinking modes, token‑based thinking budget control, unchanged pricing, and the Claude Code tool that automates lengthy coding tasks, while achieving record GPQA scores, superior video‑game testing results, and reduced unnecessary refusals on harmful requests.

AI modelClaudeCoding tool
0 likes · 7 min read
Claude 3.7 Sonnet: First Hybrid Reasoning Model with Enhanced Coding Tool and Strong Benchmark Performance
DataFunSummit
DataFunSummit
Feb 25, 2025 · Artificial Intelligence

Tiny‑R1‑32B‑Preview: A 5% Parameter Model Matching Deepseek‑R1‑671B Performance

On February 24, 2025, 360 and Peking University unveiled Tiny‑R1‑32B‑Preview, a medium‑scale inference model that uses only 5% of the parameters yet achieves performance comparable to the 671‑billion‑parameter Deepseek‑R1, with leading results on math, programming, and scientific benchmarks.

AI modelBenchmarkingModel Distillation
0 likes · 7 min read
Tiny‑R1‑32B‑Preview: A 5% Parameter Model Matching Deepseek‑R1‑671B Performance
Efficient Ops
Efficient Ops
Feb 25, 2025 · Artificial Intelligence

How to Deploy DeepSeek R1 Locally: A Step‑by‑Step Guide for AI Enthusiasts

This guide explains what DeepSeek R1 is, compares its full and distilled versions, details hardware requirements for Linux, Windows, and macOS, and provides step‑by‑step instructions for local deployment using Ollama, LM Studio, Docker, and visual interfaces like Open‑WebUI and Dify.

AI modelDeepSeekDify
0 likes · 9 min read
How to Deploy DeepSeek R1 Locally: A Step‑by‑Step Guide for AI Enthusiasts
Java Tech Enthusiast
Java Tech Enthusiast
Feb 15, 2025 · Artificial Intelligence

DeepSeek-R1: High-Performance AI Inference Model

DeepSeek‑R1 is a high‑performance AI inference model that leverages reinforcement‑learning techniques to boost reasoning on complex tasks, has become a Chinese‑New‑Year sensation, and requires substantial hardware resources for local deployment, especially the full‑scale 671‑billion‑parameter version.

AI deploymentAI inferenceAI model
0 likes · 4 min read
DeepSeek-R1: High-Performance AI Inference Model
Top Architect
Top Architect
Feb 1, 2025 · Artificial Intelligence

OpenAI Launches o3-mini: A Fast, Cost‑Effective AI Model Optimized for STEM Reasoning

OpenAI unveiled the o3-mini family—low, medium, and high variants—offering a cheaper, faster, and secure inference model that matches or exceeds the performance of its predecessor o1 across STEM, coding, and general knowledge benchmarks while introducing search integration and enhanced safety features.

AI modelAI safetyO3-mini
0 likes · 8 min read
OpenAI Launches o3-mini: A Fast, Cost‑Effective AI Model Optimized for STEM Reasoning
Code Mala Tang
Code Mala Tang
Jan 30, 2025 · Artificial Intelligence

Is Janus-Pro the Open‑Source Rival to DALL·E 3? A Deep Dive Review

This article reviews DeepSeek's Janus‑Pro image model, explains its multimodal architecture, benchmarks it against DALL·E 3 and Stable Diffusion, provides usage instructions and inference code, and offers a critical assessment of its image quality and practical limitations.

AI modelImage GenerationJanus-Pro
0 likes · 12 min read
Is Janus-Pro the Open‑Source Rival to DALL·E 3? A Deep Dive Review
Kuaishou Large Model
Kuaishou Large Model
Nov 29, 2024 · Artificial Intelligence

How OASIS Achieves State‑of‑the‑Art Code Search with Just 5M Tokens

Fast.ai's Kwaipilot team unveiled OASIS, a 1.3B‑parameter code‑embedding model that, using only 5 million tokens, outperforms larger OpenAI embeddings across CodeSearchNet, CoSQA and AdvTest benchmarks, thanks to repository‑level program analysis, synthetic data generation, and a fused loss function.

AI modelbenchmarkcode embedding
0 likes · 8 min read
How OASIS Achieves State‑of‑the‑Art Code Search with Just 5M Tokens
360 Tech Engineering
360 Tech Engineering
Jul 3, 2024 · Artificial Intelligence

360LayoutAnalysis: Open‑Source Lightweight Document Layout Analysis Models for Multiple Scenarios

The 360LayoutAnalysis project from 360 AI Lab releases lightweight, yolov8‑based layout analysis models covering Chinese and English papers, Chinese research reports, and a general document scenario, providing fast inference, paragraph‑level detection, and open‑source code and weights for flexible document‑understanding pipelines.

AI modelDocument UnderstandingYOLOv8
0 likes · 9 min read
360LayoutAnalysis: Open‑Source Lightweight Document Layout Analysis Models for Multiple Scenarios
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Jun 22, 2024 · Artificial Intelligence

Claude 3.5 Sonnet: Performance Review and Real‑World Tests

Claude 3.5 Sonnet, Anthropic’s latest large language model, is evaluated across a range of Chinese‑language tasks, visual reasoning, coding, and game creation, showing faster, cheaper, and often superior results compared to GPT‑4o, while also revealing occasional failures in simple games and math problems.

AI modelAnthropicClaude 3.5
0 likes · 8 min read
Claude 3.5 Sonnet: Performance Review and Real‑World Tests
Java Architecture Diary
Java Architecture Diary
Jun 9, 2024 · Artificial Intelligence

How to Enable Gemini Nano AI in Chrome Canary – Step‑by‑Step Guide

This guide explains how to download Chrome Canary, enable the Gemini Nano AI model via chrome://flags, verify the model download, and test it using a JavaScript console snippet, providing all necessary steps and resources for developers.

AI modelArtificial IntelligenceChrome Canary
0 likes · 3 min read
How to Enable Gemini Nano AI in Chrome Canary – Step‑by‑Step Guide
php中文网 Courses
php中文网 Courses
Aug 11, 2023 · Artificial Intelligence

Anthropic Releases Claude Instant 1.2 with Improved Performance on Coding and Math Benchmarks

Anthropic announced the Claude Instant 1.2 model, an upgraded, cheaper version of its AI assistant that leverages Claude 2.0’s capabilities, achieving higher scores on Codex (58.7% vs 52.8%) and GSM8k (86.7% vs 80.9%) benchmarks, with better safety and reduced hallucinations.

AI modelAnthropicArtificial Intelligence
0 likes · 3 min read
Anthropic Releases Claude Instant 1.2 with Improved Performance on Coding and Math Benchmarks
Tencent Cloud Developer
Tencent Cloud Developer
Dec 12, 2022 · Artificial Intelligence

Performance Optimization of Tencent Cloud OCR Service: Reducing Latency and Improving Throughput

Tencent Cloud’s OCR team cut average response time from 1.8 seconds to under one second and boosted throughput by over 50 % by redesigning the model with self‑attention, accelerating inference with a Tensor‑Network accelerator, shrinking RPC payloads, enabling asynchronous logging, and optimizing multi‑region GPU memory utilization.

AI modelCloud ServicesInference Acceleration
0 likes · 13 min read
Performance Optimization of Tencent Cloud OCR Service: Reducing Latency and Improving Throughput