Tagged articles

5000 articles

Page 23 of 50

Mar 5, 2025 · Industry Insights

DeepSeek R1 & Kimi 1.5: Inside the Development of Near‑Strong Reasoning Models

The article analyzes DeepSeek's recent releases—V3 dialogue model and R1 inference model—detailing their launch dates, rapid popularity surge, R1's reinforcement‑learning‑based design for code and math tasks, and provides links to related Beijing University technical reports while stripping promotional sales content.

AIDeepSeekIndustry Analysis

0 likes · 3 min read

DeepSeek R1 & Kimi 1.5: Inside the Development of Near‑Strong Reasoning Models

Model Perspective

Mar 5, 2025 · Artificial Intelligence

Can AI Really Crack NP‑Hard Problems? Inside the DeepSeek‑R1 Breakthrough

Researchers from Nanjing University of Aeronautics, Nanjing University of Technology and Oxford show that high‑instruction prompts dramatically boost large language models' mathematical reasoning, enabling DeepSeek‑R1 and Qwen2.5 to solve complex polynomial tasks and even produce a new counterexample to Hilbert's 17th problem.

AIDeepSeekMathematical Reasoning

0 likes · 6 min read

Can AI Really Crack NP‑Hard Problems? Inside the DeepSeek‑R1 Breakthrough

Cognitive Technology Team

Mar 5, 2025 · Artificial Intelligence

Comparative Analysis of Java AI Frameworks: LangChain4j, Spring AI, and Agent-Flex

This article examines three leading Java AI frameworks—LangChain4j, Spring AI, and Agent-Flex—by comparing their architectures, core capabilities, and ideal use‑cases, helping developers choose the most suitable solution for enterprise, domestic, or rapid‑prototype projects.

AIAgent-FlexLLM

0 likes · 5 min read

Comparative Analysis of Java AI Frameworks: LangChain4j, Spring AI, and Agent-Flex

Tencent Cloud Developer

Mar 5, 2025 · Artificial Intelligence

DeepSeek Series Overview: Core Technologies, Model Innovations, and Product Highlights

The article delivers a PPT‑style deep dive into the DeepSeek series—from the original LLM through DeepSeek‑MoE, Math, V2, V3 and R1—highlighting core innovations such as Multi‑Head Latent Attention, fine‑grained MoE, GRPO reinforcement learning, Multi‑Token Prediction, DualPipe parallelism and FP8 training that together achieve high performance at a fraction of traditional costs, and notes their integration into Tencent’s OlaChat intelligent assistant.

AIDeepSeekFP8 training

0 likes · 21 min read

DeepSeek Series Overview: Core Technologies, Model Innovations, and Product Highlights

21CTO

Mar 4, 2025 · Artificial Intelligence

Will AI Replace Developers? Emerging Roles for Software Engineers

The article examines how generative AI will automate many coding tasks yet create new opportunities for software engineers, emphasizing the need for human supervision, ethical oversight, and advanced roles such as AI integration, system architecture, and cybersecurity in the evolving tech landscape.

AIdeveloper rolesfuture of work

0 likes · 6 min read

Will AI Replace Developers? Emerging Roles for Software Engineers

JD Retail Technology

Mar 4, 2025 · Artificial Intelligence

JD Retail End-to-End AI Engine Compatible with GPU and Domestic NPU: Architecture, Optimization, and Applications

JD Retail’s Nine‑Number Algorithm Platform delivers an end‑to‑end AI engine that unifies GPU and domestic NPU resources across a thousand‑card cluster, offering zero‑cost model migration, optimized training and inference pipelines, support for over 40 LLM and multimodal models, and proven business‑level performance that reduces dependence on overseas chips.

AIDistributed TrainingGPU

0 likes · 19 min read

JD Retail End-to-End AI Engine Compatible with GPU and Domestic NPU: Architecture, Optimization, and Applications

Alibaba Cloud Developer

Mar 4, 2025 · Artificial Intelligence

Build a Smart Knowledge Base with DeepSeek R1 and Alibaba Cloud Low‑Code

This tutorial guides you through creating an AI‑powered, customizable knowledge space by integrating DeepSeek R1 via Alibaba Cloud Bailei's Model‑as‑a‑Service with the low‑code Mobinext platform, covering setup, configuration, deployment, and future expansion for multi‑tenant use.

AIAlibaba CloudDeepSeek

0 likes · 12 min read

Build a Smart Knowledge Base with DeepSeek R1 and Alibaba Cloud Low‑Code

58UXD

Mar 4, 2025 · Artificial Intelligence

Can DeepSeek AI Turn User Complaints into Actionable Design Solutions?

This article explores how DeepSeek AI was fed real negative user feedback from a 58.com B‑side posting page, compares its design recommendations with those of a professional designer, and evaluates the strengths and limitations of AI‑generated UX suggestions.

AIUX designcase study

0 likes · 4 min read

Can DeepSeek AI Turn User Complaints into Actionable Design Solutions?

DataFunTalk

Mar 4, 2025 · Artificial Intelligence

Roundtable: How AI Is Changing Enterprise – Insights from Box CEO Aaron Levie and Panel

In this roundtable, moderator Garry and guests including Box CEO Aaron Levie discuss the current AI revolution, the role of model companies, opportunities for startups, pricing models, enterprise adoption, security concerns, and the broader economic impact of AI on businesses and society.

AIAI adoptionBusiness Models

0 likes · 9 min read

Roundtable: How AI Is Changing Enterprise – Insights from Box CEO Aaron Levie and Panel

Huawei Cloud Developer Alliance

Mar 4, 2025 · Artificial Intelligence

Build a RAG Vector Database with DeepSeek on a Cloud Host – Step‑by‑Step Guide

This tutorial explains how to deploy the DeepSeek‑r1:1.5b model on a cloud server using Ollama, create a retrieval‑augmented generation (RAG) vector database with the mxbai‑embed‑large embedding model, and build an interactive AI application that answers questions from uploaded PDFs.

AIDeepSeekOllama

0 likes · 6 min read

Build a RAG Vector Database with DeepSeek on a Cloud Host – Step‑by‑Step Guide

AIWalker

Mar 3, 2025 · Artificial Intelligence

ByteDance’s Diffusion Restoration Adapter Achieves State‑of‑the‑Art Real‑World Image Recovery

This paper introduces a lightweight Diffusion Restoration Adapter that integrates into pre‑trained diffusion priors such as StableDiffusion XL and StableDiffusion 3, dramatically reduces parameter overhead compared with ControNet, and delivers superior quantitative and visual results on real‑world image restoration benchmarks through a novel sampling strategy.

AIAdapterDiffusion Models

0 likes · 17 min read

ByteDance’s Diffusion Restoration Adapter Achieves State‑of‑the‑Art Real‑World Image Recovery

Code Mala Tang

Mar 3, 2025 · Artificial Intelligence

Unlock AI’s Full Potential with Structured Prompt Decorators

Prompt Decorators are structured prefixes that standardize and enhance AI responses, addressing common challenges like vague prompts, inconsistent answers, and lack of reasoning by guiding the model to produce clear, logical, and well‑organized outputs across various use cases.

AILLMPrompt engineering

0 likes · 23 min read

Unlock AI’s Full Potential with Structured Prompt Decorators

Baidu Intelligent Cloud Tech Hub

Mar 3, 2025 · Cloud Computing

How Baidu Cloud Optimizes GPU Servers for AI Workloads

This article explains the design and implementation of GPU cloud servers, covering data processing pipelines, hardware selection, topology, interconnect technologies, virtualization, multi‑GPU communication methods, and Baidu's practical solutions for both virtualized and bare‑metal instances to boost AI inference and training performance.

AIGPUNVLink

0 likes · 29 min read

How Baidu Cloud Optimizes GPU Servers for AI Workloads

Alibaba Terminal Technology

Mar 3, 2025 · Artificial Intelligence

Boost Frontend Testing with Midscene.js: AI‑Powered UI Automation Made Easy

Midscene.js, an open‑source UI automation tool enhanced with multimodal AI, lets developers create, run, and assert frontend tests without writing code via a Chrome extension, and also integrates into Playwright, Puppeteer, or YAML scripts, offering AI‑driven actions, queries, and assertions.

AIMidscene.jsPlaywright

0 likes · 6 min read

Boost Frontend Testing with Midscene.js: AI‑Powered UI Automation Made Easy

IT Services Circle

Mar 3, 2025 · Fundamentals

AMD RX 9070 and RX 9070 XT: Specifications, Performance Benchmarks, AI Capabilities, and Pricing

The article reviews AMD's newly announced RX 9070 and RX 9070 XT graphics cards, detailing their 4 nm RDNA 4 architecture, core specifications, gaming performance gains over the RX 7900 GRE, AI workload improvements, FSR 4 enhancements, and launch pricing compared with NVIDIA's RTX 50 series.

AIAMDFSR4

0 likes · 6 min read

AMD RX 9070 and RX 9070 XT: Specifications, Performance Benchmarks, AI Capabilities, and Pricing

JD Tech Talk

Mar 3, 2025 · Artificial Intelligence

AI Engine Technology Based on Domestic Chips for JD Retail

This article describes JD Retail's AI engine built on domestic NPU chips, covering challenges, heterogeneous GPU‑NPU scheduling, high‑performance training and inference engines, extensive model support, real‑world deployment cases, and future plans for large‑scale chip clusters and ecosystem development.

AIDistributed TrainingGPU

0 likes · 20 min read

AI Engine Technology Based on Domestic Chips for JD Retail

JD Cloud Developers

Mar 3, 2025 · Artificial Intelligence

How JD.com Leverages Domestic NPU Chips to Power Large‑Scale AI Models

This article details JD.com's challenges and solutions for deploying domestic NPU chips across heterogeneous GPU‑NPU clusters, covering architecture, scheduling, high‑performance training and inference engines, real‑world case studies, and future plans to scale AI workloads securely and efficiently.

AIDomestic ChipsInference

0 likes · 19 min read

How JD.com Leverages Domestic NPU Chips to Power Large‑Scale AI Models

DataFunTalk

Mar 3, 2025 · Artificial Intelligence

FlightVGM: FPGA-Accelerated Inference for Video Generation Models Wins Best Paper at FPGA 2025

The FlightVGM paper, awarded Best Paper at FPGA 2025, details a novel FPGA-based inference IP for video generation models that leverages time‑space activation sparsity, mixed‑precision DSP58 extensions, and adaptive scheduling to achieve up to 1.30× performance and 4.49× energy‑efficiency gains over a NVIDIA 3090 GPU while preserving model accuracy.

AIFPGAHardware acceleration

0 likes · 11 min read

FlightVGM: FPGA-Accelerated Inference for Video Generation Models Wins Best Paper at FPGA 2025

大转转FE

Mar 3, 2025 · Frontend Development

Zhuanzhuan Frontend Weekly – Curated Technical Articles

This issue of Zhuanzhuan Frontend Weekly curates five insightful technical articles covering React UI paradigm shifts, a Rust beginner’s journey to production, performance improvements in a mini‑program simulator, integration of the Qwen‑2.5‑VL model with Midscene.js, and Didi’s experience in managing technical debt for internationalization.

AIReactRust

0 likes · 5 min read

Zhuanzhuan Frontend Weekly – Curated Technical Articles

Java Architecture Diary

Mar 3, 2025 · Frontend Development

Boost Real-Time AI Streams in the Browser with fetch-event-source

This article explains how Server‑Sent Events (SSE) work, outlines the limitations of the native EventSource API, and demonstrates how the fetch‑event‑source library enhances SSE with POST support, custom headers, retry strategies, and visibility handling, enabling efficient real‑time AI data streams in modern web front‑ends.

AIJavaScriptReal-time Streaming

0 likes · 6 min read

Boost Real-Time AI Streams in the Browser with fetch-event-source

Big Data Technology & Architecture

Mar 3, 2025 · Big Data

The Turning Point for Data Development: From Traditional Data Engineering to AI Data Engineering

The article analyzes how the rapid rise of open‑source large‑model AI in 2025 is reshaping the data development profession, urging developers to transition from specialized data‑engineer roles to full‑stack AI data engineering skills such as distributed computing, lake‑house architectures, and model tuning.

AIBig DataFlink

0 likes · 7 min read

The Turning Point for Data Development: From Traditional Data Engineering to AI Data Engineering

Java Architect Essentials

Mar 2, 2025 · Artificial Intelligence

Zero‑Code Local Deployment of DeepSeek LLM on Consumer GPUs Using Ollama

This guide explains why DeepSeek is a compelling GPT‑4‑level alternative, provides hardware recommendations for various model sizes, and walks through a three‑step Windows deployment using Ollama, including installation, environment configuration, model download, performance tuning, and common troubleshooting tips.

AIDeepSeekGPU

0 likes · 8 min read

Zero‑Code Local Deployment of DeepSeek LLM on Consumer GPUs Using Ollama

Data Thinking Notes

Mar 2, 2025 · Artificial Intelligence

How DeepSeek’s Open‑Source Week Accelerates AI with Cutting‑Edge GPU and Storage Innovations

During DeepSeek’s Open‑Source Week (Feb 24‑28), five production‑tested projects were released, spanning GPU‑optimized MLA kernels, MoE communication libraries, high‑performance FP8 GEMM, dual‑pipeline parallelism, and a AI‑focused distributed file system, each delivering significant performance and efficiency gains for large‑scale AI workloads.

AIDistributed TrainingGPU Optimization

0 likes · 13 min read

How DeepSeek’s Open‑Source Week Accelerates AI with Cutting‑Edge GPU and Storage Innovations

AI Algorithm Path

Mar 2, 2025 · Artificial Intelligence

Exploring Flux Labs AI’s New Virtual Try‑On Feature

The article reviews Flux Labs AI’s newly added virtual try‑on tool, explaining how AI, machine‑learning and computer‑vision enable seamless clothing overlays, outlining its main applications, providing a step‑by‑step usage guide, detailing pricing plans, and sharing the author’s positive performance impressions.

AIFlux LabsImage Generation

0 likes · 5 min read

Exploring Flux Labs AI’s New Virtual Try‑On Feature

DataFunTalk

Mar 2, 2025 · Artificial Intelligence

Implementing GRPO from Scratch with Distributed Reinforcement Learning on Qwen2.5-1.5B-Instruct

This tutorial explains how to build a distributed reinforcement‑learning pipeline using the GRPO algorithm, covering data preparation, evaluation and reward functions, multi‑GPU DataParallel implementation, and full fine‑tuning of the Qwen2.5‑1.5B‑Instruct model with PyTorch, FlashAttention2 and Weights & Biases.

AIDistributed TrainingGRPO

0 likes · 10 min read

Implementing GRPO from Scratch with Distributed Reinforcement Learning on Qwen2.5-1.5B-Instruct

DataFunTalk

Mar 2, 2025 · Artificial Intelligence

Top 10 AI Research Papers of 2024: Summaries, Contributions, and Practical Uses

This article presents a curated selection of ten groundbreaking 2024 AI research papers, detailing each model’s abstract, key contributions, and practical application scenarios across computer vision, multimodal learning, NLP, and efficient inference, offering readers inspiration and actionable insights for real‑world projects.

2024 researchAINLP

0 likes · 18 min read

Top 10 AI Research Papers of 2024: Summaries, Contributions, and Practical Uses

JD Retail Technology

Mar 1, 2025 · Industry Insights

How JD Retail’s AI Assistant Uses Multimodal LLMs to Boost E‑Commerce

JD Retail’s AI assistant combines a Master‑Sub agent framework, ReAct paradigm, multimodal integration and MoE architecture to improve sales forecasting, pricing, and recommendation accuracy, while the team’s collaborative culture and open talent pathways illustrate how cutting‑edge AI is applied in real‑world e‑commerce.

AIJD RetailLLM

0 likes · 8 min read

How JD Retail’s AI Assistant Uses Multimodal LLMs to Boost E‑Commerce

ITPUB

Mar 1, 2025 · Artificial Intelligence

Can DeepSeek AI Replace Your DBA? Real-World Database Scenarios Tested

This article examines DeepSeek, a Chinese AGI‑focused AI model, explains prompt‑engineering techniques, and evaluates its performance across database architecture, development, and operations tasks through concrete Q&A examples, SQL plan analysis, and shell‑script generation, while also discussing its broader impact on professionals, vendors and enterprises.

AIDeepSeekPrompt engineering

0 likes · 10 min read

Can DeepSeek AI Replace Your DBA? Real-World Database Scenarios Tested

IT Architects Alliance

Feb 28, 2025 · Industry Insights

How AIGC Is Redefining Full‑Stack Development in 2025

In 2025, AIGC technology is transforming every stage of full‑stack development—from precise AI‑driven requirement analysis and automated UI design to code generation and intelligent testing—while also raising technical, ethical, and talent challenges that developers must address.

AIAIGCFull-Stack Development

0 likes · 22 min read

How AIGC Is Redefining Full‑Stack Development in 2025

Code Mala Tang

Feb 28, 2025 · Fundamentals

Why AI Code Generation Needs Test‑Driven Development: Avoid Hidden Bugs

This article explains how AI‑generated code can be fast but unreliable, and demonstrates how applying Test‑Driven Development (TDD) with concrete Python examples catches errors like stack overflows, edge‑case failures, and security issues, ensuring robust, maintainable software.

AIPythonSoftware Testing

0 likes · 13 min read

Why AI Code Generation Needs Test‑Driven Development: Avoid Hidden Bugs

DataFunSummit

Feb 28, 2025 · Big Data

Apache Gravitino: Open‑Source Data Asset Management for AI and Multi‑Cloud Environments

This article introduces Apache Gravitino, an open‑source metadata and data‑asset management platform designed to address AI‑driven data demands and multi‑cloud challenges, detailing its architecture, core components, typical use cases, real‑world success stories, and a Q&A session on its capabilities.

AIApache GravitinoBig Data

0 likes · 18 min read

Apache Gravitino: Open‑Source Data Asset Management for AI and Multi‑Cloud Environments

AI Product Manager Community

Feb 28, 2025 · Artificial Intelligence

What’s Inside DeepSeek’s Open‑Source Week? DualPipe, EPLB, 3FS and More Explained

DeepSeek’s recent Open‑Source Week unveiled a suite of AI‑focused tools—including the DualPipe pipeline parallelism algorithm, the EPLB expert load balancer, detailed training‑inference framework data, the high‑performance 3FS parallel file system, and the Smallpond data‑processing framework—each with GitHub links and performance highlights.

AIDistributed Trainingfile system

0 likes · 7 min read

What’s Inside DeepSeek’s Open‑Source Week? DualPipe, EPLB, 3FS and More Explained

Model Perspective

Feb 28, 2025 · Artificial Intelligence

Boost Your Workflow: Generate Complex Flowcharts Fast with DeepSeek & Draw.io

This guide shows how to combine DeepSeek and Draw.io to quickly generate detailed flowcharts, explains a generic AI‑driven workflow for turning prompts into executable code, and provides a comprehensive table of 20 code formats with their typical applications and example snippets.

AIDraw.ioMermaid

0 likes · 11 min read

Boost Your Workflow: Generate Complex Flowcharts Fast with DeepSeek & Draw.io

AI Large Model Application Practice

Feb 28, 2025 · Artificial Intelligence

How Self-Attention Powers LLMs: A Step‑by‑Step Deep Dive

This article explains the self‑attention mechanism behind large language models, detailing why static word importance fails, how queries, keys, and values are generated, how attention scores are computed, scaled, softmaxed, and used to produce context‑aware word vectors, while noting computational costs.

AILLMSelf-Attention

0 likes · 9 min read

How Self-Attention Powers LLMs: A Step‑by‑Step Deep Dive

Java Tech Enthusiast

Feb 27, 2025 · Artificial Intelligence

Navigating the AI Era: Insights for Senior Engineers and R&D Leaders

A senior technical leader, reflecting on twelve years at a large tech firm, warns that while AI can triple a junior’s output in tasks like refactoring, it cannot replace deep business insight, strategic decision‑making, or mentorship, and urges engineers to treat AI as a helper, focus on high‑level architecture, and expand horizontally into business domains to stay indispensable.

AICareer DevelopmentSoftware Architecture

0 likes · 5 min read

Navigating the AI Era: Insights for Senior Engineers and R&D Leaders

JavaEdge

Feb 27, 2025 · Artificial Intelligence

How to Quickly Build a DeepSeek‑Powered Knowledge Base on Tencent Cloud

This guide walks through deploying the full‑feature DeepSeek V3+R1 model on Tencent Cloud, configuring a smart knowledge‑base application, importing documentation, enabling internet search, tuning retrieval parameters, and publishing the app for public use, all without writing code.

AIDeepSeekKnowledge Base

0 likes · 6 min read

How to Quickly Build a DeepSeek‑Powered Knowledge Base on Tencent Cloud

Python Programming Learning Circle

Feb 26, 2025 · Artificial Intelligence

Key Python 3.13 Features Boosting Machine Learning and AI Performance

Python 3.13 introduces experimental free‑threading, a JIT compiler, enhanced type‑system utilities, asyncio improvements, and standard‑library updates that together aim to reduce the Global Interpreter Lock bottleneck, accelerate compute‑intensive workloads, and simplify deployment of AI and ML applications across diverse platforms.

AIJITML

0 likes · 25 min read

Key Python 3.13 Features Boosting Machine Learning and AI Performance

58UXD

Feb 26, 2025 · Artificial Intelligence

How AI Tools Like Deepseek Transform Design Workflow

This article shows designers how to combine AI services such as Deepseek, JiMeng, Tripo, Tongyi and Jianying to accelerate 3D modeling, PPT creation and short‑video production, turning lengthy manual tasks into fast, creative processes.

3D modelingAIDeepSeek

0 likes · 5 min read

How AI Tools Like Deepseek Transform Design Workflow

Architecture Digest

Feb 26, 2025 · Artificial Intelligence

DeepSeek4j 1.4: A Java Integration Framework for DeepSeek AI Models

DeepSeek4j 1.4 introduces a Java‑native framework that fully preserves DeepSeek's chain‑of‑thought and billing features, adds reactive streaming support, and provides a Spring Boot starter for effortless integration, accompanied by quick‑start code, configuration examples, and a built‑in debugging UI.

AIAPIDeepSeek

0 likes · 5 min read

DeepSeek4j 1.4: A Java Integration Framework for DeepSeek AI Models

macrozheng

Feb 26, 2025 · Databases

Boost Your SQL Workflow with Chat2DB’s AI‑Powered Database Management

This article introduces Chat2DB, an AI‑enhanced SQL client and reporting tool, walks through its key features, Docker‑based installation, practical usage with a SpringBoot‑Vue e‑commerce project, and demonstrates how its built‑in AI can generate SQL queries automatically.

AIChat2DBDatabase Management

0 likes · 4 min read

Boost Your SQL Workflow with Chat2DB’s AI‑Powered Database Management

Java Architecture Diary

Feb 26, 2025 · Databases

Build a Private LLM Knowledge Base with Redis and DeepSeek4J in 10 Minutes

This tutorial shows how to harness Redis's dual role as a high‑performance cache and a vector database, guiding you through Docker setup, vector storage methods, and Java Lettuce integration to build a private large‑language‑model knowledge base with DeepSeek4J.

AIDeepSeekLettuce

0 likes · 6 min read

Build a Private LLM Knowledge Base with Redis and DeepSeek4J in 10 Minutes

Model Perspective

Feb 26, 2025 · Artificial Intelligence

How Do Large Language Models Compress Massive Data? Limits and Techniques

This article explains how large language models act like a super‑library by compressing vast amounts of text using information‑theoretic concepts, probability‑based coding, autoregressive neural networks, and arithmetic coding, while discussing accuracy, compression ratios, and theoretical limits.

AIarithmetic codingautoregressive networks

0 likes · 8 min read

How Do Large Language Models Compress Massive Data? Limits and Techniques

21CTO

Feb 25, 2025 · Artificial Intelligence

How Alibaba’s Qwen 2.5‑Max Challenges GPT‑4o and Redefines China’s AI Race

Chinese tech giants Huawei and Alibaba respond to President Xi’s call for stronger innovation, with Huawei showcasing its HarmonyOS and server‑grade Arm processor while Alibaba unveils the Qwen 2.5‑Max large language model that outperforms leading Western AI systems on multiple benchmarks, highlighting China’s accelerating AI ambitions.

AIAlibabaChina

0 likes · 5 min read

How Alibaba’s Qwen 2.5‑Max Challenges GPT‑4o and Redefines China’s AI Race

Alibaba Cloud Native

Feb 25, 2025 · Cloud Native

Build an AI‑Powered English Speaking Coach with Alibaba Cloud Function Compute

This guide walks you through creating an AI English‑speaking companion by deploying a web app with Function Compute, integrating Alibaba's AI model platform, Intelligent Media Service, and real‑time audio (ARTC), covering architecture, workflow setup, service configuration, deployment steps, and validation.

AIEnglish learningFunction Compute

0 likes · 12 min read

Build an AI‑Powered English Speaking Coach with Alibaba Cloud Function Compute

DataFunSummit

Feb 25, 2025 · Artificial Intelligence

Collecting High-Quality LLM Training Data and Custom Model Training Guide

This article explains what constitutes high‑quality LLM training data, why large datasets are essential, outlines the step‑by‑step process for collecting, preprocessing, and fine‑tuning models, and highlights the best data sources—including web content, books, code repositories, and news—while noting available free datasets.

AILLMWeb Scraping

0 likes · 9 min read

Collecting High-Quality LLM Training Data and Custom Model Training Guide

JD Cloud Developers

Feb 25, 2025 · Artificial Intelligence

How to Access Free DeepSeek AI Models on China’s Supercomputing Center

This guide explains how to obtain free API keys for DeepSeek‑R1:7B, 14B, and 32B models from the National Supercomputing Center, walks through the purchase steps, and provides a Python example for calling the models via the provided endpoint.

AIAPIDeepSeek

0 likes · 3 min read

How to Access Free DeepSeek AI Models on China’s Supercomputing Center

AntTech

Feb 25, 2025 · Artificial Intelligence

Call for Papers: ISSTA 2025 Workshop on Reliable and Trustworthy Software Systems (Ant Group)

The Ant Group invites submissions to its inaugural ISSTA 2025 workshop on reliable, secure, and trustworthy software systems, covering topics such as software reliability, AI-driven testing, model interpretability, LLM verification, runtime analysis, and visualization, with deadlines from March to June 2025.

AIISSTAReliability

0 likes · 5 min read

Call for Papers: ISSTA 2025 Workshop on Reliable and Trustworthy Software Systems (Ant Group)

Baobao Algorithm Notes

Feb 25, 2025 · Artificial Intelligence

FlashMLA vs FlashInfer: DeepSeek Inference Performance Benchmarks Revealed

The author benchmarks DeepSeek's FlashMLA against FlashInfer and several Triton-based implementations, detailing setup challenges, decode‑only bandwidth results, and observations that the official DeepSeek version leads while Triton optimizations show mixed performance across different head sizes.

AIDeepSeekFlashMLA

0 likes · 6 min read

FlashMLA vs FlashInfer: DeepSeek Inference Performance Benchmarks Revealed

AsiaInfo Technology: New Tech Exploration

Feb 24, 2025 · Artificial Intelligence

Can Multi‑Teacher Distillation Overcome Catastrophic Forgetting in Continual Learning?

This paper proposes a multi‑teacher distillation framework for continual learning that combines active data rehearsal with feature‑decoupled distillation, demonstrating superior performance on PASCAL VOC and COCO benchmarks while mitigating catastrophic forgetting and balancing stability‑plasticity trade‑offs.

AICatastrophic Forgettingactive rehearsal

0 likes · 12 min read

Can Multi‑Teacher Distillation Overcome Catastrophic Forgetting in Continual Learning?

大转转FE

Feb 24, 2025 · Industry Insights

What Frontend Trends Matter in 2024‑25? AI, Vision Platform, Electron, ESM & React Animation

This newsletter curates five must‑read frontend articles covering AI‑driven development trends, Kuaishou's Vision platform for motion asset quality, common misconceptions about Electron, the rapid rise of pure ESM modules, and React's first native animation support, offering concise insights and data for developers.

AIESMElectron

0 likes · 4 min read

What Frontend Trends Matter in 2024‑25? AI, Vision Platform, Electron, ESM & React Animation

Java Architecture Diary

Feb 24, 2025 · Artificial Intelligence

Run Large Language Models Directly in Java with Jlama – Quick Start Guide

This article introduces Jlama, an open‑source Java LLM inference engine, outlines its key features, provides step‑by‑step CLI and Maven integration instructions, shows code examples, run logs, and special setup notes for using large language models efficiently within Java applications.

AIInferenceJlama

0 likes · 6 min read

Run Large Language Models Directly in Java with Jlama – Quick Start Guide

Open Source Tech Hub

Feb 23, 2025 · Backend Development

How to Integrate Grok AI into PHP Apps with the Grok‑PHP Client

This guide explains how to install, configure, and use the Grok‑PHP client to call Grok AI models from PHP, covering basic usage, advanced options, available model enums, and streaming responses with code examples.

AIAPIClient Library

0 likes · 5 min read

How to Integrate Grok AI into PHP Apps with the Grok‑PHP Client

Java Web Project

Feb 23, 2025 · Artificial Intelligence

Build Your First AI Chatbot with Spring Boot and DeepSeek LLM

This guide walks you through creating a Spring Boot project, configuring DeepSeek's large language model via SiliconFlow, setting up OpenAI‑compatible parameters, and implementing a REST controller that returns weather forecasts using the model, complete with step‑by‑step code snippets, configuration files, and deployment instructions.

AIChatbotDeepSeek

0 likes · 7 min read

Build Your First AI Chatbot with Spring Boot and DeepSeek LLM

DataFunTalk

Feb 23, 2025 · Artificial Intelligence

Insights from Snowflake CEO Sridhar Ramaswamy on AI Competition, Business Strategy, and Leadership

In this extensive interview, Snowflake CEO Sridhar Ramaswamy shares his perspectives on the AI arms race, the sustainable value of data platforms, competition with rivals like Databricks and DeepSeek, the challenges of scaling a public company, and personal leadership lessons drawn from his career and family life.

AIArtificial IntelligenceBusiness strategy

0 likes · 35 min read

Insights from Snowflake CEO Sridhar Ramaswamy on AI Competition, Business Strategy, and Leadership

ZhongAn Tech Team

Feb 22, 2025 · Artificial Intelligence

How SkyReels, DeepSeek NSA, Grok‑3, and KG²RAG Are Shaping the Next AI Wave

This issue reviews China's first open‑source short‑film model SkyReels‑V1, DeepSeek's Native Sparse Attention breakthrough, xAI's massive Grok‑3 deployment on 200k H100 GPUs, and a knowledge‑graph‑guided RAG framework, highlighting their performance gains, architectural innovations, and industry impact.

AIRAGindustry trends

0 likes · 15 min read

How SkyReels, DeepSeek NSA, Grok‑3, and KG²RAG Are Shaping the Next AI Wave

Java Tech Enthusiast

Feb 22, 2025 · Artificial Intelligence

Grok‑3 Evaluation Controversy and Community Reactions

Three days after Grok‑3’s launch, OpenAI was accused of inflating its benchmark scores by using a “cons@64” method that aggregates 64 answers, a practice critics say unfairly skews comparisons with single‑shot models like o3‑mini, while developers have already begun experimenting with the model in simple games.

AIGrok-3Model Evaluation

0 likes · 5 min read

Grok‑3 Evaluation Controversy and Community Reactions

21CTO

Feb 22, 2025 · Artificial Intelligence

Are AI Coding Assistants Undermining Deep Learning for Developers?

The article argues that while AI tools like Copilot and GPT speed up simple coding tasks, they risk eroding developers' fundamental understanding and critical thinking, citing research that frequent AI use correlates with weaker cognitive skills and urging a balanced, verification‑first approach.

AISoftware Developmentcoding assistants

0 likes · 6 min read

Are AI Coding Assistants Undermining Deep Learning for Developers?

Architecture and Beyond

Feb 22, 2025 · Artificial Intelligence

Understanding Retrieval‑Augmented Generation (RAG) and Its Role in Enhancing Large Language Models

The article explains how the inherent knowledge‑staleness, hallucination, lack of private data, non‑traceable output, limited long‑text handling, and data‑security concerns of large language models can be mitigated by Retrieval‑Augmented Generation, which combines external retrieval, augmentation, and generation to provide up‑to‑date, reliable, and secure AI responses.

AIKnowledge augmentationLLM

0 likes · 15 min read

Understanding Retrieval‑Augmented Generation (RAG) and Its Role in Enhancing Large Language Models

Infra Learning Club

Feb 21, 2025 · Artificial Intelligence

5 Must‑Try Open‑Source AI Projects You Can Start Using Today

This article introduces five open‑source AI tools—a PPT generator, an LLM app development platform, a cloud‑agnostic AI runner, a curated collection of LLM applications, and a one‑click HD video creator—detailing their key features, usage links, and sample configurations.

AIDifyLLM

0 likes · 8 min read

5 Must‑Try Open‑Source AI Projects You Can Start Using Today

Top Architect

Feb 21, 2025 · Artificial Intelligence

DeepSeek4j 1.4: Java Integration Framework for DeepSeek with Full Chain‑of‑Thought and Streaming Support

The article introduces DeepSeek4j 1.4, a Java‑based framework that overcomes Spring AI’s limitations by fully preserving DeepSeek’s chain‑of‑thought and billing features, adding reactive streaming, providing Spring Boot starter integration, and offering quick‑start code samples and configuration guidance.

AIChain-of-ThoughtDeepSeek

0 likes · 8 min read

DeepSeek4j 1.4: Java Integration Framework for DeepSeek with Full Chain‑of‑Thought and Streaming Support

Ma Wei Says

Feb 21, 2025 · Artificial Intelligence

How PIKE‑RAG Boosts Retrieval‑Augmented Generation for Industrial AI

PIKE‑RAG, a Retrieval‑Augmented Generation framework from Microsoft Research, tackles knowledge source diversity, one‑size‑fits‑all limitations, and LLMs' lack of domain expertise by building multi‑layer heterogeneous graphs, task‑driven modular pipelines, and a staged L0‑L4 system for more accurate industrial AI responses.

AIKnowledgeGraphLLM

0 likes · 5 min read

How PIKE‑RAG Boosts Retrieval‑Augmented Generation for Industrial AI

Cognitive Technology Team

Feb 20, 2025 · Artificial Intelligence

When Programmers Lose Their Skills: The Hidden Cost of AI Dependency

The article reflects on how reliance on AI tools is eroding developers' fundamental debugging and problem‑solving abilities, proposes a "no‑AI day" regimen to restore deep understanding, and outlines practical rules to balance AI assistance with independent coding practice.

AIproductivityprogramming

0 likes · 6 min read

When Programmers Lose Their Skills: The Hidden Cost of AI Dependency

AI Algorithm Path

Feb 20, 2025 · Artificial Intelligence

What Is Perplexity in Large Language Models?

The article explains perplexity as a metric for evaluating large language models, walks through a step‑by‑step probability calculation for a sample sentence, shows how to normalize by sentence length using the geometric mean, and demonstrates that lower perplexity indicates a more accurate and less uncertain model.

AIEvaluationLanguage Model

0 likes · 6 min read

What Is Perplexity in Large Language Models?

Top Architect

Feb 20, 2025 · Artificial Intelligence

Deploying DeepSeek R1 671B Model Locally with Ollama and Dynamic Quantization

This guide explains how to download, quantize, and run the full‑size 671‑billion‑parameter DeepSeek R1 model on local hardware using Ollama, covering model selection, hardware requirements, step‑by‑step deployment commands, optional web UI setup, performance observations, and practical recommendations.

AIDeepSeekDynamic Quantization

0 likes · 16 min read

Deploying DeepSeek R1 671B Model Locally with Ollama and Dynamic Quantization

Alibaba Cloud Infrastructure

Feb 20, 2025 · Artificial Intelligence

Deploying DeepSeek‑R1 Large Language Model on Knative with GPU A10

This guide explains how to deploy the DeepSeek‑R1 large language model on a Knative platform using an A10 GPU, covering preparation, service creation with appropriate annotations, YAML configuration, verification via curl, custom domain setup, and optional personal AI assistant deployment.

AIDeepSeekDeployment

0 likes · 8 min read

Deploying DeepSeek‑R1 Large Language Model on Knative with GPU A10

Practical DevOps Architecture

Feb 20, 2025 · Artificial Intelligence

Training MiniDeepSeek V3+R1 from Scratch: Full-Scale Large Model Technical Practice for 2025

This tutorial series provides a step‑by‑step technical guide to training, deploying, and fine‑tuning the MiniDeepSeek V3+R1 large language model, covering model performance, open‑source details, API usage, parameter explanation, multi‑turn chatbot construction, function calling, integration with Open WebUI, GraphRAG, Swarm, and various deployment and optimization techniques.

AIMiniDeepSeekTraining

0 likes · 4 min read

Training MiniDeepSeek V3+R1 from Scratch: Full-Scale Large Model Technical Practice for 2025

Architecture Breakthrough

Feb 20, 2025 · Artificial Intelligence

Can AI Really Replace You? Deepseek vs ChatGPT and How to Stay Ahead

The article analyzes Deepseek’s rapid rise, compares its strengths and limitations to ChatGPT, examines AI’s fundamental weaknesses, and offers practical strategies for individuals to build a “professional + AI” skill set that keeps them indispensable in the evolving AI landscape.

AIArtificial IntelligenceCareer Development

0 likes · 8 min read

Can AI Really Replace You? Deepseek vs ChatGPT and How to Stay Ahead

Data Thinking Notes

Feb 19, 2025 · Artificial Intelligence

DeepSeek Evolution: Key Technical Highlights from V1 to R1

This article examines DeepSeek’s various versions, detailing their core modules, underlying principles, architecture diagrams, and performance metrics, while illustrating the internal logic and advantages of each model to guide enthusiasts, professionals, and practitioners toward deeper AI innovation insights.

AIDeepSeekModel architecture

0 likes · 4 min read

DeepSeek Evolution: Key Technical Highlights from V1 to R1

Java Tech Enthusiast

Feb 19, 2025 · Artificial Intelligence

xAI's Grok 3 Model: Benchmarks, Reasoning, and Industry Reactions

Elon Musk’s xAI introduced the Grok 3 family—trained on roughly 200,000 GPUs and offered in standard, mini and Reasoning versions—that claims top‑slot performance on math, science and coding benchmarks, outpacing Google Gemini, DeepSeek V3, Claude and OpenAI GPT‑4o, while pricing starts at $30 per month and drawing both praise for its speed and criticism for lingering hallucinations and ethical sensitivities.

AIDeepSearchGrok3

0 likes · 16 min read

xAI's Grok 3 Model: Benchmarks, Reasoning, and Industry Reactions

Architect

Feb 18, 2025 · Artificial Intelligence

DeepSeek‑R1: Training Innovations and Architecture for High‑Performance Reasoning LLMs

The article explains how DeepSeek‑R1 advances large language model reasoning by releasing a lightweight distilled version, sharing a complete training pipeline—including pre‑training, supervised fine‑tuning, and reinforcement learning—introducing long‑chain reasoning data, a transitional inference model, and a comprehensive RL optimization that together yield strong mathematical and logical capabilities.

AIDeepSeekModel Training

0 likes · 10 min read

DeepSeek‑R1: Training Innovations and Architecture for High‑Performance Reasoning LLMs

IT Services Circle

Feb 18, 2025 · Fundamentals

Understanding the New‑Quality Internet (Net5.5G): Concepts, Scenarios, and Technologies

The article explains the emerging “new‑quality internet” (Net5.5G) concept, its AI‑driven motivations, the four‑link scenarios, four‑new technologies, and recent industry progress, illustrating how this next‑generation network architecture will support the AI era.

AIDigital InfrastructureIPv6

0 likes · 11 min read

Understanding the New‑Quality Internet (Net5.5G): Concepts, Scenarios, and Technologies

DevOps Cloud Academy

Feb 18, 2025 · Operations

How AI Is Transforming DevOps: 10 Key Benefits

AI is reshaping DevOps by enhancing automation, enabling predictive analytics, optimizing CI/CD pipelines, managing resources intelligently, strengthening security, accelerating incident response, driving data-driven decisions, scaling infrastructure, fostering collaboration, and promoting continuous learning, thereby boosting flexibility, scalability, and reliability of software delivery.

AIDevOpsResource Management

0 likes · 8 min read

How AI Is Transforming DevOps: 10 Key Benefits

Java Tech Enthusiast

Feb 18, 2025 · Industry Insights

Are AI Tools Eroding Our Programming Skills? A Developer’s Self‑Experiment

A seasoned developer reflects on how reliance on AI for documentation, debugging, and code generation has quietly degraded core programming abilities, proposes a weekly "no‑AI" day to reclaim skills, and warns that our dependence on AI may be ten times greater than any productivity boost.

AIIndustry Insightsdeveloper productivity

0 likes · 4 min read

Are AI Tools Eroding Our Programming Skills? A Developer’s Self‑Experiment

Mingyi World Elasticsearch

Feb 18, 2025 · Artificial Intelligence

Master Prompt Engineering for DeepSeek and ChatGPT‑4o: Essential Techniques

This guide explains the fundamentals of prompt engineering for large language models such as DeepSeek and ChatGPT‑4o, illustrating clear‑prompt design, giving models time to think, chaining prompts, iterative refinement, and advanced tricks with concrete good and bad examples.

AIChatGPT-4oDeepSeek

0 likes · 12 min read

Master Prompt Engineering for DeepSeek and ChatGPT‑4o: Essential Techniques

JD Cloud Developers

Feb 18, 2025 · Artificial Intelligence

How JD Advertising Leverages AI Agents to Boost Ad Operations

This article details JD Advertising's practical use of large‑model agents—including RAG, function calls, and workflow automation—to improve service efficiency, monitoring, and developer productivity across its advertising platform.

AIAdvertisingAgent

0 likes · 26 min read

How JD Advertising Leverages AI Agents to Boost Ad Operations

Architects' Tech Alliance

Feb 18, 2025 · Industry Insights

How DeepSeek V3 Is Driving a New Wave of Communication‑Hardware Demand

DeepSeek V3 cuts training to 2.788 M H800 GPU‑hours with FP8 mixed‑precision and a fully optimized framework, slashes token costs by 96% versus ChatGPT O1, and its efficient inference and model‑compression techniques are reshaping AI‑agent development, spurring demand for low‑latency, high‑bandwidth optical modules and edge‑computing infrastructure.

AICommunication IndustryDeepSeek

0 likes · 5 min read

How DeepSeek V3 Is Driving a New Wave of Communication‑Hardware Demand

Bilibili Tech

Feb 18, 2025 · Artificial Intelligence

Algorithmic Empowerment of Bilibili Streaming: VOD Transcoding Decision, Resource Estimation, and Live Comment Semantic Analysis

The article details how Bilibili leverages AI algorithms—including XGBoost, statistical rules, XDeepFM, and fine‑tuned SBERT—to optimize VOD transcoding decisions, estimate compute resources and processing time, and analyze live comments, thereby boosting streaming efficiency, utilization, and user experience.

AITranscoding OptimizationXGBoost

0 likes · 19 min read

Algorithmic Empowerment of Bilibili Streaming: VOD Transcoding Decision, Resource Estimation, and Live Comment Semantic Analysis

Full-Stack DevOps & Kubernetes

Feb 18, 2025 · Cloud Native

Deploy Massive LLMs on Kubernetes: Step‑by‑Step Guide for Ollama and DeepSeek‑R1

This guide explains how to deploy large‑scale AI models such as Ollama and DeepSeek‑R1 on a Kubernetes 1.30 cluster, covering hardware requirements, PVC and deployment manifests, service exposure, image pulling, verification steps, API access, and monitoring with Prometheus and Grafana.

AIDeepSeekKubernetes

0 likes · 12 min read

Deploy Massive LLMs on Kubernetes: Step‑by‑Step Guide for Ollama and DeepSeek‑R1

JD Retail Technology

Feb 18, 2025 · Artificial Intelligence

Engineering Practices of JD Advertising Agent: JDZunTong Intelligent Assistant

JD’s advertising R&D team created the JDZunTong Intelligent Assistant by engineering a modular Agent platform that combines advanced Retrieval‑Augmented Generation (RAG 1.0 → 2.0) and Function‑Call capabilities, a visual designer, custom tool registration, and a native Python workflow engine to deliver intelligent customer service, data queries, and ad creation for merchants.

AIAgentJD Advertising

0 likes · 18 min read

Engineering Practices of JD Advertising Agent: JDZunTong Intelligent Assistant

Architecture & Thinking

Feb 18, 2025 · Artificial Intelligence

Why Is DeepSeek Server Overloaded? Causes and Practical Workarounds

The article investigates why DeepSeek frequently returns a “server busy” message, analyzing factors such as sudden traffic spikes, compute and bandwidth limitations, security attacks, and maintenance policies, and then offers actionable solutions including query optimization, off‑peak usage, third‑party cloud platforms, and local deployment.

AIDeepSeekModel Deployment

0 likes · 10 min read

Why Is DeepSeek Server Overloaded? Causes and Practical Workarounds

Efficient Ops

Feb 17, 2025 · Operations

From Bronze to AI‑Powered Ops: Mastering the Operations Career Ladder

This article explores the hierarchy of operations roles, outlines five career stages from entry‑level to AI‑driven expert, and offers practical advice on building foundations, automation, high‑availability design, and embracing emerging technologies.

AICareer DevelopmentDevOps

0 likes · 6 min read

From Bronze to AI‑Powered Ops: Mastering the Operations Career Ladder

DevOps Cloud Academy

Feb 17, 2025 · Operations

Top 10 AI Tools Transforming DevOps Engineering

This article reviews ten AI‑powered tools—including Jenkins, Ansible, Puppet, Dynatrace, Splunk, GitHub Copilot, New Relic, Azure DevOps, Prometheus, and Chef—that enhance DevOps workflows through predictive analytics, automated rollback, intelligent monitoring, and code assistance, helping teams achieve faster, more reliable software delivery.

AIDevOpsTooling

0 likes · 14 min read

Top 10 AI Tools Transforming DevOps Engineering

DeWu Technology

Feb 17, 2025 · Artificial Intelligence

Optimizing Large Model Inference: High‑Performance Frameworks and Techniques

The article reviews high‑performance inference strategies for large language models such as Deepseek‑R1, detailing CPU‑GPU process separation, Paged and Radix Attention, Chunked Prefill, output‑length reduction, tensor‑parallel multi‑GPU scaling, and speculative decoding, each shown to markedly boost throughput and cut latency in real deployments.

AIDistributed inferenceGPU Acceleration

0 likes · 22 min read

Optimizing Large Model Inference: High‑Performance Frameworks and Techniques

Tencent Technical Engineering

Feb 17, 2025 · Artificial Intelligence

Prompt Engineering: Definitions, Frameworks, Principles, and Advanced Techniques

The guide defines prompts as structured queries that unlock large‑language‑model abilities, outlines five core frameworks (RTF, Chain‑of‑Thought, RISEN, RODES, Density‑Chain), presents two key principles—clear, delimited instructions and explicit reasoning steps—to reduce hallucinations, and surveys advanced techniques such as zero‑shot, few‑shot, RAG, Tree‑of‑Thought and automatic prompt engineering.

AIChain-of-ThoughtRetrieval Augmented Generation

0 likes · 29 min read

Prompt Engineering: Definitions, Frameworks, Principles, and Advanced Techniques

macrozheng

Feb 17, 2025 · Artificial Intelligence

Unlock DeepSeek4j 1.4: Build a Private AI Knowledge Base with Spring Boot

This guide explains why DeepSeek4j is needed, its core features, and provides step‑by‑step instructions—including dependency setup, configuration, code examples, and a complete RAG pipeline using Milvus—to help developers quickly create a private AI knowledge base with Spring Boot.

AIDeepSeek4jMilvus

0 likes · 12 min read

Unlock DeepSeek4j 1.4: Build a Private AI Knowledge Base with Spring Boot

AI Product Manager Community

Feb 17, 2025 · Product Management

How AI Can Transform Your Product Roadmap into a Real‑Time Strategic Tool

In today’s fast‑changing market, traditional product planning falls short, so this article explains how AI‑powered data integration, predictive analytics, and dynamic feedback loops can create a real‑time, data‑driven product roadmap, detailing three implementation phases—data unification, intelligent analysis, and continuous adjustment—with practical steps for product managers.

AIData IntegrationRoadmap

0 likes · 8 min read

How AI Can Transform Your Product Roadmap into a Real‑Time Strategic Tool

Java Architecture Stack

Feb 17, 2025 · Artificial Intelligence

How to Deploy the Full-Feature DeepSeek LLM Locally and on Alibaba Cloud

This guide walks you through preparing the environment, installing Docker, cloning the DeepSeek repository, running the model with Docker or Ollama for quick start, using the enterprise API, and deploying the same model on Alibaba Cloud's free Bailei service within minutes.

AIAlibaba CloudDeepSeek

0 likes · 6 min read

How to Deploy the Full-Feature DeepSeek LLM Locally and on Alibaba Cloud

Ops Development & AI Practice

Feb 15, 2025 · Artificial Intelligence

How to Efficiently Fine‑Tune Llama 3 on a Free Colab T4 GPU with Unsloth

This article provides a step‑by‑step, code‑rich tutorial for fine‑tuning the open‑source Llama 3 1B and 3B models on Google Colab using the Unsloth library and LoRA, covering environment setup, model loading, adapter insertion, dataset preparation, training configuration, inference, and model saving, all while keeping GPU memory usage low.

AIColabFine-tuning

0 likes · 13 min read

How to Efficiently Fine‑Tune Llama 3 on a Free Colab T4 GPU with Unsloth

Cognitive Technology Team

Feb 15, 2025 · Artificial Intelligence

The Risks of Over‑reliance on AI: Diminishing Human Cognitive, Creative, and Practical Skills

Over‑reliance on AI can erode a wide range of human abilities—from memory and spatial awareness to critical thinking, creativity, and practical skills—by creating dependency, homogenizing thought, and reducing opportunities for active learning and independent problem‑solving.

AIHuman Skillscognitive decline

0 likes · 4 min read

The Risks of Over‑reliance on AI: Diminishing Human Cognitive, Creative, and Practical Skills

dbaplus Community

Feb 14, 2025 · Databases

How AI Tools Are Transforming the Role of Database Administrators

The article argues that, despite common fears, AI and modern management tools like OEM and DB Console empower DBAs to work more efficiently, improve performance, and stay relevant, while highlighting real-world stories of tool adoption and the challenges of AI hallucinations.

AIDBAperformance optimization

0 likes · 7 min read

How AI Tools Are Transforming the Role of Database Administrators

Architect's Alchemy Furnace

Feb 14, 2025 · Artificial Intelligence

AI-Driven Power Trading: Key Technologies, Architecture, and Future Trends

This article examines how artificial intelligence transforms power trading platforms by addressing challenges of renewable integration, introducing advanced forecasting, autonomous decision engines, market clearing optimization, and innovative architectures, while also analyzing international case studies, regulatory considerations, and future trends such as quantum machine learning and digital twins.

AIDigital TwinMarket Optimization

0 likes · 18 min read

AI-Driven Power Trading: Key Technologies, Architecture, and Future Trends

Tencent Technical Engineering

Feb 14, 2025 · Artificial Intelligence

Technical Overview of DeepSeek Series Models and Innovations

The DeepSeek series introduces a refined Mixture‑of‑Experts architecture with fine‑grained expert partitioning, shared experts, and learnable load‑balancing, alongside innovations such as Group Relative Policy Optimization, Multi‑Head Latent Attention, Multi‑Token Prediction, mixed‑precision FP8 training, and the R1/R1‑Zero models that use Long‑CoT reasoning, reinforcement‑learning pipelines, and distillation to achieve OpenAI‑comparable performance at lower cost.

AIDeepSeekMixture of Experts

0 likes · 25 min read

Technical Overview of DeepSeek Series Models and Innovations

Java Tech Enthusiast

Feb 14, 2025 · Artificial Intelligence

Apple Partners with Alibaba to Develop AI Features for iPhone Users

Apple’s new Apple Intelligence platform, unveiled at WWDC24, will incorporate Alibaba’s Qwen 2.5 Max model to create China‑specific AI features for iPhone users, with a custom dataset and regulatory submission, marking a shift from overseas ChatGPT reliance to a domestic partnership.

AIAlibabaApple

0 likes · 3 min read

Apple Partners with Alibaba to Develop AI Features for iPhone Users

Huolala Tech

Feb 14, 2025 · Artificial Intelligence

How AI‑Driven Loss Prevention Transforms Risk Management Across the Software Lifecycle

This article explains a comprehensive AI‑powered loss‑prevention framework that automatically identifies financial‑risk scenarios in both existing and new code, integrates model‑based detection into product, development, testing, and release stages, and continuously refines coverage through intelligent monitoring and rule enforcement.

AIModel Trainingloss prevention

0 likes · 11 min read

How AI‑Driven Loss Prevention Transforms Risk Management Across the Software Lifecycle

Code Ape Tech Column

Feb 14, 2025 · Artificial Intelligence

Integrating DeepSeek Large Model with Spring AI: A Step‑by‑Step Guide

This article explains how to integrate DeepSeek's large language models—both the chat‑oriented deepseek‑chat and the reasoning‑focused deepseek‑reasoner—into a Spring AI application, covering API key setup, base‑URL configuration, model selection, and providing full code examples for dependency, configuration, and a simple chat controller.

AIChatbotDeepSeek

0 likes · 6 min read

Integrating DeepSeek Large Model with Spring AI: A Step‑by‑Step Guide

Code Mala Tang

Feb 14, 2025 · Artificial Intelligence

Can AI-Generated Love Letters Really Win Hearts? Surprising Stories and Insights

This article explores how AI is being used to write love letters, shares real user experiences ranging from success to failure, and examines the psychological reasons why AI-crafted romance often falls short, offering guidance on balancing technology with genuine emotion.

AIPsychologyauthenticity

0 likes · 9 min read

Can AI-Generated Love Letters Really Win Hearts? Surprising Stories and Insights

Architect

Feb 13, 2025 · Artificial Intelligence

How to Build a Mini ChatGPT on a Single GPU with MiniMind

This article provides a comprehensive, step‑by‑step guide to training and fine‑tuning a miniature large‑language model called MiniMind, covering lightweight model design, open‑source training pipelines, required datasets, tokenizer options, and deployment via a web UI, all using PyTorch on modest hardware.

AILLMMiniMind

0 likes · 11 min read

How to Build a Mini ChatGPT on a Single GPU with MiniMind

AIWalker

Feb 13, 2025 · Artificial Intelligence

How FlashVideo Turns Low‑Res Clips into 4K Video with Minimal Compute

FlashVideo introduces a two‑stage framework that first generates low‑resolution videos with strong prompt fidelity and then uses flow‑matching ODE trajectories to upscale to 4K quality in just four function evaluations, achieving top VBench‑Long scores while cutting generation time by up to five‑fold.

AIFlashVideoVideo Generation

0 likes · 26 min read

How FlashVideo Turns Low‑Res Clips into 4K Video with Minimal Compute

ByteDance Cloud Native

Feb 13, 2025 · Cloud Computing

Deploy the Full‑Size DeepSeek‑R1 Model on Volcengine Cloud with Terraform and Kubernetes

This guide walks you through two practical solutions for deploying the massive DeepSeek‑R1 model on Volcengine Cloud—one using Terraform for a quick two‑node GPU setup and another leveraging cloud‑native multi‑node distributed inference with Kubernetes, covering resource sizing, environment preparation, model download, monitoring, autoscaling, and storage acceleration.

AIKubernetesModel Deployment

0 likes · 22 min read

Deploy the Full‑Size DeepSeek‑R1 Model on Volcengine Cloud with Terraform and Kubernetes

Radish, Keep Going!

Feb 13, 2025 · Cloud Native

How Wise Is Building a Scalable 2025 Tech Stack with Kubernetes and AI

Wise’s 2025 tech stack overhaul details how its 850‑engineer team leverages cloud‑native tools like Kubernetes, Terraform, and AWS, modernizes frontend with Next.js and Storybook, accelerates mobile builds via Swift Package Manager and Gradle, and integrates AI, data pipelines, and observability to support 12.8 million active customers worldwide.

AIDevOpsMobile Development

0 likes · 20 min read

How Wise Is Building a Scalable 2025 Tech Stack with Kubernetes and AI