Tagged articles
5000 articles
Page 28 of 50
Baidu Intelligent Cloud Tech Hub
Baidu Intelligent Cloud Tech Hub
Sep 5, 2024 · Databases

How Vector Databases Power AI and RAG: Insights from Baidu’s DTCC 2024

This article reviews the 70‑year evolution of databases, explains how vector databases and Retrieval‑Augmented Generation (RAG) are reshaping AI applications, and details Baidu Intelligent Cloud's VectorDB architecture, performance advantages, real‑world use cases, and future trends in data engineering.

AIDatabase ArchitectureDistributed Systems
0 likes · 16 min read
How Vector Databases Power AI and RAG: Insights from Baidu’s DTCC 2024
Baidu MEUX
Baidu MEUX
Sep 4, 2024 · Artificial Intelligence

How Baidu Reimagined Its App Personal Center with AI: Design Strategies and Results

This article examines Baidu's AI‑driven overhaul of its app personal center, detailing the problems of the legacy design, the innovative container and recommendation framework, card‑based content structures, dialogue integration, experimental outcomes, and future design insights for AI‑enhanced user experiences.

AIBaiduPersonal Center
0 likes · 12 min read
How Baidu Reimagined Its App Personal Center with AI: Design Strategies and Results
DataFunSummit
DataFunSummit
Sep 4, 2024 · Artificial Intelligence

How Elasticsearch Powers Retrieval‑Augmented Generation (RAG) Applications

This article explains how Elasticsearch’s advanced search capabilities—including vector and semantic search, hardware acceleration, hybrid retrieval, model re‑ranking, multi‑vector support, and integrated security—enable robust RAG implementations and outlines future directions such as a new compute engine, stronger vector engines, and cloud‑native serverless deployment.

AIElasticsearchHybrid Search
0 likes · 9 min read
How Elasticsearch Powers Retrieval‑Augmented Generation (RAG) Applications
DataFunTalk
DataFunTalk
Sep 4, 2024 · Artificial Intelligence

Data+AI Data Lake Technologies: Challenges, Apache Iceberg Overview, and Vector Table Implementations with PyIceberg

This article explores the evolution of data lakes for AI, discusses the challenges of AI-era data management, introduces Apache Iceberg and its architecture, demonstrates PyIceberg-based AI training and inference pipelines, and presents vector table designs with LSH indexing and performance optimizations.

AIApache IcebergBig Data
0 likes · 22 min read
Data+AI Data Lake Technologies: Challenges, Apache Iceberg Overview, and Vector Table Implementations with PyIceberg
JD Retail Technology
JD Retail Technology
Sep 4, 2024 · Artificial Intelligence

Multimodal Recommendation Algorithms and System Architecture at JD.com

This article presents JD.com’s multimodal recommendation system architecture, covering content understanding, multimodal ranking and recall models, practical deployment pipelines, and future research directions such as large‑model integration and supply‑side generation, all illustrated with detailed diagrams and Q&A.

AIJD.comMultimodal
0 likes · 14 min read
Multimodal Recommendation Algorithms and System Architecture at JD.com
Architects' Tech Alliance
Architects' Tech Alliance
Sep 3, 2024 · Industry Insights

How NVIDIA Grace Hopper Superchip Redefines HPC and AI Performance

The article provides an in‑depth technical overview of NVIDIA's Grace Hopper superchip, detailing its heterogeneous CPU‑GPU architecture, high‑bandwidth NVLink‑C2C interconnect, unified memory model, programming support, and system‑level scaling features that together deliver unprecedented performance for high‑performance computing and large‑scale AI workloads.

AIGrace HopperHPC
0 likes · 20 min read
How NVIDIA Grace Hopper Superchip Redefines HPC and AI Performance
AntTech
AntTech
Sep 3, 2024 · Artificial Intelligence

2024 Inclusion Bund Conference AI Innovation Competition and Deepfake Challenge Results

The 2024 Inclusion Bund Conference in Shanghai announced the winners of its newly added AI Innovation Competition, including the AFAC Financial Intelligence Contest and the Global Deepfake Attack‑Defense Challenge, highlighting participation from over 7,000 teams across more than 20 countries and showcasing cutting‑edge deepfake detection achievements.

AIComputer VisionDataset
0 likes · 7 min read
2024 Inclusion Bund Conference AI Innovation Competition and Deepfake Challenge Results
NewBeeNLP
NewBeeNLP
Sep 3, 2024 · Industry Insights

Why Pre‑training Teams Boost New Engineers’ Skills Faster Than SFT Teams

The answer explains that joining a pre‑training team accelerates a newcomer’s engineering abilities through hands‑on work with large‑scale data pipelines, distributed training code, and debugging, while SFT teams focus mainly on data labeling, making pre‑training the more effective path for rapid skill growth.

AIEngineering SkillsSFT
0 likes · 6 min read
Why Pre‑training Teams Boost New Engineers’ Skills Faster Than SFT Teams
IT Services Circle
IT Services Circle
Sep 2, 2024 · Artificial Intelligence

Using Gemini Nano Prompt API in Chrome Canary for In‑Browser AI

This article explains how Google’s Gemini Nano can run directly in the browser via the Prompt API, guides you through enabling the feature in Chrome Canary, checking model readiness, and provides JavaScript code examples for creating text sessions, streaming responses, and building a simple translation demo.

AIChrome CanaryGemini Nano
0 likes · 8 min read
Using Gemini Nano Prompt API in Chrome Canary for In‑Browser AI
NewBeeNLP
NewBeeNLP
Sep 2, 2024 · Artificial Intelligence

Boosting Large Language Model Math Reasoning: Mixed Instructions, Synthetic Data, and Training Optimizations

This article presents a comprehensive technical walkthrough on enhancing large language model mathematical reasoning by reviewing model architectures, introducing mixed CoT‑PoT instructions, generating and filtering synthetic data, and applying multi‑stage training optimizations such as RFT, PPO, and DPO, with detailed experimental results and Q&A insights.

AIReward modelTraining Optimization
0 likes · 17 min read
Boosting Large Language Model Math Reasoning: Mixed Instructions, Synthetic Data, and Training Optimizations
Alibaba Cloud Big Data AI Platform
Alibaba Cloud Big Data AI Platform
Sep 2, 2024 · Artificial Intelligence

Turning PDFs and Word Docs into Searchable Knowledge for RAG Systems

This article explains why generic large language models struggle with domain‑specific data, introduces Retrieval‑Augmented Generation (RAG) as a solution, compares Word and PDF formats, outlines document‑parsing pipelines, reviews open‑source PDF tools, and presents Alibaba Cloud's rule‑based parsing architecture with performance results.

AIDocument ParsingLLM
0 likes · 13 min read
Turning PDFs and Word Docs into Searchable Knowledge for RAG Systems
Test Development Learning Exchange
Test Development Learning Exchange
Sep 1, 2024 · Fundamentals

Python Utility Scripts for Data Cleaning, Translation, File Sync, Cloud Backup, and More

This article presents a collection of Python utility scripts that demonstrate how to clean CSV data, translate text files, synchronize folders, upload files to S3, count directory contents, classify files by type, perform OCR on images, convert video to audio, extract images from webpages, and generate text summaries using modern libraries.

AIcloud storagedata-cleaning
0 likes · 6 min read
Python Utility Scripts for Data Cleaning, Translation, File Sync, Cloud Backup, and More
DataFunSummit
DataFunSummit
Aug 29, 2024 · Artificial Intelligence

Intelligent NPC Practices in Tencent Games: Multi‑Modal LLM Solutions and System Optimizations

This article details Tencent Game's end‑to‑end approach to building intelligent NPCs, covering the opportunities brought by AI, the practical implementation of multimodal LLM‑driven dialogue, knowledge‑augmented retrieval, long‑context handling, safety measures, multimodal expression (voice and facial animation), and system‑level performance optimizations for real‑time deployment.

AILLMMultimodal
0 likes · 18 min read
Intelligent NPC Practices in Tencent Games: Multi‑Modal LLM Solutions and System Optimizations
JD Tech Talk
JD Tech Talk
Aug 29, 2024 · Artificial Intelligence

Content Compliance Domain Overview and Technical Solutions for Price Governance

The article outlines the role of the content compliance domain in e‑commerce, detailing user‑facing issues, business responsibilities, challenges in detection and mitigation, and technical solutions such as comparable‑price models, large‑scale price prediction, and merchant outreach, while also offering personal growth advice for compliance engineers.

AINLPcontent compliance
0 likes · 9 min read
Content Compliance Domain Overview and Technical Solutions for Price Governance
JD Cloud Developers
JD Cloud Developers
Aug 29, 2024 · Artificial Intelligence

How AI Powers E‑Commerce Content Compliance and Price Governance

This article explains how e‑commerce platforms use AI‑driven content compliance to detect malicious products, price manipulation, and counterfeit goods, outlining the technical challenges, core business metrics, model‑based solutions for price over‑pricing, and personal growth advice for compliance engineers.

AIComputer VisionNLP
0 likes · 9 min read
How AI Powers E‑Commerce Content Compliance and Price Governance
Efficient Ops
Efficient Ops
Aug 28, 2024 · Artificial Intelligence

How Large Language Models Are Revolutionizing Banking Regulatory Interpretation

This article explores how AI-powered large language models enable Chinese commercial banks to automate, accurately match, and predict regulatory requirements, detailing new use‑cases, a prompt‑engineering framework, and the resulting efficiency and risk‑reduction benefits for the financial sector.

AIBankingPrompt engineering
0 likes · 7 min read
How Large Language Models Are Revolutionizing Banking Regulatory Interpretation
Qunar Tech Salon
Qunar Tech Salon
Aug 28, 2024 · Databases

Why Vector Databases Are Needed, PgVector Installation, Usage, and Operational Practices in PostgreSQL

This article explains the necessity of vector databases for AI workloads, reviews the PostgreSQL ecosystem, compares vector database options, provides detailed PgVector installation and usage steps, shares operational best‑practices, performance tuning tips, and real‑world deployment cases at Qunar and Tujia.

AIRAGperformance tuning
0 likes · 24 min read
Why Vector Databases Are Needed, PgVector Installation, Usage, and Operational Practices in PostgreSQL
21CTO
21CTO
Aug 28, 2024 · Fundamentals

Linus Torvalds on Linux Kernel’s Future: Versions, Rust, AI, and Security

At the 2024 OpenSource Summit China in Hong Kong, Linus Torvalds and Dirk Hohndel discussed the Linux kernel’s development roadmap, release cadence, security practices, the slow adoption of Rust, and the potential role of AI, while sharing candid views on cloud, Kubernetes, and legacy kernel support.

AILinuxOpenSource Summit
0 likes · 12 min read
Linus Torvalds on Linux Kernel’s Future: Versions, Rust, AI, and Security
Baidu Tech Salon
Baidu Tech Salon
Aug 27, 2024 · Artificial Intelligence

How PaddleX Enables Early Detection of Malignant Skin Tumors with AI Segmentation

This article examines the urgent need for early skin cancer detection in China, outlines the challenges of dermatological imaging, and details a low‑code PaddleX solution that leverages PP‑LiteSeg‑T for data preparation, model training, optimization, and deployment to improve diagnostic accuracy and efficiency.

AIDeep LearningPaddleX
0 likes · 10 min read
How PaddleX Enables Early Detection of Malignant Skin Tumors with AI Segmentation
DevOps
DevOps
Aug 26, 2024 · Operations

The Evolution of Operations: From Manual Ops to AIOps and ChatOps

This article explores the progression of IT operations—from manual processes through automated DevOps, to AI‑driven AIOps and chat‑based ChatOps—examining concepts, advantages, tools, and future possibilities, while also reflecting on how these trends reshape the role of operations engineers.

AIChatOpsCollaboration
0 likes · 12 min read
The Evolution of Operations: From Manual Ops to AIOps and ChatOps
DataFunTalk
DataFunTalk
Aug 26, 2024 · Artificial Intelligence

EasyRec Recommendation Algorithm Training and Inference Optimization

This article presents a comprehensive overview of EasyRec's recommendation system architecture, detailing training and inference optimizations, distributed deployment strategies, operator fusion techniques, online learning pipelines, and network-level improvements to enhance performance and scalability.

AIInference OptimizationTraining Optimization
0 likes · 15 min read
EasyRec Recommendation Algorithm Training and Inference Optimization
JD Tech Talk
JD Tech Talk
Aug 26, 2024 · Artificial Intelligence

Preference-oriented Diversity Model Based on Mutual Information for E-commerce Re-ranking (SIGIR 2024)

The paper proposes PODM‑MI, a mutual‑information‑driven, preference‑oriented diversity model that jointly optimizes accuracy and diversity in e‑commerce search re‑ranking by modeling user preferences with multivariate Gaussian distributions and adapting rankings via a learnable utility matrix, showing significant gains in JD's main search experiments.

AIDiversityE-commerce Search
0 likes · 12 min read
Preference-oriented Diversity Model Based on Mutual Information for E-commerce Re-ranking (SIGIR 2024)
DataFunSummit
DataFunSummit
Aug 25, 2024 · Artificial Intelligence

Applying Large AI Models to Financial Data Governance and Innovative Use Cases

This article presents a comprehensive technical overview of how large AI models are reshaping financial data production, governance, multimodal document understanding, lakehouse storage, private‑domain model deployment, data‑centric engineering methods, and multi‑agent intelligent advisory within the finance sector.

AIMulti-AgentMultimodal
0 likes · 21 min read
Applying Large AI Models to Financial Data Governance and Innovative Use Cases
Java High-Performance Architecture
Java High-Performance Architecture
Aug 25, 2024 · Artificial Intelligence

Can AI Ace the Gaokao Math Test? Surprising Results from Six Top LLMs

A recent evaluation had six leading large‑language‑model products (GPT‑4o, GLM‑4, Wenxin 4.0, Doubao, Baichuan 4, and Qwen‑2.5) answer the first 14 objective questions of the new Gaokao mathematics I paper, revealing that only GLM‑4 surpassed the 60% passing threshold while the others performed far below expectations.

AIGLM-4Gaokao
0 likes · 7 min read
Can AI Ace the Gaokao Math Test? Surprising Results from Six Top LLMs
DataFunTalk
DataFunTalk
Aug 24, 2024 · Artificial Intelligence

Improving the Mathematical Reasoning Ability of Large Language Models: Overview, Mixed Instructions, Synthetic Data, and Training Optimization

This article presents a comprehensive approach to enhancing large language models' mathematical reasoning by reviewing model architectures, introducing mixed CoT‑PoT instructions, generating and filtering synthetic data, and applying multi‑stage training optimizations such as RFT, PPO, and DPO, with detailed experimental results and Q&A.

AIReward modellarge language models
0 likes · 16 min read
Improving the Mathematical Reasoning Ability of Large Language Models: Overview, Mixed Instructions, Synthetic Data, and Training Optimization
Xiaohongshu Tech REDtech
Xiaohongshu Tech REDtech
Aug 23, 2024 · Artificial Intelligence

Xiaohongshu REDtech Live: Presentation of Recent Top‑Conference Papers (Recruitment Session)

On August 24, 2024, Xiaohongshu’s technical team will livestream a four‑hour REDtech session across WeChat Channels, its recruitment account, and Bilibili, showcasing recent top‑conference papers—from ACL and CVPR to ICLR and AAAI—covering innovations such as KV‑cache compression, zero‑shot image generation, early‑stopping self‑consistency, negative‑sample‑aware distillation, and real‑time nearest‑neighbor search, while allowing live interaction and offering surprise merchandise.

AIConference PapersXiaohongshu
0 likes · 18 min read
Xiaohongshu REDtech Live: Presentation of Recent Top‑Conference Papers (Recruitment Session)
JD Tech
JD Tech
Aug 23, 2024 · Artificial Intelligence

AI-Powered Automated Exam Generation for Aviation Maintenance Training

This article describes an AI-driven solution that uses vector databases and large language models to automatically generate, evaluate, and maintain training exam questions for aviation maintenance personnel, addressing high document volume, frequent updates, and low training effectiveness.

AIexam generationtraining automation
0 likes · 14 min read
AI-Powered Automated Exam Generation for Aviation Maintenance Training
Open Source Tech Hub
Open Source Tech Hub
Aug 22, 2024 · Artificial Intelligence

Unlock AI Power in PHP: A Hands‑On Guide to TransformersPHP

TransformersPHP brings Hugging Face’s Transformer models to PHP, enabling developers to run thousands of pre‑trained NLP models locally for tasks like text generation, summarisation, and translation, with simple installation, ONNX‑based execution, and a Python‑like pipeline API.

AINLPONNX
0 likes · 8 min read
Unlock AI Power in PHP: A Hands‑On Guide to TransformersPHP
DataFunSummit
DataFunSummit
Aug 22, 2024 · Artificial Intelligence

Multimodal Algorithms for Content Understanding and Distribution in JD E‑commerce

This article presents JD's multimodal content‑understanding framework, detailing its five‑M business characteristics, the architecture of multimodal recall and ranking models, the GMF and MIN modules for semantic alignment and personalization, and future research directions involving large language models and end‑to‑end multimodal encoding.

AIcontent understandinge‑commerce
0 likes · 16 min read
Multimodal Algorithms for Content Understanding and Distribution in JD E‑commerce
21CTO
21CTO
Aug 22, 2024 · Artificial Intelligence

Best Programming Languages for AI: Python, R, Java, LISP & More

This article surveys the most suitable programming languages for artificial intelligence, detailing why Python, R, Java, LISP, Prolog, C++, Haskell, JavaScript, and Julia each excel in AI development, and provides practical FAQs for developers choosing the right language.

AIPythonR
0 likes · 15 min read
Best Programming Languages for AI: Python, R, Java, LISP & More
AI Large Model Application Practice
AI Large Model Application Practice
Aug 22, 2024 · Artificial Intelligence

Building a Multi‑Agent AI Research Assistant with LangGraph and GPT‑Researcher

This article explains how to construct a multi‑agent AI research assistant using LangGraph and the open‑source GPT‑Researcher project, detailing the system architecture, agent roles, state design, workflow creation, parallel sub‑processes, and code examples for autonomous online research and report generation.

AIGPT-ResearcherLangGraph
0 likes · 13 min read
Building a Multi‑Agent AI Research Assistant with LangGraph and GPT‑Researcher
Huolala Tech
Huolala Tech
Aug 22, 2024 · Artificial Intelligence

How Large Language Models Automate Order Cancellation Responsibility at HuoLala

This article explains how HuoLala leverages large language models, multimodal feature integration, and retrieval‑augmented generation to automatically determine responsibility for order cancellations, improving accuracy, explainability, and driver‑user experience.

AIMultimodal RetrievalOrder Cancellation
0 likes · 10 min read
How Large Language Models Automate Order Cancellation Responsibility at HuoLala
Data Thinking Notes
Data Thinking Notes
Aug 20, 2024 · Artificial Intelligence

How Large AI Models Transform Data Governance: Strategies and Challenges

This article explores how the rise of massive AI models reshapes data governance, detailing model fundamentals, architectural types, emerging challenges, a five‑domain governance framework, and practical AI‑driven applications for data standards, metadata, quality, and security, while also looking ahead to future trends.

AIData GovernanceData Quality
0 likes · 14 min read
How Large AI Models Transform Data Governance: Strategies and Challenges
Python Programming Learning Circle
Python Programming Learning Circle
Aug 20, 2024 · Game Development

Python Turtle Pong Game with AI Paddle

This tutorial walks you through building a fully functional Pong game in Python using the turtle library, featuring an AI paddle that automatically tracks the ball, complete with step‑by‑step code snippets and a complete source listing for easy copy‑paste implementation.

AIGame DevelopmentPong
0 likes · 8 min read
Python Turtle Pong Game with AI Paddle
DataFunSummit
DataFunSummit
Aug 20, 2024 · Artificial Intelligence

Applying Large Language Models to Intelligent Telemarketing: Evolution, Architecture, and Future Outlook

This article reviews the evolution of telephone sales, introduces large model technologies, outlines their integration into intelligent telemarketing workflows, discusses practical implementation methods, challenges, and future trends, and shares insights from industry experts on optimizing AI‑driven sales automation.

AITelemarketingcustomer experience
0 likes · 17 min read
Applying Large Language Models to Intelligent Telemarketing: Evolution, Architecture, and Future Outlook
Volcano Engine Developer Services
Volcano Engine Developer Services
Aug 20, 2024 · Databases

How Vector Databases Power RAG: Scaling, Algorithms, and Real‑World Trade‑offs

RAG technology leverages vector databases to provide context‑aware answers without updating model parameters, and this article explores how cloud search teams integrate multiple vector algorithms, balance cost, stability and latency, and adopt open‑source solutions like OpenSearch to build scalable, enterprise‑grade retrieval systems.

AIDiskANNOpenSearch
0 likes · 21 min read
How Vector Databases Power RAG: Scaling, Algorithms, and Real‑World Trade‑offs
DaTaobao Tech
DaTaobao Tech
Aug 19, 2024 · Frontend Development

Challenges and Solutions in AI-Powered Front-End Code Generation for B2C Platforms

The article details how Taobao’s AI team automated repetitive UI tasks for B2C front‑end development, achieving a 15 % efficiency gain across five projects, and outlines key challenges—prompt cost, low OCR accuracy, hallucinations, excess nodes, and customization variance—along with practical solutions such as a dedicated evaluation platform, OCR translation, model upgrades, prompt segmentation, output simplification, and a reusable component library.

AIPrompt engineeringUI automation
0 likes · 9 min read
Challenges and Solutions in AI-Powered Front-End Code Generation for B2C Platforms
AntTech
AntTech
Aug 19, 2024 · Information Security

From General Computing to Intelligent Computing to Secure Computing: Key Insights from the 2024 China Cryptography Society Cryptographic Chip Conference

At the 2024 China Cryptography Society Cryptographic Chip Academic Conference in Chengdu, Ant Group’s Vice President Wei Tao highlighted the evolution from general to intelligent to secure computing, emphasizing the pivotal role of cryptographic chips in data protection, AI development, and cross‑industry applications while calling for deeper industry‑academia collaboration.

AIPrivacy Computingcryptographic chips
0 likes · 8 min read
From General Computing to Intelligent Computing to Secure Computing: Key Insights from the 2024 China Cryptography Society Cryptographic Chip Conference
JD Cloud Developers
JD Cloud Developers
Aug 19, 2024 · Artificial Intelligence

How AI Can Automate Aviation Maintenance Training Exams

This article details how a large‑model AI combined with a vector‑based knowledge base can automatically generate, update, and validate exam questions for airline maintenance training, addressing the challenges of massive documentation, frequent updates, and manual question creation.

AIaviation maintenancequestion generation
0 likes · 12 min read
How AI Can Automate Aviation Maintenance Training Exams
JD Tech Talk
JD Tech Talk
Aug 19, 2024 · Artificial Intelligence

AI‑Driven Automated Question Generation for Aviation Maintenance Training

The article describes how JD Aviation’s maintenance department uses a vector‑based knowledge base and large‑language‑model services to automatically generate, evaluate, and maintain training exam questions, addressing the rapid growth of manuals, frequent updates, and the heavy manual workload of traditional test creation.

AIKnowledge Baselarge language model
0 likes · 12 min read
AI‑Driven Automated Question Generation for Aviation Maintenance Training
Model Perspective
Model Perspective
Aug 19, 2024 · R&D Management

How the Technology Flywheel Predicts the Next Big Tech Breakthroughs

Understanding the technology flywheel—a model inspired by physics that links scientific breakthroughs, market demand, and funding—helps predict which emerging technologies, from AI to quantum computing, will accelerate rapidly, while highlighting the need for scientific thinking, market insight, and capital awareness.

AIFuture Techinnovation management
0 likes · 6 min read
How the Technology Flywheel Predicts the Next Big Tech Breakthroughs
21CTO
21CTO
Aug 18, 2024 · Artificial Intelligence

How AI and Blockchain Are Combating Telecom Scams – Plus the Latest Tech Releases

This roundup covers India's TRAI leveraging AI and blockchain to curb telecom scams, Microsoft's new Windows 11 power‑mode for better battery life, the removal of the Aria download tool after abuse, the launch of Flutter 3.24 with Dart 3.5, and KDE Frameworks 6.5.0 updates.

AIBlockchainFlutter
0 likes · 6 min read
How AI and Blockchain Are Combating Telecom Scams – Plus the Latest Tech Releases
Code Mala Tang
Code Mala Tang
Aug 17, 2024 · Artificial Intelligence

What Eric Schmidt Says About AI’s Future, Competition, and Industry Shifts

In a candid Stanford talk, former Google CEO Eric Schmidt warned that AI breakthroughs demand massive investment, larger context windows, and aggressive talent strategies, while highlighting CUDA's dominance, the rise of AI agents, geopolitical energy concerns, and the need for organizational innovation to unlock AI's full potential.

AIFuturecompetition
0 likes · 37 min read
What Eric Schmidt Says About AI’s Future, Competition, and Industry Shifts
21CTO
21CTO
Aug 17, 2024 · Artificial Intelligence

Understanding Large Language Models: Training, Uses, and a Llama 3 Code Demo

This article explains what large language models (LLMs) are, how they are trained, their diverse applications across industries, the challenges they face, and provides a practical Python example using Replicate to run Meta's Llama 3‑70b‑instruct model.

AILLMPrompt engineering
0 likes · 11 min read
Understanding Large Language Models: Training, Uses, and a Llama 3 Code Demo
DaTaobao Tech
DaTaobao Tech
Aug 16, 2024 · Artificial Intelligence

Effective Prompt Design for Large Language Models

Effective prompt design for large language models requires clear goals, relevant context, explicit input/output formats, evaluation criteria, and illustrative examples, combined with specific language, step‑by‑step instructions, edge‑case handling, ethical considerations, and proper tokenization, encoding, decoding, and post‑processing to produce accurate, concise, low‑hallucination responses.

AIPrompt Designlarge language models
0 likes · 33 min read
Effective Prompt Design for Large Language Models
DataFunSummit
DataFunSummit
Aug 16, 2024 · Artificial Intelligence

Educational Large Language Model Research and Product Applications for Youth Programming

The presentation outlines the challenges of sparse data and delayed learning effects in youth programming education, introduces three technical breakthroughs—dual‑data model training, hierarchical knowledge‑graph prompting, and reinforcement‑based cognitive recommendation—and showcases product implementations such as the Frog Programming Platform, AI learning machine, and digital‑human recorded courses.

AIeducationlarge language models
0 likes · 19 min read
Educational Large Language Model Research and Product Applications for Youth Programming
Qunhe Technology Quality Tech
Qunhe Technology Quality Tech
Aug 16, 2024 · Artificial Intelligence

How FastGPT Transforms Ticket Handling and Boosts Efficiency by 90%

This article examines the pain points of a custom ticket system, introduces FastGPT’s knowledge‑base and query capabilities, outlines integration architecture and concrete features, and shows how the combined solution reduces ticket resolution time dramatically while improving overall operational efficiency.

AIFastGPTOperations
0 likes · 10 min read
How FastGPT Transforms Ticket Handling and Boosts Efficiency by 90%
JD Retail Technology
JD Retail Technology
Aug 16, 2024 · Artificial Intelligence

Interview with JD Retail AI Director Zhai Zhouwei on the Evolution and Future of E‑commerce Search Powered by Large Models

In this interview, JD Retail’s AI director Zhai Zhouwei outlines the four historical stages of e‑commerce search, explains how large‑model AI is reshaping user interaction, retrieval and content generation, discusses practical challenges and solutions, and shares his vision and advice for enterprises adopting these technologies.

AIJD.comNLP
0 likes · 9 min read
Interview with JD Retail AI Director Zhai Zhouwei on the Evolution and Future of E‑commerce Search Powered by Large Models
Baidu Geek Talk
Baidu Geek Talk
Aug 14, 2024 · Artificial Intelligence

Sparse Tensor Basics in PaddlePaddle

The article explains how to use PaddlePaddle’s sparse computing features—including basic sparse tensor formats, creation and manipulation of sparse tensors, and building and training sparse neural networks such as a sparse ResNet—to improve memory efficiency and accelerate training on large, zero‑rich datasets.

AICOO FormatCSR Format
0 likes · 22 min read
Sparse Tensor Basics in PaddlePaddle
DaTaobao Tech
DaTaobao Tech
Aug 12, 2024 · Artificial Intelligence

Challenges and Optimization Techniques for Retrieval‑Augmented Generation (RAG)

Deploying large language models faces domain gaps, hallucinations, and high barriers, so Retrieval‑Augmented Generation (RAG) combines retrieval with generation, and advanced optimizations—such as RAPTOR’s hierarchical clustering, Self‑RAG’s self‑reflective retrieval, CRAG’s corrective evaluator, proposition‑level Dense X Retrieval, sophisticated chunking, query rewriting, and hybrid sparse‑dense methods—are essential for improving accuracy, reducing hallucinations, and achieving efficient, scalable performance.

AIRAGRetrieval Augmented Generation
0 likes · 22 min read
Challenges and Optimization Techniques for Retrieval‑Augmented Generation (RAG)
DataFunSummit
DataFunSummit
Aug 12, 2024 · Artificial Intelligence

Design and Application of Xiaohongshu Heterogeneous Training and Inference Engine

This article presents a comprehensive overview of Xiaohongshu's heterogeneous training and inference engine, covering the challenges of model engineering, the design of elastic heterogeneous engines, future HPC training frameworks, AI compilation techniques, and a forward‑looking outlook on scalability and performance.

AIAI CompilationHPC
0 likes · 19 min read
Design and Application of Xiaohongshu Heterogeneous Training and Inference Engine
21CTO
21CTO
Aug 12, 2024 · Fundamentals

12 Must‑Have Open‑Source Tools Every Developer Should Use

Discover twelve carefully curated open‑source utilities—from IDEs and API testers to AI‑powered terminals and container orchestrators—that can dramatically boost developer productivity and streamline everyday coding workflows.

AIAPIIDE
0 likes · 10 min read
12 Must‑Have Open‑Source Tools Every Developer Should Use
21CTO
21CTO
Aug 11, 2024 · Artificial Intelligence

Demystifying LLMs: How Tokens, Training, and Transformers Power Generative AI

This article explains the fundamentals of large language models, covering tokenization, probability prediction, Markov chain basics, training data limitations, context windows, and the transition to neural network architectures like Transformers, while providing Python examples and insights into model scaling and the illusion of intelligence.

AILLMNeural Networks
0 likes · 18 min read
Demystifying LLMs: How Tokens, Training, and Transformers Power Generative AI
DataFunTalk
DataFunTalk
Aug 11, 2024 · Artificial Intelligence

AI‑Driven Security Operations (AISECOPS): Architecture, Practices, and Evaluation

This article presents a comprehensive overview of AI‑enabled security operations, detailing the industry pain points, the AISECOPS workflow, model selection between OpenAI embeddings and ST5, classification methods, performance and cost evaluations, and future directions for integrating agents and secure AI pipelines.

AICost EvaluationOps Automation
0 likes · 22 min read
AI‑Driven Security Operations (AISECOPS): Architecture, Practices, and Evaluation
KooFE Frontend Team
KooFE Frontend Team
Aug 10, 2024 · Artificial Intelligence

Why AI Developer Tools Miss the Mark and How to Fix Them

This article examines the hype versus reality of AI-powered developer tools, outlines their current limitations such as lack of context awareness and reliability, categorizes the various tool types, and proposes ways to integrate AI more effectively into the software development workflow.

AISoftware Developmentdeveloper tools
0 likes · 14 min read
Why AI Developer Tools Miss the Mark and How to Fix Them
JavaEdge
JavaEdge
Aug 9, 2024 · Artificial Intelligence

Build a Graph‑Based LLM Agent with LangGraph: Step‑by‑Step Tutorial

This article introduces LangGraph, a Python library for creating stateful, multi‑agent LLM workflows, explains its loop, persistence, and human‑in‑the‑loop features, shows how to install it, and provides a complete code example that builds, runs, and reuses a searchable AI agent with thread‑level state saving.

AILLMLangChain
0 likes · 10 min read
Build a Graph‑Based LLM Agent with LangGraph: Step‑by‑Step Tutorial
AI Large Model Application Practice
AI Large Model Application Practice
Aug 9, 2024 · Artificial Intelligence

How to Build and Index Microsoft GraphRAG with Neo4j: A Step‑by‑Step Guide

This article explains the fundamentals of Microsoft GraphRAG, details its indexing pipeline—including text chunking, entity‑relationship extraction, community detection, and description generation—shows how to set up the graphrag library, create adaptive prompts, build the index, and import the resulting graph into Neo4j for visualization and analysis.

AIGraphRAGNeo4j
0 likes · 13 min read
How to Build and Index Microsoft GraphRAG with Neo4j: A Step‑by‑Step Guide
Baobao Algorithm Notes
Baobao Algorithm Notes
Aug 9, 2024 · Artificial Intelligence

Testing 1M‑Token LLMs with a Novel Medal‑Insertion Benchmark

The article presents a practical method for evaluating 1‑million‑token LLMs by inserting structured medal data into a classic Chinese novel, provides a full Python script for the test, shares results on GLM‑4‑long, and discusses training techniques and open‑source resources for long‑context models.

AILLMPython
0 likes · 10 min read
Testing 1M‑Token LLMs with a Novel Medal‑Insertion Benchmark
DataFunSummit
DataFunSummit
Aug 8, 2024 · Artificial Intelligence

GPU Throughput and Low‑Latency Optimization Practices in JD Advertising

This article presents JD Advertising's technical practices for improving GPU throughput and reducing latency in large‑scale recommendation scenarios, covering system challenges, storage and compute optimizations for training, low‑latency inference techniques, and compiler extensions to handle massive sparse models.

AIAdvertisingLow latency
0 likes · 13 min read
GPU Throughput and Low‑Latency Optimization Practices in JD Advertising
DataFunSummit
DataFunSummit
Aug 8, 2024 · Artificial Intelligence

Exploring Training and Alignment Techniques for Financial Large Models

The announcement details a DataFun Summit 2024 session where Du Xiaoman AI researcher Huo Liangyu will present on the challenges, development, and alignment methods of the Xuan Yuan financial large language model, highlighting RLHF techniques, data collection, and real‑world deployment insights for the finance sector.

AIFinancial AIModel Alignment
0 likes · 6 min read
Exploring Training and Alignment Techniques for Financial Large Models
DaTaobao Tech
DaTaobao Tech
Aug 7, 2024 · Artificial Intelligence

Overview of Large Model Development, AIGC Practices, and Prompt Engineering

The article surveys the rapid emergence of large AI models and AIGC, explains core concepts like AI, AGI, and LLMs, details prompt‑engineering techniques such as chain‑of‑thought, outlines a seven‑layer AIGC stack, discusses technical and ethical challenges, and highlights future multimodal and industry‑specific applications.

AIAIGCLLM
0 likes · 25 min read
Overview of Large Model Development, AIGC Practices, and Prompt Engineering
DataFunTalk
DataFunTalk
Aug 7, 2024 · Artificial Intelligence

Multi-Scenario Modeling for NetEase Cloud Music Recommendation: Architecture, Challenges, and Results

This article presents NetEase Cloud Music's multi‑scenario recommendation modeling work, detailing background, overall system architecture, key modules, modeling goals, technical difficulties, performance improvements, future outlook, and a comprehensive Q&A session that addresses practical deployment challenges.

AB testingAIModel architecture
0 likes · 14 min read
Multi-Scenario Modeling for NetEase Cloud Music Recommendation: Architecture, Challenges, and Results
AntTech
AntTech
Aug 6, 2024 · Artificial Intelligence

Self‑Supervised Video Copy Localization with Regional Token Representation

The article presents a self‑supervised framework that uses a regional token structure within a Vision Transformer to accurately locate video plagiarism segments, dramatically reducing annotation costs and achieving state‑of‑the‑art performance without manual labeling, while also highlighting its real‑world deployment for copyright protection.

AIcopyright protectionself-supervised learning
0 likes · 5 min read
Self‑Supervised Video Copy Localization with Regional Token Representation
Volcano Engine Developer Services
Volcano Engine Developer Services
Aug 6, 2024 · Artificial Intelligence

How an AI-Powered Bot Turns Excel Files into Interactive Reports

This article introduces an AI‑driven Smart Report Assistant Bot that automatically converts uploaded Excel files into recommended charts, allows users to customize reports, and details the underlying workflow—including Excel parsing, LLM‑generated SQL, dynamic table creation, chart rendering with ECharts, and image‑merging plugins.

AIBotECharts
0 likes · 8 min read
How an AI-Powered Bot Turns Excel Files into Interactive Reports
Open Source Linux
Open Source Linux
Aug 6, 2024 · Artificial Intelligence

What Is AI? A Beginner’s Guide to Definitions, Types, and Real‑World Impact

This article explains what artificial intelligence (AI) is, how it differs from traditional programming, outlines its main categories, introduces machine learning, deep learning, neural network models such as CNN, RNN, and Transformer, describes large models and GPT, and discusses AI’s wide‑range applications and societal implications.

AIAI applicationsArtificial Intelligence
0 likes · 16 min read
What Is AI? A Beginner’s Guide to Definitions, Types, and Real‑World Impact
DeWu Technology
DeWu Technology
Aug 5, 2024 · Frontend Development

Large Model Innovations Redefining Frontend Development – Key Takeaways

The July 14 DeWu tech salon showcased how large language models are reshaping frontend development, featuring insights from NetEase, Alibaba, and DeWu experts on AI‑driven low‑code platforms, intelligent coding assistants, and practical implementation strategies, with over 20,000 online viewers.

AIevent recapfrontend
0 likes · 8 min read
Large Model Innovations Redefining Frontend Development – Key Takeaways
Alibaba Cloud Native
Alibaba Cloud Native
Aug 2, 2024 · Cloud Native

How to Build an AI‑Native API Gateway with Higress: ChatGPT‑Next‑Web, RAG, Token Limits & More

This guide walks through creating a full‑featured AI‑native API gateway using Higress, covering architecture setup, AI agent integration, observability, content security, token rate limiting, caching, retrieval‑augmented generation, prompt templates, and intelligent request/response transformation with concrete configuration examples.

AILLMToken Limiting
0 likes · 11 min read
How to Build an AI‑Native API Gateway with Higress: ChatGPT‑Next‑Web, RAG, Token Limits & More
Model Perspective
Model Perspective
Aug 2, 2024 · Artificial Intelligence

Unlocking Problem Solving: 7 Powerful Heuristic Algorithms Explained

This article introduces heuristic algorithms—strategies that use experience and trial to quickly find approximate solutions for complex, often NP‑hard problems—detailing seven popular methods such as Greedy, Tabu Search, Simulated Annealing, Ant Colony, Genetic, Particle Swarm, and Artificial Bee Colony, and highlighting their principles, steps, and real‑world insights.

AIMetaheuristicsheuristic algorithms
0 likes · 10 min read
Unlocking Problem Solving: 7 Powerful Heuristic Algorithms Explained
BirdNest Tech Talk
BirdNest Tech Talk
Aug 2, 2024 · Industry Insights

What’s Next for Go? Inside the Oscar Contributor Agent Project

The article traces the lineage of Go’s technical leadership, explains Russ Cox’s shift to AI, and details the Oscar open‑source contributor‑agent architecture that uses large language models to automate maintenance tasks while preserving deterministic code execution.

AIContributor AgentIndustry Insights
0 likes · 10 min read
What’s Next for Go? Inside the Oscar Contributor Agent Project
Java Tech Enthusiast
Java Tech Enthusiast
Aug 1, 2024 · Artificial Intelligence

Apple Intelligence: Inside the New Apple Foundation Model

Apple Intelligence, an on‑device AI suite debuting with iOS 18.1 beta, centers on the Apple Foundation Model—a 3‑billion‑parameter on‑device LLM (and a larger undisclosed cloud version) trained on TPUs with novel RL algorithms and mixed‑precision quantization, delivering Siri, writing assistance, photo search, and benchmark performance that surpasses GPT‑4, though currently limited to paid developers.

AIApple IntelligenceModel Training
0 likes · 11 min read
Apple Intelligence: Inside the New Apple Foundation Model
Kuaishou Tech
Kuaishou Tech
Jul 31, 2024 · Artificial Intelligence

Kuaishou Showcases AI‑Driven Multimedia Innovations at China Multimedia 2024

At the China Multimedia 2024 conference in Yinchuan, Kuaishou presented its latest AI‑driven large‑model technologies—including text‑to‑image, text‑to‑video, and audio models—alongside advances in intelligent video coding, a new research‑fund initiative, and recent industry awards.

AIKuaishouMultimedia
0 likes · 5 min read
Kuaishou Showcases AI‑Driven Multimedia Innovations at China Multimedia 2024
Baidu Geek Talk
Baidu Geek Talk
Jul 31, 2024 · Artificial Intelligence

Quantitative Analysis of Transformer Architecture and Llama Model Performance

This engineering‑focused document reviews transformer fundamentals, derives precise FLOP and memory formulas for attention and feed‑forward layers, defines the MFU performance metric, analyzes memory components and parallelism strategies, examines recent architecture variants such as MQA, GQA, sliding‑window attention and MoE, and provides practice problems applying these calculations.

AIGPU computingTransformer
0 likes · 30 min read
Quantitative Analysis of Transformer Architecture and Llama Model Performance
FunTester
FunTester
Jul 30, 2024 · Operations

Mastering True Observability: Models, Practices, and AI‑Driven Automation

This article explains why true observability is essential for modern software, outlines its five core pillars, details a four‑stage maturity model with benefits and drawbacks, and provides practical steps—including data collection, team organization, and AI automation—to advance from basic monitoring to predictive, self‑healing systems.

AIMaturity ModelObservability
0 likes · 13 min read
Mastering True Observability: Models, Practices, and AI‑Driven Automation
DeWu Technology
DeWu Technology
Jul 29, 2024 · Artificial Intelligence

AI-Driven Loss Prevention: A Comprehensive Field-Level Risk Control System

The paper introduces an AI‑driven loss‑prevention platform that augments manual risk analysis with automated field recognition to map database and code models, generate loss‑related methods and interfaces, and deliver pre‑emptive avoidance, real‑time detection, and post‑incident response, achieving over 1,200% growth in identified loss methods and near‑full field coverage.

AIBusiness Intelligencedatabase analysis
0 likes · 8 min read
AI-Driven Loss Prevention: A Comprehensive Field-Level Risk Control System
DataFunSummit
DataFunSummit
Jul 29, 2024 · Artificial Intelligence

Large Language Models for Recommendation Systems: Current Progress, Challenges, and Future Directions

This article reviews the state‑of‑the‑art applications of large language models in recommendation systems, summarizing background knowledge, recent advances such as LLM4Rec, various tuning strategies, agent‑based approaches, open research problems, and future directions for generative recommendation.

AIIn-Context LearningLLM
0 likes · 24 min read
Large Language Models for Recommendation Systems: Current Progress, Challenges, and Future Directions
php Courses
php Courses
Jul 29, 2024 · Artificial Intelligence

Building Reinforcement Learning Algorithms with PHP

This article explains the fundamentals of reinforcement learning, demonstrates how PHP can be used with neural‑network libraries such as Keras or TensorFlow to implement a simple reinforcement‑learning agent, provides a complete PHP code example, and discusses its potential applications.

AICode ExampleReinforcement Learning
0 likes · 5 min read
Building Reinforcement Learning Algorithms with PHP
Full-Stack DevOps & Kubernetes
Full-Stack DevOps & Kubernetes
Jul 29, 2024 · Artificial Intelligence

How to Run Real‑Time Voice Cloning with Python: A Step‑by‑Step Guide

This guide introduces the open‑source Realtime Voice Cloning project, explains its key features, and provides detailed installation and usage instructions—including environment setup, dependency installation, cloning the repository, and running the demo tool—to enable real‑time voice transformation with Python.

AIPythonReal-time Audio
0 likes · 5 min read
How to Run Real‑Time Voice Cloning with Python: A Step‑by‑Step Guide
DataFunSummit
DataFunSummit
Jul 28, 2024 · Artificial Intelligence

Leveraging Large Language Models for Graph Learning: Opportunities, Current Progress, and Future Directions

This article reviews why large language models can be applied to graph learning, outlines their capabilities and graph data characteristics, surveys current research across different graph types and LLM roles, and proposes future research directions for unified cross‑domain graph learning.

AIMultimodalResearch Directions
0 likes · 19 min read
Leveraging Large Language Models for Graph Learning: Opportunities, Current Progress, and Future Directions
Python Programming Learning Circle
Python Programming Learning Circle
Jul 27, 2024 · Artificial Intelligence

Numpy‑ML: A Pure NumPy Implementation of Machine Learning Algorithms

The Numpy‑ML project, created by UC Berkeley’s David Bourgin, provides a comprehensive pure‑NumPy implementation of over 30 machine‑learning algorithms—including probabilistic models, neural‑network layers, optimizers, and reinforcement‑learning agents—along with extensive data‑preprocessing utilities, all in a single open‑source repository.

AIAlgorithmsNumPy
0 likes · 6 min read
Numpy‑ML: A Pure NumPy Implementation of Machine Learning Algorithms
DataFunTalk
DataFunTalk
Jul 26, 2024 · Artificial Intelligence

Llama 3: Open‑source Large Language Model Technical Report and Evaluation

This comprehensive technical report details the development, architecture, training methodology, extensive benchmark evaluations, safety measures, and inference optimizations of Meta's open‑source Llama 3 large language model series, covering models up to 405 billion parameters and supporting multilingual, multimodal, and tool‑use capabilities.

AILLaMATraining
0 likes · 115 min read
Llama 3: Open‑source Large Language Model Technical Report and Evaluation
Data Thinking Notes
Data Thinking Notes
Jul 25, 2024 · Information Security

How Large Language Models Transform Data Security Compliance Management

This article explains how a leading insurance technology group leverages large language models to streamline data security compliance, detailing the evolution of data management, key governance challenges, multimodal AI architecture, and practical workflows for policy enforcement, risk monitoring, and asset management.

AIData Governancecompliance
0 likes · 10 min read
How Large Language Models Transform Data Security Compliance Management
Architect's Alchemy Furnace
Architect's Alchemy Furnace
Jul 25, 2024 · Artificial Intelligence

Designing Autonomous LLM Agents: Architecture, Memory, Planning, and Learning Strategies

This article surveys the design of autonomous large‑language‑model agents, detailing their modular architecture—including profiling, memory, planning, and execution—while also reviewing common profiling methods, memory structures, planning techniques, action strategies, and various learning approaches such as exemplar, human‑in‑the‑loop, and environment‑feedback training.

AIAgent ArchitectureAutonomous Agents
0 likes · 36 min read
Designing Autonomous LLM Agents: Architecture, Memory, Planning, and Learning Strategies