Tagged articles
34 articles
Page 1 of 1
Geek Labs
Geek Labs
Apr 30, 2026 · Artificial Intelligence

Why the 14-Year-Old ccv Library Remains a Top Choice for Modern Computer Vision

The ccv library, created in 2010 and still actively maintained, offers a highly portable C‑based computer‑vision toolkit with minimal dependencies, a built‑in cache for preprocessing, a full libnnc neural‑network runtime, and easy builds via Bazel, Make, or Swift Package Manager.

C libraryComputer VisionNeural Network
0 likes · 5 min read
Why the 14-Year-Old ccv Library Remains a Top Choice for Modern Computer Vision
Data STUDIO
Data STUDIO
Dec 9, 2025 · Artificial Intelligence

20 Core PyTorch Concepts to Accelerate Your AI Projects

This article walks through twenty essential PyTorch concepts—from basic Tensor creation and manipulation, through autograd and neural‑network construction, to data loading, GPU acceleration, model saving, and practical training tricks—providing concrete code examples and clear explanations for developers eager to build and deploy AI models.

AutogradDataLoaderDeep Learning
0 likes · 16 min read
20 Core PyTorch Concepts to Accelerate Your AI Projects
Tencent Technical Engineering
Tencent Technical Engineering
Dec 3, 2025 · Artificial Intelligence

Why Transformers Power Modern LLMs: A Deep Dive into Architecture and Mechanics

This article provides a comprehensive, step‑by‑step explanation of the Transformer architecture that underpins large language models, covering tokenization, embeddings, positional encoding, attention mechanisms, feed‑forward networks, layer stacking, a detailed translation example, visualized attention weights, and a survey of recent open‑source LLM designs such as DeepSeek V3, OLMo 2, and Gemma 3.

EmbeddingLLMNeural Network
0 likes · 38 min read
Why Transformers Power Modern LLMs: A Deep Dive into Architecture and Mechanics
Alibaba Cloud Developer
Alibaba Cloud Developer
Jul 23, 2025 · Artificial Intelligence

Build a Handwritten Digit Recognizer with TensorFlow: Step‑by‑Step MNIST Tutorial

Learn the fundamentals of deep learning by building, training, and evaluating a TensorFlow model that recognizes handwritten digits from the MNIST dataset, covering data preparation, network architecture, activation functions, optimizer choices, model compilation, training loops, evaluation metrics, and visualization of predictions.

Image ClassificationKerasMNIST
0 likes · 20 min read
Build a Handwritten Digit Recognizer with TensorFlow: Step‑by‑Step MNIST Tutorial
JavaEdge
JavaEdge
Feb 23, 2025 · Artificial Intelligence

How Java Developers Can Build Neural Networks with PyTorch: A Step‑by‑Step Guide

This tutorial walks Java developers through the complete workflow of building, training, and evaluating a neural network in PyTorch, covering network definition, data iteration, forward and backward passes, loss calculation, and parameter updates with detailed code examples and Java‑centric analogies.

BackpropagationDeep LearningJava
0 likes · 12 min read
How Java Developers Can Build Neural Networks with PyTorch: A Step‑by‑Step Guide
Test Development Learning Exchange
Test Development Learning Exchange
Nov 28, 2024 · Artificial Intelligence

Introduction to Deep Learning with Keras: Building and Training a Simple Neural Network

This tutorial introduces the fundamentals of deep learning, covering neural network basics, Keras fundamentals, and provides a step‑by‑step Python example that loads the Iris dataset, preprocesses data, builds, compiles, trains, evaluates, visualizes, and predicts with a simple neural network model.

AIDeep LearningKeras
0 likes · 7 min read
Introduction to Deep Learning with Keras: Building and Training a Simple Neural Network
Ctrip Technology
Ctrip Technology
Sep 29, 2024 · Artificial Intelligence

Structured Components-based Neural Network (SCNN) for Multivariate Time Series Forecasting: Theory, Implementation, and Business Application

This article presents the SCNN model for multivariate time series forecasting, explains its decomposition into long‑term, seasonal, short‑term, and co‑evolving components, details the neural‑network‑based fusion and loss design, provides Python code snippets, and demonstrates its practical deployment for business volume prediction at Ctrip.

Neural NetworkPredictionPython
0 likes · 30 min read
Structured Components-based Neural Network (SCNN) for Multivariate Time Series Forecasting: Theory, Implementation, and Business Application
Python Programming Learning Circle
Python Programming Learning Circle
Jul 12, 2024 · Artificial Intelligence

Building a Simple Neural Network from Scratch in Python

This article walks through constructing a basic neural network using only Python and NumPy, explains the underlying concepts such as neurons, training cycles, sigmoid activation, and weight‑adjustment formulas, and provides complete, runnable code with sample inputs and outputs.

Neural NetworkNumPyPython
0 likes · 9 min read
Building a Simple Neural Network from Scratch in Python
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Jun 30, 2024 · Artificial Intelligence

Spatial Attention Mechanism and Its PyTorch Implementation

This article explains the principle of spatial attention in convolutional neural networks, details the underlying algorithmic steps, and provides a complete PyTorch implementation including the attention module, full network architecture, and practical considerations for integrating spatial attention into deep learning models.

CNNDeep LearningNeural Network
0 likes · 10 min read
Spatial Attention Mechanism and Its PyTorch Implementation
JD Cloud Developers
JD Cloud Developers
Jun 25, 2024 · Artificial Intelligence

Why Do Large Language Models Output Text Word‑by‑Word? Inside the Transformer Mechanics

This article explains the fundamental architecture of large language models, from the dual file nature of parameters and code, through neural network basics, perceptrons, and weight training, to the Transformer’s tokenization, positional encoding, self‑attention, and inference processes, illustrated with diagrams and examples.

Neural NetworkSelf-AttentionTransformer
0 likes · 22 min read
Why Do Large Language Models Output Text Word‑by‑Word? Inside the Transformer Mechanics
Liangxu Linux
Liangxu Linux
Mar 23, 2024 · Artificial Intelligence

Understanding AI Neurons: A Storytelling Guide to Basics of Neural Networks

This article uses a narrative of an AI neuron to explain fundamental concepts of neural networks, including neuron structure, weighted sums, activation functions, loss functions, gradient descent, and learning rate, making complex AI topics accessible to beginners.

AI basicsNeural Networkactivation function
0 likes · 9 min read
Understanding AI Neurons: A Storytelling Guide to Basics of Neural Networks
Model Perspective
Model Perspective
Aug 30, 2023 · Artificial Intelligence

How Gradient Descent Trains Neural Networks: A Blind Hiker’s Journey

This article uses a blindfolded mountain‑climbing analogy to explain how gradient descent trains neural networks, covering cost functions, learning rates, iterative updates, and provides a Python implementation for a simple three‑layer network example.

AIBackpropagationNeural Network
0 likes · 10 min read
How Gradient Descent Trains Neural Networks: A Blind Hiker’s Journey
Airbnb Technology Team
Airbnb Technology Team
Aug 3, 2023 · Artificial Intelligence

Improving Airbnb Search Ranking Diversity with Neural Networks

Airbnb upgraded its neural‑network ranking system by adding a similarity network that penalizes duplicate‑like listings, enabling the algorithm to present a more diverse set of options, which boosted booking rates, value, and five‑star ratings, demonstrating that reduced result similarity improves overall search quality.

AirbnbDiversityNeural Network
0 likes · 8 min read
Improving Airbnb Search Ranking Diversity with Neural Networks
Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Jul 26, 2023 · Artificial Intelligence

Building and Training a Fully Connected Neural Network for Fashion-MNIST Classification with PyTorch

This tutorial demonstrates how to download the Fashion‑MNIST dataset, build a four‑layer fully connected neural network with PyTorch, and train it using loss functions, Adam optimizer, learning‑rate strategies, and Dropout to achieve high‑accuracy multi‑class image classification.

AdamDeep LearningDropout
0 likes · 17 min read
Building and Training a Fully Connected Neural Network for Fashion-MNIST Classification with PyTorch
Open Source Linux
Open Source Linux
Jul 13, 2023 · Artificial Intelligence

How ChatGPT Works: Inside the Neural Network and Language Model

This article explains the inner workings of ChatGPT, covering its probabilistic token generation, transformer architecture, attention mechanisms, embeddings, training process, and the mathematical principles that enable a massive neural network to produce coherent, human‑like text.

ChatGPTNeural Network
0 likes · 79 min read
How ChatGPT Works: Inside the Neural Network and Language Model
AntTech
AntTech
Mar 31, 2023 · Artificial Intelligence

Web Photo Source Identification Based on Neural Enhanced Camera Fingerprint

This paper presents a neural‑enhanced camera fingerprint framework combined with zero‑knowledge proof and digital signature schemes to reliably trace the originating device of photos, offering high‑accuracy identification, privacy preservation, and resistance to forgery across various application scenarios.

Neural NetworkZero Knowledge Proofcamera fingerprint
0 likes · 8 min read
Web Photo Source Identification Based on Neural Enhanced Camera Fingerprint
DataFunTalk
DataFunTalk
Feb 19, 2023 · Artificial Intelligence

How ChatGPT Works: An In‑Depth Explanation by Stephen Wolfram

This article provides a comprehensive, step‑by‑step explanation of how ChatGPT generates text, covering token probabilities, n‑gram models, embeddings, attention mechanisms, and the Transformer architecture, while illustrating concepts with Wolfram‑language examples and visualizations.

AIChatGPTNeural Network
0 likes · 20 min read
How ChatGPT Works: An In‑Depth Explanation by Stephen Wolfram
Sohu Tech Products
Sohu Tech Products
Feb 1, 2023 · Artificial Intelligence

ChatGPT Writes AI: Building an MNIST Classifier with Keras Using ChatGPT

This article demonstrates how a machine‑learning enthusiast used ChatGPT to generate, modify, and refine Keras code for training, evaluating, visualizing, and deploying a neural‑network model that classifies handwritten digits from the classic MNIST dataset, showcasing the full development workflow.

ChatGPTKerasMNIST
0 likes · 4 min read
ChatGPT Writes AI: Building an MNIST Classifier with Keras Using ChatGPT
vivo Internet Technology
vivo Internet Technology
Aug 24, 2022 · Frontend Development

Applying Self-Attention Based Machine Learning Model to Design-to-Code Layout Prediction

Vivo’s frontend team built a self‑attention‑based machine‑learning model that predicts web‑page layout types (column, row, or absolute) from node dimensions and positions, solving parent‑child and sibling relationships for design‑to‑code conversion, achieving 99.4% accuracy using over 20 k labeled, crawled, and generated samples, while outlining further enhancements.

D2CNeural NetworkSelf-Attention
0 likes · 11 min read
Applying Self-Attention Based Machine Learning Model to Design-to-Code Layout Prediction
Code DAO
Code DAO
May 16, 2022 · Artificial Intelligence

How to Build a Simple Neural Network from Scratch with NumPy

This article walks through implementing a basic multi‑layer neural network using only NumPy, covering terminology, network architecture, forward and backward propagation, activation functions, loss calculation, parameter updates with SGD, and compares the custom model with a Keras implementation.

BackpropagationNeural NetworkNumPy
0 likes · 17 min read
How to Build a Simple Neural Network from Scratch with NumPy
Code DAO
Code DAO
May 12, 2022 · Artificial Intelligence

How Activation Functions Work in Deep Learning

This article explains the role of activation functions in deep learning, covering their definition, why they are needed, the main categories—including linear, binary step, and various non‑linear functions such as Sigmoid, TanH, ReLU, Leaky ReLU, ELU, Softmax and Swish—along with each function's mathematical form, advantages, disadvantages, and practical usage recommendations.

Deep LearningNeural NetworkReLU
0 likes · 13 min read
How Activation Functions Work in Deep Learning
Python Programming Learning Circle
Python Programming Learning Circle
Mar 19, 2022 · Artificial Intelligence

Building a Simple Digital Twin for Lithium‑Ion Batteries Using Python and Neural Networks

The article demonstrates how to build a digital twin for lithium‑ion batteries in Python by constructing a physics‑based model, augmenting it with experimental data using a simple Keras neural network, and visualizing predictions, illustrating the hybrid approach’s improved accuracy over purely empirical methods.

Digital TwinKerasNeural Network
0 likes · 9 min read
Building a Simple Digital Twin for Lithium‑Ion Batteries Using Python and Neural Networks
NetEase Smart Enterprise Tech+
NetEase Smart Enterprise Tech+
Aug 23, 2021 · Artificial Intelligence

How a Lightweight Neural Network Cuts Transient Noise in Real‑Time Audio

NetEase Cloud Communication’s Audio Lab presents a low‑complexity neural‑network denoising algorithm that effectively suppresses both stationary and transient noises while preserving speech quality, detailing its mathematical model, feature design, loss function, GRU‑based architecture, real‑time performance, and comparative evaluation against state‑of‑the‑art methods.

Neural NetworkReal-time Processingaudio denoising
0 likes · 13 min read
How a Lightweight Neural Network Cuts Transient Noise in Real‑Time Audio
ByteDance Dali Intelligent Technology Team
ByteDance Dali Intelligent Technology Team
May 31, 2021 · Frontend Development

Optimized Chinese Character Stroke‑Order Animation: Glyph Extraction, Stroke Segmentation, Median Generation, and Order Determination

This article describes how to integrate and extend the open‑source HanziWriter library for Android, iOS and web, extracting TrueType glyph data, segmenting strokes with corner detection and neural‑network scoring, generating stroke medians via Voronoi diagrams, and determining correct stroke order using a Hungarian‑algorithm match.

HanziWriterJavaScriptNeural Network
0 likes · 16 min read
Optimized Chinese Character Stroke‑Order Animation: Glyph Extraction, Stroke Segmentation, Median Generation, and Order Determination
Taobao Frontend Technology
Taobao Frontend Technology
Oct 27, 2020 · Artificial Intelligence

Mastering Tensors in TensorFlow.js: From Scalars to Neural Networks

This guide explains the fundamentals of tensors in TensorFlow.js—including scalars, vectors, and higher‑dimensional tensors—demonstrates how to convert real‑world data such as the Titanic dataset into tensors, and shows how to build, compile, and train a simple neural network model using appropriate layers, loss functions, and optimizers.

JavaScriptNeural NetworkTensorFlow.js
0 likes · 7 min read
Mastering Tensors in TensorFlow.js: From Scalars to Neural Networks
Xianyu Technology
Xianyu Technology
Feb 27, 2020 · Artificial Intelligence

Data-Driven Simulation for User Activity Retention Prediction

By extracting hour‑level activity logs and training supervised models—including CART, GBDT, and neural networks—on user tags, the team simulated short‑term metrics for new reward campaigns, enabling earlier prediction of next‑day retention and shortening experiment cycles despite delayed T+1 data.

AB testingCARTGBDT
0 likes · 9 min read
Data-Driven Simulation for User Activity Retention Prediction
360 Tech Engineering
360 Tech Engineering
Sep 16, 2019 · Artificial Intelligence

Backpropagation Algorithm for Fully Connected Neural Networks with Python Implementation

This article explains the backpropagation training algorithm for fully connected artificial neural networks, detailing its gradient‑descent basis, mathematical derivation, matrix formulation, and provides a complete Python implementation with mini‑batch stochastic gradient descent, momentum, learning‑rate decay, and experimental results.

BackpropagationMini-BatchNeural Network
0 likes · 14 min read
Backpropagation Algorithm for Fully Connected Neural Networks with Python Implementation
Tencent Advertising Technology
Tencent Advertising Technology
Jun 13, 2019 · Artificial Intelligence

Competition Solution Overview: Data Analysis, Rule‑Based and Neural Network Models for Advertising Prediction

The article details a contestant's end‑to‑end approach for an advertising competition, covering data analysis, rule‑based preprocessing, a three‑layer neural network architecture, model‑rule ensemble weighting, self‑correction strategies for the B phase, and final model‑only solutions that achieved top scores.

AdvertisingNeural Networkcompetition
0 likes · 8 min read
Competition Solution Overview: Data Analysis, Rule‑Based and Neural Network Models for Advertising Prediction
NetEase Media Technology Team
NetEase Media Technology Team
Apr 26, 2019 · Artificial Intelligence

Intelligent Cover Image Selection System for News Articles: Image Quality Assessment and Smart Cropping

The article describes an intelligent cover‑image selection system for NetEase News that automatically filters unsuitable illustrations, assesses image quality with a pairwise‑trained deep model across clarity, color and composition, and smartly crops images using aspect‑ratio‑aware object detection, dramatically cutting manual editing and enabling confidence‑based automatic publishing.

Computer VisionImage CroppingNeural Network
0 likes · 11 min read
Intelligent Cover Image Selection System for News Articles: Image Quality Assessment and Smart Cropping
MaGe Linux Operations
MaGe Linux Operations
Apr 15, 2019 · Artificial Intelligence

How to Build a Breast Cancer Prediction Neural Network from Scratch in Python

This article walks through creating a Python‑based neural network to predict breast cancer using the Wisconsin dataset, covering network architecture, weight and bias initialization, back‑propagation, gradient descent, and the role of activation functions such as sigmoid, tanh, ReLU and Leaky ReLU.

Deep LearningNeural NetworkPython
0 likes · 13 min read
How to Build a Breast Cancer Prediction Neural Network from Scratch in Python
Tencent Cloud Developer
Tencent Cloud Developer
Jan 17, 2019 · Artificial Intelligence

Deep Learning for Big Data Recommendation Systems: Tencent's Industrial Practice

Tencent’s industrial practice shows how a large‑scale offline‑nearline‑online “Shield” recommendation architecture, powered by the DeepR framework built on RCaffe, uses deep semantic embeddings, massive neural networks and reinforcement‑learning decisions to handle billions of daily requests, demonstrating that data richness and engineering capability, not model depth alone, drive performance in big‑data recommendation systems.

Big DataDeep LearningNeural Network
0 likes · 13 min read
Deep Learning for Big Data Recommendation Systems: Tencent's Industrial Practice
Tencent Cloud Developer
Tencent Cloud Developer
Oct 15, 2018 · Artificial Intelligence

Neural Network Fundamentals: Building Your Own Neural Network from Scratch in Python

This tutorial explains neural network fundamentals by defining layers, weights, biases, and sigmoid activation, then walks through building a Python class that implements forward propagation, a sum‑of‑squared‑error loss, and backpropagation using the chain rule and gradient descent to train a simple two‑layer network.

BackpropagationNeural NetworkPython
0 likes · 8 min read
Neural Network Fundamentals: Building Your Own Neural Network from Scratch in Python