Tagged articles

Neural Network

34 articles · Page 1 of 1

Apr 30, 2026 · Artificial Intelligence

Why the 14-Year-Old ccv Library Remains a Top Choice for Modern Computer Vision

The ccv library, created in 2010 and still actively maintained, offers a highly portable C‑based computer‑vision toolkit with minimal dependencies, a built‑in cache for preprocessing, a full libnnc neural‑network runtime, and easy builds via Bazel, Make, or Swift Package Manager.

C libraryCross-PlatformEmbedded

0 likes · 5 min read

Why the 14-Year-Old ccv Library Remains a Top Choice for Modern Computer Vision

Data STUDIO

Dec 9, 2025 · Artificial Intelligence

20 Core PyTorch Concepts to Accelerate Your AI Projects

This article walks through twenty essential PyTorch concepts—from basic Tensor creation and manipulation, through autograd and neural‑network construction, to data loading, GPU acceleration, model saving, and practical training tricks—providing concrete code examples and clear explanations for developers eager to build and deploy AI models.

AutogradDataLoaderDeep Learning

0 likes · 16 min read

20 Core PyTorch Concepts to Accelerate Your AI Projects

Tencent Technical Engineering

Dec 3, 2025 · Artificial Intelligence

Why Transformers Power Modern LLMs: A Deep Dive into Architecture and Mechanics

This article provides a comprehensive, step‑by‑step explanation of the Transformer architecture that underpins large language models, covering tokenization, embeddings, positional encoding, attention mechanisms, feed‑forward networks, layer stacking, a detailed translation example, visualized attention weights, and a survey of recent open‑source LLM designs such as DeepSeek V3, OLMo 2, and Gemma 3.

EmbeddingLLMNeural Network

0 likes · 38 min read

Why Transformers Power Modern LLMs: A Deep Dive into Architecture and Mechanics

Alibaba Cloud Developer

Jul 23, 2025 · Artificial Intelligence

Build a Handwritten Digit Recognizer with TensorFlow: Step‑by‑Step MNIST Tutorial

Learn the fundamentals of deep learning by building, training, and evaluating a TensorFlow model that recognizes handwritten digits from the MNIST dataset, covering data preparation, network architecture, activation functions, optimizer choices, model compilation, training loops, evaluation metrics, and visualization of predictions.

KerasMNISTNeural Network

0 likes · 20 min read

Build a Handwritten Digit Recognizer with TensorFlow: Step‑by‑Step MNIST Tutorial

Code Mala Tang

Jun 17, 2025 · Artificial Intelligence

Build a Handwritten Digit Classifier with PyTorch: Step‑by‑Step Guide

This tutorial walks you through building a digit classifier using PyTorch and the MNIST dataset, covering environment setup, data loading, model construction, training, evaluation, and model persistence while explaining core deep‑learning concepts.

Deep LearningMNISTNeural Network

0 likes · 15 min read

Build a Handwritten Digit Classifier with PyTorch: Step‑by‑Step Guide

JavaEdge

Feb 23, 2025 · Artificial Intelligence

How Java Developers Can Build Neural Networks with PyTorch: A Step‑by‑Step Guide

This tutorial walks Java developers through the complete workflow of building, training, and evaluating a neural network in PyTorch, covering network definition, data iteration, forward and backward passes, loss calculation, and parameter updates with detailed code examples and Java‑centric analogies.

BackpropagationDeep LearningJava

0 likes · 12 min read

How Java Developers Can Build Neural Networks with PyTorch: A Step‑by‑Step Guide

Test Development Learning Exchange

Nov 28, 2024 · Artificial Intelligence

Introduction to Deep Learning with Keras: Building and Training a Simple Neural Network

This tutorial introduces the fundamentals of deep learning, covering neural network basics, Keras fundamentals, and provides a step‑by‑step Python example that loads the Iris dataset, preprocesses data, builds, compiles, trains, evaluates, visualizes, and predicts with a simple neural network model.

AIDeep LearningKeras

0 likes · 7 min read

Introduction to Deep Learning with Keras: Building and Training a Simple Neural Network

Ctrip Technology

Sep 29, 2024 · Artificial Intelligence

Structured Components-based Neural Network (SCNN) for Multivariate Time Series Forecasting: Theory, Implementation, and Business Application

This article presents the SCNN model for multivariate time series forecasting, explains its decomposition into long‑term, seasonal, short‑term, and co‑evolving components, details the neural‑network‑based fusion and loss design, provides Python code snippets, and demonstrates its practical deployment for business volume prediction at Ctrip.

Neural NetworkPythonSCNN

0 likes · 30 min read

Structured Components-based Neural Network (SCNN) for Multivariate Time Series Forecasting: Theory, Implementation, and Business Application

Python Programming Learning Circle

Jul 12, 2024 · Artificial Intelligence

Building a Simple Neural Network from Scratch in Python

This article walks through constructing a basic neural network using only Python and NumPy, explains the underlying concepts such as neurons, training cycles, sigmoid activation, and weight‑adjustment formulas, and provides complete, runnable code with sample inputs and outputs.

Artificial IntelligenceNeural NetworkNumPy

0 likes · 9 min read

Building a Simple Neural Network from Scratch in Python

Rare Earth Juejin Tech Community

Jun 30, 2024 · Artificial Intelligence

Spatial Attention Mechanism and Its PyTorch Implementation

This article explains the principle of spatial attention in convolutional neural networks, details the underlying algorithmic steps, and provides a complete PyTorch implementation including the attention module, full network architecture, and practical considerations for integrating spatial attention into deep learning models.

CNNDeep LearningNeural Network

0 likes · 10 min read

Spatial Attention Mechanism and Its PyTorch Implementation

JD Cloud Developers

Jun 25, 2024 · Artificial Intelligence

Why Do Large Language Models Output Text Word‑by‑Word? Inside the Transformer Mechanics

This article explains the fundamental architecture of large language models, from the dual file nature of parameters and code, through neural network basics, perceptrons, and weight training, to the Transformer’s tokenization, positional encoding, self‑attention, and inference processes, illustrated with diagrams and examples.

Large Language ModelNeural NetworkSelf-Attention

0 likes · 22 min read

Why Do Large Language Models Output Text Word‑by‑Word? Inside the Transformer Mechanics

Liangxu Linux

Mar 23, 2024 · Artificial Intelligence

Understanding AI Neurons: A Storytelling Guide to Basics of Neural Networks

This article uses a narrative of an AI neuron to explain fundamental concepts of neural networks, including neuron structure, weighted sums, activation functions, loss functions, gradient descent, and learning rate, making complex AI topics accessible to beginners.

AI basicsNeural Networkactivation function

0 likes · 9 min read

Understanding AI Neurons: A Storytelling Guide to Basics of Neural Networks

Model Perspective

Aug 30, 2023 · Artificial Intelligence

How Gradient Descent Trains Neural Networks: A Blind Hiker’s Journey

This article uses a blindfolded mountain‑climbing analogy to explain how gradient descent trains neural networks, covering cost functions, learning rates, iterative updates, and provides a Python implementation for a simple three‑layer network example.

AIBackpropagationNeural Network

0 likes · 10 min read

How Gradient Descent Trains Neural Networks: A Blind Hiker’s Journey

Airbnb Technology Team

Aug 3, 2023 · Artificial Intelligence

Improving Airbnb Search Ranking Diversity with Neural Networks

Airbnb upgraded its neural‑network ranking system by adding a similarity network that penalizes duplicate‑like listings, enabling the algorithm to present a more diverse set of options, which boosted booking rates, value, and five‑star ratings, demonstrating that reduced result similarity improves overall search quality.

AirbnbDiversityNeural Network

0 likes · 8 min read

Improving Airbnb Search Ranking Diversity with Neural Networks

Rare Earth Juejin Tech Community

Jul 26, 2023 · Artificial Intelligence

Building and Training a Fully Connected Neural Network for Fashion-MNIST Classification with PyTorch

This tutorial demonstrates how to download the Fashion‑MNIST dataset, build a four‑layer fully connected neural network with PyTorch, and train it using loss functions, Adam optimizer, learning‑rate strategies, and Dropout to achieve high‑accuracy multi‑class image classification.

AdamDeep LearningDropout

0 likes · 17 min read

Building and Training a Fully Connected Neural Network for Fashion-MNIST Classification with PyTorch

Open Source Linux

Jul 13, 2023 · Artificial Intelligence

How ChatGPT Works: Inside the Neural Network and Language Model

This article explains the inner workings of ChatGPT, covering its probabilistic token generation, transformer architecture, attention mechanisms, embeddings, training process, and the mathematical principles that enable a massive neural network to produce coherent, human‑like text.

ChatGPTNeural Network

0 likes · 79 min read

How ChatGPT Works: Inside the Neural Network and Language Model

AntTech

Mar 31, 2023 · Artificial Intelligence

Web Photo Source Identification Based on Neural Enhanced Camera Fingerprint

This paper presents a neural‑enhanced camera fingerprint framework combined with zero‑knowledge proof and digital signature schemes to reliably trace the originating device of photos, offering high‑accuracy identification, privacy preservation, and resistance to forgery across various application scenarios.

Neural NetworkZero Knowledge Proofcamera fingerprint

0 likes · 8 min read

Web Photo Source Identification Based on Neural Enhanced Camera Fingerprint

DataFunTalk

Feb 19, 2023 · Artificial Intelligence

How ChatGPT Works: An In‑Depth Explanation by Stephen Wolfram

This article provides a comprehensive, step‑by‑step explanation of how ChatGPT generates text, covering token probabilities, n‑gram models, embeddings, attention mechanisms, and the Transformer architecture, while illustrating concepts with Wolfram‑language examples and visualizations.

AIChatGPTLarge Language Model

0 likes · 20 min read

How ChatGPT Works: An In‑Depth Explanation by Stephen Wolfram

Sohu Tech Products

Feb 1, 2023 · Artificial Intelligence

ChatGPT Writes AI: Building an MNIST Classifier with Keras Using ChatGPT

This article demonstrates how a machine‑learning enthusiast used ChatGPT to generate, modify, and refine Keras code for training, evaluating, visualizing, and deploying a neural‑network model that classifies handwritten digits from the classic MNIST dataset, showcasing the full development workflow.

ChatGPTKerasMNIST

0 likes · 4 min read

ChatGPT Writes AI: Building an MNIST Classifier with Keras Using ChatGPT

vivo Internet Technology

Aug 24, 2022 · Frontend Development

Applying Self-Attention Based Machine Learning Model to Design-to-Code Layout Prediction

Vivo’s frontend team built a self‑attention‑based machine‑learning model that predicts web‑page layout types (column, row, or absolute) from node dimensions and positions, solving parent‑child and sibling relationships for design‑to‑code conversion, achieving 99.4% accuracy using over 20 k labeled, crawled, and generated samples, while outlining further enhancements.

D2CNeural NetworkSelf-Attention

0 likes · 11 min read

Applying Self-Attention Based Machine Learning Model to Design-to-Code Layout Prediction

Code DAO

May 16, 2022 · Artificial Intelligence

How to Build a Simple Neural Network from Scratch with NumPy

This article walks through implementing a basic multi‑layer neural network using only NumPy, covering terminology, network architecture, forward and backward propagation, activation functions, loss calculation, parameter updates with SGD, and compares the custom model with a Keras implementation.

BackpropagationNeural NetworkNumPy

0 likes · 17 min read

How to Build a Simple Neural Network from Scratch with NumPy

Code DAO

May 12, 2022 · Artificial Intelligence

How Activation Functions Work in Deep Learning

This article explains the role of activation functions in deep learning, covering their definition, why they are needed, the main categories—including linear, binary step, and various non‑linear functions such as Sigmoid, TanH, ReLU, Leaky ReLU, ELU, Softmax and Swish—along with each function's mathematical form, advantages, disadvantages, and practical usage recommendations.

Deep LearningNeural NetworkReLU

0 likes · 13 min read

How Activation Functions Work in Deep Learning

Python Programming Learning Circle

Mar 19, 2022 · Artificial Intelligence

Building a Simple Digital Twin for Lithium‑Ion Batteries Using Python and Neural Networks

The article demonstrates how to build a digital twin for lithium‑ion batteries in Python by constructing a physics‑based model, augmenting it with experimental data using a simple Keras neural network, and visualizing predictions, illustrating the hybrid approach’s improved accuracy over purely empirical methods.

Digital TwinKerasNeural Network

0 likes · 9 min read

Building a Simple Digital Twin for Lithium‑Ion Batteries Using Python and Neural Networks

360 Quality & Efficiency

Aug 27, 2021 · Artificial Intelligence

Training an Audio Quality Detection Model Using Synthetic Noise and PESQ Scoring

This article explains how to generate low‑quality audio samples from clean speech by randomly inserting noise at various SNR levels, compute objective PESQ scores as ground‑truth, and use these paired data to train a neural‑network model for reference‑free audio quality assessment.

Neural NetworkPESQPython

0 likes · 7 min read

Training an Audio Quality Detection Model Using Synthetic Noise and PESQ Scoring

NetEase Smart Enterprise Tech+

Aug 23, 2021 · Artificial Intelligence

How a Lightweight Neural Network Cuts Transient Noise in Real‑Time Audio

NetEase Cloud Communication’s Audio Lab presents a low‑complexity neural‑network denoising algorithm that effectively suppresses both stationary and transient noises while preserving speech quality, detailing its mathematical model, feature design, loss function, GRU‑based architecture, real‑time performance, and comparative evaluation against state‑of‑the‑art methods.

Neural NetworkReal-time Processingaudio denoising

0 likes · 13 min read

How a Lightweight Neural Network Cuts Transient Noise in Real‑Time Audio

ByteDance Dali Intelligent Technology Team

May 31, 2021 · Frontend Development

Optimized Chinese Character Stroke‑Order Animation: Glyph Extraction, Stroke Segmentation, Median Generation, and Order Determination

This article describes how to integrate and extend the open‑source HanziWriter library for Android, iOS and web, extracting TrueType glyph data, segmenting strokes with corner detection and neural‑network scoring, generating stroke medians via Voronoi diagrams, and determining correct stroke order using a Hungarian‑algorithm match.

HanziWriterJavaScriptNeural Network

0 likes · 16 min read

Optimized Chinese Character Stroke‑Order Animation: Glyph Extraction, Stroke Segmentation, Median Generation, and Order Determination

Taobao Frontend Technology

Oct 27, 2020 · Artificial Intelligence

Mastering Tensors in TensorFlow.js: From Scalars to Neural Networks

This guide explains the fundamentals of tensors in TensorFlow.js—including scalars, vectors, and higher‑dimensional tensors—demonstrates how to convert real‑world data such as the Titanic dataset into tensors, and shows how to build, compile, and train a simple neural network model using appropriate layers, loss functions, and optimizers.

Data preprocessingJavaScriptNeural Network

0 likes · 7 min read

Mastering Tensors in TensorFlow.js: From Scalars to Neural Networks

Xianyu Technology

Feb 27, 2020 · Artificial Intelligence

Data-Driven Simulation for User Activity Retention Prediction

By extracting hour‑level activity logs and training supervised models—including CART, GBDT, and neural networks—on user tags, the team simulated short‑term metrics for new reward campaigns, enabling earlier prediction of next‑day retention and shortening experiment cycles despite delayed T+1 data.

AB testingCARTGBDT

0 likes · 9 min read

Data-Driven Simulation for User Activity Retention Prediction

360 Tech Engineering

Sep 16, 2019 · Artificial Intelligence

Backpropagation Algorithm for Fully Connected Neural Networks with Python Implementation

This article explains the backpropagation training algorithm for fully connected artificial neural networks, detailing its gradient‑descent basis, mathematical derivation, matrix formulation, and provides a complete Python implementation with mini‑batch stochastic gradient descent, momentum, learning‑rate decay, and experimental results.

BackpropagationMini-BatchNeural Network

0 likes · 14 min read

Backpropagation Algorithm for Fully Connected Neural Networks with Python Implementation

Tencent Advertising Technology

Jun 13, 2019 · Artificial Intelligence

Competition Solution Overview: Data Analysis, Rule‑Based and Neural Network Models for Advertising Prediction

The article details a contestant's end‑to‑end approach for an advertising competition, covering data analysis, rule‑based preprocessing, a three‑layer neural network architecture, model‑rule ensemble weighting, self‑correction strategies for the B phase, and final model‑only solutions that achieved top scores.

AdvertisingNeural Networkcompetition

0 likes · 8 min read

Competition Solution Overview: Data Analysis, Rule‑Based and Neural Network Models for Advertising Prediction

NetEase Media Technology Team

Apr 26, 2019 · Artificial Intelligence

Intelligent Cover Image Selection System for News Articles: Image Quality Assessment and Smart Cropping

The article describes an intelligent cover‑image selection system for NetEase News that automatically filters unsuitable illustrations, assesses image quality with a pairwise‑trained deep model across clarity, color and composition, and smartly crops images using aspect‑ratio‑aware object detection, dramatically cutting manual editing and enabling confidence‑based automatic publishing.

Image CroppingNeural Networkcomputer vision

0 likes · 11 min read

Intelligent Cover Image Selection System for News Articles: Image Quality Assessment and Smart Cropping

MaGe Linux Operations

Apr 15, 2019 · Artificial Intelligence

How to Build a Breast Cancer Prediction Neural Network from Scratch in Python

This article walks through creating a Python‑based neural network to predict breast cancer using the Wisconsin dataset, covering network architecture, weight and bias initialization, back‑propagation, gradient descent, and the role of activation functions such as sigmoid, tanh, ReLU and Leaky ReLU.

Deep LearningNeural NetworkPython

0 likes · 13 min read

How to Build a Breast Cancer Prediction Neural Network from Scratch in Python

Tencent Cloud Developer

Jan 17, 2019 · Artificial Intelligence

Deep Learning for Big Data Recommendation Systems: Tencent's Industrial Practice

Tencent’s industrial practice shows how a large‑scale offline‑nearline‑online “Shield” recommendation architecture, powered by the DeepR framework built on RCaffe, uses deep semantic embeddings, massive neural networks and reinforcement‑learning decisions to handle billions of daily requests, demonstrating that data richness and engineering capability, not model depth alone, drive performance in big‑data recommendation systems.

Big DataDeep LearningNeural Network

0 likes · 13 min read

Deep Learning for Big Data Recommendation Systems: Tencent's Industrial Practice

Tencent Cloud Developer

Oct 15, 2018 · Artificial Intelligence

Neural Network Fundamentals: Building Your Own Neural Network from Scratch in Python

This tutorial explains neural network fundamentals by defining layers, weights, biases, and sigmoid activation, then walks through building a Python class that implements forward propagation, a sum‑of‑squared‑error loss, and backpropagation using the chain rule and gradient descent to train a simple two‑layer network.

BackpropagationNeural NetworkPython

0 likes · 8 min read

Neural Network Fundamentals: Building Your Own Neural Network from Scratch in Python