Tagged articles
1236 articles
Page 8 of 13
MaGe Linux Operations
MaGe Linux Operations
Mar 29, 2021 · Artificial Intelligence

Mastering PyTorch Visualization: TensorBoard and Visdom Guide

This tutorial explains how to install, launch, and use TensorBoard and Visdom with PyTorch, providing step‑by‑step commands, code examples for logging training metrics, and visualizing images and plots to monitor deep‑learning experiments.

Deep LearningPyTorchPython
0 likes · 6 min read
Mastering PyTorch Visualization: TensorBoard and Visdom Guide
iQIYI Technical Product Team
iQIYI Technical Product Team
Mar 26, 2021 · Artificial Intelligence

Insights into OCR Technology at iQIYI: Development, Challenges, and Applications

iQIYI’s OCR journey, explained by researcher Harlon, covers the evolution from separate detection and recognition pipelines to end‑to‑end models, key algorithms like CTPN, DB and CRNN, large‑scale simulated training, diverse video‑text applications, and future goals such as mobile deployment and tighter NLP integration.

AIComputer VisionDeep Learning
0 likes · 21 min read
Insights into OCR Technology at iQIYI: Development, Challenges, and Applications
21CTO
21CTO
Mar 23, 2021 · Artificial Intelligence

How AI is Revolutionizing Monkey Identification: The Tri‑AI System

Researchers at Northwestern University have developed the Tri‑AI system, a deep‑learning facial recognition platform that accurately identifies individual golden snub‑nosed monkeys in the wild, achieving 94% precision and enabling non‑invasive monitoring, data collection, and broader applications across multiple animal species.

AIDeep Learninganimal monitoring
0 likes · 9 min read
How AI is Revolutionizing Monkey Identification: The Tri‑AI System
Kuaishou Tech
Kuaishou Tech
Mar 22, 2021 · Artificial Intelligence

Unified Model Compression Framework (UMEC) for Efficient Recommendation Systems

The paper introduces UMEC, a unified model compression framework that jointly optimizes feature embedding and prediction modules under resource constraints, achieving up to three‑fold compression of recommendation models without sacrificing accuracy, and demonstrates superior performance on multiple benchmark datasets.

AIDeep LearningUMEC
0 likes · 9 min read
Unified Model Compression Framework (UMEC) for Efficient Recommendation Systems
Amap Tech
Amap Tech
Mar 22, 2021 · Artificial Intelligence

Visual Technology for Automated POI Name Generation: STR, Text Detection, and Naming Practices

Amap’s visual‑technology pipeline automatically generates and updates POI names by crowdsourcing street‑level images, applying deep‑learning scene‑text recognition, dual‑branch classification of text attributes, and a BERT‑plus‑graph‑attention model that selects and orders recognized text, achieving about 95 % naming accuracy.

Computer VisionDeep LearningName Generation
0 likes · 14 min read
Visual Technology for Automated POI Name Generation: STR, Text Detection, and Naming Practices
DataFunTalk
DataFunTalk
Mar 17, 2021 · Artificial Intelligence

Deep Ranking Model Evolution and Applications in Taobao Live: DBMTL, DMR, and RUI Ranking

This article presents a comprehensive overview of Taobao Live's deep ranking system evolution, detailing the DBMTL multi‑task learning framework, the two‑tower DMR matching‑ranking architecture, and the RUI Ranking refer‑item model, together with their offline formulas, online deployment scenarios, and measured performance gains across click‑through, watch‑time, and conversion metrics.

AIDeep LearningModel Optimization
0 likes · 27 min read
Deep Ranking Model Evolution and Applications in Taobao Live: DBMTL, DMR, and RUI Ranking
DataFunTalk
DataFunTalk
Mar 14, 2021 · Artificial Intelligence

A Review of Medical Domain Sentiment Analysis: Interpretability, Contextual Aspect‑Sentiment Relations, Noisy Labels, and Domain Lexicon Construction

This article reviews recent research on medical sentiment analysis, covering interpretability of neural models, contextual aspect‑sentiment interactions, strategies for handling noisy labels, and methods for building domain‑specific sentiment lexicons, highlighting challenges and proposed solutions.

Deep LearningInterpretabilitySentiment Analysis
0 likes · 19 min read
A Review of Medical Domain Sentiment Analysis: Interpretability, Contextual Aspect‑Sentiment Relations, Noisy Labels, and Domain Lexicon Construction
DeWu Technology
DeWu Technology
Mar 12, 2021 · Industry Insights

How Do Recommendation Systems Rank Items? A Deep Dive into Models and Strategies

This article explains the architecture and ranking process of modern recommendation systems, covering the two-stage pipeline of candidate generation and ranking, the evolution from rule‑based methods to logistic regression, GBDT, wide‑and‑deep, and deep learning models, and discusses challenges such as feature non‑linearity, multi‑objective optimization, and the need for post‑ranking interventions.

Deep LearningGBDTIndustry Insights
0 likes · 15 min read
How Do Recommendation Systems Rank Items? A Deep Dive into Models and Strategies
MaGe Linux Operations
MaGe Linux Operations
Mar 11, 2021 · Artificial Intelligence

What’s New in PyTorch 1.8? Key Features, APIs, and Performance Boosts

PyTorch 1.8, released by the PyTorch team, bundles over 3,000 commits since 1.7, introducing AMD ROCm support, enhanced Python function conversion, stable FFT and linear‑algebra APIs, complex‑tensor autograd, distributed‑training improvements, new mobile tutorials, performance tools, and several prototype features.

Deep LearningGPUMobile
0 likes · 6 min read
What’s New in PyTorch 1.8? Key Features, APIs, and Performance Boosts
Alibaba Cloud Native
Alibaba Cloud Native
Mar 5, 2021 · Artificial Intelligence

How Alluxio Supercharges Cloud Deep Learning: Benchmarks, Architecture, and Tuning

This article examines why accelerating cloud‑based deep learning is essential, presents benchmark results comparing GPU generations and distributed training, introduces Alluxio as a distributed memory‑level cache, details its architecture on Kubernetes, and offers concrete tuning strategies to overcome I/O bottlenecks and boost training performance.

AIAlluxioDeep Learning
0 likes · 16 min read
How Alluxio Supercharges Cloud Deep Learning: Benchmarks, Architecture, and Tuning
Tencent Cloud Developer
Tencent Cloud Developer
Mar 4, 2021 · Artificial Intelligence

WeChat OCR: Implementation of Image Text Extraction Feature

WeChat’s 8.0 update introduced an OCR pipeline that first quickly detects text in images, classifies the image type, applies a lightweight multi‑language detection network and a MobileNetV3‑based DBNet recognizer with a multi‑task CTC/Attention model, then merges results via a rule‑based layout analyzer to deliver accurate, well‑formatted extracted text across diverse languages and document types.

Computer VisionDBNetDeep Learning
0 likes · 13 min read
WeChat OCR: Implementation of Image Text Extraction Feature
DataFunTalk
DataFunTalk
Mar 2, 2021 · Artificial Intelligence

Multi-Objective Optimization with MMoE for Taobao "Lying Flat" Channel

This article presents the design and implementation of a multi‑objective optimization framework using Multi‑gate Mixture‑of‑Experts (MMoE) to improve click‑through, conversion, and purchase behaviors in Taobao's "Lying Flat" home‑goods recommendation channel, detailing model variants, feature engineering, loss weighting, and online A/B test results.

CTRCVRDeep Learning
0 likes · 10 min read
Multi-Objective Optimization with MMoE for Taobao "Lying Flat" Channel
DataFunTalk
DataFunTalk
Feb 26, 2021 · Artificial Intelligence

Fine‑Grained Sentiment Analysis and Opinion Quadruple Extraction: Methods, Tasks, and Applications

This article introduces the concepts, tasks, and recent advances in text sentiment analysis, focusing on attribute‑level sentiment (TG‑ABSA) and opinion‑quadruple extraction, describing unsupervised, reading‑comprehension, and multi‑task deep‑learning approaches, their implementation on Huawei Cloud, experimental results, and future research directions.

Deep LearningNLPSentiment Analysis
0 likes · 20 min read
Fine‑Grained Sentiment Analysis and Opinion Quadruple Extraction: Methods, Tasks, and Applications
ITPUB
ITPUB
Feb 25, 2021 · Artificial Intelligence

How 58.com Scales Voice Quality Inspection with AI-Powered Architecture

This article details the AI-driven intelligent voice quality inspection system built by 58.com, covering its background, multi‑layer architecture, speech recognition, role and tag identification, backend services, and the resulting efficiency gains for large‑scale call‑center operations.

AIDeep Learningcall center automation
0 likes · 15 min read
How 58.com Scales Voice Quality Inspection with AI-Powered Architecture
Kuaishou Large Model
Kuaishou Large Model
Feb 25, 2021 · Artificial Intelligence

How Kuaishou’s AI‑Powered Beauty Engine Transforms Real‑Time Video

This article details Kuaishou Y‑tech’s Gorgeous beauty platform, covering traditional smoothing, advanced skin‑tone effects, AI‑driven blemish removal, clarity enhancement, local facial tuning, and the UNet‑based GorgeousGAN that delivers one‑click high‑definition beauty for live‑stream and short‑video applications.

AI beautyComputer VisionDeep Learning
0 likes · 13 min read
How Kuaishou’s AI‑Powered Beauty Engine Transforms Real‑Time Video
DataFunTalk
DataFunTalk
Feb 15, 2021 · Artificial Intelligence

Deep Tree Matching (TDM): Evolution and Practice in Large-Scale Retrieval at Alibaba

This article explains Alibaba's Deep Tree Matching (TDM) technology, covering the challenges of large‑scale match retrieval, the progression from classic two‑stage recall to tree‑based indexing, max‑heap tree modeling, beam‑search retrieval, and the joint model‑index learning across TDM 1.0, 2.0, and 3.0, highlighting significant offline and online performance gains and future research directions.

AlibabaBeam SearchDeep Learning
0 likes · 15 min read
Deep Tree Matching (TDM): Evolution and Practice in Large-Scale Retrieval at Alibaba
DataFunTalk
DataFunTalk
Feb 13, 2021 · Artificial Intelligence

Multi-Channel Deep Interest Modeling for 58.com Home Page Recommendations

This article details how 58.com tackled the challenges of multi‑business recommendation on its home page by developing a dual‑channel deep interest model, introducing customized feature‑crossing, optimizing training and online performance, and exploring multi‑channel extensions for broader scenario adaptation.

AIDeep Learningfeature engineering
0 likes · 20 min read
Multi-Channel Deep Interest Modeling for 58.com Home Page Recommendations
DataFunTalk
DataFunTalk
Feb 10, 2021 · Artificial Intelligence

Deep Learning Based Search Ranking Optimization for 58.com Rental Services

This article describes how 58.com’s rental platform leverages deep learning models such as Wide&Deep, DeepFM, DCN, DIN, and DIEN to improve search ranking, detailing data pipelines, feature engineering, model iteration, multi‑task training, prediction optimizations, and resulting online performance gains.

Deep LearningModel Optimizationfeature engineering
0 likes · 27 min read
Deep Learning Based Search Ranking Optimization for 58.com Rental Services
DataFunTalk
DataFunTalk
Feb 4, 2021 · Artificial Intelligence

Cross‑Session Aware Temporal Convolutional Network (CA‑TCN) for Session‑Based Recommendation

The article introduces the CA‑TCN model, which combines cross‑session item graphs, a temporal convolutional network, and a session‑context graph to capture both item‑level and session‑level cross‑session influences, achieving state‑of‑the‑art performance on benchmark session‑based recommendation datasets.

Deep LearningGraph Neural NetworkTemporal Convolutional Network
0 likes · 17 min read
Cross‑Session Aware Temporal Convolutional Network (CA‑TCN) for Session‑Based Recommendation
DataFunTalk
DataFunTalk
Feb 3, 2021 · Artificial Intelligence

Towards Best Possible Deep Learning Acceleration on the Edge – A Compression-Compilation Co-Design Framework

The lecture presented by Assistant Professor Yanzhi Wang introduces a compression‑compilation co‑design framework (CoCoPIE) that achieves real‑time deep‑learning inference on edge devices through novel pruning and quantization techniques, delivering up to 180× speedup without accuracy loss.

AIDeep LearningEdge Computing
0 likes · 5 min read
Towards Best Possible Deep Learning Acceleration on the Edge – A Compression-Compilation Co-Design Framework
Amap Tech
Amap Tech
Feb 1, 2021 · Artificial Intelligence

AMAP-TECH Algorithm Competition: Dynamic Road Condition Analysis Using In-Vehicle Video

The AMAP‑TECH competition challenged participants to infer real‑time road conditions from in‑vehicle video, prompting the authors to combine lane‑wise vehicle detection with LightGBM and later an end‑to‑end DenseNet‑GRU model, augment data, ensemble five networks, and achieve a 0.7237 F1 score while outlining future deployment and research directions.

Computer VisionDeep LearningModel Deployment
0 likes · 15 min read
AMAP-TECH Algorithm Competition: Dynamic Road Condition Analysis Using In-Vehicle Video
JD Tech Talk
JD Tech Talk
Jan 28, 2021 · Artificial Intelligence

Spatial‑Temporal Graph Diffusion Network for City Traffic Flow Forecasting

This article introduces a hierarchical graph neural network model that jointly captures multi‑scale temporal patterns and global spatial context for urban traffic flow prediction, demonstrates its superiority over existing methods on multiple public datasets, and validates each component through extensive ablation studies.

Deep LearningGraph Neural Networkattention
0 likes · 8 min read
Spatial‑Temporal Graph Diffusion Network for City Traffic Flow Forecasting
DataFunTalk
DataFunTalk
Jan 25, 2021 · Artificial Intelligence

Evolution of Zhihu Search Ranking Models: From GBDT to DNN, Multi‑Goal and Context‑Aware LTR

This article reviews the development of Zhihu's search system, describing the transition from early GBDT ranking to deep neural networks, the introduction of multi‑objective and position‑bias‑aware learning‑to‑rank methods, context‑aware techniques, end‑to‑end training, personalization, and future research directions.

DNNDeep LearningGBDT
0 likes · 17 min read
Evolution of Zhihu Search Ranking Models: From GBDT to DNN, Multi‑Goal and Context‑Aware LTR
DataFunTalk
DataFunTalk
Jan 20, 2021 · Artificial Intelligence

Techniques for Reducing the Computational Complexity of Large-Scale Graph Neural Networks

This article presents an overview of graph neural networks, explains their computational framework, analyzes space and time complexities, and proposes ten practical strategies—including edge avoidance, dimensionality reduction, selective iteration, memory baking, distillation, partitioning, sparse computation, routing, and cross-sample feature sharing—to significantly lower the cost of large‑scale GNN processing.

Computational ComplexityDeep Learninglarge scale
0 likes · 14 min read
Techniques for Reducing the Computational Complexity of Large-Scale Graph Neural Networks
DeWu Technology
DeWu Technology
Jan 18, 2021 · Artificial Intelligence

Recall Stage in Recommendation Systems: From Intuition to Deep Learning

The recall stage, the first filtering step after candidate generation, transforms intuitive attribute‑based shortcuts into sophisticated matrix‑factorization and embedding methods—such as dual‑tower and tree‑based models—enabling fast, personalized, diverse candidate selection for real‑time recommendation pipelines.

Deep LearningEmbeddingcollaborative filtering
0 likes · 13 min read
Recall Stage in Recommendation Systems: From Intuition to Deep Learning
DataFunTalk
DataFunTalk
Jan 7, 2021 · Artificial Intelligence

User Preference Mining and Modeling Practices at Beike

This article introduces the concept of user preference mining, discusses challenges such as accurate expression, interpretability, and high-dimensional preferences, reviews statistical and model-based approaches including weighting, decay, XGBoost, DNN, LSTM, Seq4Rec, and Deep Interest Network, and describes their practical implementation at Beike.

BeikeDeep LearningEmbedding
0 likes · 19 min read
User Preference Mining and Modeling Practices at Beike
Sohu Tech Products
Sohu Tech Products
Jan 6, 2021 · Artificial Intelligence

Overview of Main Model Compression and Acceleration Techniques: Structural Optimization, Pruning, Quantization, and Knowledge Distillation

This article reviews four mainstream model compression and acceleration methods—structural optimization, pruning, quantization, and knowledge distillation—explaining their principles, implementations, and performance, and presents practical examples such as DistillBERT, TinyBERT, and FastBERT with comparative results.

AIDeep LearningKnowledge Distillation
0 likes · 14 min read
Overview of Main Model Compression and Acceleration Techniques: Structural Optimization, Pruning, Quantization, and Knowledge Distillation
DataFunTalk
DataFunTalk
Jan 4, 2021 · Artificial Intelligence

Personalized Computing‑Power Allocation for Alibaba Display Advertising: Transformers Engine and DCAF Algorithm

The article presents Alibaba's display‑advertising team’s three‑stage computing‑power efficiency evolution, introduces the DCAF personalized power‑allocation algorithm with its Lagrangian formulation, and describes the AllSpark dynamic‑control framework that together enable a flexible, resource‑aware Transformers engine achieving significant business gains during high‑traffic events.

Deep LearningSystem optimizationalgorithmic co-design
0 likes · 21 min read
Personalized Computing‑Power Allocation for Alibaba Display Advertising: Transformers Engine and DCAF Algorithm
Didi Tech
Didi Tech
Dec 29, 2020 · Artificial Intelligence

Evolution and Challenges of Perception in L4 Autonomous Driving

The article traces L4 autonomous-driving perception from early rule-based point-cloud methods through data-driven deep-learning models to emerging self-learning, multi-task systems, and highlights four key hurdles—model generalization and explainability, robust multi-sensor fusion, real-time compute limits, and proper uncertainty handling—calling for integrated AI, engineering, and data solutions.

AIComputer VisionDeep Learning
0 likes · 12 min read
Evolution and Challenges of Perception in L4 Autonomous Driving
JD Tech Talk
JD Tech Talk
Dec 29, 2020 · Artificial Intelligence

Robust Spatio-Temporal Purchase Prediction via Deep Meta Learning

The paper proposes a deep meta‑learning framework that generates spatio‑temporal representations for retail sales forecasting, especially during large shopping festivals, by combining amortization networks, shared statistical structures, and alternating spatial‑temporal training to achieve robust and accurate predictions despite scarce historical data.

Deep LearningMeta LearningRetail analytics
0 likes · 9 min read
Robust Spatio-Temporal Purchase Prediction via Deep Meta Learning
21CTO
21CTO
Dec 22, 2020 · Artificial Intelligence

Explore tinygrad: A Minimalist Deep Learning Framework Under 1000 Lines

tinygrad, an open‑source autograd tensor library by George Hotz, offers a compact PyTorch‑like experience in fewer than 1000 lines, with easy installation, GPU support via PyOpenCL, full EfficientNet inference, and extensible optimizers for rapid neural‑network prototyping.

AIAutogradDeep Learning
0 likes · 6 min read
Explore tinygrad: A Minimalist Deep Learning Framework Under 1000 Lines
Python Crawling & Data Mining
Python Crawling & Data Mining
Dec 22, 2020 · Artificial Intelligence

Create Stunning Video Ghosting Effects with PaddlePaddle’s DeepLabV3p Model

Learn how to generate cinematic ghosting effects in videos by leveraging PaddlePaddle’s PaddleHub deep learning library and the pretrained deeplabv3p_xception65 model for semantic segmentation, with step‑by‑step code, environment setup, and practical testing on classic martial‑arts footage.

Deep LearningGhost EffectPaddlePaddle
0 likes · 7 min read
Create Stunning Video Ghosting Effects with PaddlePaddle’s DeepLabV3p Model
DataFunTalk
DataFunTalk
Dec 17, 2020 · Artificial Intelligence

Context‑Aware Re‑ranking in Industrial Recommendation Systems: Design and Practice of a List Retrieval System

The article presents a comprehensive study of re‑ranking in large‑scale industrial recommendation pipelines, identifies four key challenges—context awareness, permutation specificity, computational complexity, and business constraints—and proposes a two‑stage List Retrieval System that combines fast sequence search and a generative re‑ranking network with a deep context‑wise model, achieving significant online gains across multiple Taobao feed scenarios.

Context-AwareDeep LearningIndustrial AI
0 likes · 28 min read
Context‑Aware Re‑ranking in Industrial Recommendation Systems: Design and Practice of a List Retrieval System
DataFunSummit
DataFunSummit
Dec 14, 2020 · Artificial Intelligence

LightSeq: High‑Performance Open‑Source Inference Engine for Transformers, GPT and Other NLP Models

This article introduces LightSeq, an open‑source, GPU‑accelerated inference engine that dramatically speeds up Transformer‑based models such as BERT and GPT by up to 14× over TensorFlow, supports multiple decoding strategies, integrates seamlessly with major deep‑learning frameworks, and provides detailed performance benchmarks and technical optimizations.

Deep LearningGPUInference
0 likes · 15 min read
LightSeq: High‑Performance Open‑Source Inference Engine for Transformers, GPT and Other NLP Models
Tencent Cloud Developer
Tencent Cloud Developer
Dec 14, 2020 · Artificial Intelligence

Game AI SDK: Overview, Architecture, and Usage

Tencent’s open‑source Game AI SDK provides a versatile automation testing platform—supporting a wide range of game genres and mobile/PC apps—by integrating environment simulation, configurable tools, image‑recognition modules, and deep‑learning algorithms (DQN and IM) into a unified, user‑friendly workflow for training and executing AI agents.

Automation testingDeep LearningSDK
0 likes · 17 min read
Game AI SDK: Overview, Architecture, and Usage
Meituan Technology Team
Meituan Technology Team
Dec 10, 2020 · Artificial Intelligence

Cross‑Session Aware Temporal Convolutional Network (CA‑TCN) for Session‑Based Recommendation

The Cross‑Session Aware Temporal Convolutional Network (CA‑TCN) combines a cross‑session item graph, a dilated temporal convolutional network, and a session‑context graph to capture both global cross‑session signals and positional order, achieving state‑of‑the‑art recommendation performance on benchmarks and slated for deployment in Meituan’s e‑commerce platforms.

Deep LearningGraph Neural NetworkTemporal Convolutional Network
0 likes · 17 min read
Cross‑Session Aware Temporal Convolutional Network (CA‑TCN) for Session‑Based Recommendation
Programmer DD
Programmer DD
Dec 5, 2020 · Artificial Intelligence

Revive Vintage Photos with AI: Guide to Bringing-Old-Photos-Back-to-Life

This article introduces the AI‑powered "Bringing-Old-Photos-Back-to-Life" project, explains its requirements, provides step‑by‑step commands for full‑pipeline restoration, scratch detection, global restoration, and face enhancement, and shares the Colab demo and GitHub repository for hands‑on experimentation.

AI image restorationColab demoDeep Learning
0 likes · 4 min read
Revive Vintage Photos with AI: Guide to Bringing-Old-Photos-Back-to-Life
DataFunSummit
DataFunSummit
Dec 3, 2020 · Artificial Intelligence

GAN Fundamentals, Variants, and Practical Applications in Image Style Transfer and Handwriting Font Generation

This article provides a comprehensive overview of Generative Adversarial Networks, covering their original formulation, training dynamics, loss functions, major variants such as DCGAN and WGAN, and practical implementations for image‑to‑image translation, style transfer, and handwriting font synthesis at Laiye Technology.

Computer VisionDeep LearningGAN
0 likes · 28 min read
GAN Fundamentals, Variants, and Practical Applications in Image Style Transfer and Handwriting Font Generation
Kuaishou Large Model
Kuaishou Large Model
Dec 3, 2020 · Artificial Intelligence

Kuaishou Y‑Tech’s Real‑Time, High‑Precision Facial & Body Keypoint Detection Explained

Y‑Tech’s in‑house keypoint detection system powers Kuaishou’s beauty and effect filters across live streaming, video creation, and editing by leveraging lightweight deep‑learning models, extensive multi‑scenario data collection, and specialized handling of occlusion, enabling real‑time, robust facial and body landmark tracking on diverse mobile devices.

Computer VisionDeep LearningMobile AI
0 likes · 10 min read
Kuaishou Y‑Tech’s Real‑Time, High‑Precision Facial & Body Keypoint Detection Explained
360 Quality & Efficiency
360 Quality & Efficiency
Nov 27, 2020 · Artificial Intelligence

Image Similarity Detection Methods: Hashing, Histograms, Feature Matching, BOW+K‑Means, and CNN‑Based Approaches

This article reviews common image similarity detection techniques—including hash-based methods (aHash, pHash, dHash), histogram comparison, feature matching with ORB and SIFT/SURF, bag‑of‑words with K‑Means, and CNN‑based VGG16 features—detailing their algorithms, Python implementations, performance characteristics, and practical considerations.

Computer VisionDeep LearningHashing
0 likes · 15 min read
Image Similarity Detection Methods: Hashing, Histograms, Feature Matching, BOW+K‑Means, and CNN‑Based Approaches
New Oriental Technology
New Oriental Technology
Nov 23, 2020 · Artificial Intelligence

A Seq2Seq Deep Learning Approach for Recognizing Mathematical Formulas in Images

This article presents a deep‑learning Seq2Seq model that converts images of mathematical formulas—including matrices, equations, fractions, and radicals—into LaTeX sequences with over 95% accuracy, detailing data preparation, LaTeX normalization, model architecture, training, inference, and post‑processing techniques.

Deep LearningFormula RecognitionLaTeX
0 likes · 9 min read
A Seq2Seq Deep Learning Approach for Recognizing Mathematical Formulas in Images
DataFunSummit
DataFunSummit
Nov 22, 2020 · Artificial Intelligence

An Overview of NVIDIA Merlin Recommendation System Framework and Its Deep Learning Components

This article introduces NVIDIA's Merlin recommendation system framework, detailing its three core components—NVTabular for feature engineering, HugeCTR for high‑performance CTR model training, and Triton for inference—while discussing common pipeline challenges, performance advantages, and example implementations for deep‑learning‑based recommender models.

AIDeep LearningHugeCTR
0 likes · 12 min read
An Overview of NVIDIA Merlin Recommendation System Framework and Its Deep Learning Components
Sohu Tech Products
Sohu Tech Products
Nov 18, 2020 · Artificial Intelligence

Understanding Sequence‑to‑Sequence (seq2seq) Models and Attention Mechanisms

This article explains the fundamentals of seq2seq neural machine translation models, covering encoder‑decoder architecture, word embeddings, context vectors, RNN processing, and the attention mechanism introduced by Bahdanau and Luong, with visual illustrations and reference links for deeper study.

Deep LearningEmbeddingNeural Machine Translation
0 likes · 11 min read
Understanding Sequence‑to‑Sequence (seq2seq) Models and Attention Mechanisms
DeWu Technology
DeWu Technology
Nov 18, 2020 · Artificial Intelligence

Evolution and Technical Analysis of Dewu Photo Search

Dewu Photo Search evolved from a limited Aliyun‑based prototype to a self‑developed pipeline using EfficientNet detection and 128‑dim embeddings, boosting top‑1 shoe accuracy over 100 % and overall precision by up to 41 %, while reducing latency and improving scalability despite remaining stability challenges.

Deep LearningModel Optimizationfeature extraction
0 likes · 10 min read
Evolution and Technical Analysis of Dewu Photo Search
DataFunTalk
DataFunTalk
Nov 16, 2020 · Artificial Intelligence

Deep Semantic Relevance and Multimodal Video Search at Alibaba Entertainment

The presentation by Alibaba Entertainment's senior algorithm expert details the challenges of video search in the 4G/5G era and describes a comprehensive framework covering business overview, relevance and ranking, multimodal retrieval, deep semantic modeling, dataset construction, and practical deployment techniques.

Deep Learninginformation retrievalmultimodal
0 likes · 27 min read
Deep Semantic Relevance and Multimodal Video Search at Alibaba Entertainment
Suning Technology
Suning Technology
Nov 14, 2020 · Artificial Intelligence

Designing Real-Time AI Algorithms for Unmanned Retail Stores

This lecture details the end‑to‑end AI architecture for unmanned stores, covering algorithm module selection, calibration, face recognition, multi‑task detection, tracking, recommendation, data collection, augmentation, model training, and GPU‑accelerated deployment to achieve real‑time performance and high accuracy.

Deep LearningModel Deploymentdata augmentation
0 likes · 15 min read
Designing Real-Time AI Algorithms for Unmanned Retail Stores
JD Cloud Developers
JD Cloud Developers
Nov 4, 2020 · Artificial Intelligence

How Cloud Trade Fairs Use AI to Power Smart Recommendations

This article explains how a cloud‑based trade fair leverages AI techniques—including user and item profiling, multi‑level caching with Caffeine and Redis, and a Deep Interest Network model with attention mechanisms—to deliver personalized, high‑performance recommendations for exhibitors, buyers, and individual users.

AIDeep Learningcaching
0 likes · 15 min read
How Cloud Trade Fairs Use AI to Power Smart Recommendations
Didi Tech
Didi Tech
Nov 3, 2020 · Artificial Intelligence

Advances in Single‑Channel Speech Separation and Target Speaker Extraction with Iterative Refined Adaptation

The article surveys recent advances in single‑channel speech separation and target‑speaker extraction, explains the encoder‑separator‑decoder framework, compares frequency‑ and time‑domain methods, highlights models such as SpEx+, DPRNN‑Spe, and introduces Iterative Refined Adaptation, which iteratively improves speaker embeddings to boost SI‑SDR performance and enables effective speaker‑suppression for applications like in‑vehicle voice interaction.

AIDeep Learningaudio signal processing
0 likes · 13 min read
Advances in Single‑Channel Speech Separation and Target Speaker Extraction with Iterative Refined Adaptation
DataFunTalk
DataFunTalk
Oct 31, 2020 · Artificial Intelligence

Trajectory Classification for Road Closure Detection Using Bayesian and Deep Learning Approaches

This article investigates how to dynamically identify road closures by classifying vehicle, bicycle, and pedestrian trajectories, addressing sample imbalance and noisy labels with a label‑probability mixture Bayesian model and a deep‑learning image‑encoding pipeline, and compares their experimental results.

Artificial IntelligenceBayesian modelDeep Learning
0 likes · 12 min read
Trajectory Classification for Road Closure Detection Using Bayesian and Deep Learning Approaches
Suning Technology
Suning Technology
Oct 29, 2020 · Artificial Intelligence

Accelerating Deep Learning for Retail: Model Compression, Speed & Energy

This lecture outlines the key challenges of deep learning in retail—growing model size, speed, and energy consumption—and presents a comprehensive acceleration framework covering algorithmic optimizations like network design, pruning, and hardware acceleration, with practical examples such as MobileNet, model compression, and edge deployment.

Deep LearningHardware Optimizationmodel acceleration
0 likes · 15 min read
Accelerating Deep Learning for Retail: Model Compression, Speed & Energy
Tencent Advertising Technology
Tencent Advertising Technology
Oct 29, 2020 · Artificial Intelligence

Large-Scale User Visits Understanding and Forecasting with Deep Spatial-Temporal Tensor Factorization Framework

This article discusses a deep spatial-temporal tensor factorization framework for large-scale user visits understanding and forecasting, addressing challenges in advertising inventory prediction and demonstrating significant improvements over traditional methods.

Artificial IntelligenceData ScienceDeep Learning
0 likes · 9 min read
Large-Scale User Visits Understanding and Forecasting with Deep Spatial-Temporal Tensor Factorization Framework
Beike Product & Technology
Beike Product & Technology
Oct 24, 2020 · Artificial Intelligence

FrameX: An AI System for Intelligent Floorplan Analysis and Applications

FrameX is an AI-powered platform developed by Beike’s Data Intelligence Center that leverages vector floorplan data to automatically tag, score, interpret, cluster, and retrieve housing layouts, supporting numerous business scenarios through a layered architecture of data, feature, and application layers.

AIDeep LearningFloorplan Analysis
0 likes · 9 min read
FrameX: An AI System for Intelligent Floorplan Analysis and Applications
DataFunTalk
DataFunTalk
Oct 23, 2020 · Artificial Intelligence

Feedback‑Aware Deep Matching Model for Music Recommendation in Tmall Genie

This article presents DeepMatch, a behavior‑sequence based deep learning recall model enhanced with play‑rate and intent‑type embeddings, describes its self‑attention architecture, factorized embedding parameterization, multitask loss design, distributed TensorFlow training tricks, and demonstrates significant offline and online improvements in music recommendation performance.

Deep LearningSelf-AttentionTensorFlow
0 likes · 15 min read
Feedback‑Aware Deep Matching Model for Music Recommendation in Tmall Genie
Didi Tech
Didi Tech
Oct 21, 2020 · Artificial Intelligence

Deep Model Compression Techniques for Intelligent Automotive Cockpits

The article reviews deep‑model compression methods—ADMM‑based structured pruning, low‑bit quantization, and teacher‑student knowledge distillation—and their automated AutoCompress workflow, demonstrating how these techniques shrink neural networks enough to run real‑time driver‑monitoring and other intelligent cockpit functions on resource‑limited automotive hardware while preserving accuracy.

ADMMDeep LearningKnowledge Distillation
0 likes · 16 min read
Deep Model Compression Techniques for Intelligent Automotive Cockpits
DataFunTalk
DataFunTalk
Oct 20, 2020 · Artificial Intelligence

From Biological Neurons to Artificial Neural Networks: Perceptrons, Multilayer Perceptrons, and Backpropagation

This article traces the evolution of artificial neural networks from their biological inspiration, explains the McCulloch‑Pitts neuron model, details perceptron architecture and learning rule with a Scikit‑Learn example, and introduces multilayer perceptrons and the back‑propagation algorithm together with common activation functions.

AIBackpropagationDeep Learning
0 likes · 19 min read
From Biological Neurons to Artificial Neural Networks: Perceptrons, Multilayer Perceptrons, and Backpropagation
Amap Tech
Amap Tech
Oct 16, 2020 · Artificial Intelligence

Trajectory Classification for Road Closure Detection Using Bayesian and Deep Learning Approaches

The paper proposes classifying vehicle, bicycle, and pedestrian trajectories to detect road closures, introducing a probability‑mixture Bayesian model that mitigates label noise and class imbalance through joint feature densities, and a dual‑stream deep‑learning approach encoding trajectories as images, with experiments showing the Bayesian method outperforming the neural network on limited labeled data.

Bayesian modelDeep LearningGPS data
0 likes · 13 min read
Trajectory Classification for Road Closure Detection Using Bayesian and Deep Learning Approaches
Didi Tech
Didi Tech
Oct 16, 2020 · Artificial Intelligence

Mask Detection System and Visual AI Competition Achievements

Didi’s COVID‑19 mask‑detection system, built on a DFS‑based face detector and an attention‑enhanced ResNet‑50 mask classifier achieving over 99.5 % accuracy, has been deployed in vehicles, open‑sourced, and complemented by top‑ranked results in international visual AI contests, including first place in driver‑gaze prediction and podium finishes in emotion recognition and model‑compression challenges.

AIComputer VisionDeep Learning
0 likes · 22 min read
Mask Detection System and Visual AI Competition Achievements
Suning Technology
Suning Technology
Oct 15, 2020 · Artificial Intelligence

How AI Powers Offline Product Recognition in Smart Retail Stores

This lecture details the evolution of product recognition algorithms from traditional image classification to deep‑learning‑based object detection, discusses challenges in dense retail scenes, presents solutions like rotated bounding boxes and multi‑source sensor fusion, and explains practical deployment in digital and unmanned stores.

Deep Learningdense sceneobject detection
0 likes · 18 min read
How AI Powers Offline Product Recognition in Smart Retail Stores
Kuaishou Large Model
Kuaishou Large Model
Oct 15, 2020 · Artificial Intelligence

How Kuaishou’s Y‑Tech Advances Monocular Depth Estimation for Mobile AR

This article reviews Kuashou Y‑Tech’s ECCV‑2020 paper on monocular depth estimation, detailing its novel GCB‑SAB network, new HC‑Depth dataset, specialized loss functions and edge‑aware training, and demonstrates superior performance on NYUv2, TUM and real‑world mobile AR applications.

Attention MechanismComputer VisionDeep Learning
0 likes · 14 min read
How Kuaishou’s Y‑Tech Advances Monocular Depth Estimation for Mobile AR
DataFunTalk
DataFunTalk
Oct 4, 2020 · Artificial Intelligence

Reinforcement Learning for Product Ranking: Model Design, Experiments, and Online Deployment

This article presents a comprehensive study of using reinforcement learning to improve e‑commerce product ranking, covering the limitations of traditional scoring, the design of context‑aware models, a pointer‑network based sequence generator, various RL algorithms, extensive offline evaluations, and successful online deployment with future research directions.

Deep LearningPPOe‑commerce
0 likes · 28 min read
Reinforcement Learning for Product Ranking: Model Design, Experiments, and Online Deployment
DataFunTalk
DataFunTalk
Sep 29, 2020 · Artificial Intelligence

Deep Sparse Network (NON): A Novel Deep Neural Network Model for Recommendation Systems

This article introduces the Deep Sparse Network (NON), a new deep neural architecture for recommendation systems that combines field‑wise networks, across‑field interaction networks, and an operation‑fusion network, and demonstrates its superior performance through extensive experiments and ablation studies.

CTR predictionDeep Learningfeature interaction
0 likes · 14 min read
Deep Sparse Network (NON): A Novel Deep Neural Network Model for Recommendation Systems
Didi Tech
Didi Tech
Sep 29, 2020 · Artificial Intelligence

Technical Overview of Didi’s MJO 3D Panoramic Navigation, Main/Sub‑Road Yaw Detection, and Deep‑Learning‑Based Navigation Engine

Didi’s Navigation system combines a novel MJO 3D panoramic map with advanced data‑compression and octree rendering, precise main/sub‑road yaw detection using LSTM‑based models trained on GPS and image data, and a lightweight deep‑learning engine optimized for mobile CPUs/GPUs, delivering accurate, real‑time guidance for ride‑hailing and autonomous driving.

3D renderingDeep LearningGPS trajectory
0 likes · 21 min read
Technical Overview of Didi’s MJO 3D Panoramic Navigation, Main/Sub‑Road Yaw Detection, and Deep‑Learning‑Based Navigation Engine
Tencent Cloud Developer
Tencent Cloud Developer
Sep 23, 2020 · Artificial Intelligence

NLP Model Interpretability: White-box and Black-box Methods and Business Applications

The article reviews NLP interpretability techniques, contrasting white‑box approaches that probe model internals such as neuron analysis, diagnostic classifiers, and attention with black‑box strategies like rationales, adversarial testing, and local surrogates, and argues that black‑box methods are generally more practical for business deployment despite offering shallower insights.

Attention MechanismBERTDeep Learning
0 likes · 12 min read
NLP Model Interpretability: White-box and Black-box Methods and Business Applications
DataFunTalk
DataFunTalk
Sep 4, 2020 · Artificial Intelligence

Beam Search Aware Training for Optimal Tree-Based Retrieval Models

This article presents a comprehensive study of tree-based deep models for large-scale matching, introduces the theoretical framework of optimal tree models, proposes a Beam Search aware training algorithm (BSAT/OTM) to address training-test mismatch, and demonstrates significant recall improvements on Amazon Books and UserBehavior datasets.

Beam SearchDeep Learninglarge-scale matching
0 likes · 23 min read
Beam Search Aware Training for Optimal Tree-Based Retrieval Models
Beike Product & Technology
Beike Product & Technology
Sep 4, 2020 · Artificial Intelligence

Wide & Deep Model for Real‑Estate Purchase Intent Prediction

This article presents a comprehensive study of the Wide & Deep architecture applied to user purchase‑intent quantification in the real‑estate domain, detailing feature engineering, model design, training procedures, experimental results, and extensions with GRU‑based sequential modeling to improve accuracy.

CTR predictionDeep LearningReal Estate
0 likes · 15 min read
Wide & Deep Model for Real‑Estate Purchase Intent Prediction
Architects' Tech Alliance
Architects' Tech Alliance
Sep 3, 2020 · Artificial Intelligence

Deep Learning Specialization Infographic Overview

This article presents a comprehensive English summary of the deep learning specialization infographics originally shared by Andrew Ng, covering fundamentals, logistic regression, shallow and deep neural networks, regularization, optimization, hyperparameters, convolutional and recurrent networks, and practical advice for model building and evaluation.

CNNDeep LearningNeural Networks
0 likes · 21 min read
Deep Learning Specialization Infographic Overview
DataFunTalk
DataFunTalk
Sep 3, 2020 · Artificial Intelligence

Deep Learning Practices for Click‑Through‑Rate Prediction and Ranking at 58.com

This article describes how 58.com applied deep‑learning techniques—including feature engineering, sample construction, model evolution from Wide&Deep to DIN/DIEN and multi‑task learning—and system‑level optimizations to improve CTR/CPM performance in its large‑scale commercial ranking platform.

CTR predictionDeep LearningSystem optimization
0 likes · 38 min read
Deep Learning Practices for Click‑Through‑Rate Prediction and Ranking at 58.com
DataFunTalk
DataFunTalk
Sep 2, 2020 · Artificial Intelligence

CSCNN: Category‑Specific Convolutional Neural Network for Visual CTR Prediction in JD E‑commerce Advertising

This article presents CSCNN, a category‑specific convolutional neural network that integrates visual priors into click‑through‑rate (CTR) models for JD.com’s e‑commerce advertising, detailing its motivation, architecture, engineering optimizations, offline and online training strategies, and empirical performance gains on both public and industrial datasets.

CTR predictionDeep Learningcategory-specific CNN
0 likes · 19 min read
CSCNN: Category‑Specific Convolutional Neural Network for Visual CTR Prediction in JD E‑commerce Advertising
Taobao Frontend Technology
Taobao Frontend Technology
Sep 1, 2020 · Artificial Intelligence

Build a Browser‑Based MNIST Classifier with TensorFlow.js: A Step‑by‑Step Guide

Learn how to create a browser‑compatible MNIST image classification model using TensorFlow.js, covering data preprocessing with sprite images, model construction, training, and evaluation, while providing complete JavaScript code examples and practical tips for handling ArrayBuffer, DataView, and visualization.

Deep LearningJavaScriptMNIST
0 likes · 8 min read
Build a Browser‑Based MNIST Classifier with TensorFlow.js: A Step‑by‑Step Guide
58 Tech
58 Tech
Aug 31, 2020 · Artificial Intelligence

Deep Learning Practices for Commercial CTR Prediction at 58.com

This article details the end‑to‑end deep‑learning workflow for click‑through‑rate (CTR) prediction in 58.com’s commercial ranking system, covering system architecture, feature engineering, sample construction, model evolution from Wide&Deep to DIN/DIEN, and engineering optimizations that together yielded significant CPM and CVR improvements.

AdvertisingCTR predictionDeep Learning
0 likes · 38 min read
Deep Learning Practices for Commercial CTR Prediction at 58.com
DataFunTalk
DataFunTalk
Aug 29, 2020 · Artificial Intelligence

User Modeling for Search Ranking: Practices, Model Design, and Experimental Analysis at Alibaba

This article presents Alibaba's comprehensive approach to user modeling for search CTR/CVR ranking, detailing the abstraction of user information, multi‑scale behavior processing, enhanced transformer‑based model structures, client‑side click and exposure modeling, and experimental results showing significant AUC improvements.

AlibabaAttention MechanismCTR prediction
0 likes · 18 min read
User Modeling for Search Ranking: Practices, Model Design, and Experimental Analysis at Alibaba
Suning Technology
Suning Technology
Aug 29, 2020 · Artificial Intelligence

How AI Powers Large‑Scale Time Series Forecasting and Root‑Cause Analysis

This article describes Suning's AI‑driven end‑to‑end solution for massive time‑series monitoring, anomaly detection, forecasting with DeepAR, MQ‑RNN, MQ‑CNN, ensemble methods, root‑cause localization using Hotspot and Monte‑Carlo Tree Search, and the evolution of its large‑scale log analytics platform.

Deep LearningLog AnalyticsRoot Cause Analysis
0 likes · 17 min read
How AI Powers Large‑Scale Time Series Forecasting and Root‑Cause Analysis
DataFunTalk
DataFunTalk
Aug 27, 2020 · Artificial Intelligence

Computational Advertising vs Recommendation Systems: Key Differences and Popular Models

This article explains the fundamental differences between computational advertising and recommendation systems, outlines the distinct problems each field addresses, and surveys the most widely used advertising models—including traditional machine‑learning approaches, deep‑learning architectures, and hybrid solutions—providing practical insights for engineers in both domains.

AICTR modelsDeep Learning
0 likes · 11 min read
Computational Advertising vs Recommendation Systems: Key Differences and Popular Models
Sohu Tech Products
Sohu Tech Products
Aug 19, 2020 · Artificial Intelligence

ASR Error Correction with BERT, ELECTRA and a Fuzzy‑Phoneme Generator: Techniques from Xiaomi AI

This article describes how Xiaomi's AI team tackles Automatic Speech Recognition (ASR) query errors by analyzing error patterns, employing BERT, ELECTRA and a soft‑masked BERT model, generating synthetic noisy data with a fuzzy‑phoneme generator, and presenting experimental results and future research directions.

ASRBERTDeep Learning
0 likes · 18 min read
ASR Error Correction with BERT, ELECTRA and a Fuzzy‑Phoneme Generator: Techniques from Xiaomi AI
58 Tech
58 Tech
Aug 19, 2020 · Artificial Intelligence

Speech Recognition in 58.com: Application Scenarios, Data Collection, Kaldi Chain Model Practice, and End‑to‑End Exploration

This article presents a comprehensive overview of how 58.com leverages large‑scale voice data from call‑center, private phone, and micro‑chat platforms, detailing data collection, annotation, Kaldi‑based chain model training, lattice‑free techniques, and end‑to‑end Transformer‑CTC models to improve Chinese speech recognition performance.

ASRChineseDeep Learning
0 likes · 16 min read
Speech Recognition in 58.com: Application Scenarios, Data Collection, Kaldi Chain Model Practice, and End‑to‑End Exploration
Ctrip Technology
Ctrip Technology
Aug 13, 2020 · Artificial Intelligence

Hotel Recommendation System Architecture, Models, and Evaluation at Ctrip

This article presents a comprehensive overview of Ctrip's hotel recommendation system, covering its technical architecture, data processing pipelines, various ranking and embedding models—including FM, Wide&Deep, DeepFM, and FTRL—deployment methods such as PMML and TensorFlow Serving, offline and online evaluation results, and challenges like cold‑start and diversity.

CtripDeep LearningEmbedding
0 likes · 24 min read
Hotel Recommendation System Architecture, Models, and Evaluation at Ctrip
Alibaba Terminal Technology
Alibaba Terminal Technology
Aug 12, 2020 · Artificial Intelligence

How AI is Revolutionizing Automatic Logic Code Generation: Techniques, Tools, and Challenges

This article surveys the landscape of automatic program synthesis for logic code, covering visual programming, example‑driven generation, code‑completion models, intent inference, NL2SQL, NL2IFTTT, and advanced frameworks like TranX and Debuild, while highlighting current challenges and research directions.

Deep Learningcode-generationlogic code
0 likes · 18 min read
How AI is Revolutionizing Automatic Logic Code Generation: Techniques, Tools, and Challenges
iQIYI Technical Product Team
iQIYI Technical Product Team
Aug 7, 2020 · Artificial Intelligence

Boundary Content Graph Neural Network (BC‑GNN) for Temporal Action Proposal Generation

The Boundary Content Graph Neural Network (BC‑GNN) introduces a bipartite‑graph framework that jointly refines start/end boundary probabilities and segment‑content confidence, enabling more precise temporal action proposals and achieving state‑of‑the‑art results on ActivityNet‑1.3 and THUMOS14.

BC-GNNComputer VisionDeep Learning
0 likes · 10 min read
Boundary Content Graph Neural Network (BC‑GNN) for Temporal Action Proposal Generation
DataFunTalk
DataFunTalk
Aug 5, 2020 · Artificial Intelligence

EdgeRec: An Edge‑Computing Based Real‑Time Recommendation System

The article introduces EdgeRec, an edge‑computing powered recommendation framework that moves user‑interest perception and ranking to the client side to overcome latency in traditional cloud‑centric recommender systems, detailing its architecture, heterogeneous behavior modeling, attention‑based reranking, and experimental gains.

Deep LearningEdge Computingranking
0 likes · 13 min read
EdgeRec: An Edge‑Computing Based Real‑Time Recommendation System
58 Tech
58 Tech
Aug 3, 2020 · Artificial Intelligence

Intelligent Voice Quality Inspection System Architecture and Implementation at 58.com

The article details the design and deployment of an AI-powered intelligent voice quality inspection system at 58.com, covering its overall architecture, speech recognition, role identification, tag detection, rechecking platform, and backend infrastructure, and demonstrates its impact on call‑center efficiency and service quality.

AIBackend ArchitectureDeep Learning
0 likes · 12 min read
Intelligent Voice Quality Inspection System Architecture and Implementation at 58.com
Java Captain
Java Captain
Aug 2, 2020 · Artificial Intelligence

Java Spring Boot License Plate Recognition and Training System (Open‑Source)

This article introduces an open‑source Java Spring Boot project that implements a license‑plate detection and recognition system with training capabilities, detailing its features, architecture, supported plate types, software requirements, processing steps, installation guide, and reference resources.

Deep LearningImage ProcessingOpenCV
0 likes · 8 min read
Java Spring Boot License Plate Recognition and Training System (Open‑Source)
Tencent Advertising Technology
Tencent Advertising Technology
Jul 30, 2020 · Artificial Intelligence

Winning Strategies for the Tencent Advertising Algorithm Competition: Text Classification with Word2Vec and BiLSTM

The article details the Tencent Advertising Algorithm competition final, explains the chizhu team's approach of converting ad IDs into word sequences for text classification using large‑scale word2vec embeddings and a dual BiLSTM architecture, presents custom loss functions, training tricks, and shares full Python model code, achieving an overall rank of 11.

AdvertisingBiLSTMDeep Learning
0 likes · 9 min read
Winning Strategies for the Tencent Advertising Algorithm Competition: Text Classification with Word2Vec and BiLSTM
Amap Tech
Amap Tech
Jul 30, 2020 · Artificial Intelligence

Evolution and Practice of Scene Text Recognition Technology in Amap Map Data Production

Amap uses advanced scene text recognition combining detection and recognition modules, deep learning, data synthesis, and result fusion to automate map data production, achieving state-of-the-art performance and automating the majority of POI and road updates, significantly reducing labor costs.

Computer VisionDeep LearningOCR
0 likes · 18 min read
Evolution and Practice of Scene Text Recognition Technology in Amap Map Data Production
Alibaba Cloud Developer
Alibaba Cloud Developer
Jul 30, 2020 · Artificial Intelligence

How Amap’s Scene Text Recognition Powers Accurate Maps: Evolution and Future Challenges

This article explains how Amap leverages scene text recognition to automate map data production, detailing the evolution from traditional image algorithms to deep‑learning models, the current detection and recognition framework, performance results, and future research directions for handling blur, data scarcity, and semantic understanding.

AmapComputer VisionDeep Learning
0 likes · 19 min read
How Amap’s Scene Text Recognition Powers Accurate Maps: Evolution and Future Challenges
Alibaba Cloud Developer
Alibaba Cloud Developer
Jul 29, 2020 · Artificial Intelligence

How Gaode Maps Boosts Accuracy with Advanced Scene Text Recognition

This article explains how Gaode Maps leverages traditional and deep‑learning based scene text recognition techniques—including character detection, sequence models, data synthesis, and multi‑stage frameworks—to automate POI and road data production with high precision and speed.

Computer VisionDeep LearningOCR
0 likes · 20 min read
How Gaode Maps Boosts Accuracy with Advanced Scene Text Recognition
Huawei Cloud Developer Alliance
Huawei Cloud Developer Alliance
Jul 29, 2020 · Artificial Intelligence

Boosting Small Industrial Image Datasets with ModelArts Augmentation and Evaluation

This article describes a practical workflow for expanding a limited industrial solar‑panel defect dataset using flip augmentation, ModelArts smart labeling, and targeted data‑balancing techniques, then evaluates the impact on a ResNet‑50 classifier with detailed accuracy and recall metrics, demonstrating how thoughtful augmentation can improve defect detection performance.

Deep LearningImage ClassificationModelArts
0 likes · 10 min read
Boosting Small Industrial Image Datasets with ModelArts Augmentation and Evaluation
Didi Tech
Didi Tech
Jul 24, 2020 · Artificial Intelligence

DLFlow: An End-to-End Deep Learning Solution for Big Data Offline Tasks

DLFlow, an end‑to‑end framework from Didi’s user‑profile team, merges Spark and TensorFlow to automate feature preprocessing, large‑scale distributed training, and massive prediction for big‑data offline tasks, offering configuration‑driven pipelines, task scheduling, and easy deployment that dramatically speeds model development.

Deep LearningModel DevelopmentSpark
0 likes · 9 min read
DLFlow: An End-to-End Deep Learning Solution for Big Data Offline Tasks
ITPUB
ITPUB
Jul 23, 2020 · Artificial Intelligence

How Likee Scales Short‑Video Recommendations with Flink, Auto‑Stats, and Cache Tensor

This article details Likee's short‑video recommendation pipeline, covering the evolution of its feature‑engineering framework, the use of Flink for minute‑level statistical and second‑level session features, the integration of automatic statistical features into DNN models, multimodal feature extraction, and the cache‑tensor technique that dramatically improves online inference performance.

AIDeep LearningFlink
0 likes · 18 min read
How Likee Scales Short‑Video Recommendations with Flink, Auto‑Stats, and Cache Tensor