Tagged articles
1881 articles
Page 7 of 19
DaTaobao Tech
DaTaobao Tech
Apr 28, 2023 · Artificial Intelligence

Multi-Scenario Recommendation Model

The paper introduces SASS, a scenario-adaptive self-supervised recommendation model that uses contrastive pre-training and multi-layer gating to expand global samples and transfer scene-aware parameters, enabling a single model to deliver personalized recommendations across diverse Taobao ‘SuoSuo’ scenarios while mitigating data sparsity and cross-domain challenges.

AIDeep Learningdata modeling
0 likes · 23 min read
Multi-Scenario Recommendation Model
21CTO
21CTO
Apr 27, 2023 · Artificial Intelligence

John Carmack’s Bold Quest: Building AGI from Game Engines to Rocket Science

In an in‑depth interview, legendary game developer and VR pioneer John Carmack explains why he left Meta to pursue artificial general intelligence through his startup Keen Technologies, outlining his independent research approach, predictions for AGI breakthroughs, and the potential economic impact of truly general AI.

AGIArtificial IntelligenceJohn Carmack
0 likes · 22 min read
John Carmack’s Bold Quest: Building AGI from Game Engines to Rocket Science
DataFunTalk
DataFunTalk
Apr 26, 2023 · Artificial Intelligence

Serializing Advertising Placement with User Algorithms at Alibaba Health

Alibaba Health’s user algorithm leverages multi‑channel serialized ad placement, using vector‑based three‑tower models, knowledge distillation, and ROI‑oriented optimizations to sequence user touchpoints, improve conversion rates, and enhance model accuracy across diverse marketing channels.

AdvertisingKnowledge DistillationROI
0 likes · 15 min read
Serializing Advertising Placement with User Algorithms at Alibaba Health
DevOps
DevOps
Apr 25, 2023 · Artificial Intelligence

The Bitter Lesson: Why Brute‑Force Computation Outperforms Hand‑Crafted Knowledge in AI

Richard Sutton’s “The Bitter Lesson” argues that over the past seven decades the most powerful driver of AI progress has been general‑purpose compute and large‑scale search, which consistently surpasses methods that rely on human‑engineered knowledge across domains such as chess, Go, speech recognition, and computer vision.

AIDeep Learningbrute force
0 likes · 7 min read
The Bitter Lesson: Why Brute‑Force Computation Outperforms Hand‑Crafted Knowledge in AI
DataFunSummit
DataFunSummit
Apr 22, 2023 · Artificial Intelligence

Applying Causal Inference to Limited‑Resource Decision‑Making

This article explains the fundamentals of causal inference, illustrates its distinction from correlation modeling, and demonstrates how causal techniques can be applied to limited‑resource decision problems such as knapsack optimization, ride‑hailing subsidies, and flight pricing, while also covering experimental design, popular models, evaluation metrics, and open challenges.

Decision Optimizationcausal inferenceexperimental design
0 likes · 15 min read
Applying Causal Inference to Limited‑Resource Decision‑Making
Alimama Tech
Alimama Tech
Apr 19, 2023 · Artificial Intelligence

Potential Generalized Second Price (PGSP) Auction for Augmented Advertising

This paper proposes a two‑stage Potential Generalized Second Price auction for augmented ads, ranking guide ads by expected welfare from their linked second‑step ads, shifting billing to the second click to eliminate free‑riding, and demonstrates via offline and online experiments on Taobao that it boosts click‑through, revenue, and GMV while lowering CPC.

e‑commercemachine learningonline advertising
0 likes · 16 min read
Potential Generalized Second Price (PGSP) Auction for Augmented Advertising
DataFunSummit
DataFunSummit
Apr 13, 2023 · Databases

AI-Driven and Cost-Based Index Recommendation for Slow Queries at Meituan

This article describes Meituan's collaboration with East China Normal University to improve slow‑query performance by combining traditional cost‑based index recommendation with a novel AI‑plus‑data‑driven approach, detailing the overall architecture, modeling process, experimental results, and future plans for large‑model integration.

AIDatabase OptimizationIndex Recommendation
0 likes · 15 min read
AI-Driven and Cost-Based Index Recommendation for Slow Queries at Meituan
Tencent Cloud Developer
Tencent Cloud Developer
Apr 10, 2023 · Artificial Intelligence

How Computers Generate Realistic Images: An In‑Depth Guide to AI Image Generation, Diffusion Models, ControlNet, LoRA and More

This guide explains how AI creates photorealistic images, tracing the shift from VAEs and GANs to diffusion models, detailing latent diffusion, ControlNet conditioning, CLIP text‑image alignment, and lightweight fine‑tuning methods like DreamBooth and LoRA, plus practical tips for higher‑resolution results.

AI image generationControlNetDiffusion Models
0 likes · 22 min read
How Computers Generate Realistic Images: An In‑Depth Guide to AI Image Generation, Diffusion Models, ControlNet, LoRA and More
Meituan Technology Team
Meituan Technology Team
Apr 6, 2023 · Databases

AI-Driven Index Recommendation for Slow Queries at Meituan

This article details a joint research effort between Meituan and East China Normal University that combines cost‑based methods with AI‑driven, data‑centric models to automatically generate and evaluate missing indexes for billions of daily slow queries, improving recommendation accuracy and query performance.

AICost ModelIndex Recommendation
0 likes · 16 min read
AI-Driven Index Recommendation for Slow Queries at Meituan
DataFunTalk
DataFunTalk
Apr 5, 2023 · Artificial Intelligence

Advances in Causal Representation Learning: From i.i.d. to Non‑Stationary Settings

This article reviews recent developments in causal representation learning, explaining why causal reasoning is essential, describing methods for i.i.d. data, time‑series, and multi‑distribution scenarios, and illustrating applications such as domain adaptation, video analysis, and financial data with numerous examples and visualizations.

causal discoverycausal inferencedomain adaptation
0 likes · 22 min read
Advances in Causal Representation Learning: From i.i.d. to Non‑Stationary Settings
Top Architect
Top Architect
Apr 4, 2023 · Artificial Intelligence

Overview of Twitter's Open‑Source Recommendation Algorithm Architecture

Twitter has open‑sourced its recommendation algorithm, detailing a complex pipeline of candidate sources, graph‑based models, embedding techniques, and ranking services that together generate personalized home‑timeline tweets for users.

ArchitectureOpen-sourceRecommendation Algorithm
0 likes · 9 min read
Overview of Twitter's Open‑Source Recommendation Algorithm Architecture
Programmer DD
Programmer DD
Apr 4, 2023 · Artificial Intelligence

Inside Twitter’s Open‑Source Recommendation Engine: How It Ranks Your Timeline

Twitter has finally open‑sourced most of its recommendation algorithm, revealing a three‑stage pipeline that gathers top tweets, ranks them with machine‑learning models, and filters out unwanted content, while also exposing the massive graph data and ranking signals that power the For You timeline.

Recommendation AlgorithmTwittermachine learning
0 likes · 11 min read
Inside Twitter’s Open‑Source Recommendation Engine: How It Ranks Your Timeline
HelloTech
HelloTech
Apr 3, 2023 · Artificial Intelligence

Integrating Machine Learning with Elasticsearch for Enhanced Ranking Capabilities

At the 2023 Elastic China Developer Conference in Shenzhen, Peng Cheng of Hello Technology will demonstrate how migrating online machine‑learning predictions into Elasticsearch can exploit its distributed architecture to rank thousands of models, expand model types and computational depth, and unlock new growth opportunities for business applications, underscoring the event’s status as China’s premier Elasticsearch open‑source technology forum.

ElasticsearchTechnical Conferencemachine learning
0 likes · 2 min read
Integrating Machine Learning with Elasticsearch for Enhanced Ranking Capabilities
21CTO
21CTO
Apr 1, 2023 · Artificial Intelligence

Inside Twitter’s Open‑Source Recommendation Engine: How Tweets Are Ranked

Twitter has released the source code of its recommendation algorithm, revealing a three‑stage pipeline that collects, ranks with machine‑learning models, and filters tweets to balance content from followed and unfollowed accounts while optimizing for user engagement.

Artificial IntelligenceOpen-sourceRecommendation Algorithm
0 likes · 5 min read
Inside Twitter’s Open‑Source Recommendation Engine: How Tweets Are Ranked
Java Architecture Diary
Java Architecture Diary
Apr 1, 2023 · Artificial Intelligence

Inside Twitter’s Open‑Source Recommendation Engine: Architecture & Key Components

This article examines the open‑source Twitter recommendation algorithm released by Elon Musk, detailing its main services, machine‑learning models, data sources, programming languages, and the GitHub repositories that host the core components such as SimClusters, TwHIN, rankers, and the Rust‑based navi framework.

Backend ArchitectureOpen-sourceTwitter
0 likes · 5 min read
Inside Twitter’s Open‑Source Recommendation Engine: Architecture & Key Components
HomeTech
HomeTech
Mar 31, 2023 · Artificial Intelligence

Digital Transformation of Used‑Car Buying: Integrated Data, AI Valuation, and VR Visualization

The article describes how a comprehensive digital platform combines structured, semi‑structured, and panoramic data with machine‑learning valuation models, natural‑language processing, and VR technology to make used‑car condition information transparent, improve estimation accuracy, and enhance user decision‑making in the Chinese second‑hand car market.

AI valuationBig DataData Integration
0 likes · 15 min read
Digital Transformation of Used‑Car Buying: Integrated Data, AI Valuation, and VR Visualization
Python Programming Learning Circle
Python Programming Learning Circle
Mar 27, 2023 · Artificial Intelligence

Top 10 Machine Learning Algorithms: Concepts, Uses, and Key Characteristics

This article introduces the No‑Free‑Lunch principle in machine learning and provides concise explanations of ten fundamental supervised‑learning algorithms—including linear regression, logistic regression, LDA, decision trees, Naïve Bayes, K‑Nearest Neighbors, LVQ, SVM, random forest, and boosting—highlighting their mathematical basis, typical applications, advantages, and limitations.

Artificial IntelligenceData Sciencemachine learning
0 likes · 12 min read
Top 10 Machine Learning Algorithms: Concepts, Uses, and Key Characteristics
DaTaobao Tech
DaTaobao Tech
Mar 24, 2023 · Artificial Intelligence

Leveraging Popularity Bias with Decoupled Unbiased Recall Models

In a March 27 livestream, Alibaba senior algorithm engineer Chen Zhihong will explain how popularity bias affects recommendation pipelines, review existing mitigation techniques, and introduce a decoupled domain‑adaptive unbiased dual‑tower recall model that leverages bias while preserving recommendation fairness.

Unbiased Recallmachine learningpopularity bias
0 likes · 2 min read
Leveraging Popularity Bias with Decoupled Unbiased Recall Models
DataFunSummit
DataFunSummit
Mar 22, 2023 · Artificial Intelligence

Sales Forecasting in Alibaba Health's Pharmaceutical E‑commerce: Business Background, Algorithm Solutions, and Scenario Exploration

The article details a comprehensive presentation on Alibaba Health's pharmaceutical e‑commerce sales forecasting, covering supply‑chain challenges, the evolution of time‑series prediction methods, a full data‑to‑model pipeline, change‑point detection, handling imbalanced data, multi‑model fusion, and specialized seasonal and long‑sequence forecasting techniques.

Alibaba HealthSales ForecastingSupply Chain
0 likes · 16 min read
Sales Forecasting in Alibaba Health's Pharmaceutical E‑commerce: Business Background, Algorithm Solutions, and Scenario Exploration
Model Perspective
Model Perspective
Mar 21, 2023 · Artificial Intelligence

Master Linear Discriminant Analysis (LDA) with Python: Theory & Code

This article explains Linear Discriminant Analysis (LDA) as a pattern‑recognition technique that projects data onto a low‑dimensional space to maximize class separation, details its mathematical formulation with between‑class and within‑class scatter matrices, and provides a complete Python implementation using scikit‑learn on the Iris dataset, including visualization of the results.

LDALinear Discriminant AnalysisPython
0 likes · 6 min read
Master Linear Discriminant Analysis (LDA) with Python: Theory & Code
Python Programming Learning Circle
Python Programming Learning Circle
Mar 21, 2023 · Artificial Intelligence

A Survey of 10 Python Libraries for Explainable AI (XAI)

This article introduces Explainable AI (XAI), outlines its importance, describes a step-by-step workflow, and reviews ten Python libraries—including SHAP, LIME, ELI5, Shapash, Anchors, BreakDown, Interpret‑Text, AI Explainability 360, OmniXAI, and XAI—providing usage examples and code snippets.

Pythonexplainable AImachine learning
0 likes · 12 min read
A Survey of 10 Python Libraries for Explainable AI (XAI)
Baidu Geek Talk
Baidu Geek Talk
Mar 20, 2023 · Artificial Intelligence

How Graph Neural Networks Boost Anti‑Cheat in User Referral Activities

This article analyzes the use of graph neural network models, including GCN and multi‑graph SCGCN, to tackle cheating in referral‑based user acquisition by capturing user relationships, improving sample purity, and achieving up to a 50% increase in cheat‑sample recall.

GCNSCGCNanti-cheat
0 likes · 12 min read
How Graph Neural Networks Boost Anti‑Cheat in User Referral Activities
Efficient Ops
Efficient Ops
Mar 14, 2023 · Artificial Intelligence

How NetEase Games Built an AIOps Platform to Transform IT Operations

This article explains how NetEase Games leveraged AI, big data, and machine learning to create an AIOps platform that automates anomaly detection, log analysis, and fault localization, improving quality assurance, cost management, and operational efficiency across complex gaming infrastructures.

IT Operationsaiopsanomaly detection
0 likes · 12 min read
How NetEase Games Built an AIOps Platform to Transform IT Operations
58 Tech
58 Tech
Mar 14, 2023 · Artificial Intelligence

Dialogue Robot Technology Practices in Recruitment Platforms

The DataFun Summit 2023 featured a presentation by senior AI engineer Sang Hailong on building and applying dialogue robot technologies—including QABot, TaskBot, intent recognition, and intelligent double‑call—in 58.com’s recruitment platform, with experimental insights on ChatGPT integration.

AIChatGPTChatbot
0 likes · 3 min read
Dialogue Robot Technology Practices in Recruitment Platforms
Python Crawling & Data Mining
Python Crawling & Data Mining
Mar 11, 2023 · Artificial Intelligence

How to Overcome Data Scarcity in Machine Learning: Strategies and Techniques

Facing data scarcity in machine learning, this article explores why large datasets are essential, categorizes missing data and label gaps, and presents practical solutions such as dataset reuse, augmentation, multimodal learning, curriculum learning, semi‑supervised methods, active learning, transfer and meta‑learning to mitigate the problem.

Meta Learningdata augmentationdata scarcity
0 likes · 19 min read
How to Overcome Data Scarcity in Machine Learning: Strategies and Techniques
21CTO
21CTO
Mar 10, 2023 · Artificial Intelligence

Inside OpenAI’s Robotics: Lilian’s Journey, AGI Vision, and AI Safety Insights

The interview with OpenAI Robotics researcher Lilian reveals the team’s gender makeup, her work on robot hands, reinforcement‑learning breakthroughs, applied AI safety projects, bias mitigation efforts, and how personal learning blogs fuel continuous innovation in artificial intelligence.

AGIOpenAIRobotics
0 likes · 11 min read
Inside OpenAI’s Robotics: Lilian’s Journey, AGI Vision, and AI Safety Insights
Python Programming Learning Circle
Python Programming Learning Circle
Mar 6, 2023 · Operations

Intelligent Operations: AI‑Driven Anomaly Detection, Alarm Compression, and Log Analysis Techniques

This article presents an AI‑enhanced operations framework that combines metric anomaly detection, alarm compression, log anomaly detection, and intelligent analysis using machine learning methods such as DBSCAN clustering, SARIMAX modeling, Apriori association rules, and LSTM‑based log parsing to improve fault detection and reduce operational costs.

Operationsaiopsanomaly detection
0 likes · 15 min read
Intelligent Operations: AI‑Driven Anomaly Detection, Alarm Compression, and Log Analysis Techniques
DataFunSummit
DataFunSummit
Mar 3, 2023 · Artificial Intelligence

Intelligent Risk Control System Architecture and Development Trends

This article introduces the architecture of intelligent risk control, detailing its four-layer structure, the underlying data, feature, model, and decision components, platform interactions, and future development trends, highlighting how AI and big data enhance risk management efficiency and accuracy.

Big DataDecision Systemsfeature engineering
0 likes · 12 min read
Intelligent Risk Control System Architecture and Development Trends
Architects' Tech Alliance
Architects' Tech Alliance
Mar 2, 2023 · Artificial Intelligence

In‑Depth Analysis of AI Servers for ChatGPT: Architecture, Costs, and Market Trends

This article provides a comprehensive technical overview of AI servers used for large‑scale models like ChatGPT, covering GPU‑centric architectures, classification by application and chip type, hardware cost breakdowns, market demand forecasts, domestic vendor strengths, and the impact of export restrictions on advanced accelerator chips.

AI serversChatGPTExport restrictions
0 likes · 17 min read
In‑Depth Analysis of AI Servers for ChatGPT: Architecture, Costs, and Market Trends
DataFunSummit
DataFunSummit
Mar 1, 2023 · Artificial Intelligence

Automating High-Fidelity Digital Human Creation: Scanning, Driving, and Remaining Challenges

The article details YINGMOU's research on automating the production of high‑fidelity digital humans, covering their rapid 3‑5‑day pipeline, extensive face‑asset database, advanced light‑field scanning, automatic topology reconstruction, AI‑driven rigging, dynamic mapping, and the unresolved issues of hair and cloth.

AI automationPBR materialsfacial rigging
0 likes · 12 min read
Automating High-Fidelity Digital Human Creation: Scanning, Driving, and Remaining Challenges
Python Programming Learning Circle
Python Programming Learning Circle
Mar 1, 2023 · Artificial Intelligence

Introducing Streamlit: A Free Open‑Source Framework for Building Machine‑Learning Apps with Python

Streamlit is a free, open‑source Python framework that lets machine‑learning engineers quickly turn scripts into interactive apps, offering features such as top‑down script execution, widget‑as‑variable handling, caching, GPU support, and seamless integration with version‑control tools, all without requiring separate frontend development.

App DevelopmentData visualizationOpen-source
0 likes · 9 min read
Introducing Streamlit: A Free Open‑Source Framework for Building Machine‑Learning Apps with Python
Top Architect
Top Architect
Mar 1, 2023 · Artificial Intelligence

Understanding the Internals of ChatGPT: Neural Networks, Embeddings, and Training Techniques

This article provides a comprehensive overview of how ChatGPT works, covering its probabilistic text generation, transformer architecture, embedding representations, neural network training processes, and the underlying principles that enable large language models to produce coherent and meaningful human-like language.

AIChatGPTNeural Networks
0 likes · 80 min read
Understanding the Internals of ChatGPT: Neural Networks, Embeddings, and Training Techniques
Bilibili Tech
Bilibili Tech
Feb 28, 2023 · Artificial Intelligence

High‑Quality Automatic Speech Recognition (ASR) Solutions at Bilibili: Data, Model, and Deployment Optimizations

Bilibili’s high‑quality ASR system combines large‑scale filtered business data, semi‑supervised Noisy‑Student training, an end‑to‑end CTC model with lattice‑free MMI decoding, and FP16‑optimized FasterTransformer inference on Triton, delivering top‑ranked accuracy, low latency, and scalable deployment for diverse Chinese‑English video content.

ASRBilibiliEnd-to-End
0 likes · 18 min read
High‑Quality Automatic Speech Recognition (ASR) Solutions at Bilibili: Data, Model, and Deployment Optimizations
DataFunTalk
DataFunTalk
Feb 26, 2023 · Artificial Intelligence

Interactive Recommendation System for Meituan Food Delivery: Architecture, Challenges, and Evaluation

This article details Meituan's interactive recommendation system for its food‑delivery homepage feed, covering the motivation, challenges, system architecture, user intent modeling, evaluation metrics, experimental results, and future directions, illustrating how real‑time, user‑centric recommendations improve conversion and user experience.

Meituanfood deliveryinteractive recommendation
0 likes · 25 min read
Interactive Recommendation System for Meituan Food Delivery: Architecture, Challenges, and Evaluation
DaTaobao Tech
DaTaobao Tech
Feb 24, 2023 · Artificial Intelligence

Data Preprocessing and Statistical Analysis Techniques in Python

The article reviews essential Python data‑preprocessing and statistical‑analysis tools—including missing‑value imputation, outlier trimming, scaling, binning, knee‑point detection, correlation, chi‑square testing, linear regression, Wilson scoring, PCA weighting, text tokenization and sentiment analysis, plus visualization with matplotlib/seaborn and big‑data access via pyodps.

PythonStatistical Analysismachine learning
0 likes · 17 min read
Data Preprocessing and Statistical Analysis Techniques in Python
DataFunTalk
DataFunTalk
Feb 17, 2023 · Artificial Intelligence

Full‑Chain Linkage Techniques for Alibaba Mama Display Advertising: From Precise Value Estimation to Set‑Selection Models

The article presents a comprehensive technical roadmap for Alibaba Mama's display advertising cascade ranking system, introducing full‑chain linkage, precise‑value estimation models (PDM, ESDM) and set‑selection approaches (LDM, LBDM), and demonstrates how these innovations jointly improve CTR and RPM while outlining future research directions.

Advertisingmachine learningpre‑ranking
0 likes · 25 min read
Full‑Chain Linkage Techniques for Alibaba Mama Display Advertising: From Precise Value Estimation to Set‑Selection Models
DataFunTalk
DataFunTalk
Feb 16, 2023 · Artificial Intelligence

Differences Between Advertising Algorithms and Recommendation Algorithms

This article compares advertising and recommendation algorithms, highlighting distinct optimization goals, model design focuses, training methods, implementation principles, auxiliary strategies, and model characteristics, emphasizing how ads aim to increase revenue while recommendations prioritize user engagement and diversity.

AdvertisingCTRalgorithm
0 likes · 5 min read
Differences Between Advertising Algorithms and Recommendation Algorithms
vivo Internet Technology
vivo Internet Technology
Feb 15, 2023 · Artificial Intelligence

Optimizing CDN Bandwidth Utilization and Cost Reduction with Predictive Control (Yugong Platform)

By leveraging the Yugong Platform’s predictive control—combining Prophet‑based threshold forecasts, custom real‑time bandwidth models, and a token‑bucket mechanism—to smooth peaks and fill valleys, enterprises can dramatically improve CDN bandwidth utilization, automate adjustments, and substantially lower peak‑based billing costs.

CDNCost reductionFlow Control
0 likes · 23 min read
Optimizing CDN Bandwidth Utilization and Cost Reduction with Predictive Control (Yugong Platform)
Tencent Cloud Developer
Tencent Cloud Developer
Feb 10, 2023 · Artificial Intelligence

Technical Overview of Claude's RLAIF Approach and Comparison with ChatGPT

Claude, Anthropic’s ChatGPT‑like assistant, employs Constitutional AI and a Reinforcement‑Learning‑from‑AI‑Feedback (RLAIF) pipeline that substitutes costly human‑ranked data with AI‑generated critiques and revisions, yielding comparable reasoning ability to ChatGPT while markedly increasing harmlessness through transparent rule‑based training, chain‑of‑thought prompting, and open‑source reproducible methods.

AI AlignmentChatGPTClaude
0 likes · 19 min read
Technical Overview of Claude's RLAIF Approach and Comparison with ChatGPT
Architect's Guide
Architect's Guide
Feb 9, 2023 · Artificial Intelligence

Why ChatGPT Is So Powerful: A Technical Overview of NLP Model Evolution

This article explains why ChatGPT performs so well by tracing the evolution of natural‑language processing from rule‑based grammars through statistical n‑gram models to neural architectures like RNNs, LSTMs, attention mechanisms, Transformers, and the massive data and training methods that power modern large language models.

ChatGPTNLPTransformer
0 likes · 14 min read
Why ChatGPT Is So Powerful: A Technical Overview of NLP Model Evolution
Cloud Native Technology Community
Cloud Native Technology Community
Feb 7, 2023 · Cloud Native

Machine Learning‑Based Optimization of Kubernetes Resources

This article explains how machine learning can be applied to automatically optimize CPU and memory settings in Kubernetes clusters, covering both experiment‑driven and observation‑driven approaches, step‑by‑step procedures, best‑practice recommendations, and the benefits of combining both methods for efficient, scalable cloud‑native operations.

KubernetesPerformanceResource Optimization
0 likes · 11 min read
Machine Learning‑Based Optimization of Kubernetes Resources
21CTO
21CTO
Feb 6, 2023 · Artificial Intelligence

Understanding the Transformer: How Attention Powers ChatGPT and Modern AI

This article breaks down the Transformer architecture behind ChatGPT, explaining its attention mechanism, embedding, positional encoding, and multi‑head self‑attention, while highlighting the model's impact on AI research, data requirements, and future innovations.

Artificial IntelligenceAttention MechanismChatGPT
0 likes · 18 min read
Understanding the Transformer: How Attention Powers ChatGPT and Modern AI
DataFunSummit
DataFunSummit
Feb 5, 2023 · Artificial Intelligence

Key Takeaways from the Causal Inference Summit: Motivation, Applications, Challenges, and Links to A/B Testing, Machine Learning, and Deep Learning

After attending the DataFun causal inference summit, this article outlines why causal analysis matters, its typical use cases, practical challenges, its relationship with A/B testing, and how it integrates with machine learning and deep learning to improve decision‑making and model robustness.

A/B testingDeep LearningUplift Modeling
0 likes · 10 min read
Key Takeaways from the Causal Inference Summit: Motivation, Applications, Challenges, and Links to A/B Testing, Machine Learning, and Deep Learning
DataFunSummit
DataFunSummit
Feb 4, 2023 · Artificial Intelligence

Walle: An End‑to‑End, General‑Purpose, Scalable Edge‑Cloud Collaborative Machine Learning System

The article introduces Walle, Alibaba's four‑year‑old edge‑cloud collaborative machine‑learning platform that unifies compute containers, data pipelines, and a deployment platform to enable low‑latency, privacy‑preserving, and high‑throughput AI services across billions of mobile devices, and presents its architecture, design challenges, and evaluation results.

Cloud ComputingEdge ComputingMobile AI
0 likes · 25 min read
Walle: An End‑to‑End, General‑Purpose, Scalable Edge‑Cloud Collaborative Machine Learning System
DataFunSummit
DataFunSummit
Feb 2, 2023 · Artificial Intelligence

Exploring Super Automation in JD Supply Chain: Architecture, Applications, and Future Outlook

This article presents JD's super automation approach for its supply chain, detailing the business background, challenges, AI‑driven forecasting, procurement, intelligent allocation, inventory clearing, integrated decision making, and future directions toward fully automated, optimal end‑to‑end operations.

JD.comSupply Chainforecasting
0 likes · 17 min read
Exploring Super Automation in JD Supply Chain: Architecture, Applications, and Future Outlook
DataFunSummit
DataFunSummit
Feb 1, 2023 · Artificial Intelligence

OpenAI’s AI Text Classifier: Features, Limitations, and Real‑World Tests

OpenAI released an AI text classifier that predicts whether a passage was generated by AI, but its accuracy is limited—especially for short texts—so the article examines its training data, performance metrics, practical tests, comparisons with other detectors, and discusses broader implications for plagiarism and content creation.

AI detectionChatGPTOpenAI
0 likes · 16 min read
OpenAI’s AI Text Classifier: Features, Limitations, and Real‑World Tests
DataFunSummit
DataFunSummit
Feb 1, 2023 · Artificial Intelligence

Clustering-Based Global LSTM Models for Large-Scale Time Series Forecasting

The paper proposes clustering thousands of related time series and training separate global LSTM models for each cluster, showing that this reduces heterogeneity, leverages shared information, and improves forecasting accuracy compared to individual models, with extensive experiments on CIF2016 and NN5 datasets.

LSTMRNNclustering
0 likes · 33 min read
Clustering-Based Global LSTM Models for Large-Scale Time Series Forecasting
DataFunSummit
DataFunSummit
Jan 31, 2023 · Big Data

Data-Driven Production and Operations Management Decision-Making

This presentation explores how enterprises can transform massive operational data into precise decisions by applying statistical, machine‑learning, and optimization techniques across four topics: data‑to‑decision pipelines, promotion analysis metrics, long‑tail product pricing, and fast‑yet‑accurate unmanned‑warehouse management.

Data-drivenmachine learningoptimization
0 likes · 13 min read
Data-Driven Production and Operations Management Decision-Making
DataFunTalk
DataFunTalk
Jan 27, 2023 · Artificial Intelligence

GNN for Science: Foundations, Applications, and Recent Advances in Equivariant Graph Neural Networks

This article reviews the role of graph neural networks in AI for science, covering background, the evolution of GNN models, applications in physics and biomedicine, recent advances in Euclidean equivariant GNNs, and the authors' own contributions such as GMN and GROVER, concluding with key distinctions between traditional GNNs and science‑focused approaches.

AI for ScienceMolecular Representationequivariant GNN
0 likes · 16 min read
GNN for Science: Foundations, Applications, and Recent Advances in Equivariant Graph Neural Networks
DataFunTalk
DataFunTalk
Jan 25, 2023 · Artificial Intelligence

Between Heaven and Earth: Reflections of an Algorithm Engineer

The article argues that algorithm engineers should move beyond a narrow focus on deep‑learning models, emphasizing the importance of system architecture, data quality, and thoughtful problem framing to break through performance plateaus in advertising and recommendation systems.

AdvertisingData QualitySystem Architecture
0 likes · 10 min read
Between Heaven and Earth: Reflections of an Algorithm Engineer
DataFunSummit
DataFunSummit
Jan 18, 2023 · Artificial Intelligence

Interview on the Current State, Challenges, and Future Trends of Graph Algorithms

This interview summarizes experts' insights on graph algorithm technology, covering its early industrial adoption, data scale and sparsity challenges, various graph types and models, application scenarios such as recommendation and risk control, R&D workflow hurdles, and emerging research directions like pre‑training, explainability, and combinatorial optimization.

ApplicationsFuture Trendsgraph algorithms
0 likes · 14 min read
Interview on the Current State, Challenges, and Future Trends of Graph Algorithms
Model Perspective
Model Perspective
Jan 12, 2023 · Artificial Intelligence

Neural Networks Explained: Architecture, Training, and Reinforcement Basics

This article introduces neural networks, covering their layered structure, common types like CNNs and RNNs, key components such as activation functions, loss, learning rate, backpropagation, dropout, batch normalization, and extends to reinforcement learning concepts including MDPs, policies, value functions, and Q‑learning.

CNNNeural NetworksRNN
0 likes · 6 min read
Neural Networks Explained: Architecture, Training, and Reinforcement Basics
Efficient Ops
Efficient Ops
Jan 9, 2023 · Operations

How Guotai Junan’s AIOps Platform Achieved Top‑Tier Evaluation in Intelligent Operations

Guotai Junan’s Intelligent Operations Service Platform, powered by AI‑driven AIOps, passed the China Academy of Information and Communications Technology’s excellence assessment for anomaly detection, showcasing advanced data‑driven monitoring, digital‑transformation initiatives, and future plans for fault prediction, self‑healing, and comprehensive operations intelligence.

Digital TransformationIT OperationsIntelligent Operations
0 likes · 15 min read
How Guotai Junan’s AIOps Platform Achieved Top‑Tier Evaluation in Intelligent Operations
Efficient Ops
Efficient Ops
Jan 9, 2023 · Operations

Guotai Junan’s AIOps Success: Inside the Award‑Winning Intelligent Operations Platform

The article explains how AIOps—AI‑driven IT operations—has become a strategic trend, details Guotai Junan’s award‑winning intelligent operations platform that achieved the top‑level “exception detection” evaluation, and shares interview insights on implementation, challenges, and future directions.

Digital TransformationIT OperationsIntelligent Operations
0 likes · 16 min read
Guotai Junan’s AIOps Success: Inside the Award‑Winning Intelligent Operations Platform
58UXD
58UXD
Jan 9, 2023 · Artificial Intelligence

How to Transform VR Scenes into Visually Stunning Experiences in Four Simple Steps

This article outlines a four‑step workflow to improve VR scene quality—covering hardware upgrades, optimized shooting setups, advanced image preprocessing, and custom color‑filter styling—demonstrating how systematic enhancements and blind testing can produce visually appealing, high‑quality VR experiences.

Image ProcessingPipelineVR
0 likes · 6 min read
How to Transform VR Scenes into Visually Stunning Experiences in Four Simple Steps
DataFunTalk
DataFunTalk
Jan 7, 2023 · Artificial Intelligence

How to Better Leverage Data in Causal Inference

This presentation introduces two recent works from Ant Group that improve causal inference by explicitly using historical control data to reduce selection bias and by fusing heterogeneous multi‑source data, describing the GBCT and WMDL methods, their theoretical foundations, experimental results, and practical applications in finance.

Bias Correctioncausal inferencedata fusion
0 likes · 18 min read
How to Better Leverage Data in Causal Inference
DaTaobao Tech
DaTaobao Tech
Jan 6, 2023 · Artificial Intelligence

Two‑Stage Ranking Optimization in E‑commerce Search: From Coarse to Fine Ranking

The paper presents a two‑stage e‑commerce search framework where the coarse‑ranking stage is redesigned with multi‑objective optimization, expanded negative sampling, and listwise distillation—guided by a new global transaction hitrate metric—enabling it to surpass fine‑ranking on large candidate sets and boost overall GMV by about one percent.

Metricscoarse rankinge‑commerce
0 likes · 25 min read
Two‑Stage Ranking Optimization in E‑commerce Search: From Coarse to Fine Ranking
AntTech
AntTech
Jan 3, 2023 · Artificial Intelligence

Ray: The Distributed Framework Powering the Next Generation of Generative AI

Ray, an open‑source distributed computing framework originally created by Berkeley's RiseLab and heavily contributed to by Ant Group, underpins many AI workloads—from privacy‑preserving federated learning to large‑scale model training for ChatGPT—making it a critical yet often overlooked engine of the generative AI revolution.

OpenAIRaymachine learning
0 likes · 7 min read
Ray: The Distributed Framework Powering the Next Generation of Generative AI
DataFunTalk
DataFunTalk
Dec 28, 2022 · Artificial Intelligence

Automated Feature Engineering and Modeling for Credit Risk: A DataFun Case Study

This article explains how DataFun’s automated feature engineering and modeling platform dramatically reduces credit‑risk model development time from weeks to days by standardizing feature creation, integrating popular algorithms such as LR, XGBoost and LightGBM, and providing comprehensive evaluation, deployment and monitoring capabilities.

AIModel MonitoringRFM
0 likes · 14 min read
Automated Feature Engineering and Modeling for Credit Risk: A DataFun Case Study
DataFunTalk
DataFunTalk
Dec 26, 2022 · Artificial Intelligence

A Review of Causal Inference Methods: Potential Outcomes, Structural Causal Models, and Recent Advances

This article reviews the two main streams of causal inference—potential‑outcome (Rubin) models and structural causal (Pearl) diagrams—covers classic techniques such as A/B testing, instrumental variables, matching, difference‑in‑differences, synthetic controls, matrix completion, heterogeneous treatment effect estimation, and discusses modern machine‑learning‑based approaches and causal discovery algorithms.

A/B testingcausal inferenceeconometrics
0 likes · 33 min read
A Review of Causal Inference Methods: Potential Outcomes, Structural Causal Models, and Recent Advances
ITPUB
ITPUB
Dec 21, 2022 · Databases

How OpenMLDB Guarantees Real‑Time, Consistent Features for Machine Learning at Scale

This article explains the data and feature engineering challenges of deploying machine learning, introduces OpenMLDB’s open‑source architecture—including offline Spark‑based processing, a high‑availability online engine with dual‑layer memory indexes, snapshot/binlog persistence, and pre‑aggregation techniques—then showcases real‑world case studies and the project’s roadmap.

Feature StoreOpenMLDBReal-time analytics
0 likes · 15 min read
How OpenMLDB Guarantees Real‑Time, Consistent Features for Machine Learning at Scale
Bilibili Tech
Bilibili Tech
Dec 20, 2022 · Industry Insights

Can Recommendation Algorithms Speed Up Test Case Prioritization? A Bilibili Case Study

This article presents a detailed study on applying recommendation‑system techniques to test case prioritization for Bilibili's mobile apps, describing the problem definition, evaluation metrics, data processing, FM model selection, experimental results, practical deployment, and future research directions.

BilibiliSoftware Testingfactorization machine
0 likes · 13 min read
Can Recommendation Algorithms Speed Up Test Case Prioritization? A Bilibili Case Study
Volcano Engine Developer Services
Volcano Engine Developer Services
Dec 15, 2022 · Artificial Intelligence

How Adaptive Transfer Kernels Boost Low‑Resource Regression: IEEE TPAMI Insights

The paper introduces adaptive transfer kernel learning for transfer Gaussian process regression, defines transfer kernels mathematically, proposes three generalized forms and two improved kernels, proves their positive‑semi‑definiteness, and demonstrates superior performance on low‑resource regression tasks through extensive experiments.

Gaussian Processkernel methodslow-resource regression
0 likes · 9 min read
How Adaptive Transfer Kernels Boost Low‑Resource Regression: IEEE TPAMI Insights
JD Retail Technology
JD Retail Technology
Dec 12, 2022 · Artificial Intelligence

Keynote Presentations from the 2022 Global AI Technology Conference – First Industrial Vision Frontier Forum

The 2022 Global AI Technology Conference’s First Industrial Vision Frontier Forum in Hangzhou gathered leading experts to discuss advances in industrial AI visual defect detection, multimodal pre‑training models, smart meteorology, digital intelligence in retail, third‑generation compound semiconductor detection, meta‑imaging, and broader industrial AI applications, highlighting the future of intelligent manufacturing.

AIIndustrial VisionMeta Imaging
0 likes · 12 min read
Keynote Presentations from the 2022 Global AI Technology Conference – First Industrial Vision Frontier Forum
Tencent Tech
Tencent Tech
Dec 9, 2022 · Artificial Intelligence

How Tencent’s Angel PowerFL Team Dominated iDASH with Homomorphic Encryption

Tencent’s Angel PowerFL team clinched the iDASH homomorphic encryption champion and secured top spots in MPC and SGX tracks, showcasing innovative privacy‑preserving machine‑learning models, CKKS‑based encrypted inference, and a scalable SGX clustering solution that push the boundaries of secure computation.

Homomorphic EncryptionPrivacy ComputingiDASH
0 likes · 5 min read
How Tencent’s Angel PowerFL Team Dominated iDASH with Homomorphic Encryption
DataFunSummit
DataFunSummit
Dec 6, 2022 · Artificial Intelligence

Multimodal Reasoning, Logic Inference, and Machine Learning: An Integrated Survey

This article surveys the development of artificial intelligence from symbolic and connectionist perspectives, covering deductive and inductive reasoning, multimodal and cross‑modal inference, knowledge‑graph reasoning, text and visual understanding, and their applications in causal inference, dialogue consistency, and security vulnerability analysis.

Knowledge GraphsMultimodal Reasoningcausal inference
0 likes · 18 min read
Multimodal Reasoning, Logic Inference, and Machine Learning: An Integrated Survey
DataFunTalk
DataFunTalk
Dec 4, 2022 · Artificial Intelligence

Key Insights on Causal Inference: Motivation, Applications, Challenges, and Links to A/B Testing, ML, and Deep Learning

This article summarizes the motivations behind causal inference, its typical business applications such as intelligent decision‑making and prediction, the practical challenges of validation and data, and its relationship with A/B testing, machine learning, and deep learning, providing a concise overview for newcomers.

AB testingBusiness AnalyticsDeep Learning
0 likes · 10 min read
Key Insights on Causal Inference: Motivation, Applications, Challenges, and Links to A/B Testing, ML, and Deep Learning
ELab Team
ELab Team
Nov 29, 2022 · Frontend Development

How to Build Real-Time Mouse Gesture Recognition with TensorFlow.js

This article explains how to design, implement, and evaluate a mouse gesture recognition system using machine learning and geometric analysis, covering data preprocessing, model training with TensorFlow.js, cosine‑similarity matching, performance optimizations, and extensions to three‑dimensional VR/AR environments.

Cosine SimilarityTensorFlow.jsWeb Development
0 likes · 32 min read
How to Build Real-Time Mouse Gesture Recognition with TensorFlow.js
MaGe Linux Operations
MaGe Linux Operations
Nov 26, 2022 · Artificial Intelligence

The Timeless Foundations of Machine Learning: 6 Core Algorithms Explained

Andrew Ng’s latest AI newsletter article revisits six foundational machine‑learning algorithms—linear regression, logistic regression, gradient descent, neural networks, decision trees, and k‑means clustering—tracing their historical origins, core concepts, and lasting impact on modern AI applications.

Decision TreesNeural Networksgradient descent
0 likes · 20 min read
The Timeless Foundations of Machine Learning: 6 Core Algorithms Explained
Alibaba Cloud Developer
Alibaba Cloud Developer
Nov 25, 2022 · Artificial Intelligence

How Multi‑Objective Optimization Boosted Taobao Search’s Coarse Ranking

This report details the multi‑stage architecture of Taobao’s main search, introduces a new global‑transaction hitrate metric, analyzes offline and online evaluation gaps, and presents a series of model, loss‑function, and sampling improvements that together lifted overall conversion by about one percent.

coarse rankinge‑commercemachine learning
0 likes · 26 min read
How Multi‑Objective Optimization Boosted Taobao Search’s Coarse Ranking
NetEase Cloud Music Tech Team
NetEase Cloud Music Tech Team
Nov 18, 2022 · Artificial Intelligence

Machine Learning-Based Anomaly Detection for Core Business Metrics

The paper proposes a containerized, machine‑learning framework that fuses rule‑based and XGBoost‑driven anomaly detection to monitor daily active users on a cloud music platform, achieving 89 % recall, 81 % precision and up to 74 % recall improvement over traditional threshold methods, while outlining future model refinement and broader metric applicability.

3-sigmaData IntelligenceHolt-Winters
0 likes · 11 min read
Machine Learning-Based Anomaly Detection for Core Business Metrics
Hulu Beijing
Hulu Beijing
Nov 18, 2022 · Artificial Intelligence

How Video Search Engines Rank Results: From Click Models to Multi‑Goal Optimization

This article explains the architecture of video search engine ranking, covering optimization objectives such as relevance, click‑through rate and watch time, and detailing pointwise, pairwise and listwise learning approaches, model training pipelines, and online serving strategies.

click-through ratemachine learningmulti-objective optimization
0 likes · 17 min read
How Video Search Engines Rank Results: From Click Models to Multi‑Goal Optimization
DataFunTalk
DataFunTalk
Nov 17, 2022 · Artificial Intelligence

Enhance the Visual Representation via Discrete Adversarial Training

The Alibaba AAIG team proposes Discrete Adversarial Training (DAT), which leverages VQGAN‑based discretization to generate natural‑looking adversarial samples that improve visual representation robustness and transferability across classification, self‑supervised learning, and object detection tasks without sacrificing accuracy, achieving new state‑of‑the‑art results on multiple benchmarks.

Computer VisionRobustnessVisual Representation
0 likes · 12 min read
Enhance the Visual Representation via Discrete Adversarial Training
Baidu Intelligent Testing
Baidu Intelligent Testing
Nov 15, 2022 · Artificial Intelligence

Risk‑Driven Delivery and Quality Assessment Model for Automated Testing at Baidu

This article describes Baidu's risk‑driven delivery approach, detailing the three current testing challenges, the design of a quality‑assessment system that uses machine‑learning (logistic regression) for risk identification, control and decision, and the resulting improvements in testing efficiency and quality.

AISoftware Testingautomation
0 likes · 14 min read
Risk‑Driven Delivery and Quality Assessment Model for Automated Testing at Baidu
Practical DevOps Architecture
Practical DevOps Architecture
Nov 15, 2022 · Fundamentals

Comprehensive Programming Course Curriculum Overview

This article presents a detailed curriculum covering programming fundamentals, web development (HTML, CSS, JavaScript, jQuery), backend frameworks (Flask, Django), database concepts, data analysis with Python (NumPy, pandas, matplotlib, seaborn), machine learning, AI, and related project tutorials, along with a brief promotional note.

CurriculumData Sciencemachine learning
0 likes · 12 min read
Comprehensive Programming Course Curriculum Overview
Baidu Tech Salon
Baidu Tech Salon
Nov 14, 2022 · Artificial Intelligence

How Risk‑Driven Delivery Boosts Test Efficiency with AI‑Powered Quality Models

This article analyzes Baidu's risk‑driven delivery approach, detailing how machine‑learning models identify, control, and decide on testing risks, replace manual judgments, improve test efficiency and quality, and deliver measurable savings and bug interceptions across large‑scale software projects.

Risk managementSoftware Testingautomated delivery
0 likes · 13 min read
How Risk‑Driven Delivery Boosts Test Efficiency with AI‑Powered Quality Models
Model Perspective
Model Perspective
Nov 13, 2022 · Fundamentals

9 Compelling Reasons to Choose Python Over Matlab for Modeling

Python outshines Matlab in cost, openness, community support, cross‑platform flexibility, extensive libraries, especially for machine learning, and offers a richer ecosystem of IDEs and flexible coding styles, making it a superior, free alternative for modern scientific and engineering modeling tasks.

MATLABOpen-sourcePython
0 likes · 4 min read
9 Compelling Reasons to Choose Python Over Matlab for Modeling
iQIYI Technical Product Team
iQIYI Technical Product Team
Nov 11, 2022 · Artificial Intelligence

Award-Winning AI Dubbing System for Film and TV by iQIYI

iQIYI’s award‑winning AI dubbing system, IQDubbing, uses deep‑learning voice conversion, voiceprint, facial and audio processing to cut dubbing time to one‑sixth, slash costs, secure 21 patents, and has already dubbed over 90 films and 200 TV series across multiple languages.

AIDubbingInnovation
0 likes · 3 min read
Award-Winning AI Dubbing System for Film and TV by iQIYI
Zuoyebang Tech Team
Zuoyebang Tech Team
Nov 9, 2022 · Artificial Intelligence

Boost Data Annotation Efficiency with VAPAL: Active Learning Meets Virtual Adversarial Perturbation

This article explains how a pool‑based active learning framework that combines uncertainty sampling (using BADGE, ALPS, or virtual adversarial perturbations) with diversity‑driven clustering can dramatically cut labeling costs for Transformer‑based NLP models, and presents experimental results showing VAPAL’s competitive performance and early‑stage advantages.

NLPactive learningdata annotation
0 likes · 10 min read
Boost Data Annotation Efficiency with VAPAL: Active Learning Meets Virtual Adversarial Perturbation
JD Cloud Developers
JD Cloud Developers
Nov 7, 2022 · Artificial Intelligence

Detecting Time‑Series Anomalies Without Thresholds Using LSTM and Unsupervised Fusion

This article presents a threshold‑free anomaly detection framework for streaming time series that combines an LSTM‑based baseline module with an unsupervised detection module, detailing the architecture, training process, data preprocessing, and experimental results that demonstrate superior accuracy and F1 scores.

Deep LearningLSTMTime Series
0 likes · 15 min read
Detecting Time‑Series Anomalies Without Thresholds Using LSTM and Unsupervised Fusion
Airbnb Technology Team
Airbnb Technology Team
Nov 3, 2022 · Artificial Intelligence

T-LEAF: A Taxonomy Learning and Evaluation Framework for Airbnb Community Support Classification System

The T‑LEAF framework introduces quantitative metrics for coverage, usefulness, and consistency to iteratively develop Airbnb’s unified Contact‑Reason taxonomy, enabling faster feedback loops, reducing “Other” classifications, and improving both human annotation agreement and machine‑learning prediction accuracy in production.

Evaluation Frameworkclassificationcommunity support
0 likes · 14 min read
T-LEAF: A Taxonomy Learning and Evaluation Framework for Airbnb Community Support Classification System
Model Perspective
Model Perspective
Oct 30, 2022 · Artificial Intelligence

How ALE Plots Overcome Partial Dependence Limitations in ML

The Accumulated Local Effect (ALE) plot, introduced by Daniel W. Apley in 2016, addresses the correlation issue inherent in Partial Dependence Plots, offering unbiased, faster, and more accurate feature impact visualizations for machine‑learning models, especially in domains like financial risk control.

ALEfeature importancemachine learning
0 likes · 9 min read
How ALE Plots Overcome Partial Dependence Limitations in ML
Model Perspective
Model Perspective
Oct 26, 2022 · Artificial Intelligence

Master Machine Learning Algorithms: Types, Python Code & Real-World Examples

This article categorizes machine learning algorithms into supervised, unsupervised, and reinforcement learning, then details ten common algorithms—including linear regression, logistic regression, decision trees, SVM, Naive Bayes, K‑NN, K‑means, random forest, and dimensionality reduction—accompanied by clear Python code examples and illustrative diagrams.

AlgorithmsPythonUnsupervised Learning
0 likes · 14 min read
Master Machine Learning Algorithms: Types, Python Code & Real-World Examples
Model Perspective
Model Perspective
Oct 21, 2022 · Artificial Intelligence

How Explainable Boosting Machines (EBM) Combine Accuracy and Interpretability

Explainable Boosting Machines (EBM) integrate boosting trees into generalized additive models, using the FAST algorithm to efficiently detect high‑impact pairwise interactions, delivering near‑state‑of‑the‑art accuracy while preserving strong global and local interpretability, as demonstrated on breast‑cancer data.

Explainable Boosting MachineFAST algorithmgeneralized additive model
0 likes · 10 min read
How Explainable Boosting Machines (EBM) Combine Accuracy and Interpretability
DataFunSummit
DataFunSummit
Oct 19, 2022 · Artificial Intelligence

Series Six of the Integer Intelligence Autonomous Driving Dataset Collection – Overview and Highlights

This article presents a comprehensive overview of several publicly available autonomous driving datasets, focusing on Series Six of the Integer Intelligence collection, which includes StreetLearn, UTBM RoboCar, Multi‑Vehicle Stereo Event Camera, comma2k19, the Annotated Laser Dataset, Ford, and Oxford RobotCar, detailing their sources, download links, publication years, key features, and research relevance.

Computer VisionDatasetsRobotics
0 likes · 10 min read
Series Six of the Integer Intelligence Autonomous Driving Dataset Collection – Overview and Highlights
21CTO
21CTO
Oct 18, 2022 · Information Security

Can Google’s New KataOS Deliver a Provably Secure Embedded AI Platform?

Google’s experimental KataOS, built with Rust atop the formally verified seL4 microkernel and paired with the Sparrow reference implementation, aims to provide a provably secure operating system for embedded machine‑learning workloads, highlighting collaborations, architectural choices, and its place among historic microkernels.

KataOSRustSecure OS
0 likes · 5 min read
Can Google’s New KataOS Deliver a Provably Secure Embedded AI Platform?
Model Perspective
Model Perspective
Oct 17, 2022 · Fundamentals

Unlocking Bayes' Theorem: From Basics to Real-World Applications

Bayes' theorem, a cornerstone of probability theory, relates prior knowledge, likelihood, and evidence to compute posterior probabilities, highlighting why prior and likelihood differ, and explaining concepts such as prior, likelihood, posterior, and evidence with intuitive examples and their relevance to sequential data analysis.

Bayes theoremmachine learningposterior
0 likes · 5 min read
Unlocking Bayes' Theorem: From Basics to Real-World Applications
Baidu Geek Talk
Baidu Geek Talk
Oct 17, 2022 · Artificial Intelligence

OCR Technology: PaddleOCR and Paddle.js Integration

The article explains OCR fundamentals and details how Baidu’s open‑source PaddleOCR suite can be converted and run in browsers via the @paddlejs‑models/ocr SDK, describing model initialization, detection and CRNN‑based recognition pipelines, and presenting benchmark results that show the newer ch_PP‑OCRv2 model achieving higher accuracy and faster inference than the mobile variant.

AIComputer VisionOCR
0 likes · 9 min read
OCR Technology: PaddleOCR and Paddle.js Integration
Xianyu Technology
Xianyu Technology
Oct 13, 2022 · Artificial Intelligence

Design of a Generalized Recommendation Platform for Xianyu Marketplace

The article presents a generalized recommendation platform for Xianyu Marketplace that consolidates feature processing, candidate generation, recall, scoring, and experiment management into shared core components, enabling rapid onboarding of new scenarios, reducing engineering effort, and delivering over 8% CTR lift and 10% more impressions.

Xianyumachine learningonline marketplace
0 likes · 12 min read
Design of a Generalized Recommendation Platform for Xianyu Marketplace
DataFunTalk
DataFunTalk
Oct 12, 2022 · Artificial Intelligence

Feature Embedding Modeling for Recommendation Systems: Techniques, Models, and Practical Insights from Weibo

This article presents a comprehensive overview of feature embedding modeling in recommendation systems, discussing the necessity of feature modeling, three technical directions (gate threshold, variable‑length embeddings, and enrichment), detailed descriptions of models such as FiBiNet, FiBiNet++, ContextNet, and MaskNet, experimental findings, and a Q&A session that addresses practical challenges and future work.

CTR modelsWeibofeature embedding
0 likes · 34 min read
Feature Embedding Modeling for Recommendation Systems: Techniques, Models, and Practical Insights from Weibo