Tagged articles
128 articles
Page 2 of 2
MaGe Linux Operations
MaGe Linux Operations
May 17, 2017 · Big Data

How Big Data Turns Raw Information into Resource Optimization

The article explains that the ultimate value of big data lies in optimizing resource allocation by first crowdsourcing massive data, then fully mining it to uncover truth, and finally using those insights across industries such as transportation, advertising, finance, and more.

Big DataResource Optimizationcrowdsourcing
0 likes · 7 min read
How Big Data Turns Raw Information into Resource Optimization
Architects' Tech Alliance
Architects' Tech Alliance
May 2, 2017 · Big Data

Data Mining and Innovation in the Adult Entertainment Industry

The article examines how extensive data collection and analysis of adult performers and their content reveal surprising demographic patterns and drive innovative business models, product development, and technology adaptations within the porn industry, illustrating the practical impact of big‑data insights beyond traditional sectors.

Adult IndustryAnalyticsInnovation
0 likes · 13 min read
Data Mining and Innovation in the Adult Entertainment Industry
21CTO
21CTO
Apr 16, 2017 · Artificial Intelligence

Build a Movie Recommendation System with Pearson Correlation in Python

This article demonstrates a Python-based movie recommendation approach that crawls Douban user data, categorizes ratings, computes Pearson correlation to identify like‑minded users, and generates weighted movie suggestions, complete with code snippets for data handling, similarity calculation, and recommendation generation.

Pearson CorrelationPythondata mining
0 likes · 7 min read
Build a Movie Recommendation System with Pearson Correlation in Python
Meitu Technology
Meitu Technology
Apr 6, 2017 · Artificial Intelligence

Meitu Internet Technology Salon: AI and Machine Learning Applications in Practice

The fourth Meitu Internet Technology Salon showcased practical AI and machine learning uses, highlighting Meipai’s text‑anti‑spam, hot‑topic detection, sentiment analysis and personalized video search, while Baidu demonstrated ML‑driven business intelligence tools for multi‑source data mining, user profiling, and intelligent enterprise and HR management.

Artificial IntelligenceBusiness IntelligenceSentiment Analysis
0 likes · 7 min read
Meitu Internet Technology Salon: AI and Machine Learning Applications in Practice
Architects' Tech Alliance
Architects' Tech Alliance
Nov 24, 2016 · Big Data

Data Mining Overview: Process, Techniques, and Model Evaluation

This article provides a comprehensive introduction to data mining, covering its definition, goal setting, data sampling, exploration, preprocessing, pattern discovery, model building, evaluation methods, and the main analytical techniques such as classification, regression, clustering, association rules, feature and deviation analysis, and web mining.

Model Evaluationassociation rulesclassification
0 likes · 10 min read
Data Mining Overview: Process, Techniques, and Model Evaluation
StarRing Big Data Open Lab
StarRing Big Data Open Lab
Oct 20, 2016 · Artificial Intelligence

How Collaborative Filtering Powers Recommendations: From Manhattan to Cosine Similarity

This article walks through the fundamentals of recommendation systems, explaining collaborative filtering and various similarity measures—including Manhattan, Euclidean, Minkowski, Pearson correlation, and cosine similarity—while discussing their suitability for dense, sparse, or biased rating data and introducing K‑Nearest Neighbors for practical implementation.

collaborative filteringdata miningmachine learning
0 likes · 15 min read
How Collaborative Filtering Powers Recommendations: From Manhattan to Cosine Similarity
Architects Research Society
Architects Research Society
Sep 25, 2016 · Big Data

Overview of Data Mining Tasks, Processes, and Related Machine Learning Techniques

Data mining, an interdisciplinary field of computer science, involves tasks such as anomaly detection, clustering, classification, and regression, follows standardized processes like KDD, CRISP-DM, and SEMMA, and often leverages machine learning techniques—including supervised, unsupervised, and reinforcement learning—to extract valuable insights from complex datasets.

CRISP-DMKDDSEMMA
0 likes · 8 min read
Overview of Data Mining Tasks, Processes, and Related Machine Learning Techniques
Architecture Digest
Architecture Digest
Aug 15, 2016 · Big Data

Understanding Data: Types, Systems, and Big Data Technologies

This article explains what data is, classifies it into structured, semi‑structured and unstructured forms, describes data mining, databases, data warehouses, the full data lifecycle, and surveys the big‑data ecosystem including storage, batch and real‑time processing, resource scheduling, and visualization technologies.

Lambda architecturedata engineeringdata mining
0 likes · 22 min read
Understanding Data: Types, Systems, and Big Data Technologies
Java High-Performance Architecture
Java High-Performance Architecture
Jun 22, 2016 · Databases

Why Apache TinkerPop Is Becoming a Top Graph Computing Framework

Apache TinkerPop, now a top-level Apache project, offers a powerful graph computing framework with Gremlin, supporting real-time transactional processing and batch analytics across languages, scalable from single machines to massive clusters, making it essential for data mining, analysis, and large‑scale graph applications.

GremlinTinkerPopdata mining
0 likes · 4 min read
Why Apache TinkerPop Is Becoming a Top Graph Computing Framework
ITPUB
ITPUB
Jun 11, 2016 · Big Data

How 58 Daojia Leverages User Portraits to Boost Operations and Fight Fraud

This article details 58 Daojia's data‑driven approach to building user‑portrait tags, covering tag construction, evaluation, and practical applications such as personalized recommendations, anti‑fraud measures, coupon distribution, and dynamic pricing, while outlining the underlying big‑data architecture and technical challenges.

Big Dataanti-frauddata mining
0 likes · 18 min read
How 58 Daojia Leverages User Portraits to Boost Operations and Fight Fraud
21CTO
21CTO
Mar 18, 2016 · Artificial Intelligence

10 Essential Tips for Building High‑Performance Intelligent Recommendation Systems

This article outlines ten practical key points—including leveraging explicit and implicit feedback, hybridizing algorithms, handling temporal and geographic factors, exploiting social ties, solving cold‑start issues, optimizing presentation, defining clear metrics, ensuring real‑time updates, and scaling big‑data processing—to help engineers design effective intelligent recommendation systems.

cold startdata miningevaluation
0 likes · 18 min read
10 Essential Tips for Building High‑Performance Intelligent Recommendation Systems
Meitu Technology
Meitu Technology
Mar 11, 2016 · Artificial Intelligence

Meipai Personalized Recommendation Technology Journey

As Meipai’s user base exploded, the platform shifted from manual curation to sophisticated personalized recommendation algorithms—leveraging machine‑learning and data‑mining techniques, iterating through multiple generations, overcoming scalability and relevance challenges, and delivering rapid solutions that inspire future recommendation system designs.

MeipaiRecommendation Algorithmalgorithm evolution
0 likes · 1 min read
Meipai Personalized Recommendation Technology Journey
Qunar Tech Salon
Qunar Tech Salon
Feb 6, 2016 · Big Data

An Introduction to Data Mining Algorithms and Their Real-World Applications

This article introduces the main types of data‑mining algorithms—classification, prediction, clustering, and association—explains supervised and unsupervised learning, and illustrates each with practical examples such as spam detection, tumor cell identification, wine quality assessment, fraud detection, recommendation systems, and more.

association analysisclassificationclustering
0 likes · 15 min read
An Introduction to Data Mining Algorithms and Their Real-World Applications
Architect
Architect
Feb 1, 2016 · Big Data

An Introduction to Data Mining Algorithms and Their Real-World Applications

This article introduces the main types of data‑mining algorithms—classification, prediction, clustering, and association—explains supervised and unsupervised learning, and illustrates each with practical examples such as spam detection, tumor identification, wine quality assessment, fraud detection, recommendation systems, and authorship analysis.

anomaly detectionclassificationdata mining
0 likes · 14 min read
An Introduction to Data Mining Algorithms and Their Real-World Applications
dbaplus Community
dbaplus Community
Dec 25, 2015 · Artificial Intelligence

Detecting Fraudulent ModemPOOL Terminals with K‑Means Clustering

This article details how telecom operators can identify fraudulent ModemPOOL (cat‑pool) terminals and predict churn using data‑driven clustering and day‑interval warning models, covering metric selection, data exploration, k‑means clustering, model deployment, and performance evaluation.

K-MeansModel DeploymentRFM
0 likes · 18 min read
Detecting Fraudulent ModemPOOL Terminals with K‑Means Clustering
Qunar Tech Salon
Qunar Tech Salon
Aug 14, 2015 · Big Data

The Nine Laws of Data Mining: Principles, Processes, and Insights

This article presents nine fundamental laws of data mining—covering goals, knowledge, preparation, experimentation, patterns, insight, prediction, value, and change—explaining how business objectives and domain expertise drive each stage of the CRISP‑DM process and why technical metrics alone cannot guarantee success.

CRISP-DMPredictive Modelingbusiness knowledge
0 likes · 19 min read
The Nine Laws of Data Mining: Principles, Processes, and Insights
Suning Technology
Suning Technology
Jun 16, 2015 · Artificial Intelligence

How Suning Leverages Query Logs to Auto‑Discover Product Synonyms

Suning’s search team automatically extracts domain‑specific synonym pairs from massive query‑click logs using candidate extraction, multi‑feature similarity calculations, and Apriori pattern mining, dramatically improving e‑commerce search recall and user experience.

data mininge‑commercequery logs
0 likes · 6 min read
How Suning Leverages Query Logs to Auto‑Discover Product Synonyms
Qunar Tech Salon
Qunar Tech Salon
Mar 15, 2015 · Artificial Intelligence

Overview of Common Classification Algorithms in Data Mining

This article introduces the concepts of classification and prediction in data mining, outlines their workflow, and provides concise explanations of six widely used classification techniques—decision trees, K‑Nearest Neighbour, Support Vector Machine, Vector Space Model, Bayesian methods, and neural networks—highlighting their principles, advantages, and limitations.

Bayesiandata miningdecision tree
0 likes · 9 min read
Overview of Common Classification Algorithms in Data Mining
Qunar Tech Salon
Qunar Tech Salon
Mar 14, 2015 · Artificial Intelligence

Common Distance and Similarity Measures in Machine Learning and Data Mining

This article reviews the most frequently used distance and similarity formulas in machine learning and data mining, explaining their definitions, mathematical properties, practical examples, and when each metric is appropriate for measuring differences between data points or probability distributions.

Cosine SimilarityKL divergenceMahalanobis distance
0 likes · 13 min read
Common Distance and Similarity Measures in Machine Learning and Data Mining
Baidu Tech Salon
Baidu Tech Salon
Oct 21, 2014 · Big Data

Baidu's Big Data Intelligence: From Data to Intelligence - QCon2014 Presentation

At QCon2014, Baidu Research’s Shen Zhiyong showcased the company’s massive big‑data engine—20,000 PB storage and daily processing of up to 100 PB—highlighting open platforms like Baidu Brain and real‑world prediction projects for tourism, the World Cup, disease outbreaks, and UN collaborations, while urging industry‑wide data‑driven transformation.

BaiduData Intelligencedata mining
0 likes · 8 min read
Baidu's Big Data Intelligence: From Data to Intelligence - QCon2014 Presentation