Tagged articles
4 articles
Page 1 of 1
Data Thinking Notes
Data Thinking Notes
Nov 12, 2023 · Artificial Intelligence

Unlocking LLM Power: Semantic Search, Private Knowledge Bases, and Text‑to‑SQL for Data Teams

This article explores how large language models can boost data workflows by using embeddings for semantic retrieval, building domain‑specific knowledge bases for private Q&A, generating SQL code from natural language, and automating exploratory data analysis, offering practical steps and visual examples.

EmbeddingKnowledge BaseLLM
0 likes · 7 min read
Unlocking LLM Power: Semantic Search, Private Knowledge Bases, and Text‑to‑SQL for Data Teams
Model Perspective
Model Perspective
Oct 12, 2022 · Fundamentals

Mastering Model‑Centric Statistics: From Exploratory Analysis to Bayesian Inference

This article explains how statistics—through data collection, exploratory analysis, descriptive metrics, visualization, and model‑centric inference—provides a framework for understanding and predicting phenomena, emphasizing the role of programming (e.g., Python) and Bayesian modeling principles such as simplicity and the Occam razor.

Data ScienceModelingexploratory data analysis
0 likes · 6 min read
Mastering Model‑Centric Statistics: From Exploratory Analysis to Bayesian Inference
Python Crawling & Data Mining
Python Crawling & Data Mining
Jun 23, 2022 · Fundamentals

Master Python Data Visualization: Line, Scatter, Histogram, and Heatmap Techniques

This guide walks you through creating various Python data visualizations—including line charts, scatter plots, histograms, bar, pie, and heatmaps—using pandas and seaborn, demonstrates code examples with the Iris, American Community Survey, and Boston housing datasets, and explains how to interpret the results.

MatplotlibSeabornexploratory data analysis
0 likes · 8 min read
Master Python Data Visualization: Line, Scatter, Histogram, and Heatmap Techniques
Python Crawling & Data Mining
Python Crawling & Data Mining
Jun 17, 2020 · Big Data

What Do Heart‑Disease Data Reveal? A Python‑Driven Exploratory Analysis

This article walks through a Python‑based exploratory analysis of a public heart‑disease dataset, loading the data, describing its 14 clinical features, visualizing gender, age, heart‑rate, blood‑pressure and cholesterol relationships, and presenting correlation insights to help understand patterns of disease prevalence.

Pythoncorrelationexploratory data analysis
0 likes · 18 min read
What Do Heart‑Disease Data Reveal? A Python‑Driven Exploratory Analysis