Tagged articles

Bright Data

5 articles · Page 1 of 1
DataFunSummit
DataFunSummit
Jun 19, 2026 · Artificial Intelligence

Mastering Data Acquisition for AI Agents: From Crawler Pitfalls to MCP Browser Control

The article distills three Bright Data webinars, detailing how to overcome traditional web‑crawling challenges with an adaptive Crawler API, integrate the Model Context Protocol (MCP) for human‑like browser control, and build a LangGraph‑powered AI search engine while addressing compliance, billing, and scaling considerations.

AI agentsAPI billingBright Data
0 likes · 15 min read
Mastering Data Acquisition for AI Agents: From Crawler Pitfalls to MCP Browser Control
DataFunSummit
DataFunSummit
Feb 13, 2025 · Big Data

E‑commerce Data Scraping: Fundamentals, Tools, Python Scripts, and Challenges

This tutorial explains e‑commerce web scraping fundamentals, covering definitions, tool types, data categories, step‑by‑step Python script creation with Requests, BeautifulSoup, and Selenium, provides sample code for Amazon, Walmart, and eBay, discusses challenges like dynamic pages and anti‑scraping measures, and recommends using specialized scraping APIs.

Bright DataPythonSelenium
0 likes · 15 min read
E‑commerce Data Scraping: Fundamentals, Tools, Python Scripts, and Challenges
DataFunTalk
DataFunTalk
Jun 14, 2024 · Artificial Intelligence

Midjourney’s Diverse Data Sources: Public Datasets, Academic Research, Partner and Proprietary Data

Midjourney enhances its AI models by integrating a wide range of data sources—including public datasets like ImageNet and COCO, academic research from top conferences, partner collaborations, and its own proprietary data—while continuously updating and managing these datasets for quality, privacy, and security.

AI trainingBright DataCOCO
0 likes · 9 min read
Midjourney’s Diverse Data Sources: Public Datasets, Academic Research, Partner and Proprietary Data