Repository History
Explore all analyzed open source repositories
Topic: Crawler

Trafilatura: Advanced Web Scraping and Text Extraction in Python
Trafilatura is a robust Python package and command-line tool designed for gathering text and metadata from the web. It simplifies web crawling, scraping, and content extraction, transforming raw HTML into structured data. Widely adopted by major companies and institutions, it offers high efficiency and accuracy for various text processing needs.
May 1, 2026
View Details

Scrapling: An Undetectable, Powerful, and Adaptive Python Web Scraping Library
Scrapling is a high-performance Python library designed for effortless web scraping. It stands out with its adaptive capabilities, automatically adjusting to website changes, and advanced stealth features to bypass anti-bot systems. This makes it a robust solution for modern web data extraction needs.
Oct 11, 2025
View Details
Page 1