Skip to main content

2 docs tagged with "data-extraction"

View all tags

[DRAFT] MinerU

MinerU is an open-source, high-quality tool for converting PDF documents to Markdown and JSON formats. It's designed to provide precise document content extraction with advanced AI-powered parsing capabilities.

APIFY Scraper

APIFY is a powerful web scraping and automation platform that provides a comprehensive suite of tools for data extraction, web automation, and data processing. It offers both cloud-based and self-hosted solutions for various scraping needs.