Skip to content
kumo

kumo, from the command line

Crawl a whole host into structured data. One pure-Go binary, no API key, output that pipes into the rest of your tools.

Crawl a whole host into structured data.

kumo scrape example.com --max-pages 20   # crawl a host into $HOME/data
kumo pages example.com -o table          # read back what you crawled
kumo page https://example.com/ -o json   # structure a single page

Each page is written as pages/<host>/<path>.md: a JSON front-matter block with the title, description, canonical, language, dates, OpenGraph, JSON-LD, and outbound links, followed by the main content as Markdown.

Where to go next

Getting started Install kumo, learn the model, and run your first command. Guides Task-oriented walkthroughs for the things people do with kumo.