Beaty is a web scraper that grabs content from your website and returns the data in a number of formats. Beaty is for small to medium websites, and simple to use. Just enter the URL of your website.
Beaty is in beta, and at present, there's a limit of 70 pages, and English is the only language supported. This is likely to change once Beaty is no longer in beta.
Beaty can't hope to crawl every website, but at present we're running at around a 96% success rate. We have noticed a few common issues you should avoid if possible.
Show your URLs
Sometimes Beaty can't find links from your website because they're hidden, possibly behind jQuery or similar. Beaty looks for anchor tags. Google is likely to have the same problem.
Use absolute rather than relative URLs
A relative URL is a URL that doesn't explicitly contain the protocol and domain. Some examples.
Relative URL. Example 1
Relative URL. Example 2
Relative URLs are perfectly valid, but can be confusing for search engine bots and indeed Beaty. Google recommends, where possible, using absolute rather than relative links.
If you have to use relative URLs, make them obvious. Include a preceding backslash and a relative path, as in Example 2.
We offer a range of flexible support
plans, including free.
Terms of Service
You must have a legal right to the content returned by Beaty, in that it should be your own content, open source, Creative Commons or similar. If in doubt, see the Beaty section of our T&Cs