Frequently Asked Questions
Why are my validations slow?
Rocket Validator speed depends on the speed at which your validated server can respond. To avoid clogging your servers, we rate limit our validations to 5 simultaneous connections, so the faster your server respond, the faster we can validate it.
Why can't some sites be scraped?
When you submit an URL for validation, we send a web spider to visit the site and get its links. Sometimes, this process that is called "scraping" fails, and it can be due to a number of reasons. Here are the most typical ones:
- Authorization required
- Sites must be publicly accessible to be validated.
- Could not be found (404)
- The URL you provided could not be found.
- Connection timed out
- The site is taking too long to respond. Retry again later, when the remote server has less traffic.
- Too many requests to remote server
- The remote server is denying us access because it is rate limiting us. Contact your sys admin to relax this rate limiting, or try again later.
- Blocked
- The remote server is blocking our requests. Consult the "If Rocket Validator is being blocked, how can I allow it to access my site?" section below to explicitly allow our requests.
- No internal links found
- The document returned for the starting URL contains no links, or the links are not on the same host. The spider will only follow internal links from the starting URL.
If none of this explains the reason why your site is not being scraped, contact us and we'll help you find the cause.
Can I validate pages that require authorization?
No, there is no direct way of validating pages that require login. Rocket Validator is a distributed system with different internal pieces: our own web crawler, the HTML validator and the A11Y validator. Requests to your web pages come from these three different origins, so authentication handling would need to be implemented in all of them, and would need to cover the different authentication systems most web sites implement.
Instead, we recommend other approaches to checking protected sites:
-
You can validate your local development server directly. This also makes it faster to fix the issues, as you don’t need to deploy them.
-
You can have a separate server that doesn’t need authentication.
-
You can have a copy of the templates for the restricted pages in a different directory that doesn't require authentication.
Does Rocket Validator cache pages?
No, we don't use any kind of cache neither on the link scraper, nor on the validator. If you're changing your page and you're getting the same validation results, the possible reason can be:
- Your server is using cache, like for example Varnish.
- Your pages are being rendered depending on the User-Agent string - you might be seeing your changes on the browser, but your server might be serving different content to the validator. The default User-Agent string for the validator is
Validator.nu/LV http://validator.w3.org/services
How long are reports stored?
Reports are stored for actively subscribed users for a limited time, depending on your subscription plan. See the pricing table. Once your subscription expires, reports are automatically deleted - you need an active subscription to keep your reports stored.
Can I share reports with other people?
Sure, you can share your validation reports with anyone. Just use one of the buttons at the bottom of the summary and you'll be able to send anyone the URL of the report. They'll be able to see the validation results just like you - but they won't be able to re-check any page or modify anything, only you can control this.
Can I export reports to Excel, CSV, or PDF?
You can download your reports data in Excel format. See this blog post for more information on exporting report data.
How can I exclude Rocket Validator from Google Analytics traffic?
You can detect the User Agent string, and exclude it from Google Analytics.
If Rocket Validator is being blocked, how can I allow it to access my site?
When initiating a site validation, Rocket Validator will scan the web pages in your site using its different systems. The link spider will find internal links, the HTML validator will check the HTML code, and the A11Y validator will check the accessibility of the page.
These processes can generate a high volume of visits to your site, which in some cases may result in Rocket Validator being blocked by your server.
To prevent this, you can add Rocket Validator to the list of allowed requests on your server in one of two ways:
- You can add the IP addresses of the Rocket Validator servers to the list of allowed requests on your server.
- You can add the User-Agent string to the list of allowed requests on your server.
To proceed, please consult with your system administrators to add the following IP addresses and/or User Agents to the list of allowed requests: