The Core

Why We Are Here => Hardware & Technology => Topic started by: Gurtie on April 30, 2015, 09:02:01 PM

Title: bulk broken link checker
Post by: Gurtie on April 30, 2015, 09:02:01 PM
I'm looking for a broken link checker which will allow me to upload a list of domains and will run through the queue providing a standard broken links check for each - xenu or screamingfrog but without having to check everything one at a time.

Anyone seen one? Happy to pay fo a one off or a subscription service if absolutely necessary
Title: Re: bulk broken link checker
Post by: rcjordan on April 30, 2015, 09:40:05 PM
Found one that says it does multi-site. No idea if it works.

http://www.deadlinkchecker.com/login.asp
Title: Re: bulk broken link checker
Post by: littleman on May 01, 2015, 01:24:26 AM
I'm pretty sure you could give Xenu a list of URLs from multiple sites from multiple sites and that it can hit them in parallel.
Title: Re: bulk broken link checker
Post by: Gurtie on May 01, 2015, 06:23:07 AM
Xenu - yup was playing with that but it seems to stop after the first site when I add a list

deadlink - thanks, will check it out

there's a github linkchecker at http://wummel.github.io/linkchecker/index.html if anyone else is looking for this - but I want to try and find something I can deal with myself before launching into trying to get a developer on it!
Title: Re: bulk broken link checker
Post by: rcjordan on May 01, 2015, 10:46:41 AM
>github linkchecker

I read over that one first, but could find nothing in the doc that indicated multi-domain crawling.  There is reference to a config setting (--stdin : Read list of white-space separated URLs to check from stdin) that has possibilities but it doesn't appear to crawl, just check specified pages.

Anther option that *might* work is regex, but that looks to be mostly for output, not input.
Title: Re: bulk broken link checker
Post by: JasonD on May 01, 2015, 11:21:43 AM
Ahrefs filter domains by 404
Title: Re: bulk broken link checker
Post by: Adam C on May 01, 2015, 03:02:46 PM
Deepcrawl.  Get an agency account.  Very good tool IMO.
Title: Re: bulk broken link checker
Post by: Rooftop on May 08, 2015, 02:29:10 PM
I've always meant to try 80legs for something like this, but never quite found the time to work it out.
Title: Re: bulk broken link checker
Post by: rcjordan on May 08, 2015, 02:35:38 PM
>80legs for something like this, but never quite found the time to work it out.

Ditto
Title: Re: bulk broken link checker
Post by: Rumbas on May 09, 2015, 02:24:25 PM
Waddabout Garreth: http://www.brokenlinkbuilding.com/
Title: Re: bulk broken link checker
Post by: JasonD on May 09, 2015, 02:38:44 PM
I'm playing around with a new code base.

Is there interest in 404s then?
Title: Re: bulk broken link checker
Post by: Gurtie on May 09, 2015, 03:43:24 PM
my orignal reason for asking was that a client has been contacted by someone asking him to part with £6500 a year for a login to a system which 'checks whether your sites have broken links'

For many reasons this is poor value for money for him, but as he has around 350 sites (all run from single CMS but managed by individual site managers and on seperate urls) I was looking for something easier than setting him up with 350 instances of WT to do the same job - if I could upload a list of the root url's and get back a list of pages on those sites with broken links on them once a quarter that would save him a fortune.

I think he's already actually signed up so less urgency now, but clearly there is a market for it - because he's signed up. I presume it has some type of pretty dashboard too (which I would have no need of)

sstrangely since I posted this a couple of other people have mentioned needing something similar for different reasons, but suspect they'll code their own. Its basically only taking a standard single site version and adding a queue to it, isn't it?

Title: Re: bulk broken link checker
Post by: Rooftop on May 09, 2015, 08:20:24 PM
If it is for their own site they surely don't need to crawl. That makes the job pretty easy. Extract anything with an http, put it in a table and start curling.
Title: Re: bulk broken link checker
Post by: Gurtie on May 10, 2015, 06:29:17 AM
yeah, but there's 350 sites.

Whatever way we check the data,  I don't want to sit there and do whatever it is 350 times. I just want to upload a list of domains and come back to a list of some type.