Hello
I need a program created which can find pages from Wordpress, BlogEngine and MovableType
It have to work like this:
I input my keyword list, up to 500 different keywords, i then input proxies to use for searching (a feature for finding working proxies would be great but not needed), then it must search the net (google, yahoo etc.) for blogs of the type you specify, lets say only wordpress blogs, (Powered by wordpress etc.), once it found a wordpress blog, it must harvest all pages from that domain which have a comment form, and then move on to the next found domain and do the same.
All this have to be quick, and to avoid getting temporary ip banned, it must support proxies for this kinda work..
Im not looking for only 10 results here, i need a lot of results in a timely manner.
My suggestion on how it could do:
1: Input keywords
2: Search for keywords, start checking every single url by going to the first page, if "Powered by Wordpress" exist in this page, continue to harvest all pages from domain, (Maybe do a site:[login to view URL] etc.)
3: Then it saves all the pages it finds which have a comment form on ("leave a comment" etc.)
4: While doing this it must check earlier saved urls for dublicates, so it doesnt generate a huge file of dublicates.
5: Once done the hole list can be exported to .txt or excel file.
Only bid if you can do this..
Thanks