Any API to search Google Cache?
Asked Answered
T

2

7

I am trying to search within Google Cache, so I need to fire this query:

http://webcache.googleusercontent.com/search?q=cache:news.ycombinator.com/news+hacker+news

And get some content like timestamp from the page. But when I do this using curl (ruby), it gives a permission denied error, i.e. scraping is blocked and that was expected.

So, is there any way to search google cache (either an API or some kind of workaround scraping) and extract some information like timestamp?

Tapetum answered 23/10, 2010 at 4:28 Comment(0)
T
2

I didn't get any API but I can scrape it using hpricot or nokogiri in rails (curl in Rails gives permission denied error). I will put up the code once I figure out how to extract the time stamp from the above URL using these gems.

Any one has a better solution?

Tapetum answered 24/10, 2010 at 21:2 Comment(0)
B
0

This looks like it could do it: http://code.google.com/apis/soapsearch/reference.html#1_2

Bought answered 13/1, 2012 at 11:11 Comment(1)
"The SOAP Search API is no longer available."Deuteronomy

© 2022 - 2024 — McMap. All rights reserved.