Loading [MathJax]/extensions/tex2jax.js

Translate

Thursday, August 13, 2020

Reverse Engineering Google News Links for RSS Feeds

I was about to add an RSS feed for Reuters, but they decided to stop supporting it. Either way, I stumbled upon a procedure and some tools to scrap RSS feeds from Google News. Artem Bugara:
Base URL: https://news.google.com/rss/search
Add up the following to the url.
Query Parameter: q=when:24h+allinurl:reuters.com
That’s how you let Google know what you need.

when parameter is responsible of fetching the last X hours articles

allinurl parameter restricts search results to documents that contain all of the query words in the document URL


Country and language: ceid=US:en

If you need a Reuters feed from another country then just change the URL to one with a subdomain, and the country & language parameter.
Now, lets dive into some examples.

https://news.google.com/rss/search?q=when:24h+allinurl:reuters.com&ceid=US:en&hl=en-US&gl=US

https://news.google.com/rss/search?q=when:24h+allinurl:ru.reuters.com&hl=ru&gl=RU&ceid=RU:ru

https://news.google.com/rss/search?q=when:24h+allinurl:nacion.com&hl=es-419&gl=US&ceid=US:es-419

Perhaps the previous examples give a better notion on the given construction.

This procedure is pretty useful since many popular platforms that once supported RSS have stopped supporting it by mere conceit. RSS gives vast possibilities for delivering information, but it seems these publishers (like Reuters) have gotten greedy enough to delete their feeds in order to feed income by advertising or tracking. Now they publish their 'feeds' on their social media channels, in detriment of privacy and online safety.


By using RSS, you protect your privacy on the web, and optimize the information you obtain from the internet in a mailbox fashion.

No comments:

Post a Comment