Go to file
2020-02-06 18:02:01 +01:00
feedscraper.js Handle feed-specific HTTP headers from configuration 2017-10-19 10:31:59 +02:00
README.md basic documentation 2020-02-06 18:02:01 +01:00

feedscraper

This tool is a node.js based command-line utility that converts RSS feeds to e-mails.

It scrapes the RSS feeds, then jumps to the articles and extract relevant information from configured DOM node (see node in configuration).

The articles are formatted a mail files and dropped into a maildir structure.

Configuration

The configuration is stored into ~/.feedscraper.json:

{
	"email": {
		"from": "rss@pipoprods.org",
		"to": "rss@pipoprods.org"
	},
	"maildir": {
		"directory": "Maildir/.RSS/"
	},
	"user_agent": "Mozilla/5.0 (X11; Linux i686; rv:24.0) Gecko/20140723 Firefox/24.0 Iceweasel/24.7.0",
	"feeds": [
		{
			"title": "Bits from Debian",
			"url": "https://bits.debian.org/feeds/feed.rss",
			"node": "div.article",
			"tags": "Software, Debian",
			"folder": "Software"
		},
		{
			"title": "xkcd",
			"url": "http://www.xkcd.com/rss.xml",
			"node": "#comic",
			"tags": "Humor",
			"folder": "Humor"
		}
	]
}