document updated 19 years ago, on Mar 8, 2005
David Newcum's RSS feeds
All RSS feeds are at least RSS v1.0 and include a date/time to support chronologically-sorted aggregators. Where explicit posting times aren't available, the time that the script first reads the news item is used. For info on why the feed was created, see the comments at the top of each script.
- andrewsullivan.rss (perl source) (updated every 20 minutes)
No official or third-party rss.
dailyrotten.rss (perl source)
Rss feed no longer available. Daily Rotten would prefer that everyone download the script I use and run it themselves, decreasing my bandwidth bills and increasing Rotten's. If you need any help whatsoever in installing the program, please don't hesitate to email me at interiot21@paperlined.org. Similar RSS feeds continue to be available at other sites, though my feed is the only one that contains full descriptions and links.
- dilbert.rss (perl source) (updated every hour)
Third-party ones available, but this one adds a first-seen date. I later concluded that this functionality should be built into the aggregator, not individual scrapers, but I don't want to throw away code.
- edmunds_headlines.rss (perl source) (updated every 2 hours during the workday)
No official or third-party rss.
- fark_photoshop.rss (perl source) (updated every 10 minutes)
Differs from the official feed in that it only includes the photoshop items, it includes the starting photo in the story's description, and if you click on the photo, you're taken to the "view voting results" version of the thread so you see the photos ranked by popularity right away.
- fuckedcompany.rss (perl source) (updated every 20 minutes)
No official rss. Compared to some 3rd party feeds, it includes the when/company/severity/points part, though a very similar feed can be found here.
- instapundit.rss (perl source) (updated every 20 minutes)
Official rss is truncted and lacks HTML markup. No third-party RSS found.
- kottke.rss (perl source) (updated every 30 minutes)
Official rss is truncted and lacks HTML markup.
- pennyarcade.rss (perl source) (updated every 20 minutes)
Official rss doesn't include <img ...> tag in the description. Third-party rss didn't use to give the proper link? Hrm, actually, it doesn't look too different. (well, my image can be clicked on, but... no other difference?). Mine has a date, but that's not really beneficial unless the post has an original date on it.
- politech.rss (perl source) (updated every 20 minutes)
- slashdot.rss (perl source) (updated every 10 minutes) (if this gets hammered, it will be removed from public use. Please download the source and generate your own personal output.)
Slashdot is really pissy about its .rss getting hammered due to automatic monitoring of it, so we'll hammer its /index.html instead because it's unmonitored.
Haha, it had to happen eventually, they blacklisted the IP of the scraper. If you want to run this program on your own server (possibly at a slightly less frequency than 10 minutes :), go ahead.
Email rss_feeds@paperlined.org for small change requests or patches, or amicable requests to use less bandwidth.
Other keywords: XML, RDF