As a gift to the community of content curators and RSS addicts, here is a simple script to generate a daily summary of all indie blogs visible in indieblog.page, because their RSS feeds only expose a few random posts while my FOMO obligates me to try to get them all π .
The script that resulted from my obsession is half cooked thanks to some LLM and is adapted to my needs so I'm excluding blog posts based on language or keywords in the title, feel free to adapt to your needs. So I encourage you to adapt the script to your need. And if not, just subscribe to the feed URL below.
I'm addicted to Changedetection for spying on website changes and internet search results for specific keywords, Occasionally also for monitoring price changes. It's quite handy to discover new links added to web directories, or stay updated with some websites that do not provide any RSS feed.
Context
I'm watching hundred of URLs.
I often spy on webrings and blogrolls to discover new interesting links, and also on search engines results for specific keywords.
I'm self-hosting Changedetection through Cloudron.
I'm mostly following through those watches via my RSS Reader, Miniflux.
For some specific changes, like weather bad conditions, I subscribe via ntfy.
Anyway, I've developed a few habits that fit my workflow so well for every new watch, which are:
Settings > General
This is where we set defaults for all future watches, it's pretty obvious you must start here. Here is my current setting:
Time between check: By forcing a convenient interval between checks, you try to find a balance between information overload and staying current. Pick your poison, but don't hesitate to override this setting at per-watch level.
Extract from document and use as watch title: it's convenient to let Changedetection take care of naming your watches based on the webpage titles rather than leaving the sometimes very long and non human-friendly URL as a default description.
Random jitter: this is handy to avoid stressing your I/O too much.
General > Group tag
This one is mostly for better organizing stuff, as I mentioned I follow those changes through RSS, I noticed it was harder to distinct between important and less important stuff because I was following the default RSS feed, but Changedetection provides distinct RSS feeds per groups/tabs of watches, and that's my preferred workflow now.
I'm trying to always set a label, I have around 15 in total, some for specific interests (privacy, discovery aka list of links, devops, music, ...) or specific people, locations and business updates. The rest is generally less important and is labelled with things like FOMO, misc, ...
Those group tags appear as labels next to the URLs you are watching.
If you want to watch a whole group through RSS, link is at the bottom right of the page on the group tab.
Filters & Triggers > Remove elements
It's common on bloated rich web pages to want to focus on specific parts, like everything between <header> and <footer> sections, so I sometimes have to add footer and header. It's mostly needed for sites like eBay, 2ememain, where we can buy and sell things.
Filters & Triggers > default filter and triggers
This is purely for spam reduction as I mostly want to know when something new is made.
Sometimes I also enable Sort text alphabetically depending how the page is updated by its author.
π Those new settings have been added recently and I'm also enabling them on new watches:
Extension
Try the web browser extension for Chromium based browsers, it makes watches one-click away.
Next
I've opened a discussion in Changedetection's repository to talk about how repetitive it feels to me, in the hope we can see something like template settings be proposed in the future, at least for the filters & triggers which I consider is not too hard to start with.
I'm following Joy of Tech comic via RSS in Miniflux but the image was never loading.
I found half a solution on this blog post of Jan-Lukas Else, unfortunately the proposed solution fails probably as a consequence of some changes in the format of Joy of Tech pages.
The fix is quite simple actually. Edit the feed settings, set the scraper rules to the following:
p.Maintext > img[src$=".png"]
And of course enable "Fetch original content" in the feed options.
I'm avid of content curation using RSS feeds. Let me share some of my tips here and some code. This is a living document so please come back for new tips π and explore my other articles on this topic.
Some of those tips rely on Userscripts which are snippets of code executed automatically on web pages, and usually it's very handy to customize your navigation. I'm using the Custom JavaScript block in Miniflux Settings. But some scripts won't work because of reliance on external resources, and in that case I'm using Tampermonkey for special cases that require loading external resources (think CSP & co).
Filter categories (remove empty ones) using Custom JavaScript block
There is by default no distinction between categories with or without content, and it can be annoying. I made a user script to remove categories with no content to read.
Before applying the script, we have some categories, including one with (0) unread entries.
After
The category with (0) unread entries is hidden.
Feed organizer - using Tampermonkey
This one is for grouping together all feed entries by feed/author in the main on unread, read, and starred pages. I needed this one because by default, in unread tab, the feed entries are mixed all together and I often wanna consume content per feed/author and not in chronological order.
Distinct boring from interesting feeds thanks to objective ranking - with Custom JavaScript in global settings
When opening the "Show all entries" view of a feed, this trick will show you if you shall keep this feed or not. The classification is based on the ratio of starred entries vs total. In this case, clearly, my assistant tells me it's quite π₯± boring. Other values are: Thinking π (in case we lack data), Interesting π (we star a lot of items), Thinking π€ (in case we stared at least some entries). Feel free to make it yours and customize the behavior!
This is a trick that works well with the majority of feeds so you can fetch the whole article content in your reader instead of just the excerpt.
Filter feed entries by title / content
I've customize the feed settings to exclude specific keywords, and on top of this I've also global rules which apply to all feeds, for excluding feed entries when keywords are found in their content or title. This makes it easy to exclude clickbait uninteresting or depressing content π
In this case I follow news with heavy metal album releases and I'm excluding specific genre like Death Metal. I'm also abusing the feature to avoid being spammed with recurrent news like Olympic games (Paris 2024). Finally there are already many reasons for me to be anxious, and I do not need more. The last rule saves me from the useless negative news. I keep fine tuning the list and I could improve this by including terms from public blacklists, like this.
Entry sorting - Application settings
This is a setting that helps well to decrease the FOMO-scrolling, by ensuring the top of your unread list stays the same. So it is very simple, sort by Older entries first! Easy.
I went up with this workflow that seem to pass the test of time and I'm sharing it with you :
I manage to keep up with Inbox zero, almost. The only mails I keep are mostly invoices/accounting related and require sometimes discussion with accountant and my partner. I try anyway to snooze them for later and schedule them as tasks.
I'm not subscribing to any newsletter, everything is read through MiniFlux (RSS curator. If you don't know about that, read -> what is RSS?), combined with RSS-Bridge and ChangeDetection and some tips, all self-hosted. For any newsletter that cannot be replaced with RSS feed, I rely on my hero https://kill-the-newsletter.com/. And if email has your preference over RSS, I recommend you BlogToTrottr to follow RSS/Atom in real-time by email.
I edit my Feed titles with emojis expressing how I feel about their interest : π (Boring?) π (Joyful read) etc.
I categorize feeds and label them also with emojis and sort them from the best to the worst. Those visuals clues really help. When I see nothing interesting for a while in any RSS I've subscribed to, I remove it from Miniflux.
I dreaming to keep a maximum of 20/30 RSS feeds of interest but reality is I have hundred. Among those, I find maybe a dozen to be absolutely fantastic and I'm even sharing them in my /links section.
I've configured Tampermonkey browser extension to take control of the rendering of my RSS feeds list and replace the whole page with "FOCUS", at least 80% of time.
Something too long to read but that looks interesting is immediately dropped from Miniflux and shared/saved into Shaarli for later read. The goal is to declutter my subscriptions inbox.
Articles I browser randomly and are already saved in Shaarli are saved with tags like "x2" if it's second time I save them, "x3" if it's the third save I save them, etc.
If I find any image or PDF of interest, I save them locally to my Dropbox folder of interest. I have a folder for Books, which is subject to automatic triage with some scripting, also my ePubs are automatically converted to PDF. Duplicate files are moved to subfolders like "x2" if it's second time I save the same book, "x3" if it's third time, etc.
Everything in Shaarli is manually labelled based on keywords because Shaarli does not support automatic labelling like Wallabag. I don't want to waste anytime adding manual labels so I'm likely about to integrate similar features either as a plugin for Shaarli, either in Tapas which is my new project I elaborate in Ideas.
I subscribe to only 1 podcast which is Bombo Podcast, the one of my partner.
I want to take time for whatever is worth reading, and skip the rest fast.