You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The size of my data base is growing quite high already. Since I expect to run the bot for decades, I would like to have a way to clean old articles (maybe only those without a match).
Options:
add an CLI argument to only clean the DB
add an CLI argument to run the Bot but also clean the DB
add an CLI argument to run the bot but also with a probability p (e.g. 0.01) clean the DB (my favorite)
To find old articles, you have to check for the last date in the feed and then remove everything older than that.
The text was updated successfully, but these errors were encountered:
Oooh, that's fun. You're matching against a much larger list of sources than I am, so I'd put this off for a bit because my database isn't growing so fast. Let me think on these approaches. My initial hunch is that you could probably just dump a set number of old (and maybe unmatched) articles once the database grows past a certain size, but I'd want that to be at least sort of transparent, and maybe configurable.
(My colleague has suggested a use case where we produce some kind of metrics out of the database, so I want to be able to retain stuff if necessary.)
The size of my data base is growing quite high already. Since I expect to run the bot for decades, I would like to have a way to clean old articles (maybe only those without a match).
Options:
To find old articles, you have to check for the last date in the feed and then remove everything older than that.
The text was updated successfully, but these errors were encountered: