I cant seem to get timely post federation to my instance (moist.catsweat.com) from lemmy.world. I might see a new post every few hours instead of every minute or so.
posts from other lemmy instances (lemmy.ca, lemmy.ml, dbzer0, etc) have no problem.
i see a ton of lemmy.world traffic in my nginx logs… it seems very specific to ‘new posts’. voting/comments seem to come through no problem once the post shows up.
ive checked a few other instances similar to mine, and they are not having this issue. so it seems something specific to lemmy.world and my instance.
i do recall a similar issue several months back, and it involved a lemmy.world admin resetting some outbound queue.
any ideas?
I’ll probably have to look at another storage than prometheus, aiui it’s not really well suited for this task.
maybe something with influxdb+telegraf, although i haven’t looked at that yet.
Thanks for the attention to this! If I can help in any way just say so.
do you happen to have experience with setting up influxdb and telegraf? or maybe something else that might be better suited?
the metrics are currently in prometheus metrics format and scraped every 5 minutes.
my idea was to keep the current retention for most metrics and have longer retention (possibly with lower granularity for data older than a month).
the current prometheus setup is super simple, you can see (and older copy of) the config here.
if you want to build a configuration for influxdb/telegraf that i can more or less just drop in there without too many adjustments that would certainly be welcomed.
the metric that would need longer retention is
lemmy_federation_state_last_successful_id_local
.I don’t have experience, but I can make a project out of it nonetheless!