Post mortem on Mastodon outage with 30k users

community.hachyderm.io

84 points by kris-nova 3 years ago · 103 comments

Reader

Quite beefy hardware for on-prem. Perhaps someone could explain to me why 30k users, even assuming concurrent users would be an issue for hardware that size?

Is the app stack naturally resource heavy or is this setup particular different to how a instance should be?

tedivm 3 years ago

There are two problems that come up with scaling-
1. Those 30k users who are following users on other servers, which pulls in the content of those users. I'm on hachyderm but I would guess only about 20% of the people I follow are. That means my user is pulling in 250 other users data into the system. Of course most, if not all, of the people are follow are being followed by someone else so it's not a pure multiplier. At the same time it does mean it's a lot more data moving in.
2. NFS, which is where they had the problems, was being used as a media store. People on mastodon and twitter like sharing images and other media. Even people who run single user nodes but follow a lot of people end up using a ton of storage space. 30k people scrolling through timelines and actively pulling that data out, while queues are pushing data in, can be tough to scale. Switching to an object store really helped fix that.
On top of that the mastodon app is very very sidekiq heavy. For those not familiar with ruby, sidekiq is basically a queue workload system (similar to python's celery). You scale up by having more queue workers running. The problem with NFS is that all of those queues are sharing the filesystem, making the filesystem a point of failure and scaling pain. Adding more queue workers makes the problem worse by adding to the filesystem load, rather than resolving the problems. Switching to an object store helps until the next centralized service (in this case postgres) reaches its limits.
So basically the 30k users each following their own set of users creates a multiplier on how many users the instance is actually working with. The more users on either side of the equation the more work that needs to be done. If this was a 30k user forum where every user existed on the instance the load would be significantly less.
- throw0101c 3 years ago
  
  > The problem with NFS is that all of those queues are sharing the filesystem, making the filesystem a point of failure and scaling pain.
  It is not NFS that is a SPOF, it is a single NFS server that is a SPOF. There exists distributed NFS systems (OneFS, Panasas) that can tolerate the loss of up to N servers before the service gets disrupted.
  - evankanderson 3 years ago
    
    I suspect distributed NFS won't help here when the problem is that a server gets slow / overloaded. In particular, this setup was actually I/O bound and there wasn't more I/O to be had.
    
    throw0101c 3 years ago
    
    > In particular, this setup was actually I/O bound and there wasn't more I/O to be had.
    No more I/O to be had from that particular NFS server. If your mounts are distributed over 4+ servers, then you have potentially 4x the available operations available.
bbu 3 years ago

>Perhaps someone could explain to me why 30k users, even assuming concurrent users would be an issue for hardware that size?
the main problem was slow io because of faulty disks which brought everything to a crawl.
denysvitali 3 years ago

Ruby On Rails... On top of that, the federation part is basically ingestion of all the federated servers across the world, so Postgresql would see a constant write despite the users of the instance aren't posting anything
- bioemerl 3 years ago
  
  I'm absolutely amazed that people are going with federation of this form with this massive overhead instead of something like RSS, but beefed up to a standard, where you can just point your client to multiple web servers and have them use your server as identity provider and that's it.
  Imagine your subscriptions and account living on one server, then when you log in that server gives you the list and your client goes and gets all the data.
  We already had this sort of federation figured out, it's the open web. We just have to find a way to get the open web to provide the things that google, facebook, reddit, provided.
  Easy way to contribute content.
  Discovery for new content.
  Search ability.
  Kill the things centralized websites provided, let people host websites within that system, and let the clients handle dealing with the fact that there are all sorts of providers out there.
  - zimpenfish 3 years ago
    
    > that server gives you the list and your client goes and gets all the data.
    That would suck on a mobile device with limited bandwidth and/or data. Also lots of repetitive fetching when people have large number of followers too, no?
    
    CharlesW 3 years ago
    
    > That would suck on a mobile device with limited bandwidth and/or data.
    Why would it be any different than using a mobile feed reader to follow hundreds of blogs, or using a podcast app to follow hundreds of podcasts? Both are commonly done on mobile all the time. Checking if a feed has been modified since the last time the client checked costs virtually nothing.
    bioemerl: I've been casually digging into Mastodon and ActivityPub for the last few weeks and FWIW I think you're absolutely right — with the caveat that I may be missing something obvious, it's seems very dumb for Mastodon/ActivityPub servers to be downloading and delivering content on behalf of client apps.
    
    zimpenfish 3 years ago
    
    > Why would it be any different than using a mobile feed reader to follow hundreds of blogs
    In my experience, you generally do that through an aggregator which has already cached the articles for you and you're doing a bulk fetch from a single host - that is quite different to calling out to hundreds of individual hosts and fetching a page from each.
    > it's seems very dumb for Mastodon/ActivityPub servers to be downloading and delivering content on behalf of client apps.
    Isn't that literally what an RSS aggregator does?
    
    CharlesW 3 years ago
    
    > In my experience, you generally do that through an aggregator…
    My understanding is that a cloud-based aggregator (like Feedly) delivers feed and state information to clients, but not the content itself.
    To test this with blogs, I did a new install of Reeder, synced with Feedly, then turned off Wi-Fi. In my subscriptions, I got everything that would be in the feeds themselves (notably item titles and descriptions as created by publishers) but nothing beyond that. The offline experience was mostly useless, suggesting that the client does most of the heavy-lifting even when leveraging cloud-based aggregators.
    So is making a few REST API calls a significant savings over checking a couple hundred (or whatever) RSS feeds? With "If-Modified-Since" checks being so cheap, I'm not sure that inserting Mastodon instances as middleboxen makes sense. If all Mastodon did was store subscriptions and state info, it seems like we'd have a far more resilient microblogging ecosystem.
    
    zimpenfish 3 years ago
    
    > My understanding is that a cloud-based aggregator (like Feedly) delivers feed and state information to clients, but not the content itself.
    Depends entirely on the feed. Just checked my most recent 100 entries from Feedbin - that comes back with 23k of `summary` and 508k of `content`. That was one call to return 661k of JSON covering 28 feeds.
    > So is making a few REST API calls a significant savings over checking a couple hundred (or whatever) RSS feeds?
    I've made one call to get (assuming >2k in `content` is a full article) 74 full articles from 28 feeds. Add another 26 and I'm looking at 27 total.
    Even just retrieving the RSS feeds is 28 and 26 more for the non-included articles to make a total of 54 - double what I had to do via Feedbin. That's before we start considering response times and latency for each of those sites which would drag things down even further.
    There's a reason why people use RSS aggregators!
    
    bioemerl 3 years ago
    
    Is it really that big of a deal? We've advanced like 15 years since we were able to handle Twitter on our devices, why can't we handle many feeds from multiple servers?
    
    ilyt 3 years ago
    
    That was solved with RSS readers.
    It would be akin to having single-user mastodon instance
    
    bioemerl 3 years ago
    
    Rss seems like it's a pretty bad standard that doesn't let you do a ton with it and doesn't solve problems of things like discoverability or your ability to post content.
    It's great for reading from lots of new sites at the same time, but it's clear that it hasn't taken off and when that happens it's normally for pretty good reason
    
    ilyt 3 years ago
    
    Think they are referring to way it works, not technical details of implementation, as it isn't really "social network" with no ability to have any social action
    As in you can subscribe to who you want and no moderator will stop you and then you can... just comment on a blogpost or article, with no 3rd party aside from this particular space moderators
    
    zimpenfish 3 years ago
    
    > That was solved with RSS readers.
    Wasn't it solved by moving to RSS aggregators and doing a single bulk fetch of your cached N blogs rather than calling out individually to N blogs?
  - bombcar 3 years ago
    
    That works best when you have few clients and lots of “sources” but mastadon is apparently designed for few sources and lots of clients - in which case updating the “server” once could be more performant.
- CommanderData 3 years ago
  
  I hate to be the person but I've seen complicated dynamic applications push much higher bandwidth and serve millions of concurrent users with similar if not smaller h/w requirements.
  Would be interesting seeing Twitters complete backend and while mastodon might not be apples to apples also interested cost per user to infrastructure analysis too.
  - lzooz 3 years ago
    
    Big difference being that Twitter uses Java, Scala, etc. Twitter used to use RoR also and it went down literally every day. I'm talking 2012 or so I think, bad memory haver here.
    
    vidarh 3 years ago
    
    Twitters primary problem was that they had not build a system that was designed to shard, not Rails. They'd have needed a rewrite no matter which framework they'd started with.
    I have no love for Rails, but blaming it for Twitters old problems is not fair.
    That said, Mastodon has much of the same problem, and is only "saved" by the combination of federation and ten years of hardware advances. Thankfully, the federation means there's plenty of opportunity for people to experiment with other implementations of ActivityPub (or even implementations of the full Mastodon API), or fixes to it.
    
    lzooz 3 years ago
    
    Rails doesn't scale.
    
    vidarh 3 years ago
    
    And yet it runs sites magnitudes larger than Hachyderm just fine.
- zimpenfish 3 years ago
  
  Looking at the amount of requests my 5 user (following ~60 people in total), 90% idle Pleroma instance handles is a bit mad - 60-90 requests a minute.
  I dread to think how much a busy 30k user instance does.
  - throwawaymaths 3 years ago
    
    Pleroma also isn't RoR and is really designed to scale out better
    
    zimpenfish 3 years ago
    
    > Pleroma also isn't RoR and is really designed to scale out better
    Sure ... but that's not what I was talking about. It's still talking ActivityPub and that seems, to me, from my low use, few user instance, to be an extremely chatty protocol. I don't know if the ActivityPub traffic scales linearly with users but it would be a not inconsiderable number for 30k users.
    
    throwawaymaths 3 years ago
    
    Exactly. We're agreeing! That's why I'm not surprised that the mastodon server didn't do so well and that you might get more mileage out of your compute
    
    Throwawayaerlei 3 years ago
    
    My understanding is that Pleroma was first and foremost designed to have a small memory footprint for small instances. As an Elixir program running on the BEAM (Erlang) runtime it ought to scale a lot better, but serious work had to be put into that. Not sure it's running on any sites as big as the big Mastodon ones.
    
    zimpenfish 3 years ago
    
    [1] has the biggest Pleroma at about 27k users - about 45th biggest according to the list on [2]
    [1] https://the-federation.info/pleroma [2] https://instances.social
Kye 3 years ago

Bad disks.
- wlonkly 3 years ago
  
  And for a while, NFS on those bad disks.

rubiquity 3 years ago

Not sure why Ruby on Rails is taking a beating in the comments section here. The problem is clearly the 1Gbps network that is functioning at only 200Mbps and worn out/defective SSDs. Waiting around on IO all day will bring any stack to a crawl.

vidarh 3 years ago

There's a number of people here who'll complain about Rails whether or not it has anything to do with the actual problem whenever a Rails app is brought up.

yoz 3 years ago

This is a superb write-up of an intense, exhausting situation. Great mixture of low-level detail and tactics, and high-level thinking about systems and people. Congratulations on managing that migration, and thank you for sharing this with us!

dboreham 3 years ago

Confused as to why they didn't just replace the bad SSDs with good ones?

Fwiw this sounds to me like what happens when you use "retail" SSDs (drives marketed for use in user laptops) underneath a high write traffic application such as a relational database. Often such drives will either wear out or will turn out to have pathological performance characteristics (they do something akin to GC eventually), or they just have firmware bugs. Use enterprise rated drives for an application like this.

kris-novaOP 3 years ago

Hi, I made the decision not to replace the drives. I also wrote the article, and am the admin of Hachyderm.
So to be clear, we did try to "offline" a drive from the ZFS pool just to see if this was a viable path. The ZFS pool was set up a few years ago and has gone through a few iterations of disks. The mirrors were unbalanced. We had pairs of drives of one manufacturer/speed mirrored with pairs of drives from another manufacturer/speed. We know this configuration was wrong, again we didn't intend for our little home lab to turn into a small production service.
I think after spending a few hours trying to "offline" the disk, and then repairing the already brittle ZFS configuration to getting the database/media store back to a "really broken and slow but still technically working" state we just decided to pull the plug and move to Hetzner. Offlining the disk caused even more cascading failures and took about 30 minutes just for the software. We could have technically shut down production to try without the database running on it, but at that point we decided to just get out of the basement.
If it would have been as easy as popping a disk in/out of the R630 (like one would imagine) we would have certainly done that.
To be honest I am still very interested in performing more analysis on ZFS on a 6.0.8 Linux kernel. I am not convinced ZFS didn't have more to do with our problems than we think. I will likely do a follow up article on benchmarking the old disks with and without ZFS in the future.
zfs-2.1.4-1 zfs-kmod-2.1.6-1 6.0.8-arch1-1
- rbanffy 3 years ago
  
  > We had pairs of drives of one manufacturer/speed mirrored with pairs of drives from another manufacturer/speed.
  The different speed is an issue, but I always recommend mixing pairs so that you don’t end up like me, when all spinning metal of the same RAID-5 array failed in a short period. Wasn’t a great day.
  Lucky me I had a contingency plan.
- ilyt 3 years ago
  
  Throw ZFS away, put X drives, make RAID10+LVM with X-1 drives (linux supports odd numbers in RAID10), never think about it again. It's simple to setup, simple to debug, and you don't need ZFS expert for something as simple as disk replacement. In cases like what happened there is --write-mostly option that will tell linux raid to prefer other disks for reads so yo can see whether unloading the drive changes anything. Maybe RAID6 if you're not screaming for performance but want some more space.
  Focus your efforts on making robust backups instead. You don't want to be that only guy in org who knows how to do ZFS things when it breaks.
  We're running few racks of servers, ZFS is delegated to big boxes of spinning rust where its benefits (deduplication/compression) are used well, but on a bunch of SSDs it is just overkill.
ilyt 3 years ago

Then you will have same problems but now you can bother manufacturer about it!
Also unless there is something horribly wrong about how often data is written, that SSD should run for ages.
We ran (for a test) consumer SSDs in busy ES cluster and they still lasted like 2 years just fine
The whole setup was a bit of overcomplicated too. RAID10 with 5+1 or 7+1 (yes Linux can do 7 drive RAID10) with hotspare woud've been entirely fine, easier, and most likely faster. You need backups anyway so ZFS doesn't give you much here, just extra CPU usage
Either way, monitoring wait per drive (easy way is to just plug collectd [1] into your monitoring stack, it is light and can monitor A TON of different metrics)
* [1]https://collectd.org/
Dynamicd 3 years ago

It costs money.
Remember this isn't a company: its hobbyist/enthusiasts putting their own resources into something or running with donations when available. There's no venture capital to absorb operating losses here. Remember the old "storm/norm/conform/perform" analogy. We are still very much pushing along into norm territory, and articles like this will help establish a conform phase ... but it will take time.
zufallsheld 3 years ago

> Confused as to why they didn't just replace the bad SSDs with good ones?
Probably because they wanted to migrate to hetzner anyway and took the chance to do it now instead of later.
But I do agree that it would have been probably a better idea.

lima 3 years ago

Hetzner is great, but it may not be the best choice for a social network that hosts user content and may attract controversy.

As a mass-market hosting provider, Hetzner is subject to constant fraud, abuse and hacked customer servers, and in consequence, their abuse department is very trigger-happy and will usually shoot first and ask questions later. They can and will kick out customers that cause too much of a headache, regardless of their ToS.

Their outbound DDoS detection systems are very sensitive and prone to false positives, such as when you get attacked yourself and the TCP backscatter is considered a portscan. If the system is sufficiently confident that you are doing something bad, it automatically firewalls off your servers until you explain yourself.

Likewise, inbound abuse reports sometimes lead to automated or manual blocks before you can respond to them.

They also rate limited or blocked entire port ranges in the past to get rid of Chia miners and similar miscreants with no regards to collateral damage to other services and without informing their other customers.

Their pricing is good and service is otherwise excellent, and if you do get locked out, you can talk to actual humans to sort it out. But, only after the damage is already done. If you use them, have a backup plan.

Kye 3 years ago

The main project instances run on Hetzner. It seems like it's either not an issue or Hetzner's anti-abuse systems are able to tell the difference.

asim 3 years ago

As someone who scaled ruby on rails in the prime era 2007-2009 I'll tell you the problems have not changed. It's very straightforward horizontal scaling followed by load balancing across multiple nodes. Load relates having enough cores, fast enough disks and enough egress bandwidth throughput. Everything else is purely caching in front of a poorly performing ruby web server and minimising disk or database reads.

The write up is cool. Reminiscent of things we used to do back in that early rails 2-3 era. Just funny we're back where we started.

TLDR: if you want to run ruby on rails on bare metal be ready to run something with 8+ cores, 10k rpm disks minimum and more bandwidth than you can support out of your basement.

pantulis 3 years ago

Also, serving images from shared NFS mounts was not the best of ideas. I remember when S3 came out and the Rails attachment plugins ecosystem quickle added support; it was a godsend.
- ilyt 3 years ago
  
  even some hacked setup with 2 way sync is better. The dumbest (but still somehow the least problem in their architecture) I saw was running [1] syncthing on few servers and just syncing everything and hoping conflicts won't happen...
  Like, I did it, but I wouldn't recommend it, restarting NFS server is gnarly, HA support on OSS side is... not really there last time I checked, and overall PITA.
  * [1] https://syncthing.net/

neonsunset 3 years ago

Weak technology stack and deeply flawed concept of federation that enables local centralization of control by discord-mods-meme style with all the corresponding issues.

Mastodon should have been based on DHT with each "terminal" aka "profile" having much higher autonomy.

Otherwise, it just gives more tools to people who left Twitter to continue doing same societal damage.

p.s.: it is time to stop writing back-ends in Ruby when every other popular alternative (sans Python-based ones) is more powerful and scalable.

denysvitali 3 years ago

IMHO decentralization is not the way to go, which is why I started OpenDolphin [1].
If you end up having a "decentralized system" with 30k users per instance, you basically just have a centralized system that federates with other instances. Sure, it is kind of decentralized, but the admins of that 30k instance are effectively able to read the DMs, impersonate users and delete their content.
I personally think (and I'm trying to formalize my ideas somehow with OpenDolphin) that a centralized instance that is only used to serve the __signed__ / encrypted content solves some parts of the decentralization issues we're seeing here - whilst still giving the users some of the features of decentralized platforms.
If you like / dislike the idea, help us out! We're trying to build a community to build together something great. Every contribution counts (:
And btw, yes, I do agree: that hardware for 30k users doesn't make any sense - it really shows that something isn't optimized :(
[1]: https://about.opendolphin.social/
- vidarh 3 years ago
  
  A centralised instance of any kind is a non-starter for a lot of those of us who have moved to Mastodon. And, yes, the admins of that 30k instance is able to do all kinds, and their users are able to leave if they do. I'd be all for improvements around signing and encryption, but not at the cost of centralisation (for my part, I run my own Mastodon server, but is also tinkering with my own ActivityPub implementation).
  - denysvitali 3 years ago
    
    Care to elaborate what could be the reason why a centralized instance is a non-starter? It looks like Mastodon's approach is yes decentralized, but soffers from the same censorship issues like Twitter - instances are often blacklisted on Mastodon
    
    vidarh 3 years ago
    
    Because if I wanted that just stay with Twitter, and that's the attitude of a lot of us.
    I run my own instance. I can't do that on Twitter. If I get tired of it, I can move to another instance and most of my followers will migrate automatically (not all, yet, because not all ActivityPub software supports the move mechanism, but when I moved to my personal instance 80%+ already moved automatically, and it'll go up).
    Instances can be blacklisted on Mastodon, but have far less of an impact unless you're so particularly abuse that a large portion of the network blocks you.
    If you don't provide means to do robust blocking and moderation that will be another reason for people to stay away. Mastodon has these "issues" because it is what users want. Anyone can say exactly what they want, but anyone can also choose to block you, or your instance, if they don't want to listen. Gab is an example that is on Mastodon and is widely blocked because most people don't want to interact with them, and that's our right, but they're still free to talk to those who want to deal with them.
    
    ilyt 3 years ago
    
    Users choosing to block someone vs server admins deciding to decide for their users to block whole other server.
    It would be fine if it was just "don't pull this server to public feeds", but as it is now it just repeats the problem of giving moderators too much power over users.
    
    vidarh 3 years ago
    
    It can be just "don't pull this server to public feeds. In fact it can be a lot more granular. You can:
    * "Silence" it - makes posts invisible on your instance for everyone not following them * "Suspend" - removes all content. * Reject media files. Avoids downloading media files locally from this instance. * Reject reports from instances that are abusing the reporting.
    But limiting an instance is often insufficient because it doesn't stop your users from then subsequently replying etc. and indirectly dragging the conversation onto your server, and doesn't stop the remote instances users from harassing your users. And so it's often users pushing for harsher blocking.
    In any case on Mastodon people have choice - if you don't like your local admins mod decisions, you can move elsewhere and take your followers with you, so if anything mods have far less power on Mastodon than most places.
    
    ilyt 3 years ago
    
    > In any case on Mastodon people have choice - if you don't like your local admins mod decisions, you can move elsewhere and take your followers with you, so if anything mods have far less power on Mastodon than most places.
    The "I now have to run my own server, and tell my followers to move with me" is kinda the blocker here
    >In any case on Mastodon people have choice - if you don't like your local admins mod decisions, you can move elsewhere and take your followers with you, so if anything mods have far less power on Mastodon than most places.
    The amount of power they have is same or even bigger, just over smaller amount of users.
    
    vidarh 3 years ago
    
    > The "I now have to run my own server, and tell my followers to move with me" is kinda the blocker here
    Not what I suggested. You have 3k+ instances to pick from before you need to opt to run your own. And Mastodon has built in functionality to move followers for you. Could be smoother, but it works - I've used it. Took me <1h from initiating the move until most of my followers were following my new account with no actions on their side.
    > The amount of power they have is same or even bigger, just over smaller amount of users.
    The very fact that people can move diminishes that power for any admin who cares about retaining users.
    
    denysvitali 3 years ago
    
    But that's not the reason you switched from Twitter to Mastodon right? Because as you're describing it, the problems are similar - although on a smaller scale.
    The main difference is that with Mastodon it seems like you have a choice to move away from a rogue instance
    
    vidarh 3 years ago
    
    To me the problems being "on a smaller scale" makes them fundamentally different. On Twitter you're beholden to the moderation decisions of a single company. On Mastodon I'm not. That's a giant difference. The problems are not remotely comparable.
    On top of that, once I moved, I found engagement was just way higher.
    But the decentralised and open nature is the driver for me with Mastodon. Knowing that I can control my own experience, both in terms of moderation and in terms of being able to mix and match software which suits me (e.g. I plan to add ActivityPub support to my blog and use it for a comment section).
    
    ilyt 3 years ago
    
    Okay, but I don't want any wanker to moderate what I decide to watch. Public streams, sure, there are reasons for that, moderate away, but if the solution is "well, you really need to just run your own server infrastructure" that's a shitty solution
    
    vidarh 3 years ago
    
    Then pick an instance which doesn't. Your choice.
    
    denysvitali 3 years ago
    
    Thanks for your inputs!
- tedivm 3 years ago
  
  > Sure, it is kind of decentralized, but the admins of that 30k instance are effectively able to read the DMs, impersonate users and delete their content.
  There's a working group on end to end encryption already and I do believe they will solve that problem.
  Archiving content is trivial and can be automated. It's also pretty easy to migrate between instances- I started on one run by a friend but which I felt was too small, moved to one of the biggest ones, and then ended up on hachyderm. I may end up moving again as I feel like the service is getting rather big- it's one of the largest instances now and there are benefits to being on smaller instances that tend to push people towards them.
- CharlesW 3 years ago
  
  I think it's neat to start a Twitter competitor and build it in the open, but I don't understand how you're doing to get traction without federation or some kind of undeniable USP. Even Twitter had to "federate" with SMS at first.
  Tip: On https://about.opendolphin.social/about, the word "Retribution" is probably not the word you want here. "Compensation", maybe?
  - denysvitali 3 years ago
    
    Thing is... I don't know of any particular killer feature currently. It shouldn't also be up to me to decide tbh, as I would like to shape a product that is build by the users for the users.
    Thanks for the tip, I'll update the website [1] soon.
    [1]: https://github.com/OpenDolphin/website
- ilyt 3 years ago
  
  I think just having public key posted in DNS record + the subscribers saving both "human name" and public key would be enough; even if someone takes over a domain every subscriber will get an alert that they key have changed; maybe have backup method of just storing `/.well-known/<social-network>/username with same info for those that don't/can't fuck with DNS.
  Building on that the list of address+pubkeys can be put somewhere to search (DHT on server nodes?) so if someone moves shop they can still be found. Then the client could either subscribe directly to who they want (akin to RSS), or get on someone's instance (which would be akin to RSS aggregator) to participate in their community.
  - vidarh 3 years ago
    
    Mastodon uses webfinger for this, which scales much better (though I find it annoying webfinger wasn't specified with a url format that'd make it easy to provide static replies like what you suggested), and supports aliases and redirection.
    E.g. if you go here: https://galaxybound.com/.well-known/webfinger?resource=acct:...
    You'll get redirected here: https://m.galaxybound.com/.well-known/webfinger?resource=acc...
    Which will return a webfinger profile, which includes this activitypub profile (curl command line because you need the "Accept:" header):
    curl -H "Accept: application/activity+json" https://m.galaxybound.com/users/vidar
    If you look at the "publicKey" key in that JSON doc, you'll see:
    { "id": "https://m.galaxybound.com/users/vidar#main-key", "owner": "https://m.galaxybound.com/users/vidar", "publicKeyPem": "-----BEGIN PUBLIC KEY-----\nMIIBIjANBgkqhkiG9w0BAQEFAAOCAQ8AMIIBCgKCAQEAvHAMudraHmCA2u8wcI2/\n7iB0Td73AF7tk3LC7AuUxzCMu0Ipf/mNVdh+2nFOQ0pY+7E/wEgGeFg5BlMNC7hK\nebfYtY5ZFm7upVqQ0OXTP2hC+fnDTNcpmnPqmwMRdL54YtiZvQtRzW++ZKquWjER\nInM97NsjW0H0yuVkMETfPX1ilWkzTlWa9m0+H5Tmaz3EbKk3VanJXSLKNLnSq2lK\nbKcr2kD/U5UobzJDaaKjK+tW50iTOMiFEXLAT+4Po375WlgxKchahvtPyioiDm6j\noneGAbKRM/eAiHf7EhP76zyoL/LNG0XP4rDEFr4Ia4vo4HG1zhbGZ+815Tip4lNW\nYwIDAQAB\n-----END PUBLIC KEY-----\n" }
    .. which is used to check the signature of my posts when they federate.
    From the profile you also know where to get my posts, and can get them either as JSON or RSS if you want to pull them without actually following me.
    If someone "moves show" there'll be an alias in place. E.g:
    curl -H "Accept: application/activity+json" https://mastodon.social/users/vidarh | jq .movedTo
    gives: "https://m.galaxybound.com/users/vidar"
    Now, something like a DHT to replace webfinger so a move can be done without cooperation from the original instance might be interesting, but Mastodon works pretty well here already.
CharlesW 3 years ago

Disambiguating DHT in case it's helpful for others, even though I'm still unclear how this would be better than what ActivityPub does now: https://en.wikipedia.org/wiki/Mainline_DHT
MuteXR 3 years ago

How would a DHT be any better than AP? It's not even close to doing the same thing, very few people would want something that comes with all the massive disadvantages that a DHT would bring.
aprdm 3 years ago

Shopify, Basecamp and github sort of disagree with you.

cyberphobe 3 years ago

These comments make me want to log off for a bit.

Post: we hit scaling issues caused by our failing disks and running image hosting and databases over NFS

HN: It’s obviously Ruby on Rails fault

kris-novaOP 3 years ago

Hi. I wrote the post. Additionally I am responsible for operating Hachyderm (Ruby on Rails) and GitHub (Ruby on rails) for both my free-time and my day job.
I can say with certainty that Ruby specifically was not the bottleneck in our case. I do think that the rails paradigm can often lead to interdependent systems. We see this at GitHub and we also see this in Mastodon. Service A will do reads/writes against the same tables in the database that Service B also does. When service A is moved to an isolation zone, it can still impact Service B's performance.
In other words, I think any stateful framework with the flexibility that Ruby on Rails encourages bad behavior that can contribute to a noisy neighbor problem.
The point I am trying to drive home is that I agree. I can confidently say that Ruby on Rails is not the culprit in our case. To be honest I just ignore anyone who is quick to point fingers and assign blame either technically or personally.
Sorry hacker news got you down. If it helps my family and I are making Sunday morning pancakes with my puppy Björn today and we are all wishing you the best day ever.
- cyberphobe 3 years ago
  
  aw thanks! I follow you on mastodon and it’s been great watching the updates as hachyderm grows
- rbanffy 3 years ago
  
  Is there any good architectural breakdown of Mastodon software?
Kye 3 years ago

It's surreal. I thought maybe I misread the post or something.

musk_micropenis 3 years ago

30,000 users seems like a ludicrously small number of users to hit scaling problems. It sounds like Mastadon has not been designed for scale from the ground up, which is surprising for a project that hopes to be a popular social network.

pacbard 3 years ago

The number of users of an instance is not equal to the workload for the instance because of how federation works.
The frontend serves 30,000 users. The backend processes their posts alongside the posts of everyone that replies to them and all posts from people that the “home” users follow. So, while the home user base is 30,000, the effective load on the back end is much more, depending on how many followers/following a person has.
You can find posts from people who were hosting their own single user instance that went down because of a popular post that got federated across multiple instances.
- ilyt 3 years ago
  
  > You can find posts from people who were hosting their own single user instance that went down because of a popular post that got federated across multiple instances.
  I either understand something wring about it or that still points out to Mastodon being slow and badly engineered.
  Like, the updates are per server not subscriber right ? That's at worst few thousand requests, all of them can be essentially served from RAM. Even if you get 10k responses to your comment, 10k responses within say 5 minutes is still only like 30-40 req/sec, that shouldn't be much even for Ruby
- rvz 3 years ago
  
  > You can find posts from people who were hosting their own single user instance that went down because of a popular post that got federated across multiple instances.
  That doesn't give confidence to the typical self-hosting Mastodon user who goes 'viral' somehow. So they should expect that if they were to have a post to go viral across instances they have to become a sys-admin for the day to bring it back and scale it up to handle the traffic.
  No wonder normal users are not self-hosting their own instances to fully own and self-verify themselves on Mastodon and have to search for an instance to re-centralize to.
  That is very disappointing and not a great sell for Mastodon so-called 'verification', but not at all surprising.
- AnIdiotOnTheNet 3 years ago
  
  So what I'm hearing is that the whole thing was designed as though any significant scaling was never really expected. That's not exactly encouraging for its future.
CharlesW 3 years ago

The largest Mastodon instance is mastodon.social, which had about ~900K users as of 11/22. This article, together with the three linked to near the top, are gold for understanding what experienced (but new-to-Mastodon) system architects may go through as they grow a Mastodon instance from an interesting experiment that lives in their bedroom to a system that reliably supports tens of thousands of users.
- knewter 3 years ago
  
  The largest mastodon instance is truth social
throwaway349902 3 years ago

The CTO of gab.com did a great breakdown on the scaling issues of ActivityPub federation: https://www.youtube.com/watch?v=3kDtZ8MBWy8
rvz 3 years ago

Of course. I mean this doesn't come as a surprise, as others have already said, Mastodon is not designed to handle hundreds of thousands of users and scale up, hence why smaller volunteers are frequently running into issues with such a low amount of users.
Mastodon is still a solution pretending to search for a problem, already reminding us of the failure of federated social networks in the long term and eventually withering away and re-centralizing with larger instances.
huimang 3 years ago

Mastodon was designed to be a system of federated instances, not necessarily a few instances scaled up to hundreds of thousands of users each. More people should be hosting themselves or joining smaller instances.
MuteXR 3 years ago

Mastodon isn't designed for massive instances, people are supposed to find smaller ones instead of centralizing. If you "scale", it goes against what Mastodon is even for.
tcfhgj 3 years ago

it has not been designed for scale because a single instance temporarily couldn't scale up fast enough?
- musk_micropenis 3 years ago
  
  No, because a single instance needed to "scale up" at a very small user count.
  - dang 3 years ago
    
    Trollish usernames aren't allowed on HN because they subtly troll every thread they post to. I've therefore banned this account. If you don't want it to stay banned, we can rename it for you, as long as it's to something non-trollish.
    https://hn.algolia.com/?sort=byDate&dateRange=all&type=comme...
  - rbanffy 3 years ago
    
    It really needed to replace a defective set of SSDs. They migrated because they were already planning to, but, with a replacement set of disks, it could continue serving more users for longer and migrate on a more relaxed schedule.
aprdm 3 years ago

You’re free to share your expertise in its repo or put some MRs
- smcn 3 years ago
  
  This here is the benefit of Mastodon. If Twitter sucks, I can't help make it better but I am able to throw code at Mastodon (or insert to ActivityPub software of choice).
wlonkly 3 years ago

Sometimes (like in this case) the scaling problem is "obtaining hardware".
Hardware having been obtained (via Hetzner, instead of in Kris Nova's home lab), the instance has scaled.
- musk_micropenis 3 years ago
  
  Before the scaling problems were hit they were hosting in 4 very powerful machines with loads of RAM and all-SSD storage. I don't see any reasonable world in which those machines aren't enough to power 30,000 users.
  - progval 3 years ago
    
    The issue was the media storage. Mastodon stores lots of small files in deeply nested structures, so filesystems, especially networked filesystems, are very badly suited. Not to mention the disks themselves having issues.
    The article quotes blocking requests for 10 to 20 seconds, this causes everything to go slower.
    
    vidarh 3 years ago
    
    > Mastodon stores lots of small files in deeply nested structures
    The deep nesting and overall layout of the cache drives me nuts. The storage layout is designed for the state of filesystems ca. late 90's, pre reiser or ext3, with the major deficiency of pretty much demanding an unnecessarily expensive object storage setup.
    The nature of most of the space on an instance like this going to cache remote content is also that it ought to not try to make caching decisions in the app, but leave the actual caching to a proper cache, which would've been trivial if the paths of the cached objects was mapped to the media-proxy URLs it falls back to if the file has not already been downloaded. Instead a bunch of effort goes into building a cache setup that people often end up putting on expensive object storage when it's data that shouldn't even need redundancy.
    
    ilyt 3 years ago
    
    Eh, we served way bigger things, with millions of small files via NFS just fine. Out of non-SSD storage too. Their issue, as written in article, was SSDs failing.
    It was migration from one clustered filesystem to another and the temporary step was just Apache equivalent of try_files and NFS mount aside the new FS while the migration was running in the background.
    If anything filesystems are better with some nesting vs "one big dir"...
  - wlonkly 3 years ago
    
    I believe it was I/O bound, on old, failing SSDs. Even after migrating object storage out to DigitalOcean's S3-equivalent, Postgres couldn't keep up on the terrible disks.
VWWHFSfQ 3 years ago

Yeah and take a look at the hardware specs that was hosting this thing. And the site fell over at 30k users??? something is wrong with the software.
- CharlesW 3 years ago
  
  When you RTFA, you'll learn that the site didn't fall over because of the Mastodon software.
  - rbanffy 3 years ago
    
    It’s very hard to scale when your persistent storage is failing and timing out.

valeg 3 years ago

We have come full circle. I remember seeing lots of Twitter's fail whale when it was run on RoR.

Settings

Post mortem on Mastodon outage with 30k users

Keyboard Shortcuts