Settings

Theme

The Facebook crawler is hammering the internet

twitter.com

27 points by jedisct1 4 months ago · 10 comments

Reader

jedisct1OP 4 months ago

Interestingly, they rotate user agents.

A few days ago they were identifying as “FaceBot” (not "FacebookBot").

When that began to be blocked, they switched to reusing the “Facebookexternalhit” user agent they also use for redirects; one people are less likely to block.

BoredPositron 4 months ago

Meta as a company always acts like the most unprofessional idiots if they feel under pressure. They have all the time and ressources to do it right and they never do.

emot 4 months ago

isn't this crawler to generate previews on Facebook? they have others for training and AI stuff. oh well, one never knows with Meta...

—— The facebookexternalhit/1.1 user agent you're seeing in the logs is a Facebook crawler, specifically used by Facebook’s servers to fetch content (like Open Graph metadata) when:

Someone shares a link on Facebook or Messenger

Facebook needs to generate a preview (title, image, description) for that URL

N19PEDL2 4 months ago

Genuine question: how do they know the bot is from Facebook, apart from what's written in the user agent?

  • extraduder_ire 4 months ago

    They're cropped off to the side, but I assume the IPs making those requests are in a block owned by facebook.

bediger4000 4 months ago

Would a 404 or a 403 be more appropriate? What if you just want Meta crawlers to go away forever?

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection