Question 1

What is Meta-ExternalAgent?

Accepted Answer

Meta-ExternalAgent is one of Meta's documented crawlers. Meta states its purpose is “training foundation AI models or improving products by indexing content directly.” It is operated by Meta (Facebook, Instagram, WhatsApp).

Question 2

What is the Meta-ExternalAgent user-agent string?

Accepted Answer

The documented user-agent string is: meta-externalagent/1.1 (+https://developers.facebook.com/docs/sharing/webmasters/crawler)

Question 3

Does Meta-ExternalAgent respect robots.txt?

Accepted Answer

Honored in general, but read the fine print. Meta does not give Meta-ExternalAgent a blanket compliance guarantee; instead it names sibling bots that may bypass robots.txt — Meta-ExternalFetcher (user-requested fetches) and FacebookExternalHit (security/integrity checks). Meta-ExternalAgent itself is not called out as a bypasser, so a User-agent: Meta-ExternalAgent group is the right control.

Question 4

How do I block Meta-ExternalAgent in robots.txt?

Accepted Answer

Add these two lines to your robots.txt: "User-agent: Meta-ExternalAgent" followed by "Disallow: /". To explicitly allow it instead, use "Allow: /".

Meta-ExternalAgent

What is Meta-ExternalAgent?

The Meta-ExternalAgent user-agent string

How do I block Meta-ExternalAgent in robots.txt?

Block Meta-ExternalAgent

Allow Meta-ExternalAgent

Does Meta-ExternalAgent respect robots.txt?

Should you block Meta-ExternalAgent?

Official documentation

What does your robots.txt say about Meta-ExternalAgent?