Commit graph

6 commits

Author SHA1 Message Date
Robert R George
a98a8ee93f
Update robots.txt to prevent crawling of domain blocks (#26470)
Co-authored-by: Claire <claire.github-309c@sitedethib.com>
2024-12-02 08:03:24 +00:00
Foritus
405f141fe0
Change: Block GPTBot (#26396) 2023-08-09 11:58:46 +02:00
ThibG
c4f2433300 Disallow robots from indexing /interact/ (#10666)
This does not provide any new information and may just triple the number
of crawled pages
2019-05-02 00:10:19 +02:00
nightpool
a5992e5883 Change robots.txt to exclude only media proxy URLs (#10038)
* Revert "Change robots.txt to exclude some URLs (#10037)"

This reverts commit 80161f4351.

* Let's block media_proxy

/media_proxy/ is a dynamic route used for requesting uncached media, so it's
probably bad to let crawlers use it

* misleading comment
2019-02-14 03:11:47 +01:00
Eugen Rochko
80161f4351
Change robots.txt to exclude some URLs (#10037)
- Exclude static assets
- Exclude uploaded files
- Exclude alternate versions of the profile page
- Exclude media proxy URLs
2019-02-13 21:28:18 +01:00
Eugen Rochko
9c4856bdb1 Initial commit 2016-02-20 22:53:20 +01:00