TrouBot

What is TrouBot ?

TrouBot is an automatic program (sometimes refered as robot, spider or web crawler) that's used for the Trouba Search Engine. The function of TrouBot is to access both PC and mobile websites and establish an index database which enables users to search the content of your site on Trouba Search. You can identify crawling from TrouBot / Trouba by analyzing the User-agent field.

How to block TrouBot from visiting your website ?

TrouBot complies with the internet robots protocol. You can use the robots.txt file to prevent TrouBot from accessing your website or to prevent TrouBot from accessing parts of your website.

Note: Banning TrouBot from accessing your site will make the pages on your site and all search engine services provided by Trouba unsearchable within the Trouba Ecosystem.

Robots.txt setup recommendations: You can set different crawling rules according to different user-agents, and you can directly prevent the crawling of TrouBot by using the following line:

Block TrouBot from accessing your website:

                User-agent: TrouBot
                Disallow: /
            

Allow only parts of the website:

                User-agent: TrouBot
                Allow: /show-this/
                Disallow: /nothing-to-see/
            

How much impact does the TrouBot have on a Web Server

In order to have a better retrieval results for the target resources, TrouBot needs to maintain a certain amount of crawling of your website. We try to impact as less as possible any website, and we will make adjustments based on comprehensive factors such as server capacity, website quality, and website updates. If there is any unreasonable behavior of TrouBot’s visiting, please send your concerns to troubot@trouba.ch

Why can my website still be found on Trouba Search when it has been banned via the robots.txt

It takes time to update data in the search engine index database. Although TrouBot has stopped accessing the pages on your site, it may take several months to clear the page index information that has already been established in the Trouba Search Engine database. Besides, please check if your robots.txt configuration is correct. If your removal is very urgent, please make your request to troubot@trouba.ch