Skip to content

Commit

Permalink
Add GarlikCrawler, ImplisenseBot and WikiDo (fnando#336)
Browse files Browse the repository at this point in the history
  • Loading branch information
paolodona authored and fnando committed Mar 5, 2018
1 parent 6f627f6 commit 784d33b
Show file tree
Hide file tree
Showing 3 changed files with 10 additions and 0 deletions.
4 changes: 4 additions & 0 deletions CHANGELOG.md
Original file line number Diff line number Diff line change
@@ -1,5 +1,9 @@
# Changelog

## UNRELEASED

- Add GarlikCrawler, ImplisenseBot and WikiDo bots.

## v2.5.3

- Add Google Site Verification to the bot list.
Expand Down
3 changes: 3 additions & 0 deletions bots.yml
Original file line number Diff line number Diff line change
Expand Up @@ -73,6 +73,7 @@ feedfetcher-google: "Google Feedfetcher"
findxbot: "Findxbot"
flipboardproxy: "FlipboardProxy"
friendfeedbot: "FriendFeed"
garlik: "GarlikCrawler"
genieo: "Genieo Web filter bot"
getprismatic.com: "getprismatic.com"
gigabot: "Gigabot spider"
Expand All @@ -98,6 +99,7 @@ hubspot: "HubSpot"
ia_archiver: "Internet Archive (WayBackMachine)"
icoreservice: "iCoreService"
idmarch: "idmarch.org/bot.html"
implisensebot: 'ImplisenseBot'
inagist: "URL resolver"
insieve: "Insieve Bot"
insitesbot: "Insitesbot"
Expand Down Expand Up @@ -248,6 +250,7 @@ webscout: "Webscout"
wesee: "WeSEE"
wget: "wget unix CLI http client"
whatsapp: "WhatsApp"
wikido: "WikiDo"
wordpress: "WordPress spider"
woriobot: "woriobot"
wormly: "WormlyBot"
Expand Down
3 changes: 3 additions & 0 deletions test/ua_bots.yml
Original file line number Diff line number Diff line change
Expand Up @@ -20,6 +20,7 @@ DOMAINAREANIMATOR: 'Domain Re-Animator Bot (http://domainreanimator.com) - suppo
DOT_BOT: 'Mozilla/5.0 (compatible; DotBot/1.1; http://www.opensiteexplorer.org/dotbot, help@moz.com)'
DUCKDUCKGO: 'DuckDuckBot/1.0; (+http://duckduckgo.com/duckduckbot.html)'
FACEBOOK_BOT: 'facebookexternalhit/1.1 (+http://www.facebook.com/externalhit_uatext.php)'
GARLIK: 'GarlikCrawler/1.2 (http://garlik.com/, crawler@garlik.com)'
GOOGLE_BOT: 'Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)'
GOOGLE_PAGE_SPEED_INSIGHTS: 'Mozilla/5.0 (X11; Linux x86_64) AppleWebKit/537.4 (KHTML, like Gecko; Google Page Speed Insights) Chrome/22.0.1229 Safari/537.4'
GOOGLE_SITE_VERIFICATION: Mozilla/5.0 (compatible; Google-Site-Verification/1.0)
Expand All @@ -29,6 +30,7 @@ GOOGLE_STRUCTURED_DATA_TESTING_TOOL: 'Mozilla/5.0 (compatible; X11; Linux x86_64
GRAPESHOT: 'Mozilla/5.0 (compatible; GrapeshotCrawler/2.0; +http://www.grapeshot.co.uk/crawler.php)'
JOBSEEKER: 'Mozilla/5.0 (X11; Linux x86_64) AppleWebKit/538.1 (KHTML, like Gecko) JobBot/5.0 (compatible; +http://www.jobseeker.com.au/bot.html) Safari/538.1'
LINKDEXBOT: 'Mozilla/5.0 (compatible; linkdexbot/2.0; +http://www.linkdex.com/bots/)'
IMPLISENSEBOT: 'ImplisenseBot 1.0'
LOAD_TIME_BOT: 'Mozilla/5.0 (compatible; LoadTimeBot/0.9; +http://www.loadtime.net/bot.html)'
LTX71: 'ltx71 - (http://ltx71.com/)'
MAIL_RU: 'Mozilla/5.0 (compatible; Linux x86_64; Mail.RU_Bot/2.0; +http://go.mail.ru/help/robots)'
Expand Down Expand Up @@ -60,6 +62,7 @@ TRAACKR: 'Traackr.com'
WATCHSUMO: 'Mozilla/5.0 (compatible) WatchSumo/1.0.0 (http://www.watchsumo.com)'
WEBCEO: 'Mozilla/5.0 (compatible; online-webceo-bot/1.0; +http://online.webceo.com)'
WHATSAPP: 'WhatsApp/2.17.38 Mozilla/5.0 (Linux; U; Android 6.1; en-us; DV Build/Donut) AppleWebKit/537.36 (KHTML, like Gecko) Safari/537.36'
WIKIDO: 'WikiDo/1.1 (http://wikido.com; crawler@wikido.com)'
WORIOBOT: 'Mozilla/5.0 (compatible; woriobot +http://worio.com)'
YAHOO_AD_MONITORING: 'Mozilla/5.0 (compatible; Yahoo Ad monitoring; https://help.yahoo.com/kb/yahoo-ad-monitoring-SLN24857.html)'
YAHOO_SLURP: 'Mozilla/5.0 (compatible; Yahoo! Slurp; http://help.yahoo.com/help/us/ysearch/slurp)'
Expand Down

0 comments on commit 784d33b

Please sign in to comment.