Skip to content

Commit c123374

Browse files
authored
Update pattern (#248)
1 parent b489f2c commit c123374

File tree

5 files changed

+23
-4
lines changed

5 files changed

+23
-4
lines changed

CHANGELOG.md

Lines changed: 5 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,10 @@
11
# Changelog
22

3+
## [5.1.2](https://github.com/omrilotan/isbot/compare/v5.1.1...v5.1.2)
4+
5+
- Add bots: Cypress, Detectify, InternetMeasurement, BuiltWith
6+
- Recognise browser: Zip Recruiter job search app, Ecosia android in-app browser
7+
38
## [5.1.1](https://github.com/omrilotan/isbot/compare/v5.1.0...v5.1.1)
49

510
- Reduce pattern size by introducing the substring ".com" and improve generic pattern

fixtures/browsers.yml

Lines changed: 8 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -159,6 +159,9 @@ DuckDuckGo:
159159
- Mozilla/5.0 (Linux; Android 10) AppleWebKit/537.36 (KHTML, like Gecko) Version/4.0 Chrome/84.0.4147.111 Mobile DuckDuckGo/5 Safari/537.36
160160
- Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15) AppleWebKit/605.1.15 (KHTML, like Gecko) Mobile/15E148 DuckDuckGo/7
161161
- Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_4) AppleWebKit/605.1.15 (KHTML, like Gecko) Mobile/15E148 DuckDuckGo/7
162+
Ecosia:
163+
- Mozilla/5.0 (Linux; Android 10; K) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/111.0.0.0 Mobile Safari/537.36 (Ecosia [email protected])
164+
- Mozilla/5.0 (Linux; Android 8.0.0; Moto Z2 Play) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/101.0.4951.41 Mobile Safari/537.36 (Ecosia [email protected])
162165
Edge:
163166
- Mozilla/5.0 (iPhone; CPU iPhone OS 13_3 like Mac OS X) AppleWebKit/605.1.15 (KHTML, like Gecko) Version/13.0 EdgiOS/44.10.19 Mobile/15E148 Safari/605.1.15
164167
- Mozilla/5.0 (Macintosh; Intel Mac OS X 10_11_6) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/81.0.4044.92 Safari/537.36 Edg/81.0.416.45
@@ -176,7 +179,6 @@ Electron Application:
176179
- Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Atom/1.45.0 Chrome/69.0.3497.128 Electron/4.2.7 Safari/537.36
177180
- Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) discord/0.0.306 Chrome/78.0.3904.130 Electron/7.1.11 Safari/537.36
178181
- Mozilla/5.0 (Windows NT 6.1; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) electron/1.0.0 Chrome/53.0.2785.113 Electron/1.4.3 Safari/537.36
179-
- Mozilla/5.0 (X11; Linux x86_64) AppleWebKit/537.36 (KHTML, like Gecko) Cypress/3.3.2 Chrome/61.0.3163.100 Electron/2.0.18 Safari/537.36
180182
Elinks:
181183
- ELinks/0.12~pre6-1ubuntu1 (textmode; Ubuntu; Linux 3.11.0-13-generic i686; 100x25-2)
182184
"ESPN: Sports News & Highlights App":
@@ -677,6 +679,9 @@ Yandex:
677679
- Mozilla/5.0 (X11; Linux x86_64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/56.0.2924.87 YaBrowser/17.3.1.873 (beta) Yowser/2.5 Safari/537.36
678680
Yandex Browser:
679681
- Mozilla/5.0 (Windows NT 6.2; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/28.0.1500.95 YaBrowser/13.10.1500.9323 Safari/537.36
682+
Zip Recruiter (Job Search phone app):
683+
- Job Search/24.4.0 Mozilla/5.0 (Linux; Android 14; SM-S918U Build/UP1A.231005.007; wv) AppleWebKit/537.36 (KHTML, like Gecko) Version/4.0 Chrome/122.0.6261.119 Mobile Safari/537.36
684+
- Job Search/95.0 (iPhone; CPU iOS 17_3_1 like Mac OS X)
680685
ZZZ Glitches and Misidentified Browsers - These browsers are legit user agent even though they are included in some bot listings (not all bots identify themselves):
681686
- (iPhone; CPU iPhone OS 16_4_1 like Mac OS X) AppleWebKit/605.1.15 (KHTML, like Gecko) Version/16.4 Mobile/15E148 DuckDuckGo/7 Safari/605.1.15
682687
- ; Android 11) AppleWebKit/537.36 (KHTML, like Gecko) Version/4.0 Chrome/96.0.4664.45 Mobile DuckDuckGo/5 Safari/537.36
@@ -695,5 +700,7 @@ ZZZ Glitches and Misidentified Browsers - These browsers are legit user agent ev
695700
- User-Agent:Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/79.0.3945.74 Safari/537.36 Edg/90.0.818.62
696701
- User-Agent:Mozilla/5.0 (Windows NT 10.0; WOW64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/92.0 Safari /537.36
697702
ZZZ Insignificat bots - These bots have very low appearance rate and are not worth blocking:
703+
- Mozilla/5.0 (compatible; MSIE 7.0; Windows NT 5.1; .NET CLR 1.1.4322) 360JK yunjiankong 427691
698704
- Mozilla/5.0 (compatible; MSIE 9.0; Windows NT 6.1; WOW64; Trident/5.0; Banca Caboto s.p.a.)
705+
- Mozilla/5.0 (Unknown; Linux x86_64) AppleWebKit/538.1 (KHTML, like Gecko) HLB/1.247
699706
- Opera/9.70 (Linux armv7l ; U; turbotabbee/TSV2.0/1.02Q; fr) Presto/2.2

fixtures/crawlers.yml

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -109,6 +109,9 @@ BUbiNG:
109109
- BUbiNG (+http://law.di.unimi.it/BUbiNG.html)
110110
Buck:
111111
- Buck/2.2; (+https://app.hypefactors.com/media-monitoring/about.html)
112+
BuiltWith:
113+
- BW/1.1; bit.ly/3eZNDnO
114+
- BW/1.1; rb.gy/oupwis
112115
Burp Collaborator Scanner:
113116
- Godzilla/17.0 (Unknown Operator; Nexus 5X Build/MMB29P) NoWebKit/5.36 (DOM, like Rhino; n7vbji2k0wt3rsbev55blgx17sdk8lwek97zvo.burpcollaborator.net) TOR/540.5.35487 Torrent 2654.76
114117
- http://5iojs2zo8rkhlrrwmm7oyl75twztnlo9ex9kz8o.burpcollaborator.net/5.0 (Windows NT 6.1; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/75.0.3770.142 Safari/537.36

package.json

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
{
22
"name": "isbot",
3-
"version": "5.1.1",
3+
"version": "5.1.2",
44
"description": "🤖/👨‍🦰 Recognise bots/crawlers/spiders using the user agent string.",
55
"keywords": [
66
"bot",

src/patterns.json

Lines changed: 6 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -5,10 +5,10 @@
55
"(?:^|[^g])news",
66
"(?<! (?:channel/|google/))google(?!(app|/google| pixel))",
77
"(?<! cu)bot(?:[^\\w]|_|$)",
8-
"(?<! ya(?:yandex)?)search",
8+
"(?<!(?: ya| yandex|^job) ?)search",
99
"(?<!(?:lib))http",
1010
"(?<![hg]m)score",
11-
"@",
11+
"(?<!android)@",
1212
"\\(\\)",
1313
"\\.com",
1414
"^12345",
@@ -22,6 +22,7 @@
2222
"^avsdevicesdk/",
2323
"^biglotron",
2424
"^bot",
25+
"^bw/",
2526
"^clamav[ /]",
2627
"^client/",
2728
"^cobweb/",
@@ -89,9 +90,11 @@
8990
"classifier",
9091
"cloud",
9192
"crawl",
93+
"cypress/",
9294
"dareboost",
9395
"datanyze",
9496
"dejaclick",
97+
"detect",
9598
"dmbrowser",
9699
"download",
97100
"evc-batch/",
@@ -112,6 +115,7 @@
112115
"library",
113116
"mail\\.ru/",
114117
"manager",
118+
"measure",
115119
"neustar wpm",
116120
"node",
117121
"nutch",

0 commit comments

Comments
 (0)