diff --git a/filter-lists/20190106115600_filters-sorted-by-hits-manual-tags-2nd-round.csv b/filter-lists/20190106115600_filters-sorted-by-hits-manual-tags-2nd-round.csv index 2991986147f62c68ce7d47bba998f1a3ccb3f021..2512800229227c046921e96f04f15b043dd3f768 100644 --- a/filter-lists/20190106115600_filters-sorted-by-hits-manual-tags-2nd-round.csv +++ b/filter-lists/20190106115600_filters-sorted-by-hits-manual-tags-2nd-round.csv @@ -682,66 +682,66 @@ Updated for new ccnorm. RF 20160820â€" 677 500 1 0 0 1 0 default 20160929181057 disallow 11 Sock sockpuppetry 678 758 1 0 0 0 0 default 20160426065334 11 Block evading editor hidden_vandalism 679 776 1 0 0 1 0 default 20160829162005 disallow 10 Prolific socker IV (update) sockpuppetry -680 290 0 0 0 1 0 default 20100312022427 10 172 Filter -681 36 1 0 0 1 0 default 20090830005032 warn,disallow 10 SEO push University of Atlanta +680 290 0 0 0 1 0 default 20100312022427 10 172 Filter politically_motivated not quite sure what the issue is, but ‘political party’, ‘canada’ and ‘trudeau’ (current prime minister of canada) which are the patterns looked for signal politically motivated vandalism +681 36 1 0 0 1 0 default 20090830005032 warn,disallow 10 SEO push University of Atlanta conflict_of_interest or ‘seo’ (which I haven’t used during the second labeling sofar) 682 817 1 0 0 0 0 default 20170104011726 disallow 10 Obamacare vandalism politically_motivated -683 824 1 0 0 0 0 default 20170418153256 disallow 10 Zeitgeist -684 343 0 0 0 1 0 default 20100920023444 10 Keemstar vandalism -685 603 1 0 0 0 0 default 20160405110310 warn 10 Special case of reverting XLinkBot reverts +683 824 1 0 0 0 0 default 20170418153256 disallow 10 Zeitgeist hidden_vandalism +684 343 0 0 0 1 0 default 20100920023444 10 Keemstar vandalism general_vandalism not quite sure what the issue is, but ‘vandalism’ appears in the name of the filter, hence the label; this seems to be an internet celebrity; +685 603 1 0 0 0 0 default 20160405110310 warn 10 Special case of reverting XLinkBot reverts hidden_vandalism 686 533 1 0 0 1 0 default 20160614033928 disallow 9 Balun link spammer spam 687 291 1 0 0 1 0 default 20100808081955 9 Prolific Seelincolshire Sock puppetry sockpuppetry -688 810 1 0 0 1 0 default 20161213162548 9 Suix vandal -689 315 1 0 0 1 0 default 20100808081913 9 Garbled content +688 810 1 0 0 1 0 default 20161213162548 9 Suix vandal hidden_vandalism +689 315 1 0 0 1 0 default 20100808081913 9 Garbled content hidden_vandalism 690 74 0 0 0 1 0 default 20090320212342 9 Unwikified text added to end of article good_faith_wiki_syntax -691 353 1 0 0 1 0 default 20100819063503 disallow 9 Blackout vandalism -692 468 1 0 0 1 0 default 20160614032907 9 Fixing vandal -693 486 1 0 0 1 0 default 20160614033248 disallow 9 Chilean vandal +691 353 1 0 0 1 0 default 20100819063503 disallow 9 Blackout vandalism hidden_vandalism +692 468 1 0 0 1 0 default 20160614032907 9 Fixing vandal hidden_vandalism +693 486 1 0 0 1 0 default 20160614033248 disallow 9 Chilean vandal hidden_vandalism 694 248 1 0 0 1 0 default 20110310110852 disallow 9 Memphis Music Spam spam -695 764 1 0 0 1 0 default 20170402074700 disallow 9 Shirik's Refdesk Filter -696 530 1 0 0 1 0 default 20130228170555 disallow 8 Writ Keeper's test filter -697 22 1 0 0 1 0 default 20160812194447 disallow 8 New accounts mentioning the abuse filter -698 284 1 0 0 1 0 default 20100214192857 disallow 8 Image vandalism -699 802 1 0 0 0 0 default 20170716170553 8 Teenage Fairytale Dropouts LTA -700 611 1 0 0 1 0 default 20160818174003 disallow 8 Persistent talk page abuse from IP ranges III -701 371 1 0 0 1 0 default 20110310111531 8 Reference desk trolling -702 908 1 0 0 0 0 default 20180809162142 8 Si Thu Moe Min -703 153 1 0 0 1 0 default 20090928230922 disallow 8 Sock deterrance -704 932 0 0 1 0 0 default 20180912225448 8 "Adding ""theknot.com"" into BLP articles" -705 679 1 0 0 1 0 default 20150518215516 8 Possible offensive redirect -706 729 1 0 0 0 0 default 20160126162749 8 Vandalism from Santa Cruz Coe -707 514 0 0 0 1 0 default 20170402073819 warn 7 Legoktm's test filter -708 8 0 0 0 1 0 default 20090819175717 7 Self-redirect -709 324 1 0 0 1 0 default 20160904145949 disallow 7 Possible edits by Fraberj -710 327 1 0 0 1 0 default 20101107191330 disallow 7 Special TFA filter -711 857 1 0 0 1 0 default 20170522203213 disallow 7 DigitalRavan -712 108 0 0 0 1 0 default 20090327021159 7 Hangon Tag -713 927 1 0 1 0 0 default 20181028205638 7 LTA 927 -714 695 1 0 0 1 0 default 20181104191958 7 Long term vandal -715 444 1 0 0 1 0 default 20130404081005 disallow 7 Pezzuto, Ivo references -716 208 0 0 0 1 0 default 20090830005614 7 Removal of WebCite URLs -717 245 1 0 0 1 0 default 20110310110845 disallow 7 AN and AN/I abuse -718 275 1 0 0 1 0 default 20100329231511 6 New users moving featured articles -719 38 1 0 0 1 0 default 20090807112247 warn,disallow 6 Argentina nuclear energy IP hopper -720 57 0 0 0 1 0 default 20090327103250 6 Creation of attack page titles -721 70 1 0 0 1 0 default 20090321081152 6 Unusual move reason -722 347 1 0 0 1 0 default 20190105155520 warn,disallow 6 Yet another test filter -723 612 1 0 0 1 0 default 20140602034412 disallow 6 Wrestling pusher -724 870 1 0 0 0 0 default 20180821180549 disallow 6 nowiki phishing -725 903 1 0 0 0 0 default 20181108021307 disallow 6 neocatechumenal way -726 401 0 0 0 1 0 default 20120808231528 disallow 6 """Red hair"" vandalism" -727 418 1 0 0 1 0 default 20170402073605 disallow 6 User is Shirik (test filter) -728 421 0 0 0 1 0 default 20120808231722 6 Engvar filter -729 428 1 0 0 1 0 default 20120808231734 6 Image abuse -730 207 0 0 0 1 0 default 20090830005600 6 Non-admins reviewing unblock requests -731 214 0 0 0 1 0 default 20090728230153 6 Creating articles with title contained in username -732 512 1 0 0 1 0 default 20160614033654 5 Long-term pattern abuse IV -733 20 0 0 0 1 0 default 20160812193140 5 "Saying ""The abuse filter will block this""" -734 540 1 0 0 1 0 default 20130329222146 disallow 5 Temporary filter -735 801 1 0 0 1 0 default 20161113203914 5 Prolific_socking_IPs -736 49 1 0 0 1 0 default 20160813234857 5 Hina spam -737 571 1 0 0 1 0 default 20160614034525 disallow 5 Persistent Disruption -738 69 1 0 0 1 0 default 20090321185406 5 Unusual title change 2 -739 842 1 0 0 0 0 default 20170314161944 disallow 5 Talk page abuse +695 764 1 0 0 1 0 default 20170402074700 disallow 9 Shirik's Refdesk Filter hidden_vandalism +696 530 1 0 0 1 0 default 20130228170555 disallow 8 Writ Keeper's test filter test +697 22 1 0 0 1 0 default 20160812194447 disallow 8 New accounts mentioning the abuse filter hidden_vandalism +698 284 1 0 0 1 0 default 20100214192857 disallow 8 Image vandalism image_vandalism +699 802 1 0 0 0 0 default 20170716170553 8 Teenage Fairytale Dropouts LTA long_term_abuse +700 611 1 0 0 1 0 default 20160818174003 disallow 8 Persistent talk page abuse from IP ranges III talk_page_vandalism +701 371 1 0 0 1 0 default 20110310111531 8 Reference desk trolling trolling +702 908 1 0 0 0 0 default 20180809162142 8 Si Thu Moe Min hidden_vandalism +703 153 1 0 0 1 0 default 20090928230922 disallow 8 Sock deterrance sockpuppetry +704 932 0 0 1 0 0 default 20180912225448 8 "Adding ""theknot.com"" into BLP articles" spam theknot.com seems to be a wedding planing website? +705 679 1 0 0 1 0 default 20150518215516 8 Possible offensive redirect hidden_vandalism +706 729 1 0 0 0 0 default 20160126162749 8 Vandalism from Santa Cruz Coe hidden_vandalism +707 514 0 0 0 1 0 default 20170402073819 warn 7 Legoktm's test filter test +708 8 0 0 0 1 0 default 20090819175717 7 Self-redirect good_faith_redirect +709 324 1 0 0 1 0 default 20160904145949 disallow 7 Possible edits by Fraberj hidden_vandalism +710 327 1 0 0 1 0 default 20101107191330 disallow 7 Special TFA filter hidden_vandalism TFA=Today’s featured article +711 857 1 0 0 1 0 default 20170522203213 disallow 7 DigitalRavan hidden_vandalism +712 108 0 0 0 1 0 default 20090327021159 7 Hangon Tag good_faith_template +713 927 1 0 1 0 0 default 20181028205638 7 LTA 927 long_term_abuse +714 695 1 0 0 1 0 default 20181104191958 7 Long term vandal long_term_abuse +715 444 1 0 0 1 0 default 20130404081005 disallow 7 Pezzuto, Ivo references hidden_vandalism +716 208 0 0 0 1 0 default 20090830005614 7 Removal of WebCite URLs unclear I’m not quite certain why is this an issue +717 245 1 0 0 1 0 default 20110310110845 disallow 7 AN and AN/I abuse hidden_vandalism AN=Administrator’s noticeboard +718 275 1 0 0 1 0 default 20100329231511 6 New users moving featured articles hidden_vandalism +719 38 1 0 0 1 0 default 20090807112247 warn,disallow 6 Argentina nuclear energy IP hopper hidden_vandalism +720 57 0 0 0 1 0 default 20090327103250 6 Creation of attack page titles doxxing as per comments; the title doesn’t quite fit the patterns and discussion, imho +721 70 1 0 0 1 0 default 20090321081152 6 Unusual move reason hidden_vandalism or ‘page_move_vandalism’ as per title +722 347 1 0 0 1 0 default 20190105155520 warn,disallow 6 Yet another test filter test +723 612 1 0 0 1 0 default 20140602034412 disallow 6 Wrestling pusher hidden_vandalism +724 870 1 0 0 0 0 default 20180821180549 disallow 6 nowiki phishing phishing +725 903 1 0 0 0 0 default 20181108021307 disallow 6 neocatechumenal way hidden_vandalism +726 401 0 0 0 1 0 default 20120808231528 disallow 6 """Red hair"" vandalism" silly_vandalism +727 418 1 0 0 1 0 default 20170402073605 disallow 6 User is Shirik (test filter) test +728 421 0 0 0 1 0 default 20120808231722 6 Engvar filter good_faith not quite sure what “changes between accepted styles by new users†are (see comments), but hey +729 428 1 0 0 1 0 default 20120808231734 6 Image abuse image_vandalism +730 207 0 0 0 1 0 default 20090830005600 6 Non-admins reviewing unblock requests good_faith_template +731 214 0 0 0 1 0 default 20090728230153 6 Creating articles with title contained in username self_promotion +732 512 1 0 0 1 0 default 20160614033654 5 Long-term pattern abuse IV long_term_abuse +733 20 0 0 0 1 0 default 20160812193140 5 "Saying ""The abuse filter will block this""" silly_vandalism +734 540 1 0 0 1 0 default 20130329222146 disallow 5 Temporary filter hidden_vandalism +735 801 1 0 0 1 0 default 20161113203914 5 Prolific_socking_IPs sockpuppetry +736 49 1 0 0 1 0 default 20160813234857 5 Hina spam spam +737 571 1 0 0 1 0 default 20160614034525 disallow 5 Persistent Disruption hidden_vandalism +738 69 1 0 0 1 0 default 20090321185406 5 Unusual title change 2 hidden_vandalism +739 842 1 0 0 0 0 default 20170314161944 disallow 5 Talk page abuse talk_page_vandalism 740 338 0 0 0 1 0 default 20100618045125 5 Vuvuzela vandalism 741 858 0 0 0 1 0 default 20181211182505 5 Anime vandal filter 742 610 1 0 0 0 0 default 20160818173520 disallow 5 Turkish Vandal diff --git a/thesis/5-Overview-EN-Wiki.tex b/thesis/5-Overview-EN-Wiki.tex index 552d06de427e10712af56b86934e112d4ffb0f31..ae3e0c071f53fe2680b00da64414465313305e87 100644 --- a/thesis/5-Overview-EN-Wiki.tex +++ b/thesis/5-Overview-EN-Wiki.tex @@ -470,10 +470,10 @@ Multiple filters have the comment "let's see whether this hits something", which \section{Patterns in filters creation and usage} * What are typical filter usage patterns? - ** switched on for a while, then deactivated and never activated again?: 81 (bad charts), 167 (two brief disables underway), 302 (switched off on the grounds of insufficient activity); 904 (to track smth) - ** switched on for a short while and then powered down: mostly stuff merged to other filters; or for which the community decides filter is not an appropriate solution (308), 199 ('Unflagged Bots'); or decides to not implement the thing (that way); 290 (disabled, since relevant pages were protected) - ** or switched off after a short while because there were no hits: 304, 67, 122 - ** or switched off after a longer while, because it was not tripped frequently, in order to save conditions from the condition limit: 211 ("Disable, appears to be inactive (log only filter). If you are using this filter, please let me know, and I'll reenable it -Prodego") + ** switched on for a while, then deactivated and never activated again?: 81 (bad charts), 167 (two brief disables underway), 302 (switched off on the grounds of insufficient activity); 904 (to track smth); + ** switched on for a short while and then powered down: mostly stuff merged to other filters; or for which the community decides filter is not an appropriate solution (308), 199 ('Unflagged Bots'); or decides to not implement the thing (that way); 290 (disabled, since relevant pages were protected); 207 ("Copy of another one we disabled. Unneeded, a bot already sees this. -Prodego") + ** or switched off after a short while because there were no hits: 304, 67, 122, 401 ("Red hair" vandalism) + ** or switched off after a longer while, because it was not tripped frequently, in order to save conditions from the condition limit: 211 ("Disable, appears to be inactive (log only filter). If you are using this filter, please let me know, and I'll reenable it -Prodego"); 20 ("A waste of processor time, deleted -Prodego") ** switched off bc merged to another filter 440 was merged in 345 ** on for a short while and off again bc?? (false positives is a plausible option here): 394 ** switched on and still on: 11 (verify), 79 (with brief periods of being disabled for couple of minutes/hours, probably in order to update the pattern), 164, 642 (if we ignore the 2min period it was disabled on 13.4.2018), 733 (2.11.2015-present), 29 (18.3.2009-present), 30 (18.3.2009-present), 33 (18.3.2009-present), 39 (18.3.2009-present), 50 (18.3.2009-present), 59 (19.3.2009-present), 80 (22.3.2009-present) @@ -484,7 +484,7 @@ Multiple filters have the comment "let's see whether this hits something", which * What do filters target: general behaviour vs edits by single users ** there are quite some filters targeting particular users: 290 (targets an IP range), 177 ('User:Television Radio'), 663 ('Techno genre warrior ', targets specific IP ranges) - ** there are also some targetting particular pages (verify!), although this clashed with the guidelines: 264 "Specific-page vandalism" (it's hidden though, so we don't know what exactly it's doing) + ** there are also some targetting particular pages (verify!), although this clashed with the guidelines: 264 "Specific-page vandalism" (it's hidden though, so we don't know what exactly it's doing); 401 ("Red hair" vandalism); there's smth with the main page; ** and there are some filtering in general ** there are also filters such as 199 (Unflagged bots) which were implemented in order to track something which was not quite malicious or abusive and were thus deemed inappropriate use of filters by the community and consequently (quite swiftly) deleted ** some target insults in general and some contain regexes containing very specifically insults directed towards edit filter managers (see filter 12)