From fe927c90116dc17d5c89c1d38be15d6938aef351 Mon Sep 17 00:00:00 2001 From: Lyudmila Vaseva <vaseva@mi.fu-berlin.de> Date: Thu, 10 Jan 2019 18:11:51 +0100 Subject: [PATCH] Update next steps --- todo | 13 +++++++++++++ 1 file changed, 13 insertions(+) diff --git a/todo b/todo index 686f2a3..46f5c64 100644 --- a/todo +++ b/todo @@ -1,10 +1,23 @@ # Next steps * Look at filters: what different types of filters are there? how do we classify them? + * classify in "vandalism"|"good_faith"|"biased_edits"|"misc" for now * syntactic vs semantic vs ? (ALL CAPS is syntactic) * are there ontologies? * how is spam classified for example? +* check filter rules for edits in user/talks name spaces (may be indication of filtering harassment) +* add "af_deleted" column to filter list +* add also "af_enabled" column to filter list; could be that the high hit count was made by false positives, which will have led to disabling the filter (TODO: that's a very interesting question actually; how do we know the high number of hits were actually leggit problems the filter wanted to catch and no false positives?) + +* Setup CSCW latex template up + +* add a README to github repo + +* Read these two pages +https://en.wikipedia.org/wiki/Wikipedia:Vandalism +https://en.wikipedia.org/wiki/Wikipedia:Vandalism_types + * look at AbuseFilter extention code: how is a filter trigger logged? https://github.com/wikimedia/mediawiki-extensions-AbuseFilter/blob/master/includes/AbuseFilter.php -- GitLab