From ff0067384381231a23de6d91185b2c11b4939f2b Mon Sep 17 00:00:00 2001 From: Lyudmila Vaseva <vaseva@mi.fu-berlin.de> Date: Thu, 13 Jun 2019 10:27:05 +0200 Subject: [PATCH] Add notes --- thesis/4-Edit-Filters.tex | 11 ++++++++--- thesis/introduction.tex | 6 ++++++ 2 files changed, 14 insertions(+), 3 deletions(-) diff --git a/thesis/4-Edit-Filters.tex b/thesis/4-Edit-Filters.tex index 716fe8f..fabf091 100644 --- a/thesis/4-Edit-Filters.tex +++ b/thesis/4-Edit-Filters.tex @@ -428,22 +428,27 @@ So, Huggle let the user define their own queue, but only along predefined parame \section{Fazit} %Conclusion, resume, bottom line +In short, in this chapter we found/worked out following salient characteristics of edit filters: .... +Why does this system continue to exist in times of fancier (syn!) machine learning based tools? + + maybe it's a historical phenomenon (in many regards): * perhaps there were differences that are not essential anymore, such as: * on which infrastructure does it run (part of the core software vs own computers of the bot operators) * filters are triggered *before* an edit is even published, whereas bots (and tools) can revert an edit post factum. Is this really an important difference in times when bots need a couple of seconds to revert an edit? * perhaps the extension was implemented because someone was capable of implementing and working well with this type of systems so they just went and did it (do-ocracy; Wikipedia as a collaborative volunteer project); * perhaps it still exists in times of fancier machine learning based tools (or bots) because rule-based systems are more transparent/easily understandable for humans and writing a regex is simpler than coding a bot. - -%TODO maybe put here the comparison table I've started as a feedback from the status presentation +* hypothesis: it is easier to set up a filter than program a bot. Setting up a filter requires "only" understanding of regular expressions. Programming a bot requires knowledge of a programming language and understanding of the API. Question: -Oftentimes edit filter managers are also bot operators; how would they decide when to implement an filter and when a bot? +Oftentimes edit filter managers are also bot operators; how would they decide when to implement a filter and when a bot? +%TODO: ask people! (on IRC?) \begin{comment} \url{http://www.aaronsw.com/weblog/whorunswikipedia} "But what’s less well-known is that it’s also the site that anyone can run. The vandals aren’t stopped because someone is in charge of stopping them; it was simply something people started doing. And it’s not just vandalism: a “welcoming committee†says hi to every new user, a “cleanup taskforce†goes around doing factchecking. The site’s rules are made by rough consensus. Even the servers are largely run this way — a group of volunteer sysadmins hang out on IRC, keeping an eye on things. Until quite recently, the Foundation that supposedly runs Wikipedia had no actual employees. This is so unusual, we don’t even have a word for it. It’s tempting to say “democracyâ€, but that’s woefully inadequate. Wikipedia doesn’t hold a vote and elect someone to be in charge of vandal-fighting. Indeed, “Wikipedia†doesn’t do anything at all. Someone simply sees that there are vandals to be fought and steps up to do the job." +//yeah, I'd call it "do-ocracy" \end{comment} diff --git a/thesis/introduction.tex b/thesis/introduction.tex index 8405519..3dc6c19 100644 --- a/thesis/introduction.tex +++ b/thesis/introduction.tex @@ -59,6 +59,12 @@ To this end, we study the academic contributions on Wikipedia's quality control \begin{comment} +# Motivation + +* What is the role of filters among existing (algorithmic) quality-control mechanisms (bots, semi-automated tools, ORES, humans)? Which type of tasks do filters take over? +* How have these tasks evolved over time (are they changes in the type, number, etc.)? +* What are suitable areas of application for rule-based systems such as filters in contrast to the other ML-based approaches? + Questions from Confluence Q1 We wanted to improve our understanding of the role of filters in existing algorithmic quality-control mechanisms (bots, ORES, humans). Q2 Which type of tasks do these filters take over in comparison to the other mechanisms? How these tasks evolve over time (are they changes in the type, number, etc.)? -- GitLab