diff --git a/thesis/2-Background.tex b/thesis/2-Background.tex index a60285b34f3346d3b303514e840f08168570e03c..f9f687a3305f006ea3b023815765de975d91a640 100644 --- a/thesis/2-Background.tex +++ b/thesis/2-Background.tex @@ -14,6 +14,12 @@ and also some accounts on the emerging machine learning service ORES~\cite{HalTa Time and again, the literature refers also to more ``manual'' forms of quality control by editors using watchlists to keep an eye on articles they care about or even accidentially discovering edits made in bad faith~\cite{Livingstone2016}, \cite{AstHal2018}. There is one mechanism though that is very ostentatiously missing from all these reports: edit filters. +\begin{figure} +\centering + \includegraphics[width=0.9\columnwidth]{pics/funnel-diagramm-no-filters.JPG} + \caption{State of the scientific literature: edit filters are missing from the quality control frame}~\label{fig:funnel-no-filters} +\end{figure} + At first, scientific studies on Wikipedia largely ignored algorithmic quality control mechanisms. Their contribution to the encyclopedia and therefore their impact were considered insignificant. %quote? This has gradually changed since around 2009 when the first papers specifically dedicated to bots (and later semi-automated tools) were published. diff --git a/thesis/4-Edit-Filters.tex b/thesis/4-Edit-Filters.tex index 606a7191f921fa057d15124c6ef8fed4f3a64580..84c99c7e838ccdd2a07aea2a06f1871b46ee82ed 100644 --- a/thesis/4-Edit-Filters.tex +++ b/thesis/4-Edit-Filters.tex @@ -330,42 +330,63 @@ If such an account is compromised, it loses its edit filter manager rights and g \section{Edit filters' role in the quality control frame} +Let us review in this section what we have learnt so far and how edit filters fit in Wikipedia's quality control ecosystem. +%TODO: explain table with text + +So, as shown in figure~\ref{fig:funnel-with-filters}, edit filters are crucial since they get active before any of the other mechanisms. + +\begin{figure} +\centering + \includegraphics[width=0.9\columnwidth]{pics/funnel-diagramm-with-filters.JPG} + \caption{Edit filters' role in the quality control frame}~\label{fig:funnel-with-filters} +\end{figure} + +\begin{comment} +\begin{longtable}{ l | c | c | c | c } + & Filters & Bots & Semi-Automated tools & ORES \\ +\end{longtable} +\end{comment} + +\begin{verbatim} | Filters | Bots | Semi-Automated tools | ORES -------------------------------------------------------------------------------------------------------------------------------------------- | | | | | - based on REGEXes | - rule/ML based | - rule/ML based | - ML framework -properties | - part of the "software"/ | - "bespoke code": run on | - heuristics obfusca- | - can be used by other tools - | platform (MediaWiki ext) | user's infrastructure | ted by the interface | - | - public filters are directly | - no requirement for the code | (but often configura- | +properties | - part of the "software"/ | - "bespoke code": run on | - heuristics obfusca- | - not used directly, but can + | platform (MediaWiki ext) | user's infrastructure | ted by the interface | instead be incorporated in + | - public filters are directly | - no requirement for the code | (but often configura- | other tools | visible for everyone | to be made public | ble) | - | interested | - you can relatively easily | | - | | get all the filters; you | | - | | easily get all the bots | | + | interested | - you can relatively easily | - mostly based on a | + | - trigger *before* an edit is | get all the filters; you | centralised queue | + | published | cannot easily get all bots | - trigger after an edit | + | | - trigger after an edit is | is published | + | | published | | + | | | | +---------------------|------------------------------------------------------------------------------------------------------------------------ | | | | ----------------------|------------------------------------------------------------------------------------------------------------------------- - | | | | | - edit filter managers | - no special persmissions/ | - *rollback* perm | - mostly Scoring platform | group (EN Wiki) | rights needed | | team (?) Who does this? | (abusefilter-modify perm) | - a bot gets a bot flag | | | | | | ----------------------|----------------------------------------------------------------------------------------------------------- +---------------------|------------------------------------------------------------------------------------------------------------------------ | | | | | - become an edit filter manager | - get an approval to run the | - learn the tool | - understand ML -What are the hurdles | - you have to *only* understand | bot from the BAG | - install Windows^^ | - formal requirements to +What are the hurdles | - you have to *only* understand | bot from the BAG | - install Windows^^ | - formal requirements to to participate | REGEXes (relatively simple? | - programming knowledge | (some don't support | develop ORES? | although relatively fast quite | - understand APIs, .. | other OS) | | confusing) | - (but there is a lot to | - get the rollback perm | | | understand with all the | | | | mechanisms) | | | | | | -------------------------------------------------------------------------------------------------------------------------------------- +----------------------------------------------------------------------------------------------------------------------------------------------- | | | | - | - censorship infrastructure? | - "botophobia" | - gamification | -Concerns | - powerful, can in theory block | | | + | - censorship infrastructure? | - "botophobia" | - gamification | - general ML concerns: +Concerns | - powerful, can in theory block | | | hard to understand | editors based on hidden | | | | filters | | | | | | | ------------------------------------------------------------------------------------------------------------------------ +------------------------------------------------------------------------------------------------------------------------------------------------ +\end{verbatim} \subsection{Alternatives} %TODO: where should this go? Already kind of mentioned in the introducing a filter part