From d280717b32da82f859cb68a9c565cddaca615d85 Mon Sep 17 00:00:00 2001 From: Lyudmila Vaseva <vaseva@mi.fu-berlin.de> Date: Sun, 14 Jul 2019 08:37:06 +0200 Subject: [PATCH] Start methodology chapter --- thesis/3-Methods.tex | 112 ++++++++++++++++++++++++++---------------- thesis/references.bib | 2 +- 2 files changed, 70 insertions(+), 44 deletions(-) diff --git a/thesis/3-Methods.tex b/thesis/3-Methods.tex index a847386..2a0fd8a 100644 --- a/thesis/3-Methods.tex +++ b/thesis/3-Methods.tex @@ -1,39 +1,46 @@ \chapter{Methods} \label{chap:methods} -\begin{comment} -vgl \cite{GeiHal2017} -iterative mixed method -combination of: -* quantitative methods: mining big data sets/computational social science -"begin with one or -more large (but often thin) datasets generated by a software platform, which has recorded digital -traces that users leave in interacting on that platform. Such researchers then seek to mine as much -signal and significance from these found datasets as they can at scale in order to answer a research -question" -* more traditional social science/qualitative methods, e.g. interviews, observations, experiments +This chapter describes the methodology applied throughout the thesis. + +\section{Open Science} + +The whole work tries to adhere to the principles of open science. %TODO what are the principle of open science? refs are missing +All the computations I have done and other artefacts I have used or compiled are openly accessible in the project's repository~\cite{github}. +And have been openly accessible since the very beginning. +Everyone interested can follow the process and/or use the data or scripts in order to verify my computations (syn) or run their own and thus continue this research along one of the directions suggested in section~\ref{sec:further-studies} or in a completely new one. -\cite{Geiger2014} -"the idea that Wikipedia only takes place on wiki- -pedia.org – or even entirely on the Internet – is a huge misunderstanding (Konieczny, 2009; -Reagle, 2010). Wikipedia is not a virtual world, especially one located entirely on the wiki." -e.g. in order to get hold of abuse_filter_history I had to engage with -- wikipedia.org -- mediawiki.org -- irc channels -- phabricator -- gerrit -- toolserver/cloudservices ----- -other spaces Wikipedia takes place -- mailinglists -- WomenEdit/offenes Editieren @Wikimedia -- Wikimania -- Wikimedia's office and daily work -\end{comment} \section{Grounded Theory} + +I have employed some methods used by grounded theory %TODO check whether it's written with caps +scholars, most prominently/above all their coding processes. +There are different branches? in grounded theory that diverge slightly or more prominently. +I followed the guidelines and .. proposed/described by Charmaz in~\cite{Charmaz2006}. + + \section{Trace Ethnography} +A second important theoretical framework constitutes the trace ethnography. +The concept was first introduced/used by Geiger and Ribes in their 2010 work ``The work of sustaining order in Wikipedia: the banning of a vandal''~\cite{GeiRib2010} and introduced in detail in a 2011 paper~\cite{GeiRib2011}. +The scholars define trace ethnography as a methodology which +``combines the richness of participant-observation +with the wealth of data in logs so as to reconstruct +patterns and practices of users in distributed +sociotechnical systems'' +and is especially practical for research in distributes technical systems (doppelt gemoppelt with quote) where direct partipants observation is impractical, costly and tend to miss phenomena due to.. +They use documents and document traces: ... %TODO which ones +in order to reconstruct quite exactly single strands of actions and comprehend how different agents on Wikipedia work together towards the blocking of a single malicious user. +They (syn!) refer to ``turn[ing] thin documentary traces into “thick descriptions†of actors and events". +What is more, these traces are used by Wikipedians themselves in order to do their work efficiently. +Geiger and Ribes underline the importance of insider knowledge when reconstructing actions and processes based on the traces, +the need for ``an ethnographic understanding of the activities, people, systems, and technologies which contribute to their production''. + +They alert that via trace ethnography only that can be observed which is recorded by the system and records are always incomplete. +%TODO pitfalls of using data produced for other purposes? + +The researchers also warn of possible privacy breaching through thickening traces: +although records they use to reconstruct paths of action are all open, the thick descriptions they compile can suddenly expose a lot of information about single users which never existed in this form before and who never gave their informed consent for their data being used this way. + \cite{GeiRib2011} Introduce the methodology (and the concept) of trace ethnography. @@ -60,20 +67,6 @@ at a fine level of granularity" "turn thin documentary traces into “thick descriptions†[10] of actors and events" -"traces can only -be fully inverted through an ethnographic -understanding of the activities, people, systems, and -technologies which contribute to their production." - -traditional ethnographic observation is costly and inpractical in distributed settings (and may miss phenomena that occur between sites) - -Critique: -"it only can observe what the system -or platform records, which are always incomplete." - -Concerns: -- ethical: breaching privacy via thickening the traces; no possibility for informed consent - \cite{GeiHal2017} "when working with large-scale “found data†[36] of the traces users leave behind when interacting on a platform, how do we best operationalize culturally-specific @@ -107,5 +100,38 @@ question" * more traditional social science/qualitative methods, e.g. interviews, observations, experiments \end{comment} +\begin{comment} +vgl \cite{GeiHal2017} +iterative mixed method +combination of: +* quantitative methods: mining big data sets/computational social science +"begin with one or +more large (but often thin) datasets generated by a software platform, which has recorded digital +traces that users leave in interacting on that platform. Such researchers then seek to mine as much +signal and significance from these found datasets as they can at scale in order to answer a research +question" +* more traditional social science/qualitative methods, e.g. interviews, observations, experiments + +\cite{Geiger2014} +"the idea that Wikipedia only takes place on wiki- +pedia.org – or even entirely on the Internet – is a huge misunderstanding (Konieczny, 2009; +Reagle, 2010). Wikipedia is not a virtual world, especially one located entirely on the wiki." +e.g. in order to get hold of abuse_filter_history I had to engage with +- wikipedia.org +- mediawiki.org +- irc channels +- phabricator +- gerrit +- toolserver/cloudservices +---- +other spaces Wikipedia takes place +- mailinglists +- WomenEdit/offenes Editieren @Wikimedia +- Wikimania +- Wikimedia's office and daily work +\end{comment} + \section{Cooking Data With Care} or Critical data science? Or both? + +\section{Validation} diff --git a/thesis/references.bib b/thesis/references.bib index 08ecc14..38f3388 100644 --- a/thesis/references.bib +++ b/thesis/references.bib @@ -132,7 +132,7 @@ @misc{github, key = "Github Repository", author = {}, - title = {}, + title = {Github Repository of the thesis}, year = 2019, note = {\url{https://github.com/lusy/wikifilters}} } -- GitLab