From eb96f039d4d5b2c5cd4e1b108e2d88e446493f22 Mon Sep 17 00:00:00 2001
From: Lyudmila Vaseva <vaseva@mi.fu-berlin.de>
Date: Tue, 26 Feb 2019 07:53:29 +0100
Subject: [PATCH] Write out Data section

---
 article/literature.bib  |  8 ++++++++
 article/proceedings.tex | 14 ++++++++++++--
 2 files changed, 20 insertions(+), 2 deletions(-)

diff --git a/article/literature.bib b/article/literature.bib
index 4a5baf7..4867e52 100644
--- a/article/literature.bib
+++ b/article/literature.bib
@@ -5,6 +5,14 @@
   publisher =    {SAGE}
 }
 
+@misc{github,
+  key =          "Github Repository",
+  author =       {},
+  title =        {},
+  year =         2019,
+  note =         {\url{https://github.com/lusy/wikifilters}}
+}
+
 @book{Lessig2006,
   title =        {Code version 2.0},
   author =       {Lawrence Lessig},
diff --git a/article/proceedings.tex b/article/proceedings.tex
index 4c3a021..2dd945f 100644
--- a/article/proceedings.tex
+++ b/article/proceedings.tex
@@ -245,12 +245,22 @@ examples of disruptive editing:
 
 \section{Data}
 
-What is the best place herefor?
+%What is the best place herefor?
 
+The data upon which the current analysis builds presently consists of 2 sets/data base tables.
+These are \emph{abuse\_filter} and \emph{abuse\_filter\_action} from \emph{enwiki\_p}, downloaded on 6.01.2019 via quarry~\footnote{\url{https://quarry.wmflabs.org/}}.
+The complete files can be found in the repository for the present paper~\cite{github}. % TODO add a more specific link
+
+These tables, along with \emph{abuse\_filter\_log}, are created and used by the AbuseFilter MediaWiki extention (quote!) % TODO: is that really so
+.
+A fourth table appears to have existed (\emph{abuse\_filter\_history}, quote!), but unfortunately this is no longer the case. (However, I suspect that same data can be obtained by joining \emph{abuse\_filter\_log} and \emph{abuse\_filter\_action}, and querriyng cleverly, not quite sure though.)
+
+\begin{comment}
 * Describe analysed filter data set
-  * Where does it come from? How was it obtained? For what purpose does it exist? How does it look like?
+  * How does it look like? : describe schema?
 * What other data sources can I explore?
   * Interview with filter managers? with admins? with new editors?
+\end{comment}
 
 %************************************************************************
 
-- 
GitLab