Understanding Normiefication
A Cross-Platform Analysis of the QAnon Conspiracy Theory
The results from this project are also uploaded on this webpage.
Team Members
Sal Hagen, Daniel de Zeeuw, Stijn Peeters, Emilija Jokubauskaite, Ángeles Briones, Rachel Blennerhassett, Carmen Ferri, Flora Woudstra Hablé, Esther Blokbergen, Birgitte Haanshuus, Marlou Poncin, Willem Hilhorst, Ryan Tsapatsaris
Visualizations by Ángeles Briones
Contents
Summary of Key Findings
Normiefication is a vernacular concept that describes a process of normalization where “underground” content from fringe subcultural online communities travels to and is popularized on mainstream social media platforms and news media. The concept borrows from subcultural actor language where “the mainstream” is said to be populated by “normies”: regular people that are not familiar with the Internet’s latest subcultural trends. Our research explores the various pathways of normiefication in an empirical manner, inquiring whether such a process can indeed be said to exist, and how exactly it works in a particular case. To this end, it takes the far-right QAnon conspiracy theory as a case study. Does this case warrant the concept of normiefication?
To a large extent, our findings can be said to confirm the “normiefication thesis”. We found that the QAnon conspiracy first appears on the imageboard 4chan (28 October 2017) before migrating to Reddit and 8chan (November 2017), more “mainstream” platforms (YouTube and Facebook), and eventually news media (New York Times, Washington Post, CNN). The results imply that platforms like YouTube and Reddit could operate as “bridges,” forming intermediaries that connect the “deep vernacular” with the “surface” web. Further, the findings suggest a slight “Streisand effect” after mainstream media covered the conspiracy, since it aligns with increased mentions of QAnon on other platforms, thus providing “oxygen” to once-fringe ideas (Phillips 2018). This reverberation occurred on more mainstream platforms like YouTube and Reddit, but also on QAnon’s birthplace, 4chan. As an exception to this finding, QAnon-related activity on 8chan seemed less affected by mainstream media coverage, implying a core group of somewhat isolated yet strongly committed Q-theorists.
1. Introduction
The QAnon conspiracy was born on 28 October 2017 with a 4chan post alleging that a “deep state” is working against Trump and his supporters (Bank, Stack & Victor, 2018). The name of this conspiracy derives from author of the original post, who claimed to be a White House insider with “Q level security clearance”. The actual contents of the deranged conspiracy are not of interest here. Rather, of interest is how it spread across the Internet. While from the outsider’s perspective, it initially seemed like the far-right conspiracy theory lived and died on 4chan, on 31 July 2018, “offline” QAnon supporters suddenly appeared at a Trump rally in Tampa, Florida. Subsequently, news outlets like CNN started reporting on the once-fringe idea. How did such an unexpected diffusion transpire?
Figure 1: The first ‘Q’ post on 4chan/pol/, retrieved from archive.4plebs.org.
This diffusion of subcultural ideas can perhaps be captured with the notion of normiefication. Normiefication has its roots in the actor-language “normie”, a word often used on the imageboards 4chan and 8chan to describe people who are not part of their online subculture and remain within the realms and discourses of the “mainstream” (Nagle, 2017). In light of the growing influence of these sites on various world-historical events like Trump’s election as US president, the purpose of this project is to test and better understand this process as one of normiefication, where content travels across different platforms. This is especially interesting when deeply vernacular concepts and conspiracy theories, which have extremely convoluted explanations and origins, start to appear among people attending Trump rallies.
Figure 2: Initial representation of the different layers from the deep vernacular to the mainstream surface web (De Zeeuw, 2019).
The notion of normiefication reflects an heuristic that conceptualises the lower layers of a “deep vernacular Web” that boil up to and eventually popularise on the mainstream “surface Web” (De Zeeuw and Tuters, 2019 [forthcoming]). We also suggested that some platforms (e.g. Reddit and
YouTube) act as intermediaries between the deep and surface layers, exercising a “bridge” function. Since this cultural divide between Internet layers exists in the domain of the imaginary (in the sense that it is not at all technically or institutionally specified, but reflects the way users imagine the space they inhabit), the process of normiefication could perhaps be empirically grounded using digital methods.
The curious fringe-to-mainstream path that the QAnon conspiracy took forms an interesting case study into how a niche concept can move through the different strata of the Web and ends up being reported on by the mainstream media. Tracing such diffusion might shine light on how fringe areas of the Web might form hotbeds for the spread of outlandish ideas and their subsequent normalization. It also shines light on the role of the “mainstream”, which might willingly or unwillingly provide oxygen to fringe ideas. Uncritical reporting on antagonistic web communities, “trolls”, or the so-called “alt-right” has been criticised by Whitney Phillips for unintentionally amplifying the often harmful messages of fringe actors (2018). This has led her and others to call for a more informed mapping of the various collective configurations that exist within these online spaces:
Taking the time to map — to accurately map — the repeated, fractured, reconfiguring mobilizations emerging from anonymous and pseudo-anonymous spaces online allows us to understand where we are and how we got here. [...] fully contextualizing our present moment—particularly given how tenuous facts in our present moment have become—puts us in a better position to safeguard the actual record, and to carefully parse symptom from disease. (Phillips, Coleman & Beyer, 2017)
Instead of focusing on anonymous and pseudonymous spaces (like 4chan and 8chan) as isolated spaces, here we aim to assess the “actual record” of their alleged influence (or a lack thereof) through a comparative cross-platform approach. To do so, we compare QAnon-related data from 4chan, 8chan, Reddit, Youtube, Facebook, and online news media.
Figure 3: An article on QAnon in The Washington Post (Stanley-Becker, 2018).
2. Initial Data Sets
We scrutinised the prevalence of the QAnon conspiracy across six online spheres. Logically, 4chan and 8chan were included in the dataset because they are the platforms on which the nebulous ‘Q’ supposedly posted. Reddit and Youtube were chosen because of their alleged role as bridges for the popularisation of fringe far right ideas and cultural productions like memes (Zannettou et al., 2018; Lewis, 2018). Facebook was chosen as a means to study the conspiracy’s dissemination on a popular social media platform. Lastly, articles on QAnon from online news media were included to study when the conspiracy ultimately boiled up to the “surface”. Because of the limited availability of recent Reddit data, we were obliged to handle October 2018 as the cut-off point in the timeframe, and October 2017 served as a logical starting point.
2.1 4chan
Data was collected from 4chan/pol/ board with the tool 4CAT (Peeters & Hagen, 2018). We merged two datasets. Firstly, we collected “Calm Before the Storm” threads, specific posts that are dedicated to the discussion of QAnon. These were collected within the timeframe of 28 November 2013 (the first appearance) to 8 January 2019 by getting all the threads that had either “cbts” or “calm before the storm” in the title of the first post (the OP). Secondly, we retrieved all posts that mentioned “q” and “qanon” (see appendix I for the specific queries).
2.2. 8chan
8chan data was collected via a collection of posts from “QAnon.news”, a site of unclear origin that collects QAnon-related discussions and information and offers it as packaged archives. Qanon.news collects any QAnon-related discussion, and describes the data as a "complete archive" of Q posts. As historical data from 8chan itself is not readily available, this archive offered a useful alternative. After tinkering with the data, there seemed to be little reason to doubt their veracity and completeness, but follow-up research would need to more thoroughly verify the integrity of these archives.
In total, the archive offered by qanon.news contained 4,563 threads, virtually all from the board /qresearch/ - a board (i.e. subforum) on 8chan dedicated to discussing “research” about QAnon's posts and theories. The data covers the period between November 2017 and November 2018.
The posts contained within the archive came primarily in the form of JSON files of a non-standard format. After inspection of a sample of the data, the relevant data fields were filtered from the full data set and reduced to a spreadsheet containing one record per thread, listing the thread title and metadata (date of posting, amount of replies and the relative size of the thread compared to the longest thread found within the dataset). Additionally, as part of the hyperlink analysis of the full social media platform dataset (see section 5.3) domain names used in the thread's first post were extracted and included separately.
2.3 Reddit
Reddit data was collected through Reddit content hosted on Google BigQuery and collected by Pushshift (Baumgartner, 2018). Firstly, all subreddits were queried for mentions of “qanon” and “q” from October 2017 until October 2018 (see appendix I for the exact queries), treating this as the main data for analysis. Some data was found to be missing from the dataset, since Pushshift data is only collected at the end of every month, while some subreddits (e.g. r/cbts_stream or r/thegreatawakening) were terminated (“banned”) before the end of the month (respectively, March and September in our case).
The initial dataset was narrowed down in two ways. The top 20 overall subreddits with the highest frequencies of the terms of “qanon” and “q” were used for the analysis of the change of the use of these terms over time (presented in section 4.2), focusing on the data of subreddits that were generally the most engaged in the discussion. A different approach was taken to extract the most relative subreddits for each month: for each month of the 13-month-period the top 20 subreddits were found that mention “qanon” or “q” most frequently. From each of them, posts that contain the terms were extracted (17933 posts in total).
Youtube data was collected through three different steps. Firstly, we qualitatively compiled a list of channels dedicated to QAnon-related discussions by simply searching for “qanon” and exploring the channels of recommended videos. Next, we retrieved the YouTube domains from high scoring Reddit post related to QAnon from r/CBTS, r/the_great_awakening and r/The_Donald. This resulted in a curated list of QAnon Youtubers. This resulted in the following channels:
- Lionel Nation (~202,000 subscribers)
- Destroying the Illusion (~136,000 subscribers)
- JustInformed Talk (~108,000 subscribers)
- Prayingmedic (~107,000 subscribers)
- TracyBeanz (~106,000 subscribers)
- SphereBeing Alliance (~94,000 subscribers)
- Lift The Veil (~53,000 subscribers)
- Bill Smith (~45,000 subscribers)
The video data from Youtube channels on the curated list was downloaded and proceeded to download the data of these channels. Secondly, we queried a dataset of thousands of videos by far-right Youtube channels compiled by an outside expert on whether they mentioned “q” or “qanon” in the title or description (see queries in appendix I). Thirdly, we collected video data from the first 500 results when searching for “qanon” using the YouTube Data Tools (Rieder, 2015). After merging these three datasets, we deleted false positives and duplicate videos manually.
2.5 Facebook
The initial dataset for Facebook was collected using the tool netvizz (Rieder, 2013). Due to known issues with large-scale data collection on this platform (Rieder, 2018), we were only able to query a limited number of public pages. These were identified by searching with the terms ‘qanon’, ‘qarmy’, and ‘wwg1wga’ (“where we go one we go all,” a popular slogan associated with QAnon). Pages with more than 1000 likes were selected for the dataset. These include pages like “QAnon Curator”, “QAnon Army”, and “QAnon France”. This only provided a limited view on Q-related activity on Zuckerberg’s platform, but we decided to include it nonetheless as it could provide an at least exploratory view on the prevalence of the conspiracy on Facebook.
Finally, we collected articles on the QAnon conspiracy by online news media. We used three different methods to do so in order to make the dataset as comprehensive as possible. Firstly, the
LexisNexis Academic newspaper archive was queried for all English articles related to the search term “QAnon”. The
LexisNexis results were subsequently compiled in Google Sheets, and we removed some duplicates from the resulting list of about 700 articles. The list was also further filtered by deleting articles that referred to “QAnon” as an Armenian musical instrument, and a few other irrelevant results. Secondly, Google News was queried for articles relating to QAnon to enrich the
LexisNexis results. Thirdly, we added articles from news sources that were frequently mentioned on Reddit in relation to the far-right conspiracy. To do so, we extracted the top fifty domain names that had been mentioned in posts mentioning ‘Q’ or ‘QAnon’ from r/news r/politics, and r/worldnews. We chose these subreddits as they are fairly popular and would thus likely lead to provide ‘mainstream’ sources, instead of the fringe ones found on e.g. r/thegreatawakening. From the popular domains, we filtered out non-news websites (like
YouTube). Then, we used Google and its “site:” operator with “qanon” (e.g. “site:vox.com qanon”) to retrieve articles from these websites on the topics. Relevant articles on the first page of the Google search results page that were not yet in the dataset were added. These three steps resulted in a total of 383 articles, from sources ranging from Quartz to The Washington Post.
3. Research Questions
- Can the concept of normiefication be empirically traced?
- When and how did QAnon “normiefy” and move from its subcultural origins to mainstream strata of the Web?
4. Methodology
4.1 Merging and Visualizing the Data
We merged all the datasets as defined in section 2 in one large datasheet. Since the objective was to trace the prevalence of the QAnon conspiracy, we had to determine what counted as a “frequency unit” with which we could measure this prevalence. We did so for each platform. While this would allow to map the frequencies of the Q-related units, we also wanted to determine how much these units were engaged with across platforms. As such, we also determined an “engagement unit”. The engagement units per platform were translated to a score between 1 and 100 so we could plot the cross-platform data on the same panes.
The frequency units and engagement units we ultimately settled on are as follows:
- 4chan
- Frequency unit: The first post of a thread (the OP) on 4chan/pol/ containing “qanon” or “q” in the post title or body (see appendix I for the exact query) or the first post of a /cbts/ “general” thread.
- Engagement unit: The amount of replies in the thread (thread length).
- 8chan
- Frequency unit: The first post of a thread (the OP) on 8chan/qresearch/ containing “qanon” or “q” in the post subject or body (see appendix I for the exact query).
- Engagement unit: The amount of replies in the thread (thread length).
- Reddit
- Frequency unit: A post on the top 20 subreddits per each month that mention “qanon” or “q” most frequently (see appendix I for the exact query).
- Engagement unit: The score of the post using Reddit’s voting mechanism (i.e. upvotes - downvotes).
- YouTube
- Frequency unit: A single video in our merged YouTube dataset.
- Engagement unit: The amount of comments on the video.
- Facebook
- Frequency unit: A post on the selected Facebook groups and pages.
- Engagement unit: The amount of comments on the post.
- Online news media
- Frequency unit: A news article.
- Engagement unit: The CrowdTangle score of the article. This is a metric based on the amount of engagements of the article on other websites like Twitter, Facebook, and Reddit.
Using this data, two graphs were created with RAWGraphs and Adobe Illustrator. Firstly, we used the frequency units and created histograms per platform, where each bar represented one day. These histograms were then merged on one pane for comparison. Secondly, we generated scatter plots per platform, where each dot represented one frequency unit, the x-axis represented the date, and the y-axis represented the corresponding engagement unit. We then added a “contour plot” on top of these scatterplots to highlight the concentration of and engagement with the frequency units over time. For both graphs, we used the timeframe of October 2017 until October 2018.
4.2 Additional Analyses
Supplementary to the main analysis, some additional analyses were conducted, touching upon the contents of the data. Firstly, in order to map out the issue spaces of the conspiracy on Reddit we visualised which subreddits mentioned “Q” and “QAnon” over time. The dataset of all Reddit posts in the 13-month-period containing these terms (presented in section 2.3) was grouped per subreddit. A streamgraph was made with RAWgraphs to show the pervasiveness of the conspiracy across subreddits over time.
Next, the change between the prevalence of terms “Q” and “QAnon” on Reddit over time was analysed. It was done because the latter nomenclature arguably marks a point of crystallization since it only appeared when the conspiracy had been around for a few months. The frequencies of mentions for “q” and “qanon” in the top 20 subreddits most engaged with the topic were calculated separately. A bumpchart was then made (using RAWgraphs) showing the moment of “QAnon” overtaking “Q”, which can be referred to as the crystallization of the term.
Lastly, to gain insight into what information sources were linked to most often in the given issue space, we extracted hyperlink networks from the merged datasets. For each post, the linked domain names (e.g. digitalmethods.net) were extracted from the post’s text content. These links were then mapped as a Gephi network, consisting of a set of “seed” nodes (the social media platforms) and nodes representing the links themselves. Weighted edges between seed nodes and link nodes represented the prominence of the given domain names on the respective platforms. The link network graph thus represents (1) on which social media platforms a given site was linked to and (2) how often it was linked to per platform. This data could then be used to get an impression of information source popularity per site (see section 5.3)
5. Findings
5.1 General Findings
Figure 4: The prevalence of the QAnon-conspiracy across 4chan, 8chan, Reddit, YouTube, Facebook, and online news media from 28 October 2017 to October 2018.
Although complicated by limited data access, our research suggests that the QAnon conspiracy indeed diffused through a process similar to what we conceptualise as normiefication. It originated on 4chan, after which it quickly found a public of devotees on Reddit, 8chan, and
YouTube. After “leaving” 4chan, the conspiracy was relatively consistently discussed on the latter three platforms, although various Q-related subreddits were banned, limiting the prevalence of the conspiracy theory on Reddit. Interestingly, it was only until February 2018 that the Q-related discussion on 8chan started to become common.
Nine months after its inception, online news media started to widely cover the conspiracy. Notably, this sparked some Q-related discussion on 4chan again, even though it had been absent for a while. This implies the online news media indeed provided “oxygen” to the prevalence of such conspiracies, as we will further discuss below.
Figure 5: The prevalence and engagement with the QAnon conspiracy from 28 October 2017 to October 2018 across 4chan, 8chan, Reddit, YouTube, Facebook, and online news media. Green bar denotes the start of media attention.
To list the findings more in-depth, this section briefly covers the findings per platform.
5.2.1 4chan
Figure 6: Thread frequency and engagement (thread length) of QAnon threads on 4chan/pol/ from October 2017 to October 2018.
QAnon began on the 4chan/pol/ board in October 2017 and there are two concurrent peaks of QAnon posts on the platform from the beginning of October to the end of November. QAnon activity on 4chan drops off sharply for both peaks at the end of November 2017. This corresponds with a post from Q on the CBTS thread announcing that 4chan as a platform had been “compromised” and that Q was moving to 8chan - as is reflected in our visualisations. There was a small bump in activity on 4chan in August 2018, which corresponds the media coverage of the conspiracy. Overall, the engagement (i.e. thread length) is fairly varied, but the dedication was especially high when the CBTS threads were still prevalent in the early phases. This suggest conversation after Q’s “compromise” became more casual, in contrast to the intense “research” and speculation before.
5.2.2 8chan
Figure 7: Thread frequency and engagement (thread length) of QAnon threads on 8chan/qresearch/ from October 2017 to October 2018.
On 8chan/pol/, there is a clear increase in posts relating to QAnon in December 2017, which corresponds with the announcement on 4chan that Q was “moving” to 8chan. The /qresearch/ board then quickly becomes consistently active from January 2018 onwards. The most engaged with threads in the data set are again so-called “general” threads, specifically intended for Q-related discussion. As indicated in the graph below, there is an assortment of less popular threads dedicated to discussing specific theories or bits of information as well. However, virtually all threads in the data set over the last two thirds of the 8chan data reach ~750 posts (91% of all threads); the limit after which 8can threads are archived or deleted and comments are disabled. This further confirms that activity remains consistent after Q-related discussion moves from 4chan to 8chan, easily filling up thread after thread. Interestingly, 8chan activity is far more consistently “dedicated” compared to the 4chan activity, where engagement fluctuates more.
Noteworthy is that in contrast with some of the other investigated platforms, there are no clear peaks in activity. Activity plateaus at its peak around July-August 2018, which corresponds with increased mainstream media coverage of QAnon. While this indicates that outside coverage does have some impact on the volume of 8chan-based discussion, given the consistently high activity before this happened the bulk of posts can perhaps be attributed to a “hard core” of posters that have been active on the platform since 4chan passed its baton.
5.2.3 Reddit
Figure 8: Post frequency and engagement (score) of QAnon related posts on Reddit from October 2017 to October 2018.
On Reddit, the first mentions of QAnon appear already on the 28th of October, 2017 the subreddits of r/4chan4trump and r/TranscribersOfReddit. This happens on the same day that the first Q post was made on 4chan/pol. In the few upcoming days it appears and grows on larger subreddits such as r/the_donald and r/conspiracy. Consistent Q specific activity then ensued on the subreddits r/cbts_stream (early 2018) and r/greatawakening (mid-2018). The former subreddit was banned in March 2018 and the latter was banned on September 27th 2018. The frequency of QAnon related posts dramatically increases following Trump’s Rally in Tampa, Florida on 1 August 2018. Around this time, links to Q-related news coverage on r/news and r/politics also spiked. The most upvoted Reddit post in the sample was titled “QAnon Fan Arrested for Threatening Massacre at
YouTube Headquarters”, which featured a news report. Because of outliers like these, the scatterplot in fig. 8 appears condensed.
A process of normiefication can arguably also be observed within Reddit. While the conspiracy was first mainly mentioned in more niche subreddits, like r/cbts_stream and r/thegreatawakening, larger and more popular subreddits like r/politics only jumped on the bandwagon in August 2018, at the same time when media sources started reporting on the topic (fig. 9). The disappearance of the niche is of course stimulated by the bans of the Q-specific subreddits.
Figure 9: the growth of the terms “q”or “qanon” in comments on different subreddits for the time period of October 2017 until September 2018. Yellow marks subreddits important for the build-up of the conspiracy, red marks terminated (“banned”) subreddits.
Interestingly, August 2018 was again cemented as a tipping point since from this month onwards, ‘QAnon’ was used more than ‘Q’ in reference to the conspiracy. Indeed, this succession occurs at the same point in which QAnon is receiving coverage from mainstream media outlets.
