Академический Документы
Профессиональный Документы
Культура Документы
A meme is defined as an element of culture or behaviour that spreads by non-genetic means, but to many the
term is now synonymous with the Internet image meme. The audience for this developing art form rapidly
grew with the mainstream adoption of social media, but little research has been done on memes despite
their increasing cultural significance and use in marketing products and ideologies.
Our analysis uses a dataset of over 400,000 images that resulted from a targeted collection of the most
popular memes posted throughout 2018 to the content aggregator Reddit. An artificial intelligence named
MemeVision was trained to identify memes using a three-branch approach; based on the colors, shapes and
words within an image. Combining this with the metadata attached to each image we can estimate the total
number of times the various instances of a meme were viewed each day of the year. p27-30
We define a new unit of measurement - the Magnitude Estimation of Meme Prominence (MEMP). This is
far less volatile than the number of views a meme receives; the relationship between views and MEMP is
loosely analogous to how net trade volume affects the price of a stock or currency. As such we get a clearer
view of trends over time and a more accurate way to determine the most prominent meme of the year (by
which we mean the most prominent image template – we are not referring to an individual instance of a
meme and we did not include other media types such as video or music memes). p31-32
The meme economy is much like any other: a handful commands the disproportionate share of audience. Of
the 250 most prominent memes, just 10 account for roughly one third of the views. Furthermore 7% of
those views (around 219m) are accounted for by the most prominent meme of the year, Drake’s Preference,
which uses two images from Drake’s 2015 music video Hotline Bling. p7
Other noteworthy memes:
- Surprised Pikachu received the most views within a single week. Circumstantial evidence suggests this
could have been part of a marketing campaign to promote the Detective Pikachu movie. p18
- Change My Mind was the most prominent meme using an image from that year; it is based on a photo
of conservative commentator Steven Crowder taken February 2018. p9
- Theresa May Dancing was the most prominent meme of a politician, far more so than any of the
numerous different memes featuring Donald Trump. p9
When grouping together memes based on a common subject, we found the following:
- SpongeBob SquarePants was the most prominent franchise; memes using images from this cartoon
collectively received over 626m views throughout the year. p8
- Stefán Karl Stefánsson was the most prominent individual; various memes using images of the actor,
best known for portraying Robbie Rotten in Lazytown, peaked shortly after his death in August. p10
- Spooktober, a festival of spooky memes, received the most views within a single week. p11
Other findings include:
- The impact of social media influencer Pewdiepie mentioning a meme is inconsistent. p17
- Once a meme has peaked it takes on average one month for it to half in prominence. p18
- A language analysis of all instances of the Pepe The Frog meme, infamous since being labelled a hate
symbol in 2016, found an estimated 17% are political and 20% include hateful terminology. p20
Wider trends in the types of images used:
- The majority of memes use a screenshot from a video, often including a caption of dialogue. p13
- Memes popularised years ago have a greater tendency to use animal photography, while those of more
recent years increasingly use illustrations and animated characters. p15
- The older image macro format has lost popularity in favour of formats based on tabular comparison,
rewriting dialogue and the use of text to label subjects of an image. p14
- We predict an increasing use of animation, superheroes and non-western media to source images. p24
We suggest the potential of a meme is best assessed on the following criteria: the immediate impact and
emotional resonance of the image, the mutability of the premise and how many competitors there are. p23
We will continue this quantitative research to further human understanding of memes. Our website and social
media pages include datasets, graphics and analysis we did not have room to include in this report.
iiMemeIndex.com 1
CONTENTS
Executive Summary 1
Contents 2
Introduction 3
Definitions 4
M ethodology 6
Results
i. Most successful image meme 7
ii. Most successful meme franchise 8
iii. Most successful meme photo taken in 2018 9
iv. Most successful human 10
v. Most viral meme event 11
vi. Most successful classic meme 12
M eme Demographics
i. Image source 13
ii. Meme format 14
iii. How image source varies by meme format 15
Analysis
i. How dominant is SpongeBob? 16
ii. Can a celebrity make or break a meme? 17
iii. Was Surprised Pikachu a marketing campaign? 18
iv. How long does a meme last? 19
v. How prevalent are political Pepe the Frog memes? 20
vi. What terms are used in memes? 22
Conclusions
i. How to judge the potential of a meme 23
ii. What to expect from the future of memes 24
Internet Image M eme Index 25
Acknowledgements 26
Appendix A: Detailed M ethodology
i. Data collection 27
ii. Identifying images 28
iii. Estimating views 30
iv. Quantifying meme prominence 31
v. Final ranking 32
Appendix B: Detailed Results
i. Top 30 ranked species 33
ii. Top ranked species that use images from 2018 34
iii. Top 20 ranked families 34
iv. Meme calendar 2018 35
v. Selection of Species compared to Families 36
Appendix C: Additional Example M emes 37
Appendix D: Image Credits 40
iiMemeIndex.com 2
IN TRODU CTION
Internet Image Memes (henceforth iimemes) are now so ingrained in popular culture that in common
parlance the term ‘meme’ has become synonymous with the iimeme. Thus one could conclude that the
iimeme has risen to become the apex meme, overshadowing all other forms of meme in a Darwinian way
that would make Richard Dawkins proud.
Driven by the mainstream adoption of social media, the audience for iimemes has rapidly expanded over the
past decade. In many ways the iimeme is an antidote to an increasingly detached way of living, a social
lubricant that provides connection through a shared idea. Thus in turn we have seen the increasing use of
iimemes to market both products and ideologies.
The iimeme is yet to achieve the reverence afforded to other art forms, instead often being dismissed as a
lowbrow triviality. But there has been a notable rise in the number of published academic articles
mentioning the term “internet meme(s)”, which in 2012 surpassed the number mentioning the one time
icon of the future “flying car(s)” (figure 1).
This report details a rigorous quantitative analysis of iimemes during 2018, and aims to provide a greater
understanding to this increasingly important aspect of popular culture.
Fig 1: Comparison of academic publications mentioning internet memes and flying cars2
1
Oxforddictionaries.com. Meme was originally defined in Dawkins, Richard. "The Selfish Gene” (1976)
2
Data from Google Scholar; English language articles only, includes patents, excludes citations
iiMemeIndex.com 3
DEFIN ITION S
iiMeme Species
A group of iimemes are of the same
species if they incorporate at least part of
the same static image.
iiMeme Family
A group of iimeme species are of the
same family if their defining images share
a common subject or source material.
Fig 2: Taxonomy of the iimeme
Note these definitions are based solely on the image used – not what is mentioned in any text. For
instance an iimeme may contain a joke about Donald Trump, but it only falls into the Donald Trump family
if it features an image of him. Because a single iimeme can be composed of multiple images it is possible
for it to belong to more than one species or family. For clarity we italicise names of species and families.
These three iimemes all belong to the Drake’s Preference species because they incorporate the same
image of musical artist Drake. It does not matter that the image has been cropped, has a different
brightness level, or has the head of Santa Claus edited on. The third example also belongs to Stefán’s
Preference because it also incorporates the image defining that species.
The first two of these imemes belong to the Stefán’s Preference species; which follows the same formula
as Drake’s Preference but uses different images so is a separate species. All three of these iimemes
belong to the Stefán Karl Stefánsson family because they use images that portray Stefán Karl Stefánsson.
iiMemeIndex.com 4
DEFINITIONS
We define this new unit of measurement to provide a better reflection of how popular opinion concerning a
species or family changes over time. The calculation of MEMP comprises multiple elements designed to
simulate short and long-term memory, as well as offset bias in the data collection.
The relationship between number of views and MEMP (figure 3) can be seen as loosely analogous to the
relation between trade volume and price for a stock or currency (figure 4). With high views being similar
to a net positive trade volume, and low views being similar to a net negative trade volume.
The primary driver of this measure is the number views that iimemes in that species or family receive.
The relationship is non-linear and relies upon additional variables. The full formula we used for MEMP
calculations can be found in the detailed methodology section of the appendices.
Example of the Elon Musk Smoking Weed Example of the Elon’s Preference
species, which uses the viral image taken species, another offshoot of the
from his appearance on the Joe Rogan Drake’s Preference species.
podcast on 6 th September 2018.
3
Chart by TradeView, via Binance
iiMemeIndex.com 5
The full methodology can be found in Appendix A
METHODOLOGY
Datas et
During 2018 we collected 418,874 images posted to Reddit,
currently the 19 th most visited website in the world4. The site is
divided into many Subreddits for different topics; we picked 69
to monitor and collect from based on the number iimemes
posted to them.
Roughly 12% of the images we collected were hosted externally
on Imgur, which provides precise view counts. Correlating these
with metadata provided by Reddit allowed us to estimate views
for the remaining 88% of images.
Identifying Images
We identify whether images in our dataset belong to any iimeme
species or families using an artificial intelligence we developed Fig 5: Artistic impression of
named MemeVision. MemeVision identifying images
It has been trained to use a three-branch approach to narrow
down potential matches using the words, colors and shapes of an
image. For each potential match it performs a more
computationally intensive process to confirm or deny the match.
F inal Rankings
Finally having calculated a time series of MEMP levels for every species and family, we need to convert
this into a single value that summarises their overall prominence throughout 2018.
When looking back at the year people will more likely recall those memes that reached very high
prominence, even if for only a short time, and so a simple average of the annual MEMP levels is not
adequate. However simply ranking by the peak MEMP achieved throughout the year does not reflect the
differing consistencies. Therefore our final rankings place equal importance on the peak and year average.
4
Data from alexa.com/topsites February 2019
iiMemeIndex.com 6
Total Views are lower bound estimates for combined views on Reddit,
Facebook, Instagram & Twitter. The true global total is likely higher.
RESULTS
The original Is This A Pigeon? image (left) which is edited by meme authors to convey how other things
are misidentified. One the most viewed versions of the meme during 2018 comments on the reaction
people have to celebrities committing suicide (center). Finally one of the many crossover versions of this
meme (right) references the popular Steamed Hams segment from an episode of The Simpsons.
Fig 7: MEMP levels of the five most prominent iimeme species during 2018
iiMemeIndex.com 7
RESULTS
The most prominent iimeme families are all based around popular culture franchises, the most successful of
which is SpongeBob SquarePants. The prominence of SpongeBob SquarePants peaked towards the end of
the year following the passing of creator Stephen Hillenburg on 26 th November.
Fig 8: MEMP levels of the four most successful iimeme families of 2018
5
To be ranked we require a family to contain at least three species each contributing at least 1% of the total
MEMP of that family. For instance a Drake family would not satisfy this requirement so is not included in the
family ranking, even though the Drake’s Preference species is on its own more prominent than some of the
top 20 ranked families (see Appendix B for full list).
iiMemeIndex.com 8
RESULTS
Fig 9: Prominence of Change My Mind (left), Theresa May Dancing (center) and Rewind Time (right)
with blue vertical line marking the date each photo was published. Note different axis scales for each.
iiMemeIndex.com 9
RESULTS
6
Human meme = Humeme
iiMemeIndex.com 10
RESULTS
The single most viral meme of the year was Surprised Pikachu,
we discuss its astronomical rise in more detail in the analysis
section. In the conclusions section we discuss what qualities a
meme needs in order to succeed.
What is striking about the top ten most viral memes is that only
four were in response to events; Spooky Memes, Stefán Karl
Stefánsson, This Is America, and Rewind Time. For the other
six their viral success appears to have emerged organically. Is
This A Pigeon? for instance was popularised as a meme many
This Spooky Memes example was years earlier but was revitalized this year, while Gru’s Plan uses
one of the most viewed of all a screenshot of a film released years ago.
images in the family.
7
This is why the SpongeBob SquarePants family is not in this ranking. While it had a large absolute rise in
MEMP over 7 days it was starting from such a high point that the percentage change was relatively small.
iiMemeIndex.com 11
RESULTS
The modern meme market is increasingly volatile, new memes rise and fall in a matter of weeks. Through
this all a handful of Classic Image Macros remain highly prominent, the most successful of which was
Confession Bear. We discuss how the Classic format differs from others in the Meme Demographics section.
Example of Confession Bear, Two of the several variants of Socially Awesome Awkward
successful partly because of Penguin, all of which come under the same species because
how guilty that bear looks. they use the base image of a penguin.
Some may be surprised that Confession Bear was ranked 9 th among all image memes8 despite being years
old (ancient by meme standards). The explanation for this is straightforward; the modern meme market is
far more competitive and fractured, in part because of a larger pool of meme authors. New memes are
constantly being tested and when one catches on it is often feverishly utilised that meme authors tire of it
after a short time. Some perform so well that they retain significant prominence even after their initial
boom, but the majority end up fading off into relative obscurity.
Meanwhile one of the single most subscribed to feeds for memes on the Internet (the AdviceAnimals
Subreddit) is consistently posting the memes using the same handful of classic image macros. This is a
significant number of people who seem to prefer familiarity in their memes; week in week out the same
few images are used in their most popular memes, and over a year that adds up. In a way this is a parable
for how first past the post electoral systems function.
8
Confession Bear would rank even higher if we based MEMP calculation solely on Reddit data, however as
explained in our methodology the formula includes an element to offset the bias towards Reddit.
iiMemeIndex.com 12
MEME DEMOGRAPH ICS
i. Image source
Table and pie chart summarising where images originate from for the top 250 species:
Share of Number of
Source Subsection Views Species
Source Description Example (species rank)
iiMemeIndex.com 13
MEME DEMOGRAPHICS
Reaction 100 An image used to show a reaction to a situation. The Theresa May
simplest and oldest iimeme format. Dancing (34)
Classic An image with one line of text overlaid at the top and Confession Bear
49
Image Macro another at the bottom, usually in Impact font. (9)
A meme that relies upon rewriting the original text that Patrick’s Wallet
Rewrite 39
appears in the image, including video subtitles9. (14)
At one time Internet image memes would have been considered synonymous with the Classic Image Macro;
many of which reached mainstream notoriety such as the Success Kid or Bad Luck Brian. Their rigid format
makes them easy to create; there are many websites that can be used to generate one in seconds.
However the Classic Macro has in recent years lost popularity in favour of more complex formats that can
communicate more, but often require at least a basic understanding of image-editing software to create.
The rise of the Modern Macro, Rewrite and Tabular formats in recent years is likely due to a larger pool of
meme authors who are computer literate, as well the natural progression of the art form. The Reaction
format is the most widespread and straight forward, although increasingly canny meme authors use image-
editing software to alter and breath new life into stale memes.
9
When a species can be considered more than one of these formats we categorise based on the most common
or fundamental usage. For instance Captain Picard WTF can be used as a Reaction, but it more often used as
a Classic Macro. Also meme authors sometimes label the subjects of Patrick’s Wallet in a way that
resembles a Modern Macro, but in every case they reword the captions and so it is fundamentally a Rewrite.
10
Most square four panel webcomics are considered Rewrite rather than Tabular because rearranging them
into a single row or column does not make them harder to understand.
iiMemeIndex.com 14
MEME DEMOGRAPHICS
iiMemeIndex.com 15
AN ALYSIS
i . H ow d ominant is S pongeBob ?
The meme economy is much like
any other; a handful of memes
occupying the top positions
command a disproportionate share
of the audience.
During 2018 the iimemes within
either top 250 species or 25
families collectively received a
minimum of 4.733 billion views.
The SpongeBob SquarePants family
alone accounted for a staggering
13.2% of those views, as well as
containing 15 of the top 250
ranked species, more than any other
family. The top ranked species,
Drake’s Preference, accounted for
4.6% of those views.
More broadly around 55% of those
views, 2.598 billion, are accounted
for by iimemes within either the top
5 families or 10 species.
The most appropriate probability
distribution to model the share of
prominence across different memes
Fig 15: Venn diagram showing the relative scale of and
would appear to be the Pareto
crossover between the top 5 families and top 10 species
distribution (figure 17).
(within the total of all 250 species and 25 families).
iiMemeIndex.com 16
ANALYSIS
Figure 18: Prominence of 10 memes before and after being reviewed by Pewdiepie
(split into two graphs based on date of review for ease of viewing).
- The impact of a review was not the same for all memes.
- More often than not a meme rises in prominence for the few days after being reviewed.
- Not all memes were reviewed at the same point of their cycle: some while they were rising, others
while they were peaking or falling. It is unclear whether the highest gainers rose because of the
review, or rather because they were reviewed while they were rising anyway.
- At the high end Is This A Pigeon? rose by over 500% in the week after being reviewed.
- At the low end Kowalski Analysis dropped by 50% in the week after being reviewed.
- Reviews in the first half of the year (left) were followed by a greater rise than those in the second half
(right) on average; this could simply be due to when during their lifecycle those memes were reviewed.
- The best impact would arguably be on Hard To Swallow Pills, given that this meme was falling but this
trend reversed immediately after the review.
- The worst impact would arguably be on Netflix Adaptation, given that this meme was rising but started
to fall immediately after the review.
This may have been the cleanest possible data set with which to study this question, but it is still not good
enough to get a definitive answer given the many other factors at play. We do not have enough data points to
control for how the meme is trending when it gets reviewed. Another important factor is likely how
enthusiastic Pewdiepie’s audience is for each meme.
In conclusion his influence is inconsistent - he does not have the power to create or destroy a meme.
We do not believe that any one person can control the tides of the meme economy.
iiMemeIndex.com 17
ANALYSIS
Figure 19: Prominence of 10 viral memes relative to each of their peaks, horizontal black line marking 50%
of peak prominence, vertical black line markets peak (split into two graphs for ease of viewing).
We find that the median time from a meme first hitting 50% prominence to hitting peak prominence is 8
days. While the median time from peak prominence to going back down to 50% prominence is 28 days.
This lopsided result is unsurprising given that our measure of prominence attempts to simulate collective
memory; the amount of time it takes for something to become well known is generally less than the time it
takes for it to be forgotten.
By this crude measure we can say on average a meme will be at its best for around 36 days.
Epic Handshake uses an image of Carl The Speed At Which Lobsters Die (103 rd
Weathers and Arnold Schwarzenegger ranked species) uses a screenshot of Josh
shaking hands from the film Predator. Peck from the series Drake and Josh.
iiMemeIndex.com 18
ANALYSIS
Fig 20: Views and Prominence of Surprised Pikachu. Peak views occurred Nov-6.
Blue vertical line marks the Detective Pikachu trailer release on Nov-12.
11
Detective Pikachu first trailer - youtube.com/watch?v=1roy4o4tqQM
12
Screen Rant on test screenings - screenrant.com/detective-pikachu-early-reactions-positive/
13
Nintendo Life on test screenings - nintendolife.com/news/2018/11/early_detective_pikachu_movie_screening_
14
Forbes article by Jay McGregor on Reddit corporate manipulation -
forbes.com/sites/jaymcgregor/2017/02/20/reddit-is-being-manipulated-by-big-financial-services-companies
15
Video interview by Tim Pool explaining how Reddit can be manipulated - youtu.be/NuR3PIwYMPs?t=843
iiMemeIndex.com 19
ANALYSIS
16
BBC report on ADL branding Pepe The Frog a hate symbol - bbc.co.uk/news/world-us-canada-37493165
17
Of the 50 most used BTTV memes/emotes 13 are Pepe variants - stats.streamelements.com/
18
The NPC meme is a modern repackaging of the philosophical zombie theory that suggests some proportion
of humans are non-sentient, thus operating similarly to how a video game NPC is programmed to behave.
iiMemeIndex.com 20
ANALYSIS
collected groups of images that are sure to be political in nature from two Subreddits; The Donald and
PoliticalHumor, which are dedicated to rightwing and leftwing content respectively. Lastly to get a baseline
for comparison we compiled a control group of all memes belonging to either the most prominent family
(SpongeBob) or any of the ten most prominent species (see Appendix B for full list).
First we calculate how many
images within each group Political terms Hateful slurs
contain explicit political or Images from: Explicit Total (estimate) Explicit Total (estimate)
hateful language. Given the two The_Donald 63.8% 100% 1.4% 2.1 - 2.4%
Subreddits are in reality 100%
PoliticalHumor 56.5% 100% 0.6% 0.9 - 1.0%
political, this implies our
explicit figures should be NPC memes 60.4% 94.7 - 100% 1.1% 1.7 - 1.9%
inflated by a factor of 1.56– Pepe memes 9.5% 14.8 - 17% 11.1% 17 .3 - 20%
1.77 to get the true total. As
Control group 3.6% 5.6 - 6.4% 1.0% 1.5 - 1.7%
such we arrive at upper
estimates of around 17% of Pepe memes conveying political sentiments. Lacking a comparison group that
we know to be 100% hateful, we assumed the same inflation factor to get an estimate of 20% containing
hateful language. Only 1.3% contained both explicit political and hateful terms. By far the most common
slur was homophobic, accounting for well over half of all hateful terms; unsurprising given that this term
was the only hateful slur listed by previous studies as one of the 15 most common profanities for both
Twitter19 and Facebook20.
To get a sense of how subjects of discussion vary between the groups we can observe two word clouds21
based on the text appearing in NPC memes (figure 22), which show several political themes, and Pepe
memes (figure 21), which imply themes around adolescence.
Pepes are more prominent than memes from huge franchises such as Game of Thrones or Star Trek. While
we can say with some certainty that most Pepe memes are non-political in nature, even just 17% of them
equates to tens of millions of views and three political Pepes for every one NPC meme. They are prevalent
enough to impact political discourse so it is understandable how the meme was labelled a hate symbol, but
to assume it is used exclusively or even mostly in this way would be to mislabel large numbers of mostly
young people with views they do not hold.
Fig 21: Word Cloud using Pepe memes Fig 22: Word Cloud using NPC memes
19
Wang, W., Chen, L., Thirunarayan, K., & Sheth, A. P. (2014). Cursing in English on Twitter. Proceedings
of the 17th ACM Conference on Computer Supported Cooperative Work & Social Computing, 415-424.
20
Chris Kirk: slate.com/blogs/lexicon_valley/2013/09/11/top_swear_words_most_popular_curse_words_on_facebook.html
21
The size a term appears in the word cloud is proportional to how often that term appears in the text.
iiMemeIndex.com 21
ANALYSIS
Fig 23: Word Cloud using text from all memes in the top 10 ranked species (listed in Appendix B)
Fig 24: Word Cloud using all text from Fig 25: Word Cloud using all text from
memes in the SpongeBob family memes in the Star Wars family
iiMemeIndex.com 22
CON CLU SION S
Having observed how memes rise and fall we propose several criteria for judging meme success.
Impact – An image is more likely to be noticed if it immediately impacts on the brain. This can be
improved with: bright colours, recognisable characters, or cute things. Surprised Pikachu has all three.
Resonance – An image is more likely to be remembered if it portrays a clear basic emotion that will
resonate with our own feelings. Drake’s Preference conveys pleasure and displeasure, Epic Handshake
conveys friendship, Confession Bear conveys guilt and Surprised Pikachu conveys surprise.
Mutability – A meme will only last and spread if the premise behind it allows many different potential
ideas; simplicity is better. Is This A Pigeon? is one thing mistaken for another, Lisa Simpsons’s
Presentation is giving any opinion, Surprised Pikachu is surprise – all very simple premises.
Competition – If two memes share the same premise they must compete for a limited number of ideas.
Surprisingly there were no very popular surprise based memes during 2018, hence potentially why
Surprised Pikachu was able to succeed. On the other hand there were many memes using the identical
presentation premise, three of which featured Lisa Simpson, Paul Ryan and Bernie Sanders. Given these
three had identical levels of competition and mutability, the most successful was the one featuring the
brightly coloured recognisable animation character with a clearer emotional expression.
Three similar presentation memes; Lisa Simpson (rank 5), Paul Ryan (64) and Bernie Sanders (187).
The more impactful and resonant the image, the more prominent that meme ended up.
Three similar preference memes; Drake’s Preference (rank 1), Distracted Boyfriend (7) and Left Exit
Off Ramp 12 (26). The simpler the emotion and colour of the image, the more prominent that meme
ended up. But the fact they all rank quite highly is a testimant to how powerfully simple this premise is.
Maturity – This is less about the potential of a new meme, more about the staying power of an existing
meme. Drake’s Preference now has dozens of fairly prominent competitors that follow exactly the same
formula, but unlike those competitors it was the first to popularise the premise. Similarly many classic
memes continue to succeed in the face of new memes because there is a first to market advantage.
Put simply a meme is just a shared idea; so the more it has been shared, the more it is a meme.
iiMemeIndex.com 23
CONCLUSIONS
Tabular Formats – This format is easy to edit and easy to understand. We will continue to see new
tabular memes appear, often using the same existing premises, but perhaps with more striking images.
Animated Characters – The success of memes from franchises like The Simpsons or SpongeBob
SquarePants has led to creators to increasingly mine older animated film and television for images, the
most successful during 2018 being Tom & Jerry (the 17 th ranked family). Whether it is from an old classic
or a new CGI live action remake of an old classic, memes of animated characters will dominate.
Superhero Films – No film of 2018 was a greater source of memes than Avengers: Infinity War and its
antagonist Thanos. Given how many superhero films are produced at least one each year will end up being
great meme fodder, due to the combination of larger than life characters, overly dramatic stories, as well as
a combined audience of children and diehard adult fans.
Global Media – The influence of Western media will slowly decline as the number of Internet users in
other parts of the world increases. For instance This Is Beyond Science picturing actor Rajinikanth in 2.0
became a viral success very shortly after that film was released. Another prominent meme Am I A Joke To
You? pictures actor Rapulana Seiphemo in South African soap opera Generations: The Legacy.
Old Favourites – Mature memes, like Harold, will persist for a long time yet.
One of the most viewed memes Reality is often disappointing This Is Beyond Science
from the Tom & Jerry family one of many Thanos memes (rank 117)
iiMemeIndex.com 24
Inte rne t Ima ge M e me Inde x
The Internet Image Meme Index conducts quantitative research to further human understanding of memes.
Our website and social media pages contain datasets, graphics and analysis not included in this report.
iiMemeIndex.com 25
A c know l e dge me nts
This research was possible because of previous work by the following groups22.
Reddit for the use of their API and for creating a well organised content aggregator.
Imgur for the use of their API and for creating a well organised image-hosting site.
The developers of the software we used, particularly those mentioned in the footnotes of pages 28-30.
KnowYourMeme and its contributors for creating a well organised meme encyclopaedia.
Everyone who made a meme during 2018.
A rti c l e 13
22
None had direct involvement with, or in any way endorse the findings of, this report.
iiMemeIndex.com 26
APPEN DIX A: DETAILED METH ODOLOGY
i. Data c ollection
During 2018 we collected 418,874 images posted to Reddit, currently the 19 th most visited website in the
world23. The site is divided into many Subreddits for different topics; we picked 69 to monitor and collect
from based on the number of iimemes posted to them.
While Facebook, Twitter and Instagram are also used for image sharing and receive more web traffic, we
chose Reddit as it has several key advantages allowing a vastly more efficient data collection process.
Each social media platform has a different user base and culture, meaning they vary in how they relate to
one another and wider culture. With respect to meme trends the relationship between platforms is often
summarised in the Life of a Meme meme. While this may be a broad caricature it does contain a large
grain of truth; iimeme trends more often than not appear on Reddit before most other platforms. To offset
the bias in our dataset we designed the MEMP formula to model how trends echo ‘downstream’ from
Reddit to other platforms.
In wider society 4chan is infamous as the source of many political trolling campaigns; but it is also the
origin point of many mainstream meme trends. Though compared to the other platforms mentioned the site
gets minimal traffic (around 120,000 th most visited site globally23) and the most popular posts receive a far
greater number of views when reposted to Reddit.
23
Data from alexa.com/topsites February 2019.
iiMemeIndex.com 27
APPENDIX A: DETAILED METHODOLOGY
ii . Image Identification
We have developed an artificial intelligence named MemeVision that is trained to recognise whether an
image belongs to any of over a thousand iimeme species and families. It does this using a three-branch
method based upon the colors, shapes and words within an image.
24
Scikit-learn: Machine Learning in Python, Pedregosa et al., JMLR 12, pp. 2825-2830, 2011
25
Chapelle, Olivier, Patrick Haffner, and Vladimir N. Vapnik. "Support vector machines for histogram-based
image classification." IEEE transactions on Neural Networks 10.5 (1999): 1055-1064.
iiMemeIndex.com 28
APPENDIX A: DETAILED METHODOLOGY
Segmentation – To improve accuracy the Color and Shape branches will look at the raw image along
with five segmentations of the input image. These are the two halves obtained by a horizontal split (H1 &
H2), two halves obtained by vertical split (V1 & V2), and the mid horizontal (M). In the example shown
in figure 31, the H2 segment would provide an almost exact match for the base image of the character
Fry from Futurama that is used for the Not Sure If species. Prior to this an algorithm is run to detect and
crop any unnecessary borders to image, if this occurs both the raw and cropped versions are examined.
Words – This branch uses text extracted from an image (using Tesseract optical character recognition27)
to narrow down potential matches. It uses a rule-based approach to natural language processing, detecting
combinations of n-grams (words and phrases) that imply certain species, implemented using NLTK 28.
Thresholds for a likely match are calibrated using additional metadata and previously classified images.
A complicating factor is that many images are distorted making accurate text extraction difficult. To
account for this fuzzy logic is used to detect words that are close to what is needed, but perhaps one or two
letters different due to the character recognition being confused by noise in the image. This is
implemented using fuzzyset29.
Confirmation – Once each branch has delivered potential matches, a brute force comparison algorithm
(implemented using OpenCV 30) gives a similarity score for each. Images are either considered to certainly
have at least one match, certainly have no matches, or be an edge case that requires human review.
Development – The calibration of parts of this process required a sizeable dataset of classified images,
something not available at the start of the development. However the SVMs could be trained to a
workable level of accuracy with just one example image. So initially we simply gave the model one
labelled example of each species. With this we had a Color branch that was able to classify a decent
number of iimemes that were easier to identify. We were then able to train a more robust Color branch,
along with the Shape and Word branch.
26
Tensorflow was developed by Google Brain - tensorflow.org
27
Tesseract OCR is developed by Google - github.com/tesseract-ocr
Python wrapper developed by Matthias Lee - github.com/madmaze/pytesseract
28
Bird, Steven; Klein, Ewan; Loper, Edward (2009), Natural Language Processing with Python. O’Reilly Media Inc.
29
Python fuzzyset developed by Mike Axiak - github.com/axiak/fuzzyset
30
Open CV developed by Intel - opencv.org
Python implementation developed by Olli-Pekka Heinisuo - github.com/skvark/opencv-python
iiMemeIndex.com 29
APPENDIX A: DETAILED METHODOLOGY
We used the Reddit API31 to collect metadata relevant to each image including upvotes, which are used by
the site to rank popularity. Reddit does not provide view counts, but 12% of images we collected were
hosted externally on Imgur, a site that does provide a view count. Using the Imgur API to collect view count
for those 12% of images then allowed us to extrapolate the relationship between upvotes and views, giving
us an estimate of views for the remaining 88% of images.
Reddit is divided into Subreddits with varying audiences and patterns of behaviour; this is a very significant
factor when estimating views. Figure 32 displays a handful of the larger Subreddits we collected from and
shows the ratio of upvotes to views ranges from around 4% to almost 13%.
When estimating views we also consider that the metadata on an Imgur post may not align precisely with
the corresponding Reddit post; for instance someone may post to Reddit a link to an Imgur post that was
created a year earlier. To account for this we do not correlate total views and total upvotes, instead we
make multiple observations of each post and correlate the differences between observations, which gives a
more accurate relationship between the two.
On a side note we found that on average an image received around 50% of its views within the first 8
hours, and at least 98% of its views within the first 24 hours.
Fig 32: How Subreddits vary by number of subscribers and the proportion of
viewers who upvote posts
31
Python Reddit API Wrapper developed by Bryce Boe - github.com/praw-dev/praw
iiMemeIndex.com 30
APPENDIX A: DETAILED METHODOLOGY
This is the downstream echo term, a weighted average of past baseline MEMP values,
multiplied by the strength term b. For our analysis we set memory m to 4 weeks.
Weights are determined by a Barthann window; α 0 = 0.62 , α1 = 0.48 , α 2 = 0.38 .
€
The strength term is inversely proportional to zt a measure of the concentration of iimemes
1 at that time. If every instance of the species/family is found in the same Subreddit then it
b(t) ∝ will get the maximum concentration € score and€thus the minimum
€ strength term. Given that
zt
zt has a big impact on final MEMP we apply significant smoothing to reduce volatility.
€
The final MEMP term is the baseline plus the downstream echo plus
f (t) c , the crash factor. c is a relatively insignificant term that is almost
€ € = g(t) + d µ (t) + c(t) always zero, except in cases where there is a long period of zero views
when it becomes negative to speed the decline and simulate a crash.
€ €
€
Fig 33: Sigmoid weight function Fig 34: Barthann weight function
with memory set to 20 days with memory set to 28 days
iiMemeIndex.com 31
APPENDIX A: DETAILED METHODOLOGY
v. F inal r anking
Having calculated a MEMP time series for every species and family we finally need to provide some
ranking for the most significant of 2018. For this we give equal weighting to two metrics: the peak and
year average. This balances the recognition of species/families that perform consistently with those that
have great viral success for a short time but a lower annual average.
The index for the species/family x is calculated as such:
fx (t)
max fx (t) ∑ N
t ∈2018€ t ∈2018 S is the set of all species (or families)
i(x) = +
⎛ N is the number of time periods in 2018
max(max f s (t)) f s (t)⎞
s ∈S t ∈2018 max⎜⎜ ∑ ⎟
⎟
s ∈S N
⎝ t ∈2018 € ⎠
€
The index for species x is the maximum MEMP of x in 2018, divided by the maximum MEMP of any
species in 2018, plus the average MEMP of x in 2018, divided by the maximum average MEMP of any
€ species in 2018. For family calculations we divide by the maximum of any family during 2018.
€ €
€
Now to fill the rest of the space on this page here are the two most prominent iimemes using webcomics:
The Scroll of Truth (species rank 38) Two Red Buttons (species rank 32)
iiMemeIndex.com 32
APPEN DIX B: DETAILED RESU LTS
i. T op 3 0 ranked s pecies
iiMemeIndex.com 33
APPENDIX B: DETAILED RESULTS
iiMemeIndex.com 34
APPENDIX B: DETAILED RESULTS
January - - - - - -
We Don’t Do That
June Thanos - ∞ 182 -
Here
Stefán Karl €
August - 10 Excuse Me, WTF? 12 -
Stefánsson
Theresa May
November Surprised Pikachu 3 - 34 -
Dancing
iiMemeIndex.com 35
APPENDIX B: DETAILED RESULTS
Eric Andre Show (family 8) and Stefán Karl Stefánsson (family 10) and
Who Killed Hannibal (species 16) Stefan’s Preference (species 28)
This Is America (family 16) and Star Trek (family 17) and
This Is America Execution (species 79) Captain Picard WTF (species 51)
Mario Bros (family 18) and Elon Musk (family 19) and
Mario Bros Views (species 70) Elon Musk Smoking Weed (species 70)
iiMemeIndex.com 36
APPEN DIX C: ADDITION AL MEME EXAMPLES
iiMemeIndex.com 37
APPENDIX C: ADDITIONAL MEME EXAMPLES
Thanos – Strongest Wills They Hated Jesus Intelligent Students Crying Kid
iiMemeIndex.com 38
APPENDIX C: ADDITIONAL MEME EXAMPLES
American Chopper Argument Trump – Enemy of the People Excuse Me, WTF?
Change Language
Lesson in Trickery How Old Is…?
iiMemeIndex.com 40