Академический Документы
Профессиональный Документы
Культура Документы
net/publication/323960601
The Good, the Bad and the Bait: Detecting and Characterizing Clickbait on
YouTube
CITATIONS READS
6 1,811
4 authors:
Some of the authors of this publication are also working on these related projects:
ReCRED: From Real-world Identities to Privacy-Preserving and Attribute-based CREDentials for Device-centric Access Control View project
All content following this page was uploaded by Savvas Zannettou on 23 March 2018.
kiss
whatsahpopt
boyfriend
challeenxgi
japanes
indianl
grin
tri
prankt
girlfriend
indal
goniae
imposs
wroanrig
y
sexy
h
body
suit
ot
ent
ctive
ing
dog
sex
th
gir
gir
no
gon
prett
beac
vir
h
maill
sc
s
cloth
garm
swim
blo
attra
(a) (b) (c)
0.6 Clickbait non-Clickbait Category Clickbaits (%) Non-clickbaits (%)
0.5 0.9
0.8 Entertainment 406 (38%) 308 (32%)
0.4 0.7
0.3 0.6 Comedy 318 (29%) 228 (24%)
0.5
0.2 0.4 People & Blogs 155 (14%) 115 (12%)
0.3 Autos & Vehicles 33 (3%) 49 (5%)
0.1 0.2
0.0 0.1 Sports 29 (3%) 114 (12%)
0.0
s
Likes
ents
es
View
ading
Bait
bnail
bait
ed
Title
Dislik
Click
Comm
Click
Thum
Misle
(d) (e)
Fig. 2 & TABLE I: Analysis of the manually reviewed ground truth dataset. Normalized mean scores for: (a) stems from headline text; (b)
tags derived from thumbnails; (c) stems from tags that were defined by uploaders; (d) video statistics; and (e) comments that contain words
for flagging suspicious videos. Table 1 shows the top five categories (and their respective percentages) in our ground truth dataset.