Академический Документы
Профессиональный Документы
Культура Документы
Research by
Rich Gordon & Syndio Social, LLC.
1
A key question by the late ’90s:
How do you build the online audience?
• Data showed clearly that what news sites were
doing clearly wasn’t enough
• Blogs offered some clues
o Individual bloggers generated huge audiences
o News Web sites found that a single link from an “A
list” blogger could drive enormous traffic
• News sites found that half or more of their
traffic arrived from search engines or inbound
links.
© Rich Gordon / Syndio Social 2010
And then …
… the Facebook phenomenon
2
2007: For MMC and NAA,
I began exploring “new communities”
• Wrote the “Online Community
Cookbook” under contract to
the NAA
• Took on a part-time
assignment as director of new
communities for MMC
• Discovered Linked, by Albert-
Laszlo Barabasi, and the
“groundbreaking science of
networks”
© Rich Gordon / Syndio Social 2010
A brief introduction
the science of networks
3
The science of networks:
a brief historical overview
• Its roots: 18th century mathematics
• Understanding of networks has exploded in
the past decade – with applications to many
disciplines:
o psychology, sociology, biology, neurology, ecology,
business, marketing, political science and more
o Northwestern researchers: leaders in this field
• The first 50 years of network research focused
on interpersonal networks (or social networks)
© Rich Gordon / Syndio Social 2010
Social networks:
What researchers found
Alice
Mary
1 2
Bob
Ann
Joe
Joan
Phil
3
Cindy
Mike
4
What we know
about social networks
1. People cluster together: if I am your friend, and you
are Mike’s friend, there is a very good chance I am
Mike’s friend, too
2. Interpersonal networks are “small worlds” – in
general you can connect any two individuals
through a small number of “hops” or “handshakes”
3. Connectors are the reason that “small worlds” exist.
They are likely to have a much larger group of
friends and acquaintances than most people.
4. These connectors are network hubs that connect
clusters to one another.
o Without connectors, interpersonal networks would have more
than “six degrees of separation”
Source: Albert-Laszlo Barabasi, Linked
5
What researchers learned
about the World Wide Web
1. Web sites cluster together: if my site links to yours, and your
site links to Mike’s, there is a very good chance my site links
to Mike’s, too
2. The Web is a “small world” – in general you can connect any
two sites through a small number of “hops” or “handshakes”
3. Connectors are the reason that the Web is a “small world.”
These are the Web sites that are most likely to be linked to
other sites – they are “shortcuts” across the Web network
o 80% of Web links go to 15% of Web pages*
4. These connectors are network hubs that connect network
clusters to one another.
o These sites also tend to get a disproportionate share of Web traffic
6
In a world of infinite choice,
how can attention be more concentrated?
Top 10 11-100
0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100%
Source: Matthew Hindman, “A Mile Wide and an Inch Deep: Measuring Media Diversity Online and Offline”
7
How digital content is different:
the network perspective
• Content networks (links) are:
o Transparent (they are easily visible to others)
o Persistent (they remain live for an extended
period – perhaps forever)
• Interpersonal networks now can also be
transparent and persistent
o Bloggers who frequently cite and comment on
one another
o Content sharing on social networking sites like
Twitter, Facebook, etc.
© Rich Gordon / Syndio Social 2010
8
In the mass media age (1950-1995?),
newspapers & TV had it pretty easy
Because of technology
constraints …
Cost of presses
Limited broadcast spectrum
... media choice was
limited ...
… mass media were the
hubs ...
it was relatively easy to
capture attention, large
audiences, handsome
profits
© Rich Gordon / Syndio Social 2010
9
Why attention is more concentrated:
the mathematics of networks
Many
links
8’4”-
Height: Links in a network:
Linear power law
Few
links
1’10”-
1 6.7 billion 1 ∞
Rank among people in the world Rank among Web sites
♦
10
Research objectives
Discover the network of links connecting
the news websites in Chicago’s news ecosystem
11
Choosing our seed sites:
two primary sources
Crawl demonstration
S1
S2 S3
12
Crawl demonstration
S1
S2 S3
Crawl demonstration
S1
S2 S3
13
Crawl demonstration
S1
S2 S3
Crawl demonstration
S1
S2 S3
14
Crawl demonstration
S1
S3
The new core acts as a second list of seed sites; it is crawled again to identify the
next level of the community. After 3 iterations, the final network is exposed
IssueCrawler findings
15
General network overview
Key statistics
Total Nodes: 277 Total Links: 24,598
Density: 0.1273 Total Site Relationships:
1,232
Site categorizations
To simplify analysis, sites were coded by Category
Category Description
Legacy Web publications corresponding to a mainstream or traditional
media brand [Ex: ChicagoTribune.com, CBS2Chicago.com]
Legacy- Web publications/brands owned by a mainstream or traditional
Affiliated media brand [Ex: Chicagonow.com, Vocalo.org]
Micropublisher Web-only/Web-first publishers focused on a particular topic,
audience or geographic area [Ex: GapersBlock.com]
Organization/ Organizations, companies, institutions or non-profits that
Institution historically would have needed media intermediaries but now
publish online [Ex: FieldMuseum.org, CityofChicago.org]
National Brand Websites of national scope with local presence [Ex:
CitySearch.com, HuffingtonPost.com, SBNation.com]
Service Websites that provide services to Web publishers [Ex:
WordPress.com, Twitter.com, Quantcast.com]
16
Site categorizations
The Legacy, Legacy-affiliated, Micropublisher, and
Organization/Institution categories were further coded by Scope
Scope Description
Geo-Publisher Web publications focused on one or more specific
geographic areas within the Chicago region (Ex:
DailyHerald.com, EvanstonNow.com,
AdentroDePilsen.org)
Niche Web publications focusing on a topic or audience
Publisher segment (Ex: ChicagoBusiness.com,
TheExpiredMeter.com, BleedCubbieBlue.com,
NewCity.com, Catalyst-Chicago.org)
Mass Media Websites branded with a major mass media outlet (Ex:
ChicagoTribune.com, NBCChicago.com, Newshour.org)
A F
D H
B G
17
Organizations/Institutions,
a mix of other sites are top authorities
Top 5 Authorities
Transitchicago.com
Chicagotribune.com
Gapersblock.com
Mcachicago.org
Metrarail.com
Top 5 Hubs
Gapersblock.com
Badatsports.com
Saic.edu
Uchicago.edu
Macfound.org
18
Betweenness:
Intermediaries and switchboards
•Flow betweenness measures the number of ‘paths’ passing through
each site; betweenness measures the number of ‘shortest paths’
passing through each site
Intermediaries: sites that are deeply embedded between
otherwise unconnected sites (high flow betweenness)
Switchboards: sites that connect readers in the fastest
possible way to otherwise unconnected communities (high
betweenness)
A F K M
D H I J N
B G L O
Top 5 Intermediaries
Addthis.com
Windycitizen.com
Ravinia.org
Chicagoartistsresource.org
Cityofchicago.org
19
Gapersblock.com, Windycitizen.com,
Organizations/Institutions are switchboards
Top 5 Intermediaries
Gapersblock.com
Transitchicago.com
Windycitizen.com
Saic.edu
Chicagoartistsresource.org
Eigenvector centrality:
the key ingredient in search algorithms
20
Eigenvector centrality
is a measure of prestige
A
• A site’s centrality is a function of the centralities
of the sites it links to B C
• Sites with a high eigenvector centrality tend to
be considered “prestigious”
Organization/Institutions
tend to be the most prestigious
Top 5 Authorities
Transitchicago.com
Metrarail.com
Newcity.com
Rtachicago.com
Nictd.com
21
When we set a ‘link volume’ threshold, we
see clustering by content and affiliation
NewCityChicago
Sports
Tribune Co.
Music
Micropublisher core
Periphery
Niche publishers
central cluster
Periphery
22
Thank you!!
Rich Gordon
richgor@northwestern.edu
23