Академический Документы
Профессиональный Документы
Культура Документы
Rich Social Network Data Schema to aid designing, collec1ng and evalua1ng social network data
Data
Strategy
Design
&
Collec1on
9me
Hypothesis Evalua1on
9me
Hypothesis Evalua1on
Mar1n EvereM David Krackhardt Nicholas Mullins S.D. Berkowitz Ronald B urt Barry Wellman Anatol Rapoport Stanley Wasserman J. A. Barnes Katherine Faust Nan Lin Peter Marsden Tom A. B. Snijders M ark G ranoveMer Stephen Garry Robins Linton Freeman David Knoke BorgaV Kathleen Carley Karen Harrison White Cook Douglas R. White
Mar1n EvereM
David
Krackhardt
Nicholas
Mullins
S.D.
Berkowitz
Ronald
B urt
Barry
Wellman
Anatol
Rapoport
Stanley
W asserman
Mul1level
J.
A.
Barnes
Katherine
Faust
Nan
Lin
Analysis
&
SIENA
Peter
Marsden
Tom
A.
B.
Snijders
M ark
G ranoveMer
Stephen
Garry
Robins
Linton
Freeman
David
Knoke
BorgaV
Kathleen
Carley
Karen
Harrison
White
Cook
Douglas
R.
White
Exchange
&
Trust
The
Strength
of
Weak
Ties
(economic
networks)
ERGMs
Dynamic
Network
Analysis
Mar1n EvereM
David
Krackhardt
Nicholas
Mullins
S.D.
Berkowitz
Ronald
B urt
Barry
Wellman
Anatol
Rapoport
Stanley
W asserman
Mul1level
J.
A.
Barnes
Katherine
Faust
Nan
Lin
Analysis
&
SIENA
Peter
Marsden
Tom
A.
B.
Snijders
M ark
G ranoveMer
Stephen
Garry
Robins
Linton
Freeman
David
Knoke
BorgaV
Kathleen
Carley
Karen
Harrison
White
Social
Networks
&
Cook
Douglas
R.
White
Social
Structure
&
Cogni1on
Exchange
&
Trust
ERGMs
Network
Realism
Inter-organisa1onal
poli1cal
networks
&
Terrorist
Networks
the
Internet
Dynamic
Network
Analysis
Social Constructs / Persistent Social Forma1ons Formal Organisa1ons & Social Networks Consensus Analysis
Privacy Concerns
?? No Standard Representa1on ??
Approach
Taken
1. Searched
for
publically
available
social
network
datasets
(20-30
dierent
datasets)
2. Accesses
datasets
&
related
publica1ons.
Reviewed
structure
and
collec1on
approach
3. Created
draf
schema
4. Added
110
more
datasets
to
analysis.
Rened
/
iterated
schema
design
5. Published
dataset
wiki
/
solicited
input
from
social
network
analysis
community
(INSNA)
6. Completed
schema
design
TBC
Dataset
Wiki:
hPp://dl.ucd.ie
Examples: UK MPs on TwiMer (Personal TwiMer Accounts) (Men1ons) Co-authorship in network science (Academic Journal Authors) (Co-Authorship) Infec1ous SocioPaMerns (Visitors to Science Gallery) (face-to-face proximity)
Is bipar1te? .
Examples: Terrorist Network Nodes Types: Terrorist, Leader, Poli1cian, Ci1zen Primary School Cumula1ve Network Node Types: Teacher, Student Edge Type: Physical Interac1on between student and teacher
. . . .
.
.
Is bipar1te?
Examples: The Policy Network of Toxic Chemicals Regula1on in Germany in the 1980s Edge Types: Shared CommiMee Membership, Informa1on Exchange Students data sets (van de Bunt) Edge Types: Unknown, best friend, friend, friendly rela1on, neutral, troubled rela1on, item non-response, actor non-response
Is bipar1te?
Examples:
Enron
Email
Dataset
Nodes:
Senior
Enron
Employees
Edge
Types:
Email
Sent,
Email
Recieved
Weight:
#
of
Emails
sent
Dining-table partners in a girls dormitory at a New York State training school Nodes: Girls in a New York state dormitory Edge Types: preferred dining partner Weight: order of preference
Is bipar1te?
Examples:
Lawyers
data
(Lazenga)
Node
APributes:
seniority,
formal
status,
oce
in
which
they
work,
gender,
law
school
aMended,
individual
performance
measurements
(hours
worked,
fees
brought
in),
aVtudes
concerning
management
policy
Node AMributes . .
Irish Poli1cians & Organisa1ons on TwiMer Communi9es: Poli1cal Alia1on (Fine Gael, Fianna Fil, Labour, Sinn Fin, )
Is bipar1te?
Examples: Kapferer Tailor Shop Interac1ons recorded at two dierent 1me points seven months apart; a strike happened in between (snapshot) Southern Women Network It contains the observed aMendance at 14 social events by 18 Southern women. (event driven)
Is bipar1te?
Examples: Norwegian Boards (Aug09) Board membership evolu1on from 1999 to 2009 (con1nuous or real-1me)
Is bipar1te?
Examples: Wiki-Vote Nodes: Wikipedia Editors Edges: Vo1ng Behaviour Parallel Data: Vote outcome MathSciNet: Co-authorship network Node: Journal Ar1cle Authors Edges: Co-authorship Parallel Data: Detailed informa1on about MathSciNet papers: numerical IDs of papers, authors, and categories
Parallel
Is bipar1te?
Examples: Extended Epinions dataset Nodes: Consumers on trust site Epinions.com Edges: Trust / Distrust Parallel Data: Details of all product reviews hosted on the Epinions website
Parallel
Is bipar1te?
Examples: Newcomb Fraternity 15 weekly sociometric preference rankings from 17 men aMending the University of Michigan in the fall of 1956; data from week 9 are missing. Enron Email Dataset (Boundary Condi1ons)
Parallel
Collec1on Metadata
Is bipar1te?
Examples: Yahoo! Messenger User Communica1on PaMern Dataset contains a small sample of the Yahoo! Messenger community's communica1on (IM) log at a high level for a period of 4 weeks. Specically, this dataset only records the rst communica1on event from one user to another on a par1cular day, and generates such records for a period of 28 days.
Parallel
Collec1on Metadata
Is bipar1te?
Parallel
Collec1on Metadata
Thank You
Ques2ons