Академический Документы
Профессиональный Документы
Культура Документы
Chun-Yuen Teng
Yu-Ru Lin
Lada A. Adamic
School of Information
University of Michigan
Ann Arbor, MI, USA
School of Information
University of Michigan
Ann Arbor, MI, USA
chunyuen@umich.edu
yuruliny@gmail.com
ABSTRACT
The recording and sharing of cooking recipes, a human activity dating back thousands of years, naturally became an
early and prominent social use of the web. The resulting
online recipe collections are repositories of ingredient combinations and cooking methods whose large-scale and variety yield interesting insights about both the fundamentals of
cooking and user preferences. At the level of an individual
ingredient we measure whether it tends to be essential or can
be dropped or added, and whether its quantity can be modified. We also construct two types of networks to capture the
relationships between ingredients. The complement network
captures which ingredients tend to co-occur frequently, and
is composed of two large communities: one savory, the other
sweet. The substitute network, derived from user-generated
suggestions for modifications, can be decomposed into many
communities of functionally equivalent ingredients, and captures users preference for healthier variants of a recipe. Our
experiments reveal that recipe ratings can be well predicted
with features derived from combinations of ingredient networks and nutrition information.
General Terms
Measurement; Experimentation
Keywords
ingredient networks, recipe recommendation
1.
INTRODUCTION
The web enables individuals to collaboratively share knowledge and recipe websites are one of the earliest examples of
collaborative knowledge sharing on the web. Allrecipes.com,
ladamic@umich.edu
large communities: one savory, the other sweet. The substitute network, derived from user-generated suggestions for
modifications, can be decomposed into many communities of
functionally equivalent ingredients, and captures users preference for healthier variants of a recipe. Our experiments
reveal that recipe ratings can be well predicted by features
derived from combinations of ingredient networks and nutrition information (with accuracy .792), while most of the
prediction power comes from the ingredient networks (84%).
The rest of the paper is organized as follows. Section 2 reviews the related work. Section 3 describes the dataset. Section 4 discusses the extraction of the ingredient and complement networks and their characteristics. Section 5 presents
the extraction of recipe modification information, as well as
the construction and characteristics of the ingredient substitute network. Section 6 presents our experiments on recipe
recommendation and Section 7 concludes.
2.
RELATED WORK
3.
DATASET
3.1
Data preprocessing
40
20
0
10
% in recipes
30
midwest
mountain
northeast
west coast
south
bake
boil
fry
grill
roast
simmer marinate
method
3.2
Regional preferences
4.
p(a, b)
,
p(a)p(b)
where
p(a, b) =
p(a) =
# of recipes containing a
,
# of recipes
p(b) =
# of recipes containing b
.
# of recipes
5.
RECIPE MODIFICATIONS
tiger prawn
lobster tail
artichoke
greek yogurt
white mushroom
haddock
button mushroom
goat cheese
port wine
watercres
sea scallop
triple sec
sour mix
sweet
white rum
club soda
butter
cranberry juice
ice
pomegranate juice
pink lemonade
banana liqueur
shallot
juiced
tequila
smoked ham
asparagus
brie cheese
watermelon
hazelnut
orange juice
eggnog
lemon juice
juice
superfine sugar
plum
white chocolate
artificial sweetener
semisweet chocolate
chocolate coffee
cake flour
raspberry jam
hazelnut liqueur
cocoa powder
almond paste
milk chocolate
vanilla wafer
peach
cantaloupe
pie shell
pistachio nut
bourbon whiskey
vanilla yogurt
blackberry
fig
golden syrup
kiwi
banana
pear
prune
chocolate wafer
candied cherry
apricot jam
apple juice
currant
orange gelatin
tapioca
turbinado sugar
pie crust
cream of tartar
german chocolate
flour
baking soda
strawberry preserve
coffee powder
vanilla
extract
semisweet chocolate chip
maple extract
white chip
chocolate chip
vanilla frosting
unpie crust
unbleached flour
applesauce
tapioca flour
fruit
brownie mix
flax seed
oat bran
skim milk
teriyaki sauce
yeast
sunflower kernel
matzo meal
pie filling
wheat
cream cheese
wheat bran
beaten egg
sourdough starter
neufchatel cheese
pretzel
chocolate pudding
decorating gel
cook
cornflakes cereal
cornmeal
pancake mix
beer
vegetable
ranch dressing
corn
green bean
spiral pasta
salt
cornflake
olive
kidney bean
white corn
vinegar
ketchup
pickle relish
crescent roll
butter cooking spray
potato chip
dill pickle
bean
barbeque sauce
rye bread
butter cracker
green chile
baby pea
sausage
brown mustard
stuffing
picante sauce
turkey gravy
cheese
lean beef
ranch bean
macaroni
taco seasoning
taco sauce
elbow macaroni
kernel corn
catalina dressing
ham
onion
whole wheat tortilla
green chily
mexican cheese blend
butter bean
stuffed olive
egg noodle
mild cheddar cheese
colby cheese
beef gravy
sour cream
vidalia onion
processed cheese
stuffing mix
barbecue sauce
cream corn
biscuit mix
buttermilk biscuit
onion salt
sourdough bread
tuna
refried bean
enchilada sauce
lima bean
garlic salt
steak sauce
yellow mustard
mustard
pimento pepper
french dressing
sauerkraut
corned beef
vegetable combination
corn chip
tortilla chip
guacamole
beef chuck
powder
pepperjack cheese
kaiser roll
pimento
pickle
bacon grease
hoagie roll
tomato juice
english muffin
pepperoni sausage
pizza sauce
chili bean
mixed vegetable
onion flake
seasoning salt
pimiento
pepper
pizza crust
bread dough
chuck roast
wax bean
roast beef
beef consomme
corn tortilla
saltine cracker
biscuit
bratwurst
round steak
sandwich roll
white bread
apple jelly
baking mix
black olive
beef bouillon
pinto bean
parsley flake
meat tenderizer
dressing
marinara sauce
spaghetti sauce
salami
pepperoni
tomato sauce
potato
green bell pepper
venison
broccoli floweret
cottage cheese
liquid smoke
cracker
smoked sausage
worcestershire
sauce
chicken
spicy brown mustard
lasagna noodle
spaghetti
polish sausage
swiss cheese
provolone cheese
seashell pasta
bacon dripping
steak
onion separated
seasoning
horseradish
pork chop
italian bread
toothpick
saltine
adobo seasoning
noodle
long grain
salad green
black bean
navy bean
cornbread
beef brisket
mexican corn
pasta sauce
basil sauce
manicotti shell
red bean
seafood seasoning
browning sauce
chicken bouillon
iceberg lettuce
italian sauce
ziti pasta
turkey breast
lean turkey
lettucechili sauce
cheese ravioli
tomato
mozzarella cheese
white hominy
baby carrot
barley
beef broth
green pea
poultry seasoning
kielbasa sausage
monosodium glutamate
chile sauce
alfredo sauce
pasta shell
tube pasta
tomato paste
fajita seasoning
mustard powder
pasta
italian sausage
white potato
nutritional yeast
rump roast
lean pork
crouton
dill seed
bagel
grape jelly
creole seasoning
okra
red potato
smoked paprika
celery salt
mixed nut
celery
romano cheese
rotini pasta
banana pepper
cabbage
popped popcorn
hot sauce
onion powder
celery seed
caesar dressing
lentil
marjoram
beef sirloin
bacon bit
cocktail sauce
pork sparerib
miracle whip
potato flake
yellow cornmeal
milk
margarine
cereal
egg
candy
dill
ditalini pasta
rigatoni pasta
ricotta cheese
pearl barley
ham hock
green salsa
chive
steak seasoning
cider vinegar
caraway seed
bread flour
crab meat
herb stuffing
savory
meatball
jalapeno pepper
cauliflower
sirloin steak
oil
pearl onion
cod
cauliflower floret
oyster
wild rice
catfish
white vinegar
unpie shell
lemon pepper
water chestnut
pickling spice
yellow squash
chorizo sausage
beef stock
cajun seasoning
coleslaw mix
vegetable bouillon
hungarian paprika
french bread
parmesan cheese
spinach
artichoke heart
russet potato
flounder
caulifloweret
molasse
yellow onion
yellow pepper
poblano pepper
crawfish tail
radishe
cannellini bean
burgundy wine
pork shoulder
broccoli floret
beet
chicken liver
white onion
cheese tortellini
fusilli pasta
andouille sausage
chicken wing
giblet
romaine
cumin
cayenne pepper sage
cooking oil
mustard seed
salmon steak
pumpkin seed
rye flour
oatmeal
nilla wafer
vanilla
blue cheese
unpastry shell
topping
garlic paste
egg substitute
cooking spray
sunflower seed
food coloring
ramen noodle
chutney
black pepper
french baguette
white bean
chicken breastmushroom
scallion
chicken thigh
sesame seed
cashew
softened butter
cherry gelatin
milk powder
brown sugar
butterscotch chip
red grape
basil pesto
oregano
chicken broth
italian seasoning
bay
tomatillo
avocado
low fat
canola oil
dijon mustard
acorn squash
whole milk
red apple
pineapple chip
wheat germ
lemon gelatin
baking apple
peanut butter
vanilla pudding
maple syrup
tart apple
anise seed
turmeric
garam masala
lobster
caramel
chocolate mix
butter shortening
ginger paste
mixed fruit
maple flavoring
honey
asafoetida powder
green tomato
mango chutney
pita bread
chipotle pepper
cucumber
pesto
escarole
kale
clam
clam juice
red pepper
brown rice
white pepper
wonton wrapper
serrano pepper
green lettuce
baby corn
salad shrimp
curry powder
marshmallow
orange marmalade
marshmallow creme
mixed berry
rice flour
lime gelatin
recipe pastry
green grape
ring
pastry
apple
peppercorn
green apple
apricot preserve
soy milk
potato starch
nut
lard
peanut butter chip
vegetable shortening
toffee baking bit
lemon peel
lemon yogurt
sour milk
pumpkin
1% buttermilk
evaporated milk
baking cocoa
corn syrup
apple butter
cornstarch
cinnamon water
vegetable oil
oat sugar
persimmon pulp
raspberry preserve
cinnamon sugar
raspberry gelatin
allspice
mace
gingersnap cooky
strawberry gelatin
strawberry jam
black walnut
coconut extract
shortening
pecan
anise extract
orange peel
butterchocolate
extract cookie
bourbon
baking chocolate
rhubarb
graham cracker
buttermilk
date
chocolate pudding
mix
chocolate frosting
candied citron
fruit cocktail
cinnamon
red candy
chocolate sandwich
cooky
lemon extract
chicken drum
red lentil
panko bread
habanero pepper
snow pea
bamboo shoot
cumin seed
fenugreek seed
curry
cooking sherry
romaine lettuce
fennel seed
oyster sauce
ghee
spaghetti squash
eggplant
plum tomato
bell pepper
thyme
garbanzo bean
farfalle pasta
brown lentil
bay scallop
carrot
gingerroot
coriander seed
bean sprout
smoked salmon
basmati rice
anchovy
green olive
chickpea
rutabaga
vegetable stock
parsley
red wine
red snapper
saffron thread
herb
prosciutto
collard green
green cabbage
fish stock
round
fontina cheese
basil
zucchini
vermicelli pasta
asiago cheese
linguine
vegetable broth
shrimp
pork loin
hoisin sauce
yogurt
soy sauce
cardamom
fettuccini pasta
parsnip
tarragon vinegar
turnip
cilantro
coriander
black peppercorn
allspice berry
sugar pumpkin
cardamom pod
splenda
clove
mandarin orange
silken tofu
peppermint extract
hot
red lettuce
adobo sauce
brussels sprout
seed
linguini pasta
orzo pasta
penne pasta
roma tomato
caper
cherry tomato
leek
phyllo dough
ears corn
halibut
sugar snap pea
chipotle chile powder
bok choy
chinese five spice powder
ginger
grape
walnut oil
pastry shell
salt pepper
tofu
napa cabbage
tarragon
red cabbage
sherry
rice wine
raspberry vinegar
sweet potato
crystallized ginger
apricot nectar
golden raisin
mixed spice
food cake
orangeangel
extract
rum extract
apple cider
apricot
miso paste
rice vinegar
chicken stock
pork tenderloin
sesame oil
fish sauce
beef tenderloin
saffron
flank steak
curry paste
jasmine rice
chile paste
rice noodle
grapefruit
orange zest
nectarine
pound cake
gruyere cheese
serrano chile pepper
low fat cottage cheese
lemon gras
peanut oil
fettuccine pasta
swiss chard
creme fraiche
pancetta bacon
debearded
squid
lamb
chile pepper
pork roast
kaffir lime
butternut squash
mirin
ginger root
coconut milk
tahini
spanish onion
scallop
mussel
arborio rice
rosemary
red onion
bulgur
salmon
portobello mushroom
new potato
tamari
sake
kalamata olive
feta cheese
asparagu
marsala wine
quinoa
corn oil
chili oil
whipping cream
macadamia nut
puff pastry
cream
lime peel
lime zest
whiskey
strawberry
cherry
blueberry
yam
tea bag
semolina flour
raspberry
orange sherbet
gelatin
grape juice
cream of coconut
amaretto liqueur
chocolate syrup
coffee liqueur
ladyfinger
lime juice
mango
skewer
wooden skewer
chicken leg
crimushroom
couscou
heavy cream
honeydew melon
orange liqueur
vanilla bean
white sugar
mascarpone cheese
pine nut
tilapia
cornish game hen
kosher salt
papaya
zested
orange
greek seasoning
english cucumber
coconut oil
malt vinegar
pineapple ring
coconut cream
egg yolk
bittersweet chocolate
balsamic vinegar
orange roughy
lime mint
sauce
champagne
sour cherry
garlic
gorgonzola cheese
sea salt
baby spinach
grapefruit juice
vodka
lemonade
pineapple juice
spiced rum
rum
maraschino cherry
vanilla vodka
lemon
simple syrup
carbonated water
fennel
italian parsley
tuna steak
vermouth
gin
limeade
triple sec liqueur
ginger ale
butterscotch schnapp
olive oil
grape tomato
chestnut
leg of lamb
melon liqueur
coconut rum
grenadine syrup
lemon lime soda
peach schnapp
arugula
white wine
trout
baking potato
pork sausage
creamed corn
canadian bacon
chili
beef
tomato soup
cheddar cheese
biscuit dough
process cheese
american cheese
chunk chicken
hot dog
tater tot
grit
Figure 2: Ingredient complement network. Two ingredients share an edge if they occur together more than
would be expected by chance and if their pointwise mutual information exceeds a threshold.
0.6
0.1
0.2
0.3
0.4
0.5
no modification
with modification
0.0
luncheon meat
rating
5.1
Adjustments
5.2
0.50
0.02
0.05
0.10
0.20
garlic
broth
cheddarcinnamon chicken
bacon
chip honey
mushroom parmesan chocolate
cream
cheese
cornstarch
worcestershire s.
potato
lemon juice
garlic powder
chicken breast
milk
carrot
sour cream
tomato
flour
brown sugar
vanilla extract
basil
pecan
nutmegwater
butterwhite sugar
onion
celery
mayonnaise
sugar
oregano
cs. sugar black pepper
egg
salt
walnut
baking powder
pepper
olive oil
green bell pepper
baking soda
parsley
shortening
vegetable oil
0.01
1.00
soy sauce
margarine
0.01
0.02
0.05
0.10
0.20
0.50
1.00
5.3
addition
0.41
deletion
0.22
-0.15
increase
0.61
0.79
0.09
decrease
0.68
0.11
0.58
0.39
substitution. Thus, we found an alternative source for extracting replacement relationships users comments, e.g.
I replaced the butter in the frosting by sour cream, just to
soothe my conscience about all the fatty calories.
To extract such knowledge, we first parsed the reviews
as follows: we considered several phrases to signal replacement relationships: replace a with b, substitute b for a,
b instead of a, etc, and matched a and b to our list of
ingredients.
We constructed an ingredient substitute network to capture users knowledge about ingredient replacement. This
weighted, directed network consists of ingredients as nodes.
We thresholded and eliminated any suggested substitutions
that occurred fewer than 5 times. We then determined the
weight of each edge by p(b|a), the proportion of substitutions of ingredient a that suggest ingredient b. For example,
68% of substitutions for white sugar were to splenda, an
artificial sweetener, and hence the assigned weight for the
sugar splenda edge is 0.68.
The resulting substitution network, shown in Figure 5,
exhibits strong clustering. We examined this structure by
applying the map generator tool by Rosvall et al. [13], which
uses a random walk approach to identify clusters in weighted,
directed networks. The resulting clusters, and their relationships to one another, are shown in Fig. 6. The derived clusters could be used when following a relatively new recipe
which may not receive many reviews, and therefore many
suggestions for ingredient substitutions. If one does not have
all ingredients at hand, one could examine the content of
ones fridge and pantry and match it with other ingredients
found in the same cluster as the ingredient called for by
the recipe. Table 2 lists the contents of a few such sample
ingredient clusters, and Fig. 7 shows two example clusters
extracted from the substitute network.
Table 2: Clusters of ingredients that can be substituted for one another. A maximum of 5 additional
ingredients for each cluster are listed, ordered by
PageRank.
main
chicken
olive oil
sweet
potato
baking
powder
almond
apple
egg
tilapia
spinach
italian
seasoning
cabbage
other ingredients
turkey, beef, sausage, chicken breast, bacon
butter, apple sauce, oil, banana, margarine
yam, potato, pumpkin, butternut squash,
parsnip
baking soda, cream of tartar
pecan, walnut, cashew, peanut, sunflower s.
peach, pineapple, pear, mango, pie filling
egg white, egg substitute, egg yolk
cod, catfish, flounder, halibut, orange roughy
mushroom, broccoli, kale, carrot, zucchini
basil, cilantro, oregano, parsley, dill
coleslaw mix, sauerkraut, bok choy
napa cabbage
Finally, we examine whether the substitution network encodes preferences for one ingredient over another, as evidenced by the relative ratings of similar recipes, one which
contains an original ingredient, and another which implements a substitution. To test this hypothesis, we construct
a preference network, where one ingredient is preferred to
another in terms of received ratings, and is constructed by
creating an edge (a, b) between a pair of ingredients, where a
and b are listed in two recipes X and Y respectively, if recipe
ratings RX > RY . For example, if recipe X includes beef,
ketchup and cheese, and recipe Y contains beef and pickles, then this recipe pair contributes to two edges: one from
pickles to ketchup, and the other from pickles to cheese. The
aggregate edge weights are defined based on PMI. Because
PMI is a symmetric quantity (PMI(a; b) = PMI(b; a)), we
introduce a directed PMI measure to cope with the directionality of the preference network:
p(a b)
,
PMI(a b) = log
p(a)p(b)
where
p(a b) =
6.
RECIPE RECOMMENDATION
We use the above insights to uncover novel recommendation algorithms suitable for recipe recommendations. We
use ingredients and the relationships encoded between them
in ingredient networks as our main feature sets to predict
recipe ratings, and compare them against features encoding nutrition information, as well as other baseline features
such as cooking methods, and preparation and cook time.
vegetable shortening,..
pumpkin seed,..
lemon cake mix,..
baking powder,..
dijon mustard,..
black olive,..
golden syrup,..
lemonade,..
graham cracker,..
coconut milk,..
almond extract,..
vanilla,..
peach schnapp,..
cranberry,..
strawberry,..
almond,..
milk,..
lemon juice,..
cinnamon,..
apple juice,..
bread,..chocolate chip,..
corn chip,..
olive oil,..
sour cream,.. apple,..
white wine,.. champagne,..
flour,..
cottage cheese,..
egg,..
chicken broth,..
garlic,.. sauce,..
sweet potato,..
onion,..
tomato,..
brown rice,..
celery,..
pepper,..
spaghetti sauce,..
hot,..
cheese,.. chicken,..
spinach,..
seasoning,..
black bean,..
red potato,..
italian seasoning,..
cream of mushroom soup,..
sugar snap pea,..iceberg lettuce,.. curry powder,..
imitation crab meat,..
pickle,..
quinoa,..
tilapia,.. cabbage,..sea scallop,.. smoked paprika,..
hoagie roll,..
honey,..
pie crust,..
whipping cream
ginger root
evaporated milk
heavy cream
cardamom
cinnamon
milk
clove
whole milk
soy milk
skim milk
ginger
nutmeg
allspice
mace
6.1
combined
ing. networks
nutrition
full ingredients
0.80
0.75
0.70
0.65
baseline
0.60
Accuracy
group
1.0
1.0
nutrition
nutrition (6.5%)
carbs (20.9%)
cholesterol (17.7%)
0.8
calories (19.7%)
importance
importance
sodium (16.8%)
0.6
0.4
0.2
fat (12.4%)
0.4
0.0
20
40
60
80
100
feature
network
0.7
substitution (39.8%)
cooccurrence (30.9%)
0.6
complement (29.2%)
0.5
0.4
0.3
0.2
0.1
0.0
20
40
10
12
feature
Figure 9:
Relative importance of features in the
combined set. The individual items from nutrition information are very indicative in differentiating highly rated recipes, while most of the prediction
power comes from ingredient networks.
importance
fiber (12.3%)
0.2
0.0
60
80
100
feature
Figure 10: Relative importance of features representing the network structure. The substitution network has the strongest contribution (39.8%) to the
total importance of network features, and it also has
more influential features in the top 100 list, which
suggests that the substitution network is complementary to other features.
divided the dataset into a training set (2/3) and a testing
set (1/3). The prediction performance is evaluated based on
accuracy, and the feature performance is evaluated in terms
of relative importance [8]. For each single decision tree, one
of the input variables, xj , is used to partition the region associated with that node into two subregions in order to fit
to the response values. The squared relative importance of
variable xj is the sum of such squared improvements over
all internal nodes for which it was chosen as the splitting
variable, as:
X 2
ik I(splits on xj )
imp(j) =
k
6.2
0.6
Results
Figure 11:
Relative importance of features from
nutrition information. The carbs item is the most
influential feature in predicting higher-rated recipes.
.746. In contrast, the nutrition information and ingredient
networks are more effective (with accuracy .753 and .786, respectively). Both of them have much lower dimensions (from
tens to several hundreds), compared with the full ingredients
that are represented by more than 2000 dimensions (1000
ingredients per recipe in the pair). The ingredient network
features lead to impressive performance, close to the best
performance given by the combined set (.792), indicating
the power of network structures in recipe recommendation.
Figure 9 shows the influence of different features in the
combined feature set. Up to 100 features with the highest
relative importance are shown. The importance of a feature
group is summarized by how much the total importance is
contributed by all features in the set. For example, the
baseline consisting of cooking effort and cooking methods
contribute 8.9% to the overall performance. The individual
items from nutrition information are very indicative in differentiating highly-rated recipes, while most of the prediction
power comes from ingredient networks (84%).
Figure 10 shows the top 100 features from the three networks. In terms of the total importance of ingredient network features, the substitution network has slightly stronger
contribution (39.8%) than the other two networks, and it
also has more influential features in the top 100 list. This
suggests that the structural information extracted from the
substitution network is not only important but also complementary to information from other aspects.
Looking into the nutrition information (Fig. 11), we found
that carbohydrates are the most influential feature in predicting higher-rated recipes. Since carbohydrates comprise
around 50% or more of total calories, the high importance
of this feature interestingly suggests that a recipes rating
can be influenced by users concerns about nutrition and
diet. Another interesting observation is that, while individual nutrition items are powerful predictors, a higher prediction accuracy can be reached by using ingredient networks
alone, as shown in Fig. 8. This implies the information
about nutrition may have been encoded in the ingredient
network structure, e.g. substitutions of less healthful ingredients with healthier alternatives.
Constructing the ingredient network feature involves reducing high-dimensional network information through SVD,
as described in the previous section. The dimensionality can
be determined by cross-validation. As shown in Fig. 12, features with a very large dimension tend to overfit the training
0.80
0.79
Accuracy
0.78
network
0.77
combined
substitution
complement
cooccurrence
7.
0.76
10
20
30
40
50
60
70
Dimensions
Figure 12:
Prediction performance over reduced
dimensionality. The best performance is given by
reduced dimension k = 50 when combining all three
networks. In addition, using the information about
the complement network alone is more effective in
prediction than using other two networks.
Color Key
svd dimension
82
0.5
433
0.5
Value
splenda
olive oil
applesauce
honey
butter
brown sugar
milk
half and half
chicken broth
buttermilk
sour cream
evaporated milk
vanilla extract
vanilla
kale
almond
beef
cream of chicken soup
almond extract
chocolate pudding
lemon extract
lime juice
walnut
coconut extract
turkey
chicken
sausage
italian sausage
pork
chicken breast
194
65
splenda
olive oil
applesauce
honey
butter
brown sugar
milk
half and half
chicken broth
buttermilk
sour cream
evaporated milk
vanilla extract
vanilla
kale
almond
beef
cream of chicken soup
almond extract
chocolate pudding
lemon extract
lime juice
walnut
coconut extract
turkey
chicken
sausage
italian sausage
pork
chicken breast
6
19
43
Figure 13:
Influential substitution communities.
The matrix shows the most influential feature dimensions extracted from the substitution network.
For each dimension, the six representative ingredients with the highest intensity values are shown,
with colors indicating their intensity. These features
suggest that the communities of ingredient substitutes, such as the sweet and oil in the first dimension, are particularly informative in prediction.
data. Hence we chose k = 50 for the reduced dimension of
all three networks. The figure also shows that using the
information about the complement network alone is more
effective in prediction than using either the co-occurrence
and substitute networks, even in the case of low dimensions. Consistently, as shown in terms of relative importance
(Fig. 10), the substitution network alone is not the most effective, but it provides more complementary information in
the combined feature set.
Color Key
0.5
Value
ingredient
41
CONCLUSION
8.
ACKNOWLEDGMENTS
This work was supported by MURI award FA9550-08-10265 from the Air Force Office of Scientific Research. The
methodology used in this paper was developed with support from funding from the Army Research Office, MultiUniversity Research Initiative on Measuring, Understanding, and Responding to Covert Social Networks: Passive and
Active Tomography. The authors gratefully acknowledge D.
Lazer for support.
9.
REFERENCES