Академический Документы
Профессиональный Документы
Культура Документы
FOOD TECHNOLOGY
Variables Selected
A great many variables could be
considered. Hence, it was necessary to
be selective and try to choose the more
important ones.
l Test Products. The test product
was not really a source of variation, but
remained constant throughout the main
series of experiments. We settled on a
sweetness difference in an orange
drink. One can reliably predict that
children will like a sweeter drink, at
least within the normal range. This
proved to be the case.
Preliminary testing of drinks with
various sweetness differences indicated
the adjustments needed. For example, a
drink sweetened with the recommended
amount of sugar com- pared to one
made with only 50% of that amount
produced highly significant differences
no matter what rating scale was used.
Needed was a difference that was
definite but not overwhelming, so that
the possible effects of the variations of
interest could emerge. The final choice
was an orange- flavored drink sweetened with the recommended amount of
sugar, compared to a drink with 80% of
that amount.
l Scale Type. Differences in scale
type were the main issue ad- dressed in
these experiments. After preliminary
work with older children, we concentrated on three scale types (Fig. 3) the
standard hedonic scale with the usual
verbal categories, a pictorial or face
scale, and a child-oriented verbal scale
we developed.
Over the years, researchers have
investigated test language suitable for
children. After reviewing childThe author is President, Peryam & Kroll,
Marketing and Sensory Research, 6323 N.
Avondale Ave., Suite 121, Chicago, IL 60631
P&K scale
Like extremely
Super good
Really good
Like moderately
Good
Like slightly
Dislike slightly
Dislike moderately
Bad
Really bad
Dislike extremely
Super bad
Face scale
Fig. 3 Three Types of Rating Scale Were Used: the traditional hedonic scale, the
P&K scale developed for this study, and the typical face scale. After testing, scale
values of 1 to 9 were assigned (starting with 1 at the top) for the purposes of
FOOD TECHNOLOGY 79
80 FOOD TECHNOLOGY
Can children discriminate? How far down the age scale does the
capacity for discrimination exist?
There has never been much argument here. Children can definitely
discriminate. At least they have preferences. Observations of the
behavior of even infants indicate the capability of choice in terms of
rejection and acceptance.
l About 1955, investigators at Eli Lilly, Inc., developed a procedure
for working with children 2 3 years old to evaluate formulas for
vitamin preparations (Peryam, 1989). They used one-on-one interviewing and the paired-comparison method and claimed to have
obtained results useful in product development.
l Investigators at the University of Florida did extensive testing of
various citrus products with preschool children 6ges 3 5 (Morse,
1953). They found lots of discrimination, as well as puzzling aberrations.
They used, and endorsed, paired comparisons, which produced the only
meaningful results. However, they also tried a method which was
essentially the triangle test, although not labeled as such. Their conclusion that the method was too complicated for kids should not surprise
anyone.
l Work with preschoolers ages 3 5 used fruit as stimuli and an
interesting variation of the rank-order method (Birch, 1979). The child
was presented with a number of different kinds of fruit and asked to
select the one liked best. This was then removed, and the one liked best
among the rest was chosen, and so on. Whatever the utility of the
findings, there was discrimination, which replicate testing showed was
reliable.
l Colwill (1987) reviewed scaling methods for obtaining information
about consumers' likes and dislikes. He recommended using picture
scales, preferably with five or seven points, for testing preliterate
children.
Can one use a measuring device more sophisticated than simple paired
comparisons? Can children differentiate degrees of liking and/or
disliking?
Usually investigators have found that children do have such ability, but
the extent of that ability, as well as how it might be affected by any one
of many variables, is seldom considered.
l Some years ago, Bert Krieger, a researcher with a candy manufacturer, was faced with the problem of evaluating formulation changes in
chocolate bars (Moskowitz, 1985). He dealt with children 5 7 years old
as well as older children, using a picture scale that showed the Snoopy
cartoon character in a series of nine poses ranging from up-eared elation
to droopy disgust. His subjects were able to discriminate.
l Another researcher (Wells, 1965) used a scaling method to evaluate
children's feelings about cereals. He was not concerned with the foods as
eaten, but evaluated children's ideas about familiar cereals and their
feelings about TV commercials. Some of the subjects were in the 5 7
age range. The study used 7-point face scales showing a youngster (a
boy for boys, a girl for girls) in poses ranging from grinning happiness to
hold-the-nose distaste. The children could discriminate, and the results
were meaningful.
Are the results of testing children useful in solving typical product
development problems?
The sponsors must be getting something useful, or why would so much
be attempted? Some of the published studies actually address the
question, e.g., the previously cited work by Krieger, who achieved
comparative evaluation of formulas for chocolate bars.
Summary
Briefly summarizing the literature, we note that:
l There is consensus that children can discriminate, particularly in
regard to degree of liking.
l Children are able to show degree of preference if the proper
measuring device is used.
l Children can provide useful information about products if the right
methods are employed.
l Children require special handling, i.e., handling that is different
from the procedures routinely employed with adults. One must pay
attention to such things as gaining confidence, providing motivation, and
expressing tasks in language children understand. This recognition
appears throughout the literature.
Testing Procedure
The test subjects were prerecruited
from families on our extensive roster of
consumer panelists. Usually, the
computer knows which families have
children and their ages. All had to like
orange drinks, which was no problem.
Otherwise the only concern was age,
sex, and availability to fit into the
schedule. An important proviso was that
no child should be invited to participate
in more than one test, which would
raise questions about training effect.
In all cases, a subject tried the pair of
samples, high sweet vs low sweet,
twice, using a different scale for each
pair, then made a paired-comparison
choice after each pair. Except for those
on the mode of presentation, the
experiments included all three scale
types hedonic, P&K, and face. The
design required that the scales be used
equally often and appear equally often
as the first or second pair. Furthermore,
for each scale type the high-sweet and
low- sweet samples were served first or
second equally often.
Sex differences did not seem important in the context of this investigation,
but our recruiters attempted to have
equal numbers of girls and boys in each
of the age groups. This was not
achieved exactly, but it was close. They
also tried to get an even distribution of
ages within each age group. Again, this
was not exact but was very close.
The drinks were prepared in quantity
ahead of time, chilled to refrigerator
temperature, and held at that temperature throughout testing. They were
poured just before serving. A sample as
served was about 1% oz of drink in a
small plastic glass. The samples were
identified by code number, but only for
the convenience of the operators and to
avoid errors. If a subject even saw the
codes, it was accidental.
All interviewing was conducted oneon-one, except for the sessions
82 FOOD TECHNOLOGY
Analyses
There is a qualification to note here.
Some findings, in the sense of the
objectives of the research, rely on what
may be called soft data; however, they
were derived from hard data.
l Hard Data. For the paired
comparison, the significance of the
proportions of choice was deter- mined
by the z-test. For the scalar measures,
the significance of the difference
between the average rating for the highsweet and low-sweet drinks was
determined using the t-by-difference
test, which was natural, since each
subject had tried both samples. Using
the variances of the distributions was
also considered, but the figures were
volatile and hard to interpret. With
scales of this kind, the variance is
highly dependent on the average rating,
being quite low when the upper end of
the scale is approached, but increasing
Results
What, if anything, was discovered in
this study? Are any conclusions
definitive, settling certain points once
and for all? Not likely! But there are
results that can direct future research on
the subject.
l Paired-Comparison. The paired
comparisons were always made after
the pair of drinks had been presented
and rated. The results, summarized
across all tests, are shown in Table l.
Overall, there was a highly significant
difference well below the 0.1% level
which was due in part to the large
number of subjects (N). As expected,
the high-sweet sample was preferred,
which validated the product variable.
Other conclusions come from comparing different subgroups.
Test order, whether the first or second
pair of the session, made no difference.
There was no difference in discrimination between boys and girls.
Children 8 10 years old were
definitely more discriminating than the
younger kids, who failed to establish a
significant difference. Their failure
might have been due to interference by
the scaling task. The difference between
ages might have been expected.
Scale type may also have made a
difference, although evidence is
borderline. When the comparison was
84 FOOD TECHNOLOGY
Continued on page 86
References
86 FOOD TECHNOLOGY