Академический Документы
Профессиональный Документы
Культура Документы
net/publication/223892117
CITATIONS READS
22 1,308
1 author:
Kenji Tanaka
The University of Tokyo
87 PUBLICATIONS 555 CITATIONS
SEE PROFILE
Some of the authors of this publication are also working on these related projects:
All content following this page was uploaded by Kenji Tanaka on 28 February 2019.
A sales forecasting model for new-released and nonlinear sales trend products
Kenji Tanaka *
Department of Systems Innovations, Graduate School of Engineering, The University of Tokyo, 7-3-1 Hongo Bunkyo-ku, Tokyo 113-8656, Japan
a r t i c l e i n f o a b s t r a c t
Keywords: This paper proposes a sales forecasting model for new-released products with a knowledge-based data-
Sales forecasting base. Though forecasting future demand is an essential part of business planning and operation, most
New-released products major forecasting methods are applied only to regular consuming items, showing gradual sales trend
Nonlinear caused by seasonal cycles. Most retail products are, however, irregular consuming products characterized
Book
by fluctuated and nonlinear sales trends. In this study, using high correlations between short- and long-
Consumer electronic device
term accumulated sales within similar products groups, a new forecasting model is presented. Based on
the correlation database of short- and long-term accumulations, the model provides a prediction of long-
term forecast using the sales result of the product’s very early release. For practical use, the model is
designed to deal with the following three points: accuracy; timing of forecast release; and the broad cov-
erage of items. As a case study, we applied the model to books and consumer electronics sold in Japan.
The model enables us to obtain a practical sales forecast throughout the lifecycle of the item one or
two week after its release, and furthermore provides valuable information on reprint decision-making.
This experiment will prove the reliability in accuracy and efficiency of the proposed method in compar-
ison with existing established ones.
Ó 2010 Elsevier Ltd. All rights reserved.
0957-4174/$ - see front matter Ó 2010 Elsevier Ltd. All rights reserved.
doi:10.1016/j.eswa.2010.04.032
7388 K. Tanaka / Expert Systems with Applications 37 (2010) 7387–7393
ratio of book return. According to annual report of The Research among relevant item groups. Fig. 1 shows an example of conver-
Institute for Publications (2007), approximately 40% of copies are gence transition in ratio of domestic sales in 182 days and first
returned to publishers as surplus-goods in Japan, which causes 7 days (Japanese 7660 literature titles). These high correlations
serious problems. Nagai (2004) estimated the total amount of are shown in Japanese book sales results without major structural
wasted book value at $13 billion USD in 2003. One of the main rea- changes in the following fields: existing retail process and existing
sons behind this is the misleading forecasts of players involved. item categories which are used for relevant grouping.
Publishers, wholesale-distributors and shop-chains have each pro- The basic concept of proposed forecasting method is illustrated
duce forecasts of their own. in Fig. 2. Left figure of Fig. 2 is a scatterplot of accumulated sales
The main objective of this study is to propose a forecasting results of two sales periods (nth day and mth day) with a best fit
method which can deal with the following three features: forecast- line. Using the correlation, the inclination (k) of best fit line with-
ing new-released items sales; providing forecast on proper timing out intercept is figured out. The target mth day sales forecast ðRm Þ
for management decision; and the simple process without collab- is calculated with the inclination k and nth day sales result (Rn).
orative setting of each item in order for broad coverage of items. We call this method ‘‘NM method”. This method predicts the
This study will verify its performance by applying it to book items mth day accumulated sales forecast ðRm Þ based on the nth day
forecasting. The evaluation of its accuracy and potential of contin- accumulated sales result Rn
uous improvement will be stated.
Rm ¼ kðgroup;m;nÞ Rn ð1Þ
P P P
N i X mi X ni X mi i X ni
2. Proposed forecasting model kðgroup;m;nÞ ¼ P iP 2 ð2Þ
2
N i ðX ni Þ X
i mi
2.1. Proposed method
mth and nth are the number of days after the item release, Rn is an
Miyata, Tanaka, Sato, and Nishi (2008a, 2008b) reported that accumulated sales at nth day after the item release. ðRm Þ is the
there remains a high correlation between two sales-period results predicted value of Rm. k(group,n,m) is the NM coefficient of mth day
Titles Titles
200 200
100 100
Day1
50 50
0 0
1 5 9 13 17 21 25 29 33 37 1 5 9 13 17 21 25 29 33 37
Accumulated sales multiples of day182 by day1 Accumulated sales multiples of day182 by day5
Titles Titles
200 200 Day7
150 150
50 50
0 0
1 5 9 13 17 21 25 29 33 37 1 5 9 13 17 21 25 29 33 37
Accumulated sales multiples of day182 by day3 Accumulated sales multiples of day182 by day7
Fig. 1. Convergence example of the ratio of two accumulated sales in 182 days and short period sales.
nth-mth day accumulated sales results plot Concept of proposed forecast method
Sales Sales
(Items) (items)
Forecast
R*m = k Rn R*m
Sales
mth day
Result
Rn
R*m = k Rn
Table 1 day) are set in the first step. And then the latest accumulated sales
NM coefficient tables example. result Rn is obtained based on dynamic sales data. Thirdly, a refer-
m=1 2 3 4 5 6 182 ence group is selected and the coefficient k(group,n,m) is determined.
n=1 – 2.28 3.52 4.56 5.24 5.75 10.74 Rm is calculated according to Eq. (1). Rm should be updated when
2 – – 1.51 1.97 2.27 2.49 4.65 the dynamic database is updated.
3 – – – 1.33 1.54 1.69 3.16 Before the calculation process, the NM coefficient tables should
4 – – – – 1.17 1.30 2.43 be prepared with sample sales data. NM groups are defined by exist-
5 – – – – – 1.11 2.08
181 – – – – – – 1.00
ing categories or by experts. Then sample items’ data of each group
are collected, and NM coefficients of every nth–mth pair are calcu-
Sample items 71 titles. lated by Eq. (2). Reference groups are prepared in two patterns: gen-
erated by existing books categories or corrected by experts’
knowledge. For a newly released book, we have to decide the existing
forecast from nth day result in the ‘‘group”. N is the number of items group in which the book belongs to based on categories, publishers,
in the group. authors and so on. If the forecast is not adequate, new advanced NM
Each nth–mth pair has a NM coefficient in a group. NM coeffi- group can be then corrected by experts (Fig. 4). For example, certain
cient table is prepared for each group as shown in Table 1. key words or topics such as movies, TV shows, new prize can be used
in determining new advanced NM groups.
2.2. Proposed forecasting workflow In addition to original grouping, experts are able to use their
knowledge to maximize forecast accuracy.
Fig. 3 shows the forecasting workflow of the MN method. There Unlike the existing method which requires constant tunings,
are two main flows. One is the forecast calculation flow with dy- MN method allows the main forecast flow to be simple once the
namic sales database. The other is coefficient calculation flow. As coefficient tables are prepared, enabling to deal with various num-
for the former flow, the forecasting target item and period (mth bers of items.
by Existing Categories
Dynamic Define NM groups or Knowledge based features
Sales Data Obtain N th day
accumulated sales R N
-Tested Case
-Daily update Collect sample items
Sample Sales Data
of each group
-Sample cases
Select a Reference Group -Annual Data
of NM coefficient tables
Calculate each groups’
NM coefficient
Pick the NM coefficient
Of N th -M th day
Calculate NM
coefficient
NM coefficient table
Calculate Mth day forecast
R * m = k ( group ,m ,n) ⋅ R n
day=day+1
Output sales
forecast
ISBN-code 4-06-370431-9
Title Name Doctor K
Original
Author Kazuo Mahune
Title Data Category Publisher Series Author
items Publisher Kodansha
Release date 2007/07/01
Category Literature
2.3. Verification indicator category, a main category in the market, is selected as a testing
case.
In order to evaluate the accuracy of the method, two indicators
are used. One is mean absolute percentage error (MAPE);
3.1.1. Conditions of verification
n
1X
Ri ðmth dayÞ 1 100 The case data was collected by a Japanese leading book whole-
MAPE ð%Þ ¼ ð3Þ
N i¼1 Ri ðmth dayÞ saler company. Since its share holds half of the Japanese book mar-
ket, its data is considered as a reliable representative of the
Ri is the actual sales volume of the item ‘‘i” on mth day after release. industry. The daily sales data of all 7660 literature titles published
ðRi Þ is the forecasted sales volume of the item ‘‘i” on mth day after in 2006 in Japan are collected as sample cases. Based on this, we
release. N is the number of test items. predicted sales forecasts of 70 titles published by one company
The other indicator is the Proportion of Items with error rate in June 2007.
Under a certain Criteria (PIUC). This criterion is determined in ref- Since the 3rd–7th day is the first decision-making timeframe for
erence to the return goods ratio of the items. It indicates the cov- reprinting (Fig. 5), the forecast of nth (first 7 days) to mth (182th
erage range of items suitable to the forecasting model. day) is used for verification. Most books have their sales peaks
within three weeks after their releases, and publishers have to de-
3. Results cide how many copies to reprint without losing their business
chances. The target number of reprinting is decided with 6th
The MN model is applied to Japanese book industry and Japa- month forecast. According to Publishing Science Research Institute
nese consumer electronic devices industry. In order to be practical, (2007) the present book return ratio is about 40%. Then the target
helpful tools for management decision, the real market data and accuracy level is set at 40%.
decision-making timeframe were used. One of the key points of proposed method is to forecast not
weekly or daily sales but accumulated sales at certain point. For
3.1. Book products forecasting case comparison, two daily time series sales forecasting models, expo-
nential smoothing method (a = 0.5) and moving average method
In this section, the proposed NM forecasting method is verified (3-day average), are adopted. The accumulated forecast is calcu-
through its application to the Japanese book industry. Literature lated from their daily forecast. The duration of moving average
Sales Target
(items) Estimate
Forecasting Period
Period
Accumulated
Sales
Daily
Sales
period and the smoothing constant a is determined by the 7660 method is 546% (moving average), 204% (exponential method)
sample case analysis. The 3 days among 1- to 7-day cases and 0.5 and 41% (NM method Publisher group).
among 0–1.0 cases obtained the best MAPE results. Since there The results of PIUC on the 7th day are indicated in Table 2. The
are only 7-day data points for calculation, complex models are tend PIUC of NM method is 59%, which is ten times as many as those of
to amplify the small noise. That is one reason why simple two the other two methods. Thus, the NM method proved its perfor-
methods are used. In addition to that, those two methods are com- mance, while the existing method’s error rates never reached few-
mon methods for Japanese book wholesaler and publishers. er than 40% for over 90% of the titles.
For NM method forecasting, two existing-category-groups and
five knowledge-based groups were prepared as NM reference
groups. In other words, each item belonged to two existing-cate- 3.1.3. Accuracy improvement by grouping
gory-groups and one knowledge-based group and therefore was The performance of NM forecasting method is determined by
tested in three different ways. the reference grouping. In this section, a comparison is made be-
tween the existing group and knowledge-based group to recognize
whether there is room for improvement by grouping. It enables the
NM methods to incorporate the effective knowledge on the sales
3.1.2. Comparison to existing forecasting methods
trend.
Forecast result examples of Title A and Title B during the first
There are various kinds of grouping axis: season of released,
three weeks are shown in Fig. 6. The difference between the fore-
the author, publisher, volume of print, price, appearance of cover
cast curve of each method and sales result is the number of error
and so on. All of these attributes contribute to accuracy improve-
books. Title A is the top sales title of the 70 titles. Title B is the
ment to some extent. Among those grouping axis, we focus on
35th sales title (average title). Publisher group is referred to the
the knowledge-based grouping as first step. The groups gener-
NM method forecasting here. Because of their sales peak of each ti-
ated by wholesaler experts, who are in charge of company inter-
tle, error numbers of books of both existing forecasting methods
nal stock categories, are adopted. Various factors are taken into
are over 10 times as many as sales result.
account by them, but main ruling factors are found as follows:
The comparison of MAPE transitions during first 14 days are
target customer, shelf category at book store and publisher pro-
shown in Fig. 7. While the existing forecasting methods have an er-
motion process. According to their rules, the knowledge-based
ror range up to 1000%, the NM methods showed the high accuracy
categories are prepared in addition to the two existing groups
performance throughout first 14 days. The 7th day MAPE of each
of all literature and publisher A. They are general, comics, teens,
encyclopedic and academic (Fig. 8). As Table 3 shows, the stan-
1000% dard deviations of teens, comic, academic and encyclopedic
900% NM Publisher groups are succeeded to be stabilized compared with that of for-
800% mer groups.
Moving Average The results of MAPE are shown in Table 4 and Fig. 9. The MAPE
700%
600%
of knowledge-based groups are always superior to those of other
Exponetial smoothing
two groups, all literature group and Publisher group. The MAPE im-
MAPE
500%
prove by 15% from 41% of Publisher group to 25%. It needs only two
400%
days for MAPE to reach under 40%. The results of PIUC are shown in
300%
Table 5. The PIUC has risen by 28% from 59% of Publisher group to
200%
87%.
100%
0%
1 2 3 4 5 6 7 8 9 10 11 12 13 14
Original Group Knowledge based 5 categories
N th day (the forecast is calculated) Sales
Main Customer Peak
Fig. 7. MAPE transition of 70 titles.
General Novels Public 2-6
Table 3
Knowledge-based NM reference groups.
Reference groups All literature Publisher A Generala Teensa Comica Encyclopedica Accademica
NM reference groups and standard deviations
Number of reference cases 7206 664 281 71 117 73 63
Number of tested cases 69 69 30 13 10 8 8
Standard deviationb 5.88 1.55 1.75 0.41 1.04 1.41 0.75
a
Knowledge-based reference groups by wholesaler experts.
b
Standard deviation is calculated from the 182-day sales by 7 days.
7392 K. Tanaka / Expert Systems with Applications 37 (2010) 7387–7393
Forecasting methods MAa Exp.b NM NM MN In this section, the NM method is applied to consumer electronic
(%) (%) (all (publisher) (knowledge- products sales forecast to prove its generality over the industry. The
novels) (%) basedc) (%) sales data is collected by an electronic device company from 2008
(%)
to the beginning of 2009. There were 21 new-released items during
Publisher all 72 cases 546 204 61 41 25 the period. We tested the latest two items with other 19 sample
Knowledge-based group data. The accumulated sales forecast at the 100th day is predicted
General (30 cases) 492 194 33 26 29 based on 10 days sales result. We prepared three NM groups gener-
Teens (13 cases) 856 325 146 98 13
Comic (10 cases) 581 196 72 45 21
ated by existing product features in advance. The following three
Encyclopedic (8 cases) 474 171 36 23 49 are the groups: all 21 items, OEM group and usage group. OEM
Academic (8 cases) 271 92 41 19 12 group is divided into five OEM manufacturers, and usage group
Average 535 196 66 42 25 has four categories based on marketing targets. Two existing estab-
a
lished forecasting methods, exponential smoothing method
MA; moving average.
b (a = 0.5) and moving average method (3-day average), are calcu-
Exp; exponential smoothing.
c
NM (knowledge-based group) is calculated with each reference group NM lated to compare with the model. The return goods ratio of the
coefficient. items is about 30–40%. We set the target forecast at 30%.
The result of MAPE is indicated in Table 6. NM method (usage
group) maintains within 20% MAPE after 4th day. According to
the two test case results, NM method was efficient in practical
250% reproduction decision-making compared to the existing two
NM All literature
methods.
200%
NM Publisher
4. Conclusion
150% NM Knowledge based
MAPE
Table 6
MAPE transition of electronic consumer devices.
Forecasting method Day 1 (%) Day 2 (%) Day 3 (%) Day 4 (%) Day 5 (%) Day 6 (%) Day 7 (%) Day 8 (%) Day 9 (%) Day 10 (%)
NM all 21 items 32 31 31 26 23 18 16 20 21 19
NM OEM group 51 51 50 37 31 25 24 27 28 25
NM usage group 32 27 21 16 12 12 12 16 17 13
Moving average 501 388 313 249 214 182 160 151 144 133
Exponential smoothing 298 183 128 76 81 58 81 68 62 60
K. Tanaka / Expert Systems with Applications 37 (2010) 7387–7393 7393
or wholesalers, are preparing their trial use. Review of real time Elkateb, M. M., Solaiman, K., & Al-Turki, Y. (1998). A comparative study of medium-
weather-dependent load forecasting using enhanced artificial/fuzzy neural
marching experiment will be studied.
network and statistical techniques. Neurocomputing, 23, 3–13.
Faraway, J., & Chatfield, C. (1998). Time series forecasting with neural networks: A
comparative study using the airline data. Applied Statistics, 47, 231–250.
References Kirby, H. R., Watson, S. M., & Dougherty, M. S. (1997). Should we use neural
networks or statistical models for short-term motorway traffic forecasting?
International Journal of Forecasting, 12, 43–50.
Agrawal, D., & Schorling, C. (1996). Market share forecasting: An empirical Mentzer, J. T., & Bienstock, C. C. (1996). Sales forecasting, management. Thousand
comparison of artificial neural networks and multinomial logit model. Journal Oaks, CA: Sage.
of Retailing, 72(4), 383–407. Miyata, H., Tanaka, K., Sato, I., & Nishi, I. (2008a). Reprint recommendation
Callen, J. L., Kwan, C. Y., Yip, C. Y., & Yuan, Y. (1996). Neural network forecasting of processing system outputs specific number of reprint candidate book,
quarterly accounting earnings. International Journal of Forecasting, 12, 255–267. JP2008282269-A.
Chang, P. C., & Lai, C. Y. (2005). A hybrid system combining self organizing maps Miyata, H., Tanaka, K., Sato, I., & Nishi, I. (2008b). Sales forecast system for
with case-based reasoning in wholesaler’s new-release book forecasting. Expert estimating sales book, JP2008176662-A.
Systems with Applications, 29, 183–192. Nagai, S. (2004). Electric tag implication for publishing industry logistics. Japan
Chang, P. C., Lai, C. Y., & Lai, K. R. (2006). A hybrid system by evolving case based Publisher Infrastructure Centre Survey Report (pp. 1–22). Tokyo: Japan Publisher
reasoning with genetic algorithm in wholesaler’s returning book forecasting. Infrastructure Centre.
Decision Support System, 42, 1715–1729. Prybutok, V. R., Yi, J., & Mitchell, D. (2000). Comparison of neural network
Chang, P. C., & Liao, W. (2006). Combining SOM and fuzzy rule base for flow time models with ARIMA and regression models for prediction of Huston’s daily
prediction in semiconductor manufacturing factory. Applied Soft Computing, maximum ozone concentration. European Journal of Operations Research, 122,
6(2), 198–206. 31–40.
Chang, P. C., & Liu, C. H. (2008). A TSK type fuzzy rule based system for stock price Publishing Science Research Institute (2007). Annual report of publishing indicator
prediction. Expert Systems with Applications, 34(1), 135–144. 2007. Japan: Domestic Publishing Association.
Chu, C.-W., & Zhang, G. P. (2003). A comparative study of linear and nonlinear The Research Institute for Publications (2007). The annual report of publications 2007
models for aggregate retail sales forecasting. International Journal of Production (p. 5). Japan: The Research Institute for Publications.
Economics, 86, 217–231. Trappey, V. C., & Wu, H.-Y. (2008). An evaluation of the time-varying extended
Darbellay, G. A., & Slama, M. (2000). Forecasting consumers’ expenditure: A logistic, simple logistic, and Gompertz models for forecasting short product
comparison between econometric and neural network models. International lifecycles. Advanced Engineering Informatics, 22(4), 421–430.
Journal of Forecasting, 12, 255–267.