Вы находитесь на странице: 1из 14

Available online at www.sciencedirect.

com

ScienceDirect
Procedia Computer Science 100 (2016) 1071 1084

&RQIHUHQFHRQ(17(5SULVH,QIRUPDWLRQ6\VWHPV,QWHUQDWLRQDO&RQIHUHQFHRQ3URMHFW
0$1DJHPHQW&RQIHUHQFHRQ+HDOWKDQG6RFLDO&DUH,QIRUPDWLRQ6\VWHPVDQG7HFKQRORJLHV
&(17(5,63URM0$1+&LVW2FWREHU

%LJ'DWD$QDO\WLFVLQ6XSSRUWRIWKH'HFLVLRQ0DNLQJ3URFHVV
1DGD(OJHQG\D $KPHG(OUDJDODE

a
Department of Business Informatics & Operations Management, German University in Cairo (GUC), Cairo, Egypt
b
Department of Computer Science, Electrical, and Space Engineering, University of Technology, Lule, Sweden

$EVWUDFW

,QIRUPDWLRQLV D NH\ VXFFHVV IDFWRU LQIOXHQFLQJ WKH SHUIRUPDQFHRI GHFLVLRQ PDNHUV VSHFLILFDOO\WKH TXDOLW\ RI WKHLU GHFLVLRQV
1RZDGD\VVKHHUDPRXQWVRIGDWDDUHDYDLODEOHIRURUJDQL]DWLRQVWRDQDO\]H'DWDLVFRQVLGHUHGWKHUDZPDWHULDORIWKHVWFHQWXU\
DQGDEXQGDQFHLVDVVXPHGZLWKWRGD\VELOOLRQGHYLFHV>DND7KLQJV@DOUHDG\FRQQHFWHGWRWKH,QWHUQHW$FFRUGLQJO\VROXWLRQV
QHHGWREHVWXGLHGDQGSURYLGHGLQRUGHUWRKDQGOHDQGH[WUDFWYDOXHDQGNQRZOHGJHIURPWKHVHGDWDVHWV)XUWKHUPRUHGHFLVLRQ
PDNHUVQHHGWREHDEOHWRJDLQYDOXDEOHLQVLJKWVIURPVXFKUDSLGO\FKDQJLQJGDWDRIKLJKYROXPHYHORFLW\YDULHW\YHUDFLW\DQG
YDOXHE\XVLQJELJGDWDDQDO\WLFV7KLVSDSHUDLPVWRUHVHDUFKKRZELJGDWDDQDO\WLFVFDQEHLQWHJUDWHGLQWRWKHGHFLVLRQPDNLQJ
SURFHVV$FFRUGLQJO\XVLQJDGHVLJQVFLHQFHPHWKRGRORJ\WKH%LJ'DWD$QDO\WLFVDQG'HFLVLRQV %'$' IUDPHZRUNZDV
GHYHORSHGLQRUGHUWRPDSELJGDWDWRROVDUFKLWHFWXUHVDQGDQDO\WLFVWRWKHGLIIHUHQWGHFLVLRQPDNLQJSKDVHV7KHXOWLPDWHREMHFWLYH
DQG FRQWULEXWLRQ RI WKH IUDPHZRUN LV XVLQJ ELJ GDWD DQDO\WLFV WR HQKDQFH DQG VXSSRUW GHFLVLRQ PDNLQJ LQ RUJDQL]DWLRQV E\
LQWHJUDWLQJELJGDWDDQDO\WLFVLQWRWKHGHFLVLRQPDNLQJSURFHVV&RQVHTXHQWO\DQH[SHULPHQWLQWKHUHWDLOLQGXVWU\ZDVDGPLQLVWHUHG
WR WHVW WKH IUDPHZRUN $FFRUGLQJO\ UHVXOWV VKRZHG DGGHG YDOXH ZKHQ LQWHJUDWLQJ ELJ GDWD DQDO\WLFV LQWR WKH GHFLVLRQ PDNLQJ
SURFHVV
 
2016 Published by Elsevier B.V. This is an open access article under the CC BY-NC-ND license
7KH$XWKRUV3XEOLVKHGE\(OVHYLHU%9
(http://creativecommons.org/licenses/by-nc-nd/4.0/).
3HHUUHYLHZXQGHUUHVSRQVLELOLW\RI6FL.$$VVRFLDWLRQIRU3URPRWLRQDQG'LVVHPLQDWLRQRI6FLHQWLILF.QRZOHGJH
Peer-review under responsibility of the organizing committee of CENTERIS 2016

Keywords:%LJGDWDDQDO\WLFVGHFLVLRQPDNLQJ%'$'IUDPHZRUNGHVLJQVFLHQFH



&RUUHVSRQGLQJDXWKRU7HOID[
(PDLODGGUHVVQDGDHOJHQG\#JXFHGXHJ

1877-0509 2016 Published by Elsevier B.V. This is an open access article under the CC BY-NC-ND license
(http://creativecommons.org/licenses/by-nc-nd/4.0/).
Peer-review under responsibility of the organizing committee of CENTERIS 2016
doi:10.1016/j.procs.2016.09.251
1072 Nada Elgendy and Ahmed Elragal / Procedia Computer Science 100 (2016) 1071 1084

,QWURGXFWLRQ

'LJLWDO WHFKQRORJLHV KDYH FKDQJHG WKH ZD\ RUJDQL]DWLRQV DUH EXLOW DQG IXQFWLRQ WULJJHULQJ WKH QHHG IRU QRYHO
VROXWLRQV DQG D ZLGH DUUD\ RI IXQFWLRQLQJ DSSOLFDWLRQV %UXQVZLFNHU HW DO   $V VWRUDJH FDSDELOLWLHV KDYH
H[SRQHQWLDOO\LQFUHDVHGDQGPHWKRGVRIGDWDFROOHFWLRQKDYHFKDQJHGHQRUPRXVDPRXQWVRIGDWDKDYHEHFRPHHDVLO\
DYDLODEOH(YHU\VHFRQGPRUHDQGPRUHGDWDLVEHLQJFUHDWHGIURPYDULRXVVRXUFHV7KLVGDWDQHHGVQHZZD\VWREH
VWRUHGDQGDQDO\]HGLQRUGHUWRH[WUDFWYDOXH)XUWKHUPRUHRUJDQL]DWLRQVQHHGWRJHWDVPXFKYDOXHDVSRVVLEOHIURP
WKHKXJHDPRXQWVRIVWRUHGGDWD (OJHQG\DQG(OUDJDO $GGLWLRQDOO\FRPSDQLHVDQGLQGLYLGXDOVSRVVHVVPRUH
WHFKQRORJLHVDQGGHYLFHVZKLFKFUHDWHDQGFDSWXUHPRUHGDWDLQGLIIHUHQWFDWHJRULHV$VLQJOHXVHUQRZDGD\VFDQRZQ
DGHVNWRSODSWRSVPDUWSKRQHWDEOHWDQGPRUHZKHUHHDFKGHYLFHFDUULHVYHU\ODUJHDPRXQWVRIYDOXDEOHGDWD7KHVH
W\SHVRIGDWDDUHQRZEHLQJUHIHUUHGWRDVELJGDWDRUGDWDZLWKVXFKYROXPHYDULHW\DQGYHORFLW\WKDWLWEHFRPHV
GLIILFXOWWRPDQDJHZLWKFXUUHQWWRROV 5XVVRP 
%LJGDWDFDQLQFOXGHWH[WZLWKVRFLDOVHQWLPHQWVFOLFNVWUHDPVDXGLRDQGYLGHRZHEVLWHORJILOHVDVZHOODVVSDWLDO
DQGJHRORFDWLRQGDWDPXOWLPHGLD;0/GDWDHWF &KDQJHWDO 6XFKGDWDUHTXLUHVDQHZW\SHRIELJGDWD
DQDO\WLFVGXHWRLWVVL]HYDULHW\DQGUDSLGFKDQJHDVZHOODVGLIIHUHQWVWRUDJHDQGDQDO\VLVPHWKRGV$GGLWLRQDOO\
WKHVHHQRUPRXVDPRXQWVRIELJGDWDQHHGWREHSURSHUO\DQDO\]HGLQRUGHUIRUYDOXDEOHDQGSHUWDLQLQJLQIRUPDWLRQWR
EHH[WUDFWHG7KHUHIRUHZLWKWKHLQFUHDVLQJGHPDQGIRUXWLOL]LQJELJGDWDDQGWDNLQJDGYDQWDJHRILWVRSSRUWXQLWLHV
RUJDQL]DWLRQV DUH VHHNLQJ FOHDU DQG VLPSOH VROXWLRQV DQG JXLGHOLQHV IRU ELJ GDWD PDQDJHPHQW $FFRUGLQJO\ WKH
UHVHDUFKTXHVWLRQRIWKLVSDSHULVHow to integrate big data analytics into the decision making process"7KHDLPRI
WKLVUHVHDUFKLVWRGHYHORSDQGWHVWDIUDPHZRUNIRUWKHLQWHJUDWLRQRIELJGDWDWRROVDQGWHFKQLTXHVLQWRWKHGHFLVLRQ
PDNLQJSURFHVV%\DGRSWLQJWKLVIUDPHZRUNGHFLVLRQPDNHUVVKRXOGEHDEOHWRHQKDQFHWKHTXDOLW\RIWKHGHFLVLRQ
PDNLQJ SURFHVV DQG SRWHQWLDOO\ WKH TXDOLW\ RI WKH GHFLVLRQ DV D E\SURGXFW 7KH IUDPHZRUN LQFRUSRUDWHV GLIIHUHQW
LPSRUWDQWDVSHFWVRIELJGDWDDQDO\WLFVVXFKDVWKHGDWDDQDO\WLFVOLIHF\FOHQHFHVVDU\LQIUDVWUXFWXUHDQGDUFKLWHFWXUH
DVZHOODVUHTXLUHGWRROVDOOPDSSHGWRWKHGLIIHUHQWGHFLVLRQPDNLQJSKDVHV

%DFNJURXQG

7KHWHUP%LJ'DWDDSSOLHVWRGDWDVHWVWKDWJURZVRODUJHWKDWWKH\EHFRPHDZNZDUGWRZRUNZLWKXVLQJWUDGLWLRQDO
GDWDEDVHPDQDJHPHQWV\VWHPV0RUHRYHUWKHVL]HRIELJGDWDKDVH[SDQGHGEH\RQGWKHDELOLW\RIFRPPRQO\XVHG
VRIWZDUHWRROVDQGVWRUDJHV\VWHPVWRFDSWXUHVWRUHPDQDJHDVZHOODVSURFHVVWKHGDWDZLWKLQDWROHUDEOHHODSVHG
WLPH .XELFN 7KUHHPDLQIHDWXUHVFKDUDFWHUL]HELJGDWDYROXPHYDULHW\DQGYHORFLW\RUWKHWKUHH9V )DQ
HWDO 7KHYROXPHRIWKHGDWDLVLWVVL]HZKLOHYHORFLW\UHIHUVWRWKHUDWHZLWKZKLFKGDWDLVFKDQJLQJRUKRZ
RIWHQLWLVFUHDWHG)LQDOO\YDULHW\UHJDUGVWKHGLIIHUHQWIRUPDWVDQGW\SHVRIGDWDDVZHOODVWKHGLIIHUHQWNLQGVRIXVHV
DQGZD\VRIDQDO\]LQJWKHGDWD$GGLWLRQDOO\,%0DGGHGDWK9ZKLFKLVYHUDFLW\ -DJDGLVK $GGLWLRQDOO\
WKHYDOXHRIWKHGDWDKDVDOVREHHQFRQVLGHUHGE\VRPHUHVHDUFKHUVWREHDWK9 &KDQJHWDO 
%LJGDWDDQDO\WLFVLVZKHUHDGYDQFHGDQDO\WLFWHFKQLTXHVDUHDSSOLHGRQELJGDWD VHWV $QDO\WLFVEDVHGRQODUJH
GDWDVDPSOHVFDQKHOSUHYHDODQGOHYHUDJHEXVLQHVVFKDQJH+RZHYHUWKHODUJHUWKHVHWRIGDWDWKHPRUHGLIILFXOWLW
EHFRPHVWRPDQDJH 5XVVRP 6RSKLVWLFDWHGDQDO\WLFVFDQVXEVWDQWLDOO\LPSURYHGHFLVLRQPDNLQJPLQLPL]H
ULVNVDQGXQFRYHUYDOXDEOHLQVLJKWVIURPWKHGDWDWKDWZRXOGRWKHUZLVHUHPDLQKLGGHQ6RPHWLPHVGHFLVLRQVGRQRW
QHFHVVDULO\QHHGWREHDXWRPDWHGEXWUDWKHUDXJPHQWHGE\DQDO\]LQJKXJHHQWLUHGDWDVHWVXVLQJELJGDWDWHFKQLTXHV
DQG WHFKQRORJLHV LQVWHDG RI MXVW VPDOOHU VDPSOHV WKDW LQGLYLGXDOV ZLWK VSUHDGVKHHWV FDQ KDQGOH DQG XQGHUVWDQG
0DQ\LNDHWDO 
0RUHRYHUWKHPDQDJHULDOGHFLVLRQPDNLQJSURFHVVKDVEHHQDQLPSRUWDQWDQGWKRURXJKO\FRYHUHGWRSLFLQUHVHDUFK
WKURXJKRXWWKH\HDUV6LPRQVIRXUSKDVHVRIGHFLVLRQPDNLQJLQWHOOLJHQFHGHVLJQFKRLFHDQGLPSOHPHQWDWLRQDUH
SRSXODUO\DGRSWHGE\GHFLVLRQPDNHUVLQGLIIHUHQWGRPDLQV 7XUEDQHWDO )XUWKHUPRUHDFFRUGLQJWR-DJDGLVK
 WKHUHDUHPDQ\VWHSVWRWKHELJGDWDDQDO\VLVSLSHOLQHDQGHDFKVWHSFRPHVZLWKLWVFKDOOHQJHVDQGUHTXLUHG
GHFLVLRQV7KHVHGHFLVLRQVUDQJHIURPZKDWGDWDWRDFTXLUHWRKRZWRUHSUHVHQWWKHGDWDLQDVXLWDEOH PDQQHUIRU
DQDO\VLVDIWHUH[WUDFWLQJFOHDQLQJDQGLQWHJUDWLQJWKHGDWDZLWKRWKHUVRXUFHVWRKRZWRPDNHGHFLVLRQVEDVHGRQWKH
UHVXOWVRIWKHDQDO\VLV,QRUGHUIRUWKHELJGDWDDQDO\VLVWRSURGXFHUHDOYDOXHDOORIWKHVHFKDOOHQJHVDQGGHFLVLRQ
KDYHWREHHIIHFWLYHO\SODQQHGDQGDFFRPPRGDWHGIRU
Nada Elgendy and Ahmed Elragal / Procedia Computer Science 100 (2016) 1071 1084 1073

'HFLVLRQPDNHUVDUHFRQVWDQWO\RQWKHORRNRXWIRUFKDQFHVWRPDNHPRUHLQIRUPHGGHFLVLRQVDQGWKH\QHHGWREH
DEOHWRXQGHUVWDQGDQGXWLOL]HELJGDWDLQRUGHUWRIXUWKHUHQKDQFHWKH WUDGLWLRQDO GHFLVLRQPDNLQJSURFHVV7KXV
UHVHDUFK QHHGV WR FRYHU KRZWKHELJGDWD DQDO\WLFV WRROVDQG PHWKRGV FDQEH LQWHJUDWHGZLWK WKHGHFLVLRQ PDNLQJ
SURFHVVLQRUGHUWRHQKDQFHGHFLVLRQPDNLQJDQGSURYLGHYDOXDEOHLQVLJKWVIRUGHFLVLRQPDNHUV

7KH%'$')UDPHZRUN

2XU UHVHDUFK IROORZV WKH GHVLJQ VFLHQFH PHWKRGRORJ\ VR DFFRUGLQJO\ 3HIIHUV HW DOV   VL[ VWDJHV GHVLJQ
VFLHQFHSURFHVVLVDGRSWHGIRUEXLOGLQJDQGHYDOXDWLQJWKHIUDPHZRUN7KHILUVWWZRVWDJHVLGHQWLI\LQJWKHSUREOHP
DQG GHILQLQJ WKH REMHFWLYHV RI D VROXWLRQ ZHUH FRPSOHWHG WKURXJK H[SORUDWRU\ UHVHDUFK &RQVHTXHQWO\ XVLQJ WKH
DSSOLFDEOHNQRZOHGJHIURPWKHNQRZOHGJHEDVHDQGWKHEXVLQHVVQHHGVRIWKHHQYLURQPHQWDQDUWLIDFWQDPHO\WKH
%'$'IUDPHZRUNZDVGHYHORSHG7KLVLVDWWDLQHGWKURXJKSHUXVLQJWKHOLWHUDWXUHDQGUHVHDUFKDVZHOODVWHVWLQJ
VRPHRIWKHELJGDWDDQDO\WLFVWHFKQRORJLHVWRDGGWRWKHIUDPHZRUN$FFRUGLQJO\ERWKUHVHDUFKULJRUDQGUHOHYDQFH
DUHDWWDLQHG
6XEVHTXHQWO\DIWHUWKH%'$'IUDPHZRUNLVGHYHORSHGLWLVHYDOXDWHGDQGGHPRQVWUDWHGE\XVLQJLWWRDSSO\ELJ
GDWDDQDO\WLFVLQRUGHUWRVXSSRUWGHFLVLRQPDNLQJ7KLVGHPRQVWUDWLRQLVIL[HGLQWKHIRUPRIH[SHULPHQWVRQUHDOGDWD
DQGDFWXDOEXVLQHVVFDVHVLQRUGHUWRSURYLGHDVXIILFLHQWO\UHOHYDQWFRQWH[WIRUHYDOXDWLRQ)LQDOO\WKHHYDOXDWLRQRI
WKHIUDPHZRUNZDVDFFRPSOLVKHGE\REVHUYLQJKRZZHOOWKHIUDPHZRUNZDVLQDSSO\LQJELJGDWDDQDO\WLFVWKURXJKRXW
WKHGHFLVLRQPDNLQJSURFHVVLQRUGHUWRVXSSRUWPDNLQJDPRUHLQVLJKWIXOGHFLVLRQ$GGLWLRQDOO\WKHVPRRWKQHVVRI
WKHSURFHVVDQGWKHDSSOLFDELOLW\RIWKHIUDPHZRUNLQWKHGLIIHUHQWVFHQDULRVZHUHREVHUYHG$VDUHVXOWZHLWHUDWHG
EDFNWRWKHIUDPHZRUNGHVLJQDQGGHYHORSPHQWSKDVHLQRUGHUWRLQFRUSRUDWHWKHPRGLILFDWLRQVUHVXOWLQJIURPWKH
H[SHULPHQWVLQWRWKHILQDO%'$'IUDPHZRUN7KHSURFHVVLVHODERUDWHGDQGFRPPXQLFDWHGLQGHWDLOEHORZ

3.1. Framework Development

7KH%'$'RUWKH%LJ'DWD$QDO\WLFVDQG'HFLVLRQVIUDPHZRUNZDVGHYHORSHGLQRUGHUWRPDSELJGDWD
WRROVDUFKLWHFWXUHVDQGDQDO\WLFVWRWKHGLIIHUHQWGHFLVLRQPDNLQJSKDVHV7KH%LJLVK\SKHQDWHGEHFDXVHLWUHIHUV
WRWKHIROORZLQJWKUHHDVSHFWVDVEHLQJELJQRWRQO\WKHGDWDDQGDGGLWLRQDOO\PDSVWKHLQFRUSRUDWLRQRIWKHVHDVSHFWV
WRJHWKHU+HQFHWKHGDWDLVELJWKHDQDO\WLFVDUHELJDQGWKHUHVXOWLQJGHFLVLRQVDUHDOVRELJ7KRURXJKDQDO\VLVDQG
V\QWKHVLVRIUHOHYDQWOLWHUDWXUHLQYHVWLJDWLQJVWDWHRIWKHDUWWHFKQRORJLHVLQELJGDWD DQDO\WLFV DQGSUDFWLFHVKDYH
FRQWULEXWHGWRWKHGHYHORSPHQWRIRXUIUDPHZRUN+RZHYHUWKHIUDPHZRUNLVLQQRZD\LQFOXVLYHRIDOOWKHELJGDWD
WRROV WHFKQRORJLHV DQG DQDO\WLFV DQG UDWKHU VHUYHV DV D FRQFHSWXDOL]DWLRQ RI VRPH RI WKH SRVVLEOH DSSURDFKHV WR
SHUIRUPLQJELJGDWDDQDO\WLFVLQVXSSRUWRIWKHGHFLVLRQPDNLQJSURFHVV$GGLWLRQDOO\WKHIUDPHZRUNDVVXPHVWKDW
WKHGHFLVLRQGRPDLQLVDOUHDG\NQRZQDQGGRHVQRWQHHGWREHILUVWH[SORUHGLQRUGHUWRH[WUDFWDSUREOHPZKLFKQHHGV
WREHVROYHGRUDTXHVWLRQZKLFKQHHGVWREHDQVZHUHG7KHIUDPHZRUNLVGHSLFWHGLQ)LJ
7KHILUVWSKDVHRIWKHGHFLVLRQPDNLQJSURFHVVLVWKHLQWHOOLJHQFHSKDVHZKHUHGDWDZKLFKFDQEHXVHGWRLGHQWLI\
SUREOHPVDQGRSSRUWXQLWLHVLVFROOHFWHGIURPLQWHUQDODQGH[WHUQDOGDWDVRXUFHV,QWKLVSKDVHWKHVRXUFHVRIELJGDWD
QHHGWREHLGHQWLILHGDQGWKHGDWDQHHGVWREHJDWKHUHGIURPGLIIHUHQWVRXUFHVSURFHVVHGVWRUHGDQGPLJUDWHGWRWKH
HQGXVHU$FFRUGLQJO\WKHILUVWVWHSLQWKHIUDPHZRUNLVLGHQWLI\LQJWKHELJGDWDZKLFKZLOOEHXVHGIRUWKHDQDO\VLV
7KHPDLQGLIIHUHQFHLQWKLVVWHSIURP)D\\DGHWDOV  .''SURFHVVOLHVLQWKHGLYHUVLW\RIWKHW\SHVRIGDWD
ZKLFK ZLOO EH LGHQWLILHG DQG WKHLU YDULRXV VRXUFHV ,Q DGGLWLRQ WR UHODWLRQDO GDWD DQG FRPPRQ WUDQVDFWLRQDO RU
RSHUDWLRQDOGDWDWKHUHLVVRFLDOPHGLDGDWDWH[WLPDJHVDQGDXGLR$GGLWLRQDOO\WKHUHLVGDWDZKLFKUHVXOWVDVWKH
RXWSXWRIPDFKLQHVDQGGHYLFHVVXFKDVV\VWHPORJILOHVVHQVRUGDWDVDWHOOLWHGDWDDQGPRELOHRU*36GDWD0RUHRYHU
JHRVSDWLDOGDWDKDVEHFRPHYHU\LPSRUWDQWIRUDQDO\VLVDORQJZLWKLQWHUQHWGDWDFOLFNVWUHDPILOHVDQG;0/
6XFKELJGDWDQHHGVWREHWUHDWHGDFFRUGLQJO\VRDIWHUWKHGDWDVRXUFHVDQGW\SHVRIGDWDUHTXLUHGIRUWKHDQDO\VLV
DUHGHILQHGWKHFKRVHQGDWDLVDFTXLUHGDQGVWRUHGVLPLODUWRWKHDFTXLULQJSKDVHLQ)LVKHUHWDOV  ELJGDWD
SLSHOLQHDQG2UDFOHV  LQWHJUDWHGLQIRUPDWLRQDUFKLWHFWXUH7KHDFTXLUHGGDWDFDQWKHQEHVWRUHGLQDQ\RIWKH
ELJ GDWD VWRUDJH DQG PDQDJHPHQW WRROV 7KHVH WRROV FDQ UDQJH IURP WUDGLWLRQDO '%06V VXFK DV WKH RSHQ VRXUFH
0\64/ RU 3RVWJUHV64/ WR (':V DQG FROXPQDU RU 033 GDWDEDVHV VXFK DV &DVVDQGUD 3$'% DQG 6$1'
$GGLWLRQDOO\DGLVWULEXWHGILOHV\VWHPOLNH+')6FDQEHXVHGIRUVWRULQJELJGDWDDVZHOODV1R64/GDWDEDVHVVXFK
1074 Nada Elgendy and Ahmed Elragal / Procedia Computer Science 100 (2016) 1071 1084

)LJ%'$')UDPHZRUN

DV0RQJR'%&RXFK'%RU+%DVHZKLFKLVEXLOWRQWRSRI+')6,QWKHIUDPHZRUNWKHH[DPSOHVRIVSHFLILFVWRUDJH
WRROVDUHGHSLFWHGLQDVOLJKWO\GDUNHUEOXHFRORUWKDQWKHJHQHULFWHFKQRORJLHV
$IWHUWKHELJGDWDLVDFTXLUHGDQGVWRUHGLWLVWKHQRUJDQL]HGSUHSDUHGDQGSURFHVVHGDVLQWKHGDWDSUHSDUDWLRQ
SKDVHLQ(0&V  GDWDDQDO\WLFVOLIHF\FOHWKHSURFHVVLQJDQGWUDQVIRUPDWLRQSKDVHVLQWKH.''SURFHVVDQG
WKH RUJDQL]LQJ SKDVH LQ 2UDFOHV   LQWHJUDWHG LQIRUPDWLRQ DUFKLWHFWXUH 7KLV LV DFKLHYHG DFURVV D KLJKVSHHG
QHWZRUNXVLQJ(7/(/7RUELJGDWDSURFHVVLQJWRROV+DGRRSDQG0DS5HGXFHDVZHOODVLQPHPRU\PDQDJHPHQW
FDQEHXVHGIRUGDWDSURFHVVLQJ0RUHRYHUWKHGDWDFDQEHTXHULHGDQGFRPSXWDWLRQVDQGSURFHVVLQJFDQEHDSSOLHG
XVLQJVHYHUDOGLIIHUHQWODQJXDJHVUDQJLQJIURP3LJDQG+LYHWR5IRUVWDWLVWLFDOFRPSXWLQJWR64/DQG64/+IRU
GLUHFWO\DFFHVVLQJ+DGRRSGDWD6XFKWRROVDORQJZLWKRWKHUVFDQHQDEOHELJGDWDGLVFRYHU\DQGSUHSDUDWLRQIRUWKH
GHVLUHGDQDO\VHV
6RPHYHQGRUVKDYHDOVRSURYLGHGDYDULHW\RIWRROVSODWIRUPVRUDSSOLDQFHVWRVXSSRUWELJGDWDDFURVVWKHVWRUDJH
DQGPDQDJHPHQWDVZHOODVWKHGLVFRYHU\DQGRUJDQL]DWLRQVWHSV7KHVHDOORZIRUDPRUHFRPSUHKHQVLYHELJGDWD
Nada Elgendy and Ahmed Elragal / Procedia Computer Science 100 (2016) 1071 1084 1075

VROXWLRQZLWKPRUHIHDWXUHVLQDVLQJOHSDFNDJHUDWKHUWKDQKDYLQJWRPL[DQGPDWFKWHFKQRORJLHV([DPSOHVLQFOXGH
9HUWLFD*UHHQSOXP,%01HWH]]D7HUDGDWD$VWHUDQG6$3+$1$
7KHQH[WSKDVHLQWKHGHFLVLRQPDNLQJSURFHVVLVWKHGHVLJQSKDVHZKHUHSRVVLEOHFRXUVHVRIDFWLRQDUHGHYHORSHG
DQGDQDO\]HGWKURXJKDFRQFHSWXDOL]DWLRQRUDUHSUHVHQWDWLYHPRGHORIWKHSUREOHP7KHIUDPHZRUNGLYLGHVWKLVSKDVH
LQWRWKUHHVWHSVPRGHOSODQQLQJGDWDDQDO\WLFVDQGDQDO\]LQJ,QWKHPRGHOSODQQLQJVWHSDPRGHOIRUGDWDDQDO\WLFV
LVVHOHFWHGDQGSODQQHG7KLVLVVLPLODUWRWKHPRGHOSODQQLQJSKDVHLQ(0&V  GDWDDQDO\WLFVOLIHF\FOHDVZHOO
DV WKH PRGHO VHOHFWLRQ SKDVH LQ WKH .'' SURFHVV ,Q WKLV VWHS WKH PRGHOV DQG DOJRULWKPV ZKLFK DUH IRXQG WR EH
DSSURSULDWHEDVHGRQWKHW\SHVRIGDWDDYDLODEOHDQGWKHDQDO\VHVRURXWSXWLQWHQGHGDUHVHOHFWHGDQGSODQQHGIRU$
YDULHW\RIVRPHRIWKHPRGHOVDQGDQDO\VHVZKLFKFDQEHFKRVHQDUHGHSLFWHGLQWKHIUDPHZRUN
7UDGLWLRQDO GDWD PLQLQJ DQG DGYDQFHG DQDO\WLFV WHFKQLTXHV VXFK DV FODVVLILFDWLRQ FOXVWHULQJ UHJUHVVLRQ DQG
DVVRFLDWLRQUXOHVFDQEHFKRVHQDORQJZLWKPDFKLQHOHDUQLQJDQG$,WHFKQLTXHVVXFKDVQHXUDOQHWZRUNVGHFLVLRQ
WUHHVDQGSDWWHUQEDVHGDQDO\WLFV0RUHRYHUWLPHVHULHVDQDO\VLVFDQEHXVHGIRUDQDO\]LQJVHTXHQFHVRIGDWDSRLQWV
ZKLFK UHSUHVHQW YDOXHV DW VXFFHVVLYH WLPHV )XUWKHUPRUH WH[W DQDO\VLV IURP GRFXPHQWV RU VRFLDO PHGLD VRFLDO
QHWZRUNDQDO\VLVDQGVHQWLPHQWDQDO\VLVFDQDOVREHVHOHFWHGLIWKHELJGDWDLVLQWKHIRUPRIWH[WRUZHDUHGHDOLQJ
ZLWKVRFLDOPHGLDGDWD$GGLWLRQDOO\JUDSKDQDO\VHVFDQEHXVHGIRUUHSUHVHQWLQJFRPSOH[QHWZRUNVDQGSDWKDQDO\VHV
FDQGHVFULEHGLUHFWHGGHSHQGHQFLHVDPRQJYDULDEOHV0RUHRYHUGHQVLW\EDVHGRUVSDWLDODQDO\VHVFDQEHDSSOLHGIRU
FOXVWHULQJGHQVHDUHDVRUGHDOLQJZLWKVSDWLDORUJHRJUDSKLFDOGDWDDQGFOLFNVWUHDPDQDO\VHVFDQEHXVHGIRUZHEGDWD
DQGDQDO\]LQJPRXVHFOLFNV
6XEVHTXHQWO\LQWKHGDWDDQDO\WLFVVWHSWKHVHOHFWHGPRGHOLVDSSOLHG,WPD\DOVREHDFFRPSDQLHGZLWK2/$3
DQG SUHGLFWLYH DQDO\WLFV FDQ EH IXUWKHU DSSOLHG WR DQDO\]H FXUUHQW DQG KLVWRULFDO GDWD DQG UHVXOWV VR DV WR PDNH
SUHGLFWLRQVDERXWWKHIXWXUH)XUWKHUPRUHLQPHPRU\DQDO\WLFVDQGSURFHVVLQJFDQEHXVHGZLWKELJGDWDLQRUGHUWR
HQKDQFHDQGVSHHGWKHDFFHVVWRDQGVFRULQJRIWKHDQDO\WLFPRGHOV6HYHUDODQDO\WLFDOWRROVDQGWHFKQRORJLHVFDQEH
XVHGLQWKLVVWHSVXFKDV+$1$*UHHQSOXP$VWHU.RJQLWLR5HYROXWLRQ5(QWHUSULVHZKLFKLVEXLOWXSRQWKH5
ODQJXDJH 7HUDGDWD :DUHKRXVH 0LQHU 7:0  0$'OLE 0DKRXW 5DSLG0LQHU 5DGRRS ZKLFK LV D 5DSLG0LQHU
H[WHQVLRQWKDWLQWHJUDWHVWKHGDWDDQDO\WLFVFDSDELOLWLHVRI+LYHDQG0DKRXWWRSURYLGHDGDWDDQDO\WLFVVROXWLRQIRU
+DGRRSDVZHOODV3HQWDKRZKLFKFDQSHUIRUPSUHGLFWLYHDQDO\WLFVDQG2/$3DQGFDQLQWHJUDWH+DGRRSDVZHOODV
1R64/DQGDQDO\WLFGDWDEDVHV
,QWKHDQDO\]LQJVWHSWKHRXWSXWRIWKHSUHYLRXVVWHSDQGWKHUHVXOWVRIWKHDQDO\WLFVDUHDQDO\]HGVLPLODUWRWKH
DQDO\]LQJVWHSLQ2UDFOHV  LQWHJUDWHGLQIRUPDWLRQDUFKLWHFWXUH$FFRUGLQJO\WKHSRVVLEOHFRXUVHVRIDFWLRQWR
EHWDNHQDUHGHILQHG7KHVHFRXUVHVDUHWKHQFKRVHQIURPLQWKHIROORZLQJSKDVH
&RQVHTXHQWO\ WKH QH[W SKDVH LQ WKH GHFLVLRQ PDNLQJ SURFHVV LV WKH FKRLFH SKDVH ZKHUH PHWKRGV DUH XVHG WR
HYDOXDWHWKHLPSDFWVRIWKHSURSRVHGVROXWLRQVRUFRXUVHVRIDFWLRQIURPWKHGHVLJQSKDVH,QWKHIUDPHZRUNWKLV
SKDVHLVGLYLGHGLQWRWZRVWHSVHYDOXDWHDQGGHFLGH,QWKHHYDOXDWHVWHSZKLFKLVFRPSDUDEOHWRWKH.''SURFHVV
HYDOXDWLRQ LQ WKH LQWHUSUHWDWLRQHYDOXDWLRQVWHS WKHSURSRVHGFRXUVHVRI DFWLRQ DQG WKHLU LPSDFW DUHHYDOXDWHG DQG
SULRULWL]HG7KLVFRXOGEHGRQHXVLQJUHSRUWLQJGDVKERDUGVVLPXODWLRQVRIWKHVROXWLRQVZKDWLIVFHQDULRVFRJQLWLYH
PDSVKHXULVWLFV.3,VDVZHOODVDGYDQFHGRULQWHUDFWLYHGDWDYLVXDOL]DWLRQ6RPHRIWKHELJGDWDYLVXDOL]DWLRQWRROV
DYDLODEOHLQFOXGH*HSKLZKLFKLVPDLQO\DJUDSKEDVHGYLVXDOL]HUDQGGDWDH[SORUHU3UHIXVH7DEOHDX4OLN9LHZ
6SRWILUH 6$6 9LVXDO $QDO\WLFV &HQWULIXJH -03 DQG $'9,=25 $GGLWLRQDOO\ 3HQWDKR DOVR SURYLGHV ELJ GDWD
YLVXDOL]DWLRQDVZHOODVUHSRUWLQJDQGGDVKERDUGIHDWXUHV
$FFRUGLQJO\WKHQH[WVWHSLQWKHFKRLFHSKDVHLVWRGHFLGHRQWKHEHVWFRXUVHRIDFWLRQVLPLODUWRWKHGHFLVLRQVWHS
LQ2UDFOHV  LQWHJUDWHGLQIRUPDWLRQDUFKLWHFWXUH7KLVLVZKHUHWKHGHFLVLRQDFWXDOO\WDNHVSODFHEDVHGRQWKH
UHVXOWVRIHYDOXDWLQJWKHSRVVLEOHFRXUVHVRIDFWLRQDQGILQDOO\FKRRVLQJWKHEHVWRUPRVWDSSURSULDWHRQH
)LQDOO\WKHODVWSKDVHLQWKHGHFLVLRQPDNLQJSURFHVVLVWKHLPSOHPHQWDWLRQSKDVHZKHUHWKHSURSRVHGVROXWLRQ
IURPWKHSUHYLRXVSKDVHLVLPSOHPHQWHG,QWKLVVWHSWKHUHVXOWVRIWKHFKRLFHDUHRSHUDWLRQDOL]HGRUSXWWRDFWLRQDV
LQ WKH ODVW SKDVH RI (0&V   GDWD DQDO\WLFV OLIHF\FOH +HQFHELJGDWD WRROVDQG WHFKQRORJLHV FDQEHXVHG LQ
PRQLWRULQJWKHUHVXOWVRIWKHGHFLVLRQDVZHOODVLQSURYLGLQJUHDOWLPHRUSHULRGLFDOIHHGEDFNRQWKHRXWFRPHVRIWKH
LPSOHPHQWDWLRQ
1076 Nada Elgendy and Ahmed Elragal / Procedia Computer Science 100 (2016) 1071 1084

)LJ'DWD0RGHO

3.2. Framework Evaluation: Retail Experiment

$IWHUWKHGHYHORSPHQWRIWKH,7DUWLIDFWWKHIUDPHZRUNWKHQQHHGVWREHHYDOXDWHG7KHUHIRUHDQH[SHULPHQWDO
HYDOXDWLRQPHWKRGZDVFKRVHQLQRUGHUWRWHVWWKH%'$'IUDPHZRUN+HQFHDQLQVWDQWLDWLRQRIWKHIUDPHZRUNZDV
WDNHQDQGDSSOLHGWRDFWXDOGDWD0RUHRYHUVRPHRIWKHDYDLODEOHVROXWLRQVVXFKDV$VWHU&ORXGHUD+')67:0
5DSLG0LQHU3HQWDKR*HSKL7DEOHDXDQGVHYHUDO'%06VZHUHWHVWHGLQDODEH[SHULPHQWDQGWKHLQWHJUDWLRQDQG
IORZEHWZHHQYDULHWLHVRIWKHSURYLGHGWRROVZHUHH[DPLQHG
7KHH[SHULPHQWZDVSHUIRUPHGLQWKHDUHDRIUHWDLOLQRUGHUWRHYDOXDWHWKH%'$'IUDPHZRUN$FFRUGLQJO\WKH
GHFLVLRQGRPDLQLVWHVWLQJSURPRWLRQHIIHFWLYHQHVVDQGWKHLPSDFWRIVHQWLPHQWVDQGVRFLDOPHGLDRQVDOHV7KHUHIRUH
WKHGHFLVLRQZRXOGEHZKLFKSURGXFWVVKRXOGSURPRWLRQVEHRIIHUHGRQZKHQVKRXOGWKH\EHRIIHUHGDQGZKHWKHU
VRFLDOPHGLDPDUNHWLQJFDPSDLJQVDUHHIILFLHQWDQGVKRXOGEHIRFXVHGRQRUQRW%\DQDO\]LQJWKHDYDLODEOHGDWDDERXW
LWHPSXUFKDVHVDVZHOODVWKHIHHGEDFNDQGSRVWVRIFXVWRPHUVLQDGGLWLRQWRWKHLUUHVSRQVHWRVRFLDOPHGLDVXFK
NQRZOHGJHVKRXOGEHJDLQHGLQRUGHUWRVXSSRUWRXUGHFLVLRQV(DFKSKDVHRIWKHIUDPHZRUNLVHODERUDWHGEHORZ
+RZHYHUWKHLPSOHPHQWDWLRQSKDVHZDVQRWWHVWHGDVLQWKDWFDVHWKHGHFLVLRQZRXOGKDYHWRDFWXDOO\EHH[HFXWHG
DQGPRQLWRUHGRYHUWLPHZKLFKZRXOGQRWEHIHDVLEOHZLWKLQWKHVFRSHRIRXUH[SHULPHQW

1) Intelligence Phase:
Nada Elgendy and Ahmed Elragal / Procedia Computer Science 100 (2016) 1071 1084 1077

,QWKHILUVWSKDVHRIWKHIUDPHZRUNWKHLQWHOOLJHQFHSKDVHWKHELJGDWDZKLFKZLOOEHXVHGQHHGVWREHFROOHFWHG,Q
WKLVH[SHULPHQWDPL[RIUHODWLRQDOGDWDVRFLDOPHGLDGDWDDQGWH[WLVXVHG7KHGDWDPRGHOLVVKRZQLQ)LJ,Q
RUGHUWRJHWUHODWLRQDOGDWDDERXWUHWDLOSXUFKDVHZHFKRVHRQHRIWKHODUJHVWK\SHUPDUNHWVLQ(J\SWWRDFTXLUHGDWD
IURPWKHLU3RLQWRI6DOHV 326 DQG(QWHUSULVH5HVRXUFH3ODQQLQJ (53 V\VWHPV7KHK\SHUPDUNHWGHDOVZLWKPRUH
WKDQLWHPVDQGKDVWZRODUJHEUDQFKHVGLYLGHGLQWRGLIIHUHQWVHFWLRQV7KHQXPEHURIGDLO\YLVLWRUVFDQ
UHDFKDSSUR[LPDWHO\YLVLWRUV7KXVIRURXUH[SHULPHQWZHWRRNDVDPSOHRIWKHLU326DQG(53GDWDFRYHULQJ
WKHVL[PRQWKVIURP-DQXDU\WR-XQH+RZHYHUWKHGDWDZDVLQ$UDELFZKLFKPDGHLWYHU\GLIILFXOWWR
GHDO ZLWK GXH WR WKH HQFRGLQJV RI WKH ILOHV DQG WKH UHFRUGV 7KH GDWD ZDV WKHQ VWRUHG LQ D 7HUDGDWD '%06 XVLQJ
7HUDGDWDORDGLQJWRROV
$VIRUWKHVRFLDOPHGLDGDWDZHQHHGHGFXVWRPHUSRVWVDQGFRPPHQWVIURPWKHK\SHUPDUNHWV)DFHERRNIDQSDJH
1RWZHHWVUHODWHGWRWKHK\SHUPDUNHWZHUHIRXQGRQ7ZLWWHU&RQVHTXHQWO\ZHXVHGWKH)DFHERRN$3,LQRUGHUWR
H[WUDFWWKHSRVWVDQGFRPPHQWVZLWKLQWKHWLPHGXUDWLRQRIWKHVDOHVGDWDZHKDYHDYDLODEOH$FFRUGLQJO\ZHJDWKHUHG
DQGVWRUHGWKHIDQSDJHVVWDWXVHVSRVWVDQGUHODWHGFRPPHQWVDQGWKHIDQVSRVWVDQGWKHLUFRPPHQWVIRUWKHVSHFLILHG
WLPHSHULRG

2) Design Phase:
6XEVHTXHQWO\WKHVHFRQGSKDVHLQWKHIUDPHZRUNWHVWHGLQWKHH[SHULPHQWLVWKHGHVLJQSKDVH7KLVLVZKHUHWKH
PRGHO SODQQLQJ WDNHV SODFH DQG WKH UHODWLRQVKLSV ZKLFK QHHG WR EH LGHQWLILHG DUH GHILQHG ,Q RUGHU WR ILQG WKH
UHODWLRQVKLSVEHWZHHQWKHGLIIHUHQWUHWDLODWWULEXWHVZHKDYHVXFKDVWKHGLVFRXQWVDQGSXUFKDVHVVHYHUDOPRGHOVZHUH
SODQQHGIRU7KHPRGHOVDQGDQDO\VHVXVHGDUHEULHIO\GHVFULEHGEHORZ
a) Visualization Analysis:
)LUVWRIDOOZHQHHGHGWRXQGHUVWDQGWKHGLVWULEXWLRQRIWKHGLVFRXQWVTXDQWLWLHVSXUFKDVHGDQGWKHWRWDOYDOXHRI
LWHPSXUFKDVHVDFURVVWKHGLIIHUHQWVHFWLRQVDQGEUDQFKHVDVZHOODVRYHUWKHVL[PRQWKWLPHSHULRGVRZHVWDUWHGE\
XVLQJ7DEOHDXIRUYLVXDOL]LQJWKHUHODWLRQVKLSV
7KURXJK YLVXDOL]DWLRQ ZH FRXOG VHH WKH K\SHUPDUNHW EUDQFKHV DQG GHSDUWPHQW VHFWLRQV ZLWK WKH PRVW VDOHV
GLVFRXQWV DQG SURILW DV ZHOO DV WKH EUDQFKHV 0RUHRYHU D WLPHVHULHV DQDO\VLV ZDV SHUIRUPHG WR YLVXDOL]H WKH
UHODWLRQVKLSEHWZHHQWKHGLVFRXQWVRIIHUHGTXDQWLWLHVVROGDQGWKHWRWDOYDOXHRIVDOHVDFURVVWKHJLYHQWLPHSHULRG
6RPHLQWHUHVWLQJLQIRUPDWLRQVXFKDVWKHSURGXFWVDQGWLPHVDIIHFWLQJWKHSHDNVDOHVDVZHOODVWKHLUUHODWLRQVKLS
ZLWKWKHSURPRWLRQVDQGGLVFRXQWVRIIHUHGDFURVVWLPHZDVH[WUDFWHG
b) Correlation and Regression Analysis:
,QRUGHUWRIXUWKHUH[SORUHWKHUHODWLRQVKLSEHWZHHQWKHYDULDEOHVDFRUUHODWLRQDQDO\VLVZDVSHUIRUPHGRQ7:0
,Q WKH UHVXOWLQJ FRUUHODWLRQ PDWUL[ ZH VDZ WKDW WKH YDULDEOHV ZLWK WKH KLJKHVW FRUUHODWLRQ WR WKH GLVFRXQWV DUH WKH
TXDQWLW\ DQG WKH VDOHV YDOXHV 0RUHRYHU ORJLVWLF UHJUHVVLRQ ZDV SHUIRUPHG RQ 7:0 WR PHDVXUH WKH UHODWLRQVKLS
EHWZHHQZKHWKHURUQRWWKHUHLVDGLVFRXQWDQGWKHUHPDLQLQJLQGHSHQGHQWYDULDEOHV$FFRUGLQJO\WKHSUHGLFWLRQRI
KDYLQJDGLVFRXQWRUQRWZDVIRXQGWREHEDVHGRQQLQHLQGHSHQGHQWYDULDEOHVZKLFKZHUHGHWHUPLQHGWREHXVHGLQ
WKHPRGHO
c) Cluster Analysis:
6XEVHTXHQWO\DFOXVWHUDQDO\VLVZDVSHUIRUPHGLQRUGHUWRJURXSWKHLWHPVWRJHWKHUEDVHGRQWKHLUVLPLODULWLHV
DFFRUGLQJWRWKHGLVFRXQWV:HVWDUWHGE\XVLQJWKHFOXVWHUDQDO\VLVLQ7:0ZLWKNPHDQVEHLQJWKHFKRVHQDOJRULWKP
DQGWKHQXPEHURIFOXVWHUVEHLQJWZRVLQFHZHDUHIRFXVLQJRQZKHUHWRSURYLGHWKHGLVFRXQWVZKLFKKDYHWKH%RROHDQ
YDOXHVRIDQG QRGLVFRXQWDQGGLVFRXQW 7KHFOXVWHULQJUHVXOWHGLQ&OXVWHUZKLFKKDVRIWKHLWHPVDQG
&OXVWHUZKLFKKDVRIWKHLWHPV
1078 Nada Elgendy and Ahmed Elragal / Procedia Computer Science 100 (2016) 1071 1084

$IWHUZDUGVFOXVWHULQJLVSHUIRUPHGZLWK:HNDXVLQJWKHVDPHNPHDQVDOJRULWKPZLWKWKHNDOVRHTXDOWRWZR
FOXVWHUV DFFRUGLQJ WR WKH FDWHJRULHV RI WKH GLVFRXQW 7KH FOXVWHUV DUH DOVR VFRUHG XVLQJ WKH FODVVHV WR FOXVWHUV
HYDOXDWLRQZKHUHWKHGLVFRXQWLVWKHFODVV$FFRUGLQJO\DIWHUWKHFOXVWHUPRGHOLVFUHDWHGIURPWKHWUDLQLQJGDWDLWLV
HYDOXDWHGE\DVVLJQLQJHDFKYDOXHRIWKHFODVVWRDFOXVWHUDQGHYDOXDWLQJWKHFRUUHFWQHVVRIFOXVWHULQJWKHSRLQWVEDVHG
RQZKHWKHUWKHGLVFRXQWYDOXHRIWKHSRLQWPDWFKHVWKDWRIWKHFODVVLWZDVFOXVWHUHGLQRUQRW+HUH&OXVWHUKDV
RIWKHLQVWDQFHVDQG&OXVWHUKDVRIWKHLQVWDQFHVZKLFKGLIIHUVIURPWKHUHVXOWVRI7:07KLVPD\EH
DV7:0XVHVWKH0DKDODQRELVGLVWDQFHLQFDOFXODWLQJWKHGLVWDQFHEHWZHHQWKHSRLQWVDQGWKHPHDQRIHDFKFOXVWHU
ZKLOHLQ:HNDWKH(XFOLGHDQGLVWDQFHZDVXVHG


)LJ.0HDQV&OXVWHULQJ9LVXDOL]DWLRQ )LJ(0&OXVWHULQJ9LVXDOL]DWLRQ

)LJ  UHSUHVHQWV WKH YLVXDOL]DWLRQ RI WKH FOXVWHUHG LQVWDQFHV 7KH [D[LV UHSUHVHQWV WKH LWHP FRGH WKH \D[LV
UHSUHVHQWVWKHWRWDOYDOXHDQGWKHFRORUVUHSUHVHQWWKHGLVFRXQWFDWHJRU\7KHUHGSRLQWVDUHWKHRQHVWKDWZHUHDVVLJQHG
WR&OXVWHUZKHUHWKH\VKRXOGQRWKDYHDGLVFRXQWZKLOHWKHEOXHSRLQWVDUHWKHRQHVWKDWZHUHDVVLJQHGWR&OXVWHU
 ZKHUH WKH\ VKRXOG KDYH D GLVFRXQW +RZHYHU GXH WR WKH FODVVHV WR FOXVWHUV HYDOXDWLRQ WKH VTXDUHV DUH WKH
LQFRUUHFWO\FOXVWHUHGLQVWDQFHVZKHUHWKHUHGVTXDUHVUHSUHVHQWWKHLWHPVWKDWDFWXDOO\GRKDYHDGLVFRXQWEXWZHUH
SXWLQWKHQRGLVFRXQWFOXVWHUZKLOHWKHEOXHVTXDUHVUHSUHVHQWWKHLWHPVWKDWGRQWKDYHDGLVFRXQWEXWZHUHSXWLQ
WKHGLVFRXQWFOXVWHU7KXVSURPRWLRQVFRXOGEHSXWRQWKHEOXHVTXDUHV0RUHRYHUWKHHUURUVRULQFUHDVHLQWKHDPRXQW
RIUHGVTXDUHVPD\EHGXHWRWKHDVSHFWWKDWDERXWRIWKHSURPRWLRQLWHPVZKLFKWKHK\SHUPDUNHWGLVFRXQWVDUH
GHWHUPLQHGWKURXJKDSURPRWLRQSODQZLWKWKHVXSSOLHUVUDWKHUWKDQE\WKHK\SHUPDUNHWLWVHOI
$GGLWLRQDOO\E\FKDQJLQJWKHFOXVWHULQJDOJRULWKPZHFDQILQGGLIIHUHQWUHVXOWV6LQFHKDYLQJRQO\WZRFOXVWHUV
ZDVQRWYHU\UHSUHVHQWDWLYHZHFDQXVHWKH([SHFWDWLRQ0D[LPL]DWLRQ (0 PHWKRGLQVWHDGRINPHDQV7KH(0
FOXVWHULQJ PHWKRG JRHV WKURXJK HDFK LQVWDQFH DQG DVVLJQV LW ZLWK D SUREDELOLW\ GLVWULEXWLRQ LQ RUGHU WR FKHFN WKH
SUREDELOLW\ RI WKDW LQVWDQFH EHORQJLQJ WR HDFK RI WKH FOXVWHUV 7KH QXPEHU RI FOXVWHUV ZDV GHWHUPLQHG E\ FURVV
YDOLGDWLRQ 7KH DOJRULWKP GLYLGHG WKH GDWD LQWR WHQ GLIIHUHQW FOXVWHUV ZKHUH WKH RYHUDOO SHUFHQWDJHV RI WKH FOXVWHU
GLVWULEXWLRQVDUHEDODQFHG
)LJUHSUHVHQWVWKHYLVXDOL]DWLRQRIWKHFOXVWHUHGLQVWDQFHVEDVHGRQWKH(0DOJRULWKP7KH[D[LVUHSUHVHQWVWKH
LWHP FRGH WKH \D[LV UHSUHVHQWV WKH WRWDOYDOXH DQG WKH FRORUV UHSUHVHQW WKH WHQ FOXVWHUV$VZH FDQVHH WKHEOXH
&OXVWHULQVWDQFHVKDYHDKLJKHULWHPFRGHWKHPDJHQWD&OXVWHULQVWDQFHVKDYHDORZHUWRWDOYDOXHDQGWKHUHG
&OXVWHULQVWDQFHVKDYHDKLJKHUWRWDOYDOXH
d) Association Analysis:
7KHQH[WDQDO\VLVSHUIRUPHGRQWKHGDWDLVDVVRFLDWLRQUXOHPLQLQJLQRUGHUWRGLVFRYHULQWHUHVWLQJUHODWLRQVDQG
GHWHFW WKH PRVW FRPPRQ FRPELQDWLRQV EHWZHHQ WKH YDULDEOHV RI RXU GDWD :H ILUVW FRQYHUWHG WKH QXPHULFDO FRGH
DWWULEXWHVWRQRPLQDODWWULEXWHVDQGWKHQGLVFUHWL]HGWKHUHPDLQLQJQXPHULFDOYDULDEOHVXVLQJHTXDOIUHTXHQF\ELQQLQJ
LQRUGHUWRFRQYHUWWKHPWRFDWHJRULFDOYDULDEOHVIRUWKHDQDO\VLV&RQVHTXHQWO\ZHWKHQSHUIRUPHGDVVRFLDWLRQUXOH
PLQLQJ XVLQJ :HND DQG LWV $SULRUL DOJRULWKP ZLWK WKH VSHFLILHG PLQLPXP VXSSRUW EHLQJ  DQG WKH PLQLPXP
FRQILGHQFHEHLQJ,IWKHVXSSRUWZDVUDLVHGWKHQQRUXOHVRIYDOXHZRXOGUHVXOW
&RQVHTXHQWO\VHYHUDOLQWHUHVWLQJUXOHVZHUHH[WUDFWHG)RUH[DPSOHLIWKHTXDQWLW\SXUFKDVHGLVORZWKHQWKHUHLV
QRGLVFRXQWZLWKDFRQILGHQFHRIDQGWKHEUDQFKLVWKHVHFRQGEUDQFKZLWKDFRQILGHQFHRI$GGLWLRQDOO\
WKH&DQG\&KRFRODWH *XPVHFWLRQGRHVQWKDYHDGLVFRXQWRQRIWKHSURGXFWV0RUHRYHULIWKHEUDQFKLV
Nada Elgendy and Ahmed Elragal / Procedia Computer Science 100 (2016) 1071 1084 1079

WKHILUVWEUDQFKWKHQWKHUHLVQRGLVFRXQWRQWKHSURGXFWVRIWKHWLPHZKLOHLILWLVWKHVHFRQGEUDQFKWKHQLW
GRHVQWKDYHDGLVFRXQWRQWKHSURGXFWVRIWKHWLPH)XUWKHUPRUHRIWKHWLPHLIWKHSXUFKDVHVDUHPDGHIURP
WKH&RVPHWLFVVHFWLRQWKHQWKH\ZHUHPDGHLQWKHILUVWEUDQFK0DQ\PRUHDVVRFLDWLRQUXOHVFDQEHH[WUDFWHGIURP
WKHUHVXOWVKRZHYHUZHFRXOGQRWH[WUDFWUXOHVEDVHGRQWKHSXUFKDVHVRIFHUWDLQLWHPVWRJHWKHURUSHUIRUPDPDUNHW
EDVNHWDQDO\VLVVLQFHWKHGDWDGLGQRWLQFOXGHLQGLYLGXDOVKRSSLQJFDUWSXUFKDVHVDQGZDVUDWKHUDJJUHJDWHGGDLO\
LQYRLFHV
e) Decision Tree:
)LQDOO\DGHFLVLRQWUHHZDVEXLOWDVDW\SHRIFODVVLILFDWLRQDQDO\VLV7:0ZDVXVHGWREXLOGWKHWUHHPRGHOE\
VSOLWWLQJRQWKHJDLQUDWLR7KHGHSHQGHQWYDULDEOHFKRVHQZDVWKHERROHDQGLVFRXQWYDULDEOHZKLOHWKHLQGHSHQGHQW
YDULDEOHVZHUHWKHEUDQFKLWHPFRGHTXDQWLW\RIWKHLWHPSXUFKDVHGWKHVHFWLRQDQGVXSSOLHUFRGHRIWKHLWHPWKH
XQLWSULFHDQGWKHVDOHVDQGWRWDOYDOXHV7KHFRQIXVLRQPDWUL[RIWKHGHFLVLRQWUHHVKRZVWKDWWKHDFFXUDF\RIWKH
PRGHO LV KLJK ZLWK WKH SHUFHQWDJH RI FRUUHFW FODVVLILFDWLRQV EHLQJ  DQG WKH SHUFHQWDJH RI LQFRUUHFW
FODVVLILFDWLRQVEHLQJ$FFRUGLQJO\WKHUHVXOWLQJUXOHVIURPWKHGHFLVLRQWUHHVKRZHGWKHIDFWRUVWKDWDIIHFWWKH
GHFLVLRQRIZKHWKHURUQRWDQLWHPVKRXOGKDYHDGLVFRXQWRUQRW
f) Social Media Analysis & Text Mining:
$IWHUZDUGVZHQHHGHGWRSHUIRUPWH[WPLQLQJRQWKH)DFHERRNGDWDWRDQDO\]HZKDWSHRSOHZHUHVD\LQJDERXWWKH
K\SHUPDUNHWDQGLWVSURGXFWVDVZHOODVWKHLUUHVSRQVHVWRWKHK\SHUPDUNHWVSRVWVDQGPDUNHWLQJ$FFRUGLQJO\ZH
QHHGHGWRILQGWKHPRVWIUHTXHQWZRUGVXVHGLQWKHSRVWVDVZHOODVWKHDVVRFLDWLRQVEHWZHHQWKHVHZRUGV+HQFHZH
FRXOG JDLQ NQRZOHGJH DERXW ZKDW VHQWLPHQWV SURGXFWV RU EUDQFKHV SHRSOH ZHUH GLVFXVVLQJ DQG ZKDW WKH\ ZHUH
VD\LQJDERXWWKHP)RUH[DPSOHLIZHIRXQGWKDWSHRSOHZHUHIUHTXHQWO\GLVVDWLVILHGZLWKRUFRPSODLQLQJDERXWD
FHUWDLQSURGXFWRUWKH\ZHUHKDSS\ZLWKFHUWDLQSURPRWLRQWKLVNQRZOHGJHFDQEHDGGHGWRRXUSUHYLRXVPRGHOVLQ
RUGHUWRHQKDQFHRXUGHFLVLRQ
:HVWDUWHGE\XVLQJ5DSLG0LQHULQRUGHUWRSHUIRUPWKHWH[WPLQLQJ7KHSURFHVVVWDUWVE\UHDGLQJWKHGDWDIURP
WKHGDWDEDVHDQGVHOHFWLQJWKHGHVLUHGDWWULEXWHVIRUWKHDQDO\VLVDQGDSSHQGLQJWKHVHOHFWHGGDWDLQWRDQH[DPSOHVHW
$IWHUZDUGVWKHGDWDLVSURFHVVHGDVDGRFXPHQWZKHUHWKHWH[WLVWRNHQL]HGDQGDQRSHUDWRULVXVHGIRUILOWHULQJWKH
$UDELFVWRSZRUGVDQGUHPRYLQJWKHPIURPWKHGRFXPHQW1H[WWKHQJUDPVDUHJHQHUDWHGDQGWKHWRNHQVDUHILOWHUHG
E\OHQJWKWRUHPRYHWRRORQJRUWRRVKRUWWRNHQV$IWHUWKHGRFXPHQWSURFHVVLQJWKHQXPHULFDODQGQRPLQDODWWULEXWHV
QHHGWREHFRQYHUWHGWRELQRPLDOLQRUGHUWRSHUIRUPDVVRFLDWLRQUXOHPLQLQJ)LQDOO\WKH)3*URZWKDOJRULWKPLV
XVHGWRH[WUDFWWKHIUHTXHQWLWHPVHWVDQGDQRSHUDWRULVXVHGWRFUHDWHDVVRFLDWLRQUXOHV
+RZHYHUDIWHUVHYHUDOWULDOVRIGD\VRIUXQQLQJWKHSURFHVVQHYHUJRWSDVWWKH)3JURZWKRSHUDWRU7KHORQJHVWUXQ
WLPHZDVRYHUWZRGD\V\HWQRUHVXOWVZHUHFUHDWHG7KLVLVPRVWOLNHO\GXHWRWKH$UDELFODQJXDJHEHLQJLQ8QLFRGH
DQG EHLQJ YHU\ GLIILFXOW WR SURFHVV $GGLWLRQDOO\ WKH )3*URZWK RSHUDWRU PD\ QRW EH DEOH WR ZRUN SURSHUO\ ZLWK
$UDELFWHUPV
2Q WKH RWKHU KDQG WKH UHVXOWV RI WKH GRFXPHQW SURFHVVLQJ ZHUH DQDO\]HG ZKHUH WKH 7HUP )UHTXHQF\,QYHUVH
'RFXPHQW)UHTXHQF\ 7),') DOJRULWKPJLYHVWKHUHODWLYHLPSRUWDQFHRIDZRUGWRDJLYHQGRFXPHQWFRPSDUHGWR
WKHLPSRUWDQFHRIWKDWZRUGWRDOORIWKHGRFXPHQWV$FFRUGLQJO\ZHFRXOGVHHWKHQXPEHURIRFFXUUHQFHVRIWKHPRVW
IUHTXHQWZRUGVLQWKH)DFHERRNSRVWVDQGFRPPHQWV)RUH[DPSOHVHYHUDOLQWHUHVWLQJZRUGVRFFXUUHGIUHTXHQWO\VXFK
DVSURPRWLRQTXDQWLW\YDOLGXQWLOZKLOHJRRGVODVWRLORIIHUJRRGVXJDU5DPDGDQEUDQFK
RUDQJHVEHDXWLIXOSULFHV)ULGD\ROLYHVROLYHRLOODYHQGHUNLORZDWHUPHORQSRPHJUDQDWH
PDJD]LQHHWF
g) Sentiment Analysis:
1H[WZHSHUIRUPHGVHQWLPHQWDQDO\VLVRQ5DSLG0LQHULQRUGHUWRGLVFRYHUWKHSRVLWLYHDQGQHJDWLYHRSLQLRQVRI
WKHXVHUVRQWKHK\SHUPDUNHWV)DFHERRNSDJH:HVWDUWHGE\FUHDWLQJDVHWRIODEHOHGGDWDE\WDNLQJDVDPSOHRI
SRVWVDQGVWRULQJWKHPLQGRFXPHQWVLQGLIIHUHQWSRODULW\IROGHUVEDVHGRQZKHWKHUWKH\DUHSRVLWLYHRUQHJDWLYHSRVWV
7KH GRFXPHQWV DUH WKHQ SURFHVVHG DQG WKH ZRUGV DUH WRNHQL]HG ILOWHUHG E\ OHQJWK VWHPPHG XVLQJ OLJKW $UDELF
VWHPPLQJ DQG WKH $UDELF VWRS ZRUGV DUH UHPRYHG $GGLWLRQDOO\ WKH ZRUG YHFWRUV DUH SUXQHG WR JHW ULG RI YHU\
FRPPRQDQGYHU\LQIUHTXHQWWHUPVZKLFKRFFXUOHVVWKDQRURYHURIWKHWLPH6XEVHTXHQWO\DYHFWRUZRUGOLVW
LVFUHDWHGIURPWKHGDWDDQGVWRUHGDVZHOODVD1DwYH%D\HVFODVVLILFDWLRQPRGHOXVLQJFURVVYDOLGDWLRQ$FFRUGLQJO\
WKHGDWDLVGLYLGHGLQWRDWUDLQLQJVHWDQGDWHVWVHWDQGWKHPRGHOLVEXLOWXVLQJWKHWUDLQLQJVHWDQGLVWKHQDSSOLHGRQ
WKHWHVWVHWDQGLWVSHUIRUPDQFHLVPHDVXUHGE\HYDOXDWLQJLWVDFFXUDF\
1080 Nada Elgendy and Ahmed Elragal / Procedia Computer Science 100 (2016) 1071 1084

2XWRIWKHGRFXPHQWVZKLFKZHUHSUHGLFWHGWREHQHJDWLYHZHUHFRUUHFWO\FODVVLILHGDVQHJDWLYHOHDGLQJWR
DSUHFLVLRQRIDQGDUHFDOORIVLQFHDOORIWKHQHJDWLYHGRFXPHQWVZHUHFODVVLILHGDVQHJDWLYH2QWKH
RWKHUKDQGRXWRIWKHSRVLWLYHGRFXPHQWVGRFXPHQWVZHUHFODVVLILHGFRUUHFWO\UHVXOWLQJLQUHFDOOZKLOH
DOORIWKHGRFXPHQWVZKLFKZHUHSUHGLFWHGWREHSRVLWLYHDFWXDOO\ZHUHUHVXOWLQJLQSUHFLVLRQ7KHUHIRUHWKH
WRWDODFFXUDF\RIWKHPRGHOLV
1H[WWKHEXLOWPRGHOZDVDSSOLHGWRWKHXQODEHOHGSRVWVLQRUGHUWRFODVVLI\WKHPDVSRVLWLYHRUQHJDWLYH+RZHYHU
GHVSLWHWKLVEHLQJDWUDGLWLRQDODQGFRPPRQO\XVHGIRUPRIVHQWLPHQWDQDO\VLVLWZDVQRWYHU\SUDFWLFDOLQRXUFDVH,W
UHTXLUHV VWRULQJ HDFK SRVW LQ D VHSDUDWH GRFXPHQW DQG PDQXDOO\ ODEHOLQJ WKH SRVLWLYH DQG QHJDWLYH WUDLQLQJ VHW
0RUHRYHUDIWHUDSSO\LQJWKHPRGHODQGFODVVLI\LQJWKHGRFXPHQWVDVSRVLWLYHRUQHJDWLYHZHKDYHWRPDQXDOO\JR
EDFNWRWKHGRFXPHQWDQGRSHQLWWRVHHWKHSRVWWKDWZDVFODVVLILHG
$FFRUGLQJO\5HSXVWDWHLVDVHQWLPHQWDQDO\VLVDQGVRFLDOPHGLDDQDO\WLFVZHEVLWH,WKDVDQ$3,IRUVHQWLPHQW
DQDO\VLVLQILYHGLIIHUHQWODQJXDJHVLQFOXGLQJ$UDELF%\XVLQJDWULDORI5HSXVWDWHRQWKHK\SHUPDUNHWV)DFHERRN
SDJHVHQWLPHQWDQDO\VLVZDVSHUIRUPHGRQWKHSRVWV$FFRUGLQJO\ZHFRXOGVHHWKHQHJDWLYHSRVWVZKLFKFDQEH
ILOWHUHGIRUH[DPSOHE\WLPHLQRUGHUWRVHHZKLFKWLPHVWKURXJKRXWWKHGD\PRVWRIWKHQHJDWLYHFRPPHQWVZHUH
SRVWHG)URPDVLPSOHYLVXDOL]DWLRQRIWKHQHJDWLYHSRVWVDORQJZLWKWKHXVHUVZKRSRVWHGWKHPZHFDQHDVLO\ILQG
RXWWKHSHRSOHZKRDUHFRPSODLQLQJDERXWSURGXFWVRUVHUYLFHVDVZHOODVWKHLUQHJDWLYHIHHGEDFNDQGRSLQLRQV)RU
H[DPSOHZHFDQVHHDXVHUSURYLGLQJIHHGEDFNRQWKHSRRUGHOLYHU\LQDGGLWLRQWRWKHSRVWWKDWWKHTXDOLW\RIWKH
VHFRQGEUDQFKLVZRUVHWKDQWKDWRIWKHILUVWEUDQFK
$GGLWLRQDOO\ZHZHUHDEOHWRYLHZWKHSRVLWLYHSRVWVZKLFKDUHILOWHUHGDFFRUGLQJWRWKHJHQGHURIWKHXVHUVZKR
SRVWHGWKHP)XUWKHUPRUHZHFDQYLHZWKHQHXWUDOSRVWVDQGWKHSRVWVFDQDOVREHILOWHUHGE\WKHGDWHRQZKLFKWKH\
DUHSRVWHGDVZHOODVE\WKHGHYLFHXVHGIRUSRVWLQJZKHWKHULWLVE\SKRQHRUE\RWKHUGHYLFHV)LQDOO\ZHFDQYLHZ
WKHQXPEHURISRVLWLYHRUQHJDWLYHSRVWVLQRUGHUWRPRQLWRUXVHUVHQWLPHQWVRYHUWLPHDQGWDNHDFWLRQVZKHQSRVWV
UHDFKDFHUWDLQOLPLWRQDGD\0RUHRYHUWKH5HSXVWDWHSURPRWHUVFRUHFDOFXODWHVKRZOLNHO\XVHUVDUHJRLQJWRVSHDN
SRVLWLYHO\DERXWWKHK\SHUPDUNHWZLWKEHLQJWKHEHVWVFRUHDQGEHLQJWKHZRUVWVFRUH$FFRUGLQJO\WKLVFDQ
DOORZIRUWDNLQJFRUUHFWLYHRUSUHYHQWLYHPHDVXUHVLQRUGHUWRLQFUHDVHSRVLWLYHSRVWVDQGUHGXFHQHJDWLYHSRVWV

3) Choice Phase:
,QWKHFKRLFHSKDVHZHQHHGWRFKRRVHDSURSRVHGVROXWLRQIURPWKHUHVXOWVRIWKHSUHYLRXVSKDVHWKXVLQRXUFDVH
ZKHQRQZKDWLWHPVDQGWKURXJKZKLFKPHDQVWRSURYLGHSURPRWLRQV+RZHYHUZHGRQRWKDYHHQRXJKEDFNJURXQG
NQRZOHGJHRUGHWDLOVDERXWWKHK\SHUPDUNHWVIXQFWLRQVRU.3,VWRXVHWKHPIRUHYDOXDWLRQ7KHUHIRUHZHFKRVHWR
XVHYLVXDOL]DWLRQDVWKHPDLQPHDQVIRUWKHHYDOXDWLRQVWHSLQWKLVH[SHULPHQWDVLWZDVIRXQGWREHWKHPRUHYDOXH
DGGLQJPHWKRGLQRXUFDVH
0RUHRYHUHDFKRIWKHDQDO\VHVLQWKHSUHYLRXVSKDVHUHVXOWHGLQYLVXDOL]DWLRQVZKLFKFRXOGDGGLWLRQDOO\EHXVHG
IRUWKHFKRLFHSKDVH$GGLWLRQDOO\ZHFUHDWHGQHZYLVXDOL]DWLRQVLQWKLVSKDVHLQRUGHUWRVHHWKHUHODWLRQVKLSVEHWZHHQ
WKHGLIIHUHQWYDULDEOHVDVZHOODVWKHLUIOXFWXDWLRQRYHUWLPHHVSHFLDOO\GXULQJFHUWDLQSHULRGV0RUHRYHUWRDVVHVV
WKHLPSDFWRIVRFLDOPHGLDDGYHUWLVLQJRQFXVWRPHUSXUFKDVLQJZHWRRNSDUWLFXODULQVWDQFHVDVH[DPSOHV)RULQVWDQFH
7DEOHDXZDVXVHGWRYLVXDOL]HWKHPRUHHIIHFWLYHPHWKRGRISURPRWLRQIRUDFHUWDLQMXLFHLQDGGLWLRQWRWKHHIIHFWRI
FXVWRPHUSRVWVDQGRSLQLRQVRQRWKHUSXUFKDVHV
$GGLWLRQDOO\WKHXVHULQWHUDFWLRQVZLWKWKH)DFHERRNSRVWVRYHUWLPHZHUHDOVRYLVXDOL]HGVRWKDWZHFDQVHHWKH
)DFHERRNDFWLYLW\RIWKHXVHUVDQGIRFXVRQGD\VZLWKKLJKDFWLYLW\IRUDGGLQJSRVWVRUIRUDQDO\]LQJWKHXVHUEX]]
)XUWKHUPRUHZHFDQYLHZWKHQXPEHURIOLNHVDQGVKDUHVIRUHDFKSRVWDQGZHFDQXQGHUVWDQGWKHSRVWVZKLFKDUH
PRVWLQWHUHVWLQJWRXVHUVDVZHOODVWKHFRPPHQWVDQGIHHGEDFNZKLFKJUDEWKHLUDWWHQWLRQRUZKLFKWKH\VXSSRUW
0RUHRYHUWKHQXPEHURIOLNHVRQDSURPRWLRQSRVWSRUWUD\VXVHULQWHUHVWRUDSRVLWLYHRSLQLRQDERXWWKLVSDUWLFXODU
SURPRWLRQDQGODFNRIXVHULQWHUDFWLRQZLWKSRVWVFDQSRUWUD\GLVLQWHUHVW
7KHUHIRUHE\DQDO\]LQJWKHVDOHVRILWHPVDQGGLVFRXQWHGLWHPVRYHUWLPHDQGE\LQFRUSRUDWLQJWKHVHQWLPHQWVDQG
IHHGEDFNRIXVHUVZHZHUHDEOHWRGHWHUPLQHWKHHIIHFWLYHQHVVRIRQOLQHSURPRWLRQVDQGVHQWLPHQWVRQSXUFKDVLQJ
SDWWHUQVDVZHOODVGHWHUPLQHWKHGDWHVDQGLWHPVGXULQJDQGXSRQZKLFKSURPRWLRQVVKRXOGEHRIIHUHG
Nada Elgendy and Ahmed Elragal / Procedia Computer Science 100 (2016) 1071 1084 1081

3.3. Experiment Findings

,QWKLVH[SHULPHQWLWZDVVKRZQWKDWZHFRXOGPDNHRXUGHFLVLRQRIZKDWLWHPVWRRIIHUSURPRWLRQVRQDQGZKHQ
DVZHOODVXVLQJVRFLDOPHGLDDVDFRQWH[WIRUPDUNHWLQJDQGLWVHIIHFWRQSXUFKDVLQJDQGJDLQLQJFXVWRPHUIHHGEDFN
0RUHRYHU RXU IUDPHZRUN ZDV IROORZHG E\ XVLQJ WKH PDSSHG ELJ GDWD DQDO\WLFV WRROV DQG PHWKRGV ZLWKLQ WKHLU
GHVLJQDWHGSKDVHVRIWKHGHFLVLRQPDNLQJSURFHVV$FFRUGLQJO\WKHELJGDWDZDVVWRUHGDQGSURFHVVHGWKHDQDO\WLFV
ZHUH SHUIRUPHG DQG WKH UHVXOWV ZHUH DQDO\]HG DQG WKH GHFLVLRQ PDNLQJ ZDV HQDEOHG DV ZHOO DV HQKDQFHG
)XUWKHUPRUHWKHGHFLVLRQZDVVXSSRUWHGE\DGGLWLRQDOLQIRUPDWLRQH[WUDFWHGGXHWRELJGDWDDQDO\WLFV2YHUDOOWKH
VWHSVRIWKHIUDPHZRUNZHQWVPRRWKO\DQGZHUHYDOXDEOHDQGLQVLJKWIXO7KHQHFHVVDU\PRGLILFDWLRQVDUHHODERUDWHG
EHORZ
+RZHYHU WKHUH ZHUH VWLOO VRPH OLPLWDWLRQV DQG GUDZEDFNV QRW UHODWHG WR WKH IUDPHZRUN IDFHG GXULQJ WKH
H[SHULPHQW7KHVHDUHPDLQO\GXHWRWKH$UDELFODQJXDJHEHLQJYHU\GLIILFXOWWRZRUNZLWK)LUVWRIDOOOLNHVHYHUDO
RWKHUODQJXDJHVLWLVGHEDWDEOHZKHWKHUDZRUGRUSKUDVHLVSRVLWLYHRUQHJDWLYHDQGLWGHSHQGVRQWKHFRQWH[WDQG
ZKHWKHUWKH$UDELFLVIRUPDORUVODQJ+RZHYHU$UDELFLVWRXJKHUWKDQRWKHUODQJXDJHVLQUHVSHFWWRWKHYDULHW\RI
IRUPVDURRWZRUGFDQWDNHEDVHGRQWKHWHQVHFRQWH[WDQGVHQWHQFHJUDPPDU$GGLWLRQDOO\WKHVDPHIRUPRIDZRUG
FDQKDYHVHYHUDOPHDQLQJV)RUH[DPSOHWKHZRUGKHOZDFDQEHDQRXQPHDQLQJFDQG\RUDQDGMHFWLYHGHVFULELQJ
WDVWHPHDQLQJVXJDU\RUVZHHWRUHYHQDQDGMHFWLYHPHDQLQJQLFHRUJUHDW7KHUHIRUHLQRXUFDVHIRUH[DPSOHZH
FDQQRWNQRZLIWKHLQWHQGHGXVDJHUHIHUVWRDQLWHPWKHWDVWHRIDQLWHPRUDQH[SUHVVLRQRIOLNLQJ6DDGDQG$VKRXU
 KLJKOLJKWHGWKHFRPSOH[LW\RIWKH$UDELFODQJXDJHLQWKDWLWKDVDYHU\FRPSOH[PRUSKRORJ\DQGLWLVDKLJKO\
GHULYDWLRQDOODQJXDJHZLWKZLGHVSUHDGV\QRQ\PVDVZHOODVYDULDWLRQVLQWKHOH[LFDOFDWHJRU\ZKHWKHUQRXQYHUE
HWF LQ GLIIHUHQW FRQWH[WV 0RUHRYHU WKH HQFRGLQJ RI WKH $UDELF FKDUDFWHUV SRVHV D SUREOHP DV LW KDV GLIIHUHQW
HQFRGLQJVDFFRUGLQJWRWKHPDFKLQHSODWIRUPDQGWH[WSUHSURFHVVLQJPLQLQJDQGLQIRUPDWLRQUHWULHYDOFDQOHDGWR
LQFRUUHFWUHVXOWVLIWKHHQFRGLQJLVQRWFRUUHFW)XUWKHUPRUHDVLQRXUWH[WPLQLQJFDVH$UDELFHQFRGLQJVDUHYHU\
GLIILFXOWLQSURFHVVLQJDQGPD\QRWEHVXSSRUWHGE\VHYHUDOWRROV
+RZHYHUGHVSLWHWKHFRPSOH[LW\ZHZHUHVWLOODEOHWRH[WUDFWVHYHUDOLPSRUWDQWLQVLJKWVIURPWKHVRFLDOPHGLD
DQDO\VLV$FFRUGLQJO\E\PHUJLQJWKHUHVXOWVRIWKHGLIIHUHQWDQDO\VHVZHFRXOGJDLQXQSUHFHGHQWHGLQVLJKWVXSRQ
ZKLFK WR EDVH RXU GHFLVLRQV :KLOH ZH ZHUH DEOH WR XQGHUVWDQG WKH UHODWLRQVKLSV EHWZHHQ WKH DWWULEXWHV VXFK DV
TXDQWLWLHVGLVFRXQWVWRWDOYDOXHVEUDQFKHVDQGVHFWLRQVIURPSHUIRUPLQJDQDO\WLFVRQWKHUHODWLRQDOGDWDRQLWVRZQ
LWZDVKLJKO\VXSSRUWHGE\WKH)DFHERRNSRVWVDQGFRPPHQWV2WKHUZLVHZHZRXOGQRWKDYHEHHQDEOHWRXQGHUVWDQG
WKHHIIHFWRIVRFLDOPHGLDRQWKHFXVWRPHUVSXUFKDVLQJSDWWHUQVDQGZHZRXOGKDYHYLHZHGWKHVSLNHVLQVDOHVDW
FHUWDLQWLPHVDIWHUSRVWLQJWKHRQOLQHSURPRWLRQVZLWKRXWXQGHUVWDQGLQJWKHXQGHUO\LQJUHDVRQV$GGLWLRQDOO\ZH
ZRXOGQRWRWKHUZLVHKDYHEHHQDEOHWRLQFRUSRUDWHWKHXVHUVHQWLPHQWVLQWRRXUGHFLVLRQVDQGXQGHUVWDQGKRZWKH\
IHHODERXWFHUWDLQVHUYLFHVSURPRWLRQVDQGLWHPV

5HVXOWV

)URPWKHH[SHULPHQWVHYHUDOREVHUYDWLRQVDQGHQKDQFHPHQWVUHJDUGLQJWKHIUDPHZRUNKDYHEHHQLGHQWLILHG)LUVW
RIDOODVSUHYLRXVO\VWDWHGLQWKHIUDPHZRUNGHYHORSPHQWVHFWLRQWKHIUDPHZRUNVHUYHVDVDFRQFHSWXDOL]DWLRQRI
VRPHRIWKHSRVVLEOHDSSURDFKHVWRSHUIRUPLQJELJGDWDDQDO\WLFVLQVXSSRUWRIWKHGHFLVLRQPDNLQJSURFHVV,WZDV
QRWLQWHQGHGWREHLQFOXVLYHRIDOOWKHELJGDWDWRROVWHFKQRORJLHVDQGDQDO\WLFV7KHUHDUHDOUHDG\VHYHUDORIWKHVH
DYDLODEOH DQG WKH\ DUH FRQVWDQWO\ LQFUHDVLQJ VR WKHUH FDQQRW EH D FRPSUHKHQVLYH OLVW RI DOO SRVVLEOH VROXWLRQV
$GGLWLRQDOO\VHYHUDOVROXWLRQVZKLFKDUHQRWUHOHDVHGDVELJGDWDWRROVFDQEHXVHGDVZHOOIRUFHUWDLQPHDQV,QRXU
H[SHULPHQWVDOWKRXJK:HNDLVQRWVWULFWO\VSHDNLQJDELJGDWDDQDO\WLFVWRRODQGLVUDWKHUDQRSHQVRXUFHPDFKLQH
OHDUQLQJWRROLWKDVVHYHUDOYHU\XVHIXODQGVLPSOHWRXVHIHDWXUHVDQGDQDO\VHV$FFRUGLQJO\LWLVQRWLQFOXGHGLQWKH
IUDPHZRUNDVDELJGDWDWRROKRZHYHULWZDVXVHGLQWKHH[SHULPHQWVIRUDGGLWLRQDONQRZOHGJHSHUVSHFWLYHVDQG
YLVXDOL]DWLRQV
0RUHRYHU LW ZDV IRXQG WKDW YLVXDOL]DWLRQ ZKLFK ZDV LQWHQGHG IRU XVH LQ WKH LQWHOOLJHQFH SKDVH GXULQJ GDWD
GLVFRYHU\DQGLQWKHFKRLFHSKDVHGXULQJHYDOXDWLQJWKHSRVVLEOHFRXUVHVRIDFWLRQZDVIRXQGWRVHUYHDVDQDQDO\VLV
RQ LWV RZQ LQ WKH GHVLJQ SKDVH IURP ZKLFK YDOXDEOH LQVLJKW FDQ EH H[WUDFWHG &RQVHTXHQWO\ ZH FRXOG YLVXDOL]H
LPSRUWDQWUHODWLRQVKLSVEHIRUHKDQGWKDWFRXOGEHIXUWKHULQFRUSRUDWHGLQWRWKHDGGLWLRQDODQDO\VHVRUYLVXDOL]HWKH
UHVXOWVRIWKHDQDO\VHVIRUVLPSOHUDQGPRUHFRPSUHKHQVLYHXQGHUVWDQGLQJ$FFRUGLQJO\LWQHHGHGWREHDGGHGWRWKH
1082 Nada Elgendy and Ahmed Elragal / Procedia Computer Science 100 (2016) 1071 1084

GHVLJQSKDVHLQWKHIUDPHZRUN7KHVDPHJRHVIRUVWDWLVWLFVZKLFKZHUHLQWHQGHGWREHDSSOLHGLQWKHLQWHOOLJHQFH
SKDVHGXULQJGDWDGLVFRYHU\+RZHYHULWZDVIRXQGWKDWVWDWLVWLFDODQDO\VHVFDQEHXVHGLQWKHGHVLJQSKDVHDVD
PRGHORUDIRUPRIDQDO\VLVRQLWVRZQRULQWHJUDWHGZLWKWKHRWKHUDQDO\VHV+HQFHLQWKHGDWDDQDO\WLFVVWHSQRW
RQO\SUHGLFWLYHDQDO\WLFVFDQEHXVHGWRYDOXHDQGJDLQLQVLJKWEXWGHVFULSWLYHDQDO\WLFVDVZHOOFDQEHDSSOLHG
)XUWKHUPRUHLWZDVIRXQGWKDWWKHUHGRHVQRWQHHGWREHWZRGLIIHUHQWVWHSVLQWZRGLIIHUHQWSKDVHVIRUDQDO\]LQJ
DQGHYDOXDWLQJWKHUHVXOWVRIWKHELJGDWDDQDO\WLFVDQGWKDWWKH\FDQUDWKHUEHPHUJHGLQWRDVLQJOHVWHSIRUVLPSOLFLW\
$GGLWLRQDOO\WKHDQDO\]LQJDQGHYDOXDWLRQVKRXOGEHRQWKHDQDO\WLFVLWVHOIDQGUHODWLQJLWWRWKHGHFLVLRQGRPDLQ
UDWKHUWKDQRQWKHSRVVLEOHFRXUVHVRIDFWLRQ7KLVLVGXHWRWKHDVSHFWWKDWDIWHUWKHDQDO\WLFVDUHSHUIRUPHGWKHUHVXOWV
DUHDQDO\]HGLQRUGHUWRJDLQLQVLJKWVZKLFKFDQDGGYDOXDEOHNQRZOHGJHDQGDLGLQPDNLQJWKHQHFHVVDU\GHFLVLRQ
+RZHYHULWLVQRWDOZD\VWKHFDVHLQZKLFKWKHEHVWVFHQDULRZRXOGEHWRILUVWGHILQHWKHSRVVLEOHFRXUVHVRIDFWLRQ
DQGWKHQHYDOXDWHHDFKRQHLQRUGHUWRVHOHFWWKHEHVWRQHLQWKHGHFLVLRQ6RPHWLPHVWKHUHDUHWRRPDQ\SRVVLEOH
FRXUVHVRUWKHVHSRVVLEOHFRXUVHVDUHNQRZQEHIRUHKDQG
$OVRELJGDWDDQDO\WLFVGLIIHUVIURPWKHWUDGLWLRQDOPRUHVWUXFWXUHGPHWKRGRIILQGLQJDEXVLQHVVSUREOHPJHWWLQJ
GDWDDQGDQDO\]LQJLWDQGUDWKHUJRHVIRUDPRUHXQVWUXFWXUHGDSSURDFKZKHUHZHWU\WRJDWKHUDOOVRUWVRIGDWD


SHUIRUPDQDO\VHVWRH[WUDFWZKLFKHYHUNQRZOHGJHDQGLQIRUPDWLRQZHFDQRXWRILWDQGDFFRUGLQJO\VHHKRZZHFDQ
DSSO\WKHVHXQSUHFHGHQWHGLQVLJKWVDQGKRZWKH\FDQKHOSLQRXUGHFLVLRQGRPDLQ7KHUHIRUHZHGRQRWDOZD\VKDYH
WRNQRZWKHGHFLVLRQZKLFKZLOOEHPDGHEHIRUHKDQGDQGWKLVFDVHQHHGVWREHVXSSRUWHGE\WKHIUDPHZRUN7KXV
ERWKDQDO\]LQJDQGHYDOXDWLQJVKRXOGEHPHUJHGLQWRDVLQJOHVWHSZKHUHHLWKHURQHRUERWKFRXOGEHDSSOLHGRQWKH
UHVXOWVRIWKHDQDO\VHV
$GGLWLRQDOO\ZKLOVWWKHIUDPHZRUNLVQRWLQWHQGHGWREHDWRROIRUPDNLQJWKHRSWLPDOVWUXFWXUHGDQGLQIRUPHG
GHFLVLRQVLWLVDFRQFHSWXDOL]DWLRQRIPDSSLQJWKHGLIIHUHQWWRROVWRWKHGHFLVLRQPDNLQJSURFHVVDQGKRZELJGDWD
DQDO\WLFVFDQEHXVHGIRUGHFLVLRQVXSSRUW0RUHRYHUWKHIUDPHZRUNZDVIRXQGWREHDVRPHZKDWPRUHIOH[LEOHRU
LWHUDWLYHSURFHVVUDWKHUWKDQVHTXHQWLDO$FFRUGLQJO\ZHVKRXOGEHDEOHWRPRYHEDFNDQGIRUWKEHWZHHQWKHSKDVHV
DQG VWHSV 6LQFH DV SUHYLRXVO\ VWDWHG ELJ GDWD DQDO\WLFV LV QRW YHU\ SUHGLFWDEOH EHIRUHKDQG DQG PD\ IROORZ DQ
XQVWUXFWXUHGDSSURDFKZKHUHZHGRQRWNQRZLQDGYDQFHWKHGHFLVLRQWREHPDGHLWPDNHVVHQVHWRKDYHWRJREDFN
WRSULRUVWDJHVDWWLPHV)RUH[DPSOHZKLOHSHUIRUPLQJWKHDQDO\VHVLQWKHGHVLJQSKDVHZHPLJKWILQGWKDWZHQHHG
PRUHGDWDRIGLIIHUHQWWRHQKDQFHRXUPRGHOVRUDGGDGGLWLRQDONQRZOHGJH7KXVWKHIUDPHZRUNVKRXOGDOORZIRU
IOH[LELOLW\EHWZHHQPRYLQJEDFNDQGIRUWKWKURXJKWKHVWHSV
&RQFOXVLYHO\WKHILQGLQJVRIWKHH[SHULPHQWZHUHLQFRUSRUDWHGLQWRWKHPRGLILHGIUDPHZRUN7KHQHZWHVWHG%
'$' IUDPHZRUN LV VKRZQ LQ )LJ  $V GHSLFWHG WKH LQWHOOLJHQFH SKDVH LV WKH VDPH ZLWKRXW DQ\ PRGLILFDWLRQV
+RZHYHU WKH GHVLJQ SKDVH QRZ RQO\ LQFOXGHV WKH PRGHO SODQQLQJ DQG WKH ELJ GDWD DQDO\WLFV VWHSV 6WDWLVWLFV DQG
YLVXDOL]DWLRQKDYHDOVREHHQDGGHGDVDQDO\VHV0RUHRYHUGHVFULSWLYHDQDO\WLFVKDVEHHQDGGHGWRWKHDQDO\WLFVVWHS
)XUWKHUPRUHWKHDQDO\]LQJDQGHYDOXDWLQJVWHSVKDYHEHHQPHUJHGLQWRDVLQJOHVWHSLQWKHFKRLFHSKDVH+HUHWKH
VWHSGRHVQRWUHIHUWRDQDO\]LQJDQGHYDOXDWLQJWKHSRVVLEOHFRXUVHVRIDFWLRQEXWUDWKHUDQDO\]LQJWKHUHVXOWVRIWKH
Nada Elgendy and Ahmed Elragal / Procedia Computer Science 100 (2016) 1071 1084 1083

)LJ0RGLILHG%'$')UDPHZRUN

SULRUDQDO\VHVDQGHYDOXDWLQJWKHLULPSDFWRQWKHGHFLVLRQGRPDLQ)LQDOO\WZRVLGHGDUURZVKDYHEHHQDGGHGEHWZHHQ
WKHSKDVHVWRUHSUHVHQWWKHIOH[LELOLW\RIPRYLQJEDFNDQGIRUWKWKURXJKRXWWKHIUDPHZRUN

&RQFOXVLRQ

,QWKLVUHVHDUFKZHKDYHH[DPLQHGWKHLQQRYDWLYHWRSLFRIELJGDWDZKLFKKDVUHFHQWO\JDLQHGORWVRILQWHUHVWGXH
WRLWVSHUFHLYHGXQSUHFHGHQWHGRSSRUWXQLWLHVDQGEHQHILWV,QWKHLQIRUPDWLRQHUDZHDUHFXUUHQWO\OLYLQJLQYROXPLQRXV
YDULHWLHVRIKLJKYHORFLW\GDWDDUHEHLQJSURGXFHGGDLO\DQGZLWKLQWKHPOD\LQWULQVLFGHWDLOVDQGSDWWHUQVRIKLGGHQ
NQRZOHGJH ZKLFK VKRXOG EH H[WUDFWHG DQG XWLOL]HG +HQFH ELJ GDWD DQDO\WLFV FDQ EH DSSOLHG WR OHYHUDJH EXVLQHVV
FKDQJHDQGHQKDQFHGHFLVLRQPDNLQJE\DSSO\LQJDGYDQFHGDQDO\WLFWHFKQLTXHVRQELJGDWDDQGUHYHDOLQJKLGGHQ
LQVLJKWVDQGYDOXDEOHNQRZOHGJH
%\DSSO\LQJVXFKDQDO\WLFVWRELJGDWDYDOXDEOHLQIRUPDWLRQFDQEHH[WUDFWHGDQGH[SORLWHGWRHQKDQFHGHFLVLRQ
PDNLQJDQGVXSSRUWLQIRUPHGGHFLVLRQV$FFRUGLQJO\ZHIROORZHGWKHGHVLJQVFLHQFHPHWKRGRORJ\LQRUGHUWRDQVZHU
1084 Nada Elgendy and Ahmed Elragal / Procedia Computer Science 100 (2016) 1071 1084

RXUUHVHDUFKTXHVWLRQRIHow to integrate big data analytics into the decision making process?&RQVHTXHQWO\WKH
FRQWULEXWLRQRIWKLVUHVHDUFKLVWKHGHYHORSHGDQGWHVWHG%'$'IUDPHZRUNZKLFKJXLGHVXVWKURXJKWKHGHFLVLRQ
PDNLQJSURFHVVVXSSRUWHGE\ELJGDWDDQDO\WLFVDQGWKHYDULRXVELJGDWDWRROVDQGPHWKRGV
'HVLJQVFLHQFHUHVHDUFKVKRXOGSURYLGHDGGLWLRQVWRWKHNQRZOHGJHEDVHDVZHOODVDSSOLFDWLRQVLQWKHDSSURSULDWH
HQYLURQPHQWV+HQFHWKLVUHVHDUFKFRQWULEXWHVWRWKHRU\DQGWRWKHNQRZOHGJHEDVHE\SHUXVLQJWKHOLWHUDWXUHDQG
FROOHFWLQJ YDULRXV WKHRULHV PHWKRGV IUDPHZRUNV DQG GDWD DQDO\VLV WHFKQLTXHV IURP SUHYLRXV UHVHDUFK LQ WKH
NQRZOHGJHEDVHLQRUGHUWREXLOGWKH%'$'IUDPHZRUN$FFRUGLQJO\LWDJJUHJDWHVVHYHUDOSDUWVRIWKHVHDVSHFWV
DQGLQWHJUDWHVDQGLQFRUSRUDWHVWKHPLQWRDVLQJOHIUDPHZRUN0RUHRYHUWHVWLQJLWRQDUHDORUJDQL]DWLRQDOVFHQDULR
ZKLFKDGGVULJRUDQGFRQFHSWXDOLW\VWUHQJWKHQHGWKHHYDOXDWLRQRIWKHIUDPHZRUN
$GGLWLRQDOO\WKHUHVHDUFKDOVRSURYLGHVFRQWULEXWLRQVWRWKHHQYLURQPHQW7KH%'$'IUDPHZRUNFDQEHDSSOLHG
QRWRQO\LQUHVHDUFKEXWDOVRLQWKHLQGXVWU\DQGZLWKLQRUJDQL]DWLRQV2QHRIWKHSURORQJHGDQGFRQVWDQWDLPVRI
GHFLVLRQPDNHUVDQGSUDFWLWLRQHUVZLWKLQRUJDQL]DWLRQVLVWRHQKDQFHGHFLVLRQPDNLQJDQGJDLQWKHKLJKHVWOHYHOVRI
XQSUHFHGHQWHG NQRZOHGJH DQG LQVLJKWV SRVVLEOH $FFRUGLQJO\ WKLV UHVHDUFK KDV SURYLGHG WKH SHRSOH DQG WKH
RUJDQL]DWLRQVZLWKWKH%'$'IUDPHZRUNZKLFKVKRZVWKHPKRZWRLQWHJUDWHDQGDSSO\ELJGDWDDQDO\WLFVWKURXJKRXW
WKHSKDVHVRIWKHGHFLVLRQPDNLQJSURFHVVLQRUGHUWRSURYLGHHQKDQFHGDQGPRUHLQVLJKWIXOGHFLVLRQV
:KLOHLWZDVVKRZQWKDWELJGDWDDQDO\WLFVFRXOGHQKDQFHGHFLVLRQPDNLQJDQGHQDEOHWKHH[WUDFWLRQRIXQIRUHVHHQ
LQVLJKWVDQGNQRZOHGJHLWLVQRHDV\WDVN2WKHUWKDQWKHWLPHDQGUHVRXUFHOLPLWDWLRQVUHODWHGWRPRVWUHVHDUFKRQH
RIWKHPDLQGLIILFXOWLHVIDFHGLQRXUFDVHZDVDFFHVVWR>ELJ@GDWD
:HEHOLHYHWKDWELJGDWDDQDO\WLFVLVRIJUHDWVLJQLILFDQFHLQWKLVHUDRIGDWDRYHUIORZDQGFDQSURYLGHXQIRUHVHHQ
LQVLJKWVDQGEHQHILWVWRGHFLVLRQPDNHUVLQYDULRXVDUHDV,ISURSHUO\H[SORLWHGDQGDSSOLHGELJGDWDDQDO\WLFVKDVWKH
SRWHQWLDOWRSURYLGHDEDVLVIRUDGYDQFHPHQWVRQWKHVFLHQWLILFWHFKQRORJLFDODQGKXPDQLWDULDQOHYHOV

5HIHUHQFHV

 %UXQVZLFNHU6%HUWLQR(0DWHL6  %LJ'DWDIRU2SHQ'LJLWDO,QQRYDWLRQ$5HVHDUFK5RDGPDS,Q%LJ'DWD5HVHDUFK9RO


SS
 &KDQJ50.DXIIPDQ5-.ZRQ<  8QGHUVWDQGLQJWKHSDUDGLJPVKLIWWRFRPSXWDWLRQDOVRFLDOVFLHQFHLQWKHSUHVHQFHRIELJGDWD
,Q'HFLVLRQ6XSSRUW6\VWHPV9ROSS
 )DQ6/DX5=KDR-/  'HP\VWLI\LQJ%LJ'DWD$QDO\WLFVIRU%XVLQHVV,QWHOOLJHQFH7KURXJKWKH/HQVRI0DUNHWLQJ0L[,Q%LJ'DWD
5HVHDUFK9ROSS
 (OJHQG\1(OUDJDO$  %LJ'DWD$QDO\WLFV$/LWHUDWXUH5HYLHZ3DSHU,Q$GYDQFHVLQ'DWD0LQLQJ$SSOLFDWLRQVDQG7KHRUHWLFDO
$VSHFWV6SULQJHU,QWHUQDWLRQDO3XEOLVKLQJSS
 (0&  'DWD6FLHQFHDQG%LJ'DWD$QDO\WLFV,Q(0&(GXFDWLRQ6HUYLFHVSS
 )D\\DG83LDWHWVN\6KDSLUR*3DGKUDLF6  )URP'DWD0LQLQJWR.QRZOHGJH'LVFRYHU\LQ'DWDEDVHV,Q$PHULFDQ$VVRFLDWLRQ
IRU$UWLILFLDO,QWHOOLJHQFHSS
 )LVKHU''H/LQH5&]HUZLQVNL0'UXFNHU6  ,QWHUDFWLRQVZLWK%LJ'DWD$QDO\WLFV,Q$&0,QWHUDFWLRQV9RO1RSS

 -DJDGLVK+9  %LJ'DWDDQG6FLHQFH0\WKVDQG5HDOLW\,Q%LJ'DWD5HVHDUFK9ROSS
 .XELFN:5  %LJ'DWD,QIRUPDWLRQDQG0HDQLQJ,Q&OLQLFDO7ULDO,QVLJKWVSS
 0DQ\LND - &KXL 0 %URZQ % %XJKLQ - 'REEV 5 5R[EXUJK& %\HUV $+   %LJ 'DWD 7KH 1H[W )URQWLHU IRU ,QQRYDWLRQ
&RPSHWLWLRQDQG3URGXFWLYLW\,Q0F.LQVH\*OREDO,QVWLWXWH5HSRUWVSS
 2UDFOH   $Q (QWHUSULVH $UFKLWHFWV *XLGH WR %LJ 'DWD 5HIHUHQFH $UFKLWHFWXUH 2YHUYLHZ ,Q 2UDFOH :KLWH 3DSHUV LQ (QWHUSULVH
$UFKLWHFWXUHSS
 3HIIHUV . 7XXQDQHQ 7 5RWKHQEHUJHU 0$ &KDWWHUMHH 6   $ 'HVLJQ 6FLHQFH 5HVHDUFK 0HWKRGRORJ\ IRU ,QIRUPDWLRQ 6\VWHPV
5HVHDUFK,Q-RXUQDORI0DQDJHPHQW,QIRUPDWLRQ6\VWHPV9RO1RSS
5XVVRP3  %LJ'DWD$QDO\WLFV,Q7':,%HVW3UDFWLFHV5HSRUWSS
6DDG0.$VKRXU:  $UDELF0RUSKRORJLFDO7RROVIRU7H[W0LQLQJ,Q,QWHUQDWLRQDO&RQIHUHQFHRQ(OHFWULFDODQG&RPSXWHU6\VWHPV
((&6 SS
 7XUEDQ ( $URQVRQ - ( /LDQJ 7 6KDUGD 5   'HFLVLRQ 6XSSRUW DQG %XVLQHVV ,QWHOOLJHQFH 6\VWHPV WK HG  3UHQWLFH +DOO
3XEOLFDWLRQV8SSHU6DGGOH5LYHU1-86$


Вам также может понравиться