Академический Документы
Профессиональный Документы
Культура Документы
Europäisches Patentamt
(19) European Patent Office
(43) Date of publication: (51) Int Cl.7: G06F 3/00, H04N 7/173,
05.10.2005 Bulletin 2005/40 H04H 9/00
(21) Application number: 04007969.1
(57) The present invention relates to an emotion and an editing unit (6) for changing said multimedia con-
controlled system for processing multimedia data com- tent (7) in accordance with the emotional state of the
prising a multimedia system (8) for presenting multime- user (1) in order to present the changed multimedia con-
dia content (7) to a user (1), an emotion model means tent (9) by the multimedia system (8).
(5) for determining the emotional state of the user (1) The present invention further relates to a method
during the presentation of the multimedia content (7) for executing the steps on this system.
EP 1 582 965 A1
2
3 EP 1 582 965 A1 4
Fig. 5 is a block diagram showing the different im- on the emotion data submitted by the emotion model
plementations of the system of the present inven- means 5. The changed multimedia content 9 can then
tion. be presented again on the multimedia system 8 to the
user 1.
[0021] Fig. 1 shows the system according to the 5 [0025] Hereby, the changing of the multimedia con-
present invention. tent may happen before, during or after the presentation
A multimedia content 7 is presented by a multimedia of the multimedia content 7 to a user 1. In a pre-process-
system 8 to a user 1. Such multimedia content may be ing for example the reactions and emotions of at least
movies, music, web pages, films, or the like and the mul- one user not being the end-consumer on a multimedia
timedia system may be a TV, a computer, or a home 10 content may be measured, the content is changed ac-
environment comprising loudspeakers, display, a HiFi- cordingly and afterwards presented to the final consum-
system, lights or any other devices for presenting and er. A next possibility is to change the multimedia content
emphasising multimedia content 7. at the same time the user 1 is watching it, that means
[0022] An emotion model means 5 serves for deter- e.g. changing the quality, the sound, the volume or the
mining the emotions felt by the user 1 to which a multi- 15 like of the multimedia content depending on the reaction
media content 7 is presented by the multimedia system of the user. A third possibility would be to store the
8. Hereby, the emotion model means 5 comprises an changed multimedia content in order to show it to the
acquisition system 2, a processing means 3 and a trans- user 1 at a later time.
formation means 4. A person during the consumption of [0026] Fig. 2 shows a first embodiment of the present
multimedia content reacts instinctively to the shown 20 invention. Hereby, the multimedia system 8 presents
contents with some emotions, e.g. anger, happiness, multimedia content to the user 1 and the emotion model
sadness, surprise or the like. Such mental activity is not means 5 detects the emotional state of the user during
constant and its follows the personal attitude of the user. the presentation of the multimedia content 7 and sub-
Additionally, such mental activities produce physiologi- mits these emotional states to the editing unit 6. In the
cal reactions in the human body. The acquisition system 25 editing unit 6 suitable algorithms process the collected
2 serves for detecting and measuring such physiological information to create e.g. digests of the viewed video
reactions of a user. Such reactions may be the heart contents. Depending on the algorithm itself different
rate, blood pressure, temperature, facial expression, kinds of digests can be produced for different purpose,
voice, gesture or the like. These reactions can be meas- e.g. e-commerce trailers, a summary of the multimedia
ured by e.g. biosensors, infrared cameras, microphones 30 contents, a very compact small representation of digital
and/or any other suitable sensing means. These phys- pictures shown as a preview before selection of the pic-
ically measurable reactions are measured by the acqui- tures themselves, or the like.
sition system 2 and are then further submitted to the [0027] Such an automatic system can be applied to
processing means 3, which processes and evaluates both a personal and public usage. For personal usage,
the data submitted by the acquisition system 2. If e.g. 35 the video e.g. can fit the personal needs or attitude of a
the acquisition system 2 samples the heart beat rate, single person, in the other case it is preferable to apply
then the processing means 3 may calculate the average the method to a vast public for producing a statistic result
value. The evaluated data are then further submitted to before automatically addressing different classes of
the transformation means 4 which, depending on the people, e.g. by age, interest, culture, gender etc.
submitted biological reactions, detects the emotional 40 [0028] The digest generation can be static or dynam-
state of the user. The transformation means 4 therefore ic. Static means, that the digest of a video is generated
comprises algorithms for effectuating such detection of only once and cannot be changed later. A dynamic gen-
the emotional state of the user 1. eration allows adapting the digest to changing emotions
[0023] Such an algorithm for detecting the emotional of a person every time the video is watched by this per-
state of the user 1 on the basis of the data acquired by 45 son. That means the digest follows the emotions of a
the acquisition means may be as follows: To every emo- person during lifetime.
tional state e.g. anger, happiness, fear, sadness or the [0029] As emotions are used to generate a video di-
like a certain value of each measured parameter or a gest, such a digest is a very personal one. Therefore,
certain interval of values of a measured parameter is the privacy problem has to be solved e.g. by protecting
assigned. To the emotional state of fear a high heart beat 50 a digest by cryptographic mechanisms.
rate, a high blood pressure and the like may be as- [0030] Another possibility is to use the emotions de-
signed. So the measured physical values are compared tected for classifying and analysing multimedia con-
with the values or intervals of values assigned to each tents. Hereby, the emotional state is embedded in the
emotional state and this way the actual emotional state multimedia content 7 by the editing means 6, that is to-
of the user can be detected. 55 gether with the multimedia content data are stored,
[0024] The emotional data determined by the emotion which reveal the emotional state, the grade of interest
model means 5 are then submitted to an editing unit 6 or the like of the user 1. Such information can then be
in order to change the multimedia content 7 depending used for post-processing analysis of the multimedia
3
5 EP 1 582 965 A1 6
content or for activating intelligent devices in a room en- [0035] Fig. 4 is an overview over the different appli-
vironment. This for example opens the possibility to cations of the system of the present invention. Hereby,
search for and look up information inside a multimedia the emotion data and the multimedia contents are both
content by emotion parameters, the user can e.g. fed to the editing unit 6 and then processed in order to
search for a scene where he was scared or where he 5 receive different applications. In the first possibility, em-
was amused. bedding data is used to associate data to the digital con-
[0031] Fig. 3 shows a second embodiment of the tent and the obtained data is then processed to create
present invention, where the multimedia content 7 pre- a digital video digest, e.g. a short video digest, a preview
sented by the multimedia system 8 is changed in real video digest or a longer synthesis of the video. In a sec-
time in a closed loop control. That means, that the mul- 10 ond possibility, emotion data and the digital stream are
timedia contents is automatically and dynamically used to produce on the fly a different view. In a third pos-
adapted to the user during playing time. sibility, the quality adaptive mechanism is implemented
[0032] A first possibility is to detect the actual grade and in this case the multimedia content is processed us-
of interest and tension of the user and accordingly ing the emotion information to produce a changing video
change to another scene or programme or continue with 15 stream.
the presentation of the multimedia content. Hereby, the [0036] Fig. 5 is an overview of the different implemen-
analysis of the user's emotion has to be done either at tations of the system, especially of the emotion model
some predefined points of the multimedia content e.g. means 5. As already explained the system mainly con-
at the beginning of every new scene or after a prede- sists of the acquisition 2, the processing means 3, the
fined time period, e.g. every two minutes or the like. By 20 transformation means 4 and the editing unit 6. The dif-
detecting the actual grade of interest of the user, the at- ferent parts of the system according to the present in-
tention of the user can be analysed and it can be pro- vention can be located remotely from each other. In im-
ceeded to the next scene if the user is not interested in plementation a) the acquisition system 2 is placed di-
the actual multimedia content. Instead of changing the rectly at the user 1 and the processing means 3 and the
scenes, also an automatic zapping mode of TV pro- 25 transformation means 4 are placed remote from the ac-
grammes or a selected number of movies may be im- quisition means 2, which only submits physical param-
plemented. Depending on the emotional state of the us- eters. In this case the acquisition may be a simple brace-
er and the emotional content associated to movies or let which transmits the bio or haptic data to the process-
TV programmes, an intelligent system might change au- ing means 3 and transformation means 4 which then fur-
tomatically to a set of user's video choices. 30 ther processes these data to detect the emotional state
[0033] A second possibility of application of the closed of the user 1.
loop control is to adapt the quality or the bandwidth for In implementation b), the acquisition system 2, the
the reproduction or the transmission of the multimedia processing means 3 and the transformation means 4
content. Streaming audio or video contents can be sent are all located near the user 1. Hereby, all these means
with a very high or low quality depending on the level of 35 may be implemented in a smart wearable bracelet or
attention of the user. That means, if a low level of interest another wearable system transmitting the emotions al-
by the user is detected then the quality is lowered and ready detected and processed to the editing unit 6 lo-
on the other hand if the grade of interest of the user is cated remotely.
high then also the quality is increased. This is particu- In implementation c) no date from the user 1 are trans-
larly advantageous for multimedia devices with limited 40 mitted, but the acquisition system 2 is located remotely
network bandwidth and power such as portable phones, from the user. In this case an acquisition system 2, such
PDHs and the like. as cameras, microphones or the like may be used. Also
[0034] A third possibility is to adapt the closed loop a combination of the implementations a) to c) can be
control to an interacting home entertainment environ- possible.
ment. The multimedia system 8 may comprise not only 45
display and loudspeakers but also other environmental
devices such as lights or the like. According to the user's Claims
emotional state, a home entertainment system might re-
ceive and process emotion information and adapt the 1. Emotion controlled system for processing multime-
environment to such stimula. If for example the user is 50 dia data comprising
watching a scary movie and the physical changes relat- a multimedia system (8) for presenting multimedia
ed to fear, such as a high heart beat rate or a high blood content (7) to a user (1),
pressure are submitted to the emotion model means 5 an emotion model means (5) for determining the
and then afterwards the emotional state of fear is sub- emotional state of the user (1) during the presenta-
mitted to the editing means 6, then the multimedia sys- 55 tion of the multimedia content (7) and
tem 8 may adapt the environment by dimming down the an editing unit (6) for changing said multimedia con-
light or increasing the sound in order to emphasise the tent (7) in accordance with the emotional state of
emotion felt by the user. the user (1) in order to present the changed multi-
4
7 EP 1 582 965 A1 8
5
EP 1 582 965 A1
6
EP 1 582 965 A1
7
EP 1 582 965 A1
8
EP 1 582 965 A1
9
EP 1 582 965 A1
10
EP 1 582 965 A1
11
EP 1 582 965 A1
12