Академический Документы
Профессиональный Документы
Культура Документы
ARTHA
(Artificially Responding THinking Agent)
An automated call centre response system
Category: Artificial Intelligence
Speech to text:
E.g.
Spoken language is collection of
acoustic signals, which can be 1. ELEVEN AX L
digitized. These samples can be
EH V AX N
converted into a sentence of distinct
2. ELEVEN(2) IY L
words by matching segments of the
EH V AX N
incoming signals with a stored library
3. EXIT EH G
of phonemes. There are readily
available libraries, which can do this. Z AX T
“Dragon Speech” is one such set of 4. EXIT(2) EH K
library. But these libraries have some S AX T
limitations too. They need to take a 5. EXPLORE IX K
sample of speaker voice before S P L AO R
properly converting the speech. But 6. FIFTEEN F IH F
in a typical call centre scenario it is T IY N
required to convert the speech
immediately from the first word itself. Here is a sample dictionary. With this,
This can be overcome if some using phone sets words are derived.
probability of error is accepted for
first few sentences.
Text to grammar:
The phoneset is the list of 'phones', or
speech sounds, that the engine can The computer does not understand
recognize. When you build acoustic any text, which is obtained from
models and pronunciations for words, speech conversion. For that we need
they can be made to use any set of to have a way of extracting the
units, but they must be the same units. meaning of the text. Given text is
The acoustic models will search for broken down into words. These words
the speech sounds (phones), and the can be matched with a predefined tree
word pronunciations are also given in of grammar and the machine can
terms of the phones in the phone set. arrive at its meaning.
The default phoneset for American To achieve this we first represent the
English that comes with Sphinx2 text into annotated form. This
contains the following phones: AA annotated form is called a corpus. In
AE AH AO AW AX AXR AY B CH principle, any collection of more than
D DH DX EH ER EY F G HH IH IX one text can be called a corpus,
IY JH K L M N NG OW OY P R S (corpus being Latin for "body", hence
SH T TH UH UW V W Y Z ZH. a corpus is any body of text). But the
term "corpus" when used in the
context of modern linguistics tends
There is also the silence phone, SIL, most frequently to have more specific
and a number of "noise" phones: connotations than this simple
ARTHA: Artificially Responding THinking Agent
on - pr - preposition --
meaning Pattern AI Handler
the - det - determiner
At later stages the LEARNER can be application, the processing time needs
improved to just accept the datasheet to be within few seconds. But the
of the product, and prepare the neural networks and other techniques
knowledge base. used in ARTHA involve huge
computational complexities. Hence
suitable methods should be designed
MONITOR: to reduce the complexity of the
computations
This is a book-keeping module, which
keeps track of ARTHA’s 7. Conclusions
performance. Any unanswered query
gets logged in the monitor,which can By looking at the above examples, it
be viewed by the administrator and is clear that such Automatic Response
inputs can be given to the LEARNER systems are not far from being
about it. The MONITOR will also implemented in large scale in near
monitor the “health” and future. But doing a generic system
“intelligence” level of the BRAIN. involves lots of innovation and huge
amount of man-hours for the
6. Further Enhancements implementation.
http://www.sls.lcs.mit.edu/sls/whatw
edo/architecture.html
Spoken Language Systems, MIT.
Related work
http://www.ling.gu.se/~lager/taglog.
The research on Artificial Response html
systems started long back, leading to A LOGICAL APPROACH TO
some of the very capable systems. COMPUTATIONAL CORPUS
Major research institutes which are at LINGUISTICS.
the forefront are: MIT, CMU and
Stanford Universities. There is lot of http://www.aaai.org/AITopics/index
research going on in military .html
applications as well. American Association for Artificial
Intelligence.
Some of the similar systems built
are: VOYAGER, JUPITER. http://www.cogs.susx.ac.uk/users/ge
offs/ChrisDoc.html
The performance issues CHRISTINE Corpus, Stage I:
Documentation.
The speed of processing is a key issue
in ARTHA. As it is a call centre
ARTHA: Artificially Responding THinking Agent
http://www.cs.washington.edu/resea
rch/jair/home.html
Journal of Artificial Intelligence.
ftp://ftp.sas.com/pub/neural/FAQ.ht
ml
Neural Networks FAQs, GOOGLE.
http://www.zsolutions.com/sowhy.ht
m
Neural Networks and Data Mining, Z-
Solutions.