Академический Документы
Профессиональный Документы
Культура Документы
theaiconf.com
D4267
What Is Artificial
Intelligence?
The O’Reilly logo is a registered trademark of O’Reilly Media, Inc. What Is Artificial
Intelligence?, the cover image, and related trade dress are trademarks of O’Reilly
Media, Inc.
While the publisher and the authors have used good faith efforts to ensure that the
information and instructions contained in this work are accurate, the publisher and
the authors disclaim all responsibility for errors or omissions, including without
limitation responsibility for damages resulting from the use of or reliance on this
work. Use of the information and instructions contained in this work is at your own
risk. If any code samples or other technology this work contains or describes is sub‐
ject to open source licenses or the intellectual property rights of others, it is your
responsibility to ensure that your use thereof complies with such licenses and/or
rights.
978-1-491-96540-5
[LSI]
Table of Contents
iii
What Is Artificial Intelligence?
1
ogies, or do they act on their own?). Each of these axes is a spec‐
trum, and each point in this many-dimensional space represents a
different way of understanding the goals and capabilities of an AI
system.
On the strength axis, it’s very easy to look at the results of the last 20
years and realize that we’ve made some extremely powerful pro‐
grams. Deep Blue beat Garry Kasparov in chess; Watson beat the
best Jeopardy champions of all time; AlphaGo beat Lee Sedol, argua‐
bly the world’s best Go player. But all of these successes are limited.
Deep Blue, Watson, and AlphaGo were all highly specialized, single-
purpose machines that did one thing extremely well. Deep Blue and
Watson can’t play Go, and AlphaGo can’t play chess or Jeopardy,
even on a basic level. Their intelligence is very narrow, and can’t be
generalized. A lot of work has gone into using Watson for applica‐
tions such as medical diagnosis, but it’s still fundamentally a
question-and-answer machine that must be tuned for a specific
domain. Deep Blue has a lot of specialized knowledge about chess
strategy and an encyclopedic knowledge of openings. AlphaGo was
built with a more general architecture, but a lot of hand-crafted
knowledge still made its way into the code. I don’t mean to trivialize
or undervalue their accomplishments, but it’s important to realize
what they haven’t done.
We haven’t yet created an artificial general intelligence that can solve
a multiplicity of different kinds of problems. We still don’t have a
machine that can listen to recordings of humans for a year or two,
and start speaking. While AlphaGo “learned” to play Go by analyz‐
ing thousands of games, and then playing thousands more against
itself, the same software couldn’t be used to master chess. The same
general approach? Probably. But our best current efforts are far from
a general intelligence that is flexible enough to learn without super‐
vision, or flexible enough to choose what it wants to learn, whether
that’s playing board games or designing PC boards.
Assistants or Actors?
Press coverage of AI focuses on autonomous systems, machines that
act on their own. With good reason: that’s the fun, sexy, and some‐
what scary face of AI. It’s easy to watch AlphaGo, with a human
servant to make its moves, and fantasize about a future dominated
by machines. But there’s something more to AI than autonomous
devices that make humans obsolete. Where is the real value, artifi‐
cial intelligence or intelligence augmentation? AI or IA? That ques‐
tion has been asked since the first attempts at AI and is explored in
depth by John Markoff in Machines of Loving Grace. We may not
want an AI system to make decisions; we may want to reserve deci‐
sion making for ourselves. We may want AI that augments our intel‐
ligence by providing us with information, predicting the
consequences of any course of action, and making recommenda‐
Assistants or Actors? | 7
tions, but leaving decisions to the humans. The Matrix notwith‐
standing, a future in which artificial intelligence is at our service,
augmenting our intelligence rather than overruling it, is much more
likely than a future in which we’re the servants of an overreaching
AI.
A GPS navigation system is an excellent example of an AI system
that augments human intelligence. Given a good map, most humans
can navigate from point A to point B, though our abilities leave a lot
to be desired, particularly if we’re in unfamiliar territory. Plotting
the best route between two locations is a difficult problem, particu‐
larly when you account for problems like bad traffic and road condi‐
tions. But, with the exception of autonomous vehicles, we’ve never
connected the navigation engine to the steering wheel. A GPS is
strictly an assistive technology: it gives recommendations, not com‐
mands. Whenever you hear the GPS saying “recalculating route,” a
human has made a decision (or a mistake) that ignored the GPS rec‐
ommendation, and the GPS is adapting.
Over the past few years, we’ve seen many applications that qualify as
AI, in one sense or another. Almost anything that falls under the
rubric of “machine learning” qualifies as artificial intelligence:
indeed, “machine learning” was the name given to the more success‐
ful parts of AI back when the discipline fell into disrepute. You don’t
have to build something with a human voice, like Amazon’s Alexa,
to be AI. Amazon’s recommendation engine is certainly AI. So is a
web application like Stitchfix, which augments choices made by
fashion experts with choices made by a recommendation engine.
We’ve become accustomed to (and are frequently annoyed by) chat
bots that handle customer service calls, more or less accurately.
You’ll probably end up talking to a human, but the secret is using
the chat bot to get all the routine questions out of the way. There’s
no point in requiring a human to transcribe your address, your pol‐
icy number, and other standard information: a computer can do it at
least as accurately, if not more.
The next generation of assistants will be (and already is) semi-
autonomous. Several years ago, Larry Page said the Star Trek com‐
puter was the ideal search engine: it was a computer that understood
humans, had already digested all the information available, and
could answer questions before they were even asked. If you have
used Google Now, you were probably surprised the first time it told
you to leave early for an appointment because the traffic was bad.
Producing Results
Unlike the dark times of the AI winter, when data was limited and
computers were slow, we’re seeing successful AI systems every‐
where. Google Translate is nowhere near as good as a human trans‐
lator, but it frequently gives you a usable translation. While it hasn’t
been on the radar anywhere near as much, speech recognition sys‐
tems are also commonplace, and surprisingly accurate; a year ago,
Google claimed that an Android phone could correctly understand
92% of the questions it was asked. Given that a computer can cor‐
rectly turn a question into text, the next step is to turn that question
into an answer.
Similarly, image recognition and image processing have become
commonplace. Despite some highly publicized and embarrassing
mistakes, computer vision systems can identify faces with an accu‐
racy that was unthinkable a few years ago. Granted, constraining the
problem properly plays a huge role in this success: Facebook can
identify faces in your photos because it’s assuming that the people in
your photos are likely to be your friends. Computer vision is (or will
be) central to many applications of AI, from the mundane to the
scary. Vision is obviously critical to autonomous vehicles; it’s also
critical to surveillance, auto-targeting drones, and other uncomfort‐
able applications.
Deep learning and neural networks have attracted a lot of attention
in the past year: they have enabled progress in computer vision, nat‐
ural language, and other fields. However, almost anything that falls
under the rubric of machine learning is artificial intelligence: classi‐
fication and clustering algorithms, various kinds of decision trees,
genetic algorithms, support vector machines, hierarchical temporal
memory, and many others. These techniques can be used by them‐
selves, or in combination with others. IBM’s Watson is a good exam‐
ple of ensemble learning: it is a rule-based system that incorporates
Producing Results | 11
many other algorithms, depending on the problem it is solving. The
rules are largely hand-crafted, and the other algorithms need to be
painstakingly tuned to get good results.
Impressive as Watson is, systems that require huge amounts of hand
tuning are at best a stepping stone toward intelligence. Any general
AI, and most narrow AIs, will probably combine many algorithms,
rather than using a single, yet-to-be-discovered master algorithm.
But the tuning required to get good results is a major limitation:
Demis Hassabis, leader of the AlphaGo team, says that tuning is
“almost like an art form.” Is it really “artificial intelligence” if getting
good results requires years of work, and only a few specialists (Has‐
sabis says a few hundred) are capable of doing that work? The cre‐
ation of an engine like Watson is science, but it also requires a lot of
art. In addition, the need for manual optimization suggests that AIs
built this way are inherently narrow, designed to solve a single prob‐
lem. It’s very difficult to imagine optimizing a “general intelligence”
engine that can work on any problem. If you’re optimizing, you’re
almost certainly optimizing for something, for some specific appli‐
cation.
Do advances in AI depend on better algorithms, or better hardware?
The answer to that question is “both,” if the question is even mean‐
ingful. Even though clock speeds have stalled, our ability to put
more and more on a chip hasn’t stalled: AlphaGo’s 280 GPUs could
easily mean 200,000 cores. More important, though, we’ve seen a lot
of improvement in mathematical libraries and tools for using GPUs.
We may also see the use of ASICs and FPGAs (application-specific
integrated circuit and field-programmable gate arrays) in future AI
engines. In turn, ASICs and FPGAs will be critical to embedding AI
in hardware systems, many of which (think autonomous vehicles)
will need to run in hard real-time.
But even if the hardware is better, we will still need algorithms that
can be distributed across thousands or millions of nodes; we will
need algorithms that can reprogram FPGAs on the fly, to adapt the
hardware to the problems they are solving. MapReduce became
popular for data analysis because it suggested a way to parallelize a
large class of problems. Parallelism obviously works for AI; but what
are its limits? The hard fact of parallelism is that the part of the pro‐
gram that can’t be parallelized kills you. And the hallmark of most
parallel algorithms is that you need a phase that collects partial
results and generates a single result. AlphaGo may look at thousands