Академический Документы
Профессиональный Документы
Культура Документы
Ashley R. Kelly
Nike A. Abbott
University of Waterloo
Department of English
Language & Literature
University of Waterloo
Department of English
Language & Literature
University of Waterloo
Department of English
Language & Literature
arkelly@uwaterloo.ca
nabbott@uwaterloo.ca
raha@uwaterloo.ca
Chrysanne DiMarco
University of Waterloo
David R. Cheriton
School of Computer Science
cdimarco@uwaterloo.ca
ABSTRACT
1. INTRODUCTION
General Terms
Theory, Design
Keywords
ontology, rhetoric, rhetorical gures
Dr. Chrysanne DiMarco is the Director of the Inkpot Natural Language Research Group.
Permission to
to make
make digital
digital or
or hard copies of all or part
Permission
part of
of this
this work
work for
for
personal or
or classroom
classroom use is granted without fee
personal
fee provided
provided that
that copies
copies are
are
not
forfor
profit
or commercial
advantage
and that
not made
madeorordistributed
distributed
profit
or commercial
advantage
andcopies
that
bear
thisbear
notice
the full
the first on
page.
copy
otherwise,
or
copies
thisand
notice
andcitation
the fulloncitation
theTo
first
page.
To copy
republish,
on serverstoorpost
to redistribute
lists,
priortospecific
otherwise,toorpost
republish,
on serverstoor
to requires
redistribute
lists,
permission
and/or
a fee.permission
SIGDOC 2010,
27 - 29, 2010, S.Carlos,
requires prior
specific
and/orSeptember
a fee.
SP,
Brazil.2010, September 2729, 2010, S.Carlos, SP, Brazil.
SIGDOC
Copyright
...$5.00.
Copyright2010
2010ACM
ACM978-1-4503-0403-0
978-1-4503-0403-0$5.00.
123
2. RELEVANT LITERATURE
Natural Language Processing (NLP) is an area of study,
in the discipline of Computer Science, that treats the manipulation and generation of human-readable language. While
there have been some eorts to treat rhetoric in the eld
of NLP, as Harris & DiMarco (2009) argue, these treatments
fail to understand the primary scholarship and rhetoric to
any signicant degree, rarely going beyond a generalized
notion of style derived from linguistics. Often the degree
to which language scholarship is consulted is limited to the
study of linguistics [11].
While linguists and computational linguists have worked
together on developing models for software programs, rhetoric
can more fully treat the complexity of human speech. Many
linguistic schemes follow tractable characteristics of words
and word groupings. Rhetoric is more dicult to schematize because it seeks to uncover the exigence and purpose of
a gure or rhetorical device as well as the linguistic strategies a gure employs for eectiveness. Furthermore, many
characteristics of gures (especially tropes) are conceptual
and therefore more dicult to formally specify. However,
encouraging progress has been made towards computational
124
3. OUR APPROACH
3.1
Method
3.2
Primary Texts
125
3.3
Software
In working toward an ontology of rhetorical gures we determined to rst aggregate this large collection of disperse
information into database software. Our database was designed and developed specically for our project by a software engineer external to the research group, but who continues to work closely to meet our specic needs. The software application, dubbed Rhetg, is a Python-based application running on top of Google App Engine. As a platform,
Googles App Engine is a scalable, cloud-computing service
that allows web applications to be stored on Googles infrastructure.
The rst instantiation of the software was operationalized
as a data collection method and is constantly being developed to include more sophisticated features such as search
and sort algorithms. Phase two development of the software,
including search and sort, will be complete in late September
2010. Further development of the database will continue on
App Engine and will be a completely web-based application
accessible by anyone on any computing platform capable of
standard web-browsing.
Ultimately we will integrate open source, web-based ontology editing software (prot`eg`e, for example), but until the
categories and individual entries have been further rened
we are focused on the theoretical and conceptual issues, and
not the programmatic concerns, of an the ontologys development. That is, we see the theoretical concerns of classifying this extremely large and diverse group of data as
preceding any attempt to implement our database work in
ontology-editing software.
Fortunately (but also unfortunately) there are literally
millennia of taxonomic research into guration. Some categorizations are very robust. Others, however, are more controversial. A signicant number of gures occupy dierent
categories for dierent theorists. A signicant number, too,
are gures to some theorists, but not to other theorists. The
task is appreciable.
Theoretical work is, of course, essential to complete prior
to further software development as the theoretical work is
the basis for software design decisions. Unlike other ontologies in the eld of computer science, such as WordNet, this
ontology functions as much more than a simple lexical ontology; instead, our work towards an ontology blends the
notion of an ontology in computer science and a more classical denition (e.g., what features of a gure are essential,
individualizing?), well-suited to rhetorical studies. To better address the quiddity of gures we then turn to rhetorical
studies and will return to a discussion of software at the end
of our paper.
4. RHETORICAL FIGURES
In Silva Rhetoric, Burton describes rhetorical gures as
being like wildower seeds tossed on fertile ground, which
is to say that these gures multiplied into a garden of enormous variety over time [1]. Burton goes on to provide a
general introduction to rhetorical guresan excellent introduction for those new to rhetoricfrom their early classication and naming in Greek and Roman. Figures are strategies
or devices that are employed in language to achieve particular stylistic eects. As we will see, there is a much richer
life to these gures than is often assumed under the labeled
stylistics, but that denition will suce for now.
126
5. THE ONTOLOGY
The domain within which our ontology will be constructed
is that of rhetorical gures of speech, as dened and described above. Included within the scope of our work are
the gures, their individualizing characteristics, and the relationships among gures.
Uses of this ontology, we hope, will be multiple; the endgoal is to provide a comprehensive and robust collection of
terms and a suite of tools to manipulate and explore gures
and their relations to one another.
For the initial organization of the ontology we intend to
devise classes into a hierarchical taxonomy, using a top-down
approach of dening general concepts (e.g., linguistic domain) and subclasses (e.g., syntactic, semantic, morphological). Instances can be created from the work that we have
been conducting in the construction of our database.
5.1
5.2
Our initial ontology is envisioned as treating specic aspects of rhetorical gures as classes. That is, certain characteristics of rhetorical gures will be broadly dened and
organized based on certain linguistic or rhetorical characteristics which will, in turn, provide a governing set of rules
and constraints with which to organize our ontology. For
the purposes of this paper we will explicate three of the key
classes from which we will be able to create a formal taxonomy from our database of rhetorical gures. These three
conceptual classications are: type of (scheme, trope or,
chroma) distinctions, linguistic domain, and kind of relationships. Each of the classication schema are described,
dened, and illustrated below.
This example, from Shakespeares Julius Caesar, also contains the metaphor of a corpse as a piece of earth.
127
use for their eectiveness that are dened in this classication system. Unlike the previous method of classication,
linguistic domain, there are no subclasses within this taxonomy. Rather, we have constructed a list of common strategies employed by rhetorical gures as a method to group
them based broadly on their techniques. It is noteworthy
that this system is currently under going signicant revision, but the general structure and intentions will remain in
subsequent versions.
Repetition refers to when a gure uses repetition of
sounds (consonants, vowels, or syllables), words, syntax (phrases or clauses) or semantics (concepts).
e.g., RhymeHickory Dickory Dock. The mouse ran
up the clock.
Symmetry refers to when a gure pairs two constructions in an inverse (mirrored) way.
e.g. Antimetabole: When the going gets tough, the
tough get going.
Opposition refers to when a gure oppose two structures or concepts. e.g., Oxymoron: A wanton modesty.
Proud humility.
Identity refers to when a gure uses two or more identical elements.
e.g., Ploce: O villain, villain, smiling, damned villain!
(Hamlet [5]).
Similarity (partial-identity) refers to when a gure
uses resemblance of a concept.
e.g., Simile: My love is like a red, red rose.
Omission refers to when a gure omits expected elements.
e.g., Asyndeton (which omits the expected conjunction, and): I came, I saw, I conquered.
Series refers to when a gure establishes a series (through
words or concepts). e.g., Abecedarian: Adorable, beautiful, charming, delightful, exciting, fantasticyou run
the gamut from A to Z [1].
In this system is it evident that there are classical inuences; Peacham, for example, created multiple divisions in
this classication of rhetorical gures, including gures of
repetition and gures of omission. We hope that our work
here builds on, rather than rehashes, previous work on classication to provide a useful category for broadly dening
the strategies employed by rhetorical gures for their eectiveness. Furthermore, we hope that this system can be
paired with the linguistic domain classication to reference
the levels at which gures may be functioning in both kind
and linguistic-level.
6. FUTURE WORK
Presently we are continuing to rene the entries for each
rhetorical gure, drawing on the primary texts noted above,
as well as expert knowledge from language-related disciplines (i.e., rhetoric, linguistics, and literary studies). In
rening each gures entry we hope to ensure that its subsequent placement in the ontology, when populated, will accurately reect strategies and intentions employed by the
gure. That is, each gure will have a clear placement in
128
terms of the primary linguistic domain within which it operates, what kind of strategies it uses, and so on with the
categories described earlier in the paper.
Prior to populating an ontology we hope to ensure that
there is a high inter-annotator agreement on each gures
classication in the elds that we hope to use as classes
for the ontology (e.g., type of, linguistic domain, and kind
of). To do this we would have annotators independently
select the classications for a set of gures and then determine Cohens kappa coecient for statistical validation of
our taxonomy.
Ontology tools that we will create will be integrated into
the web-accessible database that has already be implemented.
In this way we hope that our work will be included in broader
research contexts than our own and, additionally, provide a
useful tool for other researchers. Our application has yet to
be developed as we are still working towards the completion
of version of our database of rhetorical gures.
Additionally, as we populate and rene the database we
are engaged in an iterative design process wherein the categories and systems of classication that will come to shape
our ontology are continually being revised and rened.
7. FINAL REMARKS
While the work described above is highly theoretical and
may rst appear to be situated rmly in the domain of
rhetorical scholarship alone, the impact of rhetorical gures
reaches much further. Rhetorical gures are present in all
communication and, as Lako & Johnson have famously described in their seminal text Metaphors We Live By, gures
shape how we think about and through the world. Though
Lako & Johnson seem unaware of the many precedents,
rhetoricians like Giambattista Vico, I.A. Richards, and Kenneth Burke had all advanced similarly compelling arguments
about guration and thought.
In related Inkpot projects we have explored the context of
health communications and as many other scholars studying health communications have demonstrated [17], the design of health communication has a signicant impact on
patients and rhetorical gures are inescapable components
of the communicative design of language, especially in those
areas concerned with persuasion.
Health care communication is only one example of where
rhetorical gures are crucial elements of a persuasive text.
Indeed, the presence or absence of carefully chosen rhetorical
gures has a profound impact on all genres of communication. Technical writers may nd particularly strategies of
repetition or ellipsis present in technical communications,
for example. Researchers situated more rmly in computer
science may nd our initial classication schemes to be valuable in detecting gures in a corpus and subsequently diagnosing, modifying, or mining that corpus. With further development of the database and ontology software researchers
will be able to sort and search through gures based on a
multitude of classications.
The end goals of the Rhetorical Figure Ontology Project
are to support researchers in rhetoric, poetics, cognitive science, human-languages technologies, and any of the other
elds (and subelds) for which rhetorical gures have either been traditionally important or in which rhetorical gures are more recently becoming important. As an ongoing
project, community feedback will certainly inform our work
and looking to form partnerships with others working in re-
lated elds.
Figures are extraordinarily pervasive in language, and extraordinarily uid; a gure for one generation is literal for
the next (becoming, e.g., a dead metaphor). This situation makes them very challenging to explore. But it also
makes them utterly essential for an adequate and responsible account of discourse, and for any investigations, of any
sort, into style. They cluster in certain ways relative to affect (anger, joy, sadness), for instance, and relative to genre
(newspaper articles, scientic reports, blogs), and relative to
functions (persuasion, description, argumentation). Computationally detecting, or reproducing, or modifying any of
these phenomena will be greatly facilitated by the ontology
we are building.
8. ACKNOWLEDGMENTS
We wish to thank Silva Rhetoric (that is, its mastermind, Gideon O. Burton), which not only served as the inspiration for this database, but as the primary source for
its phase-one population, and as an invaluable training tool
for its contributors. We would also like to thank the entire Inkpot Natural Language Research Group for their input and support in the development of this project, Garret Kelly for the design and development of our software,
and the University of Waterloo Computer Science Computing Facility sta for their ongoing support. This research
is funded by the Social Sciences and Humanities Research
Council of Canada (SSHRC).
9. REFERENCES
[1] G. O. Burton. Silva Rhetoricae. Brigham Young
University, February 2007. http://rhetoric.byu.edu.
[2] H. Caplan. An English Translation of: Ad C.
Herennium Libri IV De Ratione Dicendi. William
Heinemann Ltd, London, 1964.
[3] C. DiMarco, S. Banks, O. Gladkova, and M. Skala. A
ne-grained multi-level ontology for computational
stylistics. Unpublished., 2009.
[4] C. C. Dumarsais. Trait
e des Tropes. Veuve Gaspard
Fritsch, Leipsic, 1757.
[5] G. P. Epp. Rhetorical Figures. University of Alberta,
1994. http://www.ualberta.ca/ gepp/figures.html.
[6] J. Fahnestock. Rhetorical Figures in Science. Oxford
University Press, Oxford, 1999.
[7] J. Gawryjolek. Automated annotation of rhetorical
gures. Masters thesis, University of Waterloo, 2009.
[8] J. Gawryjolek, C. DiMarco, and R. Harris. An
annotation tool for automatically detecting rhetorical
gures. Pasadena, July 2009. Computational Models
of Natural Argument (CMNA).
129