Вы находитесь на странице: 1из 8

Development of an Electronic Arabic Dictionary

Siti Noormaya binti Bilmas*, Muhammad Nasir Ibrahim**, Mariani Idraos***

*Department of Mechatronic: Pusat Latihan Teknologi Tinggi ADTEC Batu


Pahat, KM8 Mukim Minyak Beku, Jalan Tanjung Laboh 83000, Batu Pahat,
Johor.
Email: snoormaya@yahoo.com
** Faculty of Electrical Engineering, Universiti Teknologi Malaysia, 81310 UTM
Johor Bahru, Malaysia.
Email: mnasir@fke.utm.my
** Faculty of Petroleum and Renewable Energy Engineering, Universiti
Teknologi Malaysia, 81310 UTM Johor Bahru, Malaysia.
Email: mariani@fpree.utm.my

Abstract

Currently, the arrangement of common Arabic dictionary is based on the first letter of
the words according to the Arabic alphabet sequence. For this ‘Electronic Arabic
Dictionary’ it is developing based on the arrangement of Hans Wehr Dictionary edited by
J. M. Cowan. Since publish in 1964, this dictionary is the only one that have been
arranged according to the Arabic root word. So far, there is no electronic version for this
dictionary format. Therefore, the objective of this project is to develop the software
format for Hans Wehr dictionary and as the first step is to design the database structure
based on the root word arrangement concept. The content of this electronic dictionary is
fully based on the Hans Wehr dictionary content. Beside provide the meaning for each
word, this dictionary also give information about the root word derivation form,
derivation words and its usage example.

Keywords: database structure, Arabic root word, electronic dictionary.

1. Introduction
Compared to the paper dictionary, electronic dictionary turn out to be favourite as
consulting tools for most users, especially for student [7]. It is efficiently used in learning
of foreign language such as Arabic language [9]. Recently, there are many electronic
Arabic dictionary are being developed to be on line or off line usage. But all these
dictionaries have a similarity at the aspect of the data arrangement. All the words are
arranged according to its first character of the word.
This paper discusses new arrangement for this electronic Arabic dictionary. The
arrangement that proposed is an adaptation from a dictionary that written by Hans Wehr
originated from Germany around 1964. This dictionary is arranged the Arabic words
according to its root word. So, far this is the only dictionary that used this kind of system
arrangement. This arrangement is a kind of unique that gives the advantage on more
information given along the meaning and the user will more understand how to use the
word for the correct grammar. This kind of system is still not produced in software
format. As far as this project concern is to develop a database structure that based on this
system. Root word base system.
2. Hans Wehr Dictionary
The main reference for this project is Hans Wehr Dictionary. This dictionary presents
the vocabulary and phraseology of modern written Arabic. The word arrangement is
according to its root words. Under a given root the sequence of entries is as follows as
Figure 1. In the presentation of the entries in the dictionary, homonymous roots are given
separately in only a few especially clear instances [1].

‫ﻛﺘﺐ‬
Kataba u (katb, ‫ ﻛﺘﺒﺔ‬kitba, ‫ ﻛﺘﺎﺑﺔ‬kitāba)
To write, pen, write down, put down in
writing, note down, inscribe, enter,
record, book, register (‫ ه‬s.t.h.); to
compose, draw up, indite, draft (‫ ه‬s.t.h);
to bequeath, make over by will (‫ه ل‬
s.t.h. to s.o); to prescribe (‫ ه ﻋﻞ‬s.t.h. to
s.o); to foreordain, destines (‫ ل‬or ‫ ه ﻋﻞ‬s.t.h. to s.o.;
of God); pass. kutiba to be
fated, be foreordained, be destined (‫ل‬
to s.o.) | ‫ ﻛﺘﺐ ﻋﻞ ﻧﻔﺴﻪ ا ن‬to be firmly
resolved to…, make it one’s duty to …;
‫ ﻛﺘﺐ ﻋﻨﻪ‬to write from s.o.’s dictation;
‫( ﻛﺘﺐ ﻛﺘﺎﺑﻪ‬kitābahū) to draw up the
Marriage contract for s.o., marry s.o.
(‫ ﻋﻞ‬to) II to make (‫ ە‬s.o.) write (‫ ه‬s.t.h.);
to form or deploy in squadrons (‫ ه‬troops)
III to keep up a correspondence, exchange letters,
correspond (‫ ە‬with s.o.) IV to dictate (‫ ە ه‬to s.o.
s.t.h.), make (‫ ە‬s.o.) write (‫ ه‬s.t.h.) VI to write to
each other, exchange letters, keep up a
correspondence VII to subscribe VIII to write ( ‫ه‬
s.t.h.); to copy (‫ ه‬s.t.h.), make a copy (‫ ه‬of s.t.h.);
to enter one’s name; to subscribe (‫ ل‬for); to
contribute, subscribe (‫ ب ل‬money to); to be
entered, be recorded, be registered X to ask (‫ە‬
s.o.) to write (‫ ه‬s.t.h.); to dictate (‫ ە ه‬to s.o. s.t.h.),
make (‫ ە‬s.o.) write (‫ ه‬s.t.h.); to have a copy made
(‫ ە‬by s.o.)

‫ ﻛﺘﺎب‬kitāb pl. ‫ﻛﺘﺐ‬ kutub


piece of writing, record, paper; letter, note,
message; document, deed; contract (esp. marriage
contract); book; ‫ اﻟﻜﺘﺎب‬the Koran; the Bible | ‫اﻫﻞ اﻟﻜﺘﺎب‬
ahl al-k. the people of the Book, the adherents of
a revealed religion, the kitabis, i.e., Christians and
Jews; ‫ ﻛﺘﺎب اﻟﺰواج‬k. azzawāj marriage contract; ‫اﻟﻄﻼق‬
‫ ﻛﺘﺎب‬k. aṭ-ṭalāq bill of divorce; ‫( ﻛﺘﺎب ﺗﻌﻠﳰﻰ‬ta’limi)
textbook; ‫ ﻛﺘﺎب الاﻋامتد‬credentials (dipl.); ‫ داراﻟﻜﺘﺐ‬library

‫ ﻛﺘﱮ‬kutubi pl. –ya bookseller, bookdealer


Root word
Form
Derivation word
Word pronunciation
*others are the meaning
Figure 1: Hans Wehr dictionary entry [1]

As for the Arabic verb morphology it will includes expansion of the root. Some
expansions are lexical derivations which are result a "new words". But others are
variation of the verb's conjugation. Western scholars have assigned Roman numerals to
the various patterns of derivation, which are called "forms". Also, the root is designated
"Form I". A particular form does not have a consistent meaning across verbs, although, it
has a "usual" meaning. In addition, no verb root has all the derivations. The forms
numbered beyond Roman numeral X are rare and obsolescent, for which reason many
elementary grammars omit them. The list of this form can be seen through Table 1. The
boldfaced transliteration indicates a word that would actually exist in an Arabic lexicon
for this particular root (‫ ل‬- ‫ ع‬- ‫( )ف‬f - ʻ - l - right-to-left) [1].
The middle vocal in “form I” active voice can be a, i or u, depending on the root
applied in “form I”. The exact vocalization depends on the word form. Forms XI
(ifʻālla), XIII (ifʻawwala), XIV (ifʻanlala), and XV (ifʻanlā) are very rare. Common uses
of those stems include [1]:
• faʻʻala is often used to make an intransitive verb transitive. Eg: karuma is "be
noble" but karrama is "make (someone) to be noble", or, more idiomatically, to
"honour".
• infaʻala gives a passive meaning. Eg: kasara "break" and inkasara "be broken".
• ifʻalla is used only to render stative verbs meaning "to be or become X" where X
is a colour or physical defect, eg: iḥmarra "turn red, blush" or iṭrašša "go deaf".
Then for the derivation, it derives from the root word. The meaning of the derivation
word has much similar meaning to the root word meaning.

I faʻala
II fa’’ala
III fāʻala
IV ʼafʻala
V tafaʻʻala
VI tafāʻala
VII infaʻala
VIII iftaʻala
IX ifʻalla
X istafʻala
XII ifʻawʻala
Table 1: List of “Forms”

3. Database Design
There are four main parts of this database. There are the root word meaning, the root
word form and its meaning, the example that show how to use the root word and the list
of derivation words.
As for the database structure design purposes, Microsoft Access is used. The process is
simplified as in the flowchart shown in Figure 2. For the table structure design, the first
step is to create the table [8].
For this database, there are four table are created. First table is the list for root word,
second table is listed the form, third tables list the examples and the fourth table list the
derivation words. This is the step for dividing the information into tables. For each table
the data have been divided into several fields. For field name that assign to each table
area as the Table 2.
The ID field is used to keep track the number of data that have been stored. For the
DATA NO, this is the field for unique number that have been created based on the
reference list. The format is shown as Table 3.
START

TABLE 1
Table structure
ID DATA ROOT PRO MEANING
NO WORD NOU
Input data NCE
TABLE 2
ID DATA FORM MEANING
Design relationship
NO
TABLE 3
ID DATA EXAM MEANING
Testing NO PLE
No TABLE 4
Yes ID DATA DERIV PRONO MEANING
NO ATION UNCE
END

Figure 2: Database design methodology

Table 2: List of Table Field


2 0 0 0 0 0 5 6
Character Root word sequence number
sequence based on the entries
number based arrangement in the dictionary Data Number: 22000063
on the first for each section of the 28
letter of the Arabic character
Root word ‫كتب‬
root word. Arabic character sequence ‫ ك‬- 22
The order is
Entry number in the
according to 000063
the 28 Arabic dictionary
character
Table 3: Data No Format
Table 4: DATA NO creation

The first two bit is for the character sequence that has been assigned according to the
28 Arabic characters. It is start with the alphabet ( ‫ ) ا‬, the sequence is 1 and end with
alphabet ( ‫ ) ي‬the sequence number is 28. Then for the next 6 bits is assigned according
to the entries arrangement in the Hans Wehr Dictionary for each character sequence. This
six (6) bit is reserved for the number counting to maximum value of 999 999 or entry.
For the example how the DATA NO is created can be seen in Table 4. Then for the data
entry, the data example is as the Figure 1.
The next field is ROOT WORD, PRONOUNCE, MEANING, FORM, EXAMPLE and
DERIVATION. After the tables have been established, the process of entering data to the
tables. The tables which have been filled by the data sample can be seen as the Table 5 to
Table 8.

root word
ROOT
ID DATA NO PRONOUNCE MEANING
WORD
10 05000057 ‫جدل‬ jadala u i (jadl)to twist tight, tighten, strech (‫ ه‬a rope); to braid, plait (‫ ه‬s.th. The hair, etc.)
11 06000026 ‫حتم‬ hatama i (hatm) - to decree, make necessary, prescribe (‫ عل ه‬s.th. for s.o.), make (‫ ه‬s.th.) a
duty, a necessity ( ‫ عل‬s.th. for s.o.); to impose, enjoin (‫ عل ه‬s.th. for s.o.); to decide,
determine definitely (‫ ب‬s.th.)
Table 5: Root Word Table
Form
ID DATA NO FORM MEANING
1 02000056 II to adjudge, adjudicate, award
2 02000056 VII to be cut off; to be finished, be done; to be deci
3 02000111 V to fritter away one's fortune, be overgenerous; t
Table 6: Form Table

EXAMPLE
ID DATA NO EXAMPLE MEANING
1 02000056 ‫ انبت قد وبينھم بينه‬it's all over between him and
‫االمر‬ them, they are through with
each other
2 02000111 ‫جھده بذل‬ to take pains;
3 02000111 ‫مساعدة كل بذل‬ to grant every assistance;
Table 7: Example Table

DERIVATION
ID DATA NO DERIVATION WORD PRONOUNCE MEANING
1 01000002 ‫اّب‬ ab August (month; Syr., Leb., Jord., Ir.)
2 01000030 ‫ابوة‬ ubuwa fatherhood, paternity
Table 8: Derivation Table

4. Unicode
For the data entry it consist two types of characters which are Roman and Arabic. The
computer which use Windows XP and above, the character recognition function is
embedded. For the Roman character the computer used ASCII code, where as for the
Arabic character, Unicode is being used.
Unicode is an industry standard character. This standard character set actually a
unique number that created to be deal with computer. The main objective of Unicode is
to provide a unique number for every character, no matter what the platform, no matter
what the program and no matter what the language [5].
Arabic Unicode Standard currently is version 5.0. For standard character of Arabic
language it is lies in range of 0600 – 06FF [3]. Then, for the supplement character for
Arabic language is in range of FB50 – FDFF [4]. The Unicode is arranged in matrix
form. For example for the letter (‫ ) ك‬the Unicode is 0643.

5. Result and Discussion


As the process of creating and entering data is finished, next step is to do the
relationship among the tables. The relationship is done according to its primary key for
each table[8]. In this database structure, the primary key is the DATA NO. The DATA
NO for every table are linked together so that the information in the tables is related to
each other.
The main idea of this database is to find the information by its root word. Therefore,
the query designs also according the specification. The database structure is translated
into flowchart as shown in Figure 3. This flowchart shows that information can be access
by the root word input.
For testing purposes, an interface is created (Figure 4). There two ways of searching
the data. If the user is not familiar with the root word, the user can choose to list out all
the root words and the search according to the list (Figure 5) . For the user that know the
root word, can directly type the root word to access the meaning. The dialog box will
appear for user to enter the input using Arabic character (Figure 6). For the other
information such as list of form, examples and derivation word of each root word can also
be accessed. Figure 7 to Figure 10 is the example of display for the information in the
database.

START

ROOT
WORD

LIST OF ROOT WORD

DISPLAY ROOT WORD,


N Y PRONOUNCE,
MEANING
MEANING.

N Y DISPLAY FORM
FORM
NUMBER, MEANING.

DISPLAY LIST OF
N Y EXAMPLES,
EXAMPLE
MEANING.

DISPLAY
N Y DERIVATION WORD,
PRONOUNCE,
DERIVATION
MEANING.

END

Figure 3: Database structure output

Figure 5: List of root words display


Figure 4: Main interface display

Figure 6: The dialog box for the input


Figure 9: The list of form display

Figure 7: The meaning of the root word


display

Figure 8: The list of example display

Figure 10: The derivation display

6. Conclusion
In spite of religious and academic, now proved Arabic language is becoming
commercial in Malaysia [6]. Therefore, the development of electronic module as a tool of
reference really needed as a way to improve the information sharing [10].
For that reason, Hans Wehr electronic dictionary database is developed as a new
approach to the arrangement of the Arabic dictionary. This database is design to be used
for the future development of electronic dictionary, whether for the on-line or off-line
usage.

7. Acknowledgement
The authors would like to express their thanks to Minister of Higher Education (MOHE)
and UTM for the Research Grant Vot 78626 to support this paper for the National
Conference of Engineering and Technology (NCET) 2011.

8. Reference
[1] Hans Wehr. “Dictionary of Modern Written Arabic: Arabic-English” edited by J.M.Cowan, 3th Edition,
New York, 2004.

[2] G.W. Hansen, J.V. Hansen, “Database Management and Design”, 2nd Edition, New Jersey, 1996.

[3] Arabic, Range: 0600-06FF, retrieved from www.w3c.org.ma/Talks/opstic3/data/arabicfomb.pdf on


January 2009.

[4] Arabic, Range: FB50-FDFF, retrieved from www.w3c.org.ma/Talks/opstic3/data/arabicfoma.pdf on


January 2009.

[5] “An Introduction to Unicode” retrieved from


http://scripts.sil.org/cms/scripts/page.php?site_id=WSI_Guidelines_Sec_6_2 on February 2009.

[6] Azman Che Mat, Hj. Azman Zakaria, “The Importance of Arabic Language in Malaysia Tourism
Industry”, Canadian Social Science, vol. 5, no. 4, pp 12-17, August 2009.
[7] Yu Ping Fang, Du Jia Li, "Towards the Electronic Dictionaries in Modern Commercial Context: A
Comparative Perspective", 2010 Third International Symposium on Electronic Commerce and Security, 29-31
July 2010.

[8] Database Basic Concept, retrieved from http://www.geekgirls.com/database_dictionary.htm on February


2009.

[9] Jen-Kai Liang, Tzu-Chien Liu, Hsue-Yie Wang ,Tak-Wai Chan,"Integrating Wireless Technology in
Pocket Electronic Dictionary to Enhance Language Learning", Fifth IEEE International Conference on
Advanced Learning Technologies (ICALT’05), 2005.

[10] Zhao, Jingying; Guo, Hai; Zheng, Zhenhong; Jiang, Nan; "The Implemention of Chinese-Tai Lue
Electronic Dictionary Based on C#",2010 International Conference on Machine Vision and Human-machine
Interface, 24-25 April 2010.

Вам также может понравиться