Вы находитесь на странице: 1из 8

TERMOSTAT ANALYSIS

DARSSY CARANTÓN RODRIGUEZ


MARIAM ADARVE CORREA
MELISSA RICARDO BRANCAMONTE
MISHELL HENAO CRUZ

M.A. LING. CARLOS PÉREZ PÉREZ

ESCUELA COLOMBIANA DE CARRERAS INDUSTRIALES (ECCI)

GENERAL BUSINESS TRANSLATION

MEDELLIN -ANTIOQUIA

2020
INTRODUCTION

In this assignment, we will present the search and analysis process of terminologies and
statistics of marketing documents and a useful tool for this process such as TermoStat Web.
It is an online term extractor that uses a hybrid method to identify candidate terms. It takes
into account not only the structure of potential term but also the relative frequencies of
these potential candidates in the text being processed (called the analysis corpus) and a very
large collection of newspaper articles (called the reference corpus). This method allows
TermoStat to find not only multi-word but also single-word candidate terms in a single
extraction process.

 Human evaluation of these results is always necessary to determine whether the units
identified are really terms, and whether they are pertinent in a given context.
METHODOLOGY

The first step to carry out this work was the search for three pdf documents on English
Marketing, for this, the Google Scholar tool was used. Later, we proceeded to convert these
three documents to the format (.docx) using the tool (smallpdf). Finally pass them to format
(.txt) to later join all the documents into one and thus be able to analyze them in TermoStat,
the online term extractor, looking for simple and complex terms. Upon obtaining the
results, we proceeded to save the different categories such as terms, statistics, structure
and Bi-grams, in addition to taking a screenshot of the semantic cloud and this was saved
in a folder called Resources.

Then we proceeded to create an Excel document called termostat_results and in this we


create two sheets, the first is Statistics in which we put the frequency, percentage and
examples given by TermoStat, the second is Terms in which we put all the simple and
complex terms that gave us TermoStat. Then we began to translate the terms from French
to English, which was the corpus language with the help of filters.

With the information from Statistics, we proceeded to make two bar graphs, one with the
frequency and the other with a percentage. Then the simple terms were extracted and with
them two pie charts were made, one with the frequency and the other with a percentage and
finally the complex terms were taken and two pie charts were made as in the previous one.

With the information of terms, two tables were made; in the first, five representative
examples were chosen for each category of simple terms, and in the second, five
representative examples were taken for each category of complex terms.

The analysis is carried out with the collected data and finally it is concluded.
ANALYSIS

FREQUENCY
Adjective Adjective Noun Noun10
11
Adjective Noun Preposition Adjective Noun Noun14
17
Adverb17
17
Adjective Coord_Conjunction Adjective Noun 19
25
Noun Preposition Noun Noun 26
42
Verb 46
53
Noun Noun Noun 53
53
adjective 76
99
Noun 198
230
Adjective Noun 244
0 50 100 150 200 250 300
%
25

20

15

10

0
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19

In this graph we can observed that the noun adjective terms dominate, with a frequency of
244 which represents 19% of the terms presented in the text, the second most predominant
is Noun with a frequency of 230, which characterizes the use of these terms in the selected
texts.

In this graphic we found that within simple terms, nouns represent the 59% of simple terms
with a big gap of difference. In comparison with adjective, verb and adverb that only
represent an amount 41%.
On the other hand, in complex terms we found that there is a big term variation compared
to simple terms, which make graphics much more complex and with a bigger amount of
data to analyze. in this area, the most important value are adjective nouns that are
equivalent to the 27% of the total, while there is another big amount of complex terms that
only represent 1% in the graphic.

CONCLUSION

In conclusion, the expected results were obtained, carry out a good analysis following each
of the steps provided by the teacher in the class and thus finally have a well-developed
work.

The analysis was one of the main topics, since with this quantitative and qualitative results
were obtained, through graphics the different categories such as terms, statistics, structure
and Bi-grams were shown. Translations were also made and in turn simple terms and
complex terms were extracted.
The frequency is another term that was very important to reach the results obtained,
through this it was possible to observe how the adjectives and nouns have a very different
frequency level, one greater than the other.

An analysis was carried out on each graph to conclude the result of the same, obtaining real
data that could be extracted from the graphs. At the same time we are able to know the
importance of verbs and adjectives within simple and complex terms, the frequency of each
one.

Вам также может понравиться