Академический Документы
Профессиональный Документы
Культура Документы
2011
8132
81.1-923
- 38
:
,
.. ,
, -
..
.., ..
-38 : .
: , 2011. 161 .
ISBN 978-5-88267-316-0
,
,
,
.
,
.
8132
81.1-923
.., 2011
ISBN 978-5-88267-316-0
.., 2011
, 2011
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
1. . . . . .
1.1. . . . . . . . . . .
1.2. ,
:
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
1.3. . . . . . . . .
1.4. . . . . . . . . . . . . . . . .
1.4.1. . . . . . . . . . . . . . . . . . . . . .
1.4.2. . .
1.4.3. . . . . . . . . . . . . . . . . . . . . . . . . . . .
1.4.3.1. . . . . . . . . . . . . . . . . . . . . . . . . . . .
1.4.3.2. . . . . . . . . . . . . . . . . . . . . . . . . . . . .
2. . . . . . . . . . . . . . . . . . . . . . . . .
2.1. . . . . . .
2.1.1.
2.1.2. . . . . . . . . . . . . . . . .
2.1.3. :
, , , . . . . . .
2.2. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
2.2.1. . . . . . . . . . . . . . .
2.2.2. . . . . . . . . . . . . . . . . . . . . . . . .
2.2.3. . . . . . . . . . . . . . . . . . .
2.2.4. . . . . . . . . . . .
3. . . . . . . . . . . . . . . . . . .
3.1. . . . . . . . . . . . . . . . . . . . . . . . . . . .
3.1.1. . . . . . . . . . . . . . . . . . . . .
3.1.2. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
3.1.3. . . . . . . . . . . . . . . . . . . . . . . . . . . .
3.1.4.
(WWW) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
3.2. . . . .
3.2.1. . . . . . . . . . . . . . . . .
3.2.2. . . . . . . . . . . . . . . . . . . . . . . . . .
3.2.2.1. . . . . . . . . . . . . . . . . . . .
3.2.2.2. . . . . . . . . . . . . . .
3.2.2.2.1. . . . . . . . . . . . . . .
3.2.2.2.2. . . . . . . . . . . . . . . . . . . .
3
5
7
7
11
14
17
17
20
26
26
29
33
33
33
36
38
42
42
45
50
52
55
55
55
56
69
71
75
75
82
82
86
86
89
3.2.3.
3.3.
3.3.1.
3.3.2.
3.3.3.
3.3.3.1.
3.3.3.2.
3.3.4.
3.3.4.1.
3.3.4.2.
3.3.5.
. . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . .
,
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. .
. .
,
. . . . . . . . . .
. . . . . . . . . .
, . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
92
94
94
95
98
100
113
116
117
123
128
147
148
153
,
,
,
,
. , ,
, ,
.
.
,
, ,
.
,
, ,
. 3.1
, . 3.2
,
. 3.3
,
. , ,
,
.
5
, . ,
.
,
, ,
.
:
;
;
;
;
, .
,
,
,
, .
1.
1.1.
,
( )
. ,
, ,
, , ,
,
,
.
. , ,
. , : ,
, ,
, , [42].
( ),
,
[62]. . .
: ,
[51]. ..
,
, ,
,
,
[31].
( ),
7
,
, .
.
:
1) ()
;
2)
,
;
3)
,
.
,
( ) (. corpus manager).
,
,
.
.
. ,
, , ,
,
..
8
, .
.
.
3.3.
,
.
, , ,
: 1)
; 2)
.
,
,
, ,
.
[49].
,
,
,
[8].
.
.
,
,
9
[42].
.
( ,
)
, , ,
;
, .
,
,
.
,
,
:
, ,
. , ,
;
. ,
.
,
.
, ,
, ,
(
, ., 1998).
,
, , ,
, , ,
10
. 2
,
.
,
, , ,
, ,
.
.
1.2. ,
:
, ,
.
.
, ,
:
XVIII XIX ,
, .
,
, .
, [49].
1. :
(- ).
,
, - .
,
. ,
11
XIX
() ,
.
( ). .
.
,
, (
),
.
, XIX ,
.
- .
, ,
, .
,
,
.
,
.
.
2. , .
XIX
, .
, .
,
.
,
, . ,
12
.
, ..
.
XVIII , .
(Dictionary of the English language, 1755),
,
, ,
.
,
.
,
.
(
OED)
.
,
.
KWIC
(Key Word In Context). ,
,
, ,
.
.
,
.
.
13
.
Computer-Assisted Language
Learning (CALL),
, .
3. : .
XIX .
,
, :
.
,
, .
(, ,
, ). (,
)
.
.
.
,
,
.
.
1.3.
1960- .
14
.
,
. ,
.
, ,
, ,
.
,
().
1990 600
.
[44]:
-1965
10
1966-1970
20
1971-1975
30
1976-1980
80
1981-1985 160
1986-1990 320
,
.
(
, )
(British National Corpus BNC),
(International Corpus of
English ICE), (Bank of
English), (Corpus of
Contemporary American English COCA) .
(. 1).
1990-
16
. .
1992
[60].
,
, , .
1.4.
1.4.1.
.
,
, .
, (
) ,
. ,
, 1 . (
, --,
).
. ,
() 100 .
.
( )
, ,
. . (, ,
http://ruscorpora.ru, BNC,
http://www.natcorp.ox.ac.uk/ http://sara.natcorp.ox.ac.uk).
,
() .
, ,
17
. ,
-
, , , ..,
[31].
. ,
, ()
, ,
, .
,
( ):
1. ,
.
2.
, ad hoc ( ),
,
[31].
,
.
,
,
.
.
,
.
18
,
(, ,
..),
. 60- ,
, ,
,
[51]. ,
60-
. ,
, 15
(), 6 80
:
1) : ;
2) : ;
3) : ;
4) ;
5) , , ;
6) - ;
7) , , ;
8) ( , ,
, );
9) ;
10) ;
11) ;
12) ;
13) ;
14) ;
15) .
. ,
,
19
,
, ,
[31].
XXI
, , ,
,
[2]. ,
,
,
.
, ,
.
.
(
)
.
,
,
[20].
, , ,
.
1.4.2.
,
:
1) , (
), , (, ,
, ..);
20
2) .
,
(
treebanks,
). ,
.
,
,
,
.
,
(. 1).
1
21
()
..
,
, .
( , LOB),
,
,
(, BNC .).
,
, . ,
,
,
.
,
(,
,
).
.
22
-, -
, -
.
,
.
,
, , .
(), - [38],
,
[54].
.
.
( ),
.
, , ,
.
- (http://www.philol.msu.ru/~lex/corpus/)
[2].
.
on-line
.
.
on-line -.
, ,
23
, , ,
, .
.
.
.
.
,
. ,
.
: ,
.
,
(, ) ,
.
[3],
,
[17; 2].
.
,
.
- [17].
.
24
,
,
..
[2].
.
.
/
. (
)
,
, ,
, ,
.
, ,
.
[17].
. ,
: ,
, .
()
:
,
, , .
,
--
,
, 2000 (). ,
, ,
. , .
25
, , ,
( ) [5]
.
1.4.3.
1.4.3.1.
:
1) ,
-,
- , -
;
2) ,
, .
,
( ,
, , ..),
, , .
,
[2].
(alignment) .
,
( ), .
,
.. .. [17],
.
1) ;
2) ;
3) ;
26
4) ,
;
5) ;
6)
.
- [2].
,
, ,
, .
-
,
.
. :
;
;
,
;
;
,
,
,
.
. 27
,
[17].
,
[4].
(,
).
,
-
. ,
,
.
,
,
[25; 7].
. , -
, .
.
28
,
.
. ,
, .
,
,
,
, .
1.4.3.2.
,
,
. ,
,
[42].
,
. ,
, .
- (The London-Lund Corpus)
(The Survey
of English Usage). ,
. 1960
. .
1 . .
, ,
.
29
()
1979 . -
. 34 ,
,
. .
(1980) [59].
,
,
[44].
,
, ,
, ,
- . -
.
-
, ,
. ,
actually, really, you know, you see, I mean, well,
, ,
..
- -; , ,
, .
, .. ,
,
.
.
,
30
,
. , -
, ,
. ,
,
.
.
:
1. ?
2. ?
3. ?
4. ?
5. ?
6. ?
7. ,
?
8.
, (, ,
, , ..)? [17]
,
(
) ,
. ,
. , ..
.. , 70- XX X.
. 1 . ,
. 15 ,
31
,
225 . 8
21 9 .
.
.
Z.
.
, , ,
[17].
,
, . , , 10 .
(1990- )
50
[26].
,
,
.
(
)
2008 3,9% .
:
(64,3%), (8,1%), (27,6%)
[27].
,
,
..
32
2.
2.1.
2.1.1.
.
,
. XX
.
,
,
.
(. . 1.4.1),
.
.
, .
-
, , , :
1. ? ,
?
2. ?
?
3.
?
4.
, ,
, ?
.
? ,
33
.
, , , , .
, ? , ,
.
, , , ,
, .
. , ,
,
.
.
:
1.
.
2. .
, , ,
, , -,
.
3. .
,
.
4. .
,
( ),
34
(, ), ,
( MS-DOS),
..
:
(, ..),
, ()
, ( (,
), , ,
, , , ,
, , ..).
, .
.
5. .
(). 3 :
, ;
; ,
.
( ,
,
, ), ( ,
, , ).
.
(
,
,
)
.
6. :
).
7.
-
(corpus manager),
( ).
35
8. .
, .
.
9. ,
, ,
,
, - ..
,
,
.
.
2.1.2. .
,
. .
:
1. ?
2. (
)?
3. ?
4.
, ?
.. 19651980 ,
.
. .. (1977) [13].
36
, ,
( ) .. [17].
( ), (,
) .
.
,
, , .
,
[17]. :
1.
( , , , , ,
, ..)?
2.
(, 10- , 50- ,
..)?
3.
? ?
.
,
, ,
, ,
.
(The American Heritage Intermediate
Corpus). 5 . () 22
() .
221 ,
37
.
19 . .
1045 . 10 .
500 [17].
2.1.3. :
, , ,
,
,
. ,
, ..
(, ),
. ,
,
.
,
. , ,
. .: chap.
chapter chap,
. Jan.
January
, .
, ,
.
, , ,
, ,
(, ),
.
, ..
38
, .
.
, 'walk'
: 'walk', 'walked', 'walks', 'walking'.
, 'walk', , .
,
.
, ,
, () .
,
, , ,
,
. ,
,
. , "better"
"good", .
"walk" "walking",
,
.
.
:
[The] [quick] [brown] [fox] [jumped] [over] [the] [lazy] [dogs].
, SnowballAnalyzer,
:
[quick] [brown] [fox] [jump] [over] [lazy] [dog].
:
[the] [quick] [brown] [fox] [jump] [over] [the] [lazy] [dog].
.
39
.
"All
women were walking in the streets". ()
<>, '*' ,
.
()
. , "were" "be",
, ,
; "streets" "street",
, ,
. ..
"<*all>"
"all" <*> <Quant> DET PRE SG/PL
"<women>"
"woman" N NOM PL
"<were>"
"be" <SV> <SVC/N> <SVC/A> V PAST VFIN
"<walking>"
"walk" <SV> <SVO> PCP1
"<in>"
"in" PREP
"<the>"
"the" <Def> DET CENTRAL ART SG/PL
"<streets>"
"street" N NOM PL
"<$.>"
(, )
.
( ).
()
.
40
.
,
(treebanks),
,
.
.
, (
)
.
(),
,
, ..
:
1.
,
.
, ,
. , , ,
, New York Baden Baden
. , ,
, ,
, .
2. ,
, ,
, ,
[42].
41
2.2. .
2.2.1.
.
(tagging, annotation)
,
. , ,
, ,
,
,
(taggers)
(parsers).
()
, ,
(, , , ,
/, ..).
,
( ,
..).
, ,
(,
, ).
(, )
.
.
,
42
,
.
, ,
(,
) .
. ,
. ,
, ,
. ().
.
. ,
, , , noun,
verb .. ,
, .
,
,
.
, ,
,
.
. ,
information distribution , ,
noun, ,
. ,
lexical frequent
,
43
, .
deal . ,
, . ,
a good deal of trouble, ,
good . ,
,
,
. , deal a good deal of trouble
. ,
good ,
adjective
. ,
, ,
,
,
44
. ,
,
,
, .
,
: ,
,
, , (
: , , ,
, ).
2.2.2.
:
, , , ,
, .
:
1) () ;
2) ;
3) ;
4) ;
5) () ;
6) .
part-ofspeech tagging (POS-tagging), .
, ,
. : ,
,
-,
45
, , -,
.
: , ,
. 1980
,
..
:
the_AT
jury_NN
further_RB
said_VBD
in_IN
term-end_NN
presentments_NNS that_CS the_AT *city_NP *executive_NP *committee_NP
,_, which_WDT had_HVD over-all_JJ charge_NN of_IN the_AT election_NN
,_, deserves_VBZ the_AT praise_NN and_CC thanks_NNS of_IN the_AT
*city_NP of_NP *atlanta_NP for_IN the_AT manner_NN in_IN which_WDT
the_AT election_NN was_BEDZ conducted_VBN |
.
XML- (. 1).
<text> , <p>
, <s> , <w> , <pun>
. <w> <ana>
<lemma> , <pos> , <gram> .
3.
,
.
(,
, ..).
46
. 1.
( . 3)
,
.
:
;
;
;
;
;
.
. 2
.
47
Long ago, in the city of Babylon, the people began to build a huge
tower which seemed to reach the heavens soon.
. 2.
( , -3)
, ,
, .
, ,
(), ,
, ..
.
.
1) ( , ..);
2) - (
, , ..);
48
3) (,
..).
-
:
( )
, , ;
( ,
) ;
( )
;
;
;
,
.
:
- (,
, );
(,
);
- ()
(, , );
(,
) ( . http://ruscorpora.ru, ).
, :
. ,
, ;
.
, .
,
, , ..
49
2.2.3.
, ,
, (
, ,
, ),
(, , , , ,
), -
(, , ,
). ,
.
,
.
, , -,
; , .
, :
, (E external);
(I internal).
. E-:
1 (origin) , ;
E2 (state) ,
( );
3 (aims) ,
I-:
I1 (topic) ;
I2 (style) (, ) [57].
, ,
:
:
1) : , , ( );
50
2) ;
3) (
);
4) : ,
5 . ;
5 15 . ; 15 . .
:
;
;
.
,
:
1) : , ,
, , ,
, , , ;
2) : , ,
, , , , , ,
, , , , , , , .;
3) :
[27].
: ;
XVII ; XIX ; /: ;
, 1920-1940- ;
/ 1960-1980- ; /: ;
/: ; :
; : ; :
; .
.
, ,
:
1) -, , , -, , ,
,
, -, . (
21);
51
2) ;
3) ;
4) ( . http://ruscorpora.ru/corporaparameter.html)
2.2.4.
, ,
,
.
.
. ,
, ,
.
[39].
, ,
.
, ..
.
-
.
. ,
,
, ,
.
.
52
(
),
,
[2].
,
.
,
.
, ,
.
.
.
- ,
Text Encoding
Initiative (TEI) EAGLES (Expert Advisory Group
on Language Engineering Standards).
, . ,
, : ,
(E external) (I
internal).
..
(http://ruscorpora.ru/sbornik2005/05savchuk.pdf)
.
SGML XML. EAGLES
XML
(., , Corpus Encoding Standard for XML
(XCES)). XML
. 2.2.2 (. . 1).
53
,
,
.
,
.
.
(multilingual morphosyntactic specifications) MULTEXT-East Version 4
(http://nl.ijs.si/ME/V4/).
54
3.
3.1.
3.1.1.
,
.
,
( -).
,
,
.
:
KWIC (Key Word In Context),
;
, ;
( );
,
;
;
;
,
(, ),
;
;
55
, ,
;
;
(txt, doc, rtf,
html, xml .);
( ) ,
, .
SARA, XAIRA (BNC), Manatee/Bonito, CQP, DDC.
() .
,
ndex.Server 3.8 Professional.
ndex.Server
() .
ndex.Server,
- .
,
,
. , ,
,
, , , ..
, ,
,
( )
[27].
3.1.2.
56
, .
.
.
:
1) (,
..);
2)
;
3) (, , );
4)
(
,
);
5) :
(, );
(
, , ..);
6)
()
;
7) :
;
.
, Bonito/Manatee1.
,
.
1
Bonito , Manatee
.
57
Bonito
. Bonito
: (Bonitosrv)
(GUI graphical user interface) Bonito,
. NLPlab (Natural Language Processing Laboratory)
. ()
.
SUSANNE (Surface and Underlying
Structural Analysis of Natural English) (http://www.grsampson.net/).
. 130 .
,
SUSANNE.
Bonito
(, , );
;
;
( .);
;
.
;
;
.
;
;
, ;
;
58
( );
.
, ;
.
;
;
;
.
,
,
( )
(. 3).
. 3. - Bonito
"justice"
59
:
(P-filter)
;
(N-filter)
;
(Collocations)
( )
.
,
. (
"From:" "To:"). ,
,
. ,
, N- , N- .
,
(Query History),
,
. "" ,
, ,
.
"name:",
"" () (named queries).
,
. ,
,
.
,
"play" :
[word="$1" | word="$1s" | word="$1ed" | word="$1ing"]
60
,
"$" "1".
.
$1, $2 ..
.
,
.
: (!),
, (:) ,
. "regular
verb", "play"
:
!regular verb: play
Bonito.
1.
"run". :
announced that he would not <run> for reelection . Georgia
medical benefits paid out would <run> 1 billion or more in the
May , said today Jones will <run> well ahead of his GOP
opponents
reports that he had decided to <run> and wanted Mr. Screvane ,
investigation Street car tracks <run> down the center of Pennsylvania
.
"run" .
2.
2.1. "run in". :
contest . The Orioles got a <run in> the first inning when Breeding
record in the 600 - yard <run in> the Knights of Columbus track
61
The Bears added their last <run in> the sixth on Alusik 's double
for the third Indianapolis <run in> the ninth . Despite the 45
's first major league home <run in> the fifth put the Sox back
()
, .
2.2. , "take (smth)
out".
"take".
.
(P-filter). "From:" "To:"
"2",
"" ( "out").
"out". :
for governor would force it to <take> petitions out into voting
the peasant . Nonetheless , they <take> time out -- much time -Mister McBride . You do that or <take> you out a permit right now
,
.
3.
"runs? in".
"?",
, "s"
.
. :
tied the game , and single <runs in> the eighth and ninth gave
record in the 600 - yard <run in> the Knights of Columbus track
their eight hits for two <runs in> the sixth . Chuck Hinton
The Bears added their last <run in> the sixth on Alusik 's double
's first major league home <run in> the fifth put the Sox back
4.
62
"run(|s|ning)".
( | ) ( "").
"run" "runs" "running". :
announced that he would not <run> for reelection . Georgia
medical benefits paid out would <run> 1 billion or more in the
the group are interested in <running> on the required non lawyer and former FBI man is <running> against the Republican
tied the game , and single <runs> in the eighth and ninth gave
5.
"[lemma="be"] within <head>".
:
<head>DECISIONS <ARE> MADE</head>Asked to elaborate
<head>LEADERSHIP <IS> HOPEFUL</head>The housing
Nations .<head>FORMULA <IS> DUE THIS WEEK</head>The
year .<head>COULD <BE> SCRAMBLE</head>Some predict
ends .<head>CHOICE <WAS> EXPECTED</head>The selection
<head>TOBACCO ROAD <IS> DEAD . LONG LIVE TOBACCO
,
(
, <head>). ,
,
.
6.
"[tag="VVZv"]".
:
charge of the election , " <deserves> the praise and thanks of the
However , the jury said it <believes> " these two offices should be
of Fulton County , which <receives> none of this money " . The
when the new management <takes> charge Jan. 1 the airport be
face is a state law which <says> that before making a first
63
.
"VVZv" , (Zv)
(VV).
SUSANNE. ,
, , ,
.
7.
"View Attributes"
"lemma" "tag".
"[lemma="be"]".
:
in which the election <was/be/VBDZ> conducted . The September
-October term jury had <been/be/VBN> charged by Fulton Superior
stration and election laws " <are/be/VBR> outmoded or inadequate
" these two offices should <be/be/VB0> combined to achieve greater
Department, the jury said, " <is/be/VBZ> lacking in experienced
.
RegEx
,
(
), , ,
.
RegEx
, .
(web ),
.
.
64
,
. ,
"" : *.jpg, ,
,
(*), "jpg".
.
,
. -
(, , ..),
, ,
. -
,
(). ,
" , " "
" "" (
).
,
.
, .
PHP4 MySQL
RegEx.
RegEx
.
, .
:
1) , ,
;
2) ( ,
, ..);
3) ( ,
"" - , ).
65
, , .
, RegEx ,
.
: .*?+[]{}|$^
, "abcd" ""
, "abcd".
".", . , "."
"", "", "".
,
(, , ).
, ,
.
. , "[]" ,
, "",
",," , , "". - ,
( ),
.
. , "[-]"
"" "", "[-0-9]" -
.
,
. RegEx
,
:
[:alpha:] ;
[:digit:] ;
[:alnum:] ;
[:space:] ;
66
[:punct:] .
, ,
,
, .
"[^ ]",
, , "[^"
"]". , "[^]"
, "" "",
, "" "".
,
.
. "*" ,
.
, "19*8" ,
"1", "9" , , "8".
. "+"
.
, ,
: "[-]+-[-]+".
.
"?". ,
( !) .
, "[]?"
"", "" "".
.
"{}".
67
.
:
{n,m} , "" n
m ;
{n} "" n ;
{n, } "" n .
"[]", ,
. ,
, .
RegEx
, .
"|". :
"1|2|3" [123];
"^|^" , "" "";
"||||" ,
"|".
"||||" ""
.
"()".
: "(||||)".
.
, . ,
. , "^"
, "$" .
68
, "^"
, "", "$"
"", "^$"
"".
3.1.3.
()
.
, ,
. Collins Cobuild
English Dictionary concordance
: an alphabetical list of the words in a book or a set of books
which also says where each word can be found and often how it is used.
.. (. 4).
KWIC
,
;
. .
.
, ?
. .
,
. 4.
(. 4)
(. 5).
69
View 1 examples
id=http://piligrim.iatp.by/article.html title=" "
&; quot;.
,
. , ,
. , , , ,
. , ,
,
.
. . . ,
1849 ,
. 5. CQP
KWIC,
,
, .
, , ..
, (
) ,
, . ,
,
.
;
(HTML, XML);
, (, ,
, , ..);
.
70
. .
. 3.3.3.2.
3.1.4.
(WWW)
(-)
.
,
,
.
,
.
.
, ,
( ,
).
-
.
. 2001 [45].
,
,
- (,
). , , . ,
,
.
-
.
71
,
(directories,
-).
,
( search
engines), -.
,
, ,
.
,
, ,
.
( , )
: Google, Fast Search (AllTheWeb), AltaVista, WiseNut,
HotBot, MSN Search, Teoma.
: (ndex, Yandex), (Rambler), !
(Aport).
:
1. , ()
(
) .
.
2.
(. index database),
, , ,
, -,
(
72
, ),
.
3. ,
( ) ,
.
-:
.
( )
,
.
.
.
.
(, , , ,
..) ( )
, .
( )
()
, -: , ,
,
..
,
.
.
73
, ( )
, , ,
. ,
, , ,
. ,
(, ),
.
,
, .
.
(stop-words),
, (, ..)
. (
), , . , ,
,
-, .
1.
,
().
( )
, ,
, ,
,
, ,
, .
, ,
1
., , http://www.searchengineworld.com/robots/norobots.htm
74
, URL ( ), (),
, , ( ),
,
- ,
, ,
,
( , ).
.
, ,
.
, .
.
[15].
3.2.
3.2.1.
.
.
(The Corpus of
Contemporary American English COCA)
,
http://corpus.byu.edu/coca/,
. . (Brigham Young University, )
2008 . COCA 410 .
1990 ,
, , ,
75
.
, .
DeReK (das Deutsche Referenz Korpus),
http://www.ids-mannheim.de/kl/projekte/korpora/.
,
(), ,
4
.
( 16. 08. 2010). , ,
,
.
SGML
,
TEI.
COSMAS II, ,
.
(BNC)
, 100 .
.
.
1991 1994 . ,
, 90 %
,
, ,
,
. .
,
.
:
.
76
(. 4).
.
CLAWS, .
1,7%.
,
, -
, (, VVD-VVN
,
).
4,7% .
,
.
, .
,
, .
, ,
: , ,
, .
, 1975 ,
,
. BNC 1964 ,
25% . 75 %
(,
, , , ,
). ( ), ,
, , , , ,
77
, ,
, .
10-
: 1) ,
, , 2) ,
, , , -
.
1) .
124 , ,
,
, ,
,
, ..
,
. ,
, ,
.
,
.
,
[42].
2) - .
:
, ,
, - , ;
, , , ,
;
, , ,
;
78
, , ,
.
SGML
, SARA (SGML Aware Retrieval
Application). SARA
/, ,
.
XAIRA (XML Aware Indexing
and Retrieval Architecture).
(esk nrodn korpus) ( ).
,
.
. . .
1994
, .
http://uchk.ff.cuni.cz.
, ,
.
, ,
: ( 100 . ),
(750 . ) .
1750 . . ,
.
. ,
1990-2000
,
1950 .
79
,
(. 2).
2
:
..
15 %
85 %
60 %
25 %
3,6 %
3,4 %
3,4 %
4,6 %
: (
), ( SGML)
(
).
,
( ), ,
, .
, : , ,
, , .
SGML.
,
(, ,
, , , ..).
,
SGML.
80
: SYN2000 (100 .
) PUBLIC (20 . ),
SYN2000,
. PUBLIC
:
;
( 60 );
.
.
SYN2000 ,
.
BONITO (. . 3.1.2).
:
1. holub ().
2. holub vejce ( )
( )
81
[lemma="holub"] [lemma="vejce"]
(...)
3. (A) (C),
(Y), (S)
[tag="ACYS.*"]
4.
3.2.2.
3.2.2.1.
(Upsal'skij korpus russkix tekstov)
().
82
http://www.slaviska.uu.se/korpus.htm. 600 ,
1 . ,
. ,
.
, , ,
,
. (
, ) 1985 1989
1960 1988 .
,
.
,
, , (which felt to
be more important),
. , ,
,
1.
( 25 : , ,
..), ( 40 ).
,
.
,
( vs.
), , , [2].
.
(. 3).
1
,
[2, . 129],
.
83
3
@
@@
&
*
\
{
:
&Perestrojka vse glubhe zatragivaet hiznennye interesy millionov, obqestva v
celom. Estestvenno, l~di xott luwe usnit' sut' i naznaenie processov obnovleni,
blihnie i dal'nie celi preobrazovanij, opredelit' svoe otnowenie k nim
,
- SFB 441
1990- -
(http://www.sfb441.uni-tuebingen.de/b1/rus/korpora.html#uppsalakorpus).
.
(TnT).
,
. .
: (KOI8
Windows-1251) .
:
,
.
:
, ,
, ,
: (
), XX XIX .
84
CQP,
,
.
XX-
2000-2002
.
(
11 . )
13
1994-1997 (23110
), (,
), (, , -5) ,
,
( ) .
(,
), .
,
-1 (
).
: (, ,
) -,
, , ..;
, ,
.
200
. ,
(
),
.
85
9 :
1. ,
,
, ;
2. - ,
, .
,
: , , .
3. ,
:
- , , ..
.
4. - ,
,
.
5. , ,
,
(, ).
6. .
7. .
8. - .
9. , , ,
, .., ,
,
.
3.2.2.2.
3.2.2.2.1.
,
,
86
.
2000 ,
1980- [32].
() - ,
.
http://ruscorpora.ru/ 2004 .
, ,
: ,
, , ,
.
,
,
:
1) , 176 .
( http://ruscorpora.ru/
2011 );
2) ,
(,
-,
,
, , -,
);
3)
(, ,
, )
, ( 20 .);
4) ,
,
, , ,
[11].
. ,
1951-2007 .
87
97,5 . ,
2000 .
53 .
XVIII (1,1 . ), XIX
(23,3 . 2,5 .
) 1- XX
(25,4 . ).
(. 4).
4
18
1800-1830
1831-1870
1871-1900
1901-1910
1,1
2,1
11,3
10,3
3,4
1911-1920
1921-1930
1931-1940
1941-1950
1951-1960
3,9
8,6
4,9
3,3
3,5
1961-1970
1971-1980
1981-1990
1991-2000
2001-2007
5,8
7,0
7,2
23,2
46,8
, ,
200 ,
( ) (1-2 )
.
,
:
1) , ..
;
2)
,
;
3) ,
;
4)
;
5) , ,
88
,
[11].
:
,
( );
- ,
;
,
;
;
,
,
( ,
, , ,
. .);
,
;
,
1930-2000- [27].
3.2.2.2.3.
, ,
,
.
.
89
,
, .
, , .
,
,
, (,
, ) [11].
. ,
, ,
, ,
[57],
.
,
, ,
-
[28].
,
6- ,
("Pear film"). 8 ,
, 20
. 595
, , 327
, ,
.. , ,
[22, . 10].
,
,
90
. ,
.
[28].
,
. ()
.
,
,
[38].
2007 .
30 ,
-.
,
. ,
520 (, , ,
),
. 266
, 223 .
2202 ,
134 ( 4500 , 39 .
) ( 2009 ).
:
(,
, , ,
,
91
)
[38].
3.2.3.
,
, (
),
, .
,
. .. .
, .
-
2007 . 78
9-11 , .
/
intermediate (26%) / upper-intermediate (74%).
38 . .
,
. ,
, ,
().
EFL (English as a Foreign Language),
.
,
92
[18].
, -
1 XVI-XVII . (),
http://project.phil.pu.ru/skat.
,
.
XVXVII .
, , ,
,
. 2006
500 . [1].
, ,
, , .
, (,
, , ..),
(
, ).
,
(-
), (
( . ) ,
, . , , -
.
1
93
),
, [10].
3.3.
3.3.1.
, , ,
, ,
, ,
.
, ,
,
.. ,
,
, ,
.
,
.
.
.
, ,
.
-
. (,
..)
:
94
, ,
.
(, )
, .
(, )
, ,
, .
.
,
( ,
, ).
?
() ,
,
.
,
: ,
,
(). .
3.3.2.
1.
, ..
,
, . , ,
,
, ,
.
95
. ,
,
.
[49]. , ,
,
,
,
(
, -, .
. 3.1.4):
*Harry reminds me of himself [Postal 1970]; . : Joe reminds
me of himself.
*John will leave until tomorrow [Lakoff 1970, . 148]; . : I
will leave until tomorrow.
*What an idiot I thought Tom was [Postal 1968, . 75]; . :
What an idiot I thought the main character to be. [19].
.. ,
,
[30, . 318]: ...
, -
.
,
, . ,
,
,
:
* , [ 2006, . 71]: .
: : -
96
- ,
(. , 1990-2000).
* , [ 2006, . 71]: . :
<...>
<.. > (. , 2003).
<...> (
, , ..), [:
164]: . : <...> ,
, 2010 (, 2001.10.30).
[30]
.
,
[42].
2.
,
, ,
.
(,
,
)
.
,
..
3.
,
, ,
/, ,
..
.
97
,
, , , .
,
. , CHILDES,
,
, ,
[52].
,
,
[46].
COLT (The Bergen Corpus of London Teenage Language)
(13-17 )
[58]. ,
, ,
,
[53].
3.3.3. ,
,
, ,
.
. ,
:
,
;
;
98
(
, ,
);
;
;
;
[9].
. ,
,
,
.
. , . , . Corpus Linguistics.
Investigating language structure and use [40] ( Corpus
Linguistics) ,
-,
:
1. ?
2.
?
3. (
, , ..)?
4. ,
?
5. ?
6. - ,
? [40]
99
,
, .
, (KWIC),
, .
3.3.3.1.
,
.
,
, , ,
. ,
, .
. ,
,
[2]. , deal 8-
-
1500 ,
.
. ,
,
, . ,
Sketch Engine,
(), -
.
,
deal Corpus Linguistics.
100
,
. ,
deals 3-
. Deal dealing
. ,
, ,
,
, .
, deal
,
,
.
182
deal LOB
, look,
500 . .
,
.
.
5 ,
TACT (Text-Analysis Computing Tools),
DEAL
--.
. , deal
( nn) 115 ,
(np) 1 , (vb) 66 ..
DEAL ,
.
101
5
DEAL
-- (LOB)
deal_nn
Deal_np
deal_nns
deal_vb
dealing_vbg
deals_vbz
dealt_vbd
dealt_vbn
115
1
5
66
51
20
14
17
DEAL
- ,
(overall)
.
DEAL
--,
(deal deals).
--
, ,
, ,
deal deals
(. 6).
6 () (raw
counts)
(normed
counts).
; ,
14 deal,
4, 16 ..
102
6
DEAL ,
100 .
88000
34000
54000
34000
160000
88000
154000
58000
()
DEAL
14
4
4
5
DEAL ( 100
. )
15,9
11,8
7,4
14,7
16
10,0
11
12,5
24
5
15,6
8,6
:
88 . , 34 . ,
160 . .
.
, ,
. ,
DEAL
, ,
,
, DEAL
. , ,
DEAL .
()
.
103
, 1 . , ,
, 100 . ,
15,9 100
. , 10 100 .
.. ,
.
deal
,
.
6 ,
4-5 .
DEAL,
24 ( ). ,
--
DEAL ,
-.
7
deal
-
1 . (IPM)
4000000
90
119
2000000
107
63
2000000
74
176
7 , - deal
deals ,
.
.
104
-,
( ) , deal/deals
, deal/deals
(119 . 90
.). ,
. deal/deals
,
(176 74 . ).
, deal/deals
,
(107 63 . ).
DEAL
: ,
, . ,
, .
,
DEAL
. ,
, , ,
.
, ,
.
,
,
, ,
.
,
,
.
105
()
.
(collocates) ,
() .
(collocation)
(
). ,
,
. ,
DEAL
, .
8 deal
-. ,
,
.
,
deal. , ,
8, , good
deal. ,
deal. ,
of deal.
deal,
(, ). ,
deal great (45 .
), good (23 . ).
(package that)
, 8,
, 3 .
.
106
8
deal
(5,7 . ) -
1
.
( 2,7 . )
Great
Good
of
more
in
to
( 3 . )
great
good
the
big
of
to
about
more
with
45
23
39
7
3
3
40
28
8
3
28
7
5
3
3
107
,
, .
, a good/great deal of a good
deal of work a good deal of attention.
. ,
more a great deal more
tolerance a good deal more inhabited. in to
deal ,
.: a great deal in common, differ a great deal in their understanding, a
great deal to be desired, a great deal to offer.
8 ,
.
great good. ,
good deal great deal ( 68
. ).
(96 ),
good/great + deal.
the 8 . big
3 . , 37
.
,
,
. , the c DEAL
, part of the deal is Isnt
that the deal? big .
no big deal what's the big
deal?
, , ,
: property deal, record deal, cash
deal, land deal, mining deal.
DEAL,
.
108
deal
, .
4 table 1
box, ().
,
deal , 5
,
.
Corpus Linguistics
deal.
.
, 4.
2-3 2030.
deal.
9 7 deal,
.
7 ,
. , ,
Webster's Third
Dictionary 2 Random House Dictionary
21.
deal ,
. deal
, ,
. ,
16- 23- .
,
, : big deal
. ,
,
109
, ,
.
9
deal
large but
indefinite
amount
agreement/
arrangement
distribution of
cards in a
game
treatment
received
act of
distributing
pine or fir
wood
act of buying
or selling/
a business
transaction
Longman
Encyclo.
Third
House
Lang. Culture
1989
981
1993
1992
entry 1
sense 13
entry 3
sense 3
entry 1
sense 2
entry 1
sense 3
entry 1
sense 21
entry 1
sense 16
entry 1
sense 18
entry 1
sense 16
entry 1
sense 3
____
entry 1
sense 4
entry 1
sense 18
entry 1
sense 21
entry 2
sense 1
entry 2
sense 4
entry 1
sense 15
entry 1
sense 17
entry 2
3 senses
entry 1
sense 13
entry 3
sense 2
____
entry 1
sense 6
____
entry 2
sense 2
____
entry 4
2 senses
entry 3
sense 2
entry 2
sense 1
entry 1
sense 5
entry 1
sense 6
entry 1
sense 23
entry 2
3 senses
entry 1
sense 17
1993
entry 3
sense 1
entry 2
sense 1
,
.
,
, ,
, ..
, [31].
, .
,
deal, (
110
!).
. ,
deal ,
. ,
,
deal,
, .
,
: ,
,
.
DEAL
DEAL Corpus Linguistics
. deal with ,
.
deal with ,
, .
- 157
. 58 .
,
deal only (2,6 c
. ).
deal in (4,6 . ).
deal with
,
, , , ...
,
111
,
deal with : The second
controversy dealt with the source of nitrogen in plants; An important point
to note is that the preceding discussion has dealt with thermodynamic
acidity...; Other environmental effects are dealt with in other chapters.
deal with :
When they had dealt with the fire another crisis arose; Moreover many
losses are due to chilling and crushing, both factors that can be dealt with
by good environmental control and housing.
-
: He didnt have the right
temperament to deal with the Hennigs of this world; She would have rather
there been a fight, anger or even tears and pleadings. These she could
deal with, not this deadly coldness exhibited by Alice; But suffering is also
a fact. It has to be dealt with.
, deal with ,
, :
Hansie De Beer runs the farm, hes the one Mehring usually deals with;
The son handed me a small suitcase with the distant eyes of a man dealing
with a chauffeur.
, deal with
.
, deal with,
( ,
..). ,
.
,
. ,
, .
112
3.3.3.2.
, ,
.
, , ..
, -,
, -,
,
,
, ,
(, ,
) [36].
.
(MI, t-score, log-likelihood),
.
( ,
)
, ..
, .
( [36]).
- CQP
(http://corpus1.leeds.ac.uk/ruscorpora.html)
(The corpus of Russian newspapers) 20012004 78 . ,
113
()
.. .
19 ,
.
.. [37] ,
.
() ( 1981-1984)
, ,
(,
; ,
). , ,
.
.. [6].
,
, .
, :
, , , , , , , ,
, , , , , , , , ,
, . 10 10
( 106) ,
MI ( ), Joint
; Freq1
, ..
; LL score, MI, T-score log-likelihood,
MI t-score . ,
, , , ,
, ,
MI. , MI
0 1 ,
.
114
10
( )
Joint
Freq1
LL score
MI
T-score
76
30,19
11,03
3,00
54
12,43
10,35
2,00
45
728
135,77
10,09
6,70
100
18,00
10,05
2,45
174
26,31
9,84
3,00
142
16,92
9,54
2,45
128
13,90
9,43
2,23
171
4747
469,90
9,31
13,06
14
457
37,19
9,08
3,73
148
10,37
8,90
2,00
, ,
MI ,
.
: ,
,
. ,
, , ,
, .
(), ,
,
. ,
.
,
,
. , ,
(
115
),
, , .
. ,
,
,
. , ,
, , ,
.
. , ,
.. , ,
,
,
. ,
,
,
[36].
http://www.aot.ru/cgibin/bigrams.cgi.
3.3.4. ,
,
, ,
(, .).
,
. ,
, .
116
,
,
, , .
.
, . Corpus
Linguistics
( )
.
,
.
,
,
, un-, -ment.
3.3.4.1.
()
, ,
.
,
, . ,
civilization , civilize,
kindness , kind.
117
Corpus Linguistics
: -tion/-sion, -ness, -ment, -ity (
).
,
, ,
(search template),
(mansion, nation, city).
.
-.
- ( 500 .
). 1 .
.
11
(2,7 . /)
44 000
1 . /
(3 . /)
(0,5 . /)
11 200
11 300
11
. ,
,
. ,
,
. Corpus
Linguistics
118
,
.
,
, , . ,
500 . : movement
( 900 . ), activity, information,
development, relation equation. ,
,
. , movement 100 .
60 ,
development 10 .
.
,
,
1.
movement: The legs and hips, or arms and shoulders, may be used to
initiate movement in any direction.
..
,
, . ,
.
.. :
,
,
(
, .: . . ., 1973. . 313).
, ,
, .
, ,
[33].
119
, ,
.
, -
. , ,
,
,
. ,
, move: Garth
whistled breathily to himself and moved his hand crabwise along the table
(fiction); Its how much they move it that counts (spoken).
activity, development information: The third
Important aspect of information is speed; Sometimes algae can stop the
development and growth of these plants; The experimental results can be
described quantitatively by defining the size and activity of the shoot and
root systems.
,
, : I do hope you know
that never in this country do we develop the sort of mob war that makes a
protest against something however unjust develop into an organized riot
(spoken); Ive informed the Soviet government of that visit (spoken);
Aye, the big fellow is active again youll be pleased to know. (fiction).
, ,
.
,
.
, , ,
120
, ,
.
12
-tion/-sion
-ment
-ness
-ity
68%
15%
2%
15%
51%
21%
13%
15%
56%
24%
5%
15%
12, ,
.
1.
-tion/-sion
,
(68%).
2. -ment
, .
3. -ness ,
.
,
,
,
, 12. ,
-tion/-sion
,
(relate/relation educate/education),
.
-ment
. ,
-ment,
- .
121
, , movement,
government, achievement, agreement, argument. ,
-ment ,
: amazement, agreement, astonishment,
disappointment, embarrassment, excitement.
,
:
Patrick
shrugged
in
embarrassment; The assembly cried out savagely and Ralph stood up in
amazement.
: I
can quite see theres cause for disappointment.
-ness
,
-ness , : awareness,
bitterness, goodness, happiness, politeness, weakness.
,
: The bitterness in his heart was now mixed with a kind of childlike
excitement; He could see Phylliss face in profile, and it radiated energy
and happiness.
,
, : We feel
frustrated and bitter and annoyed; Im not too happy about this lauding of
language...
122
.
, ,
.
. ,
,
.
3.3.4.2.
, ,
,
,
.
, ,
,
. ,
Corpus Linguistics,
.
.
, n,
, v.
, .
,
:
123
1. ,
? ,
grasshopper ecology animal groups
?
, grasshopper animal
,
general ecology small
groups. , grasshopper
ecology
(grasshoppers) (ecology).
.
.
2. .
, -
.
, ,
. , he
,
grasshopper. ,
, .
3. . ,
?
: He had left home a little before eight; Joanne and her mother
were talking.
. ,
(perfect progressive)
. ,
.
124
,
.
,
,
,
,
(nouny) . 13
-
.
13
-
2,2 : 1
1,2 : 1
1,2 : 1
2,9 : 1
1,5 : 1
1,6 : 1
2,5 : 1
1,3 : 1
1,3 : 1
,
,
,
,
,
,
.
.
,
,
125
. ,
,
.
-,
.
, .
-
Corpus Linguistics
(. . 13), ..
. -
(2,9 : 1
1,5 : 1),
,
(. . 3.3.4.1). , ,
10
:
1. In planning a livestock building or conversion, the
psychological and health requirements of the livestock should undoubtedly
be given absolute priority together with the basic needs of the stockman.
,
, .
,
,
: planning, conversion.
,
.
7 5 ,
:
126
127
3.3.5. ,
.
, ,
; ,
; ,
, ,
( ) [23].
: , ,
"-", .. ) ,
; ) , ; )
; ) , [12].
,
,
.
4 : 1) ; 2)
-
;
3)
; 4)
.
, . [61], ,
,
,
,
, , .
Corpus Linguistics ,
128
:
1)
(
). ,
,
,
;
2) -
, ,
,
, , ,
.
, ,
. Corpus Linguistics
(
-
-- (LOB)
: 1)
? 2)
( )
, ? 3)
?
,
,
129
: , .
(he, she, it, we, they
..), (this, that, these, those),
the such .
(pronominalization).
,
, , ,
[50]. 3 ,
: ,
.
, : The
room is large. It is light and clean.
,
, ,
, .
, ..
, : That winter. It was awful.
,
, .
, ,
. , ,
.
Corpus Linguistics
,
:
4.
Thortec International Inc. said it reached agreements with an
investor group and Wells Fargo Bank under which it will receive loans
and an equity infusion in return for stock that will reduce the number of
shares in public hands by as much as 85 percent. The engineering and
130
consulting firm, which has been plagued by losses for five years, said the
restructuring is required to relieve its debt burden and "acute shortage of
cash."
5.
A: Right, I'm ready. Have you locked the back door? [pause] I
thought we were walking.
B: Well do you want to walk or do you want to go in the car?
A: Well I have to go to the paper shop.
B: Well I'll drop you at the paper shop while I go round.
A: Oh that's a good idea.
.
(Thortec International Inc., agreements, an investor group
.),
(I, you, we, that). , ,
.
,
I you,
, -
, .
. , -
,
, : I, you, the
back door, the paper shop,
, : agreements, an investor group.
.
131
,
,
. Corpus
Linguistics 4 :
1) : :: ;
2) : ,
;
3) : ,
;
4)
.
. ,
( 4)
, ,
.
: Thortec International Inc., an investor group, Wells Fargo
Bank, loans, an equity infusion, stock.
, ,
. ,
it ,
, : Thortec International Inc.
, ,
.
, .. ,
. ,
it Thortec International
Inc.
,
. ( 5)
I you . The
132
. ,
it
Thortec International Inc.
The engineering and consulting firm
.
133
.
,
.
:
,
200 40 ,
- --.
: (5 ),
(9 ), (10 )
(16 ). Corpus Linguistics
:
1) ,
;
2) ( ),
;
3) ( ),
,
,
. ,
, ,
, ,
;
4) (, , ),
,
I you
, 3- ,
,
134
,
;
5) ( ),
,
;
6) ,
.
.
,
,
,
.
, ,
, , ,
,
, .
6
, ,
. (them),
,
.
("ANAPHORIC") .
,
:
<<<Ref = anaphoric
<<<Status = given
135
. 6.
.
,
.
.
,
Corpus Linguistics,
. 7
136
.
.
,
.
, ,
.
.
, 70 %
.
, 65%
.
.
70
60
50
40
30
20
10
0
. 7. : ::
,
, ,
137
.
45
40
35
30
25
20
15
10
5
0
. 8.
8
, ,
,
.
, ,
.
,
,
(she, he it). 6
138
,
.
,
).
6.
A: What are you doing [this afternoon]?
B: I'm going [home]. I've got to teach about half past one. Can you
pick [your own trousers] up?
A: no I don't think it'll be likely. I've got [this meeting at three
thirty] and...
B: well what are we doing [this weekend]? [Tomorrow] I've got
[dancing class] in [the morning]
A: well I've nothing down anyway at all.
B: well [they]'re open [tomorrow afternoon] up till three o'clock,
you remember [last time] we went, and [they] were closed.
. ,
, :
,
.
Corpus
Linguistics
,
.
,
. 14,
139
.
,
. ,
.
, ,
.
.
,
, , ,
,
.
14
4,5
5,5
11,0
9,0
, ,
,
. , 5
, ,
,
.
.
A: Right, I'm ready. Have YOU locked [the back door]?
I thought we were walking.
B: Well do you want to walk or do you want to go in [the car]?
A: Well I have to go to [the paper shop].
140
, ,
, .
,
: : Honey,
can I use that ashtray, please; : Thanks,
mom, ok, talk to you later; (familiariser):
Got a ticket, mate?
:
25% , ,
,
.
141
(. 15),
( , ,
) .
15
68%
11,5%
11,25%
Mom!
9, 25%
, ,
,
(love, dear), .
, ,
.
:
1. sir , ,
,
.
2. ,
, .
,
142
[24].
() (. . 3.2.2.2.2)
,
, , :
.
, 19 ( 30-35 ,
19), ,
, (, , , ).
( , , , , , , ,
..)
:
,
, ,
.
,
(, () , () ),
- , :
1) -!/ / ?/ / /
?/ ? / ? ?
/-/ / /
/ / / / / /
/ / / / /
/ / / / /
/ / //
2) / / / / /
/ / / / /
/ / / / //
143
3) ? ! <> ?
, ? ? ?
/ . <> ,
? ?<> <> ? /
? / ? <> /
?
4) / /
/ / / <> / /
5) / <> - /
,
,
(
)
.
, ( 30
)
:
? ?
? ,
, -
, ,
, , .
:
?
, ,
19:
? ,
? ,
, , ! , ,
, . ,
144
,
:
, ,
.
,
, :
, ,
. ,
.
, ,
,
.
22 . 36 . (
) .
,
, .
( 5, 6, 7),
, ,
.
, ,
(3-7), .
, ,
,
[21]. ( ) ,
,
,
, . ,
19 ,
[24].
,
145
.
,
(, )
- ,
(
) .
146
,
, . .
,
. ,
,
,
.
, ,
, ,
, ,
.
,
, , ,
,
,
. ,
XXI .
147
1. .., .., ..
//
2006.
: - .-. -, - , 2006. . 16-24.
2. .. . ., 2007.
3. .., .., ..
. ., 1993.
4. ..
//
2004. ., 2004. . 55-64.
5. ..
// : . /
. .. (. ), .. , ..
. : , 2010. . 194-213.
6. .. . (
) -
. , 1995.
7. ., .. -
//
2006. ., 2006. . 81-87.
8. .. : .
. .: , 2008.
9. .. //
2006.
.: - .-. -; - , 2006. . 88-91.
10. ..
() //
2006. .: - .-. -; - ,
2006. . 92-93.
11. .., ..
//
2008 6-10 2008 . ., 2008. . 161-169.
12. .. -
. . 2.
// . .
, 39. ., 1982.
148
13. .. (.). . .,
1977.
14. .. : -.
. ., 2005.
15. .. - //
:
"-2005" (, 1-6
2005 .) ., 2005. . 166-171.
16. ., .., ..
: . . .:
, 2005.
17. .., ..
: . . .: ,
2004. 208 .
18. ..
: //
V -
:
. .:
, 2010. . 84-96.
19. .
// : .
. : , 2009. . 215-247.
20. .., ..
//
2001
. , 2001.
21. .. // / .
. .. . .: , 2000. . 453501.
22. .. . .: , 2002. 581 .
23. / .
.. . .: . , 1990.
24. ..
//
. 7 (14).
(2008)
/ . . . . . ., 2008. C. 352-355.
149
//
2006. .: - .-. -; - ,
2006. . 318-331.
31. .. //
-2002. .: , 2002.
32. .. :
. 2005. http://ruscorpora.ru/sbornik2005/03sitch.pdf
33. .. , ,
// XX : . .
.: , 1995. . 35-73.
34. ..
: . .
. . . ., 2008.
35. ..
// . 2006. . 8.
1. . 67-73.
36. ..
// Slavica Helsingiensia 34. :
. , 2008. . 343-357.
150
37. .. . 2002.
http://www.artint.ru/projects/frqlist.asp
38. .. :
// . . .
9. ., 2009.
39. Atkins S., Clear J., Ostler N. Corpus Design Criteria. Literary and
Linguistic Computing. 1992. Vol. 7, No. 1. Pp. 1-16.
40. Biber D., Conrad S., Reppen R. Corpus Linguistics. Investigating
language structure and use. Cambridge University Press, 1998.
41. esk nrodn korpus vod a pruka uivatele FF UK / Kocek,
J. Kopivov, M. Kuera, K. NK 2000.
42. Finegan E. LANGUAGE: its structure and use. N.Y.: Harcourt
Brace College Publishers, 2004.
43. Johannessen J.B. Corpus linguistics or corpora in linguistics? //
NODALIDA 2005, the 15th Nordic Conference of Computational
Linguistics. Joensuu, Finland, May 20-21, 2005.
44. Johansson S. Some aspects of the development of corpus linguistics
in the 1970s and 1980s // Ldeling A., Kyt M., eds. Corpus Linguistics.
An International Handbook. Volume 1. Berlin & New York: Walter de
Gruyter, 2008. Pp. 33-53.
45. Kilgarriff A. Web as corpus // Proc. of Corpus Linguistics 2001
conference (Lancaster University). Lancaster, 2001. Pp. 342-344.
46. Kyt M., Rissanen M. A language in transition: The Helsinki
Corpus of English texts, ICAME Journal, 1992. 16: 7-27.
47. Lakoff G. Pronominalization, Negation, and the Analysis of
Adverbs // Jacobs & Rosenbaum, eds. Readings in English
transformational grammar, Ginn & Co, Waltham, MA, 1970. Pp. 145165.
48. Leech G. The Distribution and Function of Vocatives in American
and British English Conversation // In Hasselgrd and Oksefjell eds., 1999.
Pp. 107-120.
49. Ldeling A., Kyt M., eds. Corpus Linguistics. An International
Handbook. Volumes 1, 2. Berlin & New York: Walter de Gruyter, 2008.
http://alknyelvport.nytud.hu/muhelyek/elte.../HSK-CorpusLinguistics.../file
50. McCarthy M.J. Discourse Analysis for Language Teachers.
Cambridge: Cambridge UP, 1991.
151
152
http://ruscorpora.ru
70 .
http://www.natcorp.ox.ac.uk/
http://sara.natcorp.ox.ac.uk/
100 .
http://corpus.nytud.hu/mnsz/
100 .
()
http://www.corpusdelespanol.org/
100 .
http://www.korpus2000.dk/
50 .
CORIS/CODIS
http://www.cilta.unibo.it/ricerca.htm
100 .
(LIVAC
Synchronous Corpus)
http://www.rcl.cityu.edu.hk/livac/
720 .
(150 . )
(Institut fr
Deutsche Sprache, Mannheim, Germany)
http://corpora.ids-mannheim.de/~cosmas/
1610 .
http://www.fida.net/eng/
100 .
153
174
93
74
54
43
31
30
27
25
22
18
17
16
15
14
13
13
13
13
13
13
12
12
11
11
11
10
10
10
154
4
155
1233
331
157
154
38
33
27
23
23
156
:
,
. , A users guide to the
Grammatical Tagging of the BNC,
BNC.
.
, ,
.
AJ0 (, good, old, beautiful)
AJC (, better, older)
AJS (, best, oldest)
AT0 (, the, a)
AV0 : ,
AVP AVQ (. ) (, often, well, longer (adv.), furthest)
AVP (, up, off, out)
AVQ Wh- (, when, where, how, why, wherever)
CJC (, and, or, but)
CJS (, although, when)
CJT that
CRD (, one, 3, fifty-five, 3609)
DPS (, your, their, his)
DT0
DTQ , Wh (, which,
what, whose, whichever)
EX0 there
ITJ (, oh, yes,
mhm, wow
NN0 , (, aircraft, data,
committee)
NN1 (, pencil, goose,
time, relevation)
NN2 (, pencils,
geese, times, relevations)
NP0 , (, London, Michael,
Mars, IBM)
ORD (, first, sixth, 77th, last)
PNI (, none, everything, one [as
158
pronoun], nobody)
PNP (, I, you, them, ours)
PNQ Wh- (, who, whoever, whom)
PNX (, myself, yourself, itself)
POS 's '
(, 'Peter's or somebody else's', : NP0 POS
CJC PNI AV0 POS)
PRF of. -
, "of" .
PRP ( of) (, about, at, in, on, on behalf of, with)
PUL : [
PUR : ]
TO0 to
VBB (to be) ,
is, ' s: am, are, 'm, 're be
VBD , (to be) (2-
): was were
VBG , "-ing": being
VBI , : to be
VBN : been
VDB , (do): do
VDD : did
VDG "-ing": doing
VDI : to do
VDN : done
VDZ "-s" : does, 's
VHB : have, ve
VHD : had, 'd
VHG "-ing": having
VHI : to have
VHN : had
VHZ (-s): has, 's
VM0 (, will, would, can,
could, 'll, 'd)
VVB (, forget, send, live,
return)
VVD 2 3 (, forgot,
sent, lived, returned)
VVG -ing (, forgetting, sending,
living, returning)
VVI (, forget,
send, live, return)
VVN II (3 ) (,
forgotten, sent, lived, returned)
VVZ -s (. ; . )
159
160
: ..
17.01.2011.
.. / 3. 500 .
664025, . , . , 8
161