Академический Документы
Профессиональный Документы
Культура Документы
3–39
www.elsevier.nlrlocaternpe
a
´
Laboratoire de Physique Theorique, LAPTH, Chemin de BelleÕue, B.P. 110, F-74941 Annecy-le-Vieux,
Cedex, France 1
b
Department of Theoretical Physics, Tata Institute of Fundamental Research, Homi Bhabha Road, 400 005
Mumbai, India
Received 3 May 1999; accepted 5 August 1999
Abstract
We revisit the effect of the large stop mixing on the decay and production of the lightest SUSY
Higgs at the LHC. We stress that whenever the inclusive two-photon signature is substantially
reduced, associated production, Wh and tth, with the subsequent decay of the Higgs into photons
is enhanced and becomes an even more important discovery channel. We also point out that these
reductions in the inclusive channel do not occur for the smallest Higgs masses where the
significance is known to be lowest. We show that in such scenarios the Higgs can be produced in
the decay of the heaviest stop. For not too heavy masses of the pseudo-scalar Higgs where the
inclusive channel is even further reduced, we show that large stop mixing also allows the
production of the pseudo-scalar Higgs through stop decays. These large mixing scenarios therefore
offer much better prospects than previously thought. As a by-product we have recalculated t˜1 t˜1) h
production at the LHC and give a first evaluation of the t˜1 t˜1) Z. q 2000 Elsevier Science B.V. All
rights reserved.
1. Introduction
1
´ a` l’Universite´ de Savoie.
URA 14-36 du CNRS, associee
0550-3213r00r$ - see front matter q 2000 Elsevier Science B.V. All rights reserved.
PII: S 0 5 5 0 - 3 2 1 3 Ž 9 9 . 0 0 4 9 7 - 6
4 ´
G. Belanger et al.r Nuclear Physics B 568 (2000) 3–39
have rather high masses. In the minimal scenario of SUSY, the light Higgs mass can not
exceed ; 130 GeV. Considering the existing LEP2 2 direct searches which indicate a
mass greater than about 90 GeV means that the lightest SUSY Higgs is confined to a
small mass range. Yet this range of Higgs masses poses considerable problems for the
™™
LHC. For a review see Ref. w5x. The dominant decay into bb is not exploitable,
especially in the inclusiÕe production channel gg h bb, due to the huge QCD
background. One therefore has to rely on the much smaller two-photon signal w6,7x.
However, especially for the LHC, the two-photon decay of the light Higgs to which
dedicated detectors are being designed constitutes a challenge. Moreover many effects
either due to the direct w8–14x or indirect Žloop. w13,15–18x contributions of the rich
SUSY spectrum enter the predictions of the two-photon rate of the supersymmetric
Higgs. These can lead to a substantial reduction of the supersymmetric Higgs signal as
compared to the standard model Higgs. Take for instance the rather simple scenario
w13,19x where all sparticles, apart from the parameters of the Higgs sector, are very
heavy and where mixing effects are negligible. This is the scenario which has been
extensively investigated by the ATLAS w20,21xrCMS w22x collaborations which leads to
the much celebrated MA –tan b Higgs discovery potential of the LHC. For short, we will
refer to this model as Class-H scenario. In this scenario the two-photon Higgs signal can
be much reduced compared to the S M Higgs especially as one lowers the mass of the
pseudo-scalar boson, A0 . Nonetheless, even in this scenario, this channel covers a large
part of the MA –tan b discovery plane, while when MA gets small so that the two-photon
signal gets too small, one can extend the discovery potential by exploiting the signatures
of the then not too heavy additional Higgses w19–22x. It is therefore important to inquire
how much the important two-photon signal can get reduced and equally important to
investigate when this reduction occurs, whether new mechanisms for Higgs production
open up or are enhanced. Could the latter then make up for the loss in the former?
Considering that a general SUSY model furnishes an almost untractable number of
parameters to give an unambiguous answer, apart from the Class-H scenario only partial
investigations w15–18x within specific models have been conducted. To quantify how the
rate of the two-photon signal can be affected as the SUSY parameters are varied, it is
instructive to take as a reference point the signal for an S M Higgs with a mass that of
the lightest SUSY Higgs. In Ref. w17x this has been done within the mSUGRA
™™
hypothesis w23,24x but considering only the dominant inclusive Higgs production
channel: gg h gg . One does find indeed that this ratio can be much smaller than
unity even for relatively large MA Žwhich is generic in mSUGRA. and hence making it
more difficult to search for the SUSY Higgs than for the same mass S M Higgs.
However, it is known w25x that within mSUGRA other channels for Higgs production
may open up, like the cascading of the heavier neutralino to a lighter one and a Higgs,
thus offering the fantastic possibility of not only discovering supersymmetry but
allowing an easy detection of the Higgs w26x before its observation in the two-photon
channel. Recently, it has been argued w18x, that even in the large MA region, the
so-called decoupling limit w27x, if one introduces large mixing in the stop sector a very
substantial reduction can also ensue in the inclusiÕe two-photon Higgs signal. This
2
The limit on the mass of the SUSY Higgs depends on the SUSY parameters. For an update on the limits
on the Higgs mass see Refs. w1–4x.
´
G. Belanger et al.r Nuclear Physics B 568 (2000) 3–39 5
effect together with the issue of the mixing in the Higgsinorgaugino sector had been
studied previously by comparing the rates with and without mixing w15,16x. It was found
that there were small regions in parameter space where the rate for the two-photon
Higgs signal could be either very much reduced or very much enhanced by the
inclusion of mixing.
When large reductions in an important channel occur it is crucial to find out how
other channels are affected. What has not been stressed in the previous studies w15–18x,
especially in the case of large mixing, is the importance of the associated Higgs
production w28–33,35x Žthe K-factor for WH in the SM is calculated in Ref. w34x. and
even if no efficient b-tagging were possible, how in these scenarios these processes can
salvage the Higgs signal. Within the S M and in the no-mixing scenarios, both CMS
w22,36,37x and ATLAS w21,38x have now shown that associated Higgs production
ŽWh,Zh and tth., with the subsequent decay of the Higgs into photons, can provide an
invaluable Higgs signal, when enough luminosity has been accumulated. This is
because, although associated production has lower rates than the inclusive channel, the
corresponding signals are not plagued by as much background. The CMS analysis w22x
for the S M Higgs shows that already with an integrated luminosity of 30 fby1 the gg l
Ž l s e, m . leads to a significance higher than 5 Žthus an observable Higgs signal. for the
range of light Higgs masses we are interested in. For a high luminosity of 100 fby1 this
significance improves to more than 10 and is higher than the significance in the
inclusive channel for practically all Higgs masses in the range of interest. Although it is
known that the ATLAS analyses are less optimistic 3 when it comes to the two-photon
signal, either in the associated or inclusive channel w21x, it remains that at high
luminosity the associated production provides a better reach in the MA –tan b plane w21x.
One should therefore also inquire in the case of the SUSY Higgs if the rates for
associated production are reduced together with the inclusive rates or if they can rather
help the discovery potential. At the same time if the rates for the SUSY Higgs are very
much affected this generally means that some sort of non-decoupling of some of the
SUSY particles is taking place. These particles should then be observed directly.
Moreover since their coupling to the Higgs can not be negligible, these same particles
could trigger Higgs production, through their decays for instance or through new
associated productions. Another important aspect to address is the impact of stop mixing
on the Higgs mass and its conjunction with the reduction in the inclusive channel.
Indeed, the significance in the inclusive two-photon channel is very much dependent on
the Higgs mass, even in the narrow range allowed by SUSY w21,22x, contrary to the
associated two-photon channel where the significances are rather flat as a function of the
Higgs mass in the range of interest w21,22x.
The present paper revisits the case of the large mixing in the stop sector w15,16,18x,
how the WhrZh and tth associated production saves the day when the inclusive channel
drops to critical levels and how other new channels for Higgs production open up. To set
the stage, Section 2 is intended as a reminder of how much a reduction in the usual light
Higgs signals can occur and is tolerable in the Class-H scenario. This will serve as a
benchmark when we study whether the other scenarios could give reductions which are
3
The differences between ATLAS and CMS are quantified in w39x.
6 ´
G. Belanger et al.r Nuclear Physics B 568 (2000) 3–39
much worse than those obtained with lowering MA , a situation of some concern
especially if no new production mechanism is exploitable. We will also present some
approximations for evaluating the reductions due to MA which will be useful even when
we study the stop mixing case.
Our analysis of the large mixing scenario in the stop sector is contained in Section 3.
We first consider the large MA limit. While we confirm that large reductions in the
inclusive channel can occur, we point out that in most cases these are no worse than
what is obtained with a low MA in the no-mixing case. Moreover we will show that if
tan b ) 3 an increase in the inclusiÕe channel is possible. This increase is not possible
for low values of tan b as studied in w18x because the effect is associated with a too low
Higgs mass already excluded by LEP2. We also carefully analyse for which Žlight.
Higgs masses these reductions occur. We will show that contrary to the no-mixing
scenario where the signals in the inclusive channels are lowest for the lowest Higgs
masses, in the case of large stop mixing the most drastic drops in the inclusive channels
do not occur for the lightest Higgs mass possible. Indeed the effect of mixing tends to
increase the mass of the Higgs compared to its value in the absence of mixing.
Considering that the significances for the S M Higgs in the inclusive channel are lowest
for the lowest possible Higgs masses in our range, ; 90–130 GeV, means that the
largest reductions do not necessarily correspond to the lowest Higgs signal. For instance
a reduction of 0.4 may well be tolerated for a Higgs mass of 110 GeV but a reduction of
0.8 may be ’’too much’’ when it occurs in conjunction with m h s 90 GeV. More
importantly we find that at the same time as the inclusive channel decreases, the
associated production increases and has much better significances than with an
S M Higgs or with a corresponding SUSY Higgs where the stop mixing have been
switched off. We will explain why this is so. It should also be pointed out that the large
reductions in the inclusive channel occur mostly when one of the stops becomes rather
light, below about 200 GeV. In many instances, as first suggested by Refs. w40,41x,
associated t˜1 t˜1 h production can provide a new channel to search for the light Higgs. We
will quantify how much one can benefit from this additional channel. Most studies
w15,16,18x have assumed equality of all soft squark masses which almost invariably
leads to a maximal mixing angle <sin2 u t˜ s 1 <, where u t˜ is the mixing angle in the stop
sector. Maximal mixing should be viewed as a very special singular point in the large
array of the SUSY parameters and even though justified for the first two families as
suggested by the mSUGRA w24x Žfor a review see Ref. w23x. scenario is quite unnatural
for the third family especially in view of the large Yukawa coupling. We show,
nonetheless, that maximal mixing is not always required for the reductions in the
two-photon rates to occur. However, moving away even slightly from this singular
mixing angle, while not changing much the previous conclusions, can open yet another
Higgs production channel. We point out that provided m t̃ 2 is not too large, say
m t̃ 2 ( 500 GeV, so that its production rate is large, t˜2 can provide a source of Higgs
through its decay into the lighter stop thanks to a sizeable Yukawa t˜2 t˜1 h coupling. This
coupling is controlled by the same parameters that make the t˜1 t˜1 h coupling large and
™ ™™
which lead to a reduction of the inclusive channel. We will compare the rate for this new
Higgs production mechanism s Ž pp t˜2 t˜2) t˜2 t˜1) h q t˜2) t˜1 h. with the associated light-
est stop pair production mechanism s Ž pp t˜1 t˜1) h. w40,41x and show that the cascade
decay of the t˜2 can be substantial. This is akin w25x to the mixing in the higgsino–gaugino
´
G. Belanger et al.r Nuclear Physics B 568 (2000) 3–39 7
™
sector which has been shown w26x to allow a direct Higgs production through the
cascade decay x 20 x 10 h. We then move to the analysis of the combined effect of
allowing for smaller pseudo-scalar masses together with large stop mixing. For moderate
MA our conclusions are little changed, the associated productions offering always a good
channel. When MA gets rather small Ž MA ; 250 GeV., the usual reduction, as compared
to the S M , in both the inclusive and associated production occurs. This is irrespective
of mixing and can be explained along the lines of what happens in the Class-H scenario.
Including the large mixing effects from the stops could decrease even further the signal
from the inclusive channel, but the same effect again helps increase the associated
production channel. Therefore the reach in this channel alone is better than what
previously studied by ATLAS w20,21x and CMS w22x in the MA –tan b plane for the
no-mixing Class-H scenario. Luckily in these situations with both a low MA and large
™ ™ ™ ™ ™
mixings in the stop sector we find that beside the new channels for Higgs Ž h. production
s Ž pp t˜2 t˜2 t˜2 t˜1 h. and s Ž pp t˜1 t˜1) h., one can also have s Ž pp t˜2 t˜2 t˜2 t˜1 A..
™
There are even instances where the pseudo-scalar Higgs triggers h production through
A Zh. Independently of the extreme mixing scenario studied here we advocate to
exploit the potentially large Yukawa coupling of the stops to search for the HiggsŽes.
through the cascade decays of these third generation squarks. In all our discussion we do
not mention rescuing the Higgs signal through its decay into bb in the associated
production w42x which would be possible provided good b-tagging is available as
discussed by ATLAS w21,43x. This is a difficult issue w44x especially at high luminosity
and further simulation studies are needed. Note also, as we will discuss in the next
section, that there can be scenarios where the branching ratio of the Higgs into bb also
is reduced. We will not investigate this possibility. Section 4 gives our conclusions.
In order to compare the various effects of lowering the masses of the SUSY particles,
we start by briefly reviewing the situation when the masses of all sparticles but those of
the Higgs sector are set to a high scale, M˜ S s 1 TeV. The mass of the pseudo-scalar
Higgs is let free. Moreover in the illustration we have also taken the Higgs mixing
parameter such that m s y180 GeV and the SUŽ2. gaugino mass M2 s 500 GeV with
the traditional GUT assumption on the gaugino masses which at the electroweak scale
translates as
5
M1 s tan2u W M2 . Ž 2.1 .
3
Therefore strictly speaking we have allowed rather light charginos and neutralinos. All
the tri-linear A-terms were set to zero. These kind of scenarios w13,19,20,22x, with high
masses of sfermions, have been assumed in the simulation searches for the HiggsŽes. by
the ATLASrCMS Collaboration leading to the much advertised MA –tan b plots.
Meanwhile it has been known for some time that as MA increases one reaches a
decoupling regime w27x whereby at low energy only the lightest neutral Higgs appears in
the spectrum with the important property that its couplings are essentially the same as
those of the standard model. This kind of S M-like Higgs should be easiest to discover
8 ´
G. Belanger et al.r Nuclear Physics B 568 (2000) 3–39
Fig. 1. Variation of R g ggg with MA , for tan b s 2.5 Žfull. and tan b s10 Ždotted..
at the LHC. However, as the mass of the pseudoscalar decreases the production rates of
the lightest Higgs also decrease. The reduction in the inclusive two-photon rate of the
Higgs, as compared to the S M , is defined through the ratio
G SUSY Ž h ™ gg . = BR Ž h ™ gg . .
SUSY
Ž h ™ gg . = BR Ž h ™ gg .
R g ggg s Ž 2.2 .
G SM SM
This ratio is calculated by taking the same mass for the S M Higgs as the one that is
derived for the SUSY Higgs once all the SUSY parameters are set. Throughout this
paper we use HDECAY w45x to calculate all the couplings, widths and branching ratios of
the Higgs. This program incorporates the leading two-loop corrections for the Higgs
masses following Refs. w46,47x. We show in Fig. 1 how this ratio decreases with MA .
This ratio can drop to as little as ; 30% for MA s 200 GeV and tan b s 10. Though
trivial in this case, it is useful to point for later that as the pseudo-scalar Higgs mass
decreases so do the other Higgs masses,4 therefore the most drastic drops occur for the
lowest range of the lightest Higgs, see Fig. 2. This is particularly drastic for tan b s 2.5
GeV, where the drop occurs around m h ; 90 GeV. It is for these low masses that the
significance of the S M Higgs is also lowest w20,22x and therefore for this low tan b this
would constitute the worst scenario for the discovery of the lightest SUSY Higgs
through its two-photon decay.5
What is troublesome for a low MA is that the branching fraction into two photons is
the main reason behind the drop, as shown in Fig. 2. This ratio is defined as
BRSUSY Ž h ™ gg . .
Ž h ™ gg .
Rgg s SM Ž 2.3 .
BR
For instance for MA s 200 GeV and tan b s 2.5, the ratio of the branching fraction
into photons, Rgg , is reduced to about 0.5 with respect to what it would be in the S M .
This reduction accounts for much of the reduction in R g ggg , R g ggg s 0.4. Therefore one
4
In the analysis we have required Mh )90 GeV.
5
Of course, for MA ( 2 m t there is a chance of discovering the other Higgses.
´
G. Belanger et al.r Nuclear Physics B 568 (2000) 3–39 9
Fig. 2. Ža. R g ggg versus Mh as MA varies from 200 GeV to 1000 GeV for different values of tan b .
m sy180 GeV. The lowest values of R g ggg and Rgg correspond to MA s 200 GeV. Žb. R g ggg versus Rgg .
expects also a considerable drop in the Higgs signal even in the associated channel Wh
and tth with the subsequent decay of the Higgs into two photons. These channels have
been shown to be invaluable w20–22,38x especially when a high luminosity has been
accumulated.
To get an understanding of these gross features and compare with what happens in
other scenarios, it is worth discussing how the various couplings, tth, bbh and WhrZh
that enter both the associated production and, at the loop level, the inclusive production
are influenced by a change in MA . This is best illustrated and most transparent in the
large MA , so-called decoupling, limit which has been shown to be already operative at
200 GeV w27x. Take the tth coupling which differs from the S M by the factor R,
g cos a
Vt t h s R mt with R s , Ž 2.4 .
2 MW sin b
where a is the usual angle that appears in the diagonalization of the CP-even neutral
Higgs mass matrix. As was shown elsewhere w48,49x, in this limit and up to radiatiÕe
corrections we may introduce the factor r,
where r collects all MA dependence and other radiative corrections which also occur
in the computation of the Higgs masses. Neglecting the latter we have
2 MZ2 tan2b y 1
r, 0 0. Ž 2.6 .
MA2 tan2b q 1
Then the reduction factor which appears in tth is
2
1 q tan2b r
R s 2 2
, R,1y . Ž 2.7 .
1 q tan b q r q 2 r 1 q tan2b
Likewise it is found that in hbb there is an enhancement factor which especially, for
larger values of tan b , is more substantial than the reduction in the top vertex
tan2b
Rbbh , 1 q r . Ž 2.8 .
1 q tan2b
On the other hand, the WWhrZZh vertex, controlled by sinŽ a y b ., is much less
affected: it only shows a quadratic dependence on r:
r2 tan2b
RV V h , 1 y 2
. Ž 2.9 .
2 Ž 1 q tan2b .
™
In the S MG Ž h gg . is dominated by the W loop which interferes destructively
with the top. Since, in this scenario the dominant W coupling is hardly affected at
™
moderate MA the little change in the top Žbottom. coupling has negligible effect on
G Ž h gg .. However, this is not the case for the branching fraction into photons. Here,
since the total width is dominated by the width into bb, which is larger than in the S M ,
™
the branching ratio into photons will be reduced, especially as tan b increases, see Eq.
Ž2.8.. On the other hand, we expect a slight decrease in the G Ž h gg .. This is because
™
vertex is reduced. Therefore the main effect in the production rate pp h gg is due
to the reduction in Br Ž h gg .. This very crude argument gives the correct order of
™™
it is dominated by the top loop in the S M , and therefore it is reduced roughly as the tth
magnitude in the different drops in R g ggg and Rgg shown in the figures. Writing for
example
G SM Ž h ™ bb . , 1 y 4 M 2
tan2b Ž tan2b y 1 .
Ž h ™ bb .
Z
Rgg , 1 y 2 2 Ž 2.10 .
G SUSY M A Ž 1 q tan2b .
we recover Rgg s 0.483 for MA s 200 GeV and tan b s 2.5 which compares very well
with the full calculation. Moreover in a first approximation, the change in the width into
gluons can be mostly accounted for by the change in the tth vertex, in which case we
may write
4 MZ2 tan2b y 1
R g ggg , 1 y . Ž 2.11 .
MA2 1 q tan2b
For larger tan b and especially for low values of MA the approximation is acceptable
but not as good. This is partly due to the effect of radiative corrections on the h bb ™
´
G. Belanger et al.r Nuclear Physics B 568 (2000) 3–39 11
coupling through the diagonalisation of the neutral Higgs mass matrices.6 Especially for
large tan b these corrections are no longer so suppressed compared to the MA2 correc-
tions w17,49–51x. In our case the effect is rather marginal since the only mixing
parameter, m , is rather small compared to the SUSY scale. However, let us stress that in
all the analyses in this paper even when considering large values of the tri-linear
coupling Žsee next section. the branching ratio into bb is hardly affected. Because our
aim is to concentrate on the effect of the tri-linear coupling of the top sector we do not,
in the present paper, analyse the case with very large tan b as these would require to
analyse the sbottom sector and also for large m possible reductions in the h bb
branching ratio.
™
When considering the associated channels, beside the reduction in the two-photon
branching ratio, a further, even though slight, reduction factor affects tth production
while we expect Wh to be much less affected. This is borne out by the numerical
analysis shown in Figs. 1–3. Once again we define, for the associated productions, ratios
normalized to the S M rates for the same Higgs mass:
s SUSY Ž pp ™ Wh. = BR Ž h ™ gg .
SUSY
Ž pp ™ Wh . = BR Ž h ™ gg .
R Wgg s Ž 2.12 .
s SM SM
™
and similarly for the associated top production: R t tgg . At the level of the cross sections,
s Ž pp Wh,Zh,tth., the ratios are assumed to be given by the ratios of the squares of
the WWh and tth couplings. We clearly see, Figs. 1–3, that a lowering of MA in case of
no mixing not only results in a lowering of m h but also in a reduction of both the
inclusive and associated two photon channels, as compared to the S M signal for the
same m h . The worst hit channels are the direct production and the tth. The Wh channel
is slightly less affected. Note that ttgg versus R g ggg shows almost no tan b depen-
dence, Fig. 3. This is due to the dominance of the top loop in the gg h production,
controlled by the same vertex that enters the associated tth cross section. These
™
reductions occur for the lightest Higgs mass and are due essentially to the drop in the
branching ratio of the Higgs into two photons, see Figs. 1–3. Since as we pointed earlier
the significances in the associated channels are rather flat with respect to the Higgs mass
in the range we are interested in, this explains why the 5s discovery region based on the
associated channel in the MA –tan b plane w20,22x are almost independent of tan b
Ž2.5 ( tan b ( 10.. On the other hand, the discovery region based on the inclusive
channel shows a strong difference between low and high tan b values. This is due
essentially to the low significances for low Higgs masses which translates into low
significances for tan b in case of no mixing for a fixed value of MA . Therefore although
the reduction due to a low MA is slightly worse for tan b s 10 than for tan b s 2.5, Fig.
1, the significance in the direct channel is much better for tan b s 10 Ž m h ; 110 GeV.
than for tan b s 2.5 Ž m h ; 93 GeV.. This observation is to be kept in mind and shows
the importance of localising where in terms of m h any reduction, especially, in the
inclusive direct channels occurs. Take for instance the CMS analysis w22x. It is found
that already with a low luminosity of 30 fby1 the significance for the S M Higgs is
6
In this discussion this applies especially to the off-diagonal terms of the Higgs mass matrix.
12 ´
G. Belanger et al.r Nuclear Physics B 568 (2000) 3–39
Fig. 3. Variation of R g ggg versus R Wgg , R t tgg with 200( MA (1000 GeV and tan b s 2.5 Žfull., 10 Ždotted..
larger in the associated channel than in the inclusive channel for m h - 105 GeV and is
above 5. Translated to Class-H this means that for MA 0 450 GeV associated production
allows observability of the light Higgs for all values tan b s 2.5–10 whereas direct
production extends the reach in MA for tan b s 10 Ž MA ; 400 GeV.. With a higher
luminosity of 100 fby1 , the CMS analysis shows that the reach in MA is better in the
associated channel for all values of tan b and especially for low tan b . In terms of the
ratio Rgg , this analysis translates into discovery for Rgg ) 0.4 corresponding to MA ) 220
GeV Ževen slightly better for tan b s 2.5, see Fig. 3.. Note that one can recover the
obsvervability region of the SUSY Higgs of the CMS analysis by combining our results
for the ratios R with their analysis for the S M . As stated earlier the ATLAS w21x
analysis requires higher luminosities and the above numbers correspond roughly to a
luminosity of 300 fby1 to take full advantage of the associated production.7
7
For lower luminosities the ATLAS significances in the associated channels are based on a Poisson
statistics. We thank Guillaume Eynard for providing us with his code and the ‘‘data’’ for the SM Higgs in the
separate channels WhrZh and tth, see also Ref. w38x.
´
G. Belanger et al.r Nuclear Physics B 568 (2000) 3–39 13
To discuss the stop sector and define our conventions, we turn to the weak eigenstate
basis where the mass matrix in the t˜L ,t˜R involves the SUSY soft-breaking masses: the
common SUŽ2. mass m ˜ Q˜ 3 and the UŽ1. mass m˜ U˜3 R , beside the mixing, m˜ 2t˜L R
4
˜ 2Q˜ 3 q m2t q 12 MZ2 1 y
m2t̃ L s m ž sin2u W cos Ž 2 b .
/ Ž 3.1 .
3
m
m2t̃ L R s ym t A t q
ž / ' ym t A˜t . Ž 3.2 .
tan b
One sees that apart from the soft SUSY-breaking parameters, m ˜ Q˜ 3 , m˜ U˜3 R and the
tri-linear top term Ž A t ., there appears also the ubiquitous tan b and the higgsino mass
term m.
The stop mass eigenstates are defined through the mixing angle u t̃ , with the lightest
stop, t˜1 ,
y2 m t A˜t 2 m2t̃ L R
tan Ž 2 u t˜ . s or sin Ž 2 u t˜ . s .
MZ2 cos2 b 8 sW2 m2t˜1 y m2t˜2
m2Q˜ 3 y mU2˜3 R q
˜ ˜
2 ž 1y
3 /
Ž 3.4 .
For further reference note, in the case of equal soft SUSY breaking masses for the
left and right sector of the stop Ž m˜ 2Q˜ 3 s m˜ U2˜3 R ., that apart from the case of extremely
small mixing A˜t s O Ž MZr10., one has maximal mixing: sinŽ2 u t˜. , 1. In this case we
have
m t 12 A˜t tan b 2 q 1
tan Ž 2 u t̃ . , . Ž 3.5 .
MZ MZ tan b 2 y 1
Mixing in the stop sector not only allows one of the stops to be rather light, but this
light stop can have rather large Yukawa couplings. Let us therefore discuss this
coupling. The stop–stop Higgs couplings, like the stop mass matrix, emerge essentially
from the F-terms in the scalar potential Žthere is a residual D-term component A MZ2 ..
14 ´
G. Belanger et al.r Nuclear Physics B 568 (2000) 3–39
With the angle a in the Higgs mixing matrix, the t˜1 t˜1 h coupling is Žwe write the
potential.
m t cos a
Vt˜1 t˜1 h s yg
MW sin b ž Ž A t y mtan a . sin u t˜cos u t˜ y m t
MZ2 sin b 1 2 2
q
m t cos a
sin Ž a q b .
žž 2
y
3
sin2u W cos 2u t˜ q
/ 3
sin2u W sin2u t˜
// .
Ž 3.6 .
The vertex does involve some important parameters which stem from the Higgs
sector, notably the angle a . In the decoupling limit w27x which we are most interested in
and up to radiative corrections Ž3.6. reads
1 mt m r
Vt˜1 t˜1 h s qgR
MW ½ m2t q sin u t˜cos u t˜ sin u t˜cos u t˜ m 2t˜1 y m2t˜2 y
ž ž / tan b /
1 2 2
qMZ2 Ž Ž 2 q r . cos 2b y 1 .
žž 2
y
3
sin2u W cos 2u t˜ q
/ 3
sin2u W sin2u t˜
/5 .
Ž 3.7 .
We see that in the limit r < 1 where r is neglected, the t˜1 t˜1 h very much simplifies.
Note that neglecting the correction due to r, the coupling no longer depends on m.
Notice also that Eq. Ž3.7. shows that this correction is reduced as tan b gets larger.
Discarding the r correction altogether, we end up with the compact formula
2 2
g ž m˜ y m˜ / q m q M
t1 t2 1 2
Vt˜1 t˜1 h ,
MW ž sin2 Ž 2 u t˜ .
4
2
t
2
Z cos Ž2 b . žž 2
y
3
sin2u W /
2
=cos 2u t˜ q
3
sin2u W sin2u t˜
// . Ž 3.8 .
We also confirm that the tan b dependence in the vertex is also hardly noticeable. Eq.
Ž3.8. makes it clear that even for maximal mixing, sin2 2 u t̃ ; 1 the contribution of the
stops and that of the top cancel each other thus leading to a very small vertex. The dip
occurs for values of the mixing angle such that
2
4 m 2t
sin 2 u t̃ , . Ž 3.9 .
m 2t˜2 y m2t˜1
On the other hand, when the mixing is negligible, the vertex is accounted for almost
entirely by the top mass and therefore has the same strength as the tth vertex.
The t˜2 t˜2 h vertex can be obtained from t˜1 t˜1 h by sin u t˜ cos u t˜ and m t˜1 m t˜2 .
Therefore if the t˜2 t˜2 h and t˜1 t˜1 h vertices were to be added, the mixing terms do not
l l
survive, as expected since the latter mix the left and right states. This is to be kept in
™ ™
mind. In situations where the stop masses are of the order of the top mass so that they
both contribute to h gg or h gg , the effect of mixing will, to a large extent, be
washed away.
´
G. Belanger et al.r Nuclear Physics B 568 (2000) 3–39 15
Already at this point we can attempt to predict the general features in R g ggg and Rgg
that will be introduced by large mixing in the stop sector. Consider the large MA limit
where the t˜1 t˜1 h vertex is most transparent, see Eq. Ž3.8.. Naturally the stop will
™ ™
contribute if its mass is not too large and if its coupling to the Higgs is also large. When
there is no mixing, only the diagonal m 2t term in Eq. Ž3.8. will, in both G Ž h gg . and
™
G Ž h gg ., interfere constructiÕely with the top quark contribution. We therefore
expect an enhancement of G Ž h gg ., that is of the inclusive production. On the other
™
hand, the fact that the toprstop loops and W interfere destructively, means that
G Ž h gg . will get smaller. Nonetheless since the W loop is much larger than the top
loop, the reduction in the two-photon decay width will be modest compared to the
enhancement in the two gluon width. Considering that at large MA the width into bb
™™
Žthus the total width. is hardly affected by mixing and hence sensibly the same as in the
S M , direct production s Ž pp h gg . is enhanced. At the same time associated
WhrZh and tth with the subsequent two-photon decay of the Higgs will be reduced
somehow. For moderate mixing the t˜1 t˜1 h vertex gets vanishingly small: here no effect is
to be expected, either in any of the associated productions nor in the direct production.
When the mixing gets very large so that now, it is the term in m 2t̃ 2 in Eq. Ž3.8. which
dominates, the sign of the interferences between the stop and the top quark loop gets
reversed. In this situation direct production can get extremely small, the stop loop
cancelling the top loop. In the two photon decay, on the other hand when this
cancellation takes place it still leaves the large W contribution. Nonetheless, the increase
in Rgg will be modest compared to the dramatic decrease in R g ggg . Since the total
width is hardly affected by these mixing effects the direct inclusive production will be
much reduced. However, associated WhrZh and tth gets enhanced in these situations.
Because t˜1 is relatively light and its coupling to the Higgs Ž h. large, associated stop
cross sections can, exceptionally, be of the order of that of the associated top cross
section w40,41x or even larger. At the LHC this cross section is essentially induced by
gluon gluon fusion and is therefore directly proportional to the square of the t˜1 t˜1 h
vertex. We have recalculated this cross section with the help of a modified version of
CompHep w55–58x to properly take into account the radiative corrections to the Higgs
mass and couplings. For our analysis we have found it useful to calculate the cross
section at the LHC by taking, as a reference point, the m2t term only in the t˜1 t˜1 h vertex,
Eq. Ž3.8.. The cross section can then be easily evaluated by specifying as independent
input parameters m t̃ 1 and m h only. The corresponding cross sections are shown in Figs.
4 and 5. We have made a polynomial fit, in the variables m h y m t˜1 to these cross
sections that reproduces the full results with a precision better than 2%, which is well
within the uncertainty due to the choice of scale and structure function. Once a set of
SUSY parameters is given, apart from the stop masses and tan b it will also furnish the
corresponding Higgs mass, m h , and the proper t˜1 t˜1 h vertex can be evaluated. One can
™
then properly normalise our cross sections. Considering the relative complexity of the
pp t˜1 t˜1 h cross section this method is much more efficient when we are scanning over
™
many SUSY parameters as done in the present analysis since we do not have to
recalculate the pp t˜1 t˜1 h for each scan. Our results agree with those shown in Refs.
16 ´
G. Belanger et al.r Nuclear Physics B 568 (2000) 3–39
Fig. 4. t˜1 t˜1 h at the LHC as a function of m t˜1 and for a range of SUSY Higgs masses. The t˜1 t˜1 h vertex is set in
the limit of large MA with no mixing and no D-term, see text for how to normalise it when the SUSY
parameters are fixed. Also shown is t˜1 t˜1 Z. For the latter the vertex has been computed with cos 2u t˜ s1r2, i.e.
maximal mixing. For other values of the mixing, rescale by using the vertex Žcos 2u t̃ r2y 23 sW 2 .
. We have taken
the CTEQ4 structure function with a scale set at the invariant mass of the subprocess.
w40,41x as well as in Ref. w59x, however the largest cross sections shown in w40,41x do
not pass our constraint on the Higgs mass m h ) 90 GeV andror Dr Žsee below..
2 1
qMZ2 sin2 u t˜ Ž cos2 b q rcos 2b . ž 3
sin2u W y
4 /5
™ qgR 4 M1 W
sin4u t˜ m2t˜1 y m2t˜2 .
ž / Ž 3.10 .
It is crucial to note that within the approximation of neglecting the r terms and the
D-terms, this coupling does not survive in the maximal mixing scenario, it is propor-
™
tional to sinŽ4u t̃ .. Nonetheless because of it its Yukawa nature this can be a rather large
coupling and therefore phase-space allowing Br Ž t˜2 t˜1 h. can be large. Considering
Fig. 5. As in Fig. 4 but as function of the Higgs mass. The tth is also shown for comparison for the same set
of structure functions and by only taking into account the gluon gluon processes. For the latter including the
small quark initiated process, our results agree with Ref. w60x. Also shown is ttZ.
18 ´
G. Belanger et al.r Nuclear Physics B 568 (2000) 3–39
Fig. 6. Next-to-leading order t˜1 pair production at the LHC, for three representative values of the gluino mass.
We used the code given to us by Michael Spira w61,62x.
that t˜2 pair production exceeds 1pb for m t˜2 ( 500 GeV Žsee Fig. 6., t˜2 can trigger Higgs
™
Ž h. production.8
Contrary to x 20 x 10 h whose branching ratio can reach 100% for some of the SUSY
parametersw25x and thus very efficiently triggers Higgs production, Br Ž t˜2 t˜1 h. w26x ™
™ ™
can never reach 100%. This is because, independently of other decay modes into the b,b˜
™
sector, there is always the competing larger decay rate t˜2 t˜1 Z. Indeed when the
splitting is large t˜2 t˜1 Z can be approximated by t˜2 t˜1 f 0 , f 0 being the neutral
Goldstone boson, with an effective coupling g 4 M1 W sin2 u t˜Ž m2t˜2 y m2t˜1 . s gr2 MW m t Ž A t q
mrtan b ..
™
When MA is small, t˜2 can also provide a welcome source of pseudo-scalar Žand
heavy Higgses. through t˜2 t˜1 A, H. What’s more, the strength of the t˜2 t˜1 A ™
8
t˜1 t˜2 is completely negligible at the LHC w61x. ŽFor review on QCD corrections to SUSY particles
production at hadron colliders, see Ref. w62x..
´
G. Belanger et al.r Nuclear Physics B 568 (2000) 3–39 19
™
The decay t˜2 t˜1 H is generally smaller and vanishes when the mixing is maximal.
In the decoupling limit this becomes
mt At
Vt˜1 t˜2 H ; igcos2 u t˜
2 MW ž tan b /
ym . Ž 3.12 .
Of course, to calculate the branching ratios of t˜2 into Higgses we have evaluated all
possible widths of t˜2 , without QCD corrections though. We have checked our numbers
against those of w63–67x as well as the output of GRACE w69x. ŽFor a description of the
program see, for example, Ref. w68x..
For a general recent review of stop decays see Refs. w54,70x. ŽRef. w70x contains a
nice review of QCD corrections to stop decays..
™
For further reference note that whenever stop mixing is not excessively small, we can
reach Br Ž t˜2 t˜1 h. ; 10%. Associated t˜2 t˜1 A in mSUGRA has also been entertained
recently w59x. However, in the mSUGRA scenario the mixing is generally not large and
the stops are usually heavy leading to small cross section for Higgs production through
stops. But then in this same scenario large drops in the inclusive production due to stop
mixing hardly occur either.
Large values of the t˜1 t˜1 h vertex which lead to the largest pp t˜1 t˜1 h and the ™
sharpest drop in R g ggg occur when the mixing is large with a large splitting between the
two stop physical masses. It is, however, for this configuration that one has some strong
constraints which preclude the highest values of the cross section. For instance, one has
to be wary that imposing a lower bound on the Higgs mass, from its non-observation at
LEP2 say, can restrict drastically the sin2 u t˜–m t˜2 parameter space. This constraint is very
much dependent on tan b . Much less dependent on tan b but a quite powerful one, for
the values of m t̃ 1 that we have entertained, is the constraint coming from Dr w71–
75,77–79x. ŽFor earlier analyses, see Ref. w76x.. Taking the present limit Dr - 0.0013
applicable to New Physics with a light Higgs w80x, which here means essentially the
contribution from stops and sbottoms Žand marginally the Higgs sector 9 . generally
excludes region of the parameter space where the t˜1 t˜1 h is largest. In our Dr constraint
we have relied on the two-loop calculation of w81,82x, which can enhance Dr by as
much as 10% even with a heavy gluino.
One more constraint one needs to mention. In the stop sector and in the presence of
large mixing as is the case here, one often has to check whether the parameters do not
induce colour and charge breaking global minima ŽCCB. w83–88x. ŽFor a recent
summary see Ref. w89x.. It has been argued that the constraints based on the global
minima may be too restrictive w90–92x. It was shown that for a wide range of
9
For light stops in the decoupling limit the sbottom–stop contribution when substantial gives a positive
contribution, whereas the Higgs sector contributes a negligible negative contribution.
20 ´
G. Belanger et al.r Nuclear Physics B 568 (2000) 3–39
parameters, the global CCB minimum becomes irrelevant on the ground that the time
required to reach the lowest energy state exceeds the present age of the universe. Taking
the tunneling rate into account results in a milder constraint which may be approximated
w90–92x by
We start our analysis by considering the case with tan b s 2.5. Although this value is
not far from being excluded by the direct LEP2 searches w1–4x, depending on the exact
SUSY parameters, we study it here in order to compare our results with those in w18x and
to show a feature which is not present for higher values of tan b .
3.5.1. The case of a common mass in the third generation squark sector
We first revisit the case w15,16,18x of allowing, at the electroweak scale, a common
mass for all the supersymmetric masses of the third generation squarks: m ˜ Q˜ 3 s m˜ U˜3 R s
m˜ D˜ 3 R ˜ 3˜
s m . Taking a common value for the&SUŽ 2. and UŽ1 .
<
masses
<
shows that unless
the effective tri-linear term is negligible, A t; 0, this leads to sin2 u t˜ s 1, see Eq. Ž3.4..
We note that contrary to what is claimed in w18x this situation, although common for the
first two generation of squarks, occurs only in exceptional situations in a model such as
mSUGRA. Moreover in mSUGRA A t is controlled almost entirely by m1r2 , the
common gaugino mass, and thus would not be excessively large w95–100x. Leaving this
aside, this assumption helps keep the number of parameters to a minimum while
concentrating on the impact of mixing. To that effect we have set, apart from the
common third family scalar quark m ˜ 3̃ which was allowed to vary in the range 100–1000
GeV and MA s 1 TeV, all other sfermion masses to 500 GeV. Moreover we have
assumed the unification condition for the gaugino masses and set the Higgsino mass
M2 s ym s 250 GeV. We then scanned over A t , y1000( A trŽGeV. ( 1000 and m ˜ 3˜ .
Note that since we are scanning over both positive and negative values of A t , some
important mixing effects sensitive to the sign of A t = m are covered even though we
have fixed the sign of m. Among the 2 = 10 4 generated point for each tan b half passed
all the constraints.
First, as shown in Fig. 7, we do confirm that the reduction in the two-photon signal in
the direct channel is most dramatic for the lowest values of the stop mass, although a
low stop mass does not always mean that a reduction has to occur. As a matter of fact
´
G. Belanger et al.r Nuclear Physics B 568 (2000) 3–39 21
Fig. 7. R g ggg versus m t˜1 for tan b s 2.5, m sy250 GeV and MA s1 TeV.
there are more points that generates a low m t̃ 1 and give R g ggg 0 0.6, say, than those that
give R g ggg ( 0.6. Note that most points clustering around values corresponding to little
mixing or large stop masses. Therefore the very rare situations corresponding to very
sharp drops could be interpreted as at best unnatural. It is also worth pointing out that
values such that R g ggg 0 1.0 are not obtained for tan b s 2.5. We have verified that
while, in principle, this was possible for tan b s 2.5 this possibility was ruled out by the
requirement of having m h 0 90 GeV.
As stressed numerous times, for the intermediate mass Higgs in the direct channel
decaying into two photons, the significance depends crucially on the Higgs mass. It is
therefore important to localise for which values of the Higgs mass, the reductions are
most drastic. For tan b s 2.5 we see, Fig. 8, that this reduction gets worse, R g ggg , 0.2
for Higgs masses clustered around ; 103 GeV.
It is important to note, on the other hand, that for Higgs masses around 90 GeV
where the Ž S M . Higgs signal is most difficult to extract, the effect of the stop is rather
negligible Žhere there is no mixing hence the low mass of the Higgs which does not get
further radiative corrections.. Therefore this is a welcome point. As compared to the
case of CLASS-H with MA s 180 GeV and tan b s 2.5, for which Mh s 90 GeV, R g ggg
reaches 0.3 whereas for the same Higgs mass Žand tan b . our points cluster around one.
An even more important remark concerns the behaviour of the branching ratio into two
photons. We find, see Fig. 8, that the branching ratio into photons in this SUSY scenario
increases at the same time as the direct production decreases, in sharp contrast to what
happens in CLASS-H when MA decreases. This confirms our expectations. The R g ggg
versus Rgg can be considered as a signature of this scenario. In the corresponding
scatter plot of Fig. 8, the points fall almost along a line and shows that when
R g ggg ( 0.8, Rgg 0 1. Considering that in this large MA scenario and even in the
presence of large mixing the tth and Wh are sensibly the same as in the standard model,
the associated Higgs production with the Higgs decaying into two-photon should pose
no problem with the high luminosity LHC. We do not show the ratios for ttgg and Wgg
as these are given essentially by the ratio Rgg , see Fig. 8. To conclude, for this value of
tan b s 2.5, when 90 ( m h ( 100 GeV observability of the lightest SUSY Higgs Žh. is
quite similar to that of the S M . Above these values, if the direct production is not
possible, the branching into photons is larger than the S M and thus associated produc-
tion provides more chance of detecting the Higgs. For instance, taking the S M Higgs
22 ´
G. Belanger et al.r Nuclear Physics B 568 (2000) 3–39
Fig. 8. As in Fig. 7 but for Ža. R g ggg versus Mh , Žb. R g ggg versus Rgg and Žc. Rgg versus Mh .
CMS analysis w22x with a luminosity of 100 fby1 as a guide, shows that it is only in the
range 100–105 GeV where values below 0.6 are possible for R g ggg that the Higgs may
not be observed in the direct channel. The same analysis shows, however, that with the
values that we obtain in the associated channels that there is no problem of cornering the
Higgs. Note that for the most critical drop in the direct channel we have obtained a
enhancement factor of up to 1.35 in the associated production. For such values even the
ATLAS simulation w21,38x with a luminosity of 100 fby1 indicates observation in the
associated channels.
Finally, another note of optimism in the case where the drop in the inclusive
production is severe is that production of h in association with stops could help also. As
´
G. Belanger et al.r Nuclear Physics B 568 (2000) 3–39 23
shown in Fig. 9, whenever R g ggg ( 0.6, s Ž t˜1 t˜1 h. is in excess of 100 fb and can reach as
much as ; 740 fb. As a comparison, for these extreme cases for which m h ; 100–105
GeV, one has s Ž tth. , 500 fb. Considering that, see Fig. 7, these helpful t˜1 t˜1) h cross
sections are for values of m t̃ 1 ( 250 GeV for which R g ggg ( 0.6, t˜1 with our choice of
parameters will decay exclusively into c x 10 . It remains to be seen whether this
constitutes a viable signal and whether we could use the Higgs decays into bb, which by
the way is not much affected at these low values tan b by these mixing effects. The
signal would be bb q jets q pu T . Note that the continuum t˜1 t˜1 Z is quite small. For
m t̃ 1 s 120 GeV and maximal stop mixing angle, after folding with Br Ž Z bb . the
continuum leads to a dismal raw cross section of about 1 fb.
™
3.5.2. Lifting the degeneracy in the third family scalar masses
We have already argued that the scenario with exactly equal squark masses for the
third generation is very special and even unnatural. Taking a more general framework,
™
we move away from the case of maximal mixing. As we have discussed this can open
up new possibilities, notably t˜2 t˜1 h decays. For illustration, we have taken m
˜ t˜3 R s 200
GeV, m ˜ b˜ 3 R s 500 GeV and allowed 50 ( m˜ Q˜ 3 ( 500 GeV. In order to compensate for
the deviation from maximal mixing, the trilinear coupling was allowed to vary in the
range y2000( A t ( 2000 GeV. However, very few points with < A t < 0 1200 GeV pass
our constraints, essentially from Dr . As expected the general features found in the case
of maximal mixing are still present here, even though with our parameters the drops are
not as dramatic as in the maximal mixing case. Another observation is that m h ) 105
GeV is not generated. This is because contrary to the previous case the stop masses do
not extend to 1 TeV and hence the radiative corrections to the Higgs mass are not
optimal. Nonetheless as seen in Fig. 10 a ratio R g ggg as low as 0.4 is possible and
occurs for low t˜1 masses. Again this drop occurs for a small range of Higgs masses
sensibly the same as in the maximal mixing case, m h ; 103–104 GeV, Fig. 10.
However, when this occurs one is saved by the fact that the branching ratio into photons
is larger than in the S M , Fig. 11. Moreover we still find that when R g ggg gets too
24 ´
G. Belanger et al.r Nuclear Physics B 568 (2000) 3–39
Fig. 10. Ža. R g ggg versus m t˜1 for tan b s 2.5, m sy250 GeV and MA s1 TeV, when we allow different
scalar masses for the third generation as given, see text. Žb. As in Ža. but for R g ggg versus Mh .
™ ™ ™
small pp t˜1 t˜1) h is of the order 100 fb reaching a maximum of 200 fb when R g ggg is
lowest, Fig. 12. The main novelty here is pp t˜2 t˜2) t˜2 t˜1) h, with Br Ž t˜2 t˜1 h. s ™
O Ž10%.. Because this stems from a two-body cross section, it can lead to quite large
s Ž t˜2 t˜1) h. reaching as much 600 fb, and therefore in many instances larger than the
continuum t˜1 t˜1) h, Fig. 12. What is also worth noting is that these large cross sections do
not necessarily occur when one has large drops in the inclusive two-photon channel.
Moreover the signature in this channel should be cleaner, taking advantage of the
cascade decays of the other t˜2 starting with t˜1 Z,b˜ 1W,bx˜q 1,2 , . . . Of course there are
points where neither t˜2 t˜1 h10 nor t˜1 t˜1 h exceeds 10 fb, Fig. 12. However, in this case the
reduction in R g ggg is quite modest.
10
With our set of parameters one would expect that some points with maximal mixing are generated.
However, we have checked that these do not pass all the constraints. This explains why we never get a
vanishingly small t˜2 t˜1 h cross section.
´
G. Belanger et al.r Nuclear Physics B 568 (2000) 3–39 25
Fig. 11. As in Fig. 10 but for R g ggg versus Rgg and Rgg versus Mh .
™ ™ t˜ t˜ h. and Žc.
™ ™ ™ ™
Fig. 12. As in Fig. 10 but for Ža. R g ggg versus s Ž pp t˜1 t˜1 h., Žb. R g ggg versus s Ž pp 2 1
s Ž pp t˜1 t˜1 h. versus s Ž pp t˜2 t˜1 h.. s Ž pp t˜2 t˜1 h. ' s Ž pp t˜2 t˜1) hq t˜2) t˜1 h..
Rgg is about 0.75, which may also preclude detection in the associated Higgs produc-
tion, Fig. 13. With these values occurring at such low values of m h , even CMS w22x with
30 fby1 will miss the Higgs, but again there should be no problem in the associated
production after collecting ; 100 fby1 . Still, whenever mixing becomes important and
reduces R g ggg significantly, associated production should be no problem. For instance
™
when R g ggg is below 0.4, Rgg 0 0.95. In these configurations Br Ž h gg . benefits ™
™
from the increase in G Ž h gg . which is not completely offset by the increase in
G Ž h bb .. In these configurations with small t˜1 , t˜1 t˜1 h could help with s Ž t˜1 t˜1 h. s
100–780 fb, Fig. 14.
´
G. Belanger et al.r Nuclear Physics B 568 (2000) 3–39 27
Fig. 13. Ža. R g ggg versus Mh for tan b s 2.5, m sy250 GeV and MA s 350 GeV, with equal squark masses.
Žb. As in Ža. but for Rgg versus Mh . Žc. As in Ža. but for R g ggg versus Rgg .
We may argue that had we taken a much lower value of MA we would have
introduced a larger reduction in Rgg which may affect dangerously the associated
production. We would then be in a situation where the inclusive cross section is down
because of large mixing in the stops and the associated production small mainly because
the branching into photons is down as a result of MA being low. Note however that in
these situations we would be far from the decoupling regime, with all Higgses being
relatively light and a very light stop having large couplings to the Higgses. One
™™
consequence of this light spectrum is that, even in the case of maximal stop mixing
where t˜2 t˜1 h is inhibited, the t˜1 t˜2 A coupling is large, Eq. Ž3.11. and can be such that
it triggers t˜2 t˜1 A. This is because large mixing and large splitting between the stop,
28 ´
G. Belanger et al.r Nuclear Physics B 568 (2000) 3–39
allows enough phase space for a relatively light pseudo-scalar. To illustrate this fact, we
have lowered MA to 250 GeV. The gross features found for MA s 350 GeV are still
Fig. 15. Ža. Rgg versus Mh for tan b s 2.5, m sy250 and MA s 250 TeV, with equal squark masses. Žb. As
in Ža. but for R g ggg versus Rgg
´
G. Belanger et al.r Nuclear Physics B 568 (2000) 3–39 29
™
present as concerns the inclusive production of h Fig. 15, with an overall reduction
factor due to Br Ž h gg . which is slightly larger. At the same time the location of the
drops are shifted to slightly lower values of m h , which is a direct consequence of a low
MA . However, as shown in Fig. 16, s Ž t˜2 t˜1 A. can be quite large and often exceeds t˜1 t˜1 h.
Note that s Ž t˜2 t˜1)A. may be large even for points where the inclusive two-photon cross
section is lowest, whereas t˜2 t˜1) h is largest for regions where the inclusive cross section
is most affected. Therefore we see that combining different channels in this scenario
offers much better prospects than in the no-mixing case with the same low value of MA .
To start with, when the direct production is very much reduced, associated production
has a better significance in the case of very large stop mixing compared to the no mixing
™ ™
case for the same MA . Another interesting point is that although the main decay of A
™ ™
will be into bb, we also find that A Zh can be substantial. For instance, the chain
s Ž pp t˜2 t˜2 t˜2 t˜1 A t˜2 t˜1 Zh. can reach as much as 350 fb Žfor this point m t˜1 s 129
™
GeV, m t˜2 s 396 GeV.. For larger values of the stop masses Ž m t˜1 s 235 GeV, m t˜2 s 525
GeV., the same chain corresponds to 43 fb. The decay t˜2 t˜1 H is also possible, but the
corresponding cross section, s Ž t˜2 t˜1) H . is below 10 fb. This is because the branching
ratio into H is about a factor cos 2 2 u t̃ down compared to the branching ratio into A,
while H and A are almost degenerate in mass, Eqs. Ž3.11. – Ž3.12.. To end this section
Fig. 16. As in Fig. 16 but for Rgg versus s Ž t˜1 t˜1 h. and s Ž t˜2 t˜1 A..
30 ´
G. Belanger et al.r Nuclear Physics B 568 (2000) 3–39
™
let us mention that when the mass of the pseudo-scalar gets small, below 2 m t , one
should also investigate direct gg A, H production. A low mass t˜1 has no effect either
on the production or decay Žwe are in a scenario where m t̃ 2 ) MA . of A, the usual
™
channels should not be much affected. For H, one needs to critically review how the
production is affected and whether H t˜1 t˜1 can be exploited. The phenomenology is
certainly richer here and the HiggsŽes. should not be missed.
3.6. tan b s 5
We now move to a larger tan b . We go through basically the same steps as those in
Subsection 3.5. For the same scenarios we will scan over the same mass ranges. One
general new feature will have to do with the fact that for larger tan b we obviously have
larger Higgs masses. In most cases this will help. However, on the whole similar
conclusions will be reached.
3.6.1. The case of a common mass in the third generation squark sector with large MA
Again R g ggg is most affected when the t˜1 mass is smallest, Fig. 17. In the maximal
mixing case, one new feature compared to tan b s 2.5 is that the ratio R g ggg can be
larger than one, for small m t̃ 1, reaching almost ; 1.3. This is even more welcome that it
Fig. 17. Ža. R g ggg versus m t˜1 for tan b s 5, m sy250 GeV and MA s1 TeV. Žb. As in Ža. but for R g ggg
versus Mh .
´
G. Belanger et al.r Nuclear Physics B 568 (2000) 3–39 31
occurs for Higgs masses in the range 92–98 GeV, Fig. 17. As a matter of fact, this is
consistent with the argument we gave earlier: in this case t˜2 is not too heavy so that the
top and t˜1 loop interfere and since the scale in the stop sector is not too high, the
radiative corrections to the lightest Higgs mass are far from maximal. Considering that,
especially in the lower end of this range, the significance in the direct channel are
usually Ž S M or no-mixing. smallest, such scenarios can make it easier to discover h
even in the direct channel. Of course, light t˜1 Žwith much heavier t˜2 . can also lead to a
much reduced R g ggg . When this happens it occurs for higher Higgs masses, clustered
around Mh s 115 GeV. Though for this range of m h significances in the direct
production are much better, for certain values of the parameters the drop is too severe:
R g ggg - 0.4. But again this occurs simultaneously with an enhanced Rgg : Rgg ) 1.2,
™
Fig. 18. Again the smaller R g ggg the larger Rgg . As with the lower tan b when the
direct production drops, s Ž pp t˜1 t˜1 h. increases. When R g ggg - 0.6 this cross section
is in excess of 100 fb up to , 650 fb, for the smallest value of R g ggg , Fig. 19. Note
also that when R g ggg ) 1 this additional cross section is below 100 fb.
3.6.2. The case of a common mass in the third generation squark sector with MA s 350
GeV
The discussion is essentially the same as the one we presented for tan b s 2.5 with
MA s 350 GeV. The overall reduction factor from the lowering of MA which affects
Fig. 18. Ža. As in Fig. 17 but for R g ggg versus Rgg . Žb. As in Ža. but forRgg versus Mh .
32 ´
G. Belanger et al.r Nuclear Physics B 568 (2000) 3–39
Fig. 19. As in Fig. 17 but for Rgg versus s Ž t˜1 t˜1 h..
Rgg is sightly smaller Žabout 0.76. but then the reductions in R g ggg are for m h ; 115
GeV, Fig. 20. Note also that s Ž t˜1 t˜1 h. production, Fig. 21, is only slightly smaller than
with tan b s 2.5 Žthis is due to a higher Higgs mass. and therefore is a useful addition
when the direct channel drops too much. For R g ggg - 0.2 one gets as much as 400 fb.
For larger tan b and small MA , de-excitation of t˜2 into t˜1 is not as efficient as for the
Fig. 20. Ža. As in Fig. 18 but for R g ggg versus Rgg . Žb. As in Ža. but forRgg versus Mh .
´
G. Belanger et al.r Nuclear Physics B 568 (2000) 3–39 33
Fig. 21. As in Fig. 17 but for Rgg versus s Ž t˜1 t˜1 h..
lower tan b with the rather moderate values of m that we have considered in this study.
This is evident from Eq. Ž3.11., but as we see t˜1 t˜1 h still plays its role.
Fig. 22. Ža. R g ggg versus Rgg for tan b s 5, m sy250 GeV and MA s1 TeV, when we allow different scalar
masses for the third generation as given, see text. Žb. As in Ža. but for R g ggg versus t˜2 t˜1 h.
34 ´
G. Belanger et al.r Nuclear Physics B 568 (2000) 3–39
Fig. 23. As in Fig. 22 but for R g ggg versus t˜1 t˜1 h and t˜2 t˜1 h versus t˜2 t˜1 h.
´
G. Belanger et al.r Nuclear Physics B 568 (2000) 3–39 35
Fig. 24. Ža. R g ggg versus m h for tan b s10, m sy250 GeV and MA s1 TeV, Žb. As in Ža. but for R h ™ gg
versus Mh , Žc. As in Ža. but for R g ggg versus Rgg .
tan b s 5 are recovered again, Fig. 24. Note that we do not get more noticeable
reduction either in R g ggg or Rgg due to the larger tan b , and also that R g ggg ) 1 are
possible. Similar observations to those made for tan b s 5 can be made here even when
we consider different splitting and lowering of masses, especially as concerns the
importance of t˜2 t˜1) h. Some of these results are summarised in Fig. 24.
4. Conclusions
We have in this paper reinvestigated the fate of the photon signal of the lightest
SUSY Higgs at the LHC when large tri-linear mixing terms in the stop sector are
36 ´
G. Belanger et al.r Nuclear Physics B 568 (2000) 3–39
present. Previous investigations w15,16,18x had drawn a very pessimistic picture of these
scenarios. Our analysis shows that if we exploit all the consequences of these scenarios
and not pick out only the Higgs signal in the inclusive channel these models have an
excellent discovery potential. First, the large reductions in the inclusive two-photon
signal not only require large mixing but also that one of the stops be relatively light.
Although this has not been stressed in the text, a first signal of these scenarios will be
t˜1 t˜1) production with a cross section of order ; 100pb. Even though it may be argued
that regions with the largest drops in the inclusive two-photon channel correspond to a
very light stop and are likely to lead to a signature, jets q pu T , which is difficult. In any
case it should be stressed that a hallmark of these scenarios is that whenever the signal
™
and makes it up for the drop in the former channel. Moreover when s Ž pp h gg . ™™
in the inclusive channel drops that in the associated WhrZh and tth channels increases
gets too small the continuum pp t˜1 t˜1 h w40,41x reaches values of order few 100 fb.
™
More importantly we find that since these situations imply a large mass splitting
between the two stops, t˜2 t˜1 h can be substantial leading to another source of Higgs
™
production with a yield larger than in the continuum and with a better signature than the
t˜1 t˜1) h continuum. We have shown that t˜2 t˜1) h occurs whenever the stop mixing angle
does not take its maximal value, <sin2 u t̃ < s 1, which is often unnaturally assumed on the
basis of equal soft SUSY breaking masses for the SUŽ2. and UŽ1. sfermions of the third
generation, at the electroweak scale. We have also shown that although when MA gets
small the two-photon signals Žboth direct and associated. get further reduced Žthis
for low values of tan b , one can trigger A production through the cascade t˜2 t˜1 A,
beside the usual channels for A productions. Moreover one should not forget that
™
happens even in the absence of mixing., with large tri-linear mixing terms and especially
especially with not too small tan b , tan b ) 3, scenarios with light stops Žbut small
mixing. do give an increase in the direct channel, but then a decrease in the associated
two-photon channels. The overall conclusion we can draw almost resembles that of a
no-lose scenario: whenever an effect reduces a particular signal it opens up new
channels or enhances other channels. We have not discussed the use of h bb in the™
associated tth channel which in these scenarios should allow detection. This requires
rather good b-tagging facilities, as shown in w43x. This should certainly add to the
discovery potential. The new associated stop Higgs signatures deserve a full simulation
to critically quantify how beneficial these additions can be. In general there is a lack of
detailed study of stop phenomenology at the LHC despite some important theoretical
issues related to the third generation sfermions. As has been pointed out by several
authors w101–106x, the idea of an inverted hierarchy of the SUSY spectrum whereby the
third generation sfermions are, at the electroweak scale, much lighter than the first two
is compelling and quite plausible. This helps solve the flavour problem in SUSY since
very large masses for the superpartners of the first two generations can suppress FCNC,
contributions to electric dipole moments and lepton flavour violations. This would still
not go against naturalness since these particles couple weakly to the Higgs, at the heart
of the fine-tuning problem. Naturalness does on the other hand require the stops and
sbottoms Žand the electroweak gauginos higgsinos. to be rather light, like in the
scenarios we have studied and could also with a light stop make electroweak baryogene-
sis w107–112x work.
´
G. Belanger et al.r Nuclear Physics B 568 (2000) 3–39 37
Acknowledgements
We would like to thank Guillaume Eynard for providing with his programme for
calculating the significance of the two-photon associated Higgs signal using the ATLAS
simulation and for very useful discussions. We also thank Michael Spira for promptly
providing us with the code for the NLO stop pair production at the hadron colliders and
Andrei Semenov for advice on the use of CompHep. We are also grateful to Elzbieta
Richter-Waş for helpful discussions and communication,as well as providing us with an
advance copy of the ATLAS TDR on Higgs Physics. K.S. acknowledges the hospitality
of LAPTH where part of this work was done. This work is done under partial financial
support of the Indo-French Collaboration IFCPAR-1701-1 Collider Physics.
References
w1x F. Gianotti, Opal talk for the LEP Committee, March 1999, http:rralephwww.cern.chrALPUBrsemi-
narrlepc_mar99.pdf.
w2x N.J. Kjaer, Delphi talk for the LEP Committee, March 1999, http:rrdelphiwww.cern.chrdelfigsrfig-
uresrniels990324.ps.gz.
w3x G. Bobbink, L3 talk for the LEP Committee, March 1999, http:rrl3www.cern.chrconferencesrpsr
Bobbink_LEPC9903.ps.
w4x D.A. Glenzinski, Opal talk for the LEP Committee, March 1999, http:rrwww1.cern.chrOpalrplotsr
glenzinskirmain.ps.gz.
w5x J.F. Gunion, H.E. Haber, G.L. Kane, S. Dawson, The Higgs Hunter’s Guide ŽAddison-Wesley, Reading,
1990..
w6x R.K. Ellis, I. Hinchliffe, M. Soldate, J.J. van der Bij, Nucl. Phys. B 297 Ž1988. 221.
w7x J. Gunion, G. Kane, J. Wudka, Nucl. Phys. B 299 Ž1988. 231.
w8x J. Gunion, H. Haber, Nucl. Phys. B 278 Ž1986. 449.
w9x H. Baer, D. Dicus, M. Drees, X. Tata, Phys. Rev. D 36 Ž1987. 1363.
w10x K. Griest, H. Haber, Phys. Rev. D 37 Ž1988. 37.
w11x H. Baer, M. Bisset, D. Dicus, C. Kao, X. Tata, Phys. Rev. D 47 Ž1992. 1062.
w12x A. Djouadi, J. Kalinowski, P.M. Zerwas, Z. Phys. C 57 Ž1993. 569.
w13x H. Baer, M. Bisset, C. Kao, X. Tata, Phys. Rev. D 46 Ž1992. 1067.
w14x H. Baer, M. Bisset, D. Dicus, C. Kao, X. Tata, Phys. Rev. D 47 Ž1992. 1062.
w15x B. Kileng, Z. Phys. C 63 Ž1993. 87.
w16x B. Kileng, P. Osland, P.N. Pandita, Z. Phys. C 71 Ž1996. 87, hep-phr9506455.
w17x G.L. Kane, G.D. Kribs, S.P. Martin, J.D. Wells, Phys. Rev. D 50 Ž1996. 213.
w18x A. Djouadi, Phys. Lett. B 435 Ž1998. 101, hep-phr9806315.
w19x Z. Kunszt, F. Zwirner, Nucl. Phys. B 385 Ž1992. 3.
w20x E. Richter-Wa̧s et al., ATLAS Internal note, PHYS-No-074 Ž1996..
w21x For an update on the Higgs analysis in ATLAS see, the ATLAS Technical Design Report, the ATLS
Collaboration, 1999, to appear. http:rratlasinfo.cern.chrAtlasrGROUPSrPHYSICSrHIGGSrhiggs-
wwwr Analyses-notes.htmlrhiggs-sugra.ps.gz.
w22x For an update on Higss searches in CMS see, R. Kinnunen, D. Denegri, CMS Note 1997r057, April
1997.
w23x H.P. Nilles, Phys. Rep. 110 Ž1984. 1.
w24x R. Arnowitt, A. Chamseddine, P. Nath, Applied N s1 Supergravity ŽWorld Scientific, Singapore,
1984..
w25x H. Baer, M. Bisset, C. Kao, X. Tata, Phys. Rev. D 50 Ž1993. 316.
w26x G. Polesello, L. Poggioli, E. Richter-Wa̧s, J. Sderqvist, ATLAS Internal Note, PHYS-No-111, Oct.
1997.
w27x H. Haber, in: Perspectives in Higgs Physics II, ed. G.L. Kane ŽWorld Scientific, Singapore, 1998..
w28x S.L. Glashow, D.V. Nanopoulos, A. Yildiz, Phys. Rev. D 18 Ž1978. 1724.
38 ´
G. Belanger et al.r Nuclear Physics B 568 (2000) 3–39
a
Dipartimento di Fisica, UniÕersita` di Milano-Bicocca and INFN Sezione di Milano, Milano, Italy
b
Joint Institute for Nuclear Research, Dubna, 141980, Russia
Received 30 June 1999; received in revised form 14 September 1999; accepted 20 September 1999
Abstract
The process of large-angle high energy electron–positron scattering with emission of one hard
photon almost collinear to one of the charged particles’ momenta is considered. The differential
cross section with radiative corrections due to emission of virtual and soft real photons calculated
to a power accuracy is presented. Emission of two hard photons and total expressions for radiative
correction are given in leading logarithmical approximation. The latter are illustrated by numeric
estimates. A relation of the results with structure function formalism is discussed. q 2000 Elsevier
Science B.V. All rights reserved.
1. Introduction
1
On leave of absence from the Institute of Physics and Technology, Almaty.
0550-3213r00r$ - see front matter q 2000 Elsevier Science B.V. All rights reserved.
PII: S 0 5 5 0 - 3 2 1 3 Ž 9 9 . 0 0 6 2 5 - 2
V. Antonelli et al.r Nuclear Physics B 568 (2000) 40–59 41
ey Ž p 1 . q eq Ž p 2 . ™e y
Ž pX1 . q eq Ž pX2 . q g Ž k 1 . Ž 1.
at the tree level. We define the collinear kinematical domains as those in which the hard
$
photon is emitted close Žwithin a narrow cone with opening angle u 0 < 1. to the
X
$
incident Ž u 1Ž2. s p 1Ž2. k 1 - u 0 . or the outgoing electron Žpositron. Ž u 1Ž2. s pX1Ž2. k 1 - u 0 .
direction of motion. Because of the symmetry between electron and positron we may
restrict ourselves to a consideration of only two collinear regions, which correspond to
42 V. Antonelli et al.r Nuclear Physics B 568 (2000) 40–59
the emission of the photon along the electron momenta. The two remaining contributions
to the differential cross section of the process Ž1. can be obtained by the substitution Q ,
d scoll s 1qQ
Q
ž ll /
p1
pX1
p2
pX2
d s g Ž k 1 I p 1 . qd s g Ž k 1 I pX1 . 4 . Ž 2.
To begin with, let us recall the known expression w15,16x in Born approximation for the
general kinematics, i.e. assuming all the squares of the momenta transferred among
fermions to be large compared to the electron mass squared:
a3 d 3 pX1d 3 pX2 d 3 k 1
d s 0g s T dG , d Gs d 4 Ž p 1 q p 2 y pX1 y pX2 y k 1 . , Ž 3.
8p 2 s ´ 1X ´ X2 v 1
2
S 16 m2 s t1
Ts
tt 1 ss1
ss1 Ž s 2
q s12 . q tt1 Ž t 2
q t 12 . q uu1 Ž u 2
q u12 . y
x 2X 2 ž t1
q
s
q1
/
2 2 2
16 m 2 s t 16 m2 s1 t1 16 m2 s1 t
y
x 1X 2 ž t
q q1 y
s x 22 / ž t1
q
s1
q1
/ y
x 12 ž t
q
s1
q1 ,
/
s s1 t1 t u1 u
Ss4 q y y q q ,
x1 x2 x 1X x 2X x 1 x 1X x 2 x 2X x 2 x 1X x 1 x 2X
2 2 2
s s Ž p1 q p 2 . , s1 s Ž pX1 q pX2 . , t s Ž p 2 y pX2 . ,
2 2 2
t 1 s Ž p 1 y pX1 . , u s Ž p 1 y pX2 . , u1 s Ž p 2 y pX1 . ,
X
x i s 2 pi k 1 , x 1,2 s 2 pX1,2 k 1 .
In the collinear kinematical domain in which k 1 I p 1 the above formula takes the form
a 3 d3 k1 1 d 3 pX1d 3 pX2
d s 0g Ž k 1 I p 1 . s FF d 4 Ž Ž 1 y x . p 1 q p 2 y pX1 y pX2 .
p 2s v1 x1 ´ 1X ´ X2
Ž 4.
2
1q Ž1yx . 2 m2
s dWp 1d s 0 Ž Ž 1 y x . p 1 , p 2 . , Fs y ,
x Ž1yx . x1
2
s1 t
Fs
ž t
q
s1
q1 ,
/
where
´ 1X 1yx
s1 s s Ž 1 y x . , y1 s s2 ,
´ a
´ X2 2 y 2 x q x 2 q cx Ž 2 y x .
y2 s s , a s 2 y x q cx , v1 s ´ x ,
´ a
m2
s s 4´ 2 , x1 s
s
2
x Ž 1 y c1 b . , bs ( 1y
´2
,
V. Antonelli et al.r Nuclear Physics B 568 (2000) 40–59 43
Ž1 yx . Ž1 yc.
2
$ $
t s t 1 Ž 1 y x . s ys , c s cos Ž p 1 pX1 . , c1 s cos Ž p 1 k 1 . ,
a
3
a 1yx d k1
dWp 1 s 2
F . Ž 5.
2p x1 v1
Here yi are the energy fractions of the scattered leptons and d s 0 Ž p 1Ž1 y x ., p 2 . is the
cross section of the elastic Bhabha scattering process.
Throughout the paper we use the following relations among invariants:
s1 q t q u1 s 4 m2 y x 1 f 0, s q t 1 q u s 4 m2 q x 1 f 0.
In the case k 1 I pX1 we have
a 1 d3 k1
d s 0g Ž k 1 I pX1 . s X F˜ Ž 1 y x . d s 0 Ž p1 , p 2 . , Ž 6.
2p 2 x 1 v1
2
1q Ž1yx . 2 m2
F̃ s y .
x x 1X
These expressions could also be inferred by using the method of quasi-real electrons w17x
and starting from the non-radiative Bhabha cross section.
After integration over a hard collinear Ž k 1 I p 1 . photon angular phase space, the cross
section of radiative Bhabha scattering in the Born approximation is found to be
2
d s 0g 4a 3 1 q Ž 1 y x . 1yx
s L0 y 2
d xd c k 1 Ip 1 s x x
2
3 y 3 x q x 2 q 2 cx Ž 2 y x . q c 2 Ž 1 y x Ž 1 y x . .
=
ž Ž 1 y x . Ž 1 y c . a2 /
= Ž 1 q O Ž u 02 . . , Ž 7.
2
where L0 s ln Ž ´u 0rm . . And in the case k 1 I pX1 it reads
2 2
d s 0g a 3 1q Ž1yx . 1yx 3 q c2
LX0 y 2
d xd c X
k 1 Ip 1
s
4s x x ž 1yc /Ž 1 q O Ž u 02 . . , Ž 8.
´ 1X u 0 2
LX0 s ln ž / , ´ 1X s ´ Ž 1 y x . .
m
The simplest way to reproduce these results is to use the physical gauge for the real
photon which in the beam cms sets the photon polarization vector to be a space-like
3-vector el having density matrix
0, if m or n s 0, k1
Ý emlenl ) s
l
½ dmn y nm nn , m s n s 1,2,3, ns
v1
,
44 V. Antonelli et al.r Nuclear Physics B 568 (2000) 40–59
t 1 u1
Ý < el < 2 s y2, Ý < p1 el < 2 s ´ 2 Ž 1 y c12 . , Ý < pX1 el < 2 s s
,
l l l
Ý Ž p1 el . Ž pX1 el . ; u ™ 0u .
)
Ž 9.
l
These properties enable us to omit mass terms in the calculations of traces and, besides,
to restrict ourselves to the consideration of singular terms Žsee Eq. Ž10.. only, both at
the Born and one-loop level. As shown in Ref. w18x, this gauge is proved useful for a
description of jet production in quantum chromodynamics; it is also very well suited to
our case because it allows to simplify a lot the calculation with respect, for instance, to
the Feynman gauge. What is more, it possesses another very attractive feature related
with the structure of the correction to be mentioned below Žsee Appendix A..
™
With these tools at our disposal let us turn now to the main point. The contributions,
which survive the limit u 0 0, arise from the terms containing
2 2
Ž p1 e . e2 Ž pX1 e .
, , . Ž 10 .
x 12 x1 x1
Other omitted terms Žin particular those which do not contain a factor xy1
1
. can be safely
neglected since they give a contribution of the order of u 02 which determines the
accuracy of our calculations
a 2
m
1qO žpu / 0 Ls ,
´
< u 0 < 1. Ž 11 .
In the realistic case this corresponds to an accuracy of the order of per mille.
3. Crossing relations
In this and the next section we shall consider the case k 1 I p 1. In the case of photon
emission along pX1 one can get the desired expression by using the left-to-right
permutation
l yp < M <
l yp /
X
p1
< M < 2k 1 I pX1 s Q 1 2
ž p2 X
2
k 1 I p1 . Ž 12 .
From now on we deal with scattering type amplitudes ŽFD. with the emission of a
hard photon by the initial electron. This is possible due to the properties of the physical
gauge. The contribution of annihilation type amplitudes may be derived by applying the
momenta replacement operation as follows:
2
D < M < annihilation s Q Ž pX1 l yp . 4D < M <
2
2
scattering '
2
Q1 4 D < M < scattering . Ž 13 .
V. Antonelli et al.r Nuclear Physics B 568 (2000) 40–59 45
In considering FD with two photons in the scattering channel Žbox FD. one may
l
examine only those with uncrossed photons because a contribution of the others may be
obtained by the permutation p 2 ypX2 . Thus the general answer becomes
1
½
< M < 2k 1 I p 1 s Re Ž 1 q Q1 . w G q L x q
s1 t
Ž 1 q Q1 . Ž 1 q Q 2 . s1 t Ž B q P . 5 ,
Ž 14 .
with the permutation operators acting as
Q1 F Ž s1 ,t 1 , s,t . s F Ž t , s,t 1 , s1 . , Q 2 F Ž s,u, s1 ,u1 . s F Ž u, s,u1 , s1 . .
One-loop QED RC Žwhich are described by seventy two Feynman diagrams. can be
classified out into the two gauge invariant subsets Žsee Fig. 1.:
Fig. 1. Some representatives of FD for radiative Bhabha scattering up to second order: Ž1. is the vertex
insertion; Ž2. is the vacuum polarization insertion; graphs denoted by Ž3., Ž4. are of the L-type, Ž5. is of
G1-type, Ž6. is of G 2-type, Ž7. is of B-type and Ž8. is of P-type.
46 V. Antonelli et al.r Nuclear Physics B 568 (2000) 40–59
2
For realistic applications one should also add to P the contributions due to m and t leptons and hadrons.
V. Antonelli et al.r Nuclear Physics B 568 (2000) 40–59 47
where the coefficients are associated with scalar, vector and tensor integrals over loop
momentum
d 4 k Ž 1,k m ,k m k n . d4 k 1
H ip 2
s Ž B, B m , B mn . , J0 s H ip 2
,
b 0 b1 b 2 b 3 b1 b 2 b 3
b 0 s k 2 y l2 , b 1 s k 2 q 2 pX1 k , b 2 s k 2 y 2 pX2 k , b 3 s k 2 y 2 qk q t ,
m
q s pX2 y p 2 , B m s Ž apX1 q bpX2 q cp 2 . ,
B mn s a g g mn q a1X 1X pX1m pX1n q a 22 p 2m p 2n q a 2X 2X pX2m pX2n
mn mn mn
q a1X 2 Ž pX1 p 2 . q a1X 2X Ž pX1 pX2 . q a 22X Ž p 2 pX2 . .
For P-type FD Žsee Fig. 1Ž8.. with uncrossed photon legs we have
s13 y u13 2Ž 2 y x .
D < M < 2P s 2 9p 2a 4 Y Ž E y E1 . q
t x 1Ž 1 y x . 1yx
™
Here we are using the definition Žwith tensor structures giving no contributions in the
limit u 0 0 dropped.
d 4 k Ž 1,k m ,k m k n .
H ip 2
s Ž E, E1 p 1m q Ek k 1m , E11 p 1m p 1n q Ek k k 1m k 1n
a0 a1 a2 a3 a4
qE1 k Ž p 1m k 1n q p 1n k 1m . . ,
a 0 s k 2 y l2 , a1 s k 2 y 2 p 1 k , a2 s k 2 y 2 k Ž p1 y k 1 . y x 1 ,
a3 s k 2 q 2 p 2 k , a4 s k 2 y 2 qk q t.
Note that in the evaluating of P-type FD we are allowed to put k 1 s xp1 , thus keeping
only p 1 momentum containing terms in the decomposition.
Collecting all the contributions Žfor the explicit expressions of all the coefficients see
Appendix A. given above we arrive at the general expression for the virtual corrections
with r s x w1 q Ž ´urm. 2 x < srm2 ,
2Re Ý Ž M0) M . k Ip 1 1
11 4 2
2 a p 2yx w p2
s
x1
FF
½ 1yx F
F q 2 Ll Ž 2 y L t y L t 1 y L s y L s1 q L u q L u1 . q
3
48 V. Antonelli et al.r Nuclear Physics B 568 (2000) 40–59
r
qLi 2 Ž x . y 101
18 q ln q L2u1 y L2t y L2s1 q Lr ln Ž 1 y x . q 113 L t
1yr
s1 1 t 3 y u13 s1 2 u1 Ž u12 q s12 . y ts12 u1
yq q ln2 q Pq3 ln q ln2
t F s12 t t 2
4 t s1 t
2 u1 Ž u12 q t 2 . y t 2 s1 u1 s1 u1 t u1 s1 t
q
4 ts12
ln2
s1
q
2t
ln
t
q
2 s1
ln
s1
y 34 p 2
ž t
q
s1 /5 ,
Ž 20 .
where we have used the following definitions:
x p2
qs Li 2 Ž 1 y r . y q Li 2 Ž x . q Lr ln Ž 1 y x . ,
ryx 6
p 1
=
a
Re
ž 1 y P s1 /
y 1 y 13 L s1 q 59 ,
w19,20x may be applied in the present paper: the factorization of the two in the
differential cross section is present and we are, hence, allowed to consider a soft photon
emission restricted as usual by
D´
< 1. Ž 21 .
´
Thus the soft correction can be written as
wsoft Ž k 1 I p 1 . s y
4p 2
Hv-D ´ 'k 2
q l2 ž y
p1 k
q
pX1 k
q
p2 k
y
pX2 k / ,
where M B denotes the matrix element of the hard photon emission at the Born level and
in the kinematics k 1 I p 1 it reads
2 11a 3p 3
Ý < M < B2 s x1
F F. Ž 23 .
Now let us check the cancellation of the terms containing Ll. Indeed it takes place in the
sum of contributions arising from emission of virtual and soft real photons. To show that
we bring the soft correction into the form
a D´
wsoft Ž k 1 I p 1 . s
p ½ž
2 ln
´
q Ll /Ž y2 q L s q L s1 q L t q L t 1 y L u y L u1 .
2p 2 y1
qln y 2 Ž L u y L t y L s1 . q ln Ž y 1 y 2 . y y 12 ln2
3 y2
1 q c1X 2X 1 q c1X 1 y c 2X
qLi 2 ž 2 / q Li 2 ž 2 / q Li 2 ž 2 /
1 y c1X 1 q c 2X
yLi 2 ž 2 / y Li 2 ž 2 /5 , Ž 24 .
where c i are the cosines of emission angles of ith particle with respect to the beam
direction Ž p 1 in cms., c1X 2X is the cosine of the angle between scattered fermions in cms
of the colliding particles and yi are their energy fractions and in the case k 1 I p 1 we
have
1 q c1X 2X 1yx 1 y cX2 y1Ž 1 q c .
cX1 s c, s1y , s . Ž 25 .
2 y1 y 2 2 2 y2 Ž 1 y x .
Then the cancellation of infrared singularities in the sum is evident from comparison
of Eqs. Ž20., Ž24.. The terms with lnŽ D ´r´ . should be cancelled when adding a
contribution of a second hard photon having energy above the registration threshold D ´ .
50 V. Antonelli et al.r Nuclear Physics B 568 (2000) 40–59
2
R s 2Re Ý Ž M0) M . q < M < soft
2 11a 4p 2 2yx w D´
s
x1
FF
½ 1yx F
F q 4ln ž / ´
y1 q L t 1 q 12 yln Ž 1 y x . ž
s s1
q2ln / q 113 L t q Ž Lr y L t . ln Ž 1 y x . y L t ln Ž y 1 y 2 . q ln2
yu yt
yu p2
qln y 1 ln Ž 1 y x . q ln Ž y 1 y 2 . 1 q ln ž / y q Li 2 Ž x . y 101
18 y q
s 3
r y1 yu 1 q c1X 2X
qln y 12 ln2 q ln Ž 1 y x . ln q Li 2 ž /
1yr y2 s 2
1 q c1X 1 y c 2X 1 y c1X 1 q c 2X
qLi 2 ž 2
3
/ q Li 2 ž 2 / y Li 2 ž 2 / ž /
y Li 2
2
1 t y u13 s1 2 u1 u12 q s12
Ž . y ts12 u
2 1
q Pq3 ln q ln
F s12 t yt 4 t 2 s1 t
2 u1 Ž u12 q t 2 . y t 2 s1 yu s1 u1 t yu
q ln2 q ln q ln
4 ts12 s 2t t 2 s1 s
s1 t
y 34 p 2
ž t
q
s1 /5 , Ž 26 .
1
d s Ž k 1 I p1 . s R dG .
2 p 5s
11
5. Kinematics k 1 I pX1
We put here a set of replacements one can use in order to obtain the modulus of
matrix element squared and summed over spin states for the case k 1 I pX1 , starting from
l l
the analogous expression for k 1 I p 1 ŽEq. Ž14.. and using the replacement of momenta
p 1 ypX1 , p 2 ypX2 . The last operation results in the following substitutions:
x y ™ x
1yx
, x 1 yx 1X , s s1 , ™
u u1 , t t, t1 t1 .l l ™ ™
Ž 27 .
Then under these permutations the expression for virtual corrections given in Eq. Ž20.
gets transformed yielding the following result for the collinear domain k 1 I pX1:
2 11a 4p 2 2 y x w̃
2Re Ý Ž M0) M . k Ip s 1
X
1
x 1X
F˜F˜
½ 1 y x F˜
F˜ q 2 Ll Ž 2 y L t y L t 1 y L s y L s1
p2 yx j
qL u q L u1 . q
3
q Li 2 ž 1yx / y 101
18 q ln ž /
jq1
V. Antonelli et al.r Nuclear Physics B 568 (2000) 40–59 51
s
qL2u y L2t y L2s y Lj ln Ž 1 y x . q 113 L t q ln2
yt
3 3
1 t yu s 2 u Ž u q s . y ts 2
2 2
q Pq3 ln q
F˜ s2 t yt 4t2s
u 2 uŽ u2 q t 2 . y t 2 s yu s u
=ln2 q 2
ln2 q ln y q˜
t 4 ts s 2t t
t yu s t
q
2s
ln
s
y 34 p 2 ž t
q
s /5 , Ž 28 .
with
s 3 y u3 p 1 t 3 y u3 p 1
P̃ s
st 2
a ž 1 y Pt /
y 1 y 13 L t q 59 q 2
s t a
Re
ž 1 y Ps
y1
/
s t 2
1yx 1 x 1X
1
y Ls q
3
5
9 , F˜ s ž q q1 , / w˜ s y q , js
t s x j m2
and F˜ ,q˜ derived upon applying a set of replacements from Eq. Ž27. on the quantities
F ,q .
The contribution from the soft photon emission is described by
a D´ 1yc
wsoft Ž k 1 I pX1 . s ž
4 ln q Ll /ž y1 q L s q ln q 12 ln Ž 1 y x . /
p ´ 1qc
1yc 1yc
qL2s q 2 L s ln y 12 ln2 Ž 1 y x . q ln Ž 1 y x . q ln2
1qc 2
2
1qc 2p 1qc 1yc
yln2 y q 2Li 2 ž / y 2Li 2 ž / . Ž 29 .
2 3 2 2
The total correction for the case k 1 I pX1 has the following form:
R˜ s 2Re 2
Ý Ž M0) M . q < M < soft
2 11a 4p 2 2 y x w˜ D´
s
x 1X
F˜F˜
½ 1 y x F˜
F˜ q 4ln ž / ´
1yc p2 yx
= y1 q L s q 12 ln Ž 1 y x . q ln
ž / q q Li 2 ž /
1qc 3 1yx
j s
y 101
18 q ln ž /
jq1
y 2ln2 Ž 1 y x . q 113 L t y Lj ln Ž 1 y x . q ln2
yt
1qc 1yc
y 23 p 2 q ln Ž 1 y x . y q˜ q 2Li 2 ž / y 2Li 2 ž /
2 2
52 V. Antonelli et al.r Nuclear Physics B 568 (2000) 40–59
1 t 3 y u3 s 1 u 1
q P˜ q 3 ln q ln2 ž / Ž 2 u Ž u 2 q s 2 . y ts 2 . q 4 ts 2
F˜
2 2
s t yt 4t s t
yu s u t yu s t
=ln2 ž /Ž
s
2 uŽ u2 q t 2 . y t 2 s . q
2t
ln
t
q
2s
ln
s
y 34 p 2 ž t
q
s /5 ,
Ž 30 .
1
d s Ž k 1 I pX1 . s R˜ d G .
2 p 5s 11
Performing the integration over a hard photon angular phase space Žinside a narrow
cone. we put the RC to the cross section coming from virtual and soft real additional
X
Fig. 2. The ratio J r L t JL Žsee Eq. Ž31.. versus x s v 1 r ´ for the case: Ža. k 1 I p1 ; Žb. k 1 I p1 .
V. Antonelli et al.r Nuclear Physics B 568 (2000) 40–59 53
Turning to the structure of the result obtained, it should be noted that all the terms
quadratic in large logarithms L t 1 ; L s1 ; L u 4 Lr are mutually cancelled out as it
should be.
From the formula Ž26. it immediately follows that Župon doing an integration over a
hard photon angular Žwithin a narrow cone. phase space. the w-term that is not
proportional to F , which is in fact the kernel of the non-singlet electron structure
function, is not dangerous in the sense of a feasible violation of the expected Drell–Yan
form of the cross section, because it does contribute only at next-to-leading order.
Performing the above mentioned integration and confining ourselves to LLA we get
for the sum of virtual and soft photons
d s g Ž SqV . d s 0g a D´
s L 4ln q 113 y 12 ln Ž 1yx . yln Ž y 1 y 2 . . Ž 32 .
d xd c d xd c p ´
The LLA contribution coming from the emission of a second hard photon with total
energy exceeding D ´ consists of a part corresponding to the case in which both hard
photons Žwith total energy ´ x . are emitted by the initial electron w10x
d s 2g d s 0g a x PQŽ 2 . Ž 1 y x . D´
s L 2
q 12 ln Ž 1 y x . y ln y 34 , Ž 33 .
d xd c d xd c p 4 Ž1 q Ž 1 y x . . ´
1qz2 1qz
PQŽ2. Ž z. s2
1yz
Ž 2ln Ž 1 y z . y ln z q 32 . q 2
ln z y 1 q z ,
and the remaining part which describes the emission of a second hard photon along
scattered electron and positrons. The latter, upon combining with the part of contribu-
tions of soft and virtual photons to our process,
d s 0g 3 a D´
L ln q 34 ,
d xd c p ´
may be represented via electron structure function in the spirit of the Drell–Yan
approach
2
d s 0g a 1q Ž1yx .
¦ ;d xd c k 1 Ip 1
s
2p x
L0 d z 2 d z 3 d z 4 D Ž z 2 . D Ž z 3 . D Ž z 4 .
H
d s 0 Ž p 1 Ž 1 y x . , z 2 p 2 ;q1 ,q2 .
= , Ž 34 .
dc
54 V. Antonelli et al.r Nuclear Physics B 568 (2000) 40–59
™0
P Ž1 ,2. Ž z . s lim d Ž 1 y z . PŽD1,2 . q Q Ž 1 y D y z . PQŽ1 ,2. Ž z . 4 ,
D
1qz2
PDŽ1. s 2ln D q 32 , PQŽ1. Ž z . s ,
1yz
2 2p 2
PDŽ2. s Ž 2ln D q 32 . y , ...
3
These functions describe the emission of Žreal and virtual. photons both by final
electron and by positrons. The multiplier before the integral stands for the emission of a
hard photon by the initial electron. Thus Eq. Ž34. actually represents the partially
integrated Drell–Yan form of the cross section. Quite the same arguments are applicable
to the second case in which a hard photon is emitted by the final electron.
The cross section of the hard sub-process eŽ p 1 z 1 . q eŽ p 2 z 2 . eŽ q1 . q eŽ q2 . enter-
ing Eq. Ž34. has the form
™
d s 0 Ž z 1 p 1 , z 2 p 2 ;q1 ,q2 .
dc
2
8pa 2 z 12 q z 22 q z 1 z 2 q 2 c Ž z 22 y z 12 . q c 2 Ž z 12 q z 22 y z 1 z 2 .
s 2
. Ž 36 .
s z1Ž 1 y c . Ž z1 q z 2 q c Ž z 2 y z1 . .
The momenta of scattered electron q1 and positron q2 are completely determined by the
energy–momentum conservation law
2 z1 z 2 $
q10 s ´ , q10 q q20 s ´ Ž z 1 q z 2 . , c s cosq1 , p1 ,
z1 q z 2 q c Ž z 2 y z1 .
$ $
z 1 sinq1 , p1 s z 2 sinq2 , p1 .
In general their energies differ from those detected in experiment ´ 1X , ´ X2 , namely
´ 1X s q10 z 3 , ´ X2 s q20 z 4 ,
whereas the emission angles are the same in LLA.
Collecting the two expressions presented in Eqs. Ž32., Ž33. one can rewrite the result
in LLA as
ds g d s 0g
d xd c k 1 Ip 1
s ž d xd c / k 1 Ip 1
1 q d14 ,
²d s 0grd xd c : a x PQŽ 2 . Ž 1 y x .
d1 s
ž d s 0grd xd c / k 1 Ip 1
y1q
p
L 2
3 y ln Ž y 1 y 2 . q
4 Ž1 q Ž 1 y x .
2
.
.
Ž 37 .
V. Antonelli et al.r Nuclear Physics B 568 (2000) 40–59 55
²d s 0grd xd c : a x PQŽ 2 . Ž 1 y x .
d 1X s
ž d s 0grd xd c / X
k 1 Ip 1
y1q
p
L 2
3 q
4 Ž1 q Ž 1 y x .
2
.
,
2
d s 0g a 1q Ž1yx .
¦ ;
d xd c X
k 1 Ip 1
s
2p x
LX0 d z 1d z 2 d z 4 D Ž z 1 . D Ž z 2 .
H
d s 0 Ž z 1 p 1 , z 2 p 2 ;q1 ,q2 .
= D Ž z4 . , Ž 38 .
dc
with LX0 s L0 q 2lnŽ1 y x ..
For the case when the energies of scattered fermions are not detected, the expressions
Ž34., Ž38. may be simplified due to Hd z D Ž z . s 1 and z 3 , z 4-independence of the
integrand in k 1 I p 1 kinematics Ž z 4-independence in the k 1 I pX1 case..
The x-dependence of d 1 is shown in Fig. 3 for different values of the cosine of
scattering angle c. For a hard photon emission by final particles the correction d 1X
strongly depends on the experimental conditions of particle detection: the energy
thresholds of detection of scattered fermions. This dependence for d 1 is much weaker,
namely about 1%.
In conclusion let us recapitulate the results given in Eqs. Ž37., Ž38.. They both respect
the Drell–Yan form for a cross section in LLA. Nevertheless a certain deviation away
from RG structure function representation at a second order of PT in k 1 I p 1 kinematics
is observed. The term destroying expectations based on RG approach comes from
Fig. 3. The x-dependence of d 1 Žsee Eq. Ž37.. for different values of the cosine of scattering angle c.
Other parameters chosen are: u 0 s 0.1, ´ s1 GeV.
56 V. Antonelli et al.r Nuclear Physics B 568 (2000) 40–59
definite contribution of a soft photon emission, the term with lnŽ y 1 y 2 . in Eq. Ž37. which
for such a kind of process the validity of the Drell–Yan form for a cross section was not
proved so far. Another possible way out is a careful analysis of a conflict between a soft
and hard collinear photon emission. We have used the factorized form of a soft photon
emission Ž22. under the condition Ž21.. But, to the moment, this representation in the
peculiar case at hand is not rigorously proved as well.
The accuracy of our calculations of virtual and soft photon corrections is determined
by the omitted terms of the order of
a m2 a
ž
1 q O u 02
p
Ls ,
s p
Ls ,
/ Ž 39 .
which corresponds to a per mille level. The accuracy of the correction coming from two
hard photon emission is determined by O ŽŽ arp .lnŽ4ru 02 .. and at 1% level.
Acknowledgements
We thank A.B. Arbuzov for a critical reading of the manuscript, many valuable
comments and participating at the early stage of the investigation. We are also indebted
to L. Lipatov for many discussions elucidating factorization issues. One of us ŽEAK. is
grateful to the physical department of Insubria University ŽComo. for a warm hospitality
during accomplishment of the final part of this work and to the Landau Network-Centro
Volta grant for financial support. The support of EAK by INTAS grant 93-1867 ext. and
of EAK and BGS by Russian Foundation for Basic Research grant 99-02-17730 is
acknowledged.
Appendix A
Here we give the expressions for the quantities associated with G-type integrals:
1 p2
Jsy y2 Ll L t 1 q 2 L t 1 Lr y L2t y 2Li 2 Ž x. y ,
x 1 t1 6
1 r dz ln z A x Aqq
J1 s
t1 x 1
H0 1yz 1ylz
s
t1 x 1 ž 1q
ryx / s
t1 x 1
,
1 r dz z ln z
Jk s y H ,
t1 x 1 r 0 1yz 1ylz
1 r dz z ln z
J11 s y
t1 x 1
H0 Ž1yz . Ž1yl z . ž 1q
1yz / ,
V. Antonelli et al.r Nuclear Physics B 568 (2000) 40–59 57
1 r zd z z ln z
. ž /
J1 k s H 1q ,
t1 x 1 r 0 Ž1yz . Ž1yl z 1yz
p2 x x1
A s Li 2 Ž 1 y r . y q Li 2 Ž x . q Lr ln Ž 1 y x . , ls , rs .
6 r m2
Ž A.1 .
In the limit r 4 1 we have
F s x 1 A 2 q t 1 x 1 Ž J11 y J1 q xJ1 k y xJk . s y 12 q O Ž ry1 .
and that is the reason why w-structure does contribute only to next-to-leading terms.
In general the expression for 5-denominator one-loop scalar, vector and tensor
integrals are some complicated functions of five independent kinematical invariants Žin
the derivation we extensively use the technique developed in Ref. w22x.. In the limit
m 2 < x 1 < s ; yt they may be considerably simplified because of singular 1rx 1
terms only kept:
1 1
Es D 0124 q D 0123 ,
s1 t
1
E1 s yxEk s Ž D 0134 y Ž 1 y x . D 0234 y xD1234 q x 1 E . ,
2 x1
1 x x 2p 2
D 0124 s L2r q 2 Lr ln y ln 2
y ,
xt1 x 1 1yx 1yx 3
1 p2
Re D 0123 s L2s1 y 2 L s1 Lr y 2 L s Ll q q 2Li 2 Ž x . ,
sx1 6
1 5p 2
Re D 0234 s L2s1 q 2 L s1 Ll y 2 Lr L s1 q 2 L s1 L t y ,
s1 t 6
1 7p 2
Re D 0134 s L2s q 2 L s Ll y 2 Ž L t 1 q ln Ž x . . L s q 2 L s L t q ,
st 6
1 7p 2
Re D 1234 s y yL2s q 2 L s Ž L t 1 q ln Ž x . . q 2 L s1 Ll y . Ž A.2 .
s1 xt 1 6
The structure E11 q xE1 k has the form 1rŽ s x 1 . f Ž x, x 1 . and will vanish after performing
the operation Ž1 q Q 2 . s1 tP given in Ž14. which yields a contribution of P-type graphs
with crossed and uncrossed photon legs.
The following coefficient for the scalar integral is obtained in the calculation of
B-type FD:
1 p2
Bs L2s1 q 2 L s1 Ll y 2 L s1 Lr q 2 L s1 L t q . Ž A.3 .
s1 t 6
58 V. Antonelli et al.r Nuclear Physics B 568 (2000) 40–59
1 2 1 1 r
a1X 2 s y Ž Lt y L s . qp 2 q Ž Ls y Lt . y s ž /
Lr y L s1 ,
2 u12 1
tu1 1
1t ry1
1 3
4p 2
J0 s 2 L2s1 y 2 L s1 Lr y Li 2 Ž1yr . y . Ž A.5 .
s1 3
As has been mentioned in the text, the physical gauge exploited provides a direct
extraction of the kernel of the structure function out of the traces both in the tree- and
loop-level amplitudes. The pattern emerging
Ž pˆ 1 y kˆ 1 q m . eˆ Ž pˆ 1 q m . eˆ Ž pˆ 1 y kˆ 1 q m .
2
s 4 Ž p 1 e . Ž pˆ 1 y kˆ 1 . y e 2x 1 kˆ 1 f Ž 1 y x . Ypˆ 1 ,
2yx
kˆ 1 eˆ Ž pˆ 1 q m . eˆ Ž pˆ 1 y kˆ 1 q m . f Ž 1 y x . 2 ž /
W y Y pˆ 1 , Ž A.6 .
1yx
shows this clearly.
References
a
CERN, TH DiÕision, GeneÕa, Switzerland
b
C.N. Yang Institute for Theoretical Physics, State UniÕersity of New York at Stony Brook, Stony Brook, NY
11794-3840, USA
Received 19 August 1999; accepted 9 September 1999
Abstract
1. Introduction
A direct measurement of the gluon distribution in the proton is both very interesting
and very difficult. This is true, in particular, for the spin-dependent gluon density D g of
a longitudinally polarized proton. So far, the only information on the nucleon polarized
parton distributions comes from polarized deep-inelastic lepton–nucleon scattering
ŽDIS.. Here, in principle, D g could be determined from scaling violations; however, in
practice this procedure is afflicted by very large uncertainties due to the limited accuracy
of the data and, in particular, to the fact that so far only fixed-target DIS experiments
have been carried out in the polarized case, which, consequently, have a rather limited
lever arm in Q 2 . Thus, the proton spin-dependent gluon density D g is experimentally
constrained only very little so far.
In order to perform a direct determination of the gluon distribution, one needs to
consider physical processes, which are predominantly initiated by gluons at the parton
level; the contamination from quark-initiated subprocesses must be under good theoreti-
0550-3213r00r$ - see front matter q 2000 Elsevier Science B.V. All rights reserved.
PII: S 0 5 5 0 - 3 2 1 3 Ž 9 9 . 0 0 5 7 5 - 1
S. Frixione, W. Vogelsangr Nuclear Physics B 568 (2000) 60–92 61
cal control; and finally, the process must take place at a measurable rate, taking
experimental efficiencies into proper account. Only a few processes are known that meet
all these requirements. One example is the production of heavy flavours in photon–
hadron collisions; the cross section for this process in the case of polarized scattering
has recently been computed w1,2x in QCD at next-to-leading order ŽNLO.. The COM-
PASS collaboration at CERN w3x, and possibly even HERA in the polarized configura-
tion w4–6x, will be able to exploit charm production in order to constrain the polarized
gluon density. Such measurements are on the other hand severely limited by the low
experimental efficiency of charmed-meson tagging.
A second possibility is given by jet hadro- and photoproduction; also in this case,
QCD cross sections for polarized collisions are now known to NLO w7,8x. It will indeed
be attempted to determine D g, at the forthcoming polarized RHIC pp collider w9,10x, by
a measurement of the spin asymmetry in jet production, and the recent study of Ref. w7x
has demonstrated that this approach has very promising prospects.
™
In the unpolarized case, the classical tool for determining the gluon density at
intermediate and large x has been prompt-photon production, pp g X and pN g X,
in fixed-target experiments w11x. Indeed, data on prompt photons have been the
™
backbone of the gluon determination in many analyses of parton densities. The main
™ ™
reason for this is that, at leading order, a photon in the final state is produced in the
reactions qg g q and qq g g, with the contribution of the former subprocess being
obviously sensitive to the gluon and usually dominant over that of the latter. It is the
‘point-like’ coupling of the photon to the quark in these subprocesses that is responsible
for a much cleaner signal than, say, for the inclusive production of a p 0 , which proceeds
necessarily through a fragmentation process.
The cleanliness of the signal is of course an advantage that also counts at colliders.
The aim of this paper is to provide a detailed study, performed in perturbative QCD at
NLO accuracy, of the production mechanism for prompt photons at polarized hadronic
colliders, such as RHIC. In the rest of this introduction, we will briefly review the
present theoretical knowledge of prompt-photon production. In particular, we will
explain why, in our opinion, a theoretical reappraisal of this matter is needed, before the
data-taking will start at RHIC.
Prompt-photon data obtained at hadronic colliders have been used as a constraint for
the unpolarized gluon density w11x. Thinking of the polarized case again, it is clear that
the production of photons with polarized beams at RHIC is likely to be a very promising
source of information on D g. We note that having pp reactions Žas opposed to pp ones
™
as hitherto in the unpolarized case. is also an advantage, since the competing LO
subprocess qq g g does not receive any contributions from valence–valence scatter-
ing here. Compared to jets, the prompt-photon reaction shows a larger spin asymmetry,
even though of course the jet rate is much higher at a given p T , resulting in smaller
statistical errors. Eventually, both reactions, plus also charm production and p 0 produc-
tion in polarized pp scattering, can be utilized to constrain D g at RHIC, and it will be
interesting to see how far the various channels will provide compatible pieces of
information, andror whether they will complement one another.
Unfortunately, the cleanliness of the prompt-photon signal alleged above is limited.
As is well known, photons can also be produced through a fragmentation process, in
which a parton, scattered or produced in a QCD reaction, fragments into a photon plus a
number of hadrons. The problem with the fragmentation component in the prompt-pho-
62 S. Frixione, W. Vogelsangr Nuclear Physics B 568 (2000) 60–92
™
w12,13x. Secondly, all QCD partonic reactions contribute to the fragmentation compo-
™
nent; thus the advantage of having a priori only one partonic reaction Ž qq g g .
competing with the signal Ž qg g q . is lost, even though some of the subprocesses
relevant to the fragmentation part at the same time result from a gluon initial state.
Numerical studies w14–17x for photon production in unpolarized collisions, based on
predictions w14,18–20x for the photon fragmentation functions that turned out to be
compatible with the sparse LEP data, demonstrate that the fragmentation component is
not messing up things too much, even though it cannot be neglected in a careful study.
In the fixed-target regime, it amounts to an effect of about 20%. At collider energies, it
would easily make up for about half of the observed photons; however, here the
situation is saved by the so-called ‘isolation’ cut, which is imposed on the photon signal
in experiment. Isolation is an experimental necessity: in a hadronic environment the
study of photons in the final state is complicated by the abundance of p 0 ’s, eventually
decaying into pairs of g ’s. The isolation cut simply serves to improve the signal-to-noise
ratio: if a given neighbourhood of the photon is free of energetic hadron tracks, the event
is kept; it is rejected otherwise. In principle, there is a large freedom in the choice of
™
specific isolation cuts, the only requirement being that they must strongly suppress the
background p 0 gg , while keeping the signal at a measurable rate. Traditionally,
isolation is realized by drawing a cone of fixed aperture in azimuthal angle-pseudorapid-
ity space around the photon, and by restricting the hadronic transverse energy allowed in
this cone to a certain fraction Žof the order of less than 10%. of the photon transverse
energy. In this way, it is clear that the fragmentation contribution, resulting from an
essentially collinear process, will be diminished w21x. In actual numbers, it is not
expected w15,16x that it will remain responsible for more than 15–20% of the photon
signal after isolation.
Studies of the backgrounds to prompt-photon production expected for RHIC have
been reported in Refs. w22–26x, based on parton-shower Monte Carlo event generators. It
is anticipated w22–24x that isolation cuts will also have to be applied in prompt-photon
measurements at polarized RHIC. When working out theory predictions for the
Žun.polarized cross sections and the resulting spin asymmetry, it is crucial that the
calculation properly treats all effects mentioned so far, in particular those related to
fragmentation and isolation.
It is the objective of this paper to do just this. Of course, it must be pointed out that
into these calculations in the approximation that the isolation cone be rather narrow.
possibly be saved even by ‘fine’-tuning the unpolarized gluon density! Clearly, if this
situation persists, we will have to worry about whether one can really interpret, in the
polarized case, future RHIC data straightforwardly in terms of D g. A possible remedy
for this trend has been brought forward in terms of a smearing of the transverse
momenta of the initial partons participating in the hard scattering w50–53x, required to be
substantially larger than what is already introduced by the NLO calculation. This
approach still remains to be set on a more solid foundation – eventually it should be
accounted for to some extent by a k T-resummation calculation w54x with perturbative as
well as non-perturbative components. Furthermore, threshold resummations w55,56x,
aiming at the high-pT end, have been shown w57x to lead to a certain improvement in the
fixed-target regime. We also note that possible inconsistencies between the various data
sets themselves have been pointed out w17x. It remains to be seen whether or not the
d sA B Ž K A , K B ; Kg . s dx 1 dx 2 f aŽ A. Ž x 1 , m F . f bŽ B . Ž x 2 , m F .
H
=d sˆaisol
b ,g Ž x 1 K A , x 2 K B ; K g ; m R , m F , mg .
where A and B are the incoming hadrons, with momenta K A and K B respectively, and
a sum over the parton indices a, b and c is understood. In the first term on the r.h.s. of
Eq. Ž1., denoted as the direct component, the subtracted Žfactorized. partonic cross
sections d sˆaisol
b,g get contributions from all the diagrams with a photon leg. On the other
hand, the subtracted partonic cross sections d sˆaisol
b,c appearing in the second term on the
r.h.s. of Eq. Ž1. Ždenoted as the fragmentation component., get contribution from the
pure QCD diagrams, with one of the partons eventually fragmenting in a photon, in a
way described by the Žperturbatively uncalculable but universal. parton-to-photon
fragmentation function DgŽ c.. Eq. Ž1. is to be regarded as a generic expression for the
cross section: it will apply to unpolarized as well as polarized cross sections; in the latter
case one simply has to substitute the parton densities f iŽ h. and the partonic cross sections
d sˆaisol
b, r with their spin-dependent counterparts, D f i
Ž h.
and d Dsˆaisol
b, r respectively. Note,
however, that the parton-to-photon fragmentation functions DgŽ c. are always the unpolar-
ized ones since we are not measuring the polarization of the produced photon.
As the notation in Eq. Ž1. indicates, the isolation condition is embedded into the
partonic cross sections. As mentioned in the introduction, for all the isolation conditions
known at present, except that of Ref. w44x, as well as for the case of totally inclusive
Žnon-isolated. photon production, neither the direct nor the fragmentation components
are separately well defined at any fixed order in perturbation theory: only their sum is
physically meaningful. In fact, the direct component is affected by quark-to-photon
collinear divergences, which are subtracted by the bare fragmentation function that
appears in the unsubtracted fragmentation component. Of course, this subtraction is
arbitrary as far as finite terms are concerned. This is formally expressed in Eq. Ž1. by the
presence of the same scale mg in both the direct and fragmentation components: a finite
piece may be either included in the former or in the latter, without affecting the physical
predictions. The need for introducing a fragmentation contribution is physically better
motivated from the fact that a QCD hard scattering process may produce, again through
a fragmentation process, a r meson that has the same quantum numbers as the photon
and can thus convert into a photon, leading to the same signal.
Owing to the presence of the fragmentation remnants, which surround the emitted
photon, the effect of the isolation cuts will be a stronger suppression of the fragmenta-
tion component relative to the direct component, with respect to the case of totally
inclusive photon production. Since the parton-to-photon fragmentation functions are
extremely poorly known, one may adopt two opposite points of view.
1. Define the isolation cuts in order to suppress as much as possible the fragmentation
component. The resulting cross section will be useful to measure the incoming gluon
density or to test the predictions of the underlying theory. In this context, the
unknown fragmentation functions are regarded as uncertainties affecting the theoreti-
cal predictions.
2. Define the isolation cuts in order to keep a non-negligible contribution from the
fragmentation component. The comparison between data and the resulting cross
section will eventually be used to extract the parton-to-photon fragmentation func-
tions. This strategy makes most sense if the initial state is as clean as possible, which
is the case for eqey collisions.
66 S. Frixione, W. Vogelsangr Nuclear Physics B 568 (2000) 60–92
The former criterion leads to the so-called cone approach w21,40,58–61x. After
tagging the photon, one draws a cone of half-angle R 0 around it. The word ‘cone’ can
be misleading, being motivated by eqey physics. Here, the cone is drawn in the
pseudorapidity-azimuthal angle plane, and corresponds to the set of points
½ ( 2
CR 0 s Ž h , f . N Ž h y hg . q Ž f y fg . ( R 0 ,
2
5 Ž 2.
where hg and fg are the pseudorapidity and azimuthal angle of the photon, respectively.
The quantity in Eq. Ž2. is boost-invariant, and is therefore suited to be used in collider
physics. For the photon to be defined as isolated, the total amount of hadronic transverse
energy ET ,had Ž R 0 . found in this cone must fulfill the following condition:
ET ,had Ž R 0 . ( e c p Tg , Ž 3.
where e c is a small number, and p Tg is the transverse momentum of the photon. This
isolation prescription was proven to be infrared-safe at all orders of perturbation theory
in Ref. w62x. The smaller e c , the tighter the isolation. Loosely speaking, for vanishing e c
™
the direct component behaves like log e c , while the fragmentation component behaves
™
like e c log e c . Thus, for e c 0 Eq. Ž1. diverges. This is obvious since the limit e c 0
corresponds to a fully isolated cross section, which cannot be a meaningful quantity,
whether experimentally Žbecause of limited energy resolution. or theoretically Žbecause
there is no possibility for soft particles to be emitted into the cone..
On the other hand, if one actually wants to measure the fragmentation functions, then
the so-called democratic approach should be adopted w63,64x. The basic idea here is to
treat the photon as a QCD parton in a jet-clustering algorithm, and then to impose a cut
on the hadronic energy contained in the ‘jet’, which also contains the photon. This
approach has so far been used only in eqey physics, and we will not discuss it any
further in this paper.
In the spirit of the cone approach, an alternative definition of the isolated photon has
been proposed w44x. After drawing a cone of half-angle R 0 around the photon axis, all
the cones of half-angle R ( R 0 are considered; their definition is identical to the one
given in Eq. Ž2., with R 0 replaced by R. Denoting by ET ,had Ž R . the total amount of
hadronic transverse energy found in each of these cones, the photon is isolated if the
following inequality is satisfied:
ET ,had Ž R . ( eg p Tg Y Ž R . , Ž 4.
for all R ( R 0 . A sensible choice for the function Y is the following:
n
1 y cos R
Y Ž R. s
ž 1 y cos R 0 / , n s 1. Ž 5.
It has been proved in Ref. w44x that such a choice allows the definition of an
isolated-photon-plus-jet cross section, which is infrared-safe to all orders in QCD
perturbation theory and still does not receive any contribution from the fragmentation
mechanism. In this case, therefore, only the first term on the r.h.s. of Eq. Ž1. is different
from zero, and it does not contain any mg dependence. The reader can find all the
technical details concerning the isolation prescription based on Eq. Ž4. of Ref. w44x.
Here, we will just recall the main ideas. The fundamental property of the function Y is
R ™0
lim Y Ž R . s 0, Ž 6.
S. Frixione, W. Vogelsangr Nuclear Physics B 568 (2000) 60–92 67
the function being different from zero everywhere except for R s 0. This implies that
the energy of a parton falling into the isolation cone CR 0 is correlated to its distance Žin
the h – f plane. from the photon. In particular, a parton becoming collinear to the photon
is also becoming soft. When a quark is collinear to the photon, there is a collinear
divergence; however, if the quark is also soft, this divergence is damped by the quark
vanishing energy Žprovided that the energy vanishes fast enough; this condition is not
very restrictive, and the form in Eq. Ž5. easily fulfills it.. When a gluon is collinear to
the photon, then either it is emitted from a quark, which is itself collinear to the photon
– in which case, what was said previously applies – or the matrix element is finite.
Finally, it is clear that the isolation condition given above does not destroy the
cancellation of soft singularities, since a gluon with small enough energy can be emitted
anywhere inside the isolation cone. The fact that this prescription is free of final-state
QED collinear singularities implies that the direct part of the cross section is finite. As
far as the fragmentation contribution is concerned, in QCD the fragmentation mecha-
nism is purely collinear. Therefore, by imposing Eq. Ž4., one forces the hadronic
remnants collinear to the photon to have zero energy. This is equivalent to saying that
the fragmentation variable z is restricted to the range z s 1. Since the parton-to-photon
fragmentation functions do not contain any d Ž1 y z ., this means that the fragmentation
contribution to the cross section is zero, because an integration over a zero-measure set
is carried out.
We stress that the function given in Eq. Ž5. is to a very large extent arbitrary. Any
sufficiently well-behaved function, fulfilling Eq. Ž6., could do the job, the key point
being the correlation between the distance of a parton from the photon and the parton
energy, which must be strong enough to cancel the quark-to-photon collinear singularity.
We also remark that the traditional cone-isolation prescription, Eq. Ž3., can be recovered
from Eq. Ž4. by setting Y s 1 and eg s e c . In the rest of this paper, as a short-hand
notation, we will indicate the ‘traditional’ isolation obtained by imposing Eq. Ž3. as
definition A, and that obtained by imposing Eq. Ž4. as definition B.
At first sight, the new isolation approach appears to be stricter than the traditional
one. On the other hand, the fact that for the new constraint one also considers the angle
between the photon and hadrons in the cone, is a real virtue here: for the traditional
criterion, one would reject a hadron of, say, 2 GeV wherever it is located in the cone,
just because its energy exceeds the limit. Of course, if the cone size is 0.7, and the
hadron has a distance of 0.6 with respect to the photon – why should one want to reject
such an event? This situation is improved with the new constraint: hadrons in the cone
that are still quite far away from the photon are allowed to have more energy than those
close to the photon. In this way, one can well allow a hadron to have 2 GeV, or even
more, at a distance of 0.6. This little example implies that a detailed comparison
between the traditional and the new isolation methods is certainly of some interest, and
this will also be performed in this paper. One can then eventually decide which isolation
to use in actual experiment.
mention the fact that the code outputs the kinematical variables of the photon and of the
final-state partons, plus a suitable weight. Therefore, the isolation condition, the
jet-reconstruction algorithm, and any cuts matching the experimental setup can be
implemented as the final step of the computation algorithm. This allows us to plot as
many observables as we want in one single computer run.
It is easy to see that one can extend the ideas behind the ‘narrow-cone’ approxima-
tion, used for the definition-A code, also to the isolation given by definition B. In this
way, we have been able to compare extensively the results of the two codes. We found
excellent agreement of the two programs over a wide range of kinematical variables, and
also for cone openings of even R 0 s 0.7, if only central values of rapidity are
considered. This suggests the correctness of the two – entirely independent – codes. It
also implies that the ‘narrow-cone’ approximation has a rather large region of validity
and can be well used for practical applications. We recall, however, that the correspond-
ing code is only suitable for fully inclusive photon observables, and not for photon-plus-
jet ones.
In this section, we study the inclusive properties of isolated photons. More exclusive
observables, such as correlations between the photon and the accompanying jets, will be
discussed in Section 4. We will consider centre-of-mass energies spanning the range
'S s 200–500 GeV. We will carefully investigate the differences induced by the
different isolation prescriptions we deal with in this paper. We will address the issue of
the perturbative stability of our results, and study the dependence of the cross sections
upon the polarized parton densities.
Unless otherwise specified, we will use the following parameters, as a default for our
calculations:
1 GeV
R 0 s 0.4, ec s , definition A; Ž 7.
p Tg
R 0 s 0.4, eg s 1, n s 1, definition B. Ž 8.
It is worth emphasizing at this point that we have chosen eg 4 e c : for traditional
isolation A, e c has to be small – otherwise, isolation is totally ineffective. For isolation
B, on the other hand, eg may be chosen large, as we discussed previously. A large eg
only means that one still allows considerable amounts of hadronic energy in the cone,
provided it is deposited far away from the photon. We add that it is actually desirable
theoretically in any isolation to have a ‘large’ value of e Žs eg or e c .: soft-gluon
emission into the cone generates logarithms w21,40,44x of e , with an extra power at each
further order of perturbation theory, which for very small e eventually threaten to spoil
the perturbative expansion. A study on the structure of the logarithms appearing in the
isolated-photon cross section in eq ey collisions has been given in Ref. w62x. Pending a
70 S. Frixione, W. Vogelsangr Nuclear Physics B 568 (2000) 60–92
more thorough investigation of these points in the case of isolation B, it seems likely
that being able to choose eg s O Ž1. is clearly a virtue of this isolation method.
The default value for the factorization and renormalization scales will be indicated by
m 0 , which will be taken equal to the transverse momentum of the photon in the case A,
and equal to half of the total transverse energy of the event in the case B. These two
choices, which slightly differ beyond LO, are due to the different structure of the codes
computing the isolated-photon cross section in cases A and B, as described in Section 2.
It would be possible to set m 0 s p Tg in case B Žhowever, this choice, although formally
correct, is less appropriate than the one adopted here: since the code is fully exclusive in
the variables of the photon and of the final-state partons, the reference scale, which is
directly related to the hardness of the process, should also depend upon the transverse
momenta of the partons.; this would result in differences with our default choice that are
completely negligible when compared to the other sources of theoretical uncertainty
affecting the cross section. We will adopt throughout the two-loop expression for a S , the
LQC D value being that associated with the parton densities used. Our default parton
density sets will be the NLO ‘standard’ set of Ref. w70x ŽGRSV STD. and MRST w53x
for the polarized and unpolarized scattering respectively. In the case of the definition A,
we will use the NLO GRV w18,19x set of parton-to-photon fragmentation functions.
We have to note here that, while the value of LQCD associated with the MRST set
Ž L5 MS s 220 MeV. is close to the central value of the latest PDG world average Žat two
loops, L5 MS s 237q26 w x.
y24 MeV 71 , all the available polarized density sets have a value
which is much lower, consistent with that extracted from DIS data some years ago.
Thus, by adopting the value of LQC D associated with a given set, we have the
unpleasant situation in which, in the computation of asymmetries, the numerator and the
denominator have different L’s. Still, we preferred not to violate the correlation
between the parton densities and LQC D . This correlation is expected to be particularly
strong in the case of the gluon density, which is of great importance here. Since a
smaller LQC D entails a smaller strong coupling, our predictions for asymmetries would
have become somewhat larger than the ones we present below, had we decided to adopt
the same value of LQC D in the polarized and unpolarized cross sections. This situation
has already been encountered in Ref. w7x, for jet physics. There, it has been shown that
using the same LQC D in the polarized and unpolarized cross sections would increase the
asymmetry by 15% Žrelatively. at the most. In the case of photon production, the
difference is even smaller. As we will see, the effect is therefore completely negligible,
with respect to the differences in the predictions of the asymmetries induced by the
choice of different parton densities.
In what follows, in order to assess the importance of the radiative QCD corrections,
we will often compare the NLO and Born results. Throughout the paper, by ‘Born
result’ we will mean the prediction obtained by convoluting the lowest-order partonic
cross sections Ž O Ž a em a S . and O Ž a S2 . for the direct and the fragmentation contributions
respectively. with the NLO-evolved parton densities and, if needed, fragmentation
functions. Also, the two-loop expression of a S will be used. There is of course a lot of
freedom in the definition of a Born-level result. However, we believe that with this
definition one has a better understanding of some issues related to the stability of the
perturbative series. This is especially true in polarized physics, where the data are not
sufficient to determine the parton densities with a good accuracy, and where large
S. Frixione, W. Vogelsangr Nuclear Physics B 568 (2000) 60–92 71
Žartificial. differences can arise between sets fitted at LO or NLO to the available DIS
data. For a detailed discussion on this point, see for example Ref. w7x.
We start by considering the transverse momentum spectrum of isolated photons. In
the lower part of Fig. 1 we plot the Born Žhistogram with symbols. and NLO Žsolid
histogram. results for the polarized cross section, obtained at 'S s 500 GeV with the
isolation definition B. A cut hg - 0.35 has been imposed, which is suitable for the
PHENIX experiment. As can be seen from the figure, the inclusion of the radiative
corrections gives a sizeable effect as far as the normalization is concerned Žin the first
bin, the ratio of NLO over Born result is about 1.8., while the shape is almost unaffected
Žthe Born being only slightly harder than the NLO result.. Since the radiative corrections
are large, one may wonder whether the NLO result is a sensible quantity to compare
with data. A rigorous answer to this question can only come from a complete NNLO
calculation. Lacking that, we study the scale dependence of our results, as customary in
perturbative QCD, to see whether the inclusion of radiative corrections leads to a milder
dependence upon the scales, as compared to the one of the Born result. Here, it is
especially important to study the separate dependence upon the renormalization and
factorization scales, because cancellation effects between the two may hide some
problems. We present the scale dependence of the p Tg spectrum in the upper and central
parts of Fig. 1. There, we show the ratio of the cross section obtained by setting the
Fig. 1. Transverse-momentum spectrum of the isolated photon, in the case of definition B, for polarized pp
collisions at 'S s 500 GeV. The polarized parton densities used are GRSV STD. The scale dependences of
the Born and NLO results are also shown; see the text for details.
72 S. Frixione, W. Vogelsangr Nuclear Physics B 568 (2000) 60–92
scales equal to m 0r2 and 2 m 0 , over the cross section for the default values of the scales.
We stress that only one scale is varied at a time. The renormalization Žfactorization.
scale variation corresponds to the dotted Ždashed. curves; the curves decreasing for
increasing p Tg correspond to m R s m 0r2 and m F s 2 m 0 , respectively. From the figure,
it is apparent that the inclusion of the radiative corrections reduces the scale dependence
in the whole p Tg range considered, with a possible exception in the case of the m F
dependence, for p Tg equal to 16–20 GeV, where there is basically no m F dependence.
The reduction is stronger in the case of m F dependence than in the case of m R
dependence. The fact that there is a point in the p Tg spectrum where there appears to be
no factorization scale dependence is purely accidental; it can be traced back to the
behaviour of the parton densities with respect to the hard scale. In fact, the gluon density
increases with the scale in the x range corresponding to the low-pTg region, while it is
decreasing when the scale is increasing for larger x values, probed when a harder
photon is produced. We can conclude from Fig. 1 that the perturbative expansion seems
to be reliable in this case; in all cases, the radiative corrections reduce the size of the
dependence of the p Tg spectrum upon the scales. We must comment on the fact that this
conclusion is not specific to the kinematical configuration considered in Fig. 1: we
verified that the same kind of behaviour can be seen in a larger hg range Žwe studied
the case y1 - hg - 2., and also at lower centre-of-mass energies Ž'S s 200 GeV..
Furthermore, almost the same results are obtained in the case of unpolarized collisions.
We now turn to the case when the photon is isolated according to definition A. The
results are presented in Fig. 2. In the lower part, we display the ratio of the cross section
over that obtained with definition B. In this case, the scales are fixed to their default
values. The result at the Born level is again displayed as a histogram with symbols. The
Born result in the case of definition A is always higher than that relevant to definition B.
This is easy to understand, since at this order the result for the direct part is independent
of the isolation condition, and the photon isolated with definition A gets a contribution
from the fragmentation part, which is not present in the case of definition B. Things of
course change at NLO: having an additional parton around, the isolation condition is
effective also in the direct part. We must also remark that, in the case of isolation A, the
fragmentation contribution is only included at LO. A consistent computation at NLO
would presumably produce a slightly larger cross section Žfor example, in the case of
unpolarized collisions, the inclusion of radiative corrections in the fragmentation compo-
nent enhances the full cross section at high p T by about 3%.. The effect is much larger
in the case in which there is no isolation condition, and the photon is fully inclusive. We
will further comment on this fact below. As in the previous case, we also studied the m R
and m F dependence of the spectrum; in doing so, the factorization and renormalization
scales of the direct and of the fragmentation components have been set to the same
value: mXF s m F and mXR s m R Žsee Eq. Ž1... The results are displayed in the upper parts
of Fig. 2. Note that for definition A we have an additional pair of lines Ždot-dashed.,
corresponding to the results obtained by varying the final-state factorization scale mg ,
which enters the fragmentation functions. The m F and m R dependence is very similar to
the one relevant to definition B, displayed in Fig. 1, and the same comments made
previously apply here. On the other hand, the mg dependence is extremely small, and
gives a negligible contribution to the theoretical error affecting the cross section. The
almost identical scale dependence in the case of definitions A and B also implies that the
S. Frixione, W. Vogelsangr Nuclear Physics B 568 (2000) 60–92 73
Fig. 2. Transverse-momentum spectrum of the isolated photon, in the case of definition A. The ratio of the
cross section over that obtained with definition B is shown in the lower part. The rest of the figure displays the
scale dependence, at the Born and NLO levels.
ratio of cross sections plotted in the lower part of Fig. 1 is, to a good extent, independent
of the scale choice.
For completeness, we present in Fig. 3 the corresponding predictions for the fully
inclusive non-isolated prompt-photon cross section. We first discuss the scale depen-
dence of the results, displayed again in the upper two parts of the figure. At low p Tg , the
m R dependence turns out to be larger than that of the isolated-photon cross sections,
while at large p Tg the two appear to be pretty similar Žthe isolation condition is less and
less restrictive as the transverse momentum of the photon is increased, since it is more
and more difficult to have a hard parton, in the surroundings of the photon, that does not
pass the isolation cuts.. The mg dependence of the fully inclusive cross section is much
larger than that of the isolated-photon cross section obtained with definition A. How-
ever, its effect is still smaller than that due to m R and m F .
The comparison of the non-isolated cross section with the one isolated according to
definition B is shown in the lower part of Fig. 3. One immediately notices a striking
feature: at NLO, the isolated cross section becomes larger than the unpolarized one at
large p Tg . Clearly, this finding is at odds with the physical expectation that any
meaningful isolation cut should reduce the number of events with respect to the number
obtained for no isolation at all. The origin of the problem we encounter resides in the
fact that the fragmentation contribution to the non-isolated cross section Žand, obviously,
also for the results for isolation A presented in Fig. 2. has only been calculated at the
74 S. Frixione, W. Vogelsangr Nuclear Physics B 568 (2000) 60–92
LO level since, as we pointed out earlier, the NLO corrections to the fragmentation
contribution have not been calculated so far in the polarized case. We expect that once
the proper NLO fragmentation component is included in the calculation of the polarized
cross section, the disagreeable feature of Fig. 3 will disappear. This view is corroborated
by the observation that we find exactly the same pattern in the unpolarized case: there,
everything can be calculated consistently at NLO, and the non-isolated cross section
turns out to be larger than the one obtained for both types of isolation we consider.
However, we checked that, if we compute the fragmentation contribution only at LO in
the unpolarized case, we indeed obtain a non-isolated cross section that is smaller than
the isolated one for definition B, much as happens in Fig. 3. The figure clearly points
out the importance of consistency in the NLO calculation. A NLO calculation of the
fragmentation component of the polarized prompt-photon cross section is highly desir-
able for the future. On the basis of Fig. 3 we would predict non-negligible positive
corrections to the LO result. For the time being, our present results for isolation A and
the non-isolated case have only limited reliability. Fortunately, fragmentation is really
important only in the non-isolated case, which is not the one relevant to experiment. For
a Žtraditionally. isolated cross section, it contributes a relatively small fraction of the full
result, and this fraction decreases rapidly towards larger p Tg . We are therefore fairly
confident that our predictions for definition A are numerically not too far off the true
NLO answer.
For these reasons, we refrain from performing a detailed study of the uncertainty in
the cross section for isolation A. We only state that we have also calculated the
S. Frixione, W. Vogelsangr Nuclear Physics B 568 (2000) 60–92 75
Fig. 4. Average transverse energy in the cone annuli around the photon.
unpolarized and the polarized cross sections, using set I of the photon fragmentation
functions of Ref. w20x. We find that the fragmentation component to the cross section
decreases by about 23% at p Tg f 10 GeV, and by about 50% at the high-pTg end. For
the full Ži.e. direct plus fragmentation. cross section, the effect is obviously much
smaller, generally below 2%.
There is another striking property of the curve in the lower part of Fig. 2: it is Õery
close to unity over the whole range of p Tg Žin fact, from what we just discussed, we
would expect it to be even closer to unity at large p Tg , had we been able to include the
fragmentation component at NLO in case of definition A.. In other words, the two types
of isolation, albeit so different from a physics point of view, lead to almost identical
cross sections. To some extent, this is certainly due to the choices we made for e c , eg in
Eqs. Ž7., Ž8.: had we chosen, say, eg s e c there1 , then isolation B would have become
stricter than isolation A, and the corresponding curve in Fig. 2 would have been above
unity everywhere. Our choices in Eqs. Ž7., Ž8. presumably created a certain ‘balance’
between the two isolations. However, we found that there is more to the similarity of the
two isolated cross sections. When performing runs at larger values of R 0 , we found that
the cone-size dependence of the cross section is extremely mild for both types of
isolation. This indicates that isolation is most effective close to the photon and does not
affect the cross section too much at larger distances from the photon. To illustrate this
point, we plot in Fig. 4, as a function of the distance R from the photon, the amount of
hadronic transverse energy deposited on average in a cone annulus between R y D R and
R q D R, where D R s 0.025. We do this for isolation of type B, considering two
realistic values of the isolation-cone size, R 0 s 0.4 and R 0 s 0.7, and one extreme value
R 0 s 0.005. We have chosen 'S s 500 GeV, and the photon variables have been
1
This is simply an example; as discussed at the beginning of this subsection, this is actually not a desirable
choice.
76 S. Frixione, W. Vogelsangr Nuclear Physics B 568 (2000) 60–92
™
Fig. 4 that this extra particle is usually soft: the point is rather that, thanks to the
collinear singularity of the corresponding 2 3 matrix elements at R s 0, the extra
particle simply happens to be close to the photon more often than far away from it, so
that on aÕerage more energy is deposited close to the photon. Incidentally, one can
convince oneself that, for small R and D R < R, the quantity shown in Fig. 4 has to be
proportional to D Rrs Ž R 0 . = Ž d s Ž u .rdu .usR , where u is the angle between the
momenta of the photon and the other particle in the cone, and s Ž R 0 . is the total cross
section for a given isolation-cone size R 0 and a given kinematical range for the photon
variables Žhere, p Tg ) 10 GeV and y1 - hg - 2.. For a quark parallel to the photon
Žwhich is the only configuration producing a collinear singularity at u s 0., one thus
finds immediately that the curves in Fig. 4 should fall A 1rR if R ) R 0 , i.e. outside the
isolation cone. This is exactly the pattern observed in the figure. Inside the cone the
™
isolation is effective, and for the isolation of definition B employed here, one expects
the curve to fall like some power of R as R 0. Finally, we also note that the
normalization factor 1rs Ž R 0 . is the reason why the histograms for the three different
R 0 in Fig. 4 do not exactly coincide even if R ) 0.7. A comparison of the histograms
relevant to R 0 s 0.4 and R 0 s 0.7 at R s 1 nicely demonstrates how weak the depen-
dence of the isolated-photon cross section on the cone size is.
We finally study the distribution of the photon in pseudorapidity hg . We impose here
the transverse momentum cut p Tg ) 10 GeV. Our results are summarized in Fig. 5; they
have been obtained in the case of definition B. The upper histograms correspond to
NLO, while the lower ones represent the Born level results. The solid histograms have
been obtained with default scales. They are pretty similar in shape; the Born is only
slightly broader, with a deeper dip at hg s 0. The size of radiative corrections is as
expected from what we previously found for the p Tg spectrum: the NLO result is about
a factor 1.6 higher than the Born one. The pairs of dotted histograms are obtained by
setting the renormalization scale equal to m 0r2 and 2 m 0 . It is clear that also in this case
the radiative corrections have the effect of reducing the scale dependence; by varying
m R , one obtains NLO cross sections that differ from the default one by 10% at most. The
effect of varying the factorization scale is much smaller than that associated with
S. Frixione, W. Vogelsangr Nuclear Physics B 568 (2000) 60–92 77
Fig. 5. Pseudorapidity spectrum of isolated photons Ždefinition B.. The cross sections obtained by varying the
renormalization and factorization scales are also shown.
In spin physics, experiments usually focus on spin asymmetries, since many system-
atic uncertainties cancel out in this ratio of polarized and unpolarized cross sections. In
what follows, we will therefore study the quantity
d DsrdpTg
Ap Tg s , Ž 9.
d srdpTg
as a function of p Tg . We will also consider a similar asymmetry, with p Tg replaced by
hg . More studies on asymmetries will be presented in Section 4. In Eq. Ž9., it is
understood that the same kinematical cuts are applied to both the numerator and the
78 S. Frixione, W. Vogelsangr Nuclear Physics B 568 (2000) 60–92
protons are equal, L is L °' L . If this were not the case, the quantity 2 s L in Eq.
here that the luminosities relevant to parallel and antiparallel spins of the incoming
Fig. 6. Asymmetry as a function of pTg , for various polarized densities, at different centre-of-mass energies.
The minimally observable asymmetry Ždot-dashed histogram. is also shown.
S. Frixione, W. Vogelsangr Nuclear Physics B 568 (2000) 60–92 79
of the asymmetry is concerned, the main difference being in the absolute normalization.
For this reason, we will limit ourselves in this paper to three sets: GRSV STD, our
default set; GRSV MAXg w70x, which has a much larger gluon density and is thus
expected to return the highest cross sections; and set C of Ref. w72x ŽGS-C., which has a
rather small gluon density whose shape is dramatically different from that of all the
other sets, turning negative at high x for Q 2 not too large.
In Fig. 6 we present our results for the asymmetry as a function of p Tg . A cut
hg - 0.35 has been applied. In the left part of the figure we plot the asymmetries
obtained at 'S s 200 GeV, while in the right part we present the results for 'S s 500
GeV. The solid, dashed and dotted histograms correspond to the NLO predictions
obtained with GRSV STD, GRSV MAXg and GS-C respectively. The corresponding
symbols Žsee the labels on the figure. are the results obtained at the Born level. Finally,
the dot-dashed histogram is the minimally observable asymmetry, as defined in Eq. Ž10..
We have chosen L s 100 pby1 , P s 1 and e s 1. Of course, the latter two choices are
not realistic; however, in adopting this ‘ideal-world’ situation, we can estimate the
optimally achievable accuracy for a given integrated luminosity. Note that the assumed
value for L is conservative; one expects in the best case to eventually obtain L s 160
pby1 rpolarization at 'S s 200 GeV and L s 400 pby1rpolarization at 'S s 500
GeV. In any case, it is straightforward to rescale our predicted minimally observable
asymmetry 2 if one prefers other values for L , P or e . We also emphasize that we have
chosen rather small bins in p Tg , D p Tg s 2 GeV. It would certainly seem advantageous
in the actual data analysis to increase the bin size when going to larger p Tg , as is indeed
a commonly adopted procedure in the unpolarized prompt-photon experiments.
From Fig. 6, we see that the shapes of the asymmetries obtained using the GRSV
STD and GRSV MAXg sets are quite similar, but the difference in normalization is
sizeable; this is consistent with what we observed in our study of jet physics w7x. On the
other hand, the result obtained for GS-C looks completely different. At NLO, this
asymmetry becomes negative in the region 10 - p Tg - 30 GeV at 'S s 200 GeV, and in
the region 30 - p Tg - 45 GeV at 'S s 500 GeV. This is due in the first place to large
cancellations between the contributions of various partonic channels: while the GRSV
STD and GRSV MAXg results are dominated by the contribution of the qg-initiated
subprocess, this is not true in the case of GS-C, where the gluon is so small that
quark–quark scatterings Žof opposite sign. are in absolute value of the same order or
larger, in particular in the central pseudorapidity region, which is of interest here. For
the same reason, the asymmetries for set GS-C obtained at the Born level turn out to be
always substantially larger than those obtained at NLO. The issue of Žnon-.dominance of
the qg subprocess, which is obviously of key interest for the extraction of the polarized
gluon density from isolated-photon data, will be examined in more detail in the
following.
2
Our cuts used in Fig. 6, in particular hg - 0.35, actually correspond to those of the PHENIX experiment.
Note, however, that all our results have been integrated over the full 2p of azimuthal angle, whereas the
PHENIX electromagnetic calorimeter only covers half the azimuth. This implies that for a correct comparison
our minimally observable asymmetry has to be multiplied by a factor of '2 , in addition to introducing the
appropriate values for L , P, e . Our results for Ž Ap Tg . min are then found to be consistent with those reported
in w22,23x.
80 S. Frixione, W. Vogelsangr Nuclear Physics B 568 (2000) 60–92
Fig. 8. pTg and hg spectra of isolated photons in polarized and unpolarized collisions. The results obtained by
retaining only the contribution of the qg-initiated partonic subprocess are also shown Žsymbols..
that is obtained by keeping only the qg-initiated subprocess. One can clearly see that in
the polarized case, for the two GRSV density sets, the qg subprocess alone produces a
S. Frixione, W. Vogelsangr Nuclear Physics B 568 (2000) 60–92 83
result that is almost identical to the full answer. As we have checked, this comes about
to some extent because the other subprocesses all give small contributions, but also
because they tend to cancel one another to a good approximation. This explains why for
set GS-C a different pattern is found: slightly different quark densities and a vastly
different gluon distribution make the cancellation of the non-qg channels imperfect, and
the gluon density is not large enough to render the signal from qg scattering dominant,
except for large <hg <. However, in view of Fig. 6, this finding does not really create a
problem: if the gluon is indeed as small as embodied in the GS-C set, the measurement
at RHIC will anyway only give an asymmetry compatible with zero, and we will not be
in a position to actually unfold D g from the data. If, on the other hand, D g is sizeable,
it is an encouraging result that the polarized cross section provides a very direct measure
of it. Note that in the unpolarized case the qg channel is generally responsible for only
O Ž80%. of the cross section.
We finally mention that we have also computed the asymmetries by isolating the
photon according to definition A. In the case of the GRSV sets, the results are almost
identical to those shown here. In the case of GS-C, some difference can be seen in the
central pseudorapidity region, where the asymmetry tends to be smaller in the case of
definition A. Part of this effect results from the different scale choices adopted for the
two definitions, as discussed at the beginning of this section.
4. Isolated-photon-plus-jet observables
In the production process, the transverse momentum of the prompt photon is balanced
by that of the high-pT outgoing hadrons. It may be decided to neglect the properties of
these hadrons, and to study the inclusive production of the photon, as we did in Section
3. On the other hand, the study of the correlations between the photon and the associated
hadrons gives a more thorough information on the underlying dynamics. Also, from the
experimental point of view, photon-plus-hadron events can be used as a means of
calibrating the hadronic calorimeter. In this section, we will consider photon-plus-jet
observables. Our predictions are relevant to the STAR experiment w24x at RHIC, where
one of the main goals is indeed to determine D g from prompt-photon-plus-jet events.
As is customary in any fixed-order computation in perturbative QCD, our predictions are
given at the parton level Ži.e. our jet-finding algorithm deals with partons and not with
hadrons.. We will adopt here a k T-algorithm, namely that proposed in Ref. w73x, with
D s 1. We will only discuss the case when the photon is isolated following the
prescription B; unless otherwise specified, we will adopt the isolation parameters given
in Eq. Ž8.: R 0 s 0.4, eg s 1, n s 1. Since our computation is based on five-leg
amplitudes, we are able to predict the photon-plus-one-jet observables at NLO, and the
photon-plus-two-jet observables at LO.
The number of jets that accompany the photon does depend not only upon the
dynamics, but also upon the jet-finding algorithm and the kinematical cuts imposed on
84 S. Frixione, W. Vogelsangr Nuclear Physics B 568 (2000) 60–92
Table 1
Total rates Žin pb; the entries relevant to unpolarized scattering have been multiplied by 10y3 . for
isolated-photon-plus-jet events, at 'S s 500 GeV
pT j 010 GeV pT j 012 GeV pT j 014 GeV
0yjet 1yjet 2yjet 0yjet 1yjet 2yjet 0yjet 1yjet 2yjet
GRSV STD 115.1 168.0 15.87 136.7 151.4 10.92 178.0 113.3 7.692
GRSV MAXg 206.7 294.7 29.28 244.6 265.8 20.30 317.2 199.1 14.41
GS-C 27.73 41.57 1.121 37.30 32.52 0.604 49.61 20.50 0.312
MRST=10y3 11.61 13.27 0.730 14.37 10.77 0.471 18.45 6.854 0.310
the jets. This is documented in Table 1, where the total rates are presented for events
satisfying
p Tg 0 10 GeV, y1 ( hg ( 2, p T j 0 10,12,14 GeV, y1 ( hj ( 2,
Ž 11 .
in the case of polarized and unpolarized pp collisions at 'S s 500 GeV Ž p T j and hj are
the jet transverse momentum and pseudorapidity, respectively.. The cuts on hg ,hj
considered here are relevant to the STAR detector. As one might expect, there is only a
small fraction of events where the photon is accompanied by two jets Žin the case of
two-jet events, the transverse-momentum cut is applied to both jets; it is clear that the
inclusion of radiative corrections for photon-plus-two-jet observables will not change
what was said before.. On the other hand 3 , a sizeable number of events falls in the class
denoted by ‘0-jets’, which is constituted by those events where the jetŽs. do not pass the
imposed transverse-momentum or pseudorapidity cuts; at the lowest transverse-momen-
tum cut, this is mainly due to the fact that the pseudorapidity cut is not symmetric
around h s 0. Since the number of jets is directly related to the hardness of the event,
large differences can be seen in the ratio of two-jet over one-jet rates, when different
™
parton densities are considered. In particular, this ratio is the larger the slower the gluon
density approaches zero for x 1; while GRSV STD and GRSV MAXg return almost
the same ratio Žthe shape of their gluon being basically the same., the result for GS-C is
much smaller, since the gluon density in this case has a dip at intermediate x values. It
follows that a first rough piece of information on the behaviour of the gluon density at
large x can be obtained by simply looking at the total photon-plus-jet rates. We also
notice that the result for the ratio of one-jet over two-jet rates in the case of unpolarized
scattering Žfourth row in Table 1. lies in between that of the GRSV sets and that of the
GS-C set, consistently with the fact that the shape of the MRST gluon density is softer
than that of GRSV STD and harder than that of GS-C. This also implies that the ratio of
rates corresponding to different numbers of jets is not very sensitive to the polarization
of the beams.
We must stress that, in the case of p T j 0 10 GeV and p Tg 0 10 GeV, the zero-jet and
one-jet rates are rather pathological in perturbative QCD Žon the other hand, their sum is
well-behaved.. Indeed, when equal transverse-momentum cuts are imposed on the
3
By definition, for a given row in Table 1, the sum ‘0-jet’q‘1-jet’q‘2-jet’ is the same for each of the
three pT j cuts, and corresponds to the inclusive isolated-photon rate.
S. Frixione, W. Vogelsangr Nuclear Physics B 568 (2000) 60–92 85
photon and the hardest jet, large logarithms appear in the cross section, which in
principle should be resummed. Roughly speaking, at any fixed order in perturbation
theory, for ‘symmetric cuts’ the radiation of real gluons cannot compensate the large and
negative contribution of the virtual diagrams. The mechanism is identical to the one that
can be observed in two-jet correlations, in the case when the two jets have the same
minimum transverse momentum cut. This matter was discussed at length in Ref. w74x, to
which we refer the reader for more details. For illustration, we consider here the total
rate Žno h cuts have been applied; these would just change the absolute normalization
which is of no interest in what follows.:
sg j Ž D . s s Ž p Tg 0 10 GeV, p T j 0 10 GeV q D . Ž 12 .
as a function of D, for both polarized and unpolarized collisions, at different centre-of-
mass energies. By definition, the jet is the hardest of the jets of the event. The results are
displayed in Fig. 9. The plots in this figure are completely analogous to the ones in Fig.
4 of Ref. w74x. The main point is that a negative slope is here visible at D s 0, implying
that the cross section decreases here with the decreasing cut on p Tg , clearly signalling
a failure of the perturbative expansion. We remark, however, that, at variance with the
case of jet–jet correlations, in the case of isolated-photon-plus-jet production, a value of
D s 1 GeV already appears to be perfectly safe. We also remind the reader that, even in
the case of equal transverse-momentum cuts, the perturbative expansion is reliable
everywhere except in some corners of the phase space Žexamples will be given below..
Inspecting Fig. 9, we finally note that, when going from 'S s 200 GeV to 'S s 500
GeV, the cross section increases much more in the unpolarized than in the polarized
Fig. 9. Total photon-plus-jet rates, in polarized and unpolarized collisions, at different centre-of-mass energies.
86 S. Frixione, W. Vogelsangr Nuclear Physics B 568 (2000) 60–92
case. This implies that, as in the case of inclusive observables discussed in the previous
section, at fixed final-state kinematical variables the asymmetries for photon-plus-jet
observables are larger at the smaller centre-of-mass energies.
We now turn to the issue of the perturbative stability of our results for correlations
between photon and jets. Since we are interested in NLO predictions, we will only
consider photon-plus-one-jet quantities; in the case when two jets are present in the
event, only the hardest jet is retained. As in subsection 3.1, we will assume that we
obtain a Žrelatively sound. indication of the stability of the cross sections if the
variations induced by changing the renormalization and factorization scales with respect
to their default values are small. In Fig. 10 we present the result for the invariant mass
distribution of the photon-jet system, in the case of polarized collisions at 'S s 500
GeV. The pseudorapidities of both the photon and the jet are required to be in the range
y1 ( h ( 2, and we impose p Tg 0 10 GeV and p T j 0 10 GeV. The results of both the
NLO computation Župper curves. and Born computation Žlower curves, which have been
rescaled in order to make them clearly distinguishable from the NLO ones. are
displayed. Similarly to the case of the inclusive transverse momentum distribution of
isolated photons Žcf. Fig. 1., in most of the range in Mg j the change of cross section
induced by a variation of the renormalization scale is of the order of 10% at NLO, and
larger at the Born level. However, a quite dramatic effect is seen at threshold, when
radiative corrections are included: the cross section in the first bin becomes negative,
and the scale dependence displays a pathological behaviour in this range, the cross
Fig. 10. Renormalization-scale dependence of the invariant mass spectrum of the photon–jet system, at the
Born and NLO levels. The result for unequal lower transverse-momentum cuts is also shown Žinset..
S. Frixione, W. Vogelsangr Nuclear Physics B 568 (2000) 60–92 87
section becoming smaller for decreasing renormalization scales. This effect is exactly a
consequence of the fact that the minimum transverse-momentum cuts on the photon and
the jet are equal. Indeed, in the inset of Fig. 10 we show Žin the threshold region. the
invariant mass of the photon–jet system in the case when p T j 0 12 GeV. It is obvious
that here the scale dependence is as expected, and the cross section in the first bin
Žalthough not visible in the figure. remains positive. However, the first two bins show a
scale sensitivity comparable to the one of the Born result. In fact, close to the threshold
the NLO result is effectively a LO one, since the threshold at the Born level is in this
case at Mg j s 24 GeV. In the invariant mass range not close to threshold, the scale
dependence in the case of unequal transverse-momentum cuts is practically identical to
the one displayed in the main body of Fig. 10. We also studied the factorization scale
dependence of the invariant mass distribution. In the region not close to the threshold,
there is a clear improvement when going from LO to NLO; again, the results for equal
and unequal transverse-momentum cuts are very similar. At threshold, the same consid-
erations as given above apply. As in the case of the single-inclusive photon transverse-
momentum spectrum displayed in Fig. 1, the cross section for larger Žsmaller. factoriza-
tion scales is larger Žsmaller. than the default one at small invariant masses, while it is
smaller Žlarger. than the default for large invariant masses. This behaviour is almost
entirely due to scale dependence of the parton densities, as already discussed in the
single-inclusive case.
We performed a thorough study of the renormalization and factorization scale
dependence of many photon-jet observables. In particular, we considered the photon
Žjet. transverse momentum and pseudorapidity distributions, when cuts on the recoiling
jet Žphoton. are imposed, as suggested in Ref. w43x. Among the photon-jet correlations,
we considered the transverse momentum of the pair p TŽ g j., the azimuthal distance in the
transverse plane Dfg j , the distance in the h – f plane D Rg j , and the variables
p Tg ehg q p T j eh j p Tg eyhg q p T j eyh j
x1 s , x2 s , Ž 13 .
'S 'S
which, at the Born level, coincide with the Bjørken-x values of the incoming partons. In
all these cases, a reduction in the relative size of the scale dependence is seen at the
process.: a change of the scales within the limits as above induces a variation of the
™
results of about 10% or less. In the regions of the phase space where the partonic
contributions start at the 2 3 level Žfor example, p TŽ g j. ) 0, Dfg j - p , D Rg j - p ., our
NLO results have a scale dependence larger than elsewhere, of the order of 15% to 20%,
since they are effectively LO. Finally, as in the case of the invariant-mass distribution,
there are corners of the phase space where the perturbative results are not reliable in the
case of equal transverse-momentum cuts. Among those, the case of D Rg j s p is
particularly interesting, since here the Born threshold falls inside the range available at
NLO. This case has been described, on general grounds, in Ref. w75x.
asymmetries on the choice of polarized parton densities, at the Born and NLO levels.
Here, we restrict ourselves to 'S s 200 GeV. We verified that the pattern when going to
'S s 500 GeV is similar to the one already described in the preceding section; namely,
at fixed final-state kinematics we get smaller asymmetries, with however also smaller
minimally observable asymmetries. In Fig. 11 we show the asymmetries as functions of
the invariant mass Žleft. and x 1 Žright.. The photon-plus-jet events have been selected by
imposing equal transverse-momentum cuts on the photon and on the hardest jet
Ž p T ) 10 GeV; pseudorapidities are restricted to the range y1 - h - 2.. As discussed
previously, this choice only affects the threshold region of the invariant mass, where our
predictions should not be considered as reliable. As in the case of single-inclusive
quantities, the results for GRSV STD and GRSV MAXg are pretty similar in shape,
although sizeably different in normalization. On the other hand, GS-C has a clearly
distinguishable signature, showing a dip at intermediate values of the invariant mass and
in the region around x 1 s 0.1. It is very easy to trace the origin of this behaviour back to
the shape of the GS-C gluon. The NLO results are smaller than those at LO, as they
already were in the case of inclusive observables. The difference between Born and
NLO results is not big at small invariant masses and in the whole x 1 range, while it
grows larger in the tail of the invariant-mass distribution, since the K-factor of the
unpolarized cross section is larger in this region than that of the polarized cross section.
Figure 11 also presents the minimally observable asymmetry Ždot-dashed histograms.,
Fig. 11. Asymmetries at 'S s 200 GeV, as functions of invariant mass Žleft. and x 1 Žright.. The NLO
Žhistograms. and Born Žsymbols. results are both shown. The minimum observable asymmetry is displayed by
the dot-dashed histogram.
S. Frixione, W. Vogelsangr Nuclear Physics B 568 (2000) 60–92 89
calculated according to Eq. Ž10.. A bin size of 2 GeV has been chosen for Mg j , and of
0.05 for log x 1; as before, L s 100 pby1 and e s P s 1. It is apparent that, if the
polarized densities are as predicted by the GS-C set, the measurement of the asymmetry
at RHIC will produce a result compatible with zero, even if quite large integrated
luminosities are attained. This result is consistent with what we already found in Section
3.2; however, in the case of single-inclusive variables the situation appeared to be
slightly better, when the enlarged pseudorapidity range y1 - h - 2 was considered.
In this context, we would like to comment on the findings of Ref. w43x, where it was
observed that, by looking at photon-plus-jet events, instead of considering only the
inclusive variables of the photon, one gets larger asymmetries. Also, photon-plus-jet
observables enhance the sensitivity to the shape of the parton densities and can be used
for a more straightforward deconvolution of D g from data w24x. Although we agree with
these observations, we doubt that photon-plus-jet correlations will give us a better
chance of measuring the gluon density than inclusive observables. In fact, there is in
practice the problem that the Žtheoretical. minimally observable asymmetry is larger in
the case of photon-plus-jet quantities than in the case of inclusive-photon quantities. The
situation is summarized in Fig. 12, for the case of the hg spectrum with p Tg 0 10 GeV.
In the case of the photon-plus-jet observable, the following cuts have been imposed on
the jet: p T j 0 10 GeV, hj ( 0.5. From the figure, it is apparent that, in spite of the fact
that the asymmetry is increased when cutting on the jet variables, the measurement
would be more difficult, since the minimally observable asymmetry is enhanced by a
Fig. 12. Asymmetries at 'S s 200 GeV, as functions of photon pseudorapidity, with Ždotted. and without
Žsolid. a transverse-momentum cut on the recoiling jet. The corresponding minimally observable asymmetries
Ždashed and dot-dashed, respectively. have been rescaled in order to coincide with the respective asymmetries
at hg s 0.
90 S. Frixione, W. Vogelsangr Nuclear Physics B 568 (2000) 60–92
larger factor with respect to the asymmetry. Of course, this is a purely theoretical
estimate, which assumes that the experimental efficiency is the same in the case of
inclusive-photon or photon-plus-jet events Žhere taken to be equal to 1.. This is probably
unrealistic, but it seems unlikely that the photon-plus-jet efficiency will be higher than
that for photon-tagging only. Therefore, without any detailed study at the detector level,
it seems improbable that the photon-plus-jet asymmetries will be the preferred tool for
pinning down the polarized gluon density. We finally also have to add that in Ref. w43x
the photon transverse momentum was constrained in a bin around p T j s 10 GeV of
width 1 GeV, instead of having p Tg ) 10 GeV as in Fig. 12. In view of the discussion
relative to Fig. 9, the kinematical constraints imposed in Ref. w43x appear to be more
problematic from the perturbative point of view than those adopted in this paper for
producing Fig. 12.
5. Conclusions
Acknowledgements
We are happy to thank G. Ridolfi for his collaboration at an early stage of this work.
We are also grateful to N. Saito for useful information. The work of S.F. is supported in
S. Frixione, W. Vogelsangr Nuclear Physics B 568 (2000) 60–92 91
References
Abstract
1. Introduction
Almost exactly thirty years ago the study of the spectrum of string theory Žknown at
the time as the dual resonance model. revealed w1,2x a huge degeneracy of states
growing as an exponential of the mass. This led to the concept of a limiting ŽHagedorn.
temperature THag in string theory. Only slightly more recently Bekenstein w3x proposed
that the entropy of a black hole should be proportional to the area of its horizon in
Planck units, and Hawking w4x fixed the constant of proportionality after discovering that
black holes, after all, do emit thermal radiation at a temperature THaw ; Ry1BH .
When string and black hole entropies are compared one immediately notices a
striking difference: string entropy 1 is proportional to the first power of mass in any
1
The self-interaction of a string lifts the huge degeneracy of free string states. One then defines the entropy
of a narrow band of string states, defined with some energy resolution M s Q D E < M, as the logarithm of the
number of states within the band D E.
0550-3213r00r$ - see front matter q 2000 Elsevier Science B.V. All rights reserved.
PII: S 0 5 5 0 - 3 2 1 3 Ž 9 9 . 0 0 5 9 6 - 9
94 T. Damour, G. Venezianor Nuclear Physics B 568 (2000) 93–119
2 X
Below, we shall use the precise definition l s ''2 a " , but, in this section, we neglect factors of order
unity.
3
One uses here the fact that, during an adiabatic variation of g, the entropy of the black hole
dy 1
SBH ; ŽArea.r GN ; R BH r GN stays constant. This result Žknown to hold in the Einstein conformal frame.
applies also in string units because S BH is dimensionless.
4
The variation of g can be seen, depending on one’s taste, either as a real, adiabatic change of g due to a
varying dilaton background, or as a mathematical way of following energy states.
T. Damour, G. Venezianor Nuclear Physics B 568 (2000) 93–119 95
each Žquantum. string state should turn into a Žquantum. black hole state at sufficiently
strong coupling, while, conversely, if g is decreased, each black hole state should
‘‘decollapse’’ and transform into a string state at sufficiently weak coupling. For all the
reasons mentioned above, it is very natural to expect that, when starting from a black
hole state, the critical value of g at which a black hole should turn into a string is given,
in clear relation to Ž1.3., by
g c2 M ; M s , Ž 1.4 .
and is related to the common value of string and black-hole entropy via
1 1
g c2 ; s . Ž 1.5 .
SBH Ss
Note that g c2 < 1 for the very massive states Ž M 4 M s . that we consider. This justifies
our use of the perturbative relation between GN and a X .
In the case of extremal BPS, and nearly extremal, black holes the conjectured
correspondence was dramatically confirmed through the work of Strominger and Vafa
w12x and others w13–15x leading to a statistical mechanics interpretation of black-hole
entropy in terms of the number of microscopic states sharing the same macroscopic
quantum numbers. However, little is known about whether and how the correspondence
works for non-extremal, non-BPS black holes, such as the simplest Schwarzschild black
hole 5. By contrast to BPS states whose mass is protected by supersymmetry, we shall
consider here the effect of varying g on the mass and size of non-BPS string states.
Although it is remarkable that black-hole and string entropy coincide when R BH s l s ,
this is still not quite sufficient to claim that, when starting from a string state, a string
becomes a black hole at g s g c . In fact, the process in which one starts from a string
state in flat space and increases g poses a serious puzzle w7x. Indeed, the radius of a
typical excited string state of mass M is generally thought of being of order
1r2
R rw
s ; l s Ž MrM s . , Ž 1.6 .
X
as if a highly excited string state were a random walk made of MrMs s a Mrl s
segments of length l s w16–19x. ŽThe number of steps in this random walk is, as is
natural, the string entropy Ž1.1... The ‘‘random walk’’ radius Ž1.6. is much larger than
the Schwarzschild radius for all couplings g ( g c , or, equivalently, the ratio of
self-gravitational binding energy to mass Žin d spatial dimensions.
dy2 1
GN M R BH Ž M . M Ž4yd .
2
Ž R rw
s .
dy2
;
ž R rw
s
/ ;g 2
ž /
Ms
Ž 1.7 .
remains much smaller than one Žwhen d ) 2, to which we restrict ourselves. up to the
transition point. In view of Ž1.7. it does not seem natural to expect that a string state will
‘‘collapse’’ to a black hole when g reaches the value Ž1.4.. One would expect a string
state of mass M to turn into a black hole only when its typical size is of order of
5
For simplicity, we shall consider in this work only Schwarzschild black holes, in any number d ' Dy1
of non-compact spatial dimensions.
96 T. Damour, G. Venezianor Nuclear Physics B 568 (2000) 93–119
6
With the proviso that the consistency of our analysis is open to doubt when d08.
T. Damour, G. Venezianor Nuclear Physics B 568 (2000) 93–119 97
that the size of a typical self-gravitating string remains fixed at the random walk value
Ž1.6. when g ( g c . However, when g gets close to a value of order g c , the ensemble of
self-gravitating strings becomes Žsmoothly in d s 4, but suddenly in d 0 5. dominated
by very compact strings of size ; l s Žwhich are then expected to collapse with a slight
further increase of g because the dominant size is only slightly larger than the
Schwarzschild radius at g c ..
Our results confirm and clarify the main idea of a correspondence between string
states and black hole states w7,9–11,20x, and suggest that the transition between these
states is rather smooth, with no apparent hysteresis, and with continuity in entropy, mass
and typical size. It is, however, beyond the technical grasp of our analysis to compute
any precise number at the transition Žsuch as the famous factor 1r4 in the Bekenstein–
Hawking entropy formula..
The aim of this section is to estimate the distribution function in size of the ensemble
of free string states of mass M, i.e. to count how many massive string states have a
given size R. This estimate will be done while neglecting the gravitational self-interac-
tion. The effect of the latter will be taken into account in a later section.
Let us first estimate the distribution in size by a rough, heuristic argument based on
the random walk model w16–19x of a generic excited string state. In string units
Ž l s ; My1s ; 1 , the geometrical shape in d-dimensional space of a generic massive
. Ž .
string state can be roughly identified with a random walk of M steps of unit length. We
can constrain this random walk to stay of size Q R by considering a diffusion process,
starting from a point source at the origin, in presence of an absorbing sphere SR of
radius R, centered on the origin. In the continuous approximation, the kernel K t Ž x,0.
giving the conditional probability density of ending, at time t, at position x, after having
started Žat time 0. at the origin, without having ever gone farther from the origin than the
distance R, satisfies: Ži. the diffusion equation E t K t s D K t , Žii. the initial condition
K 0 Ž x,0. s d Ž x ., and Žiii. the ‘‘absorbing’’ boundary condition K t s 0 on the sphere SR .
The kernel K t can be decomposed in eigenmodes,
K t Ž x ,0 . s Ý cn Ž x . cn Ž 0 . eyE n t , Ž 2.1 .
n
E0 s c1rR 2 , where c1 s O Ž1. is a numerical constant. This scaling is evident for the
Dirichlet problem Ž2.2., whatever be the shape of the boundary. Finally, remembering
that the number of time steps is given by the mass, t s M, we expect the looked for
conditional probability, i.e. the fraction of all string states at mass level M which are of
size Q R, to be asymptotically of order
2
f Ž R . ; eyc 1 M r R . Ž 2.3 .
This estimate is expected to be valid when MrR 2 4 1, i.e. for string states which are
much smaller Žin size. than a typical random walk R 2rw ; M Žbut still larger than the
string length, R R 1.. In the opposite limit, R 2 4 M, the kernel K t Ž x,0. can be
approximated by the free-space value K tŽ0. Ž x,0. s Ž4 p t .yd r2 expŽyx 2rŽ4 t .., with
t s M, so that the fraction of string states of size R R 4 R rw will be of order
2
; eyc 2 R r M , with c 2 s O Ž1..
As the result Ž2.3. will be central to the considerations of this paper, we shall now go
beyond the previous heuristic, random walk argument and derive the fraction of small
string states by a direct counting of quantum string states. For simplicity, we shall deal
with open bosonic strings Ž l s ' '2 a X , 0 ( s ( p .
X m Ž t , s . s Xcm
m
Ž t , s . q X˜ m Ž t , s . , Ž 2.4 .
m
Xcm Žt ,s . s x m q 2 aX p m t , Ž 2.5 .
a nm
X̃ m Ž t , s . s i l s Ý eyi nt cos n s . Ž 2.6 .
n/0 n
m Ž
Here, we have explicitly separated the center of mass motion X cm with w x m, p n x s i h mn .
from the oscillatory one X˜ Žw a m , a n x s m dmqn h .. The free spectrum is given by
m m n 0 mn
Here Nn ' a†n P a n is the occupation number of the nth oscillator Ž a nm s 'n a nm ,w a nm ,anm† x
s h mn dn m , with n,m positive..
The decomposition Ž2.4. – Ž2.6. holds in any conformal gauge ŽŽ Et X m " Es X m . 2 s 0..
One can further specify the choice of world-sheet coordinates by imposing
nm X m Ž t , s . s 2 a X Ž nm p m . t , Ž 2.8 .
where n m is an arbitrary timelike or null vector Ž n P n ( 0. w24x. Eq. Ž2.8. means that the
n-projected oscillators nm a mm are set equal to zero. The usual ‘‘light-cone’’ gauge is
obtained by choosing a fixed, null vector nm . The light-cone gauge introduces a
preferred Ž‘‘longitudinal’’. direction in space, which is an inconvenience for defining the
Žrms. size of a massive string state. As we shall be interested in quasi-classical, very
massive string states Ž N 4 1. it should be possible to work in the ‘‘center of mass’’
gauge, where the vector n m used in Eq. Ž2.8. to define the t-slices of the world-sheet is
taken to be the total momentum p m of the string. This gauge is the most intrinsic way to
T. Damour, G. Venezianor Nuclear Physics B 568 (2000) 93–119 99
describe a string in the classical limit. Using this intrinsic gauge, one can covariantly 7
define the proper rms size of a massive string state as
1 2
R 2 ' ² Ž X˜H m
Ž t , s . . :s ,t , Ž 2.9 .
d
where X˜H m
' X˜ m y p m Ž p P X˜ .rŽ p P p . denotes the projection of X˜ m ' X m y X cm
m Ž .
t
m
orthogonally to p , and where the angular brackets denote the Žsimple. average with
respect to s and t . The factor 1rd in Eq. Ž2.9. is introduced to simplify later formulas.
So defined R is the rms value of the projected size of the string along an arbitrary, but
fixed spatial direction. ŽWe shall find that this projected size is always larger than
' 3 a Xr2 ; i.e. string states cannot be ‘‘squeezed’’, along any axis, more than this..
In the center of mass gauge, pm X˜ m vanishes by definition, and Eq. Ž2.9. yields
simply
1
R 2 s l s2 R , Ž 2.10 .
d
with
` ayn P a n q a n P ayn
R' Ý . Ž 2.11 .
ns1 2 n2
The squared-size operator R, Eq. Ž2.11., contains the logarithmically infinite contribu-
tion Ý 1rŽ2 n.. Without arguing with the suggestion that this contribution may have a
physical meaning Žsee, e.g., Ref. w25x., we note here that this contribution is state-inde-
pendent. We are interested in this work in the relative sizes of various highly excited,
quasi-classical states. A concept which should reduce to the well-defined, finite rms size
of a classical Nambu string in the classical limit. We shall therefore discard this
state-independent contribution, i.e. work with the normal-ordered operator
` a†n P a n ` Nn
: R :s Ý s Ý . Ž 2.12 .
ns1 n ns1 n
™
We shall assume that we can work both in the center-of-mass Žworld-sheet. gauge
Ž pm a mm 0. and in the center-of-mass ŽLorentz. frame ŽŽ p m . s Ž M,0... This means
that the scalar product in the level occupation number Nn runs over the d spatial
dimensions: Nn s a†n P a n s Ý dis1 Ž a ni . † a ni . The ‘‘wrong sign’’ time oscillators a n0 are set
equal to zero. The Virasoro constraints then imply, besides the mass formula a X M 2 s N
y 1, the usual sequence of constraints on physical states, L n < f : s 0, with L n
s 12 Ý m Ý dis1 a nym
i
a mi . These constraints mean that the d oscillators a ni at level n are
not physically independent.
The problem we would like to solve is to count the number of physical states, in the
Fock space of the center-of-mass oscillators a ni , having some fixed values of N and R
Žwe henceforth work only with the normal-ordered operator Ž2.12. without adorning it
with the : : notation.. The Virasoro constraints make this problem technically quite
7
In an arbitrary conformal gauge, the definition Ž2.9. is gauge-dependent Žin spite of the use of the
mŽ .
orthogonal projection. because both the definition of Xcm t , and that of the Ž s ,t .-averaging depend on the
choice of world-sheet gauge. Even if we were using the Žmore intrinsic but more complicated. average with
weight ydet ga b d s dt s Ž Es X m . 2 d s dt , the dependence upon X cm
' m Ž .
t would remain.
100 T. Damour, G. Venezianor Nuclear Physics B 568 (2000) 93–119
difficult. However, we know from the exact counting of physical states Žwithout size
restriction. in the light-cone gauge that the essential physical effect of the Virasoro
constraints is simply to reduce the number of independent oscillators at any level n from
d s D y 1 Žin the center-of-mass gauge. to d y 1 s D y 2. If we Žformally. consider d
as a large parameter 8 , this change in the number of effective free oscillators should have
only a small fractional effect on any other coarse-grained, counting problem. We shall
assume that this is the case, and solve the much simpler counting problem where the d
oscillators a ni are considered as independent 9. To solve this problem we pass from a
microcanonical problem Žfixed values of N and R . to a grand canonical one Žfixed
values of some thermodynamical conjugates of N and R .. Let us introduce the formal
‘‘partition function’’
Z d Ž b ,g . ' Ý exp Ž yb N Nni y g R Nni ., Ž 2.13 .
Nni 4
where the sum runs over all sequences Žlabeled by n 0 1 and i s 1, . . . ,d . of indepen-
dent occupation numbers Nni s Ž a in . † a ni s 0,1,2, . . . , and where N w Nni x and R w Nni x are
defined by Eqs. Ž2.7., and Ž2.12., with Nn ' Ý dis1 Nni. Note that Ž2.13. is not the usual
thermodynamical partition function, and that b is not the usual inverse temperature.
Indeed, b is a formal conjugate to N , a X M 2 and not to the energy M. In particular,
because the degeneracy grows exponentially with M Žand not M 2 . its Laplace trans-
form Ž2.13. is defined for arbitrary values of b . We associate with the definition Ž2.13.
that of a formal grand canonical ensemble of configurations, with the probability
p Nni 4 s Zy1 i
d Ž b ,g . exp Ž yb N Nn y g R Nn
i
. Ž 2.14 .
of realization of the particular sequence Nni of occupation numbers. The mean values of
N w Nni x and R w Nni x in this ensemble are
E cd Ž b ,g . E cd Ž b ,g .
Nsy , Rsy , Ž 2.15 .
Eb Eg
where we denote
cd Ž b ,g . ' ln Z d Ž b ,g . . Ž 2.16 .
The second derivatives of the thermodynamical potential cd Ž b ,g . give the fluctuations
of N and R in this grand canonical ensemble,
2
E 2 cd Ž b ,g . 2
E 2 cd Ž b ,g .
Ž DN . s , Ž D R. s . Ž 2.17 .
Eb2 Eg2
Let us define as usual the entropy SŽ b ,g . as the logarithm of the number of string
configurations having values of Nand R equal to N and R, Eqs. Ž2.15., within the
8
It would be interesting to see if one can technically implement a large d approach to our counting
problem.
9
We tried to work in the light-cone gauge, with dy1 independent oscillators. However, the necessary
inclusion of the longitudinal term My2 Ž pP X˜ . 2 in Ž2.9., which is quadratic in the longitudinal oscillators
ayn s p
Ž q .y1 Ltransverse
n , leads to a complicated, interacting theory of the dy1 transverse oscillators.
T. Damour, G. Venezianor Nuclear Physics B 568 (2000) 93–119 101
precision of the rms fluctuations Ž2.17. w26x. This definition means that, in the
saddle-point approximation, Z d Ž b ,g . , exp w S y b N y g R x, i.e.
cd Ž b ,g . , S Ž b ,g . y b N y g R , Ž 2.18 .
or
E cd Ž b ,g . E cd Ž b ,g .
S , cd Ž b ,g . y b yg . Ž 2.19 .
Eb Eg
In other words, the entropy SŽ N, R . is the Legendre transform of cd Ž b ,g ..
Because of the Žassumed. independence of the d oscillators in Ž2.13., one has
`
yd
Z d Ž b ,g . s Ł w1 y eyŽ b nq g r n. x , Ž 2.20 .
ns1
i.e.
cd Ž b ,g . s d c 1 Ž b ,g . , Ž 2.21 .
with
` g
c 1 Ž b ,g . s y Ý
ns1
ž
ln 1 y exp yb n y
n / . Ž 2.22 .
We shall check a posteriori that we are interested in values of b and g such that
with
p2
Cs d, Dsp d . Ž 2.27 .
6
ŽThe notation D in Ž2.27. should not be confused with the space-time dimension d q 1..
The thermodynamic potential Ž2.26. corresponds to the mean values
1
Cy
2
D bg ' D
N, , R, , Ž 2.28 .
b2 2d
and to the entropy
2 CyD d D2
S, '
,2 C N 1y , Ž 2.29 .
b 8C R
i.e.
d 3d 1 d aX 1r2
l s2
S,2 p ( 6
N 1y
4 R
,2 p ž / 6
M 1 y 34
R2
. Ž 2.30 .
(
The lowest-order term 2 p d Nr6 is the usual ŽHardy–Ramanujan. result for d
independent oscillators, without size restriction. The factor in bracket, 1 y
Ž3r4.Ž l s2rR 2 ., with l s2 s 2 a X , gives the fractional reduction in the entropy brought by
imposing the size constraint R 2 , dy1 l s2 R. Under the conditions Ž2.23. the fluctua-
tions Ž2.17. are fractionally small. More precisely, Eqs. Ž2.17. yield
2 2
Ž DN . Ms Ž D R. b Ž R 2rl s2 .
;b; , ; ; . Ž 2.31 .
N2 M R2 d Ž MrMs .
As said above, though we worked under the Žphysically incorrect. assumption of d
independent oscillators at each level n, we expect the result Ž2.29. to be correct when
d 4 1. ŽWe recall that the exact result for S in absence of size restriction is
(
2 p Ž d y 1 . Nr6 .. Note the rough physical meanings of the auxiliary quantities b , g
and d : b ; Ž N .y1r2 ; Ž MrM s .y1, d ; Ry1 ; l s2rR 2 , g ; Ž N .1r2rR
R 2 ; M l s5rR 4 .
Summarizing, the main result of the present section is that the number Ž‘‘degeneracy’’.
of free string states of mass M and size R Žwithin the narrow bands defined by the
fluctuations Ž2.17.. is of the form
D Ž M , R . ; exp c Ž R . a 0 M , Ž 2.32 .
X
where a 0 s 2 p ŽŽ d y 1. a r6.1r2 and
c1 R2
c Ž R. s 1 y ž R2 /ž 1 y c2
M2 / , Ž 2.33 .
with the coefficients c1 and c 2 being of order unity in string units. ŽWe have added, for
completeness, in Eq. Ž2.32. the factor 1 y c 2 R 2rM 2 which operates when one consid-
T. Damour, G. Venezianor Nuclear Physics B 568 (2000) 93–119 103
In this section we shall estimate the mass shift of string states Žof mass M and size
R . due to the exchange of the various long-range fields which are universally coupled to
the string: graviton, dilaton and axion. As we are interested in very massive string states,
M 4 M s , in extended configurations, R 4 l s , we expect that massless exchange domi-
nates the Žstate-dependent contribution to the. mass shift.
The evaluation, in string theory, of Žone loop. mass shifts for massive states is
technically quite involved, and can only be tackled for the states which are near the
leading Regge trajectory w27–30x. ŽIndeed, the vertex operators creating these states are
the only ones to admit a manageable explicit oscillator representation.. As we consider
states which are very far from the leading Regge trajectory, there is no hope of
computing exactly Žat one loop. their mass shifts. We shall resort to a semi-classical
approximation, which seems appropriate because we consider highly excited configura-
tions. As a starting point to derive the mass-shift in this semi-classical approximation we
shall use the classical results of Ref. w31x which derived the effective action of
fundamental strings. The one-loop exchange of gmn , w and Bmn leads to the effective
action
I eff s I0 q I1 , Ž 3.1 .
where I0 is the free ŽNambu. string action
2
I1 s 2p HHd s 1 d 2s 2 GF Ž X 1 y X 2 . g 1 g 2 Ctot Ž X 1 , X 2 . ,
( ( Ž 3.3 .
(g (g
1 2 Ctot Ž X 1 , X 2 . s 32 GN T 2 Ž Eq X 1m Eq X 2 m . Ž Ey X 1n Ey X 2 n . . Ž 3.4 .
104 T. Damour, G. Venezianor Nuclear Physics B 568 (2000) 93–119
Here, T s Ž2 p a X .y1 is the string tension, GN is Newton’s constant 10 and E "s ErEs "s
1 m
2 Et " Es . Let us define P " s P " s
Ž . m Ž ".
by Ž l s s '2 a X as above.
2 E " X m s l s P"
m
, Ž 3.5 .
so that, for an open Žbosonic. string Žwith a 0m ' l s p m .,
q`
"
m
P" s Ý a nm eyi n s . Ž 3.6 .
y`
Using the definition Ž3.5. and inserting the Fourier decomposition of GF yields
4 GN dD k 1 2
I1 s H Ž 2p . D 2 HHd s 1 d 2s 2 Ž Pq Ž X 1 . P Pq Ž X 2 . .
p k yi ´
10
Normalized, in any dimension, by writing the Einstein action as Ž16 p GN .y1Hd D x g RŽ g .. '
11
For simplicity, we call ‘‘graviton’’ the exchanged particle, which is a superposition of the graviton, the
dilaton and the axion.
T. Damour, G. Venezianor Nuclear Physics B 568 (2000) 93–119 105
gives the decay rate, i.e. the total rate of emission of massless quanta.. As L0 y 1 is the
‘‘Hamiltonian’’ that governs the t-evolution of an open string, the vanishing of
Ž L0 y 1. < N : for any physical state ensures that ² N < W Ž k,1,2. < N : is t-translation
invariant, i.e. that it depends only on the difference t 12 ' t 1 y t 2 , and not on the
average t ' 12 Žt 1 q t 2 .. This means that the double world-sheet integration d 2s 1 d 2s 2
s dt 1 d s 1 dt 2 d s 2 s dt dt 12 d s 1 d s 2 on the right-hand side of Eq. Ž3.8. contains a
formally infinite infra-red ‘‘ volume’’ factor Hd t which precisely cancels the integral
Hd t on the left-hand side to leave a finite answer for d M 2 .
™ ™
It is also important to note the good ultraviolet behaviour of Eq. Ž3.8.. The ultraviolet
™
limit k ` corresponds to the coincidence limit Žt 2 , s 2 . Žt 1 , s 1 . on the world-sheet.
™
Let us define u ' sq q y y
1 y s 2 , Õ ' s1 y s 2 and consider the coincidence limit u 0,
Õ 0. In this limit the vertex insertion factors Ž D Pq . 2 Ž D Py . 2 tend to zero like u 2 Õ 2 ,
while the Green’s function blows up like wŽ D X . 2 xyŽ Dy2.r2 A Ž uÕ .yŽ Dy2.r2 . The result-
ing integral, Hdu dÕ Ž uÕ .yŽ Dy6.r2 , has its first ultraviolet pole when the space-time
dimension D s d q 1 s 8. This means probably that in dimensions D 0 8 the exchange
of massive modes Žof closed strings. becomes important. Our discussion, which is
limited to considering only the exchange of massless modes, is probably justified only
when D - 8.
Following the Žapproximate. approach of Section 2 we shall estimate the average
mass shift d M 2 Ž R . for string states of size R by using the grand canonical ensemble
with density matrix
y1
r ' Ž Z d Ž b ,g . . exp Ž yb N yg : R : . , Ž 3.10 .
where the operators N and : R:, defined by Eqs. Ž2.7. and Ž2.12., belong to the Fock
space built upon d sequences of string oscillators a ni Žformal ‘‘center-of-mass’’
oscillators.. For any quantity Q Žbuilt from string oscillators. we denote the grand
canonical average as ² Q :b ,g ' tr Ž Q r .. Using the t-shift invariance mentioned above,
Eq. Ž3.8. yields
2 GN dd k d v
d Mb2,g s y H Ž 2p .
p l s2 dq1
w k 2 yv 2 yi´ x
l s2 M v t 12
= dt 12 d s 1 d s 2 eyi
H ²W Ž k ,1,2 . :b ,g , Ž 3.11 .
where we have separated k m in its center-of-mass components k 0 s v , k i s k, and
where t 12 ' t 1 y t 2 as above.
We shall estimate the grand canonical average ²W :b ,g in a semi-classical approxima-
tion in which we neglect some of the contributions linked to the ordering of the operator
W, but take into account the quantum nature of the density matrix r , Eq. Ž3.10.. The
discreteness of the Fock states built from theŽ a in . †, and the Planckian nature of r will
play a crucial role in the calculation below. ŽBy contrast, a purely classical calculation
would be awkward and ill-defined because of the problem of defining a measure on
classical string configurations, and because of the Rayleigh–Jeans ultraviolet catastro-
phe.. To compute ²W :b ,g it is convenient to define it as a double contraction of the
coefficient of zm11 zm22 zm33 zm44 in the exponentiated version of W,
m1
Wz ' :exp zm11 D Pq m2
q zm22 D Pq m3
q zm33 D Py m4
q zm44 D Py q ik P D X˜ : Ž 3.12 .
106 T. Damour, G. Venezianor Nuclear Physics B 568 (2000) 93–119
We shall define our ordering of W by working with the normal ordered exponentiated
operator Ž3.12. Žand picking the term linear in zm11 zm22 zm33 zm44 .. The average ²Wz :b s
tr ŽWz r . Žwhere, to ease the notation, we drop the extra label g . can be computed by a
generalization of Bloch’s theorem. Namely, if A denotes any operator which is linear in
the oscillators a ni , we have the results
1 1
² e A :b s exp 2
² A2 :b ; ² :e A : :b s exp 2
² : A2 : :b , Ž 3.13 .
as well as their corollaries
1 1
² e A : 0 s exp 2
² A2 : 0 ; e A s :e A : exp 2
² A2 : 0 , Ž 3.14 .
™
where ²W : 0 denotes the vacuum average Žobtained in the zero temperature limit
by1 0.. The simplest way to prove these results is to use coherent-state methods w32x
Žsee also,Ref. w24x and Appendix 7.A of Ref. w33x.. For instance, to prove the second
equation Ž3.13. it is sufficient to consider a single oscillator and to check that Ždenoting
q s eye , with e s b n q grn Žlabel n suppressed., so that Z s Ž1 y q .y1 , and < b . '
exp Ž ba† . <0:.
† † †
² e c1 a e c 2 a :b s Zy1 tr Ž e c1 a e c 2 a q a a
.
2
d b )
b † †
s Zy1 H p
eyb Ž b< ec a 1 e c2 a q a a < b .
d2 b )
b )
s Ž1yq. H p
eyb Ž b< ec b 1 e c 2 a < qb .
d2 b ) )
s Ž1yq. H eyŽ1 yq . b b
e c1 b qc 2 q b
p
s e c1 c 2 q rŽ1yq . , Ž 3.15 .
and to recognize that qrŽ1 y q . s w e e y 1xy1 is the Planckian mean occupation number
² a† a:b .
If we apply the second equation Ž3.13. to an expression of the type
4
Wz s :exp
ž Ý
is1
z i A i q B :,
/
one gets a Wick-type expansion for the coefficient Žsay W1234 . of z 1 z 2 z 3 z4 :
1
w BBx
W1234 s e 2
Ž w A1 A 2 x w A 3 A 4 x q 2 terms q w A1 B x w A 2 B x w A 3 A 4 x
q5 terms q w A1 B x w A 2 B x w A 3 B x w A 4 B x . , Ž 3.16 .
where w AB x denotes the ‘‘thermal’’ contraction w AB x ' ² : AB: :b .
The looked-for grand canonical average of W Ž k,1,2., Eq. Ž3.9., is given by replacing
B s ik P D X˜ and A1 s A 2 s D Pq
m
, A 3 s A 4 s D Pyn
in Eq. Ž3.16.. This leads to
1
²W :b ,g s ey 2 ² :Ž kPD X˜ . 2 2
2
: :b ,g
½² :Ž D P q . : :b ,g ² : Ž D Py . : :b ,g q . . . 5 , Ž 3.17 .
where the ellipsis stand for other contractions Žwhich will be seen below to be
subleading..
T. Damour, G. Venezianor Nuclear Physics B 568 (2000) 93–119 107
The calculation of the various contractions w AB x ' ² : AB: :b in Eqs. Ž3.16., Ž3.17. is
easily performed by using the basic contractions among the oscillators a n ,a†m Ž n,m ) 0.
Žwhich are easily derived from the definition Ž3.10. of the density matrix.
† † d i j dn m
² :a ni Ž a mj . : :b ,g s ² : Ž a mj . a in : :b ,g s
, Ž 3.18 .
een y1
where e n s b n q grn. The other contractions w aax and w a†a† x vanish. In terms of the
a-oscillators, the basic contraction reads w a ni a mj x s d i j dnqm
0 < n <rŽexpŽ e < n < . y 1., where
now n and m can be negative but not zero . Using these basic contractions, and the
Ž .
oscillator expansion Ž2.6. of X˜ m Žand noting that, in the center-of-mass frame only the
spatial components of X˜ m survive. one gets
` x n Ž 1,2 .
2
² : Ž k P D X˜ . : :b ,g s 2 k 2 l s2 Ý e , Ž 3.19 .
ns1 n Ž e y 1.
n
with
x n Ž 1,2 . s cos 2 n s 1 q cos 2 n s 2 y 2 cos n s 1 cos n s 2 cos n t 12 . Ž 3.20 .
Similarly, the oscillator expansion Ž3.6. yields
` n pq
n Ž 1,2 .
² : D Pq . 2 : :b ,g s 4 d Ý , Ž 3.21 .
ns1 een y1
with
pq q q
n Ž 1,2 . s 1 y cos n Ž s 1 y s 2 . s 1 y cos n Ž t 12 q s 1 y s 2 . . Ž 3.22 .
.2
The result for Ž D Py is obtained by changing s
ys 1 q s 2 ..
q
™ y
s in Eq. Ž3.22. Ži.e. s 1 y s 2 ™
We can estimate the values of the right-hand sides of Eqs. Ž3.19. and Ž3.21. by using
the following ‘‘statistical’’ approximation. In the parameter range discussed in Section 2
the basic sums Ý n " 1 Ž e e n y 1.y1 , appearing in Eqs. Ž3.19., Ž3.21., see their values
dominated by a large interval, D n 4 1, around some n 0 4 1, of values of n, so that
one can, with a good approximation, replace the discrete sum over n by a formal
continuous integral over a real parameter. In such a continuous approximation one can
integrate by parts to show that any ‘‘oscillatory’’ integral of the type Hdn f Ž n. cos n s s
w ny1 f Ž n. sin n s x y Hdn ny1 f X Ž n. sin n s is, because of the factors ny1 , numerically
much smaller than the non-oscillatory one Hdn f Ž n.. ŽHere s denotes some combination
of s 1 and s 2 , like 2 s 1 , 2 s 2 , s 1 " s 2 .. Alternatively, we can say that, for generic
values of s 1 and s 2 , one can treat in Eqs. Ž3.20. or Ž3.22. cos n s 1 and cos n s 2 as
statistically independent random variables with zero average. Within such an approxima-
tion one can estimate Ž3.19. by replacing x nŽ1,2. by 1 Žbecause cos 2 n s 1 q cos 2 n s 2 s
1 q 12 Žcos 2 n s 1 q cos 2 n s 2 ... Similarly, one can estimate Ž3.21. by replacing pn" 1.
The resulting estimates of Eqs. Ž3.19. and Ž3.21. introduce exactly the grand canonical
™
averages of the quantities : R: and N:
2
1
2
² : Ž k P D X˜ . : :b ,g , k 2 ² R 2 :b ,g , Ž 3.23 .
2 2 X
² : Ž D Pq . : :b ,g , ² : Ž D Py . : :b ,g , 4 ² N :b ,g , 4 a M 2 . Ž 3.24 .
108 T. Damour, G. Venezianor Nuclear Physics B 568 (2000) 93–119
equal to 1r 'p in d s 3.
The result Ž3.26. was expected in order of magnitude, but we found useful to show
how it approximately comes out of a detailed calculation of the mass shift which
incorporates both relativistic and quantum effects. It shows clearly that perturbation
theory breaks down, even at arbitrarily small coupling, for sufficiently heavy and
compact strings. Let us also point out that one can give a simple statistical interpretation
of the calculation Ž3.16. of the normal-ordered vertex operator W Ž k,1,2., with the basic
contractions Ž3.18.. The result of the calculation would have been the same if we had
simply assumed that the oscillators a in were classical, complex random variables with a
Gaussian probability distribution A exp y 12 Ž e e n y 1 . < a in < 2 . This equivalence under-
lies the success of the classical random walk model of a generic excited string state. The
fact that the random walk must be made of MrM s independent steps is linked to the
fact that the Planckian distribution of mean occupation numbers, Nn s ŽexpŽ b n q grn.
y 1.y1 is sharply cut off when n R by1, i.e. from Eq. Ž2.28., when n R MrM s . More
precisely, using the same ‘‘statistical’’ approximation as above, one finds that the slope
correlator ² : Es X˜ i Žt , s 1 . Es X˜ j Žt , s 2 .: :b ,g decays quite fast when < s 2 y s 1 < R M srM.
Finally, let us mention that, by using the same tools as above, one can compute the
imaginary part of the mass shift d M s d Mreal y i Gr2, i.e. the total decay rate G in
massless quanta. The quantity G is, in fact, easier to define rigorously in string theory
because, using Ž k 2 y i ´ .y1 s PP Ž k 2 .y1 q ip d Ž k 2 . in Ž3.8., it is given by an integral
where the massless quanta are all on-shell. When g s 0 Ža consistent approximation for
12
The sign d M - 0 was classically clear Ževen when taking into account relativistic effects., say in ds 3,
from the starting formulas Ž3.3., Ž3.4. where Gsym Ž x . s Ž4p .y1 d Ž x 2 . ) 0 and 4Ž Eq X1PEq X 2 . Ž Ey X1PEy X 2 .
s Ž Eq D X . 2 Ž Ey D X . 2 ) 0 because E " D X m is purely spacelike in the center-of-mass frame. The same
conclusion would hold in the light-cone gauge.
T. Damour, G. Venezianor Nuclear Physics B 568 (2000) 93–119 109
a result dominated by n ; by1 . one can use the covariant formalism with D s d q 1
oscillators to find,after replacing a discrete sum over n by an integral over v ,
GN n 2
G s cXd 2 Hd v v dy2
ž bn / , Ž 3.28 .
M l s
e y1
where cXd is a numerical constant, and where n s M l s2 vr2. The spectral decomposi-
tion of the total power radiated by the b-ensemble of strings is then simply deduced
from Ž3.28. by adding a factor " v in the integrand,
GN n 2
P s cXd 2 Hd v v dy1
ž bn / . Ž 3.29 .
M l s
e y1
The results Ž3.28., Ž3.29. agree with corresponding results Žfor closed strings. in Ref.
w10x and in w34x Žnote, however, that the factor M 2 in Eq. Ž3.2. of Ref. w34x should be M
and that the constant contains GN and powers of l s .. The integrals Ž3.28., Ž3.29. are
dominated by n ; by1 , i.e. v ; M s . This gives for the integrated quantities:
G;g2 M, P ; g 2 M Ms . Ž 3.30 .
The second equation Ž3.30. means that the mass of a highly excited string decays
exponentially, with half-evaporation time
string
ls
tevap ' MrP ; . Ž 3.31 .
g2
Let us anticipate on Section 3 and note that, at the transition l ' g 2 MrMs ; 1 between
string states and black hole states, not only the mass and the entropy are Žin order of
magnitude at least. continuous, but also the various radiative quantities: total luminosity
P, half-evaporation time tevap , and peak of emission spectrum. Indeed, for a black hole
decaying under Hawking radiation the temperature is TBH ; Ry1 BH and
y2 y2 rŽ dy2. BH
P BH ; Ry2
BH ; l s l , tevap ; R BH S BH ; l s gy2 l d rŽ dy2. . Ž 3.32 .
In the present section we shall combine the main results of the previous sections, Eqs.
Ž2.32. and Ž3.26., and heuristically extend them at the limit of their domain of validity.
We consider a narrow band of string states that we follow when increasing adiabatically
™
the string coupling g, starting from g s 0 13 . Let M0 , R 0 denote the ‘‘bare’’ values Ži.e.
for g 0. of the mass and size of this band of states. Under the adiabatic variation of
g, the mass and size, M, R, of this band of states will vary. However, the entropySŽ M, R .
remains constant under this adiabatic process: SŽ M, R . s SŽ M0 , R 0 .. We assume, as
usual, that the variation of g is sufficiently slow to be reversible, but sufficiently fast to
13
Alternatively, we can consider the coupling as an adjustable parameter Žit is so in perturbation theory. and
just follow how different physical quantities change as g is varied, without pretending that the change takes
place in physical time.
110 T. Damour, G. Venezianor Nuclear Physics B 568 (2000) 93–119
be able to neglect the decay of the states. We consider states with sizes l s < R 0 < M0
for which the correction factor,
2 2
c Ž R 0 . , Ž 1 y c1 Ry2
0 . Ž 1 y c 2 R 0rM 0 . , Ž 4.1 .
in the entropy
S Ž M0 , R 0 . s c Ž R 0 . a 0 M0 , Ž 4.2 .
™
is near unity. ŽWe use Eq. Ž2.32. in the limit g 0, for which it was derived.. Because
of this reduced sensitivity of c Ž R 0 . on a possible direct effect of g on R Ži.e.
RŽ g . s R 0 q d g R ., the main effect of self-gravity on the entropy Žconsidered as a
function of the actual values M, R when g / 0. will come from replacing M0 as a
function of M and R. The mass-shift result Ž3.26. gives d M s M y M0 to first order in
g 2 . To the same accuracy 14 , Eq. Ž3.26. gives M0 as a function of M and R,
M2 g2 M
M0 , M q c 3 g 2
R dy2
s M 1 q c3
ž R dy2 / , Ž 4.3 .
1 R2 g2 M
S Ž M , R . , a0 M 1 yž R2 /ž 1y
M2 /ž 1q
R dy2 / . Ž 4.4 .
For notational simplicity, we henceforth set to unity the coefficients c1 , c 2 and c 3 . There
is no loss of generality in doing so because we can redefine l s , R and g to that effect,
and use the corresponding Žnew. string units. The main point of the present paper is to
emphasize that, for a given value of the total energy M Žand for some fixed value of g .,
the entropy SŽ M, R . has a non-trivial dependence on the radius R of the considered
string state. Eq. Ž4.3. exhibits two effects varying in opposite directions: Ži. self-gravity
favors small values of R Žbecause they correspond to larger values of M0 , i.e. of the
‘‘bare’’ entropy., and Žii. the constraint of being of some fixed size R disfavors both
small Ž R < 'M . and large Ž R 4 'M . values of R. For given values of M and g, the
most numerous Žand therefore most probable. string states will have a size R ) Ž M; g .
which maximizes the entropy SŽ M, R .. Said differently, the total degeneracy of the
complete ensemble of self-gravitating string states with total energy M Žand no a priori
size restriction. will be given by an integral Žwhere D R is the rms fluctuation of R
given by Eq. Ž2.17..
dR SŽ M , R.
DŽ M . ; HDR e ; e SŽ M , R ) . , Ž 4.5 .
which will be dominated by the saddle point R ) which maximizes the exponent.
14
Actually, Eq. Ž4.3. is probably a more accurate version of the mass-shift formula because it exhibits the
real mass M Žrather than the bare mass M0 . as the source of self-gravity.
T. Damour, G. Venezianor Nuclear Physics B 568 (2000) 93–119 111
The value of the most probable size R ) is a function of M, g and the space
dimension d. To better see the dependence on d, let us first consider the case Žwhich we
generically assume. where the correction factors in Eq. Ž4.4. Žparentheses on the
right-hand-side. are very close to unity so that
S Ž M , R . , a0 M Ž 1 y V Ž R . . , Ž 4.6 .
where
1 R2 g2 M
V Ž R. s q y . Ž 4.7 .
R2 M2 R dy2
One can think of V Ž R . as an effective potential for R. The most probable size R )
must minimize V Ž R .. This effective potential can be thought of as the superposition of:
Ži. a centrifugal barrier near R s 0 Žcoming from the result Ž2.30.., Žii. an harmonic
potential Žforbidding the large values of R ., and Žiii. an attractive Žgravitational.
potential. When g 2 is small the minimum of V Ž R . will come from the competition
between the centrifugal barrier and the harmonic potential and will be located around the
2 2
value Ry2 '
) , R ) rM , i.e. R ) , M s R rw . This random walk value will remain
Žmodulo small corrections. a local minimum of V Ž R . Ži.e. a local maximum of SŽ M, R ..
as long as g 2 MrR )dy 2 < Ry2 2 2
) , i.e. for g < g 0 with
dy2 g2
R ) , 'M 1 y
ž 8 g 02 / . Ž 4.9 .
Note that, when g 2 < g 02 , the value of V Ž R . at this local minimum is of order
Vmin , q2 Ry2 ) , q2 M
y1
, i.e. that it corresponds to a saddle-point entropy SŽ M, R ) .
, a0 M Ž1 y Vmin . , a 0 M y O Ž1. which differs essentially negligibly from the ‘‘bare’’
entropy a0 M Ž4 1.. To study what happens when g 2 further increases let us consider
separately the various dimensions d 0 3. We shall see that the special value g 02 , Eq.
Ž4.8., is significant Žas marking a pre-transition, before the transition to the black hole
state. only for d s 3. For d 0 4, the only special value of g 2 is the critical value
g c2 ; My1 , Ž 4.10 .
around which takes place a transition toward a state more compact than the usual
random walk one.
4.1. d s 3
Let us first consider the Žphysical. case d s 3, for which g 02 ; My3r2 < g c2 ; My1.
In that case, when g 2 becomes larger than g 02 , the Žunique. local minimum of V Ž R .
slowly shifts towards values of R lower than R rw and determined by the competition
between the centrifugal barrier 1rR 2 and the gravitational potentialy g 2 MrR.
112 T. Damour, G. Venezianor Nuclear Physics B 568 (2000) 93–119
In the approximation where we use the linearized form Ž4.6., Ž4.7., and where Žfor
g 2 4 g 02 . we neglect the term R 2rM 2 , the most probable size R ) is
2
R Ž)ds3. , 2 , when My3r2 < g 2 < My1 . Ž 4.11 .
g M
Note that as g 2 increases between My3 r2 and My1 , the most probable size R Ž)ds3.
smoothly interpolates between R rw and a value of order unity, i.e. of order the string
length. Note also that Vmin , yg 2 MrŽ2 R ) . , yg 4 M 2r4 remains smaller than one
when g 2 Q My1 so that the saddle-point entropy SŽ M, R ) . , a 0 M Ž1 y Vmin . never
differs much from the ‘‘bare’’ value a 0 M.
When g 2 , in its increase, becomes comparable to My1 , the radius becomes of order
one and it is important to take into account the Žsupposedly. more exact expression Ž4.4.
Žin which the factor Ž1 y Ry2 . plays the crucial role of cutting off any size R ( 1.. If we
neglect, as above, the term R 2rM 2 Žwhich is indeed even more negligible in the region
R ; 1. but maximize the factored expression Ž4.4., we find that the most probable size
R ) reads
1 q '1 q 3 l2
R Ž)ds3. , , when g 2 4 My3r2 , Ž 4.12 .
l
where we recall the definition
l'g2 M . Ž 4.13 .
When l < 1, the result Ž4.12. reproduces the simple linearized estimate Ž4.11..
When l R 1, Eq. Ž4.12. says that the most probable size, when g 2 increases, tends to a
limiting size Ž R` s '3 . slightly larger than the minimal one Ž R min s 1. corresponding
to zero entropy. ŽNote that even for the formal asymptotic value R` s '3 , the reduction
in entropy due to the factor 1 y Ry2 is only 2r3.. On the other hand, the fractional
self-gravity GN MrR ) Žwhich measures the gravitational deformation away from flat
space., or the corresponding term in Eq. Ž4.4., continues to increase with g 2 as
l l2
s . Ž 4.14 .
R) 1 q '1 q 3 l2
The right-hand side of Eq. Ž4.14. becomes unity for l s '5 s 2.236. The picture
suggested by these results is that the string smoothly contracts, as g increases, from its
initial random walk size down to a limiting compact state of size slightly larger than l s .
For some value of l of order unity Žmay be between 1 and 2; indeed, even for l s 1 the
size R ) s 2 and the self-gravity lrR ) s 0.5 suggest one may still trust a compact
string description. the self-gravity of this compact string state will become so strong that
one expects it to collapse to a black hole state. We recall that, as emphasized in Refs.
w5,7–11,20x, when l ; 1, the mass of the string state matches Žin order of magnitude.
that of a ŽSchwarzschild. black hole with Bekenstein–Hawking entropy equal to the
string entropy S.
4.2. d s 4
When d s 4, the argument above Eq. Ž4.9. suggests that the random-walk size
remains the most probable size up to g 2 Q g 02 ; My1 , i.e. up to l Q 1. A more accurate
T. Damour, G. Venezianor Nuclear Physics B 568 (2000) 93–119 113
4.3. d 0 5
When d 0 5, the argument around Eq. Ž4.8. shows that the random walk size R rw
, 'M is a consistent local maximum of the entropy in the whole domain g 2 < g 02 , i.e.
for l ' g 2 M < M Ž dy4.r2 , which allows values l 4 1. However, a second, discon-
nected maximum of the entropy, as function of R, could exist. To investigate this we
consider again Ž4.4., when neglecting the R 2rM 2 term Žbecause we are interested in
other possible solutions with small sizes.,
SŽ M, R.
, Ž1 yx . Ž1 ql xn . 'sŽ x . , Ž 4.19 .
a0 M
where we have defined x ' Ry2 and n ' Ž d y 2.r2. By studying analytically the
maxima and inflection points of sŽ x ., one finds that, in the present case where
15
The transition takes place in the range < l y1 < ; My2 r3 corresponding to R ) ; M 1r3 .
114 T. Damour, G. Venezianor Nuclear Physics B 568 (2000) 93–119
Ž 4.20 .
corresponds to the birth Žthrough an inflection point. of a maximum and a minimum of
the function sŽ R .. Because s1 - 1 is strictly lower than the usual random walk
maximum with sŽ R rw . , 1 y 2rM , 1, the local maximum near R ; 1 of the entropy,
which starts to exist when l ) l1 , is, at first, only metastable with respect to R rw .
However, there is a second critical value of l, l2 ) l1 , defined by
n ny1 n 1r2
l2 s n žn / , R 2 s xy1
2
r2
s ž
) 1, /
s2 s 1 . Ž 4.21 .
y1 ny1
When l ) l2 the local maximum near R ; 1 of the entropy has sŽ R . ) 1, i.e. it has
become the global maximum of the entropy, making the usual random walk local
maximum only metastable. Therefore,when l ) l2 the most probable string state is a
™
very compact state of size comparable to l s . Formally, this new global maximum exists
for any l R 1 and tends, when l `, toward the limiting location R` s ŽŽ n q 1.rn .1r2
) 1, i.e. slightly Žbut finitely. above the minimum size R s 1. However, as in the cases
d ( 4, the self-gravity of the stable compact string state will become strong when l R 1,
so that it is expected to collapse Žfor some l c ) l 2 . to a black hole state. As in the cases
d ( 4, the mass, size and entropy of this compact string state match those of a black
hole. The big difference with the cases d ( 4 is that the transition between the Žstable.
random walk typical configuration and the Žstable. compact one is discontinuous. Our
present model suggests that Žwhen n ' Ž d y 2.r2 ) 1. a highly excited single string
system can exist, when l ) l 2 , in two different stable typical states: Ži. a dilute state of
typical size R rw , 'M and typical mean density r ; MrR rw d
; My n < 1, and Žii. a
condensed state of typical size R ; 1 and typical mean density using l ; 1.: r ; M ;
Ž
gy2 4 1. We shall comment further below on the value r ; gy2 of the dense state of
string matter.
5. Discussion
Technically, the main new result of the present work is the Ždimension independent.
estimate16 cŽ R . s 1 y c1rR 2 , with c1 , Ž3r4. l s2 s 3 a Xr2, of the factor giving the
decrease in the entropy Ž2p ŽŽ d y 1. a Xr6.1r2 M . of a narrow band of very massive
Žopen17 . string states M 4 Ž a X .y1 r2 , when considering only string states of size R
16
In spite of our efforts in Section 2, this result remains non-rigorous and open to O Ž1r d . fractional
corrections because of the difficulty to define a good quantum operator representing the mean radius of a
string state.
17
For technical simplicity, we have restricted our attention to open bosonic strings. We could have dealt
with closed bosonic strings by doubling the oscillators, but the level matching condition, NL s NR , would have
complicated the definition of the grand canonical ensemble we used. We expect that our results Žwhich are
semi-classical. apply Žwith some numerical changes. to open or closed superstrings.
T. Damour, G. Venezianor Nuclear Physics B 568 (2000) 93–119 115
18
We consider states of a single string because, for large values of the mass, the single-string entropy
approximates the total entropy up to subleading terms.
19
Our conclusions are not rigorously established because they rely on assuming the validity of the result
Ž4.4. beyond the domain Ž Ry2 <1, g 2 Mr R dy2 <1. where it was derived. However, we find heuristically
convincing to believe in the presence of a reduction factor of the type 1y Ry2 down to sizes very near the
string scale. Our heuristic dealing with self-gravity is less compelling because we do not have a clear signal of
when strong gravitational field effects become essential.
116 T. Damour, G. Venezianor Nuclear Physics B 568 (2000) 93–119
blending between the two evolutions around < l y 1 < ; My2 r3. In d 0 5, the transition is
discontinuous Žlike a first-order phase transition between, say, gas and liquid states..
Barring the consideration of metastable Žsupercooled. states, on expects that when
l s l2 , n nrŽ n y 1. ny1 Žwith n s Ž d y 2.r2., the most probable size of a string state
will jump from R rw Žwhen l - l2 . to a size of order unity Žwhen l ) l2 ..
Let us, for definiteness, write down in more detail what happens in d s 3. After
maximization over R, the entropy of a self-gravitating string is given, when My3 r2 <
g 2 < My1 , by
2
S Ž M . s S Ž M , R ) Ž M . . , a 0 M 1 q 14 Ž g 2 M . . Ž 5.1 .
By differentiating S with respect to M, one finds the temperature of the ensemble of
highly excited single string states of mass M:
2
T , THag 1 y 34 Ž g 2 M . ,
ž / Ž 5.2 .
with THag ' ay1 0 . Eq. 5.2 explicitly exhibits the modification of the Hagedorn tempera-
Ž .
ture due to self-gravity Žin agreement with results of w20x obtained by a completely
different approach.. Note that, both in Eqs. Ž5.1. and Ž5.2., the self-gravity modifications
are fractionally of order unity at the transition g 2 M ; 1.
One can think of the ‘‘condensed’’ state of Žsingle. string matter, reached Žin any d .
when l ; 1, as an analog of a neutron star with respect to an ordinary star Žor a white
dwarf.. It is very compact Žbecause of self-gravity. but it is stable Žin some range for g .
under gravitational collapse. However, if one further increases g or M Žin fact,
l s g 2 M ., the condensed string state is expected Žwhen l reaches some l3 ) l2 ,
l3 s O Ž1.. to collapse down to a black hole state Žanalogously to a neutron star
collapsing to a black hole when its mass exceeds the Landau–Oppenheimer–Volkoff
critical mass.. Still in analogy with neutron stars, one notes that general relativistic
strong gravitational field effects are crucial for determining the onset of gravitational
collapse; indeed, under the ‘‘Newtonian’’ approximation Ž4.4., the condensed string
state could continue to exist for arbitrary large values of l.
It is interesting to note that the value of the mass density at the formation of the
condensed string state is r ; gy2 . This is reminiscent of the prediction by Atick and
Witten w35x of a first-order phase transition of a self-gravitating thermal gas of strings,
near the Hagedorn temperature 20 , towards a dense state with energy density r ; gy2
Žtypical of a genus-zero contribution to the free energy.. Ref. w35x suggested that this
transition is first-order because of the coupling to the dilaton. This suggestion agrees
with our finding of a discontinuous transition to the single string condensed state in
dimensions 0 5 ŽRef. w35x work in higher dimensions, d s 25 for the bosonic case.. It
would be interesting to deepen these links between self-gravitating single string states
and multi-string states.
Assuming the existence Žconfirmed by the present work. of a dense state of
self-gravitating string matter with energy density r ; gy2 , it would be fascinating to be
able to explore in detail Žwith appropriate, strong gravity tools. its gravitational
20
Note that, by definition, in our single string system, the formal temperature T s Ž E SrE M .y1 is always
near the Hagedorn temperature.
T. Damour, G. Venezianor Nuclear Physics B 568 (2000) 93–119 117
dynamics, both in the present context of a single, isolated object Ž‘‘collapse problem’’.,
and in the cosmological context Žproblem of the origin of the expansion of the universe..
Let us come back to the consequences of the picture brought by the present work for
the problem of the end point of the evaporation of a Schwarzschild black hole and the
interpretation of black hole entropy. In that case one fixes the value of g Žassumed to be
< 1. and considers a black hole which slowly looses its mass via Hawking radiation.
When the mass gets as low as a value 21 M ; gy2 , for which the radius of the black hole
is of order one Žin string units., one expects the black hole to transform Žin all
dimensions. into a typical string 22 state corresponding to l s g 2 M ; 1, which is a
dense state Žstill of radius R ; 1.. This string state will further decay and loose mass,
predominantly via the emission of massless quanta, with a quasi thermal spectrum with
temperature T ; THagedorn s ay1 0
Žsee Eq. Ž3.29. and Refs. w9,10,34x. which smoothly
matches the previous black hole Hawking temperature. This mass loss will further
decrease the product l s g 2 M, and this decrease will, either gradually or suddenly,
cause the initially compact string state to inflate to much larger sizes. For instance, if
d 0 4, the string state will quickly inflate to a size R ; 'M . Later, with continued mass
loss, the string size will slowly shrink again toward R ; 1 until a remaining string of
mass M ; 1 finally decays into stable massless quanta. In this picture, the black hole
entropy acquires a somewhat clear statistical significance Žas the degeneracy of a
corresponding typical string state. only when M and g are related by g 2 M ; 1. If we
allow ourselves to vary Žin a Gedanken experiment. the value of g this gives a potential
statistical significance to any black hole entropy value S BH Žby choosing g 2 ; Sy1
BH . We
.
do not claim, however, to have a clear idea of the direct statistical meaning of S BH when
g 2 S BH 4 1. Neither do we clearly understand the fate of the very large space Žwhich
could be excited in many ways. which resides inside very large classical black holes of
radius R BH ; Ž g 2 S BH .1rŽ dy1. 4 1. The fact that the interior of a black hole of given
mass could be arbitrarily large 23 , and therefore arbitrarily complex, suggests that black
hole physics is not exhausted by the idea Žconfirmed in the present paper. of a reversible
transition between string-length-size black holes and string states.
On the string side, we also do not clearly understand how one could follow in detail
Žin the present non-BPS framework. the ‘‘transformation’’ of a strongly self-gravitating
string state into a black hole state.
Finally, let us note that we expect that self-gravity will lift nearly completely the
degeneracy of string states. ŽThe degeneracy linked to the rotational symmetry, i.e.
2 J q 1 in d s 3, is probably the only one to remain, and it is negligible compared to the
string entropy.. Therefore we expect that the separation d E between subsequent Žstring
and black hole. energy levels will be exponentially small: d E ; D M expŽySŽ M ..,
21
Note that the mass at the black hole™ string transition is larger than the Planck mass MP ; Ž GN .y1 r2 ;
™
y1 y1
g by a factor g 41.
22
A check on the single-string dominance of the transition black hole string is to note that the single
string entropy ; Mr M s is much larger than the entropy of a ball of radiation Srad ; Ž RM . d rŽ dq1. with size
R; R BH ; l s at the transition.
23
For example, in the Oppenheimer–Snyder model, one can join an arbitrarily large closed Friedmann dust
universe, with hyperspherical opening angle 0 ( x 0 (p arbitrarily near p , onto an exterior Schwarzschild
space-time of given mass M.
118 T. Damour, G. Venezianor Nuclear Physics B 568 (2000) 93–119
Acknowledgements
This work has been clarified by useful suggestions from M. Douglas, K. Gawedzki,
M. Green, I. Kogan, G. Parisi, A. Polyakov, and Žlast but not least. M. Vergassola. We
wish also to thank A. Buonanno for collaboration at an early stage and D. Gross, J.
Polchinski, A. Schwarz, L. Susskind and A. Vilenkin for discussions. T.D. thanks the
Theory Division of CERN, Gravity Probe B ŽStanford University., and the Institute for
Theoretical Physics ŽSanta Barbara. for hospitality. Partial support from NASA grant
NAS8-39225 is acknowledged. G.V. thanks the IHES for hospitality during the early,
crucial stages of this work.
References
B ™X l l s
q y
decays in supersymmetry
E. Lunghi a , A. Masiero, a , I. Scimemi b, L. Silvestrini c
a
SISSA-ISAS, Via Beirut 2-4, Trieste, Italy and INFN, Sezione di Trieste, Trieste, Italy
b
Dep. de Fisica Teorica, UniÕ. de Valencia, c. Dr. Moliner 50, E-46100, Burjassot, Valencia, Spain
c
Physik Department, Technische UniÕersitat ¨ Munchen,
¨ D-85748 Garching, Germany
Received 15 June 1999; received in revised form 4 October 1999; accepted 27 October 1999
Abstract
™ ™
We study the semileptonic decays B X s eqey, B X s mqmy in generic supersymmetric
extensions of the Standard Model. SUSY effects are parameterized using the mass insertion
™
approximation formalism and differences with the Constrained MSSM results are pointed out.
Constraints on SUSY contributions coming from other processes Že.g. b sg . are taken into
account. Chargino and gluino contributions to photon and Z-mediated decays are computed and
non-perturbative corrections are considered. We find that the integrated branching ratios and the
asymmetries can be strongly modified. Moreover, the behavior of the differential forward–back-
ward asymmetry remarkably changes with respect to the Standard Model expectation. q 2000
Elsevier Science B.V. All rights reserved.
1. Introduction
One of the features of a general low energy supersymmetric ŽSUSY. extension of the
Standard Model ŽSM. is the presence of a huge number of new parameters. FCNC and
CP-violating phenomena constrain strongly a big part of the new parameter space.
However, there is still room for significant departures from the SM expectations in this
interesting class of physical processes. It is worthwhile to check all these possibilities on
the available data and on those processes that are going to be studied in the future. In
this way it is possible to indicate where new physics effects can be revealed as well as to
establish criteria for model building.
™
In this work we want to investigate the relevance of new physics effects in the
semileptonic inclusive decay B X s l q l y. This decay is quite suppressed in the
Standard Model; however, new B-factories should reach the precision requested by the
0550-3213r00r$ - see front matter q 2000 Elsevier Science B.V. All rights reserved.
PII: S 0 5 5 0 - 3 2 1 3 Ž 9 9 . 0 0 6 9 2 - 6
E. Lunghi et al.r Nuclear Physics B 568 (2000) 120–144 121
SM prediction w1x and an estimate of all possible new contributions to this process is
compelling.
Semileptonic charmless B decays have been deeply studied. The dominant perturba-
tive SM contribution has been evaluated in Ref. w2x and later two-loop QCD corrections
have been provided w3–7x. The contribution due to cc resonances to these results are
included in the papers listed in Refs. w8–11x. Long distance corrections can have a
different origin according to the value of the dilepton invariant mass one considers.
O Ž1rm2b . corrections have been first calculated in Ref. w12x and recently corrected in
Refs. w13–16x. Near the peaks, non-perturbative contributions generated by cc reso-
nances by means of resonance-exchange models have been provided in Refs. w13–
15,17–20x. Far from the resonance region, instead, Ref. w21x Žsee also Ref. w22x. estimate
cc long-distance effects using a heavy quark expansion in inverse powers of the
charm-quark mass Ž O Ž1rm2c . corrections..
An analysis of the SUSY contributions has been presented in Refs. w23–28x where the
authors estimate the contribution of the Minimal Supersymmetric Standard Model
ŽMSSM.. They consider first a universal soft supersymmetry breaking sector at the
Grand Unification scale ŽConstrained MSSM. and then partly relax this universality
condition. In the latter case they find that there can be a substantial difference between
the SM and the SUSY results in the Branching Ratios and in the forward–backward
asymmetries. One of the reasons of this enhancement is that the Wilson coefficient
™
in some region of the parameter space while respecting constraints coming from b sg .
The recent measurements of b sg w29x have narrowed the window of the possible
™
C 7 Ž MW . Žsee Section 2 for a precise definition. can change sign with respect to the SM
™
in the Constrained MSSM framework. Hence, on one hand it is worthwhile considering
B X s l q l y in a more general SUSY framework then just the Constrained MSSM,
and, on the other hand, the above-mentioned new results prompt us to a reconsideration
of the process. In Ref. w30x the possibility of new-physics effects coming from
gluino-mediated FCNC is studied. Effects of SUSY phases in models with heavy first
and second generations sfermions have been recently discussed in Ref. w31x.
We consider all possible contributions to charmless semileptonic B decays coming
from chargino–quark–squark and gluino–quark–squark interactions and we analyze
both Z-boson and photon mediated decays. Contributions coming from penguin and box
diagrams are taken into account; moreover, corrections to the MIA results due to a light
t̃ R are considered. A direct comparison between the SUSY and the SM contributions to
the Wilson coefficients is performed. Once the constraints on mass insertions are
established, we find that in generic SUSY models there is still enough room to see large
deviations from the SM expectations for branching ratios and asymmetries. For our final
computation of physical observables we consider NLO QCD evolution of the coeffi-
cients and non-perturbative corrections Ž O Ž1rm2b ., O Ž1rm2c ., . . . ., each in its proper
range of the dilepton invariant mass.
Because of the presence of so many unknown parameters Žin particular in the scalar
mass matrices. which enter in a quite complicated way in the determination of the mass
eigenstates and of the various mixing matrices it is very useful to adopt the so-called
‘‘Mass Insertion Approximation’’ ŽMIA. w32x. In this framework one chooses a basis for
fermion and sfermion states in which all the couplings of these particles to neutral
122 E. Lunghi et al.r Nuclear Physics B 568 (2000) 120–144
gauginos are flavor diagonal. Flavor changes in the squark sector are provided by the
non-diagonality of the sfermion propagators. The pattern of flavor change is then given
by the ratios
2
d i fj s
žm ˜/
f
ij AB
, Ž 1.
Ž . AB Msq2
˜
where Ž m if j . A2 B are the off-diagonal elements of the f˜s u,d
˜ ˜ mass squared matrix that
mixes flavor i, j for both left- and right-handed scalars Ž A, B s left,right. and Msq is
the average squark mass Žsee e.g. Ref. w33x.. The sfermion propagators are expanded in
terms of the d s and the contribution of the first two terms of this expansion are
considered. The genuine SUSY contributions to the Wilson coefficients will be simply
proportional to the various d s and a keen analysis of the different Feynman diagrams
involved will allow us to isolate the few insertions really relevant for a given process. In
this way we see that only a small number of the new parameters is involved and a
general SUSY analysis is made possible. The hypothesis regarding the smallness of the
d s and so the reliability of the approximation can then be checked a posteriori.
Many of these d ’s are strongly constrained by FCNC effects w33–39x or by vacuum
stability arguments w40x. Nevertheless it may happen that such limits are not strong
enough to prevent large contributions to some rare processes. For instance it has been
recently found in Ref. w41x that the off-diagonal squark mass matrix elements can
enhance rare kaon decays by roughly an order of magnitude with respect to the SM
result.
The paper is organized as follows. In Section 2 we define the operator basis, the basic
formulae for the BR, the forward–backward asymmetry and the non-perturbative
corrections. Section 3 and Section 4 treat chargino and gluino contributions in the mass
insertion approximation. The light t˜R corrections are presented in Section 5. Constraints
on d ’s are discussed in Section 6 and final results and conclusions are drawn in Sections
7 and 8.
™
The effective Hamiltonian for the decay B X s l q l y in the SM and in the MSSM
is given by Žneglecting the small contribution proportional to K u)s K u b .
4GF 8 a 10
Heff s y
'2 K t)s K t b Ý Ci Ž m . Qi q 4p Ý C˜i Ž m . Q i , Ž 2.
is1 is9
where
Q1 s sL a gm bL a c L b g m c L b ,
Q2 s sL a gm bL b c L b g m c L a ,
Q3 s sL a gm bL a Ý q L b g mq L b ,
qsu , . . . , b
E. Lunghi et al.r Nuclear Physics B 568 (2000) 120–144 123
Q4 s sL a gm bL b Ý q L b g mq L a ,
qsu , . . . , b
Q5 s sL a gm bL a Ý q R b g mq R b ,
qsu , . . . , b
Q6 s sL a gm bL b Ý q R b g mq R a ,
qsu , . . . , b
e
Q7 s m b sL s mn bR Fmn ,
16p 2
gs
Q8 s m b sL T as mn bR Gmn
a
,
16p 2
Q9 s Ž sL gm bL . lg m l,
Q10 s Ž sL gm bL . lg m g 5 l, Ž 3.
K is the CKM matrix and q LŽ R. s 12 Ž1 . g 5 . q. This Hamiltonian is known at next-to-
leading order both in the SM w3–7x and in the MSSM w26–28x. We find that the most
general low-energy SUSY Hamiltonian also contains the operators
e
QX7 s m s s mn bL Fmn ,
8p 2 b R
QX9 s Ž sR gm bR . lg m l,
QX10 s Ž sR gm bR . lg m g 5 l. Ž 4.
However, it is shown in the following sections that the contribution of these operators is
negligible and so they are not considered in the final discussion of physical quantities.
SUSY contributions to other operators are negligible because they influence our
observables at an higher perturbative order.
With these definitions the differential branching ratio and the forward–backward
asymmetry can be written as
d G ŽB ™X l q y
™ X en .
1 s l .
RŽ s. '
GŽB c ds
2
a2 K t)s Ž1 y s.
s 2 Ž 1 q 2 s . ž < C˜9eff Ž s . < 2 q < C˜10 < 2 / q 4 Ž 1 q 2rs . < C 7 < 2
4p K cb f Ž z. kŽ z.
q12Re C 7) C˜9eff Ž s . , Ž 5.
1
Hy1 d cos u
™ X l l . SgnŽcosu .
d 2G Ž B s
q y
d G Ž B™X l l .
d cos u d s
A FB Ž s . ' 2 q y
1 s
Hy1 d cos u
d cos u d s
3Re w C˜10 )
Ž s C˜9eff Ž s . q 2C7 . x
sy
2
, Ž 6.
Ž 1 q 2 s . Ž < C˜9eff Ž s . < 2 q < C˜10 < 2 . q 4 1 q ž / < C 7 < 2 q 12Re w C 7) C˜9eff Ž s . x
s
124 E. Lunghi et al.r Nuclear Physics B 568 (2000) 120–144
where s s Ž p l qq p l y . 2rm2b , u is the angle between the positively charged lepton and
the B flight direction in the rest frame of the dilepton system, f Ž z . and k Ž z . are the
phase space factor and the QCD correction factor Ž z s m crm b . that enter G Ž B X c en .
and can be found in Refs. w7,42,43x. C˜9eff Ž s . includes all the contributions of the
™
operators Q1 y Q6 and Q8 and its complete definition for the SM and MSSM can be
found again in Refs. w3–7,26x.
In the literature the energy asymmetry is also considered w26x but it is easy to show
that these two kinds of asymmetries are completely equivalent; in fact a configuration in
the dilepton c.m.s. in which l q is scattered in the forward direction kinematically
implies E l q- E l y in the B rest frame Žsee for instance Refs. w13–15x..
It is worth underlying that integrating the differential asymmetry given in Eq. Ž6. we
do not obtain the global forward–backward asymmetry which is by definition
1 ™ X l l . SgnŽ cosu .
d 2G Ž B s
q y
N Ž l™
q
. y N Ž l§
q Hy1 d cos uHd s
d G Ž B™X l l .
. d cos u d s
s Ž 7.
N Ž l™
q
. q N Ž l§
q
. 1
2
s
q y
H d cos uHd s
y1 d cos u d s
where l ™q
and l § q
stand respectively for leptons scattered in the forward and
backward directions.
To this extent it is useful to introduce the following quantity:
Hy11 d cos u
d 2G Ž B ™X l s
q y
l .
Sgn Ž cos u .
™
d cos u d s
A FB Ž s . '
1 d 2G Ž B Xs l q l y .
H d cos u d s
H
y1 d cos u d s
y3Re w C˜10 )
Ž s C˜9eff Ž s . q2C7 . x Ž1y s . 2
s ,
2
Hd s Ž1y s .2 Ž 1q2 s . Ž < C˜9eff Ž s . < 2 q < C˜10 < 2 . q4 1q ž < C 7 < 2 q12Re w C 7) C˜9eff Ž s . x
/
s
Ž 8.
whose integrated value is given by Eq. Ž7..
Eqs. Ž5. and Ž6. have been corrected in order to include several non-perturbative
effects. First of all O Ž1rm2b . effects have been estimated by w13–15x
d 1r m 2b R Ž s .
2
3 l2 a2 K t)s 1
s
ž Ž 1 y 15s 2 q 10 s 3 . ž < C˜9eff Ž s . < 2 q < C˜10 < 2 /
2 m2b 4p 2
K cb f Ž z. kŽ z.
< C7 < 2 gŽ z.
y Ž 5 q 6 s y 7s 2 . Re C 7) C˜9eff Ž s .
y4 Ž 6 q 3s y 5s . 3
s / q
f Ž z.
RŽ s. ,
Ž 9.
E. Lunghi et al.r Nuclear Physics B 568 (2000) 120–144 125
d 1r m 2b A FB Ž s .
3 l2 Re C˜10
)
s Ž 9 q 14 s y 15s 2 . C˜9eff Ž s . q 2 Ž 7 q 10 s y 9 s 2 . C 7
s
2 m2b Ž 1 q 2 s . < C˜9eff Ž s . < 2 q < C˜10 < 2 q 4 Ž 1 q 2rs . < C 7 < 2 q 12Re C 7) C˜9eff Ž s .
ž /
3 l2 g Ž z . 4l1 s d 1r m 2b R Ž s .
q A FB Ž s .
ž 2 m2b f Ž z .
q
3m2b Ž 1 y s . 2
y
RŽ s. / , Ž 10 .
where
g Ž z . s 3 y 8 z 2 q 24 z 4 y 24 z 6 q 5 z 8 q 24 z 4 log w z x .
l1 and l2 are the two parameters that appear in the Heavy Quark Expansion ŽHQE..
While the value of l2 is quite well established Ž l2 s Ž MB2) y MB2 .r4., l1 is not yet
well known. In Refs. w44–46x l1 is estimated as l1 s y0.52 " 0.12 GeV 2 and in Ref.
w47x l1 s y0.10 " 0.05 GeV 2 . In what follows we consider the weighted average of the
two results l1 s y0.16 GeV 2 . As was pointed out by the authors of Refs. w13–15x these
corrections are no more valid near the endpoint region, s 1, where they diverge
because of the breaking down of the HQE. Following some recent analyses we have
™
stopped the BR’s corrections given in Eq. Ž9. at s s 0.78 Žsee Ref. w16x. and the ones in
Eq. Ž10. at s s 0.7 Žsee Refs. w13–15x..
In order to account for the corrections to the parton model approximation in the high
s region Refs. w13–15x and Ref. w16x adopt two different approaches. The former
considers a Fermi-motion model and the latter invokes the Heavy Hadron Chiral
Perturbation Theory ŽHHChPT.. A discussion about the usefulness of the Fermi-motion
model for semileptonic charmless B-decays is beyond the scope of this paper. In order
to have a model-independent description of the high energy region of the spectrum we
have considered the HHChPT corrections.
™ ™
For 0.88 - s - smax s Ž MB y MK . 2rm2b s 0.99 the branching ratio is dominated by
the exclusive decays B K l q l y and B Kp l q l y; in Ref. w16x it is shown that the
contribution of the latter is completely negligible. In the following we report the
expression of the branching ratio, valid in the interval of the spectrum given above,
computed in the HHChPT framework
and the definitions of f 1 , fq, C˜9EP , Õ P pK can be found in Ref. w16x. Moreover, we have
put g s 0.5 according to the theoretical estimate given in Ref. w48x. In the intermediate
126 E. Lunghi et al.r Nuclear Physics B 568 (2000) 120–144
region 0.78 - s - 0.88 we have interpolated the obtained results. The form factors fq
and aT can be computed also using other methods ŽQCD sum rules w49–51x, light-cone
QCD sum rules w52,53x, QCD relativistic potential model. w54x but the HHChPT
approach is preferable in the endpoint region of the spectrum.
The asymmetries receive no contribution from the single kaon mode B K l q l y
and the endpoint of their spectrum is fixed, instead, by B Kp l q l y at s s Ž MB y MK ™ ™
y Mp . 2rm2b s 0.93. In the region 0.7 - s - 0.93 we use the parton model result because
these asymmetries have not been computed yet in the HHChPT framework.
Finally, the following O Ž1rm2c . corrections occur for s - 0.2 Žsee e.g. Ref. w21x.
2 2
8 l2 a K t)s Ž1 ys.
d 1r m 2c R Ž s . s y C2
9m 2c 4p 2 K cb f Ž z. kŽ z.
1 q 6 s y s2
q C˜9eff ) Ž 2 q s .
=Re F Ž s . C 7)
ž s / , Ž 13 .
d 1r m 2c A FB Ž s .
3 l2
s C2 Ž 1 q 3s .
9m2c
Re F Ž s . C˜10
)
= .
Ž 1 q 2 s . ž < C˜9eff Ž s . < 2 q < C˜10 < 2 / q 4 Ž 1 q 2rs . < C 7 < 2 q 12Re C 7) C˜9eff
Ž 14 .
The O Ž1rm2b . and O Ž1rm2c . corrections to A FB can be easily computed because of
the following relation:
RŽ s.
A FB s A FB . Ž 15 .
H d s RŽ s.
All the effects coming from the mass insertion approximation can be included in
formulae Ž5. – Ž14. writing the coefficients C 7 , C˜9eff Ž s ., C˜10 as
C 7 s C 7SM q C 7diag q C 7MI ,
SM diag MI
C˜9eff Ž s . s Ž C˜9eff Ž s . . q Ž C˜9eff . q Ž C˜9eff . ,
C˜10 s C˜10
SM
q C˜10
diag
q C˜10
MI
Ž 16 .
where all the contributions are evaluated at the MB scale and the various Cidiag
summarize all the contributions coming from graphs including SUSY Higgs bosons and
sparticles in the limit in which we neglect all the mass insertion contributions Žthey
would be the only SUSY diagrams if the scalar mass matrices were diagonalized by the
same rotations as those needed by the fermions.. The explicit expressions for Cidiag can
be found in Ref. w26x.
E. Lunghi et al.r Nuclear Physics B 568 (2000) 120–144 127
™
Fig. 1. Some of the relevant penguin diagrams for b s l q l y. The bubble indicates a mass insertion.
Diagrams A,B are based on chargino interaction. Diagrams Žc. and Žd. consider gluino interactions.
™
The Feynman diagrams with MI relevant for b s l q l y are drawn in Figs. 1 and 2.
We have considered gluino-like and chargino-like contributions with both single and
double mass insertions.
Both photons and Z-bosons can mediate the decay. Usually one finds that Z-boson
contributions are dominant in those graphs where an ‘‘explicit SUŽ2.L breaking’’ is
provided, i.e. both left and right squarks are present in the same loop. In the latter cases
the photon cannot feel any gauge-symmetry-breaking and its contribution to the Wilson
coefficients is suppressed by a factor m2ZrMsq2 with respect to the Z-boson one. For
Msq ; 300 GeV, this factor amounts to about an order of magnitude. On the other hand
if the graph does not give any SUŽ2.L explicit breaking we are in the opposite situation
and the Z-boson contribution is suppressed by a factor m2brm2Z ; 3 = 10y3 . Moreover, a
general feature of g-mediated four fermion contributions is that, for high average squark
masses, they decouple much faster than in the Z-boson case. This can be understood
simply using dimensional arguments. While Wilson coefficients for Z-boson mediated
four-fermion interactions are proportional to DrŽ m2Z Msq2 ., the same coefficients must be
proportional to DrMsq4 for the g Ž D here is a generic off-diagonal element of the
sparticle squared mass matrices and it cannot rise as fast as Msq2 for high values of Msq ..
Thus photon graphs can compete with Z-boson graphs if the sparticle spectrum is not
too heavy.
Table 1
Central values of physical constants used in the phenomenological analysis
mt 173.8 GeV
mb 4.8 GeV
mc 1.4 GeV
ms 125 MeV
MB 5.27 GeV
a sŽ mZ . 0.119
1r a e l Ž m Z . 128.9
sin2u W 0.2334
3. Chargino interactions
In the weak eigenstate basis the chargino mass matrix is given by
M2 '2 MW sin b
Mx s
ž '2 MW cos b m / ,
where the index 1 of rows and columns refers to the wino state, the index 2 to the
Ž 17 .
higgsino state, m is the Higgs quadratic coupling and M2 the soft SUSY breaking wino
mass. In order to define the mass eigenstates the unitary matrices U and V which
diagonalize Mx are introduced,
diag Ž Mx 1 , Mx 2 . s U ) Mx Vq. Ž 18 .
After the rotation to mass eigenstates it is always possible to speak of wino–quark–squark
or higgsino–quark–squark interactions. In order to identify the wino and higgsino states
from the chargino ones it is sufficient to pick up the right elements from the U and V
matrices. To be clear we write them explicitly for the cases of interest in the super-CKM
basis. The wino–quark–squark, d LWu, ˜ ˜ vertex is Ž d and u˜ are a generic down-quark and
up-squark .
ig 2 Ý K k)j Ý Va1 d j L x a u˜ k L q h.c., Ž 19 .
j, ks1,2,3 as1,2
˜ ˜ vertex is
and the higgsino–quark–squark, dHu,
i Ý K k)j Ý Ž lu . k Va2 d j L xa u˜ k R y Ž ld . k Ua2) d j R xa u˜ k L q h.c., Ž 20 .
j, ks1,2,3 as1,2
where K is the CKM matrix,
u
'2 M u M u g2
g 2 s ersin u W , l s s
Õsin b '2 sin b MW
and
'2 M d
ld s
Õcos b
are the Yukawa matrices for the up- and down-quarks.
E. Lunghi et al.r Nuclear Physics B 568 (2000) 120–144 129
Chargino graphs can contribute to the decay via both single and double insertions
Žsee Figs. 1 and 2.. The double insertion is particularly convenient if the corresponding
d ’s are not very constrained w41x. In the following subsections we examine both cases.
In the case of a single insertion approximation both g- and Z-mediated decays are
considered.
In all what follows our results for the integrals are written in terms of the functions
j
1 1 yiŽ1yy.
Pi jk Ž a,b . ' H0 H0dx dy k
. Ž 21 .
Ž 1 y y q axy q b Ž 1 y x . y .
To get a feeling with numbers it is sufficient to say that for a s b s 1,
Pi jk Ž 1,1 . s bE w 1 q i ,1 q j x , Ž 22 .
where bE is the Euler b-function.
The Z-boson mediated decay can proceed in two ways depending on the type of
chargino–quark–squark vertices we consider. If an explicit SUŽ2.L breaking on the
squark line of Fig. 1a is required we must take both an higgsino and a wino vertex. In
this way we get a contribution to the Wilson coefficients
C9 u
l t K c)s 1
y 2
s C10 s Ž d 23 . LR
1 y 4sin u W g 2 K t)s 4sin2u W
= Ý
i , js1,2
½ (
Vi1Vj2) Ui1) Uj1 x i x j P112 Ž x i , x j . q Vi1) Vj1 P111 Ž x i , x j .
1
y d i j P021 Ž x i , x j . ,
2 5 Ž 23 .
where x i s Mx2irMsq2 . This diagram, however, is exactly null in the limit in which U, V
approximate the identity matrix and so it is negligible for high M2 .
With two wino–quark–squark vertices we obtain
C9
y s C10
1 y 4sin2u W
u
K c)s 1
s yŽ d 23 . LL
K t)s 4sin2u W
Ý
i , js1,2
½ (
Vi1Vj1) Ui1) Uj1 x i x j P112 Ž x i , x j .
where we have retained only the contribution which arises because of the explicit
130 E. Lunghi et al.r Nuclear Physics B 568 (2000) 120–144
SUŽ2.L breaking Žwith a double wino–higgsino mixing in the wino line.; in fact Eq. Ž24.
is null in the limit of a diagonal chargino mass matrix.
Graphs with two higgsino–quark–squark vertices are suppressed with respect to these
ones by Yukawa or CKM factors.
MW2 1 K c)s
u
C 7 s y Ž d 23 . LL Ý Vi1Vi1) 32 P222 Ž x i , x i . q P132 Ž x i , x i . 4 ,
Msq2 3 K t)s is1,2
u
MW2 2 K c)s
C9 s y Ž d 23 . LL Ý Vi1Vi1) P312 Ž x i , x i .
Msq2 3 K t)s is1,2
q 13 P042 Ž x i , x i . q x i P313 Ž x i , x i . 4 ,
MW2 1 K c)s m s
CX7 s y Ž d 23
u
. LL Ý Vi1Vi1) 32 P222 Ž x i , x i . q P132 Ž x i , x i . 4 . Ž 25 .
Msq2 3 K t)s m b is1,2
The contributions of the g penguin with an higgsino and a wino vertices are
MW2 K c)s lt
C7 s Ý Vi2) Vi1 12 P222 Ž x i , x i . q 13 P132 Ž x i , x i . 4 Ž d 23u . L R
Msq2 K t)s is1,2 g2
Mx i l b
qUi2) Vi1 P212 Ž x i , x i . q 23 P122 Ž x i , x i . 4 Ž d 23u . L L ,
mb g2
u
MW2 2 l t K c)s
C9 s Ž d 23 . LR Ý Vi2) Vi1 P312 Ž x i , x i .
Msq2 3 g 2 K t)s is1,2
q 13 P042 Ž x i , x i . q x i P313 Ž x i , x i . 4 ,
MW2 1 l t K c)s m s
CX7 s Ž d 23
u
. LR Ý Vi2) Vi1 32 P222 Ž x i , x i . q P132 Ž x i , x i . 4 .
Msq2 3 g 2 K t)s m b is1,2
Ž 26 .
Finally we compute the contributions which come from chargino box diagrams of
Fig. 2.
E. Lunghi et al.r Nuclear Physics B 568 (2000) 120–144 131
u
K c)s MW2 1
C9 s yC10 s Ž d 23 . LL Ý Ž Vi1) Vj1Vi1Vj1) . f Ž x i , x j , xn˜ . , Ž 27.
K t)s Msq2 sin2u W i , js1,2
where
f Ž x i , x j , xñ .
2
1 1 1 1 yz Ž 1 y z .
s
2
H0 dxH0 dyH0 dz 2
y Ž 1 y z . q xn˜ Ž 1 y y . Ž 1 y z . q z Ž x i x q x j Ž 1 y x . .
Ž 28 .
and xn˜ s Mn˜2rMsq2 .
If the wino–bottom–stop vertex is replaced by a higgsino–bottom–stop one we
obtain
u
K c)s MW2 lt
C9 s yC10 s y Ž d 23 . LR Ý Ž Vi1) Vj1Vi1Vj2) . f Ž x i , x j , xn˜ . .
K t)s Msq2 g 2 sin2u W i , js1,2
Ž 29 .
It was recently pointed out w41x that a double mass insertion can provide a great
enhancement of the SUSY contribution to the decay width, at least in the K-system case,
if the d ’s are not very constrained.
For B decay we obtain contributions from this graph to C9 and C10 ,
C9
y s C10
1 y 4sin2u W
di j
q 12 Vi1) Vj1 P122 Ž x i , x j . y
3 5
P032 Ž x i , x j . . Ž 30 .
4. Gluino interactions
The main contribution of this kind of interactions comes from the graphs drawn in
Fig. 1c,d. In what follows we analyze the single and double mass insertion cases.
132 E. Lunghi et al.r Nuclear Physics B 568 (2000) 120–144
The corrections to the coefficients in the photon mediated decay case are
'2 1 Nc2 y 1 pa s ms 1
C7 s
Msq2 GF 3 2 Nc K t)s K t b ž Ž d 23d . L L q Ž d 23d . R R m
b
/ 4 P132 Ž x , x .
d
Mgl
q Ž d 23 . R L P122 Ž x , x . ,
mb
'2 1 Nc2 y 1 pa s ms 1
C 7X s
Msq2 GF 3 2 Nc K t)s K t b ž Ž d 23d . R R q Ž d 23d . L L m
b
/ 4 P132 Ž x , x .
d
Mgl
q Ž d 23 . L R P122 Ž x , x . ,
mb
'2 1 Nc2 y 1 pa s 1 d
C9 s y P042 Ž x , x . Ž d 23 . LL,
Msq2 GF 3 2 Nc K t)s K t b 3
'2 1 Nc2 y 1 pa s 1 d
C9X s y P042 Ž x , x . Ž d 23 . RR. Ž 31 .
Msq2 GF 3 2 Nc K t)s K t b 3
The term proportional to the gluino mass in Eq. Ž31. seems to be strongly enhanced
™
with respect to the others. However, the mass insertion which enters the diagram is also
strongly constrained from b sg w33x.
The only relevant contributions to the Z-boson mediated decay width come from
diagrams in which the Z feels directly the breaking of SUŽ2.L . According to the
argument of Section 2 all the diagrams that do not respect this condition are suppressed
with respect to the photon mediated ones and can be neglected. However, for penguins
containing a gluino, an explicit SUŽ2.L breaking can be provided only with a double MI.
If only one MI is considered, Z-mediated decays are completely negligible with respect
to the g-mediated ones.
For completeness we report here also the result obtained performing a double mass
insertion in the gluino penguin,
C9 Ž d 33d . L R Ž d 23d . R L Nc2 y 1 a s
y s C10 s P032 Ž x , x . ,
1 y 4sin2u W K t b K t)s 2 Nc 12 a
In the mass insertion approximation framework we assume that all the diagonal
entries of the scalar mass matrices are degenerate and that the off diagonal ones are
sufficiently small. In this context we expect all the squark masses to lie in a small region
around an average mass which we have chosen not smaller than 250 GeV. Actually
there is the possibility for the t˜R to be much lighter; in fact the lower bound on its mass
is about 70 GeV. For this reason it is natural to wonder how good the MIA is when a t˜R
explicitly runs in a loop.
Among those we have computed, the diagrams interesting in this effect are the
u .
chargino penguins and box with the Ž d 23 L R insertion. To compute the light-t˜R contribu-
tion we adopt the approach presented in Ref. w55x. There the authors consider an
expansion valid for unequal diagonal entries which gives exactly the MIA in the limit of
complete degeneration.
The new expressions for the contributions to the coefficients C9 and C10 are the
following.
= Ý
i , js1,2
½ (
Vi1Vj2) yUi1) Uj1 x i x j j Ž x i , x j , x t˜R .
u
MW2 2 l t K c)s
C9 s Ž d 23 . LR Ý Vi1Vi2) g 7 Ž x i , x t˜R . ,
Msq2 3 g 2 K t)s is1,2
u
MW2 1 l t K c)s
C 7 s Ž d 23 . V V)
Ý g 1 Ž x i , x t˜R . , Ž 34 .
LR
Msq2 6 g 2 K t)s is1,2 i1 i2
where
1
fi Ž x . y f Ž x y.
y i
gi Ž x , y . s ,
1yy
52 y 153 x q 144 x 2 y 43 x 3 q Ž 36 y 54 x q 12 x 3 . log Ž x .
f7 Ž x . s 4
,
6 Ž y1 q x .
y8 q 3 x q 12 x 2 y 7x 3 q 6 x Ž y3 q 2 x . log Ž x .
f 1Ž x . s 4
. Ž 35 .
6 Ž y1 q x .
134 E. Lunghi et al.r Nuclear Physics B 568 (2000) 120–144
C9 s yC10
1 K c)s MW2 lt
s Ž d 23u . L R Ý Ž Vi1) Vj1Vi1Vj2) . k Ž x i , x j , xn˜ , x t˜ . ,
4 K t)s Msq2 g 2 sin2u W i , js1,2
R
Ž 36 .
where k Ž x, y, z,t . is defined in Ref. w55x.
All the above formulas reduce exactly to those presented in Section 3 in the limit
x t̃ R s 1.
In order to establish how large the SUSY contribution to B X s l q l y can be, one
can compare, coefficient per coefficient, the MI results with the SM ones taking into
™
account possible constraints on the d ’s coming from other processes.
The most relevant d ’s interested in the determination of the Wilson coefficients C 7 ,
u . Ž u. Ž u. Ž d. Ž d.
C9 and C10 are Ž d 23 L L , d 23 L R , d 33 R L , d 23 L L and d 23 L R .
Ø Vacuum stability arguments regarding the absence in the potential of color and
charge breaking minima and of directions unbounded from below w40x give
Ž di3u . L R ( m t
(2 M 2 2
u˜ q 2 M l˜
,2
mt
. Ž 37 .
Msq2 Msq
For Msq ( 300 GeV this is not an effective constraint on the mass insertions.
d,u .
Ø A constraint on Ž d 23 L L can come from the possible measure of D M B s.
d .2 Ž
In fact the gluino–box contribution to D MB s w39x is proportional to Ž d 23 L L see for
w x.
instance Ref. 39 . A possible experimental determination of D MB s, say
D MB s - 30 psy1 Ž 38 .
would imply that
Ž d 23d . L L - 0.5 Ž 39 .
for squark masses about 250 GeV. Moreover the LL up- and down-squark soft
breaking mass matrices are related by a Cabibbo–Kobayashi–Maskawa rotation
2 2
Ž Msqd . L L s K † Ž Msqu . L L K Ž 40 .
so that the limit Ž39. would be valid for the up sector too:
Ž d 23u . L L - 0.5 . Ž 41 .
E. Lunghi et al.r Nuclear Physics B 568 (2000) 120–144 135
™
Ø Some constraints come from the measure of B X sg . The branching ratio of this
process depends almost completely on the Wilson coefficients C 7 and C 7X which are
™
d . Ž u.
proportional respectively to Ž d 23 L R or R L and d 23 L L . The most recent CLEO esti-
mate of the branching ratio for B X s g is w29x
B exp Ž B ™ sg . s Ž 3.15 " 0.35 " 0.32 " 0.26. = 10 y4
. Ž 42 .
where the first error is statistical, the second is systematic and the third comes from
the model dependence of the signal. The limits given at 95% C.L. are w29x
2.0 = 10y4 - B exp Ž B ™sg . - 4.5 = 10y4 . Ž 43 .
We can define a C 7eff Ž MB . as
2
C 7eff Ž MB . s
Bexp Ž B ™ sg . , Ž 44 .
2
Ž K t)s K t brK cb . Ž 6 a Frp g Ž z .
where F can be found for instance in Refs. w57,58x. Considering the experimental
limits we find at 95% C.L.
0.28 - < C 7eff Ž MB . < - 0.41. Ž 45 .
2
Actually < C 7eff Ž MB .< s < C 7 Ž MB .< 2
q < CX7 Ž MB .< 2
and the constraint given in Eq. Ž45.
should be shared between the two coefficients. However, in order to get the
maximum SUSY contribution, we observe that in physical observables CX7 does not
interfere with C 7 , the CX7 C9 term is suppressed by a factor m srm b with respect to the
C 7 C9 one and CX7 C9X is numerically negligible Žin fact C9X is much smaller than C9 ..
For these reasons we choose to fulfill the constraint of Eq. Ž45. with C 7 Ž MB . alone.
The bounds Ž45. are referred to the coefficient evaluated at the MB scale while we
are interested to the limits at the much higher matching scale. After the RG evolution
has been performed we find that for an average squark mass lower than 1 TeV, the
MIA contribution alone with a suitable choice of d ’s, can always fit the experimental
constraints.
Thus, since we are interested in computing the maximum enhancement Žsuppression.
SUSY can provide, we can choose the total C 7eff Ž MB . anywhere inside the allowed
region given in Eq. Ž45. still remaining consistent with the MIA.
d . y2
The limit we get for Ž d 23 L R is of order 10 and this rules out Z-mediated gluino
penguins contributions to C9 and C10 .
u .
For what concerns Ž d 23 L L we find that the constraint changes significantly according
eff Ž
to the sign of C 7 MB .. In this case it is important to consider both the positive and
negative region as this delta can give a non-negligible contribution to C9 and C10 .
The limits depend on the choice of the parameters in the chargino sector; the
numerical results given below are computed for Msq , 250 GeV, m , y160 GeV,
Mñ , 50 GeV, tan b , 2 Žin Section 7 we will show that these are the conditions
under which we find the best SUSY contributions.. Considering the positive interval
u . <Ž u . <
we find y0.7 - Ž d 23 L L - y0.5 while in the negative one d 23 L L - 0.1.
Ø Finally a comment on the d ’s coming in graphs with a double MI is in order.
d .
Given the constraints on Ž d 23 L R one can see that the gluino-penguins with a double
d .
MI give negligible contributions to the final results even if Ž d 33 R L is of order O 1 .
Ž .
u
A Ž d 33 .R L of order O Ž1., can give rise to light or negative squark mass eigenstates. In
136 E. Lunghi et al.r Nuclear Physics B 568 (2000) 120–144
particular a light t˜L would contribute too much to the r W -parameter. Eventual
model-dependent cancelation can provide an escape to these constraints. In any case
the numerical value of these contributions is not particularly important for the
determination of physical observables. Since we want to provide a model-indepen-
dent analysis we prefer not to consider in our final computation these double
insertion graphs and we present them only for completeness.
Contributions with three mass insertions are suppressed due to small loop integrals
and to the various constraints on the deltas.
7. Results
The results of the calculations of Sections 3 and 4 are presented in Figs. 3 and 4 and
in Tables 2 and 3. While the gluino sector of the theory is essentially determined by the
knowledge of the gluino mass Ži.e. Mgl ., the chargino one needs two more parameters
Ži.e. M2 , m and tan b .. Moreover it is a general feature of the models we are studying
the decoupling of the SUSY contributions in the limit of high sparticle masses: we
expect the biggest SUSY contributions to appear for such masses chosen at the lower
bound of the experimentally allowed region. On the other hand, these considerations
u . Ž u. Ž
Fig. 3. Ž d 23 L R above and d 23 L L below contributions to C 9 coming from chargino diagrams as functions
Ž . .
of m expressed in GeV . Msq is fixed to 250 GeV while tan b varies between 2 and 30.
Ž .
E. Lunghi et al.r Nuclear Physics B 568 (2000) 120–144 137
u . Ž u. Ž
Fig. 4. Ž d 23 L R above and d 23 L L below contributions to C10 coming from chargino diagrams as functions
Ž . .
of m Žexpressed in GeV.. Msq is fixed to 250 GeV while tan b varies between 2 and 30.
suggest us to constrain the three parameters of the chargino sector by the requirement of
the lighter eigenstate not to have a mass lower than the experimental bound of about 70
Table 2
Contributions to the coefficients C 7 , C9 and C10 from diagrams involving gluino loops. Mgl and Msq both
vary between 250 GeV and 1000 GeV. Exchanging L with R in the mass insertions we get the contributions of
X X X
gluino diagrams to C 7 , C9 and C10 . For further explanations see the caption in Table 3
Diagram Msq Mgl C7 C9
d . Ž d. d .
g˜g y1 ins 250 250 y0.192Ž d 23 L L y33.4 d 23 L R y0.513Ž d 23 LL
d . d . d .
250 500 y0.125 d 23 L L y31.2Ž d 23
Ž LR y0.189 d 23
Ž LL
d . d . d
500 500 y0.0449Ž d 23 LL y15.6 Ž d 23 L R y0.12Ž d 23 .L L
d . Ž d. d .
250 1000 y0.0344Ž d 23 L L y10.3 d 23 L R y0.0463Ž d 23 LL
d . d . d .
500 1000 y0.0291Ž d 23 LL y14.5 Ž d 23 LR y0.0439Ž d 23 LL
d . d . d .
1000 1000 y0.0105 d 23 L L y7.26 d 23
Ž Ž LR y0.0279Ž d 23 LL
C10 C9
d . Ž d . d . Ž d .
˜
gZy2 ins 250 250 y10.2Ž d 23 L R d 33 R L 0.763Ž d 23 L R d 33 R L
d . Ž d . d . Ž d .
250 500 y17.3Ž d 23 L R d 33 R L 1.29Ž d 23 L R d 33 R L
d . Ž d . d . Ž d .
500 500 y9.49Ž d 23 L R d 33 R L 0.712Ž d 23 L R d 33 R L
d . Ž d . d . Ž d .
250 1000 y17.6Ž d 23 L R d 33 R L 1.32Ž d 23 L R d 33 R L
d . Ž d . d . Ž d .
500 1000 y16.1Ž d 23 L R d 33 R L 1.21Ž d 23 L R d 33 R L
d . Ž d . d . Ž d .
1000 1000 y8.85Ž d 23 L R d 33 R L 0.664Ž d 23 L R d 33 R L
138 E. Lunghi et al.r Nuclear Physics B 568 (2000) 120–144
Table 3
Contributions to the coefficients C 7 , C9 and C10 from diagrams involving chargino loops. We assume
m sy160 GeV, M2 s 50 GeV, tan b s 2, mn˜ s 50 GeV, Mt˜R s90 GeV while Msq varies between 250 GeV
and 1000 GeV. In the first column we indicate the number of mass insertions present in each squark line,
which charginos are present at the vertexes and the kind of graph computed Žg-penguin, Z-penguin or box
diagram.
Diagram Msq C7 C9
˜ ˜g y1 ins
WW 250 u .
0.35Ž d 23 LL
u .
1.4Ž d 23 LL
u . u .
500 0.12Ž d 23 LL 0.76Ž d 23 LL
u . u .
1000 0.033Ž d 23 LL 0.32Ž d 23 LL
˜ ˜g y1 ins
HW 250 u .
y2.1Ž d 23 Ž u.
L L y0.25 d 23 L R
u .
y0.71Ž d 23 LR
u . u . u .
500 y1.1Ž d 23 LL y0.27 Ž d 23 LR y0.87Ž d 23 LR
u . Ž u. u .
1000 y0.45Ž d 23 L L y0.27 d 23 L R y0.93Ž d 23 LR
C10 C9
˜˜
WWZy2 ins 250 u . Ž u .
1.4Ž d 23 L R d 33 R L y0.092Ž d 23u . Ž u .
L R d 33 R L
u . Ž u . u . Ž u .
500 1.8Ž d 23 L R d 33 R L y0.12Ž d 23 L R d 33 R L
u . Ž u . u . Ž u .
1000 2.1Ž d 23 L R d 33 R L y0.14Ž d 23 L R d 33 R L
˜˜
WHZy1 ins 250 y8.4Ž d 23u .
LR
u .
0.56Ž d 23 LR
u . u .
500 y11.Ž d 23 LR 0.74Ž d 23 LR
u . u .
1000 y13.Ž d 23 LR 0.84Ž d 23 LR
˜˜
WWZy1 ins 250 u .
y0.91Ž d 23 LL
u .
0.06Ž d 23 LL
u . u .
500 y0.47Ž d 23 LL 0.031Ž d 23 LL
u . u .
1000 y0.19Ž d 23 LL 0.013Ž d 23 LL
box W˜ y1 ins 250 u .
2.7Ž d 23 LL
u .
y2.7Ž d 23 LL
u . u .
500 1.3Ž d 23 LL y1.3Ž d 23 LL
u . u .
1000 0.55Ž d 23 LL y0.55Ž d 23 LL
˜ ˜ y1 ins
box HW 250 u .
y0.97Ž d 23 LR
u .
0.97Ž d 23 LR
u . u .
500 y1.1Ž d 23 LR 1.1 d 23 L R
Ž
u u
1000 y1.2Ž d 23 .L R 1.2Ž d 23 .L R
GeV w56x. The remaining two-dimensional space has yet no constraint. For these reasons
we scan the chargino parameter space by means of scatter plots for which Msq s
250 GeV, Mñ s 50 GeV, 80 GeV ( < m < ( 300 GeV and 2 ( tan b ( 30; for every choice
of these two parameters, M2 is determined imposing to the lighter eigenstate a mass of
about 70 GeV. In the plots we sum all contributions coming from different graphs
proportional to a common mass insertion Žthe actual values of the coefficients are
obtained multiplying the points in the plots by the MI..
In the tables we report the contribution of each diagram and the explicit dependence
on the mass insertion parameters. We evaluate the coefficients varying Msq and Mgl
between 250 GeV and 1 TeV. The other parameters in Table 3 are fixed from the scatter
plots in order to give the best SUSY contributions to C9 and C10 .
Thus, with m , y160, Mgl , Msq , 250 GeV, Mn˜ , 50 GeV, Mt˜R s 90 GeV, tan b ,
2 one gets
C9MI Ž MB . s y1.2 Ž d 23
u
. L L q 0.69 Ž d 23u . L R y 0.51 Ž d 23d . L L
½ MI
C10 Ž MB . s 1.75 Ž d 23u . L L y 8.25 Ž d 23u . L R .
Ž 46 .
E. Lunghi et al.r Nuclear Physics B 568 (2000) 120–144 139
In order to numerically compare Eq. Ž46. with the respective SM results we note that
the minimum value of Ž C9eff Ž s .. SM Ž MB . is about 4 while C10 SM
s y4.6. Thus one
deduces that SM expectations for the observables are enhanced when C9MI Ž Mb . is
MI Ž
positive. Moreover the big value of C10 MB . implies that the final total coefficient
C10 Ž MB . can have a different sign with respect to the SM estimate. As a consequence of
this, the sign of asymmetries can be the opposite of the one calculated in the SM.
The diagonal contributions to C9 , C10 introduced in Section 2, and computed in the
same range of the parameters are
C9diag Ž MB . s y0.35
½ diag
C10 Ž MB . s y0.27
Ž 47 .
The sign and the value of the coefficient C 7 has a great importance. In fact the
integral of the BR Žsee Eq. Ž5.. is dominated by the < C 7 < 2rs and C 7 C9 term for low
values of s. In the SM the interference between O 7 and O 9 is destructive and this
behavior can be easily modified in the general class of models we are dealing with.
In the following, according to the discussion of Section 6, we give the configurations
of the various d ’s for which we find the best enhancements and suppressions of the SM
expectations.
Ø Best enhancement
C 7eff Ž MB . C9MI MI
C10 u, d .
Ž d 23 LL
u .
Ž d 23 LR
It is important to note that with such choices the behavior of the asymmetries in the low
s region of the spectrum is greatly modified: the coefficients of the operators Q 7 and Q9
sum up instead of cancel each other in such a way that the asymmetries are never
negative. It is also important to stress that the asymmetries get their extremal value with
u .
a rather small Ž d 23 L R : the enhancement given here will survive possible future
constraints on this insertion.
Ø Best enhancement with C 7 - 0
C 7eff Ž MB . C9MI MI
C10 u, d .
Ž d 23 LL
u .
Ž d 23 LR
Ø Best depression
C 7eff Ž MB . C9MI MI
C10 u, d .
Ž d 23 LL
u .
Ž d 23 LR
Fig. 5. Differential branching ratio for the decay B ™ X s l q l y. The solid line corresponds to the SM
expectation; the dashed and dotted–dashed lines correspond respectively to the SUSY best enhancement
Ž C 7eff s 0.41, C9MI s1.5, C10
MI
sy8.3. and depression Ž C 7eff sy0.28, C9MI sy1.3, C10 MI
s 5.8.; the dotted line
is the maximum enhancement obtained without changing the sign of C 7 C 7 sy0.41, C9MI s1.5, C10
Ž eff MI
s
y8.3..
The plots of BRŽ s ., A FB Ž s . and A FB Ž s . are drawn in Figs. 5–8. Here both SM and
SUSY results are shown. The discontinuity in the A FB plot at s s 0.7 corresponds to the
point at which we have stopped the corrections O Ž1rm2b .. In fact, a model-independent
description of the differential asymmetry in the region 0.7 - s - 0.93 beyond the parton
model is still lacking. Further, the peak, which occurs at s s Ž2 m crm b . 2 , 0.3, is due to
the perturbative remnant of the cc resonance.
™
The integrated BR’s and asymmetries for the decays B X s eqey and B X s mqmy
in the SM case and in the SUSY one Žwith the above choices of the parameters. are
™
summarized in Table 4. There we computed the total perturbative contributions neglect-
ing the resonances; these occur in the intermediate range of the spectrum Ž Jrc at 3.1
GeV Ž s s 0.42. and c X at 3.7 GeV Ž s s 0.59. plus others at higher energies.. However,
it is possible to exclude the resonant regions from the experimental analysis by
opportune cuts and to correct the effects of their tails in the remaining part of the
spectrum.
Fig. 7. Forward–backward asymmetry Ž A FB . for the decay B ™ X s mqmy. The solid line corresponds to the
SM expectation; the dashed and dotted–dashed lines correspond respectively to the SUSY best enhancement
Ž C 7eff s 0.28, C9MI s 0.96, C10
MI
sy2.1. and depression Ž C 7eff s 0.28, C9MI sy1.5, C10
MI
s8.3.; the dotted line
is the maximum enhancement obtained without changing the sign of C 7 Ž C 7eff sy0.28, C9MI s 0.75,
MI
C10 s 0.36..
The results of Table 4 must be compared with the experimental best limit which reads
w59x
BR exp - 5.8 = 10y5 . Ž 48 .
A comment on the CMSSM ŽConstrained MSSM. prediction for the observables we
have computed is now necessary. An analysis on the subject is presented in Ref. w26x. In
this paper the authors show that the effect of CMSSM on the integrated BR’s,
considering only contributions to C9 and C10 , varies between a depression up to 10%
and an enhancement of few percents relative to the corresponding SM values. The
asymmetries get even smaller corrections. On the other hand, a direct computation of
C 7MS SM Ž MW . yields w26x
y0.59 - C 7MS SM Ž MW . - q0.49 in the large tan b regime,
y0.26 - C 7MS SM Ž MW . - y0.20 in the low tan b regime. Ž 49 .
It is worth noting that comparing the above intervals with the experimentally allowed
region obtained via RG evolution at the MW scale of the limits in Eq. Ž45. Žwe use only
Table 4
™ ™
Integrated BR, A FB and A FB in the SM and in a general SUSY extension of the SM for the decays
B X s eq ey and B X s mqmy. The second and third columns are the extremal values we obtain with a
positive C 7eff while the fourth one is the C 7eff - 0 case. The actual numerical inputs for the various coefficients
™
can be found in the text. The BR is just the integral of RŽ s . multiplied by the BR of the semileptonic
dominant B decay ŽBRŽ B X c en . s 0.105.
Observable SM SUSY SUSYr SUSY SUSYrSM SUSY SUSYrSM
maximal SM minimal Ž C 7 - 0.
BRŽ e . 9.6=10y6 4.4=10y5 4.6 3.9=10y6 0.41 3.9=10y5 4.0
A FB Ž e . 0.23 0.33 1.5 y0.18 y0.78 0.31 1.4
A FB Ž e . 0.071 0.24 3.3 y0.19 y2.7 0.11 1.5
BRŽ m . 6.3=10y6 4.0=10y5 6.3 1.6=10y6 0.26 3.4=10y5 5.4
A FB Ž m . 0.23 0.33 1.5 y0.18 y0.78 0.31 1.4
A FB Ž m . 0.11 0.27 2.5 y0.27 y2.4 0.15 1.3
the SM contribution to C8 ; the inclusion of the MSSM corrections does not change
significantly the result.
y0.39 - C 7 Ž MW . - y0.099 and 0.66 - C 7 Ž MW . - 0.95 Ž 50 .
it is excluded that the CMSSM could drive a positive value for C 7eff Ž MB .. For what
concerns the negative interval of values of C 7eff Ž MB . we see that it can be accommo-
dated both in the CMSSM and in our framework.
Looking at Figs. 5–8. and Table 4 we see that the differences between SM and SUSY
predictions can be remarkable. Moreover a sufficiently precise measure of BR’s, A FB ’s
and A FB ’s can either discriminate between the CMSSM and more general SUSY models
or give new constraints on mass insertions. Both these kind of information can be very
useful for model building.
8. Conclusions
™ ™™ ™
In this paper an extensive discussion about SUSY contributions to semileptonic
decays B X s eqey, B X s mqmy is provided. We see that the interplay between
b sg and B X s l q l y is fundamental in order to give an estimate of the SUSY
relevance in these decays. The two kinds of decays are in fact strongly correlated.
Given the constraints coming from the recent measure of b sg and estimating all
possible SUSY effects in the MIA framework we see that SUSY has a chance to
™
strongly enhance or depress semileptonic charmless B-decays. The expected direct
measure will give very interesting information about the SM and its possible extensions.
Acknowledgements
We thank S. Bertolini and E. Nardi for fruitful discussions. I.S. wants to thank
SISSA, for support and kind hospitality during the elaboration of the first part of this
work and Della Riccia Foundation ŽFlorence, Italy. for partial support. The work of L.S.
E. Lunghi et al.r Nuclear Physics B 568 (2000) 120–144 143
References
w1x See, for instance, The BaBar Physics Book, SLAC-R-504 and references therein.
w2x B. Grinstein, M.J. Savage, M.B. Wise, Nucl. Phys. B 319 Ž1989. 271.
w3x G. Buchalla, A.J. Buras, Nucl. Phys. B 400 Ž1993. 225.
w4x M. Misiak, Nucl. Phys. B 393 Ž1993. 23.
w5x M. Misiak, Nucl. Phys. B 439 Ž1995. 461, Erratum.
w6x ¨ Phys. Rev. D 52 Ž1995. 186.
A.J. Buras, M. Munz,
w7x G. Buchalla, A.J. Buras, M.E. Lautenbacher, Rev. Mod. Phys. 68 Ž1996. 1125.
w8x N.G. Deshpande, J. Trampetic, K. Panrose, Phys. Rev. D 39 Ž1989. 1461.
w9x C.S. Lim, T. Morozumi, A.I. Sanda, Phys. Lett. B 218 Ž1989. 343.
w10x A.I. Vainshtein et al., Yad. Fiz. 24 Ž1976. 820, Sov. J. Nucl. Phys. 24 Ž1976. 427.
w11x P.J. O’Donnell, H.K. Tung, Phys. Rev. D 43 Ž1991. 2076.
w12x A.F. Falk, M. Luke, M.J. Savage, Phys. Rev. D 49 Ž1994. 3367.
w13x A. Ali, G. Hiller, L.T. Handoko, T. Morozumi, Phys. Rev. D 55 Ž1997. 4105.
w14x A. Ali, G. Hiller, Phys. Rev. D 58 Ž1998. 071501.
w15x A. Ali, G. Hiller, Phys. Rev. D 58 Ž1998. 074001.
w16x G. Buchalla, G. Isidori, Nucl. Phys. B 525 Ž1998. 333.
w17x ¨
F. Kruger, L.M. Sehgal, Phys. Lett. B 380 Ž1996. 199.
w18x A. Ali, T. Mannel, T. Morozumi, Phys. Rev. B 273 Ž1991. 505.
w19x M.R. Ahmadi, Phys. Rev. D 53 Ž1996. 2843.
w20x C.D. Lu,¨ D.X. Zhang, Phys. Lett. B 397 Ž1997. 279.
w21x G. Buchalla, G. Isidori, S.J. Rey, Nucl. Phys. B 511 Ž1998. 594.
w22x J.W. Chen, G. Rupak, M.J. Savage, Phys. Rev. B 410 Ž1997. 285.
w23x S. Bertolini, F. Borzumati, A. Masiero, G. Ridolfi, Nucl. Phys. B 353 Ž1991. 591.
w24x A. Ali, G.F. Giudice, T. Mannel, Z. Phys. C 67 Ž1995. 417.
w25x J. Hewet, J.D. Wells, Phys. Rev. D 55 Ž1997. 5549.
w26x P. Cho, M. Misiak, D. Wyler, Phys. Rev. D 54 Ž1996. 3329.
w27x T. Goto, Y. Okada, Y. Shimizu, M. Tanaka, Phys. Rev. D 55 Ž1997. 4273.
w28x T. Goto, Y. Okada, Y. Shimizu, Phys. Rev. D 58 Ž1998. 094006.
w29x CLEO Collaboration, CONF98-17, ICHEP98, 1011.
w30x Y.G. Kim, P. Ko, J.S. Lee, Nucl. Phys. B 544 Ž1999. 64.
w31x S. Back, P. Ko, hep-phr9904283.
w32x L.J. Hall, V.A. Kostolecki, S. Raby, Nucl. Phys. B 267 Ž1986. 415.
w33x F. Gabbiani, A. Masiero, Nucl. Phys. B 322 Ž1989. 235.
w34x J.S. Hagelin, S. Kelley, T. Tanaka, Nucl. Phys. B 415 Ž1994. 293.
w35x E. Gabrielli, A. Masiero, L. Silvestrini, Phys. Lett. B 374 Ž1996. 80.
w36x J.A. Bagger, K.T. Matchev, R. Zhang, Phys. Lett. B 412 Ž1997. 77.
w37x M. Ciuchini et al., J. High Energy Phys. 10 Ž1998. 008.
w38x R. Contino, I. Scimemi, hep-phr9809437.
w39x F. Gabbiani, E. Gabrielli, A. Masiero, L. Silvestrini, Nucl. Phys. B 477 Ž1996. 321.
w40x J.A. Casas, S. Dimopoulos, Phys. Lett. B 387 Ž1996. 107.
w41x G. Colangelo, G. Isidori, J. High Energy Phys. 09 Ž1998. 009.
w42x N. Cabibbo, L. Maiani, Phys. Lett. B 79 Ž1978. 109.
w43x C.S. Kim, A.D. Martin, Phys. Lett. B 225 Ž1989. 186.
w44x P. Ball, V.M. Braun, Phys. Rev. D 49 Ž1994. 2472.
w45x V. Eletsky, E. Shuryak, Phys. Lett. B 206 Ž1992. 191.
w46x M. Neubert, Phys. Lett. B 322 Ž1994. 419.
144 E. Lunghi et al.r Nuclear Physics B 568 (2000) 120–144
Abstract
1. Introduction
fermions c " M w
1,6–9x: A T-duality with respect to X 9 sends Eq X 9 yEq X 9 and ™
straightforward way from its action on the world-sheet currents E " X M and world-sheet
0550-3213r00r$ - see front matter q 2000 Elsevier Science B.V. All rights reserved.
PII: S 0 5 5 0 - 3 2 1 3 Ž 9 9 . 0 0 6 8 4 - 7
146 S.F. Hassanr Nuclear Physics B 568 (2000) 145–161
cq9 ™ ycq9 , keeping all other variables unchanged. Hence, on the left-moving half of
the world-sheet theory, it can be regarded as a parity reflection along X 9 , while the
right-moving sector remains invariant. The action of such a transformation on the
left-moving Ramond ground state is represented by V 0 s G 11 G 9 as this operator sends
G 9 of the left-moving sector to Vy1 9 9
0 G V 0 s yG , consistent with the fact that G
9
is
9
the zero-mode of cq in the Ramond sector. The action of V 0 can now be absorbed in
the space-time spinors. For example, for the parameters of space-time supersymmetry
™
e " Žwhere the subscripts ‘‘"’’ refer to the world-sheet sectors in which the supersym-
metry acts., this leads to eq V 0 eq, while ey remains unchanged. One can also obtain
the action of T-duality on the gravitinos C " M from the invariance of their vertex
™ ™ ™
operators under T-duality. CqM contains the left-moving R ground state and hence
transforms as CqM V 0CqM while CyM contains a left-moving NS field cyM and
hence Cy9 yCy9 , Cyi Cyi .
In the general case of non-flat backgrounds with an isometry, say, along X 9 ,
T-duality no longer reduces to a parity transformation acting on left-moving Žor
right-moving. world-sheet variables alone. In fact, in general, it acts as a canonical
transformation affecting both left- and right-moving sectors of the world-sheet theory
w2,10,11x. Furthermore, in curved backgrounds, the relationship between the world-sheet
fermions and space-time Dirac algebra is not as straightforward as in flat space.
Therefore, it does not seem possible to obtain the T-duality action on space-time
fermions, or equivalently, on the Ramond sector, from world-sheet considerations alone.
In this paper, we study the action of T-duality on space-time spinors in type-II string
theories in the presence of NS–NS and R–R background fields. The spinors we consider
are the space-time supersymmetry transformation parameters e ", the two gravitinos
CM " and the two dilatinos l ". These results are then used to derive the T-duality rules
for the R–R fields, including the massive IIA case. Both backgrounds and spinors are
assumed to be independent of the coordinate X 9 along which T-duality is performed
Žwith the exception of type-IIB potentials dual to massive type-IIA..
The paper is organized as follows: In Section 2, the transformation of e " is obtained
by identifying a T-duality action on the local Lorentz frame associated with the
left-moving sector of the world-sheet theory. We also describe a set of variables in terms
of which the curved-space duality resembles the flat-space case. In Section 3, we
consider space-time supersymmetry transformations in type-II theories in NS–NS
backgrounds and determine the gravitino and dilatino T-duality transformations. These
are shown to be independent of R–R backgrounds. In Section 4, we use these
transformations to derive the T-duality rules for R–R field strengths and potentials,
emphasizing the locality of potentials in the massive type-IIA case. We then use
T-duality to give a simple derivation of the ‘‘mass’’-dependent terms in the Wess–
Zumino action for D-branes in massive IIA theory. Section 5 contains the conclusions.
Many of the formulas used in this paper are given in the appendices for convenience and
to insure consistency of conventions.
In this section we describe how T-duality acts on the spinorial index of space-time
fermions in type-II theories with background fields Žthe extension to other string theories
S.F. Hassanr Nuclear Physics B 568 (2000) 145–161 147
G˜ 9 i s yGy1
99 B9 i ,
B˜9 i s yGy1
99 G 9 i ,
G˜i j s Gi j y Gy1
99 Ž G 9 i G 9 j y B9 i B9 j . ,
B˜i j s Bi j y Gy1
99 Ž G 9 i B9 j y B9 i G 9 j . ,
2 f˜ s 2 f y lnG 99 . Ž 1.
Here, M, N are space-time indices in 10 dimensions. The backgrounds are assumed to be
independent of the X 9 coordinate along which T-duality is performed, but may depend
on the remaining coordinates which we label by X i with i s 0,1, . . . ,8. Throughout this
paper, a tilde denotes a field in the T-dual theory.
Let us decompose the 10 dimensional metric of type-II theories in terms of the
vielbeins, GM N s e M aha b e b N , where a,b are Lorentz frame indices. It is known that the
M M
T-dual theory contains two possible vielbeins that we denote by e˜Žy. a and e˜Žq. a , both
giving rise to the same T-dual metric G˜ M N w2,4,12x. Explicitly,
M M N M M N
e˜Žy. a s QyN e a , e˜Žq. a s QqN e a . Ž 2.
The matrices Q " that implement T-duality on the vielbeins are given Žalong with their
inverses. by
.G 99 .Ž G . B . 9 i .Gy1
99 yGy1
99 Ž G . B . 9 i
Q "s
0 19 0 , Qy1
" s
0 19
where 1 9 denotes the identity matrix in nine dimensions. The two vielbeins in the dual
0 , Ž 3.
geometry which does not directly see the string world-sheet. However, T-duality acts
differently on the two world-sheet sectors and one may expect it to transform the
corresponding vielbeins in different ways.1 That this is the origin of the difference
between e˜Žq. and e˜Žy. can be argued as follows: The left-moving and right-moving
world-sheet sectors are interchanged under the world-sheet parity transformation s
ys which also interchanges Qq and Qy w4x and, hence, the two vielbeins in the dual
™
M
theory. This suggests that e˜Žq. a is T-dual to the vielbein associated with the left-moving
M
sector of the original world-sheet theory, while e˜Žy. a is T-dual to the one associated
with the right-moving sector. This identification also gives a heuristic understanding of
the T-duality action Ž2. on the vielbeins: Note that in flat space, e Ma ’s appear as
M
wavefunctions for states created by Fourier modes of the world-sheet fermions c " . In
curved backgrounds, T-duality transforms these fermions to c˜ " s Q " N c " w2x, which is
M M N
consistent with the mapping of their associated wavefunctions to e˜Ž " . , as given by Ž2.,
depending on the world-sheet sector they come from.
The necessity of retaining both e˜Žq. and e˜Žy. in the dual theory is not evident if we
are dealing with bosonic fields alone. However, their presence is essential to insure the
consistency of the dual theory in the presence of space-time fermions, as will be seen in
the next section.2 This implies that we have to keep track of how the vielbeins
transform, depending on the world-sheet sector they originate in. Then, to reconcile the
results with the standard formulation of gravity with one set of vielbeins, we should use
Ž4. to re-express one of the vielbeins, say e˜Žq. in terms of the other one, i.e. e˜Žy. . In
™
other words, we have to augment the T-duality action on the left-moving vielbein by a
local Lorentz transformation, e Qq e Ly1 , so that it transforms to e˜y, rather than to
e˜q. This translates to the T-duality action on the spinor index that the left-moving
Ramond sector contributes to the space-time fields. Formulating the dual theory in terms
of e˜Žy. is natural since for self-dual backgrounds, Qy in Ž3. reduces to the identity
matrix and e˜Žy. s e without further field redefinitions ŽThough this is not the case with
e˜Žq. , choosing it will also lead to a physically equivalent description..
Consider the space-time supersymmetry transformation parameters e " and the Dirac
matrices G M s e M a G a in either IIA or IIB theory. The Majorana–Weyl spinors e "
are taken to be independent of X 9 and the subscripts ‘‘"’’ refer to their world-sheet
origin and not their space-time chirality which will depend on the theory and will be
specified later. After T-duality, we will have two possible sets of G-matrices,
G˜Žq.
M M
s e˜Žq. aG
a
, G˜Žy.
M M
s e˜Žy. a
aG . Ž 6.
Keeping track of their world-sheet origin, the spinors e " in the dual theory are
1
That T-duality could transform the vielbeins associated with the left- and right-moving world-sheet sectors
in different ways, is not evident from the transformation of the metric. This is because the T-duality action on
the metric is determined by the invariance of the energy–momentum tensor and not that of the world-sheet
Lagrangian.
2
That both vielbeins necessarily appear in the dual theory also follows from the T-duality action on
complex structures associated with extended world-sheet supersymmetry, in cases where the complex
structures could be constructed in terms of target-space Killing spinors Žfor example, in non-compact
Calabi–Yau in four dimensions w17x. as discussed in w2x.
S.F. Hassanr Nuclear Physics B 568 (2000) 145–161 149
in which case, the T-duality operation that takes IIA to IIB is the inverse of the one that
takes IIB to IIA, and the transformation squares to q1 on the spectrum. In the
following, we use the latter convention whenever a convention is explicitly specified.
The correctness of Eqs. Ž10. will be checked in the next section when we examine the
supersymmetry variations of gravitinos and dilatinos to extract their T-duality transfor-
mations.
Unlike the flat-space case, in non-self-dual backgrounds the canonical transformation
that implements T-duality acts on both world-sheet sectors. Explicitly w2x,
c˜ "Ms Q "
M N
Nc" ,
E " X˜ M s Q "
M N i M N
N E "X q c "Ei Q " N c " . Ž 12 .
These equations are non-trivial only for E " X˜ and ˜ 9
reducing to X˜ s X and
9
c" , i i
c˜ "
i
sc" i
for the rest. However, the invariance of ey may tempt one to search for
variables in terms of which T-duality in curved space has the same form as that in
flat-space, affecting only the left-moving sector. To find such variables, note that the
matrices Q ", which have a very simple upper triangular form, may be decomposed as
y1 0
Qqs A˜q Qys A˜y
y1
ž 0 /
1 9 Aq ,
y1
Ay .
Here A˜" are the same matrices as A ", but in the dual theory. These equations admit
many solutions, all with Ai" N s d Ni while A9" M are not uniquely determined Žfor
example, A9yM s A9qM s G 9 M r G 99 .. If we define new world-sheet fermionic and
(
bosonic variables,
M
S" s AM N
" Nc" ,
M
J" s AM N j M N
" N E "X q c " Ej A " N c " ,
then the canonical transformations Ž12. implementing T-duality take the flat-background
form
J˜q9 s yJq9 , S˜q9 s ySq9 ,
with Jy9 and Sy9 unchanged Ž J " i
s E " X i and S "
i
sc" i
are trivially invariant..
However, the Lagrangian in terms of the new variables does not look any simpler which
shows the basic difference between the self-dual Ž Qys 1., and the more general
non-self-dual cases, even though the transformations can be written is a similar form.
In this section we will derive the transformations of the type-II superstring gravitinos
C " M Žnot to be confused with the world-sheet spinors c "M . and dilatinos l " under
T-duality, by demanding compatibility between T-duality and space-time supersymme-
try. Again, the ‘‘"’’ subscripts refer to the world-sheet sector in which the spinor index
of the fermion, i.e. its Ramond component, originates and not to its space-time chirality.
All spinors are assumed to be independent of the coordinate X 9 along which T-duality
is performed. The T-duality action on these spinors is independent of the R–R fields,
S.F. Hassanr Nuclear Physics B 568 (2000) 145–161 151
which we set to zero in this section for convenience. The case of non-zero R–R fields
will be considered in the next section.
Let us first consider the supersymmetry variations of the gravitinos C " M . With e "
as the supersymmetry transformation parameters and in the absence of R–R fields, these
are given by
1
ž
d "C " M s EM q WM"a b G a b e "q . . . ,
/ Ž 13.
4
d " C. M s 0 q . . . Ž 14.
Here, ‘‘ . . . ’’ indicates the presence of 3-spinor terms that we do not write down
explicitly, but which will be automatically accounted for in our final result. WM"ab are the
torsionful spin-connections given by
1
WM"a b s wM a b . HM a b . Ž 15 .
2
The above transformations hold in both IIA and IIB theories, depending on the chirality
of the spinors. In our conventions, in IIB, e " and hence C " M have positive chirality
while in IIA, ey, CyM have positive chirality and eq, CqM have negative chirality.
The supersymmetry transformation generated by eq Ž ey . acts on the left-moving
Žright-moving. world-sheet sector by interchanging R and NS boundary conditions.
Therefore, the supersymmetry variations d " C " M convert R-NS states into NS–NS
states and do not get modified if R–R fields are switched on. Therefore, we expect that
the gravitino T-duality rules obtained from Eq. Ž13. are independent of R–R fields. The
same argument applies to dilatino T-duality rules.
Let us now consider the gravitino supersymmetry variations in the T-dual theory.
First, note that the dual theory contains two sets of torsionful spin-connections,
a a
corresponding to the two vielbeins e˜Žy. M and e˜Žq. M given by 2 . We denote these by
Ž .
W˜Žy. M a b and W˜Žq. M a b , respectively. One can verify that
" "
y y y1 N
W̃Žy. M a b s WNa b Ž Qq . M , Ž 16 .
q q y1 N
W̃Žq. M a b s WNa b Ž Qy . M . Ž 17 .
Since we have chosen to express the T-dual theory in terms of e˜Žy. , the supersymmetry
variations d " C˜ " M in the T-dual theory should be expressed in terms of W˜Žy. "
M ab
alone,
1
d " C˜ " M s E M q W˜ Ž"y . M a b G a b e˜ "q . . . .
ž / Ž 18.
4
To determine C˜ qM in terms of CqM , note that using the relation Ž4. between e˜Žq. and
e˜Žy. , we can write W˜Žq.
q ˜q
M a b in terms of WŽy. M a b as
a a
W˜Žq.
q a b ˜q c
M b Ga s WŽy. M d Ž L
y1
. c L d b Ga b q Ž Ly1 . c EM L c b Ga b
s W˜Žy.
q a y1 b y1
M b V Ga V q 4 V E M V . Ž 19 .
152 S.F. Hassanr Nuclear Physics B 568 (2000) 145–161
i i
Now, using Eqs. Ž10. and Ž16. – Ž19., along with the fact that Q " jsd j
Ž3. and
E 9 eys 0, it is easy to see that the variation Ž13. implies the one in the dual theory Ž18.
provided
N
dy C˜ yM s dy CyN Ž Qy1
q . Mq... , Ž 20.
N
dq C˜ qM s aŽ oyf . V dq CqN Ž Qy1
y . Mq... Ž 21.
Again, ‘‘ . . . ’’ denotes 3-spinor terms.
Let us now consider the supersymmetry variations of the dilatinos l " in the absence
of R–R fields,
1 1
d " l "s ž G ME M f . G M N K HM N K e "q . . . ,
/ Ž 22.
2 12
d " l .s 0 q . . . Ž 23.
These are again valid in both IIA and IIB theories. In IIB, both dilatinos have negative
chirality, while in IIA, ly has negative chirality and lq has positive chirality.
Switching on R–R fields does not affect Eq. Ž22.. In the T-dual theory, written in terms
a
of the vielbein eŽy. ˜
M , the variations d " l " are given by
1 1
d " l˜ "s ž G˜ ŽMy . E M f˜ . G˜ MŽ yN.K H˜M N K e˜ "q . . .
/ Ž 24.
2 12
Using f˜ s f y 12 lnG 99 and
G˜Ž .
MNK ˜
. HM N K s G
MNK
HM N K . 6 Gy1 .
99 G 9 Ž W9 a b G
ab
. " 6 Gy1 i
99 G E i G 99 , Ž 25 .
along with Eq. Ž10., one can see that the supersymmetry variations Ž22. and Ž24. are
compatible provided
dy l˜ ys dy lyy Gy1
99 G 9 dy Cy9 q . . . , Ž 26.
dq l˜ qs aŽ oyf . V Ž dq lqy Gy1
99 G 9 dq Cq9 . q . . . Ž 27.
Eqs. Ž20., Ž21. and Ž26., Ž27. give the T-duality transformations of the supersymme-
try variations dC " M and dl " to linear order in spinors and receive corrections cubic in
the spinors whose presence is indicated by ‘‘ . . . ’’. From these we can read off the
T-duality transformations of the gravitinos and dilatinos, in principle, only to linear
order in the spinors. However, as we will show, the linear order result is exact and in
fact, it dictates the form of the 3-spinor corrections to the T-duality maps for the
supersymmetry variations above. Thus, for the gravitinos C " M , we have the T-duality
transformations
y1 N
C̃yM s CyN Ž Qq . M ,
N
C̃qM s aŽ oyf . VCqN Ž Qy1
y . M , Ž 28 .
S.F. Hassanr Nuclear Physics B 568 (2000) 145–161 153
l˜ ys lyy Gy1
99 G 9 Cy9 ,
As shown above, the T-duality rules for space-time fermions do not depend on the
R–R fields. In this section we use these rules, along with the requirement of compatibil-
ity of T-duality with space-time supersymmetry, to determine the T-duality rules for
R–R fields and discuss some related issues. Most of the results in this section are not
new but are re-derived here in a unified and more convenient way. The IIArIIB
T-duality rules for R–R fields were derived in w18,19x by studying the supergravity
action and equations of motion Žalso see Ref. w20x. and in w21–23x by dimensional
reduction of the Wess–Zumino term in the D-brane world-volume action, both consider-
ing the bosonic sector alone. Our derivation of these rules here emphasizes the
compatibility of the T-duality conventions used for the R–R fields with those used for
the spinors. The T-duality rules relating IIB to the massive IIA theory were obtained in
w19,21,22x. Here we re-derive these rules for generic configurations, emphasizing how
154 S.F. Hassanr Nuclear Physics B 568 (2000) 145–161
potential non-localities in the T-duality rules for R–R potentials are avoided. We also
present a simple derivation of the ‘‘mass’’-dependent terms in the Wess–Zumino action
for ‘‘massive’’ IIA branes using T-duality.
In the presence of R–R backgrounds, the supersymmetry variations dq CqM and
dy CyM are still given by Ž13., while dy CqM and dq CyM are no longer zero and
receive contributions from R–R fields. The same is true for the dilatino variations
d " l .. The T-duality rules for the R–R fields can be obtained by considering any one of
these variations, say dy CqM . In type IIA theory, this variation is given by w24x Žsee
Appendix C for details.
1 1 1
dy CqM s e f F Ž 0. q G M 1 M 2 FMŽ2.1 M 2 q G M 1 M 2 M 3 M 4 FMŽ4.1 M 2 M 3 M 4 GM eyq . . . ,
8 2! 4!
Ž 30 .
where ‘‘ . . . ’’ denote 3-spinor terms as usual. F Ž0. s m is the mass parameter of massive
type-IIA theory and the field strengths F Ž n. for the massive theory are given by ŽC.3. in
the appendices. The usual massless IIA equations are obtained by setting m s 0. In
type-IIB theory the corresponding variation is given by w25x Žsee Appendix B for details.
1 1
dy CqM s y e f G M 1 F MŽ 11. q G M 1 M 2 M 3 F MŽ 31.M 2 M 3
8 3!
1
q G M 1 M 2 M 3 M 4 M 5 F MŽ 51.M 2 M 3 M 4 M 5 GM eyq . . . Ž 31.
2 Ž 5! .
It is convenient to write these two equations in the generic form
n
1 Ž y1.
dy CqM s ef Ý G M 1 . . . M n F MŽ n1.. . . M n GM eyq . . . Ž 32.
2 Ž 8. n n!
In exactly the same way as for the R–R vertex operator in flat space Žsee, for example,
Refs. w6–9x., the actual content of the above equation is determined by the chirality of
the space-time spinors: In type-IIB theory, both ey and CqM have positive chirality
and therefore the right-hand side contains only terms with even number of G-matrices
Žcorresponding to n s 1,3,5,7,9., whereas in IIA, ey and CqM have positive and
negative chiralities respectively and hence only terms with even n Ž n s 0,2,4,6,8,10.
enter the summation. Furthermore, using the G-matrix identity ŽA.2., the positive
chirality of ey implies that F Ž n. s yŽy1. nŽ ny1.r2 ) F Ž10yn.. This allows us to write the
summation in terms of F Ž n. with n ( 5 alone, recovering Ž30. and Ž31..
Let us now consider the above equation in the T-dual theory expressed in terms of the
a
vielbein e˜Žy. M,
n
1 ˜ Ž y1.
dy C˜ qM s ef Ý G˜ŽMy1.. . . M n F˜ MŽ n1.. . . M n G˜Žy. M e˜yq . . . Ž 33.
2 Ž 8. n n!
Using Eqs. Ž2., Ž6., Ž10. and Ž28., one can readily obtain the T-duality transformation for
the R–R field strengths as
n N1 Nn
F̃MŽ n.1 . . . M n s Ž y1 . aŽ oyf . Ž F9Ž nq1 . Ž ny1. y1
N1 . . . N n q nG 9w N1 FN 2 . . . N n x . Ž Qy . M1
y1
. . . Ž Qy . Mn .
Ž 34 .
S.F. Hassanr Nuclear Physics B 568 (2000) 145–161 155
where, aŽ oyf . denotes a convention-dependent sign as explained below Eq. Ž10.. Let us
now choose the convention Ž11. so that T-duality squares to 1 on R–R fields. Then,
using the form of Qy1y given in 3 , the above equation reduces to the component form
Ž .
F̃9Ž in.2 . . . i n s yaŽ AyB . F Ži 2ny1 . y1 Ž ny1.
. . . i n y Ž n y 1 . G 99 G 9w i 2 F9 i 3 . . . i n x , Ž 35 .
F˜iŽ1n. Ž nq1. ˜ Ž n.
i 2 . . . i n s yaŽ AyB . F9 i 1 . . . i n y nB9w i 1 F9 i 2 . . . i n x . Ž 36 .
aŽ AyB . is still arbitrary and could be chosen as either q1 or y1. The antisymmetriza-
tion denoted by the square bracket affects the indices i n and not the index 9. Since the
spinors were assumed to be independent of X 9 , Eq. Ž32. implies that F Ž n. should also
be independent of this coordinate.
The above T-duality rules for F Ž n. are valid for both massless and massive type-IIA
theories and can be iteratively integrated to yield the corresponding transformations for
the R–R potentials C Ž n.. Let us first consider duality between IIB and massless IIA. In
this case, F Ž0. s m s 0 and the field strengths are given by ŽB.10.. C Ž n. can be chosen to
be X 9-independent and under T-duality transform as w18–20,22,23x
C̃9Ž n. Ž ny1 . y1 Ž ny1.
i 2 . . . i n s aŽ AyB . C i 2 . . . i n y Ž n y 1 . G 99 G 9w i 2 C 9 i 3 . . . i n x , Ž 37 .
C˜iŽ1n.i 2 . . . i n s aŽ AyB . C9Ž nq1. ˜Ž n.
i 1 . . . i n y nB9w i 1C 9 i 2 . . . i n x . Ž 38 .
Let us now consider the massive-IIA case. For n s 0, Eq. Ž36. reduces to F˜ s Ž0.
yaŽ AyB . F Ž1. s yaŽ AyB . E 9 C Ž0.. As noticed in w19,21,22x, this implies that type-IIB
theory dualizes to the massive IIA theory with F˜ Ž0. s m, provided the IIB 0-form has an
X 9 dependence given by C Ž0. s yaŽ AyB . mX 9 q Cˆ Ž0., where the last term is indepen-
dent of X 9. Naively, one may expect that this X 9-dependence could lead to a similar
dependence for the IIA potentials, which should not be the case: Consider an X 9-depen-
dent function C Ž X 9 ., say, in the IIB theory leading to an X 9-dependent T-dual C˜Ž X 9 .
in IIA. Since the natural variable in the T-dual theory is X˜ 9 , which is related to X 9
through the canonical transformation Ž12., C˜ has to be expressed in terms of X˜ 9.
However, the relationship between X 9 and X˜ 9 is non-local, involving an integration
over the string world-sheet, and hence C˜ is a non-local function of X˜ 9. This problem
can be avoided if we arrange things such that the X 9-dependent C dualizes to an
X 9-independent C,˜ or Õice Õersa. Let us define
Cˆ Ž0. s C Ž0. q aŽ AyB . mX 9 ,
CˆMŽ2.1 M 2 s CMŽ2.1 M 2 q aŽ AyB . mX 9 BM 1 M 2 ,
CˆMŽ4.1 . . . M 4 s CMŽ4.1 . . . M 4 q 3aŽ AyB . mX 9 Bw M 1 M 2 BM 3 M 4 x ,
or more generally, using the notation of w22x, with C s Ý9ns 0 C Ž n.,
Cˆ s C q aŽ AyB . m X 9 e B . Ž 39 .
We give the C Ž2 p. in type-IIB a dependence on X 9 in such a way that Cˆ Ž2 p. are X 9
independent, while in type-IIA, Cˆ Ž2 pq1. s C Ž2 pq1. and are X 9 independent. Then, using
the T-duality rules for the field strengths Ž35., Ž36., along with Eqs. ŽB.10. for type-IIB
and ŽC.3. for the massive type-IIA, one can obtain the T-duality rules for the potentials.
These are still given by Ž37. and Ž38. but now with all C Ž2 p. replaced by Cˆ Ž2 p.. The X 9
independence of Cˆ Ž2 p. guarantees the X 9 independence of the IIA potentials C Ž2 pq1.,
156 S.F. Hassanr Nuclear Physics B 568 (2000) 145–161
preventing the appearance of non-localities. Note that while the massive T-duality rules,
written in terms of Cˆ Ž2 p., have the same form as the usual massless IIArIIB rules, the
two differ by m-dependent terms when written in terms of the actual R–R potentials
C Ž2 p..
The special X 9 dependence of C Ž2 p. can be easily understood when massive-IIArIIB
duality is regarded as a Scherk–Schwarz compactification to nine dimensions w19x: The
UŽ1. ; SLŽ2, R . transformation in IIB theory that gives the right X 9 dependence to C Ž0.,
by shifting it to C Ž0. y aŽ AyB . mX 9 Žcorresponding to p s s s 1, r s 0 and q s
yaŽ AyB . mX 9 in ŽB.5.., also produces the correct X 9 dependences in C Ž2. and C Ž4..
For m / 0, the Wess–Zumino terms in the IIA D-brane world-volume actions contain
m-dependent terms the forms of which were studied in w21,22x. We will now derive
these terms in a very straightforward way using T-duality: Let us start with the WZ
terms in the D-brane world-volume actions in type-IIB theory and express the potentials
C Ž2 p. in terms of Cˆ Ž2 p. as defined in Ž39.,
ŽIIB.
I WZ s Hv C e FyB s Hv Cˆ e Fy B y aŽ AyB . m
Hv X 9 eF . Ž 40 .
2p 2p 2p
When C Ž2 p. are chosen such that Cˆ Ž2 p. are X 9 independent, the dual theory is massive
ŽIIB.
type-IIA. Therefore, on dimensional reduction, I WZ should reduce to the corresponding
action for massive IIA theory, including the m-dependent terms. As mentioned earlier,
the massive T-duality rules relating Cˆ Ž2 p. and C Ž2 pq1. have exactly the same form as the
massless T-duality rules relating C Ž2 p. and C Ž2 pq1.. Therefore, the analysis for the
massless case, for example, as presented in Ref. w23x or Ref. w22x, implies that the first
term on the right-hand side of Ž40. dualizes to the standard WZ term in type-IIA which
is common between the massive and massless theories. The m-dependent terms are
contained in the second integral on the right-hand side of Ž40.. Let us identify X 9 with a
world-volume direction, say s , along which the theory is reduced. Taking F to be
Abelian Ž F s dV ., we write e F s Ý p p!1 dŽ V n F py1 . so that
1
H X 9 eFs Ý X 9 Es Ž V n F py1 . n d s q Ea Ž V n F py1 . n d x a ,
H
v2 p p p! v 2p
Ž 41 .
a 9
where x are the world-volume directions transverse to s . Since X does not depend
on x a , the second term in the integrand leads to a surface term and can be dropped. The
Ž2 p y 1.-form V n F py 1 in the first term now only has non-zero components along x a ,
and not along s . Finally, remembering that X 9 s s and dropping a surface term, the
integration over s leads to
1 py 1
ŽIIA .
I WZ s H C e FyB q aŽ AyB . m Ý V n Ž dV .
H , Ž 42 .
v 2 py1 p p! v 2 py1
which reproduces the m-dependent terms of Refs. w21,22x Žwe have ignored the D-brane
tension that can be easily inserted into the equations..
5. Conclusions
We have shown that, besides acting on the space-time indices of fields, T-duality also
has an action on the local Lorentz frame associated with the left-moving sector of the
S.F. Hassanr Nuclear Physics B 568 (2000) 145–161 157
world-sheet theory by twisting it with respect to the one associated with the right-mov-
ing sector. This twist translates to the T-duality action on the spinor index originating in
the left-moving Ramond sector, and fixes the T-duality action on the space-time
supersymmetry parameters. The gravitinos and dilatinos also contain an NS sector
contribution to their T-duality transformations which is obtained by demanding consis-
tency between T-duality and space-time supersymmetry. It is also shown that the
T-duality action on the spinors is independent of the R–R backgrounds. The result is
then used to re-derive the R–R T-duality rules. We discuss the case of the massive IIA
theory in more detail, showing that there exist variables in terms of which the massive
T-duality rules for the R–R potentials have the same form as the massless ones,
manifestly avoiding non-local relations between potentials. Using this, we give a simple
derivation of the ‘‘mass’’-dependent terms in the WZ actions for the associated
D-branes based on T-duality. In most part, we have explicitly retained the convention
dependence of the T-duality action on the Ramond sector. In one convention, T-duality
squares to 1, while in the other, it squares to Žy1. FL on the spectrum, where FL is the
left-moving space-time fermion number.
There are certain similarities between T-duality in flat and curved backgrounds. At
the world-sheet level, as we have shown, there exist variables in terms of which the
canonical transformation that implements T-duality in curved space, has the flat-space
form. One can also check that the T-duality rules for gravitinos and R–R fields in
curved backgrounds easily follow from their flat-space vertex operators, provided we
interpret these operators as curved space objects Žwhich, of course, is not really the
s
case.. For example, consider the gravitino emission operator ; SqsCqM cyM in flat
space. To interpret this as a curved-space expression, we define the spin-filed Sqs as an
s
operator that generates space-time supersymmetry transformations of CqM with param-
s
eter eq , but now in curved-space. Sq and eq will have opposite space-time chiralities
and Sq eq is invariant under T-duality. Then using the curved-space T-duality rules for
eq Ž10. and cyM Ž12. in the flat-space vertex operator, we recover the T-duality action
Ž28. on CqM . Similarly, the R–R T-duality rules can be obtained from the correspond-
X X
ing flat-space vertex operator, e f Sqs F s s SysX , where F s s is the R–R bi-spinor.
Note added
In a recent paper w26x, which appeared after this paper was completed, the authors
consider the SO Ž d,d,Z . action on R–R fields from the point of view of low-energy
effective action. One should be able to obtain the same results in our approach, after
determining the SO Ž d,d,Z . action on gravitinos, and then using space-time supersym-
metry. The results are expected to look the same as the single T-duality case with Q "
and V appropriately generalized to SO Ž d,d ..
Acknowledgements
¨
I would like to thank C. Angelantonj, I. Antoniadis, A. Dhar, K. Forger, D. Ghoshal,
D. Jatkar, B. Pioline, A. Sagnotti, and A. Sen for many useful discussions during the
course of this work.
158 S.F. Hassanr Nuclear Physics B 568 (2000) 145–161
ydCŽ 2 . Re Ž F 1 . F 1r'2 1 1 1
ž dB
s
/ ž1
Im Ž F .
sh
F 2r'2 / ž / , with h s
'2 ž yi qi
. /
The dilaton and the R–R scalar C 0 are identified by parameterizing the matrix V such
that
Vy1 Vq1 1 yt e i u yt eyi u
U s hV ' h
ž Vy2 Vq2 / s
(2t 2
ž e iu eyi u
,
/ Ž B.4 .
S.F. Hassanr Nuclear Physics B 568 (2000) 145–161 159
with t s C Ž0. q i ey f . We can set u s 0 by fixing the UŽ1.. In these conventions, the
SLŽ2, R . action takes the form
t ™ prtt qqsq , ž CŽ 2 .
B / ™ ž pr qs / ž CB Ž 2.
/ . Ž B.5 .
C Ž4. also transforms such that F Ž5. is invariant Žsee ŽB.10... Having identified the
dilaton, we define the string frame metric and associated spinors as
i
CM s e f r8 CMX q ž GMX lX ) . /
4
1 1
d " l "s ž G ME M f . G M 1 M 2 M 3 HM 1 M 2 M 3 e "q . . . ,
/ Ž B.6 .
2 12
1 1
d. l "s e f "G M F MŽ 1 . q
ž G M 1 M 2 M 3 F MŽ 31.M 2 M 3 e .q . . . ,
/ Ž B.7 .
2 12
1 1
d "C " M s EM q ž 4 ž wM a b .
2 / /
HM a b G a b e "q . . . , Ž B.8 .
1 1
d. C " M s
8 ž
e f .G M 1 F MŽ 11. y
3!
G M 1 M 2 M 3 F MŽ 31.M 2 M 3
1
.
2 Ž 5! . /
G M 1 . . . M 5 F MŽ 51. . . . M 5 GM e .q . . . Ž B.9 .
n!
FMŽ n.1 . . . M n s n E w M 1CMŽ ny1.
2 . . . Mnx
y Hw M 1 M 2 M 3 CMŽ ny3.
4 . . . Mnx
. Ž B.10 .
3! Ž n y 3 . !
The gravitino and dilatino supersymmetry variations in type-IIA theory are given in
w24x for massive IIA. When written in terms of appropriate variables, they lead to the
160 S.F. Hassanr Nuclear Physics B 568 (2000) 145–161
usual massless IIA equations when the mass parameter is set to zero. In the standard
string theory normalizations for the fields, these equations take the form
1 1
dlX s G X ME M f y G X M N P HM N P G 11 e X
2 12
1 3
q 5 e 5f r4 F Ž 0 . y e 3 f r4G X M NF MŽ 2N. G 11
8 2!
1
q e f r4G X M N P Q F MŽ 4N. P Q e X q . . . , Ž C.1 .
4!
1 X 1 y f r2
dCMX s E M q wM ab G
ab
q e Ž GMX NP Q
y 9d w NM G X P Q x . HN P Q G 11 e X
4 96
1 1 1
q y e 5f r4GMX F Ž 0 . y e 3 f r4 Ž GMX NP
y 14d w NM G X P x . FNŽ2.P G 11
32 2 2
1 20
q e f r4 GMX
ž NP Q R
y d w NM G X P Q R x FNŽ4.P Q R e X q . . .
/ Ž C.2 .
4 3
Here, a prime indicates the use of the Einstein metric, ‘‘ . . . ’’ denote 3-spinor terms and
the field strengths F Ž n. are given by
F Ž0. s m ,
FMŽ2.N s 2 E w M CNŽ1.x q mBM N ,
1 1 1
d. C " M s e f F Ž 0. " G M 1 M 2 FMŽ2.1 M 2 q G M 1 M 2 M 3 M 4 FMŽ4.1 M 2 M 3 M 4 GM e .q . . .
8 2! 4!
Ž C.5 .
The variations d " l " and d " C " M are still given by Eqs. ŽB.6. and ŽB.8. though now,
ey, CyM and lq have positive chirality and eq, CqM and ly have negative
space-time chirality.
d " GM N s 2 e " GŽ MC " N . , d " BM N s "2 C " w M GN x e " , d "f s l"e " .
Ž C.6 .
Here, GM N is the string metric and Ž . denotes symmetrization with unit weight. These
equations are valid in both IIA and IIB.
References
Abstract
Starting from the Ginsparg–Wilson relation, a general construction of chiral gauge theories on
the lattice is described. Local and global anomalies are easily discussed in this framework and a
closed expression for the effective action can be obtained. Particular attention is paid to the
non-abelian gauge anomaly, which is shown to be related to a local topological field on the lattice
representing the Chern character in 4 q 2 dimensions. q 2000 Elsevier Science B.V. All rights
reserved.
1. Introduction
In abelian chiral gauge theories the gauge anomaly is proportional to the topological
charge density and its topological significance is hence relatively easy to understand. As
has recently become clear w2–7x, the same is true on the lattice if the lattice Dirac
operator D satisfies the Ginsparg–Wilson relation w1x
g 5 D q Dg 5 s aDg 5 D. Ž 1.1 .
For any value of the lattice spacing a, this identity implies an exact symmetry of the
fermion action, which may be regarded as a lattice version of the usual chiral rotations.
Moreover the axial anomaly Žwhich coincides with the gauge anomaly in the abelian
case. arises from the non-invariance of the fermion integration measure under these
transformations and can be shown to be a topological field, i.e. the associated charge
does not change under local deformations of the gauge field.
¨
E-mail address: luscher@mail.desy.de ŽM. Luscher..
0550-3213r00r$ - see front matter q 2000 Elsevier Science B.V. All rights reserved.
PII: S 0 5 5 0 - 3 2 1 3 Ž 9 9 . 0 0 7 3 1 - 2
¨
M. Luscherr Nuclear Physics B 568 (2000) 162–179 163
This has now led to a construction of abelian chiral gauge theories on the lattice,
which complies with all the basic requirements including exact gauge invariance w8x. The
fermion multiplet has to be anomaly free for this to work out, but otherwise there are no
restrictions and the anomaly cancellation can be proved on the basis of the topological
nature of the anomaly alone w9x.
If the gauge group is not abelian, the gauge anomaly assumes a more complicated
form and its topological interpretation is not immediately clear. An important clue is
provided by the Stora-Zumino descent equations w10–12x, which allow one to pass from
the Chern character in 4 q 2 dimensions Žan expression proportional to the third power
of the gauge field tensor. via the Chern–Simons term in 4 q 1 dimensions to the
anomaly in 4 dimensions. It is then possible to show w13x that the anomaly is related to
the index theorem in 4 q 2 dimensions and to the existence of certain non-contractible
two-spheres in the space of gauge orbits Žfor a review and an extensive list of references
see Refs. w14,15x..
In this paper a general formulation of chiral gauge theories on the lattice is proposed.
The basic ansatz is the same as in the case of the abelian theories considered in Ref. w8x,
but there are some new elements which make the approach more transparent. In
particular, a direct connection between the gauge anomaly and a local topological field
representing the Chern character in 4 q 2 dimensions will be established. Apart from
providing an interesting link to the earlier work on the gauge anomaly in the continuum
limit, the significance of this result is that the exact cancellation of the anomaly on the
lattice is reduced to a local cohomology problem which appears to be quite tractable.
In the lattice theories studied in this paper the gauge field couples to a multiplet of
left-handed fermions, which transform according to some unitary representation R of the
gauge group G. We do not impose any restrictions on R or G at this point except that G
should be a compact connected Lie group. As usual the gauge field is represented by
link variables UŽ x, m . g G, where x runs over all lattice points and m s 0, . . . ,3 labels
the lattice axes. The lattice is assumed to be finite with periodic boundary conditions in
all directions.
As already mentioned, the use of a lattice Dirac operator D satisfying the Ginsparg–
Wilson relation is a key element of the present approach to chiral gauge theories. While
the details of the definition of D are largely irrelevant, Neuberger’s operator w5,6x is an
obvious choice in this context, since it is relatively simple and has all the required
technical properties. In particular, the locality of the operator and the differentiability
with respect to the gauge field is rigorously guaranteed if the gauge field satisfies the
bound
51 y R U Ž p . 5 - e for all plaquettes p, Ž 2.1 .
where UŽ p . denotes the product of the link variables around p and e any fixed positive
number less than 301 w16x.
In the following we shall take it for granted that the gauge field action restricts the
functional integral to this set of fields. This can be achieved through a modified
164 ¨
M. Luscherr Nuclear Physics B 568 (2000) 162–179
plaquette action, for example w8x. As far as the continuum limit in the weak coupling
phase is concerned, lattice actions of this type should be in the same universality class as
the standard Wilson action, because the bound Ž2.1. constrains the gauge field fluctua-
tions at the scale of the cutoff only and does not violate any fundamental principle such
as the locality or the gauge invariance of the theory.
Chiral fields may now be defined in a natural way following the steps previously
described in Refs. w8,17–19x. One first observes that the operator gˆ 5 s g 5 Ž1 y aD .
satisfies the relations
† 2
Ž gˆ 5 . s gˆ 5 , Ž gˆ 5 . s 1, Dgˆ 5 s yg 5 D. Ž 2.2 .
The fermion action
SF s a4 Ý c Ž x . Dc Ž x . Ž 2.3 .
x
thus splits into left- and right-handed parts if the chiral projectors for fermion and
anti-fermion fields are defined through
1 1
P̂ "s Ž 1 " gˆ 5 . , P "s Ž 1 " g 5 . , Ž 2.4 .
2 2
respectively. In particular, by imposing the constraints
P̂y c s c , c Pqs c , Ž 2.5 .
the right-handed components are eliminated and one obtains a classical lattice theory
where a multiplet of left-handed Weyl fermions couples to the gauge field in a
consistent way.
An interesting point to note here is that the space of gauge fields satisfying the bound
Ž2.1. decomposes into disconnected topological sectors w20–22x. In the non-trivial
sectors the index of the lattice Dirac operator w4,7x is in general different from zero and
it turns out that the dimensions of the spaces of left-handed fermion and anti-fermion
fields are then not the same. Fermion number violating processes can thus take place,
exactly as expected from the semi-classical approximation in continuum chiral gauge
theories w8,18x.
To complete the definition of the lattice theory, the functional integration measure for
left-handed fermions needs to be specified. The principal difficulty here is that the
constraint Ž2.5. depends on the gauge field. This leads to a non-trivial phase ambiguity
in the measure and eventually gives rise to the gauge anomaly.
To make this clearer let us suppose that Õj Ž x ., j s 1,2,3, . . . , is a basis of complex-
valued lattice Dirac fields such that
P̂y Õj s Õj , Ž Õk ,Õj . s d k j , Ž 3.1 .
the bracket being the obvious scalar product for such fields. The quantum field may then
be expanded according to
c Ž x . s Ý Õj Ž x . c j , Ž 3.2 .
j
¨
M. Luscherr Nuclear Physics B 568 (2000) 162–179 165
where the coefficients c j generate a Grassmann algebra. They represent the independent
degrees of freedom of the field and an integration measure for left-handed fermion fields
is thus given by
Dw c x s Ł d cj . Ž 3.3 .
j
the measure changes by the factor det Q which is a pure phase factor since the
transformation matrix Q is unitary.
In the following two sets of basis vectors Õj and Õ˜ j are considered to be equivalent if
they are related to each other through Eq. Ž3.4. with det Q s 1. Choosing a fermion
integration measure amounts to specifying an equivalence class of bases. A given basis
thus represents the associated measure, but it should not be confused with the measure
which is a much simpler object. In particular, any two fermion measures coincide up to
a gauge field dependent phase factor. The question of how to fix this phase will occupy
us throughout the rest of this paper. For the time being we assume that some particular
choice has been made and proceed with the definition of the theory.
In the case of the anti-fermion fields the subspace of left-handed fields is independent
of the gauge field and one can take the same orthonormal basis Õ k Ž x . for all gauge
fields. The ambiguity in the integration measure
D c s Ł d ck , c Ž x . s Ý c k Õk Ž x . Ž 3.5 .
k k
is then only a constant phase factor. Fermion expectation values of any product O of the
basic fields may now be defined through
² O : F s D w c x D c O eyS F .
H Ž 3.6 .
In the non-trivial topological sectors a constant weight factor should be included in this
formula w8x, but for brevity this factor is omitted here since we shall almost exclusively
be concerned with the vacuum sector. The fermion partition function in this sector is
²1: F s det M , Mk j s a4 Ý Õ k Ž x . DÕj Ž x . , Ž 3.7 .
x
where SG denotes the gauge field action, Z the partition function and DwU x the standard
integration measure for gauge fields on the lattice.
166 ¨
M. Luscherr Nuclear Physics B 568 (2000) 162–179
4. Locality condition
We are now left with the problem to fix the phase of the fermion integration measure.
Evidently this should be done in such a way that the locality of the theory is preserved
and it also seems reasonable to demand that the measure is smoothly dependent on the
gauge field. In this section these conditions are given a precise meaning and a few key
formulae are derived which will later prove useful when we discuss the gauge anomaly.
The dependence of the fermion measure on the gauge field is best studied by
considering variations1
dh U Ž x , m . s ahm Ž x . U Ž x , m . , hm Ž x . s hma Ž x . T a , Ž 4.1 .
of the link field. Requiring the measure to be smooth means that in the neighbourhood
of any given gauge field there exists a differentiable basis Õj of left-handed fields which
represents the measure in the way explained above. The change dh Õj of the basis vectors
and the linear functional
L h s i Ý Ž Õj , dh Õj . . Ž 4.2 .
j
1
Without loss the gauge group G may be assumed to be a subgroup of UŽ n. for some value of n. The
generators T a are then anti-hermitian matrices and the field components hma Ž x . are real.
¨
M. Luscherr Nuclear Physics B 568 (2000) 162–179 167
is not a product of local measures. An important point to note however is that the
effective action is the only place where the non-trivial structure of the measure shows
up. In particular, in the vacuum sector the fermion integrals ² O : F are equal to the
partition function ²1: F times a factor which is independent of the measure.
We are thus led to require that the current jmŽ x . which is induced by the fermion
measure is a local expression in the gauge field. The measure term L h then assumes the
form of a local counterterm, i.e. the interaction vertices which arise from the gauge field
dependence of the fermion measure are local. The locality of the theory is thus preserved
and the arbitrariness in the phase of the measure is greatly reduced.
5. Gauge anomaly
Another fundamental requirement on the fermion measure is that it should not break
the gauge symmetry. In particular, the effective action should be gauge-invariant. On the
lattice the group of gauge transformations is connected and it thus suffices to consider
infinitesimal gauge transformations. One should not conclude from this that there are no
global anomalies, but as will become clear later they arise in slightly different ways than
expected from the semi-classical analysis.
Infinitesimal gauge transformations are generated by lattice fields v Ž x . with values
in the Lie algebra of the gauge group. The corresponding variations of the link field are
obtained by substituting
hm Ž x . s y=m v Ž x . Ž 5.1 .
in Eq. Ž4.1., where the gauge-covariant forward difference operator =m is given by
1 y1
=m v Ž x . s U Ž x ,m . v Ž x q a m
ˆ . U Ž x ,m . yv Ž x. Ž 5.2 .
a
Žm
ˆ denotes the unit vector in direction m .. Taking the transformation behaviour of the
Dirac operator into account,
dh D s R Ž v . , D , Ž 5.3 .
the terms on the right-hand side of Eq. Ž4.4. are easily worked out and for the gauge
variation of the effective action the result
a
dh ln det M s ia4 Ý v a Ž x . A a Ž x . y =m ) jm
½ 5
Ž x. , Ž 5.4 .
x
ia
AaŽ x. s tr g 5 R Ž T a . D Ž x , x . 4 , Ž 5.5 .
2
is thus obtained. In these equations =m ) denotes the gauge-covariant backward differ-
ence operator and DŽ x, y . the kernel representing the Dirac operator in position space.
The trace is taken over Dirac and flavour indices only.
We now show that AŽ x . converges to the covariant gauge anomaly in the classical
continuum limit. The calculation is practically the same as in the case of the axial
168 ¨
M. Luscherr Nuclear Physics B 568 (2000) 162–179
anomaly which has been studied in detail in Refs. w23–25x. One begins by representing
the link field through
1
U Ž x , m . s Pexp a ½H 0
d t Am Ž x q Ž 1 y t . a m
ˆ. ,5 Ž 5.6 .
The fields Ok Ž x . which occur in this series are traces of RŽT a . times a polynomial of
dimension k in Rw AmŽ x .x and its derivatives. Moreover they must have the proper
transformation behaviour under the symmetries of the lattice theory.
From Eq. Ž5.5. it follows that AŽ x . is a gauge-covariant pseudo-scalar field which
changes sign when the fermion representation R is replaced by its complex conjugate.
Taking this into account, it is easy to convince oneself that all terms Ok Ž x . with
dimension k ( 3 have to be equal to zero. In the continuum limit we are then left with
the term
d Ra b c s 2 itr R Ž T a . R Ž T b . R Ž T c . qR Ž T c . R Ž T b . 4. Ž 5.9 .
The constant c1 s y1r128p 2 does not depend on the gauge group and can be
calculated in the UŽ1. theory with a single fermion in the fundamental representation
w23–27x. Since the gauge anomaly coincides with the axial anomaly in this case, the
number may also be inferred from the index theorem w4,7,9x.
Returning to the question posed at the beginning of this section, the results obtained
above show that the effective action is gauge-invariant if Žand only if.
a
=m ) jm Ž x . sAaŽ x . . Ž 5.10 .
In other words, the phase of the fermion measure should be chosen so that the associated
current satisfies this equation. Together with Eq. Ž4.4. the gauge invariance of the
effective action moreover implies that jmŽ x . has to be gauge-covariant.
As is well-known one cannot have both, locality and gauge invariance, unless the
anomaly cancellation condition
d Ra b c s 0 Ž 5.11 .
is fulfilled. There is more than one way to prove this in the present framework, a quick
argument being that an expansion similar to Eq. Ž5.7. must exist in the case of the
current jmŽ x . too, since it is required to be local and smoothly dependent on the gauge
field. In the continuum limit the anomaly is hence equal to the divergence of a covariant
local current which is a polynomial of dimension 3 in the gauge potential and its
¨
M. Luscherr Nuclear Physics B 568 (2000) 162–179 169
derivatives. This is only possible if the anomaly vanishes at a s 0, i.e. if Eq. Ž5.11.
holds.
6. Integrability condition
So far we have assumed that the current jmŽ x . is obtained from a given fermion
measure through Eqs. Ž4.2. and Ž4.5.. We now show that any prescribed current
satisfying a certain integrability condition arises from a measure in this way. The
relation between the measure and the current is thus invertible and one may adopt the
point of view that the latter is the fundamental object.
To derive the integrability condition we need to study the change of phase of the
fermion measure along smooth curves
Ut Ž x , m . , 0 ( t ( 1, Ž 6.1 .
in the space of gauge fields. As discussed in Section 4, the measure term L h tells us
how the phase varies when the gauge field is deformed in a particular direction. The
total change of phase along any given curve is thus given by the Wilson line
1 y1
W s exp i ½H 0
d t Lh ,5 ahm Ž x . s E t Ut Ž x , m . Ut Ž x , m . . Ž 6.2 .
In general this phase is non-integrable, i.e. the Wilson lines around closed curves are not
necessarily equal to 1. To work this out we introduce the projector
Q1w1Wy1 if j s 1,
Õj s
½ Q 1 wj otherwise,
Ž 6.7 .
where wj denotes a fixed basis at the reference point and W the Wilson line Ž6.2.
computed from the given current. This basis is path-dependent, but the associated
measure is not, because any two curves Ut and U˜t form a closed loop and the
integrability condition Ž6.6. then implies that the unitary transformation relating the
basis vectors Õj and Õ˜ j has determinant 1. Taking this into account, it is easy to show
that the fermion measure defined by the basis Ž6.7. has all the properties mentioned
above.
The construction of chiral gauge theories on the lattice is thus reduced to the problem
of finding a local current which fulfils the integrability condition Ž6.6. and the
requirement of gauge invariance. Once this is achieved, the theory is completely
specified up to a constant phase factor in each topological sector. Note that it suffices to
define the current for all gauge fields satisfying the bound Ž2.1. since only these
contribute to the functional integral.
7. Fermion determinant
Using the results obtained in the preceding section, we are now in a position to derive
a closed expression for the fermion determinant in terms of the Dirac operator and the
current jmŽ x .. We shall then be able to make contact with Kaplan’s approach to chiral
gauge theories w28,29x and the earlier work of Alvarez-Gaume´ et al. w31–33x and Ball
and Osborn w34–37x on the effective action in the continuum theory.
Suppose U0 is an arbitrary reference field in the vacuum sector and let wj be a basis
of left-handed fermion fields at this point. As explained above, the basis Ž6.7. then
represents the fermion measure at any other point U in the vacuum sector, up to a
constant phase factor. If we insert this basis in Eq. Ž3.7., the formula
det M det M0† sdet 1yPq qPq DQ1 D†0 4 Wy1 Ž 7.1 .
is obtained, where D 0 and M0 denote the Dirac operator and fermion matrix at the
reference point 2 . All other notations are as in Eq. Ž6.7.. To fully understand this result
the following remarks may be helpful.
Ža. The integrability condition guarantees that the right-hand side of the equation does
not depend on the curve Ut which has been chosen to connect U s U1 with U0 . In other
words, the path-dependence of the determinant and the Wilson line W precisely cancel
each other. Note incidentally that the constant phase ambiguity of the measure drops out
in the product of determinants.
2
Another expression for the effective action, involving the Dirac operator and the current jmŽ x . only, may
be obtained by integrating Eq. Ž4.4. along any particular path. The idea has recently been pursued by Suzuki
w39x in abelian chiral gauge theories.
¨
M. Luscherr Nuclear Physics B 568 (2000) 162–179 171
Žb. Using equations Ž2.2. and Ž6.5. it is easy to check that DQ1 D†0 commutes with
g 5 . The determinant of this operator in the subspace of left-handed anti-fermion fields
coincides with the determinant on the right-hand side of Eq. Ž7.1. and the chiral nature
of the expression is thus evident.
Žc. In Section 8 we shall show that the current jmŽ x . vanishes in the classical
continuum limit if the fermion multiplet is anomaly free. The Wilson line W conse-
quently does not contribute to the fermion determinant in this limit.
Žd. Concerning the operator Q t we note that the differential equation Ž6.4. may be
rewritten in the form
E t Q t s 12 a w g 5 E t Dt , Pt x Q t , Dt sD Us U t . Ž 7.2 .
Close to the classical continuum limit, and when acting on fermion fields with
frequencies far below the lattice cutoff, the operator is hence equal to 1 up to terms of
order a. In particular, in Eq. Ž7.1. the operator Q1 only affects the contribution of the
high-frequency modes and it may, therefore, be regarded as part of the lattice regulariza-
tion prescription for the chiral determinant.
Že. So far the reference field U0 has been assumed to be fixed and the factor det M0†
on the left-hand side of Eq. Ž7.1. is then just a constant. Since U0 and U1 are
interchangeable in this equation, another option is to interpret U0 as a second gauge field
and det M0† as the determinant arising from a multiplet of right-handed fermions. In the
present framework the formulation of such left–right symmetric chiral gauge theories
thus appears to be particularly natural.
Having clarified the structure of Eq. Ž7.1., we now briefly discuss how the formula
relates to Kaplan’s approach to chiral gauge theories w28,29x. In the version proposed by
Shamir w30x, this approach starts from a gauge theory in 4 q 1 dimensions with a
multiplet of massive Dirac fermions, where the additional coordinate is assumed to
range between 0 and T with Dirichlet boundary conditions on the gauge and fermion
fields. The reduction to 4 dimensions is then achieved by noting that the Dirac operator
admits chiral surface modes whose interactions at large T are described by an effective
chiral gauge theory.
Contact with the present framework can now be made if we identify the fifth
coordinate, scaled to the range w0,1x, with the parameter t of the path Ut . The path thus
becomes a gauge field in 4 q 1 dimensions with boundary values U0 and U1. In
particular, we can compare the fermion determinant in 4 q 1 dimensions with the
determinant on the right-hand side of Eq. Ž7.1. and it is then conceivable that they agree
in the limit where the lattice spacing in the fifth dimension is sent to 0 and T to infinity.
Preliminary studies suggest that this is indeed what happens if the lattice Dirac operator
in 4 q 1 dimensions is chosen appropriately, but the details are complicated and will not
be presented here. It is interesting to note, however, that from the point of view of the
higher-dimensional theory, the Wilson line W in Eq. Ž7.1. amounts to adding a local
counterterm to the gauge field action. The term cancels the dependence of the fermion
determinant on the gauge field in the interior of the space-time volume and thus allows
one to reduce the theory to 4 dimensions, the dynamically relevant degrees of freedom
being the boundary values U0 and U1.
In the continuum limit the phase of the fermion determinant is known to be
proportional to the h-invariant of the Dirac operator in 4 q 1 dimensions w31–37x. One
172 ¨
M. Luscherr Nuclear Physics B 568 (2000) 162–179
considers the massless Dirac operator in this case and uses Pauli–Villars regularization
or analytic continuation methods to define the determinant, but otherwise the setup is the
same as the one described above. The h-invariant, Kaplan’s approach and the results
obtained here are hence closely related to each other. As discussed by Kaplan and
Schmaltz w38x, the formula for the effective action of Refs. w31–37x may in fact be
directly derived from the fermion integral in 4 q 1 dimensions.
To complete the construction of the lattice theory we still need to prove that there
exists a local current jmŽ x . satisfying the requirement of gauge invariance and the
integrability condition. The aim in the following lines is to determine the general
solution of this problem in the classical continuum limit. Along the way an important
simplification is achieved by considering the integrability condition in its differential
form. Global anomalies and the relation between this equation and the gauge anomaly
are further topics which will be addressed.
The differential form of the integrability condition,
3
In equations Ž8.1. and Ž8.2. the variations h and z are assumed to be independent of the gauge field.
Further terms proportional to dh z and dz h have to be included if this is not the case.
¨
M. Luscherr Nuclear Physics B 568 (2000) 162–179 173
hmŽ x . and zmŽ x . are restrictions to the lattice of some differentiable vector fields. The
locality properties of the Dirac operator w16x then imply
`
iTr Pˆy dh Pˆy , dz Pˆy
½ 5 ;™ Ý a ky 4 Hd 4 x Ok Ž x . , Ž 8.4 .
a 0 ks0
where the fields Ok Ž x . are traces of polynomials in Rw AmŽ x .x, RwhmŽ x .x, Rw zmŽ x .x and
their derivatives. Taking the symmetries of the expression into account, this leads to the
result
and after substituting the asymptotic forms, Eqs. Ž5.8. and Ž8.5., the relation
c 2 s y4c1 s 1r32p 2 Ž 8.8 .
is thus obtained. In passing we remark that the anomalous conservation law Ž5.10. is
consistent with the integrability condition Ž8.1. in the sense that the combination of these
equations does not lead to further constraints on the current jmŽ x . apart from the fact
that it should transform covariantly under gauge transformations.
An important conclusion which can be drawn at this point is that jmŽ x . s 0 is an
acceptable choice of the current in the classical continuum limit if the fermion multiplet
is anomaly free. Both, the requirement of gauge invariance and the integrability
condition in its differential form, are then satisfied. There is in fact no other sensible
solution since it is impossible to construct a gauge-covariant polynomial of dimension 3
in the gauge potential AmŽ x . and its derivatives which transforms as an axial vector
current. As far as the classical continuum limit is concerned, the theory is thus
completely specified up to a constant phase factor in each topological sector.
Most gauge field configurations which contribute to the functional integral are not as
smooth as those considered in the classical continuum limit and a general strategy to
determine the current jmŽ x . thus needs to be developed if one is interested in
constructing the complete theory. As a first step in this direction, we here show that the
anomalous conservation law Ž5.10. and the integrability condition Ž8.1. can be mapped
to a local cohomology problem whose solution is known to all orders in the lattice
spacing.
174 ¨
M. Luscherr Nuclear Physics B 568 (2000) 162–179
To explain which type of cohomology problem we are heading to, let us consider the
pure gauge theory on R n with gauge group G and suppose q Ž z . is a gauge-invariant
polynomial in the gauge potential Aa Ž z . and its derivatives. Such fields are called
topological if
n
Hd z d q Ž z . s 0 Ž 9.1 .
for any variation d Aa Ž z . of the gauge potential with compact support. Using the
Stora-Zumino descent equations w10–12x, it is possible to prove that all topological fields
are of the form
q Ž z . s c Ž z . q Ea k a Ž z . , Ž 9.2 .
where cŽ z . is a linear combinations of Chern monomials
ca 1 . . . a 2 m t a1 . . . a m Faa11a 2Ž z . . . . Faa2mmy 1a 2 mŽ z . Ž 9.3 .
and k a Ž z . a gauge-invariant local current w44–49x. The tensor ca 1 . . . a 2 m in this expres-
sion has to be totally anti-symmetric and t a1 . . . a m should be invariant under the adjoint
action of the gauge group.
The classification of topological fields modulo divergence terms is a particular case
of a local cohomology problem in which the gauge symmetry plays an important role. ˆ
From this point of view the Chern monomials represent the non-trivial cohomology
classes. Depending on the gauge group, a basis of linearly independent Chern monomi-
als is usually not difficult to find Žsee Ref. w50x for example..
Returning to the lattice, our aim in the following paragraphs is to construct a
topological field in 4 q 2 dimensions whose cohomology class is trivial if Žand only if.
there exists a local current jmŽ x . with the required properties. The added dimensions are
continuous, i.e. we are concerned with lattice gauge fields
U Ž z , m . g G, z s Ž x ,t , s . , m s 0, . . . ,3, Ž 9.4 .
which depend on two additional real coordinates t and s. We also introduce gauge
potentials A t Ž z ., A s Ž z . along these directions and define the associated field tensor
through
Ft s Ž z . s E t A s Ž z . y Es A t Ž z . q A t Ž z . , A s Ž z . . Ž 9.5 .
Under arbitrary gauge transformations in 4 q 2 dimensions, the covariant derivative
DrA U Ž z , m . s Er U Ž z , m . q A r Ž z . U Ž z , m . y U Ž z , m . A r Ž z q a m
ˆ. Ž 9.6 .
then transforms in the same way as UŽ z, m . and a similar statement also applies to the
derivatives
DrA Pˆys Er Pˆy q R Ž A r . , Pˆy Ž 9.7 .
of the projector Pˆy Žhere and below the index r stands for t or s ..
We now consider the field
1 1
q Ž z . s yitr ½ 4
gˆ 5 DtA Pˆy , DsA Pˆy q
4
DtA Pˆy , DsA Pˆy gˆ 5
1
q R Ž Ft s . gˆ 5 Ž x , x . ,
2 5 Ž 9.8 .
¨
M. Luscherr Nuclear Physics B 568 (2000) 162–179 175
where w . . . x Ž x, y . denotes the kernel representing the operator enclosed in the square
bracket, at fixed t and s, in the same way as DŽ x, y . represents the Dirac operator. The
trace is taken over the Dirac and flavour indices only and q Ž z . is thus a gauge-invariant
local field in 4 q 2 dimensions. It is also not difficult to check ŽAppendix B. that q Ž z .
satisfies
a4 Ý d t d s d q Ž z . s 0
H Ž 9.9 .
x
for all local variations of the link variables UŽ z, m . and the potential A r Ž z ., i.e. it is a
topological field.
By definition q Ž z . is in the trivial cohomology class if it is equal to the divergence of
a gauge-invariant local current. We now show that this implies the existence of a local
current jmŽ x . in 4 dimensions satisfying the anomalous conservation law Ž5.10. and the
integrability condition Ž8.1.. The converse is also true, but we shall not prove this here.
So let us suppose that
q Ž z . s Em) km Ž z . q E t k s Ž z . y Es k t Ž z . , Ž 9.10 .
where kmŽ z ., k t Ž z . and k s Ž z . are gauge-invariant polynomials in
E tnEsm U Ž z , m . , E tnEsmA r Ž z . , n q m 0 1, Ž 9.11 .
with coefficients that are local fields on the lattice depending on UŽ z, m . and A r Ž z .. In
Eq. Ž9.10. the symbol Em) denotes the backward difference operator and the last two
terms have been written in the form of a curl for reasons to become clear below.
Under scale transformations of the coordinates t and s, the monomials contributing
to kmŽ z ., k t Ž z . and k s Ž z . transform homogeneously if A r Ž z . is transformed in the usual
way. It is then immediately clear that all terms on the right-hand side of Eq. Ž9.10. with
scale dimensions different from those of the left-hand side have to cancel. Without loss
we may, therefore, assume that the Ž t, s .-dimensions of kmŽ z ., k t Ž z . and k s Ž z . are Ž1,1.,
Ž1,0. and Ž0,1. respectively. Taking the gauge symmetry into account, this implies
k r Ž z . s a4 Ý l ar , m Ž w . K ra, m Ž w, z . , w s Ž y,t , s . , Ž 9.12 .
y
y1
a l r , m Ž w . s DrA U Ž w, m . U Ž w, m . , Ž 9.13 .
where K r, mŽ w, z . is a gauge-covariant local expression in the link variables.
After summing over all lattice points, Eq. Ž9.10. thus assumes the form
1
s iTr Pˆy DtA Pˆy , DsA Pˆy y
½ 5
R Ž Ft s . gˆ 5 , Ž 9.14 .
2
jra, m Ž w . s a 4 Ý K ra, m Ž w, z . . Ž 9.15 .
x
When studying chiral gauge theories one is often led to consider gauge and fermion
fields in higher dimensions. It may well be that this is just a matter of mathematical
convenience. On the other hand, the experience should perhaps be taken as an indication
that chiral gauge theories are merely effective descriptions of the low-energy modes of a
more fundamental theory in 4 q 1 or 4 q 2 dimensions. The approach of Kaplan and
Shamir w28–30x provides a concrete model for this and it would be important to work
out its relation to the framework presented in this paper in full detail, following the lines
sketched in Section 7.
¨
M. Luscherr Nuclear Physics B 568 (2000) 162–179 177
In the continuum limit the gauge anomaly cancels if the tensor d Ra b c vanishes. The
same is presumably true on the lattice, but a complete proof of this has only been given
in abelian theories so far w8x. For non-abelian gauge groups the current status is that the
anomaly cancellation has been established to all orders of an expansion in powers of the
lattice spacing. Moreover, as explained in Section 9, the problem has been reduced to
classifying the topological fields in 4 q 2 dimensions, which does not seem to be an
impossible task.
Global anomalies are a separate issue which requires control over the first homotopy
group of the space of lattice gauge fields satisfying the bound Ž2.1.. One may be able to
achieve this by noting that such fields are continuous on the scale of the lattice spacing
up to gauge transformations. The topology of the space of gauge orbits is hence
expected to be essentially the same as in the continuum theory.
I am grateful to Raymond Stora for guiding me through the literature on local
cohomology in gauge theories. Thanks also go to Oliver Bar ¨ and Isabel Campos for
helpful discussions on global anomalies and to Peter Weisz for a critical reading of a
first draft of this paper.
Appendix A
where S is some unitary transformation matrix satisfying S < ts0 s 1. When inserted in
Eq. ŽA.1. this yields
L h s yi E tln det S Ž A.3 .
and the Wilson loop W is thus equal to det S < ts1. At t s 1 the matrix S represents the
action of Q1 in the subspace of left-handed fields. In particular, its determinant
coincides with the determinant of Q1 in this subspace, i.e. with the right-hand side of
Eq. Ž6.6..
Appendix B
As a consequence we have
for any local deformation of the gauge field. To evaluate the right-hand side of this
equation, we make use of the identity
References
Abstract
PACS: 11.25.-w
1. Introduction
0550-3213r00r$ - see front matter q 2000 Elsevier Science B.V. All rights reserved.
PII: S 0 5 5 0 - 3 2 1 3 Ž 9 9 . 0 0 7 2 1 - X
J. Troostr Nuclear Physics B 568 (2000) 180–194 181
2. Reduced action
Van Baal studied a constant diagonal field strength configuration for UŽ N . Yang–
Mills on a four-torus w3,4x. We will later look at UŽ N . Yang–Mills on a general even
dimensional torus, but we will first show how this fits in the Yang–Mills approximation
to D-brane actions. We will closely follow the analysis and notation of w3,4x and w8,9x in
the next few sections. The bosonic part of the D s 9 q 1 N s 1 UŽ N . Yang–Mills
action can be written in terms of the ten-dimensional field strength Gmn Ž m s 0, . . . ,9.,
Gmn s Em Bn y En Bm q i Bm , Bn
as
1
S9q 1 s y d 10 x Tr Gmn
H 2
.
4
Reduced to p q 1 dimensions the action becomes Ž a s 0, . . . , p and m s p q 1, . . . ,9.
1 2 2
S pq 1 s y d pq1 x Tr Ga2b y w fm ,f n x q 2 Ž Ea fm q i w Ba ,f m x . ,
H ž / Ž 1.
4
where all fields depend only on the coordinates x b and the scalars and gauge fields are
f m s Bm Ž x b . , Ba s Ba Ž x b . . Ž 2.
The dimensionally reduced gauge transformations read
B ™VB V a
y1
y i VEa Vy1 , f ™ Vf V
m
y1
, Ž 3.
with V g UŽ N ..
In this section we determine the action for the fluctuations around a general diagonal
and constant background field strength G 0 . We fix notations as follows:
Ba s Ba0 q Aa ,
0
Gab s Ea Bb0 y Eb Ba0 q i Ba0 , Bb0 ,
182 J. Troostr Nuclear Physics B 568 (2000) 180–194
Da s Ea q i Ba0 ,. ,
Gab s Ga0b q Fa b ,
Fab s Da Ab y Db Aa q i Aa , Ab , Ž 4.
where the gauge field fluctuations are denoted A, with corresponding field strength F,
and we have defined a background covariant derivative D. We choose the background
gauge fixing condition Da A a s 0. When the background field strength is diagonal and
constant Žsolving the classical equations of motion., we find the following action for the
fluctuations:
1 pq 1 0 2
Ssy Hd x Tr Ž Gab y 2 Aa D 2Aa y 4 iAa Ga0b , Ab y 2 fm D 2fm
4
2
q 2 i Ž Da Ab y Db Aa . Aa , Ab q 4 i fm Da w fm , Aa x y Aa , Ab
2 2
y 2 w Aa , f m x y w f m , f n x . Ž 5.
It will be convenient to expand the fluctuations in a Lie algebra basis for UŽ n., namely,
ŽTi .a b s d i a d i b and Ž e i j .a b s d i a d jb for i / j:
Aa s aai Ti q bai j e i j ,
fm s c mi Ti q d mi j e i j . Ž 6.
From the reality properties of the gauge fields and the scalars we find that a and c are
real, and that bai j s baji ) and d mi j s d mji ) are complex.
4. Compactification
Now we will focus on our main interest. Consider 2 n spatial dimensions of the
D-brane to be wrapped on a torus of dimension 2 n with radii R aˆ s Laˆ r2p , where
Ž aˆ s 1, . . . ,2 n.. In what follows, we only consider non-trivial field strengths in these
directions. Magnetic flux quantization2 and the fact that the background field strength is
i
diagonal implies then that we can write our background in terms of the integers nab ˆ ˆ,
i
0
nab
ˆˆ
ˆ ˆ s 2p
Gab Ti . Ž 7.
Laˆ Lbˆ
We can choose the background gauge field to be
i b ˆ
nab
ˆˆx
Baˆ0 s yp Ti . Ž 8.
Laˆ Lbˆ
Next, we substitute this form of the background into the action Ž5. and concentrate on
the terms quadratic in the fluctuations. Since we will analyse the spectrum of small
2
We restrict to configurations where the branes are wrapped only once around, for example, the odd cycles.
Generalizing this to wrapping more than once these cycles changes the quantization condition. See, for
instance, Refs. w7–9x.
J. Troostr Nuclear Physics B 568 (2000) 180–194 183
fluctuations around the background, we neglect the interactions between the fluctuations
from now on. The quadratic action is
1
S Ž2. s y d pq1 x aai M0 aai q c mi M0 c mi q 2 Ý bai j) Ž Mi j da b y 4p iJai bj . bbi j
H ž ž
2 i-j
qd mi j) Mi j d mi j //, Ž 9.
where we have defined J, a measure for the difference in field strength in sector i and j
Žon brane i and brane j ., and the mass operators M0 and Mi j ,
i j
ij
nab
ˆ ˆ y naˆ bˆ
J, ab
ˆ ˆs
Laˆ Lbˆ
M0 s yEa2 ,
2 2 2 2
E0 E 2 nq1 Ep Eâ ˆ
Mi j s ž / ž y / y... y ž / ž y yp Jaˆi bjˆ x b / . Ž 10 .
i i i i
0
0
0
0
0
0
where f 1 0 f 2 0 . . . 0 f n 0 0.
.
.
.
0
0
0
0
yf n
0 ,
fn
0
0
Next, we introduce a complex structure on the torus as follows:
Ž 11 .
1
z s Ž z1 , z 2 , . . . , z n . s
'2 Ž x 1 y ix 2 , . . . , x 2 ny1 y ix 2 n . ,
1
A s Ž A z1 , A z 2 , . . . , A z n . s
'2 Ž A1 q iA 2 , . . . , A 2 ny1 q iA 2 n . ,
1
Ž Ez1 , . . . , Ez n . s ' Ž Ex 1 q i Ex 2 , . . . ,Ex 2 ny 1 q i Ex 2 n . . Ž 12 .
2
184 J. Troostr Nuclear Physics B 568 (2000) 180–194
where Ž k s 1, . . . ,n.. Thus we can write a nice expression for the relevant non-trivial
mass operator Mi j Ž10.,
M s a k ,a†k 4 . Ž 16 .
Before we can diagonalize the mass operator, it is crucial to discuss the boundary
conditions the eigenfunctions have to satisfy. They encode the topological data of the
background gauge field.
6. Boundary conditions
We recall that the background field strength and gauge fields were given by
0
2p
Gab
ˆ ˆs naiˆ bˆ Ti ,
Laˆ Lbˆ
p ˆ
Baˆ0 s y naiˆ bˆ x b Ti . Ž 17 .
Laˆ Lbˆ
The transition functions V of the gauge bundle over the torus have to satisfy
ˆ ˆ bˆ
p i
Baˆ0 x b q Lbˆ s Vbˆ Ba0 Ž x b . Vby1
ž / y1 0
ˆ y i Vbˆ Ea Vbˆ s Baˆ Ž x . y n ˆT . Ž 18 .
Lâ aˆ b i
We choose them to be
Vaˆ sexp yp inaiˆ bˆ x bˆ TirLbˆ .
ž / Ž 19 .
The boundary conditions following from the background gauge field induced transition
functions read, using Eqs. Ž3. and Ž6.,
ž ˆ /
aai x b q Lbˆ s aai Ž x b . ,
ˆ
ij
bai j ž x ˆ q L ˆ / sexp ž yp in ˆ ˆ x ˆrL ˆ / b
b
b bg
g
g
ij
a Ž xb.,
c mi ž x ˆqL ˆ / sc Ž x ˆ . ,
b
b
i
m
b
ij
d mi j ž x ˆ q L ˆ / sexp ž yp in ˆ ˆ x ˆrL ˆ / d
b
b bg
g
g
ij
m Ž x bˆ . . Ž 20 .
J. Troostr Nuclear Physics B 568 (2000) 180–194 185
After the preliminary work of writing the non-trivial mass operator M in a harmonic
oscillator form in terms of complex coordinates Ž16., and discussing the boundary
conditions the fluctuations have to satisfy Ž20., we determine the spectrum and the
eigenfunctions. We only discuss in detail the non-trivial case of off-diagonal modes.
Moreover, the difference between gauge field off-diagonal modes and scalar off-diago-
nal modes is a mere constant in the eigenvalues, so we can treat them in one go. We
follow the analysis of Refs. w3,4x and w8,9x. ŽSee Ref. w10x for an early mathematical
treatment.. The ground state x 0 of the mass operator in the off-diagonal sector we take
to satisfy the usual conditions
1 E p
0 s ak x 0 s
i ž Ez k
q
2
hk l zl x0 ,
/ Ž 21 .
and it has to obey the boundary conditions for the off-diagonal sector
ij
x 0 x bˆ q Lbˆ s ey p i nb̂g x x 0 Ž x bˆ . .
g
r Lg
ž / Ž 22 .
The differential equation is immediately solved in terms of the hermitian form H Ž13.
and a general holomorphic function f,
p
x 0 Ž z . s ey 2 H Ž z , z . f Ž z . , Ž 23 .
but the treatment of the boundary conditions is more involved. The boundary condition
for the fluctuation x 0 implies a non-trivial boundary condition for the holomorphic
function f Ž z .. It will be convenient to introduce some extra machinery to write these
boundary conditions in terms of objects well known in the mathematical literature w11x
on theta functions w3,4x. In terms of the hermitian form H Ž z,w ., we define an
antisymmetric form EŽ z,w .,
1
baˆ aˆ b 2 2 ˆ ˆ
H Ž z Ž x . ,w Ž y . . s x Ž yJ . ab
ˆ ˆ y q ix Jaˆ bˆ y ,
1 bˆ
E Ž z ,w . s Im H Ž z ,w . s Ž H Ž z ,w . y H Ž w, z . . s x aˆ Jab
ˆˆy . Ž 24 .
2i
We introduce the notation
1
q s Ž q1 , . . . ,qn . '
'2 Ž m1 L1 y im 2 L2 , . . . ,m 2 ny1 L2 ny1 y im 2 n L2 n . , Ž 25 .
a Ž q . sexp p i ž Ý
aˆ - bˆ
maˆ naˆ bˆ mbˆ . / Ž 26 .
These objects make it easy to write down the boundary condition for the fluctuations x
for windings around the torus for any number of times in different directions,
x 0 Ž z q q . s x 0 Ž z . eyp i EŽ q , z .a Ž q . . Ž 27 .
186 J. Troostr Nuclear Physics B 568 (2000) 180–194
The boundary conditions the holomorphic function f Ž23. satisfies can then finally be
written in terms of the hermitian form H and the second degree bicharacter a ,
p
f Ž z q q . s f Ž z . a Ž q . ep H Ž z , q .q 2 H Ž q , q . . Ž 28 .
Now comes the pay-off for introducing the appropriate mathematical machinery. These
holomorphic functions f are theta functions on T 2 n w11x. They span a vector space of
dimension < Pf Ž nab
ˆ ˆ . This space of theta functions is the space of ground state
.<
fluctuations around the given gauge field background. They can be written down
explicitly and we do so in the appendix.
The higher modes are given by acting with the creation operators on the ground state.
They automatically satisfy the boundary conditions. It is clear from Ž16. then that the
spectrum of off-diagonal scalar field fluctuations is given by the harmonic oscillator
formula
n
l s 2p Ý Ž 2 m iq1. f i Ž 29 .
is1
and after a further trivial diagonalization Žcompare Ž9.. for the off-diagonal gauge fields
we get the shifted spectra
n
l"
k s 2p Ý Ž 2 m iq1. f i " 2 f k . Ž 30 .
is1
8. Summary
We summarize the spectrum and eigenfunctions for diagonal, off-diagonal and gauge
field and scalar fluctuations in Table 1. We use the notations e Ž z k . s 1 Ž0,0, . . . ,1,y
'2
i, . . . ,0,0. for the eigenvectors of J Ž11., V for the volume of the torus, i, j g
1, . . . , N 4 ;ij g 1, . . . , N Ž N y 1.4 ;k g 1, . . . ,n4 ; p g Z 4 ;m g N n ;ri g 0, . . . ,e i y 14 . All
of these notations are straightforward except for the components ri and e i for which we
Table 1
Eigenfunctions and eigenvalues
Scalar Fluctuations Eigenvalues
Diagonal c li Ti 2
2p pâ
f lp,i Ý â ž /
Lâ
Off-diagonal d li j e i j
fi / 0 f lm, r,i j 2p w Ý nis1 Ž 2 m i q1 . f i x
Gauge field Fluctuations Eigenvalues
Diagonal aaiˆ Ti 2
2p pâ
fâp,i Ý â ž /
Lâ
Off-diagonal baˆi j e i j
fi / 0 e z kf m, r,i j 2p w Ý nis1 Ž 2 m i q1 . f i y2 f k x
e z kf m, r,i j 2p w Ý nis 1 Ž 2 m i q1 . f i q2 f k x
J. Troostr Nuclear Physics B 568 (2000) 180–194 187
refer to the appendix. Suffice it to remark that the degeneracy of the off-diagonal
ij .
fluctuations is given by Pf Ž nab ˆ ˆ s Ł i e i in sector ij, and the space of theta functions is
indexed by r. We have, moreover,
1 2p i p x r L
f p ,i s
'V e
aˆ aˆ aˆ T ,
i
m1 mn
f m , r ,i j s Ž a†1 . . . . Ž a†n . r m1 ! . . . m n ! xr e i j .
( Ž 31 .
ij
We only catalogued the case where all f are different from zero and H is
non-degenerate. The eigenvalues and eigenfunctions can also be classified easily in the
other cases.
3
We do not consider the case where one of the field strengths vanishes, although it is easily incorporated in
our framework.
188 J. Troostr Nuclear Physics B 568 (2000) 180–194
the four-torus the condition for stability, namely self-duality, coincides with the condi-
tion for preservation of supersymmetry. For higher tori this is not the case. The space of
stable gauge field configurations is for higher tori much larger than the space of
supersymmetric gauge field configurations.
c s c i Ti q c k l e k l , Ž 36 .
where c k l s c l k ) and k / l. The relevant Dirac equation for the fermionic modes is
easily derived from the fermionic part of the Yang–Mills action quadratic in the
fluctuations. As for the bosonic case, we can analyse the spectrum of the Dirac operator
on T 2 n to find the mass spectrum in the non-compact directions. To that end, we
analyse the equation4
Eâ ˆ
Ž i ng 1 . . . 2 n . gaˆ ž y p Jaˆkblˆ x b c k l s mc k l .
/ Ž 37 .
i
We concentrate on the non-trivial off-diagonal components in sectors kl. The standard
trick to find the spectrum of the fermionic mass operator is to square it,
2
Eâ ˆ
gaˆ ž y p Jaˆkblˆ x b / s Ý a k ,a†k 4 y 2p if 1 g 12 y . . . y 2p if n g 2 ny1,2 n ,
i k
where we used Ž11.. It is then easy to determine the spectrum of the fermionic mass
operator by projecting onto eigenspinors of g 2 ky1,2 k ,
Implicitly, we have made use of the fact that the off-diagonal fermions satisfy the same
boundary conditions as the bosons 5. Drawing on the results in the previous sections, the
eigenfunctions can then also easily be determined. Remark that the fermions all have a
4
The appearance of the chirality operator is to ensure that the non-compact and the compact part of the
Dirac operator commute. It does not play a crucial role in our analysis since it does not change the Dirac
algebra w12x.
5
We do not want to break more supersymmetry by adding non-trivial monodromies on top of the ones
induced by the background.
J. Troostr Nuclear Physics B 568 (2000) 180–194 189
certain helicity associated to the magnetic fields in the directions 12,34, . . . and each
component of the magnetic field is responsible for a Zeeman splitting of the energy
levels.
In this section we take a closer look at the bosonic and fermionic zero modes that
often play a crucial role in applications. We start by describing the fermionic zero modes
in greater detail. It is clear from the analysis in the previous section that only the
following projected spinor has zero modes:
kl
c kl
cqq . . . qs Ž 1 q ig 1 g 2 . Ž 1 q ig 3 g4 . . . . Ž 1 q ig 2 ny1 g 2 n . . Ž 39 .
2n
The signs of the projection operators reverse for sector lk, since the field strengths are
opposite in that sector. Explicitly, the differential equations and boundary conditions in
sectors kl and lk read
a i cqk l. . . qs 0, Ž 40 .
a )i cyl k . . . ys 0, Ž 41 .
kl
c k l x bˆ q Lbˆ s ey p
ž / i nb̂g x g r Lg
c k l Ž x bˆ . . Ž 42 .
From these equations we determine the total number of massless fermionic modes,
namely 16 Ž kl .
2 n Ý k - l Pf naˆ bˆ complex zero modes. In certain cases, there is a shorter route to
get the number of fermionic zero modes, via index theory. As a preliminary, we indicate
the 2 n-dimensional chirality of the spinors. It is clear that
Ž 1 y i ng 1 . . . g 2 n . cqk l. . . qs 0, Ž 43 .
n
Ž 1 y Ž yi . g 1 . . . g 2 n . cyl k . . . ys 0, Ž 44 .
such that, for n s 2 k even, the zero modes have the same chirality in the two sectors,
while for n s 2 k q 1 odd, they have opposite chirality. Note that this strokes with Ž40.,
Ž41. and the fact that the complex conjugate representation of the Weyl representation is
the original one for SO Ž4 k ., and of different chirality for SO Ž4 k q 2..
Using index theory, we can learn the difference in number of massless fermionic
modes of positive and negative chirality. If the fermionic zero modes all have the same
chirality, index theory predicts the total number of fermionic zero modes. We will use
the index theorem for the twisted spin complex on a flat manifold for the adjoint
representation w17,18x
where
j
Fadj
ch Ž Fadj . s Ý Tr j
j j! Ž 2p .
is the Chern class evaluated in the adjoint representation and n " is the number of
positive, negative chirality zero modes. In the background Ž17. the integral is easily
evaluated on T 2 n ,
nqy nys Ý Pf nakˆ lbˆ .
ž / Ž 46 .
kl
For d s 4 k q 2 the sum over all sectors is zero, the contributions from sector Ž kl .
Žstrings going one way. cancelling the contribution from sector Ž lk . Žstrings going the
other way.. Indeed, from the analysis in the previous section we know that the number
of zero modes of positive chirality equals the number of zero modes of negative chirality
in this case. For d s 4 k we find, in our conventions, only zero modes of positive
chirality, and index theory counts Ý k l Pf Ž nakˆ lbˆ . complex fermionic zero modes in d s 4 k.
Taking into account the multiplicity of the zero modes originating in ten dimensions, we
find 16 Ž k l . non-constant complex spinor zero mode components, as before.
2 n Ý k - l Pf naˆ bˆ
This straightforwardly extends the well-known results in four dimensions w19,20x.
The number of massless fermionic and massless bosonic modes differs in supersym-
metric configurations, and at first sight it is difficult to see how they form a representa-
tion of supersymmetry when it is partially unbroken. Nevertheless, they do. We discuss
this slightly puzzling feature in this subsection. The unbroken supersymmetry transfor-
mations rules are given by the dimensionally reduced formulae of N s 1 SYM in ten
dimensions. In ten dimensions the formulae read
i
d Ama s egm c a , Ž 47 .
2
dc a s y 14 Fmna g mne . Ž 48 .
Consider first the case where the unbroken supersymmetry is given by
yg 12 e s g 34 e s . . . s g 2 ny12 n e . Ž 49 .
Starting out with a fermionic zero mode satisfying
kl
c kl
cqq . . . qs Ž 1 q ig 1 g 2 . Ž 1 q ig 3 g4 . . . . Ž 1 q ig 2 ny1 g 2 n . , Ž 50 .
2n
we easily see from Eqs. Ž47., Ž49., Ž50. and m s m that it will never transform into a
massless scalar. That is consistent with the spectrum we found earlier. When n 0 3, we
find only non-trivial gauge field components in the 1 and 2 direction. For gauge field
components in the other directions, the projection condition on the parameter e and the
zero mode c make sure that the variation vanishes. This is again consistent with what
we found earlier. It can easily be checked that on T 4 a similar analysis yields complex
bosonic zero modes in directions 1,2 and 3,4. Moreover, for the cases where the
J. Troostr Nuclear Physics B 568 (2000) 180–194 191
The theory we have been discussing is the low-energy theory of D-branes in a trivial
background and with only small Ž‘non-relativistic’. gauge fields excited. Indeed, string
theory should be seen as the high-energy completion of the theories we studied.
Nevertheless, the approximation to string theory we discussed was often used to
understand results in string theory. We will briefly review some of the applications of
the results that we obtained that have already been made in the literature and that can be
coherently presented and extended in our framework.
In w7x the fluctuation spectrum on T 4 in the Yang–Mills approximation was
compared to the string theory fluctuation spectrum and the role of the Žnon-abelian.
Dirac–Born–Infeld action in resolving the discrepancy was clarified. The precise form
of the non-abelian Dirac–Born–Infeld action remained unclear. Our analysis could be
useful for further studying fluctuation spectra from the different points of view along the
lines of w7x, for higher branes on higher tori. Note also that the explicit form for a
tachyonic fluctuation was used in w7x to discuss tachyon condensation intuitively. That
discussion is now easily extended to higher branes.
The condition for the background gauge fields to preserve supersymmetry is familiar
in D-brane physics, especially in its T-dual form. To repeat this well-known point, it is
sufficient to give an archetypical example. Consider the following configuration: a pair
of D8-branes compactified on T 8 with a constant field strength on the first D8-brane
F12 0 F34 0 F56 0 F78 0 0. When F12 s F34 q F56 q F78 , supersymmetry is conserved.
T-dualize this configuration over directions 2,4,6,8 to obtain a pair of D4-branes at
angles. The angles are given by the following formulae6 : F12 s tan f 1 , F34 s tan f 2 ,
F56 s tan f 3 , F78 s tan f4 . Working at small angle, or taking into account the modifica-
tions the Born–Infeld action induces in the Yang–Mills theory in the spirit of w7x, we
find that the condition coincides with the well-known one for rotated branes w14–16x.
Note though that for the case n s 4 we saw a physical distinction between the case
where f 1 s f 2 q f 3 q f 4 and f 1 q f 4 s f 2 q f 3 . The difference between the two cases is
probably related to the mechanism of the creation of a string in the D0–D8 system Žor
D4–D4 system., but we do not pursue this here.
As noted before, on higher tori, the space of stable gauge field configurations is much
larger than the space of supersymmetric configurations. The same applies therefore to all
kinds of D-brane constructions in gauge theories. The results on T 4 for the Yang–Mills
theory were extensively discussed in for instance w5–7x. Stability of some special
6
We ignore some constant factors.
192 J. Troostr Nuclear Physics B 568 (2000) 180–194
Acknowledgements
It is a pleasure to thank Ben Craps, Frederik Denef, Marc Massar, Frederik Roose,
Alex Sevrin and especially Walter Troost for useful discussions, and Wati Taylor and
Pierre Van Baal for friendly correspondence.
To find the explicit form of the theta functions we need some more machinery w11x.
We go to a canonical ŽFrobenius. basis of the lattice using an SLŽ2 n,Z. transformation,
°0 0 e1
.
0
.
¶
0 0 . .
. .
0 0 0 en
ns , Ž A.1 .
ye1 0 0 0
. .
. . 0 0
. .
¢0 ye n 0 0 ß
J. Troostr Nuclear Physics B 568 (2000) 180–194 193
where the e i are positive integers, each e i dividing the next e iq1. We choose a C-basis
as follows:
j Ž nq1. j Ž2 n.
z s z˜1 q . . . qz˜n s Uz˜ , Ž A.2 .
e1 en
where j â denotes the canonical basis of the lattice. By X we denote the real vector
space generated by j nq 1 , . . . , j 2 n. Since Im H s E Ž24. is zero on X = X, we find that
h˜ s U † hU is real and symmetric. We define then the symmetric C-bilinear form S
uniquely associated with H
˜˜
S Ž z ,w . s yw˜ T hz Ž A.3 .
and the quasi-hermitian form Q
Q Ž z ,w . s H Ž z ,w . q S Ž z ,w . . Ž A.4 .
Moreover, we need the period matrix t defined in terms of the quasi-hermitian form Q:
i
t i j ' Q Ž j Ž i. , j Ž j. . . Ž A.5 .
2
Further definitions are required, namely the bicharacter ep i BŽ q, q . with respect to the
canonical basis j :
q s q˜â j â ,
n
B Ž q,q . s Ý e i q˜i q˜iqn Ž A.6 .
is1
b Ž q . s e 2 p i Ý Ž m k q̃ kq ny l k q˜ k . . Ž A.7 .
is 1
y1
Ž mqr ..t Ž pqe y1 Ž mqr ..q 2 p iŽ pqe y1 Ž mqr ..PŽ z̃ql .
ur Ž z . s Ý ep iŽpq e , Ž A.8 .
pgZ n
References
Abstract
We investigate the consequence of the energy–momentum conservation law for the holo-
graphic S-matrix from AdSrCFT correspondence. It is shown that the conservation law is not a
natural consequence of conformal invariance in the large N limit. We predict a new singularity for
the four-point correlation function of a marginal operator. Only the two-point scattering amplitude
is explicitly calculated, and the result agrees with what is expected. q 2000 Elsevier Science B.V.
All rights reserved.
0550-3213r00r$ - see front matter q 2000 Elsevier Science B.V. All rights reserved.
PII: S 0 5 5 0 - 3 2 1 3 Ž 9 9 . 0 0 6 5 6 - 2
196 M. Li r Nuclear Physics B 568 (2000) 195–207
Of course, for the time being we are not yet that powerful. On the contrary, the
Maldacena conjecture w8x has been used to make predictions about the SYM theory.
Although we do not know much about the non-perturbative S-matrix, a few principles
are certainly applicable. Traditionally, very general principles such as Lorentz invari-
ance, unitarity and analyticity constitute strong constraints on the S-matrix. We expect
that these constraints transform into the ones on a subset of correlation functions in
SYM via the holographic S-matrix ansatz. Our purpose in this paper is to point out that
the simplest consequence of Lorentz invariance, the energy–momentum conservation
law, is not a bona fide consequence of conformal invariance. Notice that the isometry of
the anti-de Sitter space is the conformal group. Taking the large radius limit, the
conformal group contracts to the Poincare´ group. Rather surprisingly, already at the
level of the four-point amplitude, implementation of energy–momentum conservation
requires the existence of a new type of singularity in the four-point correlation functions
in the large N limit. This singularity, to our knowledge, is not dictated by conformal
symmetry.
In order to extract information about physical process happening in the center of the
anti-de Sitter space, well-focused wave packets must be prepared. A precise ansatz for
an incoming particle or an outgoing particle is given in w3x. We will focus on massless
particles, for we will work with the type IIB string theory, and the only stable states are
those of the supergraviton. Denote the creation operator of an incoming particle by
av ey, where v s RE is the dimensionless energy, R is the radius of AdS 5 , and e is a
unit four-vector. This particle carries a momentum v e tangent to AdS 5 in its center. The
state is smeared over S 5, or it carries a zero momentum in the internal space. Similarly,
denote the annihilation operator of an outgoing particle by av eq. For a scalar particle,
the ansatz of w3x is
v 2
av eys vy3r2 dt d V exp y
H ½ Ž t q pr2. q < x q e < 2
2
5
yi v Ž t q pr2 . O Ž t ,x . ,
v 2
av eqs vy3r2 dt d V exp y
H ½ Ž t y pr2. q < x y e < 2
2
5
qi v Ž t y pr2 . O Ž t ,x . , Ž 1.1 .
where O is the appropriate operator corresponding to the scalar field w9–11x. For the
dilaton, it is proportional to tr F 2 , for the axion, it is proportional to tr F n F. We
assumed that O is properly normalized, so some v independent numerical factors in w3x
were dropped. The integral Hd V is over the unit S 3 which is parameterized by x.
The above ansatz clearly indicates that the incoming Žoutgoing. particle originates
Žends up. at time ypr2 Žpr2.. In the large R limit, the proper time goes to "`. The
Gaussian factor helps to focus the beam in the direction e. A S-matrix element is given
by
N ™`
S s lim Fy1 ¦Ł ai
v e iy Ł av e q
j
j ;, Ž 1.2 .
where F is a normalization factor. For a fixed string coupling constant, due to the
relation R 4 s 4p g s Na X 2 , the large R limit is achieved by taking the large N limit.
M. Li r Nuclear Physics B 568 (2000) 195–207 197
In a conformal field theory, both the two-point functions and three-point functions are
fixed up to a numerical coefficient by conformal symmetry. One would expect that the
calculation of the two-point amplitudes and three-point amplitudes using Eqs. Ž1.1.,
Ž1.2. is a simple matter. Actually, as shown in Section 2, an exact form of the
three-point amplitude can be obtained only after some tedious calculations. In this
section we will be content with a qualitative examination of these amplitudes.
The geometry S 3 = R is conformal to R 4 , so correlation functions on S 3 = R can be
obtained from those on R 4 using the conformal transformation. For instance, the
Euclidean distance between the two points x and y, r 2 s < x y y < 2 is mapped to
exp Ž t x q t y . Ž cosh Ž t x y t y . y x P y . , Ž 1.3 .
where we parametrize R 4 by the radial coordinates r s expt and the unit sphere S 3. x
and y are unit four-vectors. Now t and x parametrize S 3 = R.
In a correlation function, the extra factors such as expŽt 1 q t 2 . are removed by a
conformal factor. If F Ž ri2j . is a correlation function on R 4 , then the corresponding
correlation function on S 3 = R is obtained by simply replacing ri2j with coshŽt i y t j . y
x i P x j . To obtain the correlation function on a Minkowskian S 3 = R, we wick-rotate t
to it, and add a term i e :
ri2j scos Ž t i y t j . y x i P x j q i e , Ž 1.4 .
where the i e prescription is introduced to ensure causality in the boundary conformal
theory. To see this, assume t i y t j small, and the angle f i j between x i and x j small, we
obtain
2
y 12 Ž t i y t j . q 12 f i2j q i e , Ž 1.5 .
we see that if one uses ry2
i j as the propagator, the signal will propagate along the future
light cone for a positive energy mode of the form expŽyi v t ..
The scaling dimension of an operator corresponding to a massless scalar field is
D s 4. The two-point function is therefore
² O Ž t 1 ,x 1 . O Ž t 2 ,x 2 . : s Ž cos Ž t 1yt 2 . yx 1 P x 2qi e . y4 Ž 1.6 .
up to a normalization constant.
The two-point scattering amplitude is obtained using Ž1.2.. Without taking the large
N and high energy limit, there is no energy–momentum conservation, since as shown in
the appendix of w3x, the incoming an well as outgoing waves have finite width in both
energy and momentum. The width of v is proportional to 'v . Using v s RE, the width
'
of E is proportional to ErR and goes to zero in the large R limit. So energy–
momentum conservation has to be recovered in this limit. We shall show in the next
section that the energy conservation is always guaranteed in this limit.
However, momentum conservation rather imposes strong constraints on the behavior
of conformal correlators in the large N limit. As we will see in Section 2, the
convolution of Ž1.2. using ansatz Ž1.1. is rather subtle. To obtain the exact numerical
answer, one cannot simply replace the Gaussian distribution of Ž1.1. by a simpler one,
say a delta function. To see the momentum conservation, though, we will do this in this
section.
198 M. Li r Nuclear Physics B 568 (2000) 195–207
In the case of the two-point amplitude, replacing the Gaussian wave packets by delta
functions, we obtain
y4
Ž 1 y e1 P e 2 . . Ž 1.7 .
Together with a factor depending on v i , we will have a null result if e 1 P e 2 / 1. The
above expression is singular if e 1 P e 2 s 1 or equivalently e 1 s e 2 . Thus we hope that a
more careful calculation will result a delta function d 3 Že 1 y e 2 . for the two-point
amplitude. Here we define the delta function by
3
Hd Ž e 1 y e 2 . d V 2 s 1.
this puzzle is avoided. A related feature is that the conformal dimension of a stringy
state is divergent in the large N limit w9,10x, thus although the convolution Ž1.2. exists
for finite N, its large N limit does not exist.
It is more interesting to see what happens to the four-point correlation function when
the momentum conservation is imposed. The four-point correlation function of operator
O of scaling dimension 4 can not be fixed by conformal symmetry alone. Up to a
scaling factor, it is a function of two independent cross-ratios. Defining the cross-ratios
2 2 2 2
r 12 r 34 r 12 r 34
as 2 2
, bs 2 2
, Ž 1.10 .
r 13 r 24 r 23 r 14
2. Explicit calculations
We will see that an explicit calculation based on the ansatz of w3x is quite difficult.
We will be able to obtain a closed form for the two-point amplitude, we will not be able
to complete the calculation of the three-point amplitude. However, we will show that
energy–momentum conservation is ensured in the large R limit. We discuss the
calculation of the four-point amplitude. A lot of work is left for the future.
f Ž t , . . . . s d v f˜Ž v , . . . . eyi v t ,
H
M. Li r Nuclear Physics B 568 (2000) 195–207 201
we have
2p
Hdt e y
v
2
t 2 qi v t
f Ž t, . . . . s ( v
Hd v X
ey Ž v y vX .2 v f˜Ž v X , . . . . . Ž 2.2 .
The integral over v X centers around v with width dv X ; 'v , compared with the
principle value v , this deviation tends to zero in the large v limit. Thus the above
integral is approximately
2p f˜Ž v , . . . . s dt e i v t f Ž t , . . . . .
H Ž 2.3 .
Compared with this approximate value, the deviation is about
df˜Ž v , . . . .
dv , Ž 2.4 .
dv
so it can be ignored if
f˜X Ž v , . . . . rf˜Ž v , . . . . < 1r'v .
This condition is generally satisfied. Alternatively, as we already saw, the function
f Ž t, . . . . has infinitely many poles. Integral over t will pick up these poles, the Gaussian
factor expŽyŽ vr2. t 2 . helps to suppress all these poles except the one closest to zero. In
this case, the other factor expŽ i v t . is more important than the Gaussian factor, since it
oscillates fast around t ; 1r 'v .
To demonstrate the above result, let us see how the energy conservation is derived in
the large v limit. For a general scattering amplitude, we have
1 2
H Łi dt exp y Ý v t
i 2 i i qi Ý ai vi ti f Ž ti , . . . . , Ž 2.5 .
i i
where a i s 1 for an outgoing state, and it is y1 for an incoming state. Using a new set
of times: t 1 , t i s t i q t 1 , i s 2, . . . ,n, the function f Ž t i , . . . . is a function of t i only,
since the correlation function in SYM is invariant under a time translation. Performing
the integral over t 1 first, we obtain the integrand for t i ,
2p 1 1
( v
exp y
2v
Ž Ý ai vi .
2
y
2v
Ý v it i
2
y 12 Ý v it i2
qi Ý a i v it i f Ž t i , . . . . , Ž 2.6 .
which is just the energy conservation law. This factor can be obtained without including
the Gaussian factors. The remaining Gaussian factor in Ž2.6. is positive definite in t i ,
and as we argued before, can be ignored so long if we pick up poles closest to zero in t i .
The integral over variables on S 3 is much more complicated, and since the Gaussian
factor expŽyŽ v ir2.<x i " e i < 2 . is the only non-trivial factor in the convolution, one has
to treat the convolution carefully. In the following, we will examine the two-, three- and
four-point amplitudes separately. Before doing that, let us remark that this Gaussian
factor can be replaced by
vi
< x i"e i < 2
d4 ki 2 v i.qi k i Ž e i"x i .
ey 2 s H Ž 2pv . 2
eyŽk 2 i
i
d4 ki 2 .qi v i k iP Ž e i"x i .
s H Ž 2p . 2
eyŽk 2 i ' . Ž 2.8 .
As shown above, the integration over the ‘‘center of times’’ results in a factor
2p
d Ž E1 y E2 . . Ž 2.9 .
R
The remaining part is
y3
d 4 k1 d4 k2
F y1
N2 Ž RE1 . H Ž 2p . 2 2
exp yk 12 y 12 k 22 q i v 1 ( Ý k Pe i i F2 Ž k i . ,
Ž 2p .
Ž 2.10 .
with
F2 Ž k i . s dt d V 1 d V 2exp yi v 1 t y i v 1
H ( Ý a k Px i i i
y4
= Ž cost y x 1 P x 2 y i e . , Ž 2.11 .
where in Eq. Ž2.10. we introduced a normalization factor N2 depending on the
normalization of the operator O .
The integral over t in Ž2.11. can be performed first. It picks up a pole at
t s f 12 y i e , where x 1 P x 2 s cos f 12 . Other poles are suppressed by a Gaussian factor
we have omitted. The leading contribution is
p 1
F2 Ž k i . s v 13 d V 1 d V 2 exp yi Vf 12 y i v 1
H ( Ý a k Px i i i 4
,
3 sin Ž f 12 y i e .
Ž 2.12 .
M. Li r Nuclear Physics B 568 (2000) 195–207 203
other terms are suppressed by powers of v 1. Performing the integration over V 2 first,
we have
8p 3 32p 4 p J1 v 1 < k 1 y k 2 <
ž( /.
F2 s v 14 d V 1 e i'v 1 Ž k 2yk 1 .Px 1 s
H v 14 Ž 2.13 .
3 3 (v < k y k <
1 1 2
We will see momentarily that the fact that the above result is a function of only k 1 y k 2
ensures the momentum conservation in the large R limit.
Plugging back the above result into Ž2.10., ignoring the prefactor in Ž2.10. for the
moment, we have, after changing variables k 1 y k 2 s k 2r2, k 1 q k 2 s k 1r2
2 5p y k 12 qi v 1 k 1 P Ž e 1 ye 2 .
d 4k 2 2
v 14 4
Hd k e 1
' H 2(v ey k 2 qi'v 1 k 2PŽ e 1qe 2 . J1 2 v 1 < k 2 < .
ž( /
3 <
1 k2
Ž 2.14 .
The integral over k 1 is separated from that over k 2 . The first integral results in
2
p 2 eyv 1 < e 1ye 2 <
which in the large v limit tends to
3r2
p
p2 ž / v1
d 3 Ž e1 y e 2 . . Ž 2.15 .
Due to the delta function, the second integral in Ž2.14. is simplified. The integral over
k 2 can be separated into the radial part and the angular part, and the latter can be easily
performed. In the end, we obtain
2 8p 5 3r2 2
Ž pv 1 . d 3 Ž e 1 y e 2 . dk k J12 2 v 1 k eyk .
H ž( / Ž 2.16 .
3
In the large v 1 limit, the Bessel function J1 can be replaced by its asymptotic form,
namely
`
2
H0 dk k J ž 2(v k / e
1 1
yk 2
™ p(1v 1
2
Hdk cos ž 2(v k / e 1
yk 2
,
and since for a large v 1 , the cos factor can be replaced by its average value 1r2, the
(
value of the above integral is Ž1r4.Ž1r pv 1 .. We have checked this result by using a
formula for the integral in Ž2.16. involving the Bessel function. Substitute this into
Ž2.16., we find
6
Ž 2p .
v 1 d 3 Ž e1 y e 2 . . Ž 2.17 .
3
Together with the prefactor in Ž2.10. and the delta function in Ž2.9., the end result is
7
Ž 2p .
² av a
1 e 1y v 2 e 2q
: s Fy1 N2 E1y2d Ž E1 y E2 . d 3 Ž e 1 y e 2 . . Ž 2.18 .
3 R3
204 M. Li r Nuclear Physics B 568 (2000) 195–207
ˆ t FŽ t , x . q . . . ,
a s dV fE
H
so close to the center of the AdS space we have, roughly
d4 k
f̂ Ž t , x . s H E
Ž aq Ž k . eyi E tqi kP x q a Ž k . e i E tykP x . , Ž 2.21 .
where all the coordinates are the proper ones, unlike the ones in Ž2.20.. Thus we expect
that the scattering amplitude, up to a numerical factor, must be
² av
1 e 1y
av 2 e 2q : s R 5 E1 d 4 Ž E1 e 1 y E2 e 2 . , Ž 2.22 .
where the volume factor R 5 comes from the internal space S 5, since the particles have
zero momenta in the internal space. The kinetic factor is precisely the same as in Ž2.19..
Therefore, it appears that the normalization F is order 1, and N2 ; R 8 ; N 2 g s2 .
Consider the three-point amplitude with two incoming particles. It is more convenient
to integrate out t 3 first, with t 1 s t 1 q t 3 , t 2 s t 2 q t 3 . As before a delta function
ensuring energy conservation results. The amplitude is
2p d4 ki
A 3 s N3 d Ž E1 q E2 y E3 . Ł vy3r2
i H Łi Ž 2p . 2
R i
= exp y 12 Ý k i2 q i Ý v i k i e i F3 Ž k i . ,
( Ž 2.23 .
M. Li r Nuclear Physics B 568 (2000) 195–207 205
with
y2
F3 Ž k i . s dt 1 dt 2 Ł d V i eyi v it iqi'v i k iPx icos Ž t 1 y t 2 . y x 1 P x 2 q i e .
H i
y2 y2
= cos Ž t 1 y x 1 P x 3 y i e . cos Ž t 2 y x 2 P x 3 y i e . , Ž 2.24 .
™
where we have reflected x 3 yx 3 . Denote x i P x j by cos f i j . To perform the integral
over t i first, we use the following formula:
y2
1 `
i vt " i < v <Ž f " i e .
Ž cost y cos f " i e . sy 2 H dv < v < e
sin Ž f " i e . y`
, Ž 2.25 .
y1
i i vt " i < v <Ž f " i e .
Ž cost y cos f " i e . s. Hd v e Ž 2.26 .
sin Ž f " i e .
by taking derivative with respect to f once. With the Fourier transform Ž2.25., the
integral over t i in Ž2.24. is readily performed, with the result
ž(
exp i v i k i P x i /
F3 Ž k i . s y H Łi d V sin Ž f i 2
12 q i e . sin Ž f 13 y i e . sin2 Ž f 23 y i e .
2
= d v < v = Ž v 1 y v . Ž v 2 q v . < e i < v < f 12 yi < v 1yv < f 13 yi < v 2qv < f 23 < .
H
Ž 2.27 .
We do not know how to carry out the calculation of the above integral. One thing is
certain, though: due to the singular behavior of the integrand, the integral is peaked
around f 12 s f 13 s f 23 s 0. We thus expect that F3 will be a function of Ý v i k i only. (
We now argue that this ensures the momentum conservation in the large R limit.
Introduce new vectors
1
l1 s
(2 v Ý (v k ,
3
i i
l s y(v rv k q (v v k ,
2 2 3 1 1 3 2
l s (v r Ž 2 v . k q (v r Ž 2 v
3 1 3 1 2 3 . k 2 y 1r'2 k 3 . Ž 2.28 .
With the above relations, it is easy to see that Ýk i2 s Ýl i2 , using the fact v 3 s v 1 q v 2 .
Now F3 is a function of l 1 only. We can perform the integral in Ž2.23. over l 2 and l 3
first. We note in particular that in the exponential in Ž2.23., the l 3 dependent part is
l3
y 12 l 32 q i
(2 v Ž Ý a v e . .
3
i i i Ž 2.29 .
It is seen that the integral over l 3 results in a delta function ensuring momentum
conservation, in the large R limit.
206 M. Li r Nuclear Physics B 568 (2000) 195–207
Ž 2.30 .
We have much less to say about the four-point amplitude. Although we trust that the
condition for energy–momentum conservation derived in the previous section is neces-
sary, we are not able to prove that it is also sufficient. As in the two- and three-point
amplitude cases, we can always write the four-point amplitude as
4 d4 ki
A 4 s N4 H is1
Ł 2
exp y 12 Ý k i2 q i Ý v i k i P e i F4 Ž k i . ,
( Ž 2.31 .
Ž 2p .
where F4 is given by a similar formula as Ž2.24.. We omitted a factor conserving
energy.
To see whether the momentum conservation is true in the large R limit, as in
Subsection 2.2, we introduce a set of new vectors
li s Ý Vi j k j , Ž 2.32 .
j
(
with V i j 4 being an orthogonal matrix. We can choose V 1 i s a i v irv , where v s Ý i v i .
If, in the large R limit, F4Ž l i . is less dependent on l 1 than on other l i , the integral over
l 1 in Ž2.31. can be performed first, thus resulting in a delta function associated with
momentum conservation.
It can be shown that the condition that 1r 'a q 1r 'b s 1 is a singularity of f Ž a,b .
is a necessary one, where f Ž a,b . is a function introduced in Ž1.11.. This can be seen
using a formula similar to Ž2.24.. Independence of various components can be used to
derive relations in Ž1.13. and Ž1.14., thus the above condition. This is just a single
condition, so it is far from clear whether it is also a sufficient condition for conserving
four-momentum.
Finally, we want to show that if 1r 'a q 1r 'b s 1 is a singularity of f Ž a,b ., then
both 1r 'a y 1r 'b s 1 and 1r 'a y 1'b s y1 are also singularities of f Ž a,b ..
f Ž a,b . is a symmetric function of a and b. To see this, we go to the Euclidean space.
™ ™
Exchange point 1 and point 2, a and b are exchanged. Exchange point 2 and point 3 we
are led to a 1ra, b bra, thus
3. Conclusion
We have only scratched the surface of the problem of investigating the consequence
of Lorentz invariance for the holographic S-matrix and associated correlation functions
in the large N limit. For instance, more constraints can be derived from the requirement
that A 4 is a function of only s and t, apart from a kinetic factor. Even more interesting,
is the consequence of causality in the flat space limit. We leave these problems for
future investigations.
Our main result in this paper is the identification of a new singularity in the
four-point amplitude, in the large N limit. This means that the dominant contribution to
the scattering amplitude comes from around this ‘‘saddle point’’. This reminds us the
problem of sensitive initial conditions raised in w7x. It is observed there that if the two
beams aimed at the center of the AdS are emitted with time difference greater than 1rR,
then the beams will miss each other. In the large N limit, this time difference can be
arbitrarily small. It appears that a kind of sharp saddle point may help to understand this
puzzle. Presumably this is a consequence of locality in the bulk space. It remains to see
whether bulk locality together boundary conformal invariance guarantee bulk Poincaré
invariance in the large N limit.
Acknowledgements
This work was supported by DOE grant DE-FG02-90ER-40560 and NSF grant PHY
91-23780. We are grateful to R. Siebelink for collaboration during the initial stage of
this project. Useful discussions with P. Kraus have helped me going through some
difficult phases. Correspondence with J. Polchinski is acknowledged. We are also
grateful to P-M. Ho for helpful discussions. This work was completed during a visit to
National Taiwan University. We thank P-M. Ho, W-Y. P. Hwang and Y-C. Kao for
hospitality.
References
Abstract
In the high-energy limit, we compute the gauge-invariant three-parton forward clusters, which
in the BFKL theory constitute the tree parts of the NNLO impact factors. In the triple collinear
limit, we obtain the polarized double-splitting functions. For the unpolarized and the spin-corre-
lated double-splitting functions, our results agree with the ones obtained by Campbell–Glover and
Catani–Grazzini, respectively. In addition, we compute the four-gluon forward cluster, which in
the BFKL theory forms the tree part of the NNNLO gluonic impact factor. In the quadruple
collinear limit we obtain the unpolarized triple-splitting functions, while in the limit of a
three-parton central cluster we derive the Lipatov vertex for the production of three gluons,
relevant for the calculation of a BFKL ladder at NNLL accuracy. Finally, motivated by the
reorganization of the color in the high-energy limit, we introduce a color decomposition of the
purely gluonic tree amplitudes in terms of the linearly independent subamplitudes only. q 2000
Elsevier Science B.V. All rights reserved.
1. Introduction
QCD calculations of multijet rates beyond the leading order ŽLO. in the strong
coupling constant a s are generally quite involved. However, in recent years it has
become clear how to construct general-purpose algorithms for the calculation of multijet
rates at next-to-leading order ŽNLO. accuracy w1–11x. The crucial point is to organize
the cancellation of the infrared Ži.e. collinear and soft. singularities in a universal, i.e.
process-independent, way. The universal pieces in a NLO calculation are given by the
tree-level splitting w12–15x and eikonal w16–18x functions, and by the universal structure
of the poles of the one-loop amplitudes w1,4,19x.
0550-3213r00r$ - see front matter q 2000 Elsevier Science B.V. All rights reserved.
PII: S 0 5 5 0 - 3 2 1 3 Ž 9 9 . 0 0 6 5 7 - 4
212 V. Del Duca et al.r Nuclear Physics B 568 (2000) 211–262
paper we compute their tree components, i.e. the emission of three partons in the
forward-rapidity region. By taking then the triple collinear limit of the tree NNLO
impact factors, we obtain the polarized double-splitting functions. Summing over the
parton polarizations, we obtain the unpolarized and the spin-correlated double-splitting
functions, previously computed in Ref. w20x and w21x, respectively, in the conventional
dimensional regularization ŽCDR. scheme. Since we sum over two helicity states of the
external partons, as it is done in the dimensional reduction ŽDR. scheme w56–58x, our
results agree with the ones in the CDR scheme by setting there the dimensional
regularization scheme ŽRS. parameter e s 0.
For a scattering with production of m partons, we define the n-parton cluster, with
m ) n, as the set of n final-state partons where the distance in rapidity between any two
partons in the cluster is much smaller than the rapidity distance between a parton inside
the cluster and a parton outside. In the BFKL theory, Ž n q 1.-parton forward clusters
provide the tree parts of N n LO impact factors, while Ž n q 1.-parton central clusters
provide the tree parts of the N n LO Lipatov vertex. n-parton clusters were given also a
field-theoretical basis in terms of an effective action describing the interaction between
physical partons grouped into gauge-invariant clusters and the gluons exchanged in the
cross channel w59x. In addition to computing the three-parton forward clusters and the
three-gluon central cluster mentioned above, we compute the four-gluon forward cluster,
i.e. the purely gluonic tree part of the NNNLO impact factor. By taking then the
quadruple collinear limit, we obtain the polarized triple-splitting functions. They could
be used in a gauge-invariant evaluation of the Altarelli–Parisi evolution at three loops
w60x.
The outline of the paper is: in Section 2 we review the standard color decompositions
of the n-parton tree amplitudes, and we present a color decomposition of the gluon
amplitudes in terms of the linearly independent subamplitudes only. In Section 3 we
review the elastic scattering of two partons in the high-energy limit, which allows for the
extraction of the LO impact factors. In Section 4 we review the amplitudes for the
production of three partons, with a gauge-invariant two-parton forward cluster; from
these, we can extract the tree parts of the NLO impact factors; by taking the collinear
limit, we obtain the LO splitting functions. In Section 5 we compute the amplitudes for
the production of four partons, with a three-parton forward cluster; then we extract the
tree parts of the NNLO impact factors, and by taking the triple collinear limit we obtain
the polarized and unpolarized double-splitting functions. In Section 6.1 we compute the
amplitude for the production of five gluons, with a four-gluon forward cluster. We
extract the tree part of the gluonic NNNLO impact factor, and by taking the quadruple
collinear limit we obtain the polarized triple-splitting functions. In addition, by taking
the limit in which three gluons are emitted in the central-rapidity region, we obtain the
gauge-invariant three-gluon central cluster, i.e. the tree part of the NNLO Lipatov
vertex. In Section 7 we draw our conclusions.
2. Tree amplitudes
In this section we review the color decomposition of purely gluonic and quark–gluon
tree amplitudes. For the purely gluonic tree amplitudes, we introduce a color decomposi-
tion in terms of the linearly independent subamplitudes, Eq. Ž2.9..
214 V. Del Duca et al.r Nuclear Physics B 568 (2000) 211–262
For an amplitude with n gluons the usual color decomposition at tree level reads
w61–69x
i A Ž g 1 , . . . , g n . s ig ny 2 Ý tr Ž ls1 . . . ls n . A Ž gs 1 , . . . , gs n . , Ž 2.1 .
sgS nrZ n
where SnrZ n are the non-cyclic permutations of n elements. The dependence on the
particle helicities and momenta in the subamplitude, and on the gluon colors in the trace,
is implicit in labeling each leg with the index i. Helicities and momenta are defined as if
all particles were outgoing.
The gauge invariant subamplitudes A satisfy the relations w68,69x, proven for
arbitrary n in Ref. w70x,
1
< A Ž 1, s 2 , . . . , sn . < 2 q O
s Cn Ž Nc . Ý
sgS ny1
ž /Nc2
Ž 2.5 .
Ž ny2 . !
s Ý c˜i j A i A)j , Ž 2.6 .
i , js1
with Pi the ith permutation in SnrZ n . In Eq. Ž2.5., the coefficient CnŽ Nc . is
ny 2
Ž g 2 Nc .
Cn Ž Nc . s
2n
Ž Nc2 y 1 . . Ž 2.8 .
The first term in Eq. Ž2.5. constitutes the Leading Color Approximation ŽLCA.. Up to
n s 5, the 1rNc2 corrections in Eq. Ž2.5. vanish and LCA is exact. The reduced color
matrix c˜i j in Eq. Ž2.6., has been obtained from c i j applying the linear transformations of
Eq. Ž2.3., thus the labels i, j in Eq. Ž2.6. run only on the permutations of the linearly
independent subamplitudes.
Motivated by the reorganization of the color in the high-energy limit w39,54,73,74x,
and using Eqs. Ž2.2. and Ž2.3. we rewrite Eq. Ž2.1. as
i A Ž g1 , . . . , gn .
ny 2
Ž ig .
si Ý f a1 a 2 x 1 f x 1 a 3 x 2 . . . f x ny 3 a ny 1 a n A Ž g 1 , gs 2 , . . . , gs ny 1 , g n .
2 sgS ny2
g ny 2
si
2
Ý Ž Fa 2 . . . F a ny 1 . a1 a n A Ž g 1 , gs 2 , . . . , gs ny 1 , g n . , Ž 2.9 .
sgS ny2
where Ž F a . b c ' if b ac. We have checked Eq. Ž2.9. up to n s 7. Eq. Ž2.9. enjoys several
remarkable properties. Firstly, it shows explicitly which is the color decomposition that
allows us to write the full amplitude i A in terms of the Ž n y 2.! linearly independent
subamplitudes only. In the following we shall refer to it as to a color ladder. Hence the
color matrix obtained squaring Eq. Ž2.9. yields directly the c˜i j matrix in Eq. Ž2.6.. We
have checked it against the explicit results of Ref. w75x, up to n s 5. Moreover, it is quite
suggestive to note the formal correspondence with the amplitudes with a quark–anti-
quark pair and Ž n y 2. gluons, Eq. Ž2.11., where the only difference between the two1 is
the appropriate representation for the color matrices, namely the adjoint for the n-gluon
amplitude and the fundamental for the one with the qq pair. Finally, the most relevant
applications of Eq. Ž2.9. for this work are to the study of the multi-gluon amplitudes in
the high-energy limit. As discussed in the following, the color ladder naturally arises
w73,74x in the configurations where the gluons are strongly ordered in rapidity, i.e. in the
multi-Regge kinematics. Indeed in the strong-rapidity ordering only the subamplitude
with the corresponding order in the color coefficient contributes to Eq. Ž2.9.. At NLO,
where the strong ordering is relaxed for two adjacent gluons, the leading subamplitudes
are the two which differ just by the exchange of the gluon labels in the color ladders
w54x. As we shall see this result generalizes at NNLO and beyond. Nonetheless, in the
1
The factor 1r2 in front of Eq. Ž2.9. is due to our choice for the normalization of the fundamental
representation matrices, i.e. trŽ l al b . s d a b r2.
216 V. Del Duca et al.r Nuclear Physics B 568 (2000) 211–262
following we have chosen to derive our results starting from Eq. Ž2.1. instead of using
directly Eq. Ž2.9.. The former, though more laborious, shows explicitly how the color
traces must be recombined to obtain the color ladder and, more importantly, allows us to
find the relations necessary to prove the factorization in the multi-collinear limits.
For the maximally helicity-Õiolating configurations, Žy,y,q, . . . ,q ., in Eq. Ž2.1. or
Eq. Ž2.9., there is only one independent colorrhelicity subamplitude, the Parke–Taylor
ŽPT. subamplitude
² i j :4
A Ž g 1 , . . . , g n . s 2 n r2 , Ž 2.10 .
²1 2: . . . ² Ž n y 1 . n:² n 1:
where the ith and the jth gluons have negative helicity. All other colorrhelicity
amplitudes can be obtained by relabelling and by use of reflection symmetry, Eq. Ž2.2.,
l
and parity inversion. Parity inversion flips the helicities of all particles, and it is
accomplished by the substitution ² i j : w j i x . Subamplitudes of non-PT type, i.e. with
three or more gluons of y helicity have a more complicated structure.
For an amplitude with two quarks and Ž n y 2. gluons the color decomposition at tree
level is w61–69x,
ı
i A Ž q,q ; g 1 , . . . , gŽ ny2. . s ig ny2 Ý Ž ls 1 . . . ls ny 2 . j A Ž q,q ; gs 1 , . . . , gs ny 2 . ,
sgS ny2
Ž 2.11 .
² qi :² qi :3
A Ž qq,qy ; g 1 , . . . , gŽ ny2. . s 2 Ž ny2.r2 , Ž 2.12 .
² qq :² q1: . . . ² Ž n y 2 . q :
where gluon g i has negative helicity. Helicity is conserved along the massless-fermion
line. All other colorrhelicity amplitudes can be obtained by relabelling and by use of
parity inversion, reflection symmetry and charge conjugation. In performing parity
inversion, there is a factor of y1 for each pair of quarks participating in the amplitude.
Reflection symmetry is like in Eq. Ž2.2., for gluons andror quarks alike. Charge
conjugation swaps quarks and antiquarks without inverting helicities. In particular, using
reflection symmetry and charge conjugation on Eq. Ž2.12. we obtain
² qi :3² qi :
A Ž qy,qq ; g 1 , . . . , gŽ ny2. . s 2 Ž ny2.r2 , Ž 2.13 .
² qq :² q1: . . . ² Ž n y 2 . q :
For an amplitude with four quarks and Ž n y 4. gluons the color decomposition at tree
level is w61x
helicity for all of the gluons, the A and B subamplitudes factorize into distinct
contributions for the two quark antennae w61–66,68,69x. However, as we shall see in
Section 5.4, we need the helicity configurations with two gluons of opposite helicity. For
these the above mentioned factorization does not occur.
™
an amplitude A a vertical bar to separate clusters of particles which are close in rapidity
from other clusters. The LO impact factors g ) g g, with g ) an off-shell gluon, are
p b)X H
C g ; g Ž py q
a ; pa . s 1
X C g ; g Ž py qX
b ; pb . s . Ž 3.2 .
p bX H
They conserve helicity along the on-shell gluon line and transform under parity into
their complex conjugates,
)
C g;g Ž kn 4 . s C g ; g Ž ky n 4 . . Ž 3.3 .
2
In Eq. Ž3.1. four helicity configurations are leading, two for each impact factor . The
helicity-flip impact factor C g ; g Ž pq; pXq . is subleading in the high-energy limit.
From Eqs. Ž2.11., Ž2.12., we obtain the quark–gluon q g q g scattering amplitude
in the high-energy limit w54x,
™
A q g ™ q g Ž pan a , panXaX < p bnXbX , p bn b .
n aX
1 X
s 2 s g lcaX a C q ; q Ž py
a ; panXaX . ig f b b c C g ; g Ž p bn b ; p bnXbX . , Ž 3.4 .
t
™
momentum, e.g. the antiquark is pa in Eq. Ž3.4. ŽFig. 1b., and p b in Eq. Ž3.5. ŽFig. 1c..
The LO impact factors g ) q q are
1r2
p b)X H
C q ; q Ž py q
a ; pa . s yi ;
X C q ; q Ž py qX
b ; pb . s i
ž / p bX H
. Ž 3.6 .
™
and in general an impact factor acquires a coefficient S for each pair of quarks Žsee
Section 2.. Analogously, the antiquark–gluon q g q g amplitude is
n aX
1 X
s 2 s g lca aX C q ; q Ž py
a ; panXaX . ig f b b c C g ; g Ž p bn b ; p bnXbX . , Ž 3.8 .
t
2
All throughout this paper, we shall always write only half of the helicity configurations contributing to an
impact factor, the other half being obtained by parity.
V. Del Duca et al.r Nuclear Physics B 568 (2000) 211–262 219
™
where the antiquark is paX in Eq. Ž3.8. and p bX in Eq. Ž3.9., and the LO impact factors
g ) q q are
1r2
p b)X H
C q ;q
Ž py q
a ; pa
X . si ; C q ;q
Ž py qX
b , pb . s yi ž /
p bX H
. Ž 3.10 .
In the amplitudes Ž3.1., Ž3.4., Ž3.5., Ž3.8., Ž3.9., the leading contributions from all the
Feynman diagrams have been included. However, the amplitudes have the effective
form of a gluon exchange in the t-channel ŽFig. 1., and differ only for the relative color
strength in the production vertices w76x. This allows us to replace an incoming gluon
with a quark, for instance on the upper line, via the simple substitution
X
ig f a a c C g ; g Ž pan a ; panXaX . lg l cX
aa C q ; q Ž py
a
n aX
; panXaX . , Ž 3.11 .
™
and similar ones for an antiquark andror for the lower line. For example, the
quark-quark q q q q scattering amplitude in the high-energy limit is
n aX
1
s 2 s g lcaX a C q ; q Ž py
a ; panXaX . g lcbX b C q ; q Ž py
b
n bX
; p bnXbX . . Ž 3.12 .
t
Let three partons be produced with momenta k 1 , k 2 and p bX in the scattering between
two partons of momenta pa and p b , and to be specific, we shall take partons k 1 and k 2
in the forward-rapidity region of parton pa , the analysis for k 1 and k 2 in the
forward-rapidity region of p b being similar. Parametrizing the momenta as in Eq. ŽA.1.,
we have
y 1 , y 2 4 y bX ; < k 1 H < , < k 2 H < , < p bX H < . Ž 4.1 .
Fig. 2. Amplitudes for the production of three partons, with partons k 1 and k 2 in the forward-rapidity region of
parton pa .
with the sum over the permutations of the two gluons 1 and 2, the LO impact factor,
C g ; g Ž p bn b ; p bnXbX ., as in Eq. Ž3.2., and
A g ; g g Ž pan a ;k 1n 1 ,k 2n 2 . s C g ; g g Ž pan a ;k 1n 1 ,k 2n 2 . An Ž k 1 ,k 2 . ,
B g ; g g Ž pan a ;k 1n 1 ,k 2n 2 . s C g ; g g Ž pan a ;k 1n 1 ,k 2n 2 . B n Ž k 1 ,k 2 . , Ž 4.3 .
with n s signŽ na q n 1 q n 2 ., and
C g ; g g Ž py q q
a ;k 1 ,k 2 . s 1,
C g ; g g Ž pq y q 2
a ;k 1 ,k 2 . s x 1 ,
C g ; g g Ž pq q y 2
a ;k 1 ,k 2 . s x 2 . Ž 4.4 .
The momentum fractions are defined as
kq
i
xi s q q i s 1,2 Ž x 1 q x 2 s 1 . , Ž 4.5 .
k1 q k 2
and the function Aq as follows:
qH x1 1
Aq Ž k 1 ,k 2 . s y'2
k1H x 2 12:
²
, ( Ž 4.6 .
with ²12: a shorthand for ² k 1 k 2 :. Using the dual Ward identity w61x, or UŽ1.
decoupling equations w18,67x, the function B n in Eq. Ž4.3., and thus the function B g ; g g ,
can be written as
B n Ž k 1 ,k 2 . s y An Ž k 1 ,k 2 . q An Ž k 2 ,k 1 . . Ž 4.7 .
™
is subleading to the required accuracy. The function An
g; g gŽ
The function C pq q q.
a ;k 1 ,k 2
has a collinear divergence as 2 k 1 P k 2 0, but the divergence cancels out in the function
B n where gluons 1 and 2 are not adjacent in color ordering w39x.
Using Eq. Ž4.7., and fixing t , y< q H < 2 , the amplitude Ž4.2. may be rewritten as
½
s 2 s Ž ig . Ý
sgS 2
f a ds 1c f c ds 2 cA g ; g g Ž pan a ;ksns1 1 ,ksns2 2 . 5
1 X X
= ig f b b c C g ; g Ž p bn b ; p bnXbX . , Ž 4.8 .
™ g g is enclosed in curly brackets, and includes
t
where the NLO impact factor for g ) g
six helicity configurations.
V. Del Duca et al.r Nuclear Physics B 568 (2000) 211–262 221
with k 1 the antiquark, the NLO impact factor g ) g ™ qq in curly brackets, and with
A g ; q q Ž pan a ;k 1n 1 ,ky
2
n1
. s C g ; q q Ž pan a ;k 1n 1 ,ky2 n 1 . An Ž k 1 ,k 2 . ,
C g q q Ž pq q y 3
a ;k 1 ,k 2 . s x 1 x 2 ,(
C g ; q q Ž pq y q
a ;k 1 ,k 2 . s (x 3
1 x2 , Ž 4.13 .
™
with momentum fractions as in Eq. Ž4.5., An in Eq. Ž4.6. and n s na . The NLO impact
factor g ) g qq allows for four helicity configurations.
In the multi-Regge limit kq q )
™
1 4 k 2 , the NLO impact factor g g qq vanishes, since
quark production along the multi-Regge ladder is suppressed.
222 V. Del Duca et al.r Nuclear Physics B 568 (2000) 211–262
™
the substitution Ž3.11. for the upper line, and the LO impact factor C q; q in Eq. Ž3.6..
The treatment of the amplitude q g q g g for the production of a q g pair in the
™
forward-rapidity region of antiquark a is identical to the former, thus the NLO impact
factor q g ) q g is the same as in Eq. Ž4.14. up to inverting the color flow on the
quark line w54x. The corresponding functions A and B are the same as in Eq. Ž4.15..
4.4. NLO impact factors in the collinear limit
The collinear factorization for a generic amplitude occurs both on the subamplitude
and on the full amplitude w61x, since in Eqs. Ž2.1., Ž2.11. and Ž2.14. color orderings
where the collinear partons are not adjacent do not have a collinear divergence. Hence in
the collinear limit for partons i and j, with k i s zP and k j s Ž1 y z . P, a generic
amplitude Ž2.1. can be written as
lim A . . . d i d j . . . Ž . . . ,k in i ,k nj j , . . . . s Ý A . . . c . . . Ž . . . , P n , . . . . Splityf ™
n
fi fj
Ž k in i ,k nj j . ,
ki<< kj n
Ž 4.16 .
with f denoting the parton species. Accordingly, for k 1 s zP and k 2 s Ž1 y z . P, we can
write the amplitudes Ž4.8., Ž4.12. and Ž4.14. as
lim A f g ™ f 1 f 2 g Ž pan a ,k 1n 1 ,k 2n 2 < py
b
n n
X b,p b.
b
k1< < k 2
3
In this context, Eq. Ž4.7. is only a bookkeeping, since the UŽ1. decoupling equation is valid only for the
gluino-gluon subamplitudes corresponding to the quark–gluon subamplitudes used in Eq. Ž4.14..
V. Del Duca et al.r Nuclear Physics B 568 (2000) 211–262 223
helicity conservation in the s channel ŽSection 3.. For the collinear factors, Splityf ™
n
f1 f 2
,
we obtain
Splityg ™
n
gg
Ž k 1n 1 ,k 2n 2 . s ig f c d1 d 2 splityg ™
n
gg
Ž k 1n 1 ,k 2n 2 . ,
Splityg ™
n
qq
Ž k 1n 1 ,k 2n 2 . s g Ž lc . d 2 d 1 splityg ™
n
qq
Ž k 1n 1 ,k 2n 2 . ,
q™ q g
Splity n Ž k 1n 1 ,k 2n 2 . s g Ž ld 2 . d1 c splityq ™
n
qg
Ž k 1n 1 ,k 2n 2 . ,
q™ q g
Splity n Ž k 1n 1 ,k 2n 2 . s g Ž ld 2 . c d 1 splityq ™
n
qg
Ž k 1n 1 ,k 2n 2 . , Ž 4.18 .
splityg ™ g g Ž kq q
1
'
1 ,k 2 . s 2
(z Ž 1 y z . ²12: ,
splitqg ™ g g Ž ky q
z2
'
1 ,k 2 . s 2
(z Ž 1 y z . ²12: ,
™ g g Ž kq1 ,ky2 . s '2
2
Ž1yz .
splitqg
(z Ž 1 y z . ²12: ,
splitqg ™ q q Ž kq y
1yz
'
1 ,k 2 . s 2 ,
²12:
splitqg ™ q q Ž ky q
z
'
1 ,k 2 . s 2 ,
²12:
q™ q g
Ž kq1 ,kq2 . s splityq ™ q g Ž kq1 ,kq2 . s '2
1
splity
'1 y z ²12: ,
q™ q g
Ž ky1 ,kq2 . s splitqq ™ q g Ž ky1 ,kq2 . s '2
z
splitq Ž 4.19 .
'1 y z ²12:
and splitnf ™ f 1 f 2 Ž ky
1
n 1 yn 2 .
,k 2 obtained from splityf ™
n
f1 f 2 Ž n 1 n 2 .
k 1 ,k 2 by exchanging ² k 1 k 2 :
with w k 2 k 1 x, and multiplying by the coefficient S, Eq. Ž3.7., if the splitting factor
includes a quark pair.
Summing over the two helicity states of partons 1 and 2, we obtain a two-dimen-
sional matrix, whose entries are the Altarelli–Parisi splitting functions at fixed color and
helicity of the parent parton w12–15x
f ™ f1 f 2
with e iŽ flyf r . a phase, where e iŽ f qyf y. s w21xr²12:, and where by definition Pqq
f ™ f1 f 2 f ™ f1 f 2 f ™ f1 f 2
s Pyy , and Pqy s Pyq , and
g™ gg
z 1yz
Pqq s 2CA q qz Ž1yz . ,
1yz z
g™ gg
Pqy s 2CA z Ž 1 y z . ,
g ™ qq
1 2 2
Pqq s z q Ž1yz . ,
2
g ™ qq
Pqy sz Ž1yz . ,
™ ™
2
q qg q qg
1qz
Pqq s Pqq s CF ,
1yz
q™ q g q™ q g
Pqy s Pqy s0 . Ž 4.21 .
q™ q g
over two helicity states of the external partons, Eq. Ž4.21. is valid in the-dimensional
reduction ŽDR. scheme w56–58x. Eq. Ž4.21. agrees with the corresponding spin-corre-
lated splitting functions of Ref. w77x in the DR scheme, after contracting the ones of type
P g ™ f 1 f 2 with a parent-gluon polarization as in Appendix E. The connection of Eq.
Ž4.21. with other regularization schemes ŽRS. is also given in Ref. w77x.
Averaging over the trace of P f ™ f 1 f 2 in Eq. Ž4.20., i.e. over color and helicity of the
parent parton on the left-hand side of Eq. Ž4.20., we obtain the unpolarized Altarelli–
Parisi splitting functions 4
<Splityf ™ ² P f ™ f1 f 2 : ,
1 2 g2
Ý n
f1 f 2
Ž k 1n 1 ,k 2n 2 . < 2 s Ž 4.22 .
2C nn 1n 2 s12
with C s Nc2 y 1 for a parent gluon and C s Nc for a parent quark, and where the
averaged trace of P f ™ f 1 f 2 is ² P f ™ f 1 f 2 : s tr P f ™ f 1 f 2r2 s Pqq
f ™ f1 f 2
.
Fig. 3. Amplitudes for the production of four partons, with partons k 1 , k 2 and k 3 in the forward-rapidity
region of parton pa .
type, with three gluons of q helicity and three gluons of y helicity w61x, and Appendix
D, we obtain
X X X
=tr Ž l al ds 1 l ds 2 l ds 3 l bl b y l al ds 1 l ds 2 l ds 3 l bl b q l bl bl ds 3 l ds 2 l ds 1 l a
X X
yl bl bl ds 3 l ds 2 l ds 1 l a . q B g ;3 g Ž pan a ;ksns1 1 ,ksns2 2 ,ksns3 3 . tr Ž l al ds 1 l ds 2 l bl bl ds 3
X X X
yl al ds 1 l ds 2 l bl bl ds 3 q l bl bl ds 2 l ds 1 l al ds 3 y l bl bl ds 2 l ds 1 l al ds 3 . , Ž 5.2 .
with the sum over the permutations of the three gluons 1, 2 and 3, and the LO impact
factor, C g ; g Ž p bn b ; p bnXbX ., as in Eq. Ž3.2.. From the PT subamplitudes Ž2.10. we obtain the
function of Žyqqq . helicities
qH x1 1
Aq Ž k 1 ,k 2 ,k 3 . s y2
k1H ( x 3 ²12:²23:
Ž 5.4 .
and
kq
i
xi s i s 1,2,3 Ž x 1 q x 2 q x 3 s 1. . Ž 5.5 .
kq q q
1 q k2 q k3
1 na s y
C g ;3 g Ž pan a ;k 1n 1 ,k 2n 2 ,k 3n 3 . s
½ x i2 ni s y i s 1,2,3
with n s q, Ž 5.6 .
226 V. Del Duca et al.r Nuclear Physics B 568 (2000) 211–262
From the non-PT subamplitudes w61x we obtain the function of Žyyqq . helicities
A g ;3 g Ž py q q y
a ,k 1 ,k 2 ,k 3 .
2 s a12 s123
s
s12 < k 1 H < 2
y
s23 ž b Ž k 1 ,k 2 ,k 3 . x 1
g Ž k 1 ,k 2 ,k 3 . Ž x 1 x 2 q b Ž k 1 ,k 2 ,k 3 . Ž x 2 q x 3 . .
q
x2 x3 /
2 2
b Ž k 1 ,k 2 ,k 3 . s a12 g Ž k 1 ,k 2 ,k 3 . s123 < k 1H < 2 s12 x 12 x 2 < q H < 2
y q q ,
x2 s23 x 1 x 2 x 3 s23 Ž x 2 q x 3 .
A g ;3 g Ž py q y q
a ,k 1 ,k 2 ,k 3 .
2 s a12 s123
s
s12 < k 1 H < 2
y
s23 ž ya Ž k 1 ,k 3 ,k 2 . x 1
g Ž k 1 ,k 3 ,k 2 . Ž x 1 x 2 y a Ž k 1 ,k 3 ,k 2 . Ž x 2 q x 3 . .
q
x2 x3 /
2 2
a Ž k 1 ,k 3 ,k 2 . s a12 g Ž k 1 ,k 3 ,k 2 . s123 < k 1 H < 2 s12 x 12 x 2 < q H < 2
y q q ,
x2 s23 x 1 x 2 x 3 s23 Ž x 2 q x 3 .
2 2
2 g Ž k 2 ,k 3 ,k 1 . s123 a Ž k 2 ,k 3 ,k 1 . s a12
A g ;3 g Ž py y q q
a ,k 1 ,k 2 ,k 3 . s y
s12 s23 x 1 x 2 x 3 x2 < k3H < 2
a Ž k 2 ,k 3 ,k 1 . g Ž k 2 ,k 3 ,k 1 . s a12 s123 Ž x 2 q x 3 .
q ,
s23 x 2 x 3 < k 3 H < 2
Ž 5.7 .
with si jk s Ž pi q pj q p k . 2 the three-particle invariant, and
(x 1 k3H ž (x 1
)
qH (
q x 2 w1 2x /,
a Ž k 1 ,k 2 ,k 3 . '
s a13
Ž k 1 H qk 2 H . w 1 2 x (x 1 x 2
b Ž k 1 ,k 2 ,k 3 . ' ,
s a12
(x 1 ž(
x 2 x 3 w 1 2 x x 1 ²1 3: q x 2 ²2 3: ( /.
g Ž k 1 ,k 2 ,k 3 . ' Ž 5.8 .
s123
V. Del Duca et al.r Nuclear Physics B 568 (2000) 211–262 227
Using the UŽ1. decoupling equations w18,67x, the function B in Eq. Ž5.2. can be
written as
B g ;3 g Ž pan a ;k 1n 1 ,k 2n 2 ,k 3n 3 . s y A g ;3 g Ž pan a ;k 1n 1 ,k 2n 2 ,k 3n 3 .
qA g ;3 g Ž pan a ;k 1n 1 ,k 3n 3 ,k 2n 2 .
qA g ;3 g Ž pan a ;k 3n 3 ,k 1n 1 ,k 2n 2 . . Ž 5.9 .
In the triple collinear limit, k 1 < < k 2 < < k 3 , Section 5.6, the function A has a double collinear
divergence, while the function B, whose gluon 3 is not color adjacent to gluons 1 and 2,
has only a single collinear divergence.
Using Eq. Ž5.9., we can rewrite Eq. Ž5.2. as
3 X X XX
½
s 2 s Ž ig . Ý
sgS 3
f a ds 1c f c ds 2 c f c ds 3 c A g ;3 g Ž pan a ;ksns1 1 ,ksns2 2 ,ksns3 3 . 5
1 X XX
= ig f b b c C g ; g Ž p bn b ; p bnXbX . , Ž 5.10 .
t
where the NNLO impact factor g ) g
14 helicity configurations.
™ g g g is enclosed in curly brackets, and includes
5.2. The NNLO impact factor gg ) ™ gqq
We consider the amplitude for the scattering g g g q q g ŽFig. 3b., in the kinemat-
ics Ž5.1.. Using Eqs. Ž2.11. – Ž2.13. and the subamplitudes of non-PT type, with two
™
gluons of q helicity and two gluons of y helicity w61x, we obtain
A g g ™ g q q g Ž pan a ,k 1n 1 ,k 2n 2 ,ky
3
n 2 < nXbX
p b , p bn b .
1 X X
s 2 s I g ; g q q Ž pan a ;k 1n 1 ,k 2n 2 ,ky
3
n2
. ig f b b c C g ; g Ž p bn b ; p bnXbX . , Ž 5.11 .
t
with k 3 the quark, and with NNLO impact factor g g ) ™ g q q,
X
I g ; g q q Ž pan a ;k 1n 1 ,k 2n 2 ,ky
3
n2
. s g 3 Ž lclald1 . d 3 d 2 A1g ; g q q Ž pan a ;k 1n 1 ,k 2n 2 ,ky3 n 2 .
X
q Ž l alcl d1 . d 3 d 2 A 2g ; g q q Ž pan a ;k 1n 1 ,k 2n 2 ,ky
3
n2
.
X
q Ž l d1lcl a . d 3 d 2 A 3g ; g q q Ž pan a ;k 1n 1 ,k 2n 2 ,ky
3
n2
.
X
q Ž l d1l alc . d 3 d 2 A 4g ; g q q Ž pan a ;k 1n 1 ,k 2n 2 ,ky
3
n2
.
X
q Ž l al d1lc . d 3 d 2 B1g ; g q q Ž pan a ;k 1n 1 ,k 2n 2 ,ky
3
n2
.
X
q Ž lcl d1l a . d 3 d 2 B2g ; g q q Ž pan a ;k 1n 1 ,k 2n 2 ,ky
3
n2
. .
Ž 5.12 .
228 V. Del Duca et al.r Nuclear Physics B 568 (2000) 211–262
The NNLO impact factor allows for eight helicity configurations. From the PT subam-
plitudes Ž2.12., Ž2.13. we obtain
qH 1
A1g ; g q q Ž pq q y q
a ;k 1 ,k 2 ,k 3 . s y2
k1H
x 1 x 23
²1 2 ²2 3:
:
, (
x 23 x 32
A 2g ; g q q Ž pq q y q
a ;k 1 ,k 2 ,k 3 . s 2
qH
k3H
) x1 ²1 2: ²2 3:
1
,
qH x3 1
A 3g ; g q q Ž pq q y q
a ;k 1 ,k 2 ,k 3 . s y2
k2 H
x 22 ( x 1 ²1 3: ²3 2:
,
qH 1
A 4g ; g q q Ž pq q y q
a ;k 1 ,k 2 ,k 3 . s 2
k1H
(x x 1
2
2 x3
²1 3: ²3 2:
,
qH 1
B1g ; g q q Ž pq q y q
a ;k 1 ,k 2 ,k 3 . s y2 x2 x3 ,
k3H k 1 H ²2 3:
qH 1
B2g ; g q q Ž pq q y q
a ;k 1 ,k 2 ,k 3 . s y2 x 22 , Ž 5.13 .
k2 H k 1 H ²2 3:
with momentum fractions as in Eq. Ž5.5.. The impact factors from the non-PT
subamplitudes w61x are,
A1g ; g q q Ž py q y q
a ;k 1 ,k 2 ,k 3 .
2
g Ž k 1 ,k 3 ,k 2 . s123 (x g Ž k 1 ,k 3 ,k 2 . s123 a Ž k 1 ,k 3 ,k 2 . s3 b bX
s2
½ w 1 3 x²1 2 :s23 x 1 x 2 x 3
2
q
1
)
s12²2 3 :k 1H x 3 x 3 w1 3x k 2 H (
Ž k 3 H qq H . a Ž k 1 ,k 3 ,k 2 . s3 b bX < q H < 2 x 12 x 23r2
y y
(x 2 x 3 s12 < k 1 H < 2 k 2 H (x 3 s23 < k 1 H < 2 Ž x 2 q x 3 .
x 1 s 3 b bX g Ž k 1 ,k 3 ,k 2 . s123
y
s12 s23 < k 1 H < x 3 2 (
s123 x 2 x 3 a Ž k 1 ,k 3 ,k 2 . q
(x x3
2
a Ž k 1 ,k 3 ,k 2 . Ž x 2 q x 3 .
ž
= yx 2 q
x1 /5 , Ž 5.14 .
A 2g ; g q q Ž py q y q
a ;k 1 ,k 2 ,k 3 .
° g Ž k 1 ,k 3 ,k 2 . s123
2
x2 x3 ž y w2 3x (x q w1 3x x 2 ( /
s 2~y ( 1
¢ w 1 3 x²1 2 :s23 x 1 x 2 x 3
y
x1 s12 w 2 3 x
k 2 H k 3)H y w 2 3 x x 2 x 3
ž ( /
2
(x 2 x3 (x 2 x 3 g Ž k 1 ,k 3 ,k 2 . s123 ¶•
q
(
s23 k 3)H s1 b bX x 2 x 3
q
s23
q
x 1²1 2 :²2 3 :w 1 3 x k 3)H ß,
Ž 5.15 .
V. Del Duca et al.r Nuclear Physics B 568 (2000) 211–262 229
A 3g ; g q q Ž py q y q
a ;k 1 ,k 2 ,k 3 .
2 2
(
k 3 H x 2 a Ž k 3 ,k 1 ,k 2 . s1 b bX g Ž k 1 ,k 3 ,k 2 . s123 w 1 2 x
s2
½ k 2 H < k 2 H < 2 s23 x 3
(
y
x 1 x 2 x 3 w 1 3 x s13 s23
A 4g ; g q q Ž py q y q
a ;k 1 ,k 2 ,k 3 .
2 ) 2
Ž k 2 H qq H . (x 1 a Ž k 1 ,k 2 ,k 3 . Ž k 2 H qq H . q H x 1 g Ž k 1 ,k 3 ,k 2 . s123
s2
½ 2
²1 3 :< k 1 H < k 3 H (x 2
y
²2 3 :w 1 3 x < k 1 H < 2 s13(x 1 x 3 x 2
w 1 2 x g Ž k 1 ,k 3 ,k 2 . 2 s123 x 12 < q H < 2(x 2 x 3
q q
w 1 3 x s13 s23 x 1 x 2 x 3 < k 1 H < 2 s23 Ž x 2 q x 3 .
w 1 2 x x 1 s123 y Ž k 2 H qq H . x 2(x 1 x 3 w 1 3 x q s2 b bX g Ž k 1 ,k 3 ,k 2 .
y
x 2 < k 1 H < 2 s13 s23 w 1 3 x
x 1 Ž k 2 H qq H . w 1 2 x g Ž k 1 ,k 3 ,k 2 . s123 Ž x 2 q x 3 .
q
s13 < k 1H < 2 s23 x 1 x 3 x 2
( 5 , Ž 5.17 .
B1g ; g q q Ž py q y q
a ;k 1 ,k 2 ,k 3 .
y a Ž k 3 ,k 1 ,k 2 . 2 s1 b bX Ž k 2 H q q H . 2 w 1 3 x x 1 x 3 a Ž k 1 ,k 2 ,k 3 .
'
s2
½ k 3)H s23 k 2 H 'x 2 x3
q
< k1 H < 2 < k 3 H < 2 k 3 H x 2
'
w 1 3 x x 1 x 3 ² 2 3 : x 2 < k 3 H < 2 y Ž k 2 H q q H . s23 Ž x 1 q x 2 . x 3 q k 1 H k 2 H w2 3 x x 3 x 2
' ' ' '
y 2 2
'x 2
< k 1 H < < k 3 H < s23
'x 1 qH x2
y 2
< k 1 H < s23 x 2
'
)
y w1 3x k 1 H x 2 q q H x1 x 3 ' ž x2 q x3
q k2 H / 5 , Ž 5.18 .
B2g ; g q q Ž py q y q
a ;k 1 ,k 2 ,k 3 .
°yŽ k 1 H qk 2 H . ž k 1)H q w 1 2 x (x 1 x 2 / (x 2
2
s 2~
¢ < k 1 H < 2 k 2)H s3 b bX x 3(
2
( ž
k 3 H x 2 k 3)H y w 2 3 x x 2 x 3 ( /
q
k 2)H s23 s1 b bX x 33r2
230 V. Del Duca et al.r Nuclear Physics B 568 (2000) 211–262
x2
y
< k 1 H < < k 2 H < 2 s23 x 3
2
yk 3 H w 2 3 x s3 b bX x 1 x 2 q k 1)H q w 1 2 x x 1 x 2
ž ( /
ž (
= ²2 3 :q H k 3)H y w 2 3 x x 2 x 3 y k 3 H w 2 3 x k 2 H Ž x 2 q x 3 .
ž / /
(x 2 x 1 x 32
y < k 3 H < 2 s 3 b bX x 1 x 2 q < q H < 2 < k 2 H < 2
< k 1 H < 2 < k 2 H < 2 s23 x 33r2 x2 q x3
¶•
q k 1)H q w 1 2 x x 1 x 2 ( /Žk < <2
3 H s1 b b x 2 x 3 q k 2 H k 3 H Ž x 2 q x 3 . .
ž X
ß. Ž 5.19 .
The functions A and B for the remaining helicity configurations are derived using the
relations
A ig ; g q q Ž pan a ;kq q y g ; g qq na q y q
1 ,k 2 ,k 3 . s yA 5yi Ž pa ;k 1 ,k 3 ,k 2 . , i s 1,2,3,4,
Ž 5.20 .
Big ; g q q Ž pan a ;kq q y
1 ,k 2 ,k 3 . g ; g qq
s yB3yi Ž pan a ;kq y q
1 ,k 3 ,k 2 ., i s 1,2.
A q g ™ q 3 g Ž py a
n1
,k 1n 1 ,k 2n 2 ,k 3n 3 < p bnXbX , p bn b .
n1
1 X X
s 2 s I q ; q g g Ž py a ;k 1n 1 ,k 2n 2 ,k 3n 3 . ig f b b c C g ; g Ž p bn b ; p bnXbX . , Ž 5.21 .
™
t
with k 1 the final-state quark, and the NNLO impact factor q g ) q g g,
n1
I q ;q g g
Ž py
a ;k 1n 1 ,k 2n 2 ,k 3n 3 .
X
n1
sg 3
Ý Žl ds 2
l ds 3
lc . d1 a A q ; q g g Ž py
a ;k 1n 1 ,ksns2 2 ,ksns3 3 .
sgS 2
X
n1
q Ž lcl ds 2 l ds 3 . d1 a B1q ; q g g Ž py
a ;k 1n 1 ,ksns2 2 ,ksns3 3 .
X
n1
q Ž l ds 2 lcl ds 3 . d1 a B2q ; q g g Ž py
a ;k 1n 1 ,ksns2 2 ,ksns3 3 . . Ž 5.22 .
The NNLO impact factor allows for eight helicity configurations. From the PT subam-
plitudes Ž2.12., Ž2.13. we obtain
qH x1 1
A q ; q g g Ž py q q q
a ;k 1 ,k 2 ,k 3 . s 2 i ,
k 1 H x 3 ²1 2: ²2 3: (
qH x1 x 3 1
B1q ; q g g Ž py q q q
a ;k 1 ,k 2 ,k 3 . s 2 i
k3H ( x2 k 1 H ²2 3:
,
qH x1 1
B2q ; q g g Ž py q q q
a ;k 1 ,k 2 ,k 3 . s 2 i Ž 5.23 .
k1H (x 2 k 3 H ²1 2:
V. Del Duca et al.r Nuclear Physics B 568 (2000) 211–262 231
and
A q ; q g g Ž pq y q q
a ,k 1 ,k 2 ,k 3 . s yx 1 A
q ;q g g
Ž py q q q
a ;k 1 ,k 2 ,k 3 . ,
Biq ; q g g Ž pq y q q
a ;k 1 ,k 2 ,k 3 . s yx 1 Bi
q ;q g g
Ž py q q q
a ;k 1 ,k 2 ,k 3 . , i s 1,2. Ž 5.24 .
The impact factors from the non-PT subamplitudes w61x are
A q ; q g g Ž py q y q
a ;k 1 ,k 2 ,k 3 .
°k ž k )
1H q w 1 2 x (x 1 x 2 / ) 2 2 2
g Ž k 1 ,k 2 ,k 3 . g Ž k 1 ,k 3 ,k 2 . s123
s 2 i~
2H
q
¢ < k 1 H < 2 w 1 2 x s3 b bX(x 2 3r2
Ž x 1 x 2 x 3 . s12 s23²2 1 :w 1 3 x
w 1 3 x²1 2 :x 1 ž (x 3 ²2 3 :y k 1 H (x 2 / q k 1 H (x 3 g Ž k 1 ,k 3 ,k 2 . s123
q
s12 s23 k 1H x 1 x 3 (
(x 1 x1 x 2 x 3
q 2
k 1)H x 3 Ž q H x 2 q k 2 H Ž x 2 q x 3 . . y < q H < 2
s23 < k 1 H < x 3 x2 q x3
¶
qx 1 yq H k 3)H x 2 q k 2 H x 3 k 1)H x 3 y w 2 3 x x 2
ž ( ž ( ( / / ß•, Ž 5.25 .
A q ; q g g Ž py q q y
a ; k1 , k 2 , k 3 .
b Ž k 1 , k 2 , k 3 . 2 s 3 b bX k 2)H '
b Ž k 1 , k 2 , k 3 . s 3 b bX k 2)H x 1 Ž x 3 q x 2 .
s2 i
½ w 1 2 x s 12 < k 1 H < 2
x 23r2
q
s 12 < k 1 H < 2 x 3 x 2 w 3 2 x
'
)
k 2)H x 13r2 q H g Ž k 1 , k 2 , k 3 . s 132
y
s 12 < k 1 H < 2 w 3 2 x ' x1 x 3 ² 1 2:
'
q s 13 2 x 3 x 2
y
'x 2
)
b Ž k 1 , k 2 , k 3 . s 3 b bX q H
q
g Ž k 1 , k 3 , k 2 . g Ž k 1 , k 2 , k 3 . 2 s 132
) 2
3r 2
x1 ' x 3 k 2)H Ž x1 x 3 x 2 . s 12 s 32² 3 1 : w 1 2 x
2
w
x 1 ys 12 < q H < x 22 x 1 x 3 q ² 1 3 : b Ž k 1 , k 2 , k 3 . s 3 b bX k 2)H Ž x 3 q x 2 . 2
' x
q
s 12 s 32 < k 1 H < 2 x 2 x 3 Ž x 3 q x 2 .
'
'x 1 s 3 b bX w w 1 2 x² 1 3 : k 2)H ' x 13 x 2 '
q b Ž k 1 , k 2 , k 3 . k 2)H s 132 x 1 x 3 y ² 1 3 : k 1)H q H
Ž )
x2 .x
q
w 1 2 x s 12 s 32 < k 1 H < 2 x 2 x 3
' 5 Ž 5.26 .
B1q ; q g g Ž py q y q
a ;k 1 ,k 2 ,k 3 .
2
Ž k 2 H qq H . w 1 3 x x 1(x 3 k 2 H w1 3x x 1 x 3
s2i
½ 2
< k 1 H < k 3 H s 2 b bX
)
q
2
²2 3 :k 1)H < k 3 H < 2(x 2
a Ž k 2 ,k 1 ,k 3 . a Ž k 3 ,k 1 ,k 2 . s12b bX w 1 3 x (x 3
y 2
q
< k3H < k 3)H k 2 H s23 (x 1 x2 < k 1 H < 2 < k 3 H < 2 s23 x 2
x3
q k 2 H k 1)H < k 3 H < 2 x 1 q k 2 H < k 3 H < 2 w 1 2 x x 13r2 x 2 (
< k 1 H < < k 3 H < 2 s23 x 2 x 1
2
(
q H < k 3 H < 2 x 12 x 2
)
yq H
ž x2 q x3
q < k1H < 2 k 2 H Ž x1 q x 2 . x 3
/5 , Ž 5.27 .
B1q ; q g g Ž py q q y
a ;k 1 ,k 2 ,k 3 .
) 2
a Ž k 3 ,k 1 ,k 2 . a Ž k 2 ,k 1 ,k 3 . s12b bX
s2i y
½ k 3 H k 2)H < k 3 H < 2 s32 x 1 x 2 (
²1 3 :(x 3 a Ž k 1 ,k 2 ,k 3 . 2 s2 b bX (x 3
y 2 2
q
< k1H < < k 3H < k 3H < k 1 H < < k 3 H < 2 s23
2
(x x1 3
= w 1 2 x ²1 3 : Ž s2 b bX x 1 y s1 b bX x 2 . q (x 1 ž yk 1 H q H x 33r2
(x 2
2
x1 y x 2
(
q²1 3 :k 3 H x 1 Ž x 3 q x 2 . / ž )2
qH y w1 2x y
(x 1 x2
)
qH w1 2x /
< k 3H < 2qH x1 (
)
yq H x1 (x 3
ž x3 q x2
q ²1 3 :Ž s1 b bX q s2 b bX . x 3 ( /5 , Ž 5.28 .
2i ° ~Žk 2 H qq H . x 1 ž k 1)H q w 1 2 x (x 1 x 2 /
B2q ; q g g Ž py q y q
a ;k 1 ,k 2 ,k 3 . s
< k1H < ¢
2
k 3 H w 1 2 x (x 2
2
k 2 H k 1)H q w 1 2 x x 1 x 2
ž ( /
y
w 1 2 x s3 b bX(x 2
2
Ž k 2 H qq H . w 1 3 x x 1(x 3 • ¶
y
k 3 H s 2 b bX ß, Ž 5.29 .
B2q ; q g g Ž py q q y
a ;k 1 ,k 2 ,k 3 .
2i ²1 3 :(x 3 a Ž k 1 ,k 2 ,k 3 . 2 s2 b bX 2
b Ž k 1 ,k 2 ,k 3 . s3 b bX k 2)H
s
< k1H < 2 ½ k3H < k3H < 2
q
s12 w 2 1 x x 23r2
x1 ²1 2 :x 3 a Ž k 1 ,k 2 ,k 3 . s2 b bX
y
²1 2 : k 3)H (x 1 k3H
)
qH x 1 y k 3)H x 2 b Ž k 1 ,k 2 ,k 3 . s3 b bX
q 1q
ž (x 1 x2 w1 2x / (x 2
5 . Ž 5.30 .
V. Del Duca et al.r Nuclear Physics B 568 (2000) 211–262 233
A q g ™ q Q Q g Ž py
a
n1
,k 1n 1 ,k 2n 2 ,ky
3
n 2 < nXbX
p b , p bn b .
n1
1 X
s 2 s I q ; q Q Q Ž py
a ;k 1n 1 ,k 2n 2 ,ky
3
n2
. ig f b b c C g ; g Ž p bn b ; p bnXbX . , Ž 5.31 .
t
n1
s g 3 lcd 3 a d d1 d 2 A1q ; q Q Q Ž py
a ;k 1n 1 ,k 2n 2 ,ky
3
n2
.
1 n1
y lcd1 a d d 3 d 2 A q2 ; q Q Q Ž py
a ;k 1n 1 ,k 2n 2 ,ky
3
n2
.
Nc
n1
qlcd1 d 2 d d 3 a B1q ; q Q Q Ž py
a ;k 1n 1 ,k 2n 2 ,ky
3
n2
.
y
1
Nc
lcd 3 d 2 d d1 a B2q ; q Q Q Ž py
a
n1
;k 1n 1 ,k 2n 2 ,ky
3
n2
. y dq Q Ž 1 l 3. . Ž 5.32 .
The term proportional to d q Q is due to the interference of identical quarks Ži.e. with the
same flavor and helicity . in the final state. The NNLO impact factor allows for four
helicity configurations. From the non-PT subamplitudes w78x we obtain
A1q ; q Q Q Ž py q y q
a ;k 1 ,k 2 ,k 3 .
x 1 x 3 a Ž k 1 ,k 3 ,k 2 . (x x 3
x3 < qH < 2 g Ž k 1 ,k 3 ,k 2 .
si
½( x2 < k1H < 2
q
1 2
x1 g Ž k 1 ,k 3 ,k 2 . s123
q
(x 2 x 3 s23 < k 1 H < 2 (x 1 a Ž k 3 ,k 1 ,k 2 . s a23 y
(x 1
5 ,
i g Ž k 1 ,k 3 ,k 2 .
A q2 ; q Q Q Ž py q y q
a ;k 1 ,k 2 ,k 3 . s
(x 2 x 3 s23 ž (x 1
(
q x 1 a Ž k 3 ,k 1 ,k 2 . ,
/
234 V. Del Duca et al.r Nuclear Physics B 568 (2000) 211–262
x 1 x 2 b Ž k 1 ,k 3 ,k 2 . (x x3
x3 < qH < 2
B1q ; q Q Q Ž py q y q
a ;k 1 ,k 2 ,k 3 . s i
½( x3 < k1H < 2
y
1 2
x1 a Ž k 3 ,k 1 ,k 2 . x1
q ( x2 x3 s23
y
(x 2 x 3 s23 < k 1 H < 2
g Ž k 1 ,k 3 ,k 2 . s123
(
= x 1 a Ž k 3 ,k 1 ,k 2 . s a23 y
(x 1
5 ,
(
i x1 x2 x3
B2q ; q Q Q Ž py q y q
a ;k 1 ,k 2 ,k 3 . s
< k1H < 2 ž( x3
b Ž k 1 ,k 3 ,k 2 . q ( x2
a Ž k 1 ,k 3 ,k 2 .
/
Ž 5.33 .
with a , b ,g defined in Eq. Ž5.8. and
A qi ; q Q Q Ž pq y y q q ;qQQ
a ;k 1 ,k 2 ,k 3 . s A i Ž py q y q
a ;k 1 ,k 3 ,k 2 . , i s 1,2,
Biq ; q Q Q Ž pq y y q
a ;k 1 ,k 2 ,k 3 . s Biq ; q Q Q Ž py q y q
a ;k 1 ,k 3 ,k 2 . , i s 1,2 . Ž 5.34 .
Note that for each helicity configuration, we have the following relation between the
functions A and B:
A1q ; q Q Q q B1q ; q Q Q s A q2 ; q Q Q q B2q ; q Q Q , Ž 5.35 .
5.5. NNLO impact factors in the high-energy limit
The amplitudes Ž5.10., Ž5.11., Ž5.21. and Ž5.31. have been computed in the kinematic
limit Ž5.1., in which they factorize into an effective amplitude with a ladder structure,
made of a three-parton forward cluster and a LO impact factor connected by a gluon
exchanged in the crossed channel ŽFig. 3.. In the limits y 1 , y 2 4 y 3 or y 1 4 y 2 , y 3 ,
the amplitudes must factorize further into NLO impact factors or into NLO Lipatov
vertices for the production of two partons along the ladder. Such limits constitute then
necessary consistency checks, and we display them in this section.
Fig. 4. Limits of the amplitude for the production of three gluons in the forward-rapidity region of gluon pa ,
for y1 4 y 2 , y 3 Ža. and y1 , y 2 4 y 3 Žb..
V. Del Duca et al.r Nuclear Physics B 568 (2000) 211–262 235
Ž 5.36 .
with the NLO Lipatov vertex, g g
given by w38–40x
) )
™
g g, for the production of two gluons k 2 and k 3
q1)H q2 H x2 1
A g g Ž q1 ,kq
2 ,kq
3 ,q 2 . s 2
k2 H x 3 ²23:
, (
A g g Ž q1 ,kq y
2 ,k 3 ,q 2 .
Ž q2 H qk 3 H . q2 H qk 3 H ky y
2 q k3 kq q
2 q k3
q
s 3 b bX
y
s23 ky
2
k2 H y
kq
3
k 3H
5 Ž 5.37 .
™
In the limit y 1 , y 2 4 y 3 , the NNLO impact factor in Eq. Ž5.10. factorizes into a
NLO impact factor, g ) g g g, Eq. Ž4.8., convoluted with a multi-Regge ladder ŽFig.
4b.
3 X X XX
lim
y1,y 24y 3
½Ž ig . Ý
sgS 3
f a ds 1c f c ds 2 c f c ds 3 c A g ;3 g Ž pan a ;ksns1 1 ,ksns2 2 ,ksns3 3 . 5
2 X
½
s Ž ig . Ý
sgS 2
f a ds 1c f c ds 2 cA g ;2 g Ž pan a ;ksns1 1 ,ksns2 2 . 5
1 X XX
= ig f c d 3 c C g Ž q1 ,k 3n 3 ,q2 . , Ž 5.38 .
t1
with q1 s yŽ pa q k 1 q k 2 ., and with LO Lipatov vertex C g Ž q1 ,k 3n 3 ,q2 ., Eq. Ž4.10..
236 V. Del Duca et al.r Nuclear Physics B 568 (2000) 211–262
In the limit y 1 4 y 2 , y 3 , the functions A and B in Eq. Ž5.13. – Ž5.20. fulfill the
1 X
s ig f a d1 c C g ; g Ž pan a ;k 1n 1 . g2 Ž lclc . d 3 d 2 A q q Ž q1 ,k 2n 2 ,ky3 n 2 ,q2 .
t1
X
n2
q Ž lclc . d 3 d 2 A q q Ž q1 ,ky ,k 2n 2 ,q2 . 4, Ž 5.39 .
™ q q, for the production of a qq pair w40,41x
3
A q q Ž q1 ,kq y
2 ,k 3 ,q 2 .
kq kq <
3 q2 H
<2 ky <
3 k 3 H q1 H
<2 kq )
3 k 2 H Ž q 2 H qk 3 H .
s y2 ( ½ 2
kq
3 Ž kq2 q kq3 . s23
q
k 2 H Ž ky y
2 q k 3 . s 23
q
kq
2 s3 b b
X
Ž 5.40 .
with q1 , q2 , and s3 b bX as in Eq. Ž5.37..
In the collinear limit, k 2 s zP and k 3 s Ž1 y z . P, the NLO Lipatov vertex Ž5.40.
reduces to the splitting factor Ž4.19., and amplitude Ž5.11. factorizes into a multi-Regge
amplitude Ž4.11. times a collinear factor Ž4.18.
lim A g g ™ g q q g Ž pan a ,k 1n 1 < k 2n 2 ,ky
3
n 2 < nXbX
p b , p bn b .
k2<< k3
In the limit y 2 , y 3 4 y 1 , the functions A and B in Eq. Ž5.13. – Ž5.20. fulfill the
relations A1g ; g q q s A 4g ; g q q s 0, B2g ; g q q s yA 3g ; g q q, and B1g ; g q q s yA 2g ; g q q thus the
Fig. 5. Same as Fig. 4 for the production of a quark–antiquark pair and a gluon in the forward-rapidity region
of gluon pa .
V. Del Duca et al.r Nuclear Physics B 568 (2000) 211–262 237
Fig. 6. Same as Fig. 4 for the production of a quark and two gluons in the forward-rapidity region of quark pa .
NNLO impact factor, Eq. Ž5.12., factorizes into a NLO impact factor, g ) g
Ž4.12., convoluted with a multi-Regge ladder ŽFig. 5b.,
™ q q, Eq.
lim I g ; g q q Ž pan a ;k 1n 1 ,k 2n 2 ,ky
3
n2
.
y 2,y 34y 1
n1
s g lcd1 a C q ; q Ž py
a ;k 1n 1 .
1 2 X X XX
=
t1 ½Ž ig . Ý
sgS 2
5
f c ds 2 c f c ds 3 c A g g Ž q1 ,ksns2 2 ,ksns3 3 ,q2 . . Ž 5.42 .
In the limit y 1 , y 2 4 y 3 , the functions A and B in Ž5.23. – Ž5.30. fulfill the relations
n1
A q ; q g g Ž py
a ;k 1n 1 ,k 3n 3 ,k 2n 2 . s B1q ; q g g Ž py
a
n1
;k 1n 1 ,k 2n 2 ,k 3n 3 . s 0,
n1
B2q ; q g g Ž py
a ;k 1n 1 ,k 2n 2 ,k 3n 3 . s yA q ; q g g Ž py
a
n1
;k 1n 1 ,k 2n 2 ,k 3n 3 . ,
n1
B2q ; q g g Ž py
a ;k 1n 1 ,k 3n 3 ,k 2n 2 . s B1q ; q g g Ž py
a
n1
;k 1n 1 ,k 3n 3 ,k 2n 2 . ,
238 V. Del Duca et al.r Nuclear Physics B 568 (2000) 211–262
Fig. 7. Limit of the amplitude for the production of a quark and a quark–antiquark pair in the forward-rapidity
region of quark pa , for y1 4 y 2 , y 3 .
™
thus the NNLO impact factor, Eq. Ž5.22., factorizes into a NLO impact factor,
q g ) q g, Eq. Ž4.14., convoluted with a multi-Regge ladder ŽFig. 6b.
X
n1
lim I q ; q g g Ž py
a ;k 1n 1 ,k 2n 2 ,k 3n 3 . s g 2 Ž l d 2lc . d1 a A q ; q g Ž py
a
n1
;k 1n 1 ,k 2n 2 .
y1,y 24y 3
X
n1
q Ž lcl d 2 . d1 a B q ; q g Ž py
a ;k 1n 1 ,k 2n 2 . 4
1 X XX
= ig f c d 3 c C g Ž q1 ,k 3n 3 ,q2 . , Ž 5.43 .
t1
with q1 s yŽ pa q k 1 q k 2 ..
In the limit y 1 4 y 2 , y 3 , the function A 2 in Eq. Ž5.33. vanishes, A q;
™
qQQ
2 s 0, and
)
using Eqs. Ž5.33.-Ž5.35. the NNLO impact factor, q g qQ Q, Eq. Ž5.32., factorizes
into a NLO Lipatov vertex for the production of a qq pair Ž5.40. convoluted with a
multi-Regge ladder ŽFig. 7.
n1
lim I q ; q Q Q Ž py
a ;k 1n 1 ,k 2n 2 ,ky
3
n2
.
y14y 2,y 3
n1
1 X
s g lcd1 a C q ; q Ž py
a ;k 1n 1 . g2 Ž lclc . d 3 d 2 A q q Ž q1 ,k 2n 2 ,ky3 n 2 ,q2 .
t1
X
n2
q Ž lclc . d 3 d 2 A q q Ž q1 ,ky
3 ,k 2n 2 ,q2 . 4. Ž 5.44 .
s Ý A . . . c . . . Ž . . . , P n , . . . . P Splityf ™
n
f1 f 2 f 3
Ž k 1n 1 ,k 2n 2 ,k 3n 3 . . Ž 5.45 .
n
V. Del Duca et al.r Nuclear Physics B 568 (2000) 211–262 239
Accordingly, we must show that taking the triple collinear limit of the NNLO impact
factors, we can write the amplitudes Ž5.10., Ž5.11., Ž5.21. and Ž5.31. as
™ ™
In the triple collinear limit, the functions A of Sections 5.1, 5.2, 5.3 and 5.4 yield a
quadratic divergence as s123 0 or si j 0 with i, j s 1,2,3. In the same limit, the
functions B have a single collinear divergence since only two out of the three partons
are color adjacent. However, terms with a single divergence when integrated over the
triple collinear region of phase space yield a negligible contribution w20x, thus we ignore
them.
It is easy to show that a function A g ;3 g , Eqs. Ž5.3. – Ž5.7., differs from its reflection
by a term which contains only a single divergence. Using this property and Eq. Ž5.9., we
obtain a dual Ward identity and a reflection identity for the functions A g ;3 g , up to singly
divergent terms,
A g ;3 g Ž pan a ;k 1n 1 ,k 2n 2 ,k 3n 3 .
q A g ;3 g Ž pan a ;k 1n 1 ,k 3n 3 ,k 2n 2 . q A g ;3 g Ž pan a ;k 3n 3 ,k 1n 1 ,k 2n 2 . s 0,
3 X X XX
Ž ig . Ý f a ds 1c f c ds 2 c f c ds 3 c A g ;3 g Ž pan a ;ksns1 1 ,ksns2 2 ,ksns3 3 .
sgS 3
3
Ž ig . XX X X
s f acc Ý f c ds 1c f c ds 2 ds 3 A g ;3 g Ž pan a ;ksns1 1 ,ksns2 2 ,ksns3 3 .
3 sgS 3
XX
s igf acc ½ g2 Ý Ž Fd
sgS 2
s1
5
F ds 2 . c d 3 A g ;3 g Ž pan a ;ksns1 1 ,ksns2 2 ,k 3n 3 . , Ž 5.48 .
where Ž F a . b c ' if b ac. Thus amplitude Ž5.10. can be put in the form of Eq. Ž5.46. with
collinear factor
Splityg ™
n
3g
Ž k 1n 1 ,k 2n 2 ,k 3n 3 . s g 2 Ý Ž Fd s1 F ds 2 . c d 3 splityg ™
n
3g
Ž ksns1 1 ,ksns2 2 ,k 3n 3 . .
sgS 2
Ž 5.49 .
The splitting factors splityg ™
n
3g
are the functions A, Eqs. Ž5.3. – Ž5.7., in the triple
collinear limit, up to singly divergent terms, and thus they fulfill the identities, Eq.
240 V. Del Duca et al.r Nuclear Physics B 568 (2000) 211–262
Ž5.47.. The splitting factors of PT type can be soon read off from Eqs. Ž5.3. – Ž5.6., while
for the ones of non-PT type we note that the coefficients of Eq. Ž5.8. reduce to
a Ž k 1 ,k 2 ,k 3 . ™
z1 z 3
z1 q z 3
,
b Ž k 1 ,k 2 ,k 3 . ™ y (zP z 1 2
)
H
w1 2x ,
g Ž k 1 ,k 2 ,k 3 . ™ (z sz z
1 2
123
3
d Ž 1,2,3 . Ž 5.50 .
with
ž(
d Ž 1,2,3 . ' w 1 2 x z 1 ²1 3: q z 2 ²2 3: . ( / Ž 5.51 .
Thus we obtain
splityg ™ 3 g Ž kq q q
1 1
1 ,k 2 ,k 3 . s 2 ,
(z 1 z3
²1 2:²2 3:
splitqg ™ 3 g Ž kq y q
z 22 1
1 ,k 2 ,k 3 . s 2 ,
(z 1 z3
²1 2:²2 3:
splitqg ™ 3 g Ž kq q y
z 32 1
1 ,k 2 ,k 3 . s 2 ,
(z 1 z3
²1 2:²2 3:
splityg ™ 3 g Ž kq q y
1 ,k 2 ,k 3 .
2
2 s12 z 2 d Ž 1,2,3 . z2
s
s12 s23 Ž 1 y z1 .
q
s123
q ( z1 z 3
Ž 1 y z 3 . d Ž 1,2,3 . ,
splityg ™ 3 g Ž ky q q g™3g
1 ,k 2 ,k 3 . s splity Ž kq3 ,kq2 ,ky1 . ,
splityg ™ 3 g Ž kq y q g™3g
1 ,k 2 ,k 3 . s ysplity Ž ky2 ,kq1 ,kq3 . y splityg ™ 3 g Ž kq1 ,kq3 ,ky2 . .
Ž 5.52 .
A
In the triple collinear limit of the NNLO impact factor g g ) q q g, the functions
g g qq
™
, Eqs. Ž5.13. – Ž5.20., fulfill the relations A 2g ; g q q s yA1g ; g q q and A 4g ; g q q s yA 3g ; g q q,
and A 3g ; g q q Ž pan a ;k 1n 1 ,k 2n 2 ,ky
3
n2.
s A1g ; g q q Ž pan a ;k 1n 1 ,ky
3
n2 n2.
,k 2 . Thus amplitude Ž5.11. can
be put in the form of Eq. Ž5.46. with collinear factor
Splityg ™
n
g qq
Ž k 1n 1 ,k 2n 2 ,ky3 n 2 . s g 2 Ž lcld1 . d 3 d 2 splityg ™
n
g qq
Ž k 1n 1 ,k 2n 2 ,ky3 n 2 .
q Ž l d1lc . d 3 d 2 splityg ™
n
g qq
Ž k 1n 1 ,ky3 n 2 ,k 2n 2 . , Ž 5.53 .
V. Del Duca et al.r Nuclear Physics B 568 (2000) 211–262 241
with
splitqg ™ g q q Ž kq
z 23
y q
1 ,k 2 ,k 3 . s 2 ) z1
1
²1 2:²2 3:
,
splitqg ™ g q q Ž kq
z2 1
q y
1 ,k 2 ,k 3 . s 2 z 3 ( z 1 ²1 2:²2 3:
,
splityg ™ g q q Ž kq y q
1 ,k 2 ,k 3 .
2
2 d Ž 1,3,2 . w 1 2 x d Ž 1,3,2 . z 2 Ž z1 y z 3 . (
z1 z 2 w2 3x
sy
s12 s23 w 1 3 x s123
q
z3 ž (z 1
y
w1 3x z 3 /
(z 2 Ž yz 2 s13 q z 3 s23 q z1 z 2 s123 .
q ,
(z3 Ž 1 y z1 .
splityg ™ g q q Ž kq q y
1 ,k 2 ,k 3 .
2
2 d Ž 1,2,3 . w 1 3 x d Ž 1,2,3 . Ž 1 y z 3 . (z 2 z 3 s12
sy q q . Ž 5.54 .
s12 s23 w 1 2 x s123 (z 1 Ž 1 y z1 .
™
Writing the functions A, Eqs. Ž5.23. – Ž5.30., in the triple collinear limit of the NNLO
impact factor q g ) q g g, the amplitude Ž5.21. can be put in the form of Eq. Ž5.46.
with collinear factor
q™ qgg q™ qgg
Splity n Ž k 1n ,k 2n 2 ,k 3n 3 . s g 2 Ý Ž ld s2 l ds 3 . d1 c splity n Ž k 1n ,ksns2 2 ,ksns3 3 . ,
sgS 2
Ž 5.55 .
with
q™ qgg
2i 1
splity Ž kq1 ,kq2 ,kq3 . s y ,
(z 3
²1 2:²2 3:
q™ qgg
2 iz 1 1
splitq Ž ky1 ,kq2 ,kq3 . s ,
(z 3
²1 2:²2 3:
q™ qgg
splity Ž kq1 ,ky2 ,kq3 .
2
s
2i d Ž 1,3,2 . ž (z w 1 3 x q (z 2 w 2 3 x /
1
q
(z2 Ž 1 y z3 . d Ž 1,3,2.
s12 s23 w 1 3 x s123 (z3
(z 1 z 2 s12
q
1 y z1
(
q z 2 ²2 3 :w 1 3 x ,
242 V. Del Duca et al.r Nuclear Physics B 568 (2000) 211–262
2
q
splity ™ q g g Ž kq1 ,kq2 ,ky3 . s 2i d Ž 1,2,3 . ž (z w 1 2 x y (z 3 w 2 3 x /
1
In the triple collinear limit of the NNLO impact factor q g ) q Q Q, the functions
A Ž5.33. fulfill the relation A1q; q Q Q s A q;
2
qQQ
. Thus the amplitude Ž5.31 . can be put in the
form of Eq. Ž5.46., with collinear factor
q ™ qQQ
Splity n Ž k 1n ,k 2n 2 ,k 3n 3 . s 2 g 2 lad1 c lad 3 d 2 P splityq ™
½ n
qQQ
Ž k 1n ,k 2n 2 ,k 3n 3 .
q ™ qQQ
yd q Q l ad 3 c l ad1 d 2 P splity n Ž k 3n 3 ,k 2n 2 ,k 1n 1 . , Ž 5.57 . 5
where the second term occurs for the case of identical quarks, and c is the color index of
the parent quark. The splitting factors are
q ™ qQQ
i (z 1 z2 z3 d Ž 1,3,2 .
splity
q ™ qQQ
Ž kq1 ,ky2 ,kq3 . s
s23 ž™ 1 y z1
q
s123 / ,
Splitlf ™ f 1 f 2 f 3 Ž k 1n 1 ,k 2n 2 ,k 3n 3 . Split rf ™ f 1 f 2 f 3 Ž k 1n 1 ,k 2n 2 ,k 3n 3 .
)
Ý
n 1n 2 n 3
Plrf ™ f 1 f 2 f 3 ,
X 4g4
s d cc 2 Ž 5.59 .
s123
f ™ f1 f 2 f 3 f ™ f1 f 2 f 3 f ™ f1 f 2 f 3 f ™ f1 f 2 f 3 . )
q ™ q f2 f3 q™ q g g q™ q Q Q
and P q1 ™ q1 q 2 q 2 , where the last splitting
where Pqq s Pyy , and Pyq s Ž Pqy . For splitting functions of
type P , namely for P , P
function is for identical quarks, helicity conservation on the quark line sets the
off-diagonal elements equal to zero.
Averaging over the trace of matrix Ž5.59., i.e. over color and helicity of the parent
parton, we obtain the unpolarized Altarelli–Parisi splitting functions w20x
<Splityf ™ ² P f ™ f1 f 2 f 3 : ,
1 4g4
Ý n
f1 f 2 f 3
Ž k 1n 1 ,k 2n 2 ,k 3n 3 . < 2 s 2 Ž 5.60 .
2C nn 1n 2 n 3 s123
with C defined below Eq. Ž4.22.. For ² P g ™ g 1 g 2 g 3 :, the sum over colors can be
immediately done using Eq. Ž2.5., and it yields
<Splityg ™
n
g1 g 2 g 3
Ž k 1n 1 ,k 2n 2 ,k 3n 3 . < 2 s 4C
C4 Ž Nc . Ý <splityg ™
n
3g
Ž ksns 1 ,ksns 2 ,ksns 3 . < 2 ,
1 2 3
sgS 3
Ž 5.61 .
V. Del Duca et al.r Nuclear Physics B 568 (2000) 211–262 243
with CnŽ Nc . as in Eq. Ž2.8.. Eq. Ž5.61. shows that for the purely gluonic unpolarized
(z s 2 zs 3 2 Ž 1 y zs 2 . zs 2
y
ss1s 3
Ds 2 Ds 3 y3 q
ž Ž 1 y zs . zs
3 3
/
zs1 zs 2 Ds22Ž 1 y 2 zs 3 . zs 1 zs 3 Ds23Ž 1 y 2 zs 2 .
with
y
Ž 1 y zs . zs 3 3
y
Ž 1 y zs . zs
2 2
5 , Ž 5.62 .
Di s w i j x z j q w i k x z k
( ( with i , j,k s 1,2,3 and j,k / i Ž 5.63 .
and
g ™ g1 q2 q3
Ž C P g ™ g 1 q 2 q 3 Ž ab . q CA Pqy
g ™ g 1 q 2 q 3 Ž nab .
1
Pqy s ., Ž 5.64 .
2 F qy
where the abelian and non-abelian terms are
g ™ g 1 q 2 q 3 Žab.
2 s123 2
Pqy s
s12 s13
½ z D y 2(z
1 1 2 z3 D2 D3 , 5
g ™ g 1 q 2 q 3 Žnab.
s123 1
Pqy s Ý
sgS 2 s1 s 2 s1 s 3
2
½ y z 1 D 12 q zs 2 zs 3 Ds 2 Ds 3
2
(
2 w s 2 s 3 x zs 2 zs 3 s1 s 2 s1 s 3 s1 s 3 2 Ds23 zs2 2 zs 3
q q yzs 2 Ds22 q
ss2 2 s 3Ž 1 y z 1 . z 1 ss 2 s 3 Ž 1 y z1 . z1
2 zs 2Ž zs 3 y z 1 .
(
y zs 2 zs 3 Ds 2 Ds 3 1 q
ž Ž 1 y z1 . z1 /5 . Ž 5.65 .
We have checked that Eqs. Ž5.62. – Ž5.65. agree with the corresponding spin-corre-
lated splitting functions of Ref. w21x after contracting them with a parent-gluon polariza-
tion as in Appendix E, and after setting the RS parameter e s 0.
purely gluonic clusters there are 2 nq 1 helicity configurations. However, in the high-en-
ergy limit two of these are subleading, thus an n-gluon forward cluster contains
2Ž2 n y 1. helicity configurations. For n-parton forward clusters including q q pairs, all
the helicity configurations are leading; then an easy counting yields 2 n helicity configu-
rations for the one including a q q pair, 2 ny 1 for the one including two q q pairs, and so
on. For n s 3, we obtain the helicity configurations dealt with in Section 5.
Fig. 8. Amplitude for the production of five gluons, with gluons k 1 , k 2 , k 3 and k 4 in the forward-rapidity
region of gluon pa .
1 na s y
C g ;4 g Ž pan a ;k 1n 1 ,k 2n 2 ,k 3n 3 ,k 4n 4 . s
½ x i2 n i s y i s 1,2,3,4
with n s q, Ž 6.5 .
From the non-PT subamplitudes w75x we have obtained the functions of Žyyqqq .
helicities. We do not reproduce them here because they are quite lengthy. They are
available from the authors upon request.
Using the UŽ1. decoupling equations for one and two photons, the functions B and D
in Eq. Ž6.2. can be written as
B g ;4 g Ž pan a ;k 1n 1 ,k 2n 2 ,k 3n 3 ,k 4n 4 .
s y A g ;4 g Ž pan a ;k 1n 1 ,k 2n 2 ,k 3n 3 ,k 4n 4 . q A g ;4 g Ž pan a ;k 1n 1 ,k 2n 2 ,k 4n 4 ,k 3n 3 .
D g ;4 g Ž pan a ;k 1n 1 ,k 2n 2 ,k 3n 3 ,k 4n 4 .
s A g ;4 g Ž pan a ;k 1n 1 ,k 2n 2 ,k 3n 3 ,k 4n 4 . q A g ;4 g Ž pan a ;k 1n 1 ,k 3n 3 ,k 2n 2 ,k 4n 4 .
qA g ;4 g Ž pan a ;k 3n 3 ,k 1n 1 ,k 2n 2 ,k 4n 4 . q A g ;4 g Ž pan a ;k 1n 1 ,k 3n 3 ,k 4n 4 ,k 2n 2 .
1, 2 and 3, has only a double collinear divergence; the function D, where gluon 1 is
adjacent to 2 and gluon 3 is adjacent to 4 but the pairs are not adjacent one to another,
has two single collinear divergences.
Using Eqs. Ž6.6. and Ž6.7., we can rewrite Eq. Ž6.2. as
4 X X XX XX XXX
s 2 s Ž ig . ½ sgS 4
Ý f a ds 1c f c ds 2 c f c ds 3 c f c ds 4 c
A g ;4 g Ž pan a ;ksns1 1 ,ksns2 2 ,ksns3 3 ,ksns4 4 . 5
1 X XXX
= ig f b b c C g ; g Ž p bn b ; p bnXbX . , Ž 6.8 .
t
The amplitude Ž6.8. has been computed in the kinematic limit Ž6.1., in which it
factorizes into a four-gluon cluster and a LO impact factor connected by a gluon
exchanged in the cross channel. In the limits y 1 , y 2 , y 3 4 y4 or y 1 , y 2 4 y 3 , y4 ,
or y 1 4 y 2 , y 3 , y4 , Eq. Ž6.8. must factorize further into a NNLO impact factor or into
a NLO impact factor times a NLO Lipatov vertex, or into a NNLO Lipatov vertex ŽFig.
8., respectively. While the first two limits constitute necessary consistency checks, the
last one allows us to derive the so far unknown NNLO Lipatov vertex for the production
of three gluons along the ladder.
In the limit y 1 , y 2 , y 3 4 y4 , the NNNLO impact factor, g ) g g g g g, in Eq.
Ž6.8. factorizes into a NNLO impact factor, g ) g g g g, Eq. Ž5.10., convoluted with a ™ ™
multi-Regge ladder ŽFig. 8a.
lim
y1,y 2,y 34y4
4 X X XX XX XXX
½
= Ž ig .
sgS 4
Ý f a ds 1c f c ds 2 c f c ds 3 c f c ds 4 c
A g ;4 g Ž pan a ;ksns1 1 ,ksns2 2 ,ksns3 3 ,ksns4 4 . 5
3 X X XX
½
s Ž ig . Ý
sgS 3
f a ds 1c f c ds 2 c f c ds 3 c A g ;3 g Ž pan a ;ksns1 1 ,ksns2 2 ,ksns3 3 . 5
1 XX XXX
= ig f c d4 c
C g Ž q1 ,k 4n 4 ,q2 . , Ž 6.9 .
t1
™ ™
In the limit y 1 , y 2 4 y 3 , y4 , the NNNLO impact factor in Eq. Ž6.8. factorizes into
a NLO impact factor, g ) g g g, Eq. Ž4.8., times a NLO Lipatov vertex for production
of two gluons g ) g ) g g Ž5.37., convoluted with a multi-Regge ladder ŽFig. 8b.
lim
y1,y 24y 3,y4
4 X X XX XX XXX
½
= Ž ig . Ý
sgS 4
f a ds 1c f c ds 2 c f c ds 3 c f c ds 4 c
A g ;4 g Ž pan a ;ksns1 1 ,ksns2 2 ,ksns3 3 ,ksns4 4 . 5
2 X
½
s Ž ig . Ý
sgS 2
f a ds 1c f c ds 2 cA g ; g g Ž pan a ;ksns1 1 ,ksns2 2 . 5
1 2 X XX XX XXX
=
t1 ½Ž ig . Ý
sgS 2
f c ds 3 c f c ds 4 c
A g g Ž q1 ,ksns3 3 ,ksns4 4 ,q2 . , 5 Ž 6.10 .
with q1 s yŽ pa q k 1 q k 2 ..
In the limit y 1 4 y 2 , y 3 , y4 , the NNNLO impact factor in Eq. Ž6.8. factorizes into
a NNLO Lipatov vertex convoluted with a multi-Regge ladder ŽFig. 8c.
lim
y14y 2,y 3,y4
4 X X XX XX XXX
½
= Ž ig . Ý
sgS 4
f a ds 1c f c ds 2 c f c ds 3 c f c ds 4 c
A g ;4 g Ž pan a ;ksns1 1 ,ksns2 2 ,ksns3 3 ,ksns4 4 . 5
s ig f a d1 c C g ; g Ž pan a ;k 1n 1 .
1 3 X X XX XX XXX
=
t1 ½Ž ig . Ý f c ds 2 c f c ds 3 c f c ds 4 c
A3 g Ž q1 ,ksns2 2 ,ksns3 3 ,ksns4 4 ,q2 . 5 Ž 6.11 .
™
sgS 3
with the NNLO Lipatov vertex, g ) g ) g g g, for the production of three gluons k 2 ,
k 3 and k 4 enclosed in curly brackets in the right-hand side, with
x2 1 q1)H q2 H
A3 g Ž q1 ,kq q q '
2 ,k 3 ,k 4 ,q 2 . s y2 2 ( x 4 ²2 3:²3 4: k2 H
, Ž 6.12 .
A3 g Ž q1 ,ky q q
2 ,k 3 ,k 4 ,q 2 .
( ž
q x 2 yq1H Ž q1)H y k 2)H . k 3)H x 2 (
yq ž yk )
1H
)
2 H k3H (x 2 (
q ²2 3: k 3)H x 3 q k 2 H w 2 3 x x 3 (
( /
q²2 4: k 3)H x 4 q k 2)H Ž ²2 3: k 3)H q k 2 H w 2 3 x . x 3
ž (
< q2 H < 2 x 2 ²2 3: x 3 q ²2 4: x 4
ž ( ( /
q²2 4: k 3)H x 4 ( /// q
(
²2 3:²3 4: s234 x 4
2
q2 H k 2 H Ž q1)H y k 2)H . ž q (x q w 3 4 x (x /
)
2H 3 4
q
s2 a aX s4 b bX ²3 4: k (x )
2H 4
2 2
< q1 H < Ž k . k k (x
2H
) )
3H 4 H 3
q
²3 4: s23 Ž < q1 H yq2 H < 2 q s234 .Ž < k 3 H < 2 x 2 q < k 2 H < 2 x 3 . x 4 (
< q1 H < 2 k 2 H w 3 4 x ²2 4: k 4)H x 3 q ²2 3: k 3)H x 4
ž ( ( /
q 2
(x 2 x 3 x 4 ²3 4: s23 s234 Ž < q1 H yq2 H < q s234 .
w3 4x
q
²3 4: s23 s234 x 2 x 3 x 4 ( ž yq )
1H k2 H x3 x 4 ²2 3: x 3
ž (
( / ( ž
q²2 4: x 4 q q2 H x 2 k 4 H Ž q2)H q k 4)H . x 2 x 3
A 3 g Ž q1 , k q y q
2 ,k 3 ,k 4 ,q 2 .
y
< q2 H <2 'x 2 x 33
y
< q 1 H < 2 Ž k 3 H . 2 k 2)H Ž q 2)H q k 4)H . x 2
²2 3 : s 34 Ž 1 y x 2 . s 4 b bX k 2 H s 23 Ž < k 3 H < 2 x 2 q < k 2 H < 2 x 3 .
V. Del Duca et al.r Nuclear Physics B 568 (2000) 211–262 249
q 1)H k 3 H w 2 4 x 2 ²2 3 : Ž q 1 H yk 2 H . x 2 q ²3 4 : k 2 H x 4
Ž ' ' .
q
k 2 H s 23 s 34 s 234 x 3 '
y
'
k 3 H k 2)H q 2)H k 2 H x 2 y q 1 H Ž q 2)H q k 4)H . x 2 q k 2 H w 2 4 x x 4
Ž ' ' .
s 2 a aX s4 b bX k 2 H x 2 '
y
' Ž ' '
k 2)H x 3 yq 1 H q 2 H k 4)H x 2 x 3 y Ž q 1 H yk 2 H . k 3 H w 3 4 x x 2 x 4 q q 2 H k 2 H w 2 4 x x 3 x 4 ' .
'
s 2 a aX k 2 H s 34 x 2 x 4
2
q2 H w 2 4 x 'x Ž ²3 4 : Ž q
3 q k . 'x y ²2 3 : k 'x .
)
2H
)
4H 2
)
2H 4
q
s s s 'x x23 34 234 2 4
Ž '
qq 1)H k 3 H x 2 q 2 H w 2 4 x x 3 q k 3 H w 3 4 x x 2 x 3 q k 3 H w 2 4 x x 4 .. , Ž 6.14 .
A3 g Ž q1 ,kq q y
2 ,k 3 ,k 4 ,q 2 .
k 2)H
s 2'2
s2 a aX ²2 3: k 2 H s34 x 2 ' Žq 1 H q2 H
²2 3: k 3)H x 2 '
'
y²2 4:w 3 4 x x 2 Ž q1 H y k 2 H .Ž q2 H q k 4 H . q k 2 H x 3 Ž q2 H s23 q Ž q2 H q k 4 H . s34 . . '
q1)H Ž q2 H q k 4 H . 2 Ž q1 H k 3)H 'x 2 y k 2 H w2 3x x 3 ' .
q
s2 a aX s4 b bX ²2 3: k 2 H x 3 '
< q2 H < 2 k 3 H x 2 x 3 ' < q1 H < 2 Ž k 4 H . 2 Ž ²3 4: k 3)H x 2 q²2 4: k 2)H x 3
' ' .
q q
²2 3: k 2 H s34 Ž 1y x 2 . x 4 x 3 ²2 3:²3 4: k 2 H s234 Ž < q1 H y q2 H < 2 q s234 .
'
< q2 H < 2 Ž q1 H y k 2 H . k 2)H x 3
q
s2 a aX k 2 H s34 Ž 1y x 2 .
q
s4 b bX k 4 H x 2 ' 'x 3 '
q Ž q2 H q k 4 H . Ž Ž ²3 4: k 3)H q k 4 H w 3 4 x . x 2 q²2 4: k 2)H x 3 ' . 'x 4
²2 3: k 2 H s34 x 4
250 V. Del Duca et al.r Nuclear Physics B 568 (2000) 211–262
y q1)H k 2 H k 4 H Ž k 4 H q q2 H x 4 .
/ , Ž 6.15 .
½
s Ž ig .
sgS 2
Ý f c ds 2 c f c ds 3 c A g g Ž q1 ,ksns2 2 ,ksns3 3 ,q12 . 5
1 XX XXX
= ig f c d4 c
C g Ž q12 ,k 4n 4 ,q2 . , Ž 6.18 .
t 12
with q12 s q2 q k 4 .
In the triple collinear limit, k 2 s z 2 P, k 3 s z 3 P and k 4 s z 4 P, with z 2 q z 3 q z 4 s 1,
the coefficients of the NNLO Lipatov vertex Ž6.12. – Ž6.15. reduce to the splitting
functions Ž5.52., and amplitude Ž6.11. factorizes into a multi-Regge amplitude Ž4.11.
times a double-collinear factor Ž5.49.
lim A g g ™ 5 g Ž pan a ,k 1n 1 < k 2n 2 ,k 3n 3 ,k 4n 4 < p bnXbX , p bn b .
k2 < < k3< < k4
s Ý A . . . c . . . Ž . . . , P n , . . . . P Splityf ™
n
f1 f 2 f 3 f4
Ž k 1n 1 ,k 2n 2 ,k 3n 3 ,k 4n 4 . . Ž 6.19 .
n
q A g ;4 g Ž pan ;k 3n ,k 1n ,k 2n ,k 4n . q A g ;4 g Ž pan ;k 1n ,k 3n ,k 4n ,k 2n .
a 3 1 2 4 a 1 3 4 2
q A g ;4 g Ž pan ;k 3n ,k 1n ,k 4n ,k 2n . q A g ;4 g Ž pan ;k 3n ,k 4n ,k 1n ,k 2n . s 0 .
a 3 1 4 2 a 3 4 1 2
Ž 6.23 .
We note, however that the last identity is not independent from the first two. Using the
identities Ž6.21. – Ž6.23. in Eq. Ž6.8., we can factorize the color structure on a leg
4 X X XX XX XXX
Ž ig . Ý f a ds 1c f c ds 2 c f c ds 3 c f c ds 4 c
A g ;4 g Ž pan a ;ksns1 1 ,ksns2 2 ,ksns3 3 ,ksns4 4 .
sgS 4
XXX
s igf acc ½ g3 Ý Ž Fd
sgS 3
s1 F ds 2 F ds 3 . c d 4 A g ;4 g Ž pan a ;ksns1 1 ,ksns2 2 ,ksns3 3 ,k 4n 4 . 5 Ž 6.24 .
thus amplitude Ž6.8. can be put in the form of Eq. Ž6.20. with collinear factor
Splityg ™
n
4g
Ž k 1n 1 ,k 2n 2 ,k 3n 3 ,k 4n 4 .
sg3 Ý Ž Fd s1 F ds 2 F ds 3 . c d 4 splityg ™
n
4g
Ž ksns1 1 ,ksns2 2 ,ksns3 3 ,k 4n 4 . . Ž 6.25 .
sgS 3
Ž6.3. – Ž6.5., while the ones of non-PT type can be given in terms of three functions of
the collinear momenta,
splityg ™ 4 g Ž kq q q q
1 1
'
1 ,k 2 ,k 3 ,k 4 . s 2 2 ,
(
z 1 z 4 1 2 2 3:²3 4:
² :²
splitqg ™ 4 g Ž ky q q q
z 12 1
'
1 ,k 2 ,k 3 ,k 4 . s 2 2 ,
(z 1 z4
²1 2:²2 3:²3 4:
splitqg ™ 4 g Ž kq y q q
z 22 1
'
1 ,k 2 ,k 3 ,k 4 . s 2 2 ,
(z 1 z4
²1 2:²2 3:²3 4:
splitqg ™ 4 g Ž kq q y q
z 32 1
'
1 ,k 2 ,k 3 ,k 4 . s 2 2 ,
(z 1 z4
²1 2:²2 3:²3 4:
splitqg ™ 4 g Ž kq q q y
z 42 1
'
1 ,k 2 ,k 3 ,k 4 . s 2 2 ,
(z 1 z4
²1 2:²2 3:²3 4:
splityg ™ 4 g Ž ky q q q
1 ,k 2 ,k 3 ,k 4 . s yB
B1 Ž 4,3,2,1 . ,
splityg ™ 4 g Ž kq y q q
1 ,k 2 ,k 3 ,k 4 . s B1 Ž 4,3,1,2 . q B1 Ž 4,1,3,2 . q B1 Ž 1,4,3,2 . ,
splityg ™ 4 g Ž kq q y q
1 ,k 2 ,k 3 ,k 4 . s yB
B1 Ž 1,2,4,3 . y B1 Ž 1,4,2,3 . y B1 Ž 4,1,2,3 . ,
splityg ™ 4 g Ž kq q q y
1 ,k 2 ,k 3 ,k 4 . s B1 Ž 1,2,3,4 . ,
splityg ™ 4 g Ž ky y q q
1 ,k 2 ,k 3 ,k 4 . s yB
B2 Ž 4,3,2,1 . ,
splityg ™ 4 g Ž ky q y q
1 ,k 2 ,k 3 ,k 4 . s B3 Ž 1,2,3,4 . ,
splityg ™ 4 g Ž ky q q y
1 ,k 2 ,k 3 ,k 4 . s yB
B3 Ž 1,2,4,3 . q B2 Ž 3,2,4,1 . q B2 Ž 3,2,1,4 . ,
splityg ™ 4 g Ž kq y y q
1 ,k 2 ,k 3 ,k 4 . s yB
B2 Ž 1,4,2,3 . q B3 Ž 3,4,2,1 . y B2 Ž 4,1,2,3 . ,
splityg ™ 4 g Ž kq y q y
1 ,k 2 ,k 3 ,k 4 . s yB
B3 Ž 4,3,2,1 . ,
splityg ™ 4 g Ž kq q y y
1 ,k 2 ,k 3 ,k 4 . s B2 Ž 1,2,3,4 . Ž 6.26 .
with
2 '2 z 3 d Ž 1,2,3 . (z 2 z 3 z 4 ²1 2:
B1 Ž 1,2,3,4 . s y y
²1 2: ²2 3: s34 (z 1 Ž z 3 q z 4 . w1 2x Ž 1 y z1 . Ž z 3 q z 4 .
z3
q ( z1 z 4
Ž 1 y z 4 . e Ž 1,2,3,4 .
2
e Ž 1,2,3,4 . Ž ²2 4: w 1 2 x q ²3 4: w 1 3 x . w 3 4 x
y
s1234 s234
2
z 2 e Ž 1,2,3,4 . w 2 3 x
q ( z1 s234
(z 4 ž(
²1 2: w 2 3 x z 1 ²1 4: y Ž 2 y z 1 . e Ž 1,2,3,4 . /
q , Ž 6.27 .
Ž 1 y z1 . s234
V. Del Duca et al.r Nuclear Physics B 568 (2000) 211–262 253
2
d Ž 1,2,3 . e Ž 1,2,3,4 . w 3 4 x
y
(
s1234 z 4 s12 s123
d Ž 1,2,3 . ž (z 1 ( / (
d Ž 3,4,1 . q z 2 d Ž 3,4,2 . q z 1 z 2 z 4 s12 s34
q
(
s1234 z 4 ²1 2: w 3 4 x
(
z 1 z 2 d ) Ž 3,4,2 . z 1²3 4: w 1 2 x Ž ²1 3: w 2 3 x q ²1 4: w 2 4 x .
q y ,
Ž 1 y z1 . s234 s1234 s234
Ž 6.28 .
B3 Ž 1,2,3,4 .
'
2'2 z2 z2 z3
ž' z1 ²2 3: s34 'z 3
²1 2: s23
s
s12 s23 s34 'z 1 Ž z1 q z 2 . 1y z 4
y
z3 q z4 /
z2 '
z 2 Ž z 3 e Ž 1,2,4,3 . s23 y²2 3:²3 4:w 2 4 x .
y ( z1 z 4
Ž 'z 2 z 3 ²1 2:y z 4²1 3: . d Ž2,4,3. y Ž z1 q z 2 .
Ž 'z 2 ²1 2:d Ž2,4,3. 2q'z1 ²1 3: Ž z 2 ²1 2:²2 3:q'z 3 'z 4 ²1 3:²3 4:q z 4²1 4:²3 4: . w2 4x 2 .
y
'z 1 s1234
s34
q 'z 4
²1 3:2 w 2 4 x Ž e Ž 1,2,4,3 . w 2 3 x y e Ž 2,3,4,1 . w 1 2 x .
s1234 s123
z 2 ²1 3:2 s34
y ( z1 s123
Ž y'z 2 e ) Ž2,3,4,1. q'z1 z 4 w2 4x . Ž 6.29 .
(
e Ž 1,2,3,4 . s z 1 ²1 4: q z 2 ²2 4: q z 3 ²3 4: . ( ( Ž 6.30 .
As in Section 5.6, summing over the helicities of gluons 1, 2, 3 and 4, one can obtain
the two-dimensional polarization matrix,
Splityg ™ Ž k 1n 1 ,k 2n 2 ,k 3n 3 ,k 4n 4 . Splityg ™
)
Ý n
4g
n
4g
Ž k 1n 1 ,k 2n 2 ,k 3n 3 ,k 4n 4 .
n 1n 2 n 3 n4
Plrg ™ 4 g ,
X 8 g6
s d cc 3 Ž 6.31 .
s1234
254 V. Del Duca et al.r Nuclear Physics B 568 (2000) 211–262
<Splityg ™ ²P g™4g: ,
1 8 g6
Ý n
4g
Ž k 1n 1 ,k 2n 2 ,k 3n 3 ,k 4n 4 . < 2 s 3 Ž 6.32 .
2 Ž Nc2 y 1 . nn 1n 2 n 3 n4 s1234
with ² P g ™ 4 g : s Pqq
g™4g
. As in Section 5.6, the sum over colors can be done using Eq.
Ž2.5., and we obtain
<Splityg ™
n
4g
Ž k 1n 1 ,k 2n 2 ,k 3n 3 ,k 4n 4 . < 2
s 4C
C5 Ž Nc . Ý <splityg ™
n
4g
Ž ksns 1 ,ksns 2 ,ksns 3 ,ksns4 . < 2 , Ž 6.33 .
1 2 3 4
sgS 4
with C5 Ž Nc . as in Eq. Ž2.8.. It is then clear that for the splitting functions P g ™ n g , with
n ) 4, the color will not factorize since LCA, Eq. Ž2.5., is not exact any more. We do
g™4g g™4g
not compute here Pqq and Pqy , all the information about them being already
contained in Eqs. Ž6.26. – Ž6.30..
7. Conclusions
In this paper, the structure of QCD amplitudes in the high-energy limit and in the
collinear limit has been explored beyond NLO. We have computed forward clusters of
three partons and four gluons, which in the BFKL theory constitute the tree parts of
NNLO and NNNLO impact factors for jet production. In the BFKL theory the NNLO
impact factors could be used to compute jet rates at NNLL accuracy. In Sections 5.1,
5.2, 5.3 and 5.4, we have computed the tree parts of the NNLO impact factors for all the
parton flavors. On these we have performed in Section 5.5 a set of consistency checks in
the high-energy limit, and we have obtained in the triple collinear limit ŽSection 5.6. the
polarized, the spin-correlated and the unpolarized double-splitting functions. The last
two agree with previous calculations by Catani–Grazzini and Campbell–Glover, respec-
tively. They can be used to set up general algorithms to compute jet rates at NNLO.
From the four-gluon forward cluster we have obtained in Section 6.1 the tree part of
the purely gluonic NNNLO impact factor. In the quadruple collinear limit, this yields
ŽSection 6.4. the purely gluonic unpolarized triple-splitting functions. They could be
used to compute the three-loop Altarelli–Parisi evolution, or to compute jet rates at
NNNLO. In addition, by separating a central cluster of three gluons out of the
four-gluon forward cluster, we have computed the emission of three gluons along the
ladder, Eqs. Ž6.11. – Ž6.15., which contributes to the NNLO Lipatov vertex. This
constitutes one of the universal building blocks in an eventual construction of a BFKL
resummation at NNLL accuracy.
Finally, inspired by the color structure in the high-energy limit, we have found a
compact color decomposition of the tree multigluon amplitudes in terms of the linearly
independent subamplitudes only, Eq. Ž2.9.. It would be interesting to analyse whether
this structure generalizes to multigluon amplitudes at one loop, and beyond.
The decomposition in rapidity of amplitudes in terms of gauge-invariant parton
clusters performed in this work suggests naturally a modular decomposition of a generic
V. Del Duca et al.r Nuclear Physics B 568 (2000) 211–262 255
Acknowledgements
We would like to thank Stefano Catani, Walter Giele, David Kosower and Zoltan
Trocsanyi for discussions. We are particularly grateful to Lance Dixon for his valuable
insight.
p b s Ž py y y
b r2,0,0,y p b r2 . ' Ž 0, p b ;0,0 . ,
pi s Ž Ž pq y q y
i q pi . r2,Re w pi H x ,Im w pi H x , Ž pi y pi . r2 .
' Ž < pi H < e y i , < pi H < eyy i ; < pi H <cos f i , < pi H <sin f i . , Ž A.1 .
where y is the rapidity. The first notation in Eq. ŽA.1. is the standard representation
p m s Ž p 0 , p x , p y , p z ., while in the second we have the q and - components on the left
of the semicolon, and on the right the transverse components. In the following, if not
differently stated, pi and pj are always understood for 1 ( i, j ( n. From momentum
conservation,
n
0s Ý pi H ,
is1
n
pq
a sy Ý pqi ,
is1
n
py
b sy Ý pyi , Ž A.2 .
is1
5
By convention we consider the scattering in the unphysical region where all momenta are taken as
outgoing, and then we analytically continue to the physical region where pa0 - 0 and p b0 - 0. Thus partons are
ingoing or outgoing depending on the sign of their energy. Since the helicity of a positive-energy Žnegative-en-
ergy. massless spinor has the same Žopposite. sign as its chirality, the helicities assigned to the partons depend
on whether they are incoming or outgoing. Our convention is to label outgoing Žpositive-energy. particles with
their helicity; so if they are incoming the actual helicity and charge is reversed.
256 V. Del Duca et al.r Nuclear Physics B 568 (2000) 211–262
so that
n
s s 2 pa P p b s Ý pq y
i pj ,
i , js1
n
s ai s 2 pa P pi s y Ý pyi pqj ,
js1
n
s b i s 2 p b P pi s y Ý pqi pyj . Ž A.3 .
js1
Massless Dirac spinors c " Ž p . of fixed helicity are defined by the projection
1 " g5
c "Ž p. s c Ž p. , Ž A.4 .
2
with the shorthand notation
c " Ž p . s < p " :, c "Ž p. s² p "< ,
² pk : s ² p y < k q : s cy Ž p . cq Ž k . ,
w pk x s ² p q < k y : s cq Ž p . cy Ž k . . Ž A.5 .
Using the chiral representation of the g-matrices,
0 I 0 ys i
g 0s ž I 0/, g is
ž si 0
,
/ Ž A.6 .
and the normalization condition
² p " < gm < p " : s 2 pm , Ž A.7 .
if 6
and the complex notation p H s < p H < e , the spinors for the momenta ŽA.1. are
(p q
i
0
0
cq Ž pi . s
0
(p
(y p
y
i
0
0
e if i ,
q
a
cy Ž pi . s
0
( y yi f i ,
pi e
y pq
0
i(
0
cq Ž pa . s i
0 0
0
0
0
, cy Ž pa . s i
0
0
y y pq
(
a
0
,
0
cq Ž p b . s yi
0 ( y py
b
0
0
,
0
cy Ž p b . s yi
y py
( b
0
. Ž A.8 .
6
The spinors of the incoming partons must be continued to negative energy after the complex conjugation.
'
For instance, cq Ž pa . s iŽ y pq
a ,0,0,0 .
.
V. Del Duca et al.r Nuclear Physics B 568 (2000) 211–262 257
Using the above spinor representation, the spinor products for the momenta ŽA.1. are
pq pq
² pi p j : s pi H ( j
pq
i
y pj H ) i
pq
j
,
ypq
² pa pi : s yi ( a
pq
i
pi H ,
² pi p b : s i y py
( q
b pi ,
² pa p b : s y'sˆ , Ž A.9 .
2
where we have used the mass-shell condition < pi H < s pq y
i pi .
Note that in the present
convention the spinors ŽA.8. and the spinor products ŽA.9. differ by phases with respect
to the same in Ref. w74x.
We consider also the spinor products ² pi q < g P p k < pj q :, which in the spinor
representation ŽA.8. take the form
² pi q < g P p k < p j q :
1
s Ž pqi pqj pyk y pqi pj H pk)H y pi)H pqj pk H qpi)H pj H pqk . , ;k ,
(pq q
i pj
ypq
² pi q < g P p j < p a q : s i ( a
pq
i
Ž pqi pyj y pi)H pj H . , ; j,
ypy
² pi q < g P pj < p b q : s yi ( b
pq
i
Ž ypqi pj)H q pi)H pqj . , ; j. Ž A.10 .
Throughout the paper the following representation for the gluon polarization is used:
² p " < gm < k " :
em" Ž p,k . s "
'2 ² k . < p " : , Ž A.13 .
In the multi-Regge kinematics, we require that the gluons are strongly ordered in
rapidity and have comparable transverse momentum,
y 1 4 . . . 4 yn ; < p 1 H < , . . . , < pn H < . Ž B.1 .
Momentum conservation ŽA.2. then becomes
n
0s Ý pi H ,
is1
pq q
a , yp 1 ,
py y
b , ypn . Ž B.2 .
The Mandelstam invariants ŽA.3. are reduced to,
s s 2 pa P p b , pq y
1 pn ,
s ai s 2 pa P pi , ypq y
1 pi ,
sb i s 2 p b P pi , ypq y
i pn ,
pq
j
pj H for yi ) yj ,
pq
² pa pi : , yi ( a
pq
i
pi H ,
² pi p b : , i pq( y
i pn ,
² pa p b : , y pq y
(
1 pn . Ž B.4 .
V. Del Duca et al.r Nuclear Physics B 568 (2000) 211–262 259
pq q q
a , y p1 q p 2
Ž .,
py y
b , ypn . Ž C.2 .
The spinor products ŽA.9. become
² pa p b : s y's , y (Ž p q q
1 q p2 . py
n ,
ypq pn H
² pa pn : s yi ( a
pq
n
pnH , i
< pn H <
² pa p b : ,
ypq pq q
1 q p2
² pa p k : s yi ( a
pq
k
p k H , yi ( pq
k
pk H , k s 1, . . . ,n y 1,
² p k p b : s i y py
( q q y
(
b p k , i p k pn , k s 1, . . . ,n y 1,
² pn p : s i(y p
b
y q
b pn , i < pn H < ,
pq pq pq
² p k pn : s p k H ( n
pq
k
y pn H ( k
pq
n
, ypn H ( k
pq
n
, k s 1, . . . ,n y 1,
pq pq
² p1 p 2 : s p1 H ( 2
pq
1
y p2 H ( 1
pq
2
,
pq pq pq
² p k pi : s p k H ( i
pq
k
y pi H ( k
pq
i
, ypi H ( k
pq
i
, k s 1,2;i s 3, . . . ,n y 1 .
Ž C.3 .
which differ by phases with respect to the same spinor products in Ref. w39x because of
the convention for the spinor representation we use in Section A.
pq q q
Ž q
a , y p1 q p 2 q p 3 .,
py y
b , ypn . Ž D.2 .
The spinor products ŽA.9. become
² pa p b : s y's , y (Ž p q q q
1 q p2 q p3 . py
n ,
ypq pn H
² pa pn : s yi ( a
pq
n
pnH , i
< pn H <
² pa p b : ,
ypq pq q q
1 q p2 q p3
² pa p k : s yi ( a
pq
k
p k H , yi ( pq
k
pk H , k s 1, . . . ,n y 1,
² p k p b : s i y py
( q q y
b p k , i p k pn , ( k s 1, . . . ,n y 1,
² pn p : s i(y p
b
y q
b pn , i < pn H < ,
pq pq pq
² p k pn : s p k H ( n
pq
k
y pn H ( k
pq
n
, ypn H ( k
pq
n
, k s 1, . . . ,n y 1,
pq pq pq
² p k pi : s p k H ( i
pq
k
y pi H ( k
pq
i
, ypi H ( k
pq
i
,
k s 1,2,3; i s 4, . . . ,n y 1 , Ž D.3 .
while the others spinor products remain unchanged. The spinor products ŽD.3. general-
ize straightforwardly to the kinematics Ž6.1..
For the off-diagonal terms, P g ™ g f 2 f 3 , we find a relative minus sign between the
complex conjugation.
results of Ref. w21x and ours, which, however, has no physical relevance.
References
w1x W.T. Giele, E.W.N. Glover, Phys. Rev. D 46 Ž1992. 1980.
w2x W.T. Giele, E.W.N. Glover, D.A. Kosower, Nucl. Phys. B 403 Ž1993. 633. hep-phr9302225.
w3x S. Keller, E. Laenen, Phys. Rev. D 59 Ž1999. 114004. hep-phr9812415.
w4x Z. Kunszt, D.E. Soper, Phys. Rev. D 46 Ž1992. 192.
w5x S. Frixione, Z. Kunszt, A. Signer, Nucl. Phys. B 467 Ž1996. 399. hep-phr9512328.
w6x Z. Nagi, Z. Trocsanyi, Nucl. Phys. B 486 Ž1997. 189. hep-phr9610498.
w7x S. Frixione, Nucl. Phys. B 507 Ž1997. 295. hep-phr9706545.
w8x S. Catani, M.H. Seymour, Phys. Lett. B 378 Ž1996. 287. hep-phr9602277.
w9x S. Catani, M.H. Seymour, Nucl. Phys. B 485 Ž1997. 291. hep-phr9605323.
w10x S. Catani, M.H. Seymour, Nucl. Phys. B 510 Ž1998. 503.
w11x D.A. Kosower, Phys. Rev. D 57 Ž1998. 5410. hep-phr9710213.
w12x V.N. Gribov, L.N. Lipatov, Yad. Fiz. 15 Ž1972. 781, Sov. J. Nucl. Phys. 46 Ž1972. 438.
w13x L.N. Lipatov, Yad. Fiz. 20 Ž1974. 181, Sov. J. Nucl. Phys. 20 Ž1975. 95.
w14x G. Altarelli, G. Parisi, Nucl. Phys. 126 Ž1977. 298.
w15x Yu.L. Dokshitzer, Zh. Eksp. Teor. Fiz. 73 Ž1977. 1216, Sov. Phys. JETP 46 Ž1977. 641.
w16x D.R. Yennie, S.C. Frautschi, H. Suura, Ann. Phys. 13 Ž1961. 379.
w17x A. Bassetto, M. Ciafaloni, G. Marchesini, Phys. Rep. 100 Ž1983. 201.
w18x F.A. Berends, W.A. Giele, Nucl. Phys. B 313 Ž1989. 595.
w19x Z. Kunszt, A. Signer, Z. Trocsanyi, Nucl. Phys. B 420 Ž1994. 550. hep-phr9401294.
w20x J.M. Campbell, E.W.N. Glover, Nucl. Phys. B 527 Ž1998. 264. hep-phr9710255.
w21x S. Catani, M. Grazzini, Phys. Lett. B 446 Ž1999. 143. hep-phr9810389.
w22x S. Catani, M. Grazzini, preprint, hep-phr9908523.
w23x Z. Bern, G. Chalmers, Nucl. Phys. B 447 Ž1995. 465. hep-phr9503236.
w24x Z. Bern, V. Del Duca, C.R. Schmidt, Phys. Lett. B 445 Ž1998. 168. hep-phr9810409.
w25x Z. Bern, V. Del Duca, W.B. Kilgore, C.R. Schmidt, Phys. Rev. D 60 Ž1999. 116001. hep-phr9903516.
w26x D.A. Kosower, Nucl. Phys. B 552 Ž1999. 319. hep-phr9901201.
w27x D.A. Kosower, P. Uwer, preprint, hep-phr9903515.
w28x S. Catani, Phys. Lett. B 427 Ž1998. 161. hep-phr9802439.
w29x V.S. Fadin, E.A. Kuraev, L.N. Lipatov, Zh. Eksp. Teor. Fiz. 71 Ž1976. 840, Sov. Phys. JETP 44 Ž1976.
443.
262 V. Del Duca et al.r Nuclear Physics B 568 (2000) 211–262
w30x V.S. Fadin, E.A. Kuraev, L.N. Lipatov, Zh. Eksp. Teor. Fiz. 72 Ž1977. 377, Sov. Phys. JETP 45 Ž1977.
199.
w31x Ya.Ya. Balitsky, L.N. Lipatov, Yad. Fiz. 28 Ž1978. 1597, Sov. J. Nucl. Phys. 28 Ž1978. 822.
w32x L.N. Lipatov, Yad. Fiz. 23 Ž1976. 642, Sov. J. Nucl. Phys. 23 Ž1976. 338.
w33x L.N. Lipatov, Nucl. Phys. B 365 Ž1991. 614.
w34x V.S. Fadin, L.N. Lipatov, Phys. Lett. B 429 Ž1998. 127. hep-phr9802290.
w35x G. Camici, M. Ciafaloni, Phys. Lett. B 412 Ž1997. 396. hep-phr9707390.
w36x G. Camici, M. Ciafaloni, Phys. Lett. B 417 Ž1998. 390, Erratum.
w37x G. Camici, M. Ciafaloni, Phys. Lett. B 430 Ž1998. 349. hep-phr9803389.
w38x V.S. Fadin, L.N. Lipatov, Yad. Fiz. 50 Ž1989. 1141, Sov. J. Nucl. Phys. 50 Ž1989. 712.
w39x V. Del Duca, Phys. Rev. D 54 Ž1996. 989. hep-phr9601211.
w40x V.S. Fadin, L.N. Lipatov, Nucl. Phys. B 477 Ž1996. 767. hep-phr9602287.
w41x V. Del Duca, Phys. Rev. D 54 Ž1996. 4474. hep-phr9604250.
w42x V.S. Fadin, L.N. Lipatov, Nucl. Phys. B 406 Ž1993. 259.
w43x V.S. Fadin, R. Fiore, A. Quartarolo, Phys. Rev. D 50 Ž1994. 5893. hep-thr9405127.
w44x V.S. Fadin, R. Fiore, M.I. Kotsky, Phys. Lett. B 389 Ž1996. 737. hep-phr9608229.
w45x V.S. Fadin, Proc. LAFEX International School on High-Energy Physics ŽLISHEP 98., Rio de Janeiro,
Brazil, hep-phr9807528.
w46x V. Del Duca, C.R. Schmidt, Phys. Rev. D 59 Ž1999. 074004. hep-phr9810215.
w47x V.S. Fadin, R. Fiore, M.I. Kotsky, Phys. Lett. B 359 Ž1995. 181.
w48x V.S. Fadin, R. Fiore, M.I. Kotsky, Phys. Lett. B 387 Ž1996. 593. hep-phr9605357.
w49x V.S. Fadin, R. Fiore, A. Quartarolo, Phys. Rev. D 53 Ž1996. 2729. hep-phr9506432.
w50x J. Blumlein,
¨ V. Ravindran, W.L. van Neerven, Phys. Rev. D 58 Ž1998. 091502. hep-phr9806357.
w51x V. Del Duca, C.R. Schmidt, Phys. Rev. D 57 Ž1998. 4069. hep-phr9711309.
w52x M. Ciafaloni, Phys. Lett. B 429 Ž1998. 363. hep-phr9801322.
w53x M. Ciafaloni, D. Colferai, Nucl. Phys. B 538 Ž1999. 187. hep-phr9806350.
w54x V. Del Duca, Proc. of Les Rencontres de Physique de la Vallee d’Aoste, La Thuile, ed. M. Greco ŽINFN
Press, Italy, 1996. hep-phr9605404.
w55x V. Del Duca, Proc. LAFEX International School on High-Energy Physics ŽLISHEP98., Rio de Janeiro,
Brazil, hep-phr9902211.
w56x Z. Bern, D.A. Kosower, Nucl. Phys. B 379 Ž1992. 451.
w57x W. Siegel, Phys. Lett. B 84 Ž1979. 193.
w58x D.M. Capper, D.R.T. Jones, P. van Nieuwenhuizen, Nucl. Phys. B 167 Ž1980. 479.
w59x L.N. Lipatov, Nucl. Phys. B 452 Ž1995. 369.
w60x P. Uwer, talk at the Europhysics Conference on High Energy Physics ŽEPS-HEP99., Tampere, Finland.
w61x M. Mangano, S.J. Parke, Phys. Rep. 200 Ž1991. 301.
w62x J.E. Paton, H.-M. Chan, Nucl. Phys. B 10 Ž1969. 516.
w63x P. Cvitanovic, P.G. Lauwers, P.N. Scharbach, Nucl. Phys. B 186 Ž1981. 165.
w64x D.A. Kosower, B.-H. Lee, V.P. Nair, Phys. Lett. B 201 Ž1988. 85.
w65x M. Mangano, Nucl. Phys. B 309 Ž1988. 461.
w66x D. Zeppenfeld, Int. J. Mod. Phys. A 3 Ž1988. 2175.
w67x Z. Bern, D.A. Kosower, Nucl. Phys. B 362 Ž1991. 389.
w68x F.A. Berends, W.T. Giele, Nucl. Phys. B 294 Ž1987. 700.
w69x M. Mangano, S.J. Parke, Z. Xu, Nucl. Phys. B 298 Ž1988. 653.
w70x F.A. Berends, W.T. Giele, Nucl. Phys. B 306 Ž1988. 759.
w71x R. Kleiss, H. Kuijf, Nucl. Phys. B 312 Ž1989. 616.
w72x Z. Bern, L. Dixon, D.C. Dunbar, D.A. Kosower, Nucl. Phys. B 425 Ž1994. 217. hep-phr9403226.
w73x V. Del Duca, Phys. Rev. D 48 Ž1993. 5133.
w74x V. Del Duca, Phys. Rev. D 52 Ž1995. 1527. hep-phr9503340.
w75x F.A. Berends, W.T. Giele, H. Kuijf, Nucl. Phys. B 333 Ž1990. 120.
w76x B.L. Combridge, C.J. Maxwell, Nucl. Phys. B 239 Ž1984. 429.
w77x S. Catani, M.H. Seymour, Z. Trocsanyi, Phys. Rev. D 55 Ž1997. 6819. hep-phr9610553.
w78x H. Kuijf, Ph.D. Thesis Žunpublished..
w79x Z. Kunszt, W.J. Stirling, Phys. Rev. D 37 Ž1988. 2439.
w80x C.J. Maxwell, Phys. Lett. B 192 Ž1987. 190.
Nuclear Physics B 568 Ž2000. 263–286
www.elsevier.nlrlocaternpe
Abstract
1. Introduction
More than thirty years after the pioneering experiments at SLAC w1–3x, structure
functions in deep-inelastic lepton–hadron scattering ŽDIS. remain among the most
important probes of perturbative QCD and of the partonic structure of hadrons. Indeed,
experiments have proceeded towards very high accuracy and a greatly extended kine-
matic coverage during the past two decades w4x. Moreover, the forthcoming luminosity
upgrade of the electron–proton collider HERA at DESY will allow for accurate
measurements up to very high resolution scales Q 2 , 10 4 GeV 2 , thus considerably
increasing the lever arm for precise determination of the scaling violations, i.e. the
Q 2-dependence, of the structure functions. An accurate knowledge of the parton
densities will also be indispensable for interpreting many results at the future Large
Hadron Collider at CERN.
0550-3213r00r$ - see front matter q 2000 Elsevier Science B.V. All rights reserved.
PII: S 0 5 5 0 - 3 2 1 3 Ž 9 9 . 0 0 6 6 8 - 9
264 W.L. Õan NeerÕen, A. Vogt r Nuclear Physics B 568 (2000) 263–286
We set up our notations by recalling the NNLO evolution equations for non-singlet
parton densities and structure functions. The number distributions of quarks and
antiquarks in a hadron are denoted by qi Ž x, m2f , m2r . and qi Ž x, m2f , m 2r ., respectively,
where x represents the fraction of the hadron’s momentum carried by the parton. m r and
m f stand for the renormalization and mass-factorization scales, and the subscript i
indicates the flavour of the Žanti-.quark, with i s 1, . . . , Nf for Nf flavours of effectively
massless quarks.
The scale dependence of non-singlet combinations of these quark densities is
governed by the Žanti-.quark Žanti-.quark splitting functions. Suppressing the depen-
dence on x, m r and m f for the moment, the general structure of these functions,
constrained by charge conjugation invariance and flavour symmetry, is given by
Pq i q k s Pq i q k s d i k PqVq q PqSq ,
evolves with
V
P NS s PqVq y PqVq q Nf PqSq y PqSq .
ž / Ž 2.5 .
V
The first moments of Py
NS and P NS vanish,
1 Ny 1
1 Ny1 V
H0 dx x Py
NS s H0 dx x P NS s 0 for N s 1, Ž 2.6 .
V
since the first moments of qy NS and q NS reflect conserved additive quantum numbers.
The difference PqSq y PqSq is unknown except for the first moment, which vanishes by
virtue of Eqs. Ž2.3., Ž2.5. and Ž2.6.. However, the size of the two-loop contributions to
266 W.L. Õan NeerÕen, A. Vogt r Nuclear Physics B 568 (2000) 263–286
PqVq and PqSq relative to the corresponding term of PqVq suggests that this difference is
negligibly small at moderate and large x. Hence we shall use the approximation
V
P NS s Py
NS Ž 2.7 .
V
for the rest of this article, i.e. we henceforth treat q NS as a ‘–’-quantity.
Restoring the dependence on the fractional momentum x and the renormalization and
mass-factorization scales m r and m f , our evolution equations thus read
d m2f
dln m2f
"
q NS "
Ž x , m2f , m2r . s P NS ž
a s Ž m2r . ,
m2r / "
m q NS Ž m2f , m2r . Ž x . . Ž 2.8 .
m2f
"
P NS
ž
x , a s Ž m2r . ,
m2r /
m2f
s a s Ž m2r . P NS
Ž0. Ž1."
Ž x . q a2s Ž m2r . PNS
ž Ž 0.
Ž x . y b 0 PNS Ž x . ln
m2r /
m2f
ž
Ž 2."
qa3s Ž m2r . PNS Ž 0.
Ž x . y b 1 PNS Ž1."
Ž x . q 2 b 0 PNS Ž x . 4 ln
m2r
m2f
q b 02 PNS
Ž 0.
Ž x . ln2
m2r / q... Ž 2.10 .
b 1 s 102 y 383 Nf ,
"
The non-singlet structure functions Fa,NS, a s 1,2,3, are in Bjorken-x space obtained
by convoluting the solution of Eq. Ž2.8. with the corresponding coefficient functions:
Q 2 m2f
ha Fa",NS Ž x ,Q 2 . s Ca",NS a s Ž m2r . ,
ž ,
m2f m2r
"
m q NS
/Ž m2f , m2r . Ž x . Ž 2.13 .
Ž1." Ž 0. 1.
Q2 Q2
q P NS Ž x. q P NS m c aŽ,NS Ž x . 4 ln y b 0 c Ž1.
a ,NS Ž x . ln
m2f m2r
Q2
q 12 PNS
Ž 0. Ž 0.
m PNS Ž 0.
Ž x . y b 0 PNS Ž x . 4 ln2
m2f
Q2 m2f
yb 0 PNS
Ž0.
Ž x . ln
mf 2
ln
m2r / q...
"
Ž 2.14 .
Here an overall electroweak charge factor has been absorbed into q NS . The first-order
coefficients c a,NS x can be found in Ref. w5x; the two-loop quantities c Ž2."
Ž1. Ž .
a,NS x
Ž .
computed in Refs. w8–11x are discussed in Section 3.
It is often convenient, especially in the non-singlet sector considered here, to express
the scaling violations of the structure functions in terms of these structure functions
"
themselves. The expansion coefficients of the corresponding kernels Ka,NS in
d Q2
dlnQ 2
ha Fa",NS Ž x ,Q . s Ka",NS a s Ž m2r . ,
2
ž m2r / m ha Fa",NS Ž Q 2 . Ž x . Ž 2.15 .
1. 1. 2 Ž0. 2
Q2
qb 0 c aŽ,NS m c aŽ,NS Ž x . y b 1 c Ž1.
a ,NS Ž x . q b 0 P NS Ž x . ln
m2r
Q2
y Ž1."
2 b 0 P NS Ž x. 1.
y b 0 c aŽ,NS Ž x.4 Ž 0.
q b 1 PNS Ž x . 4 ln
m2r / q... Ž 2.16 .
268 W.L. Õan NeerÕen, A. Vogt r Nuclear Physics B 568 (2000) 263–286
This approach removes the dependence of the finite-order predictions on the factoriza-
tion scheme and the scale m f , thus allowing for an easier control of the theoretical
uncertainties.
with
L1 ' ln Ž1yx . , L0 ' ln x.
Ž2.y Ž .
For c 2, NS x ,
relevant for the charged-current case, the second and third line of this
expression have to be replaced by
y17.19 L31 q 71.08 L21 y 663.0 L1 q L1 L0 Ž y192.4 L1 q 80.41 L0 .
y3.748 L30 y 19.56 L20 y 1.235 L0 y 1010 x y 84.18 y 337.994 d Ž 1 y x . . Ž 3.3 .
The corresponding parametrizations for FL read
2 2
c LŽ2.q
,NS Ž x . s 13.62 L1 y 55.79 L1 y 150.5 L1 L 0 q Ž 26.56 x y 0.031 . L 0
Only partial results are presently available for the O Ž a s3 . terms P Ž2. Ž x . of the
splitting functions. In the non-singlet sector, the current information comprises
™ ™
1
for the Nf-independent Ž i s 0. and NF Ž i s 1. terms in Eq. Ž4.1.. Here f 1 and f 0
represent contributions which, while being integrable, peak at x 1 and x 0,
respectively. f m stands for a part with a rather flat x-dependence. As for the illustrations
in Refs. w14–16x, these contributions are build up of powers of lnŽ1 y x ., x, and ln x.
Finally fas allows to account for known leading small-x terms. Equating the second to
tenth even moments of Eq. Ž4.2. to the results of Refs. w14–16x yields five linear
equations which can be solved for the coefficients Aq Ž2.y
i, j . The case of P NS is treated
afterwards by taking over A i,1 and A i,2 from the ‘q’-combinations, as already indicated
in Eq. Ž4.2., and adjusting the remaining coefficients as discussed below.
Ž2." Ž .
Before addressing PNS x we demonstrate our procedure by applying it to a known
Ž1.q Ž . w
result, the Nf s 0 part P0Ž1.q of the NLO splitting function P NS x 42x. In this case
2
the leading small-x contributions are ln x and ln x, while the integrable terms most
peaked at large-x read x 2 and lnŽ1 y x .. Disregarding small-x constraints in this
example, we thus choose
f 1Ž x . s x 2 or ln Ž 1 y x . ,
fm Ž x . s 1 or x ,
f 0 Ž x . s ln x or ln2 x ,
fas Ž x . ' 0. Ž 4.3 .
W.L. Õan NeerÕen, A. Vogt r Nuclear Physics B 568 (2000) 263–286 271
The resulting eight approximations are compared to the exact result in Fig. 1 for x - 1.
The latter curve runs inside the uncertainty band over the full x-range. The moments
tightly constrain P0Ž1.q Ž x . for x R 0.15, the total spread in our approach being about 5%
at x , 0.3. The coefficients of the common leading large-x terms are
AŽ1.
0,1 s 65.13 . . . 68.74 Ž exact 66.47 .
AŽ1.
0,2 s 61.85 . . . 79.64 Ž exact 69.00 . , Ž 4.4 .
and the first moments read
P0Ž1.q Ž N s 1 . s y2.404 . . . 0.400 Ž exact y1.127 . . Ž 4.5 .
‘Unreasonable’ combinations in the sense of Eq. Ž4.2., like lnŽ1 y x ., x 2 , and 1 Ži.e. no
f 0 . or 1, ln x, and ln2 x Ži.e. f 1 missing., can lead to considerably worse approximations.
Now we turn to P0Ž2.q Ž x .. The additional loop or emission may, besides adding two
powers of ln x, lead to two additional large-x logarithms with respect to P0Ž1.q Ž x . Žthe
transition from one-loop to two-loop yields however only a term ln1 Ž1 y x . .. Hence we
put
f 1 Ž x . s x 2 or ln Ž 1 y x . or ln2 Ž 1 y x . or ln3 Ž 1 y x . ,
f m Ž x . s 1 or x ,
f 0 Ž x . s ln x or ln2 x ,
fas Ž x . s 23 CF3 Ž ln4 x q lln3 x . . Ž 4.6 .
Besides l s 0 we also include l s y4 and l s 8 for f 0 s ln x. Subleading small-x
Ž0.q Ž1.q
terms of this order of magnitude are suggested by the expansion of P NS and P NS in
moment-space around N s 0 w44x. Thus we consider 32 combinations, 8 of which are
rejected as they fail to fulfill the further ad hoc, but mild constraint
100 P0Ž1.q Ž N s 1 . ( P0Ž2.q Ž N s 1 . ( y40 P0Ž1.q Ž N s 1 . Ž 4.7 .
on the perturbative expansion of the first moment. The x - 1 behaviour of the remaining
24 function is displayed in Fig. 2; their 1rŽ1 y x .q coefficients span the range
1138 ( AŽ2. 0,1 ( 1625 Ž 1347 . . Ž 4.8 .
The bracketed number applies if combinations with f 1Ž x . s ln3 Ž1 y x . are disregarded.
Ž1.q
Fig. 1. Approximations for the N f -independent part of PNS , derived from the lowest even-integer moments
by means of Eqs. 4.2 and 4.3 , compared to the exact result.
Ž . Ž .
272 W.L. Õan NeerÕen, A. Vogt r Nuclear Physics B 568 (2000) 263–286
Ž2.q
Fig. 2. Approximations for the N f -independent part of P NS , denoted by P0Ž2.q in Eq. Ž4.1., as derived from
the five lowest even-integer moments by means of Eqs. Ž4.2., Ž4.6. and Ž4.7.. The full lines represent those
functions selected for further consideration.
Due to the larger function pool of Eq. Ž4.6., the large-x uncertainty band of Fig. 2 is
some factor of three wider than that for P0Ž1.q Ž x . in Fig. 1, reaching a total spread of
about 15% at x , 0.3. Moreover P0Ž2.q Ž x . is rather unconstrained at small x Q 10y2 by
present information, as the leading small-x term w33x does not dominate over less
singular contributions at practically relevant values of x. However, physical quantities
are only affected by the splitting functions via convolutions with smooth non-perturba-
tive initial distributions which ‘wash out’ the oscillating large-x differences of Fig. 2 to
a large extent. Furthermore the convolutions receive important contributions from the
Žwell-constrained. large-x region of P NS Ž x . even at very small x. The above ‘bare’
uncertainty is thus considerably reduced over the full x-range. This effect is illustrated
in Fig. 3, where four representative approximate results for P0Ž2.q are convoluted with a
simple, but typical input shape. The total spread after this convolution is as small as
0.3% for 0.2 Q x Q 0.9, and becomes large only at x Q 0.02.
Fig. 3. The convolution of the approximations ‘A’ – ‘D’ of P0Ž2.q selected in Fig. 2 with a shape typical of
hadronic non-singlet initial distributions.
W.L. Õan NeerÕen, A. Vogt r Nuclear Physics B 568 (2000) 263–286 273
The uncertainty band of Fig. 3 is rather completely covered by the results ‘A’ and
‘B’. Hence our final estimates for P0Ž2.q Ž x . and its remaining uncertainty are given by
1
P0,Ž2.q
A Ž x . s 1347.207 q 2283.011 d Ž 1 y x . y 722.137 ln2 Ž 1 y x .
Ž1yx .q
y 1236.264 y 332.254 ln x q 1.580 Ž ln4 x y 4 ln3 x . , Ž 4.9 .
1
P0,Ž2.q
B Ž x . s 1137.897 q 1099.754d Ž 1 y x . y 2975.371 x 2
Ž1yx .q
y 125.243 y 64.105 ln2 x q 1.580 ln4 x. Ž 4.10 .
1
The average 2
w Ž2.q Ž . Ž2.q Ž . x
P0, A x q P0, B x represents our central result.
The Nf1-term P1Ž2.q is the leading radiative correction to P1Ž1.q Ž x ., which is in turn
only slightly more complicated than the one-loop non-singlet splitting function w42x.
Hence it is natural to adopt here the ansatz Ž4.3. employed for the two-loop Nf0-piece in
our above illustration. The resulting eight approximations for P1Ž2.q Ž x . are displayed for
x - 1 in the left part of Fig. 4 Ždashed curves.. Their spread at large x is similar to that
obtained for P0Ž1.q in Fig. 1. The leading large-x coefficients fall into the range
y190 ( AŽ2. 1,1 ( y180. Ž 4.11 .
Ž2.q Ž .
The uncertainty of the complete result for P NS x is dominated by the spread of the
above Nf-independent contribution, as estimated by the difference between Eqs. Ž4.9.
and Ž4.10.. This is also true at small x, despite the fact that the band in Fig. 4 is
presumably an underestimate in this region, as a possible term ; ln3 x has been
disregarded. Hence it is sufficient, at the present stage, to keep only the Nf0-contribution
Ž2.q
to the error band of P NS and to employ just one representative for P1Ž2.q. Our choice,
an average of two typical results with and without a lnŽ1 y x . term, reads
1
P1Ž2.q Ž x . s y184.4098 y 180.6971 d Ž 1 y x . y 98.5885 ln Ž 1 y x .
Ž1yx .q
q 205.7690 x 2 q 6.1618 q5.0439 ln2 x Ž 4.12 .
and is also shown in the left part of Fig. 4 Žsolid curve..
As mentioned before the Nf2-piece in Eq. Ž4.1. is exactly known from Ref. w30x. After
transformation to x-space, this contribution reads
1 64
P2Ž2. Ž x . s
81 ž y
Ž1yx .q
y 204 q 192 z Ž 3 . y 320 z Ž 2 . d Ž 1 y x . q 64
x ln x
q
1yx /
Ž 96 ln x q 320. q Ž 1 y x . Ž 48 ln2 x q 352 ln x q 384. ,
Ž 4.13 .
where z Ž l . denotes Riemann’s z-function.
Ž2.y Ž .
Finally we consider PNS x . Here our treatment is inevitably more approximate.
According to the expectations given at the beginning of this section, we take over the
1rŽ1 y x .q and d Ž1 y x . terms of the ‘q’-combinations in Eqs. Ž4.9., Ž4.10., and
Ž4.12.. The remaining coefficients are Žafter inserting the appropriate Nf0 leading
small-x piece w33x. determined by the first, eighth and tenth moments of Refs. w14–16x,
Ž2.q Ž .
assuming that the difference to PNS x is negligible for the latter two, entirely large-x
dominated quantities. The results are shown in the right half of Fig. 4. The uncertainty
band for P0Ž2.y Ž x . is about 50% wider than that for P0Ž2.q Ž x . around x s 0.3, reflecting
the lack of precise knowledge of the intermediate-N moments, but smaller at small-x, as
this region plays a much greater role for the first moment known here from Eq. Ž2.6.,
than for the second moment in the ‘q’-case. Our parametrizations spanning the present
uncertainty are given by
1
P0,Ž2.y
A Ž x . s 1347.207 q 2283.011 d Ž 1 y x . y 722.238 ln2 Ž 1 y x .
Ž1yx .q
y 1234.756 y 327.479 ln x q 1.432 Ž ln4 x y 4 ln3 x . , Ž 4.14 .
1
P0,Ž2.y
B Ž x . s 1137.897 q 1099.754 d Ž 1 y x . y 2954.684 x 2 y 143.709
Ž1yx .q
y 2.761 ln2 x q 1.432 ln4 x Ž 4.15 .
Žalso here the average represents the central result., supplemented by
1
P1Ž2.y Ž x . s y184.4098 y 180.6971 d Ž 1 y x . y 98.5722 ln Ž 1 y x .
Ž1yx .q
q 205.3670 x 2 q 6.5740 q3.5474 ln2 x. Ž 4.16 .
For the latter expression an average has been calculated in the same manner as for
P1Ž2.q .
5. Numerical results
We are now ready to consider the numerical impact of the NNLO terms on the
evolution of the non-singlet parton densities and structure functions. Before doing so,
however, it is worthwhile to look at the perturbative running of a s underlying these
considerations. In the left part of Fig. 5 the a s-expansion Ž2.11. of the b-function is
W.L. Õan NeerÕen, A. Vogt r Nuclear Physics B 568 (2000) 263–286 275
Fig. 5. Left: The perturbative expansion of the QCD b-function up to order a s5 , for four flavours in the MS
renormalization scheme. Right: Illustration of the resulting scale dependence of a s , using a variable Nf as
detailed in the text. m2r is given in GeV 2 .
shown for Nf s 4 flavours. Besides the contributions of Eq. Ž2.12. relevant for NNLO
calculations, also the contribution yb 3 a s5 of Ref. w45x has been included. If one uses
the effect of this four-loop ŽN 3 LO. term as an estimate of the residual error of the
expansion, the resulting uncertainty amounts to 0.08%, 0.35%, 1.1% and 2.5% for a s s
0.12, 0.20, 0.30 and 0.40, respectively. The effects are somewhat larger Žsmaller. for
Nf s 3 Ž Nf s 5.. The consequences of this expansion on the scale dependence of a s are
illustrated in the right part of Fig. 5. For this illustration we have used Eq. Ž2.11. with
Nf s 3 at m r ( m c s 1.5 GeV, Nf s 4 between m c and m b s 4.5 GeV, and Nf s 5 for
m r ) m b , assuming that a s Ž m2r . is continuous at these thresholds. If a s is fixed to 0.115
at m r s MZ , then the four-loop effect reaches 0.1% Ž1%. only at m2r s 20 GeV 2 Ž1.5
GeV 2 ., respectively. Clearly the truncation of the series Ž2.11. after three terms does not
introduce a significant theoretical uncertainty in the kinematic regime of deep-inelastic
scattering.
For illustrations of the scale dependence of the parton densities and structure
functions, initial distributions have to be chosen at some reference scales, in the
following denoted by m2f ,0 and Q02 , respectively, in Eqs. Ž2.8. and Ž2.15.. We will
employ the function
3
f s x 0.5 Ž 1 y x . Ž 5.1 .
for all six quantities
" " "
f s xq NS Ž x , m2f ,0 . , F2,NS Ž x ,Q02 . and xF3,NS Ž x ,Q02 . . Ž 5.2 .
Eq. Ž5.1. represents a simple model shape which incorporates the most important
features of non-singlet x-distributions of nucleons. The same input is used in all cases,
as this allows for a direct comparison of the effects of the various kernels in Eqs. Ž2.10.
and Ž2.16.. The overall normalization of f is irrelevant for the logarithmic derivatives
considered below. Our initial scales are specified via
a s Ž m2r s m2f ,0 . s a s Ž m2r s Q02 . s 0.2, Ž 5.3 .
276 W.L. Õan NeerÕen, A. Vogt r Nuclear Physics B 568 (2000) 263–286
irrespective of the order of the expansion. For a s Ž MZ2 . s 0.114 . . . 0.120 this choice
corresponds to m2f ,0 s Q02 , 25–50 GeV 2 , a Q 2-region typical for fixed-target DIS. If
not explicitly indicated otherwise, the results will be given for Nf s 4 massless flavours.
2.
The evolution of qq NS x, m f is illustrated in Fig. 6 for the standard choice m r s m f of
Ž
the renormalization scale. In this case the perturbative expansion appears to be very well
convergent: Except for the region around x , 0.07 where the scale derivative is very
2
small, the NNLO corrections for q˙q q
NS ' dln q NS rdln m f are as small as about 2%, while
the NLO contributions typically amount to 10–20%. The residual uncertainty of the
three-loop splitting functions of Section 4 leads to a noticeable effect only for x Q 0.02,
and even at x , 10y3 this effect does not exceed "2% with respect to the central result
1
2 NNLOA q NNLOB
Ž . not shown in the figure. Over the full x-range the NNLO
corrections are comparable to the dependence on the number of active flavours: If Nf is
increased Ždecreased. to Nf s 5 Ž Nf s 3., q˙q NS is decreased increased by about 2%,
Ž .
respectively.
Another way to assess the reliability of perturbative calculations is to investigate the
stability of the results under variations of the renormalization scale m r . In Fig. 7 the
consequences of varying m r over the rather wide range 18 m 2f ( m2r ( 8 m2f are displayed
for six representative values of x. The relative scale uncertainties of the average results,
estimated by
D q˙q
NS
2 1 2 2 2 1 2 2
max q˙q ˙q
NS Ž x , m r s 4 m f . . . 4m f . y min q NS Ž x , m r s 4 m f . . . 4m f .
' 2 1 2 2
2 < average q˙q
NS Ž x , m r s 4 m f . . . 4m f .
<
Ž 5.4 .
are shown in the left part of Fig. 8. Also this estimate leads to about 2% for the NNLO
uncertainty, an improvement by more than a factor of three with respect to the
2
Fig. 6. The perturbative expansion of the scale derivative, q˙q q
NS ' dln q NS r dln m f , for a non-singlet ‘q’-combi-
nation of quark densities at m r s m f . The initial conditions are as specified in Eqs. Ž5.1. – Ž5.3.. Here and in
what follows the subscripts A and B indicate the approximations for the three-loop splitting functions derived
in Section 4.
W.L. Õan NeerÕen, A. Vogt r Nuclear Physics B 568 (2000) 263–286 277
2
Fig. 7. The dependence of the NLO and NNLO predictions for dln qq
NS r dln m f on the renormalization scale
m r for six typical values of x.
corresponding NLO result. Even as low as x , 10y3 the NNLO calculation, despite its
approximation uncertainty increasing towards small x, is superior to the NLO.
Finally the evolution of ‘y’-combinations qyNS is illustrated in the right part of Fig.
8. For x ) 0.1 the difference to the ‘q’-case discussed so far is negligible at NLO as
Fig. 8. Left: The renormalization scale uncertainty of the NLO and NNLO predictions for the scale derivative
of qq ˙q
NS, as obtained from the quantity D q NS defined in Eq. 5.4 . Right: The NNLO effects on the evolution of
Ž .
qy
NS for the standard scale choice m r s m f , together with a comparison of the NLO partonic NSq and NSy
evolutions.
278 W.L. Õan NeerÕen, A. Vogt r Nuclear Physics B 568 (2000) 263–286
Fig. 9. The perturbative expansion of the scale derivative, F˙2,qNS' dln F2,NS
q
r dlnQ 2 , for a non-singlet structure
function at m r s Q. The initial conditions are as specified in Eqs. Ž5.1. – Ž5.3., the leading-order curve is
identical to that of Fig. 6. Also shown Žright part. is the effect of omitting the contribution from the three-loop
Ž2.q
splitting function P NS .
well as at NNLO. At small x the NLO predictions differ by up to 2%. As expected from
the discussion in Section 4, the residual uncertainty of the NNLO result is considerably
less pronounced at small x in the ‘y’-case, but somewhat larger for 0.01 Q x Q 0.1.
We now turn to the evolution of the non-singlet structure functions. The physical
scale derivative F˙2,qNS' dln F2,NS
q
rdlnQ 2 is shown in the left part of Fig. 9 for m r s Q.
Ž1,2.q Ž .
Besides the splitting functions P NS x the effect of which has been illustrated in Fig.
Ž2.q Ž .
6, here also the coefficient functions c 2,Ž1.NS Ž x . and c 2,NS x enter the NLO and NNLO
evolution kernels as detailed in Eq. 2.16 . These additional terms considerably increase
Ž .
the Q 2-dependence at large x, as can be seen by comparing Figs. 6 and 9. For example,
the NNLO corrections rise from 4% at x s 0.5 to about 7, 11 and 21% at x s 0.65, 0.8
and x s 0.95, respectively. The corresponding NLO contributions amount to 24, 30, 37
and 51% of the LO results. Unlike for the parton densities, the NNLO corrections to the
structure functions are larger than the Nf -dependence at large x: If Nf is increased
Ždecreased. to Nf s 5 Ž Nf s 3., F˙2qNS is decreased Žincreased. between 3.5% and 7% for
0.5 ( x ( 0.95, respectively.
The worse convergence of the expansion at large x is due to the large soft-gluon
contributions wln k Ž1 y x .rŽ1 y x .xq, k s 1, . . . ,2 l y 1, to c 2,NSŽ l ." Ž .
x which are conjec-
Ž l .q Ž .
tured to be absent w43x in the MS splitting functions PNS x . Consequently, as shown
in the right part of Fig. 9, keeping only the coefficient-function contributions in the
O Ž a s3 . term of Eq. Ž2.16. yields a very good approximation at large x. In fact P NS Ž2.q
contributes less than 2% to the total NNLO derivative F˙2, NS at x ) 0.2. The residual
q
Ž2.q
uncertainty of PNS , given by the difference NNLOA – NNLOB , is thus completely
negligible in this region.
The dependence of F˙2,qNS on the renormalization scale m r is presented in Figs. 10 and
11 Žleft part., analogously to the partonic case Žsee Eq. Ž5.4.. in Figs. 7 and 8 using
Fig. 10. The dependence of the NLO and NNLO predictions for dln F2,qNS r dlnQ 2 on the renormalization scale
m r for six typical values of x.
q
Fig. 11. Left: The m r-uncertainty of the scale derivative of F2,NS, as estimated by D F˙2,NS
q
defined in Eq. Ž5.5..
Note that the absolute values of F˙2,qNS are very small for 0.05- x - 0.15. Right: The NNLO effects on the
evolution of F2,yNS for m r s Q, together with a comparison of the NSq and NSy evolutions for F2,NS at NLO.
280 W.L. Õan NeerÕen, A. Vogt r Nuclear Physics B 568 (2000) 263–286
As for the parton densities shown in Fig. 8, the evolution of F2,yNS illustrated in the
right part of Fig. 11 is indistinguishable from that of F2,qNS at x ) 0.1, while being better
constrained at NNLO at very small x. For x , 10y3 , the 2.5%–3.5% positive effect of
Ž2.y
P NS in Fig. 8 is overcompensated by the coefficient-function contributions. This effect
also occurs for F˙2,qNS not displayed at small x. In both cases the NLO corrections are
"
smaller than for q˙NS , resulting in a better small-x NLO renormalization-scale stability of
F˙2, NS as can be seen by comparing the left parts of Figs. 11 and 8.
"
The scaling violations of F3,yNS are presented in Fig. 12 for the medium- to large-x
" Žl.
region. Since the soft-gluon terms wln k Ž1 y x .rŽ1 y x .xq are identical in c 2,NS and
™
c 3, NS , the results for F˙3,NS and F˙2,NS agree Žfor identical initial distributions as assumed
" Žl.
The resulting upper and lower limits for a s Ž Q 02 . are shown in the left part in Fig. 13.
Due to the increase of the higher-order corrections towards large x discussed above, the
uncertainty Da s rises with increasing x. As available experimental DIS results are
restricted to x ( 0.85 w4x, we choose a value x , 0.55 for estimating the x-averaged
uncertainties given by the differences to the reference result a s Ž Q02 . s 0.2. This
procedure yields
q0.020
Da s Ž Q02 , 25–50 GeV 2 . NLO s , Ž 5.6 .
y0.012
q0.008
Da s Ž Q02 , 25–50 GeV 2 . NNLO s . Ž 5.7 .
y0.004
Often results and uncertainties for a s from different processes and observables are
compared after evolution to a common reference scale, conventionally chosen as the
Z-boson mass MZ . Adopting Q 02 s 30 GeV 2 Žand Nf s 5 for Q 0 ( m r ( MZ . for
definiteness, one obtains the error bands displayed in the right part of Fig. 13 and
q0.006 q0.0025
Da s Ž MZ2 . NLO s , Da s Ž MZ2 . NNLO s . Ž 5.8 .
y0.004 y0.0015
As expected from our previous discussions below Eq. Ž5.5., the NNLO calculation
reduces the theoretical uncertainty under consideration by a factor of about 2.5.
In a data analysis, also the NLO and NNLO central values for a s Ž Q 02 . will be
different, since the NNLO scaling violations are stronger over most of the large-x region
as shown in Fig. 9. A simple estimate analogous to that for Da s yields
a s Ž MZ2 . NNLO y a s Ž MZ2 . NLO , y0.002. Ž 5.9 .
Due to the strong x-dependence of the NNLOrNLO ratio, this estimate is less reliable
than Eq. Ž5.8., its uncertainty amounts to about "0.001. Nevertheless it is interesting to
note that Eq. Ž5.9. agrees with the findings of Refs. w26–28x from analyses of data on
Fig. 13. Right: The x-dependent theoretical uncertainty of the determination of a s from the scale derivative of
F2, NS at Q02 s 25– 50 GeV 2 , estimated by the m r-variation 14 Q02 ( m2r ( 4Q 02 . The scales leading to a maximal
Žminimal. < F˙2 NS < are denoted by m r,max Ž m r,min .. Left: The resulting error band for a s Ž MZ2 . using Q 02 s 30
GeV 2 and Nf s 5.
282 W.L. Õan NeerÕen, A. Vogt r Nuclear Physics B 568 (2000) 263–286
Ž2.
F3 . The three-loop splitting function P NS contribute only about y0.0007 to the shift
Ž5.9. of the NNLO result.
6. Summary
We have investigated the effect of the NNLO perturbative QCD corrections on the
scale dependence of flavour non-singlet quark densities and structure functions. For this
purpose, and for application in further analyses, we have derived compact parametriza-
Ž2."
tions of the corresponding three-loop splitting functions PNS and the two-loop
coefficient functions c a,NS, a s 2,3, L. The latter quantities are exactly known w8–13x;
Ž2."
their analytic x-dependent expressions are however rather cumbersome and not readily
transformed to moment space w39,40x. Our parametrizations of c Ž2." a,NS x and their Mellin
Ž .
transforms thus provide a convenient technical tool. They agree to the exact results up to
a few permille or less over the full x-range, thus introducing a negligible error of well
below 0.1% after insertion into the perturbative expansions.
As only partial results are presently available for the three-loop splitting functions
w14–16,30,33x, our parametrizations of P NS Ž2." Ž .
x serve the additional purpose of provid-
ing quantitative estimates of their x-dependent residual uncertainties. The function
x , relevant to the evolution of flavour asymmetries like u q u y w d q d x, is well
Ž2.q Ž .
PNS
constrained at large x by the lowest even-integer moments of Refs. w14–16x, the spread
reaching about "7% at x , 0.3. On the other hand, PNS Ž2.q Ž .
x is very weakly con-
y2
strained for x Q 10 so far, despite the fact that the leading small-x term is known
w33x. The quantity P NSŽ2.y Ž .
x , entering the evolution of the quark–antiquark differences,
is somewhat better Žworse. constrained at small x Žmedium x ., respectively, than
Ž2.q Ž .
PNS x . As the splitting functions enter parton densities and structure functions only
via convolutions with smooth non-perturbative initial distributions, these ‘bare’ uncer-
tainties are very much reduced for physical quantities over the whole x-range. For
example, the spread of P NS Ž2.q
leads to effects of less than "0.2% at x R 0.2 after
convolution with typical nucleonic input shapes. In this region the present uncertainties
Ž2."
of PNS are thus rendered absolutely negligible, leading to effects even below 0.01%
after insertion into the perturbation series. Their impact becomes significant only for
x Q 10y2 , without seriously impairing the NNLO calculations even down to x , 10y3 .
"Ž
The perturbative expansion for the scale dependence dln q NS x, m2f .rdln m2f of the
non-singlet combinations of quark densities appears to be very well convergent. For
a s s 0.2, corresponding to scales of about 25–50 GeV 2 , the NNLO effects of PNS Ž2."
are
on the level of 2% rather uniformly in x. This result is to be compared to the NLO
corrections which amount to 10–20%. Also the variation of the renormalization scale
leads to effects of about "2% at NNLO. Corrections of this size are comparable to the
dependence of the predictions on the number of quark flavours, rendering a proper
treatment of charm effects w46,47x rather important even for large-x non-singlet quanti-
ties.
Especially at x ) 0.5, the higher-order corrections are much larger for the scale
" Ž
derivative dln Fa,NS x,Q 2 .rdlnQ 2 , a s 2,3, of the non-singlet structure functions. This
enhancement is an effect of the coefficient functions containing large wln k Ž1 y x .r
Ž1 y x .xq soft-gluon terms, which are conjectured to be absent in the MS splitting
W.L. Õan NeerÕen, A. Vogt r Nuclear Physics B 568 (2000) 263–286 283
functions w43x. For example, the NLO and NNLO effects reach about 37% and 11% of
the respective lower-order results at x s 0.8 for a s s 0.2 and four flavours. The NNLO
calculations thus represent a distinct improvement, reducing also the renormalization-
scale dependence of the predictions by a factor of two to three, e.g., to about "7% at
x s 0.8. Accordingly the inclusion of the NNLO corrections into fits of data on
non-singlet scaling violations is expected to yield, besides a slight lowering of the
central values for a s Ž MZ2 . by roughly 0.002, a considerable reduction of the Žso far
dominant. theoretical error due to the truncation of the perturbation series,
q0.0025
Da s Ž MZ2 . NNLO s .
y0.0015 mr
These estimates are compatible with the results of the fits of F3, NS-data performed in
Refs. w26–28x, where an alternative, integer-moment based approach to the calculation
of the scaling violations has been pursued.
FORTRAN subroutines of our parametrizations of c Ž2." Ž2." Ž .
a,NS x , a s 2,3, L, and P NS
Ž . x
can by obtained via email to neerven@lorentz.leidenuniv.nl or avogt@lorentz.
leidenuniv.nl.
Acknowledgements
This work has been supported by the European Community TMR research network
‘Quantum Chromodynamics and the Deep Structure of Elementary Particles’ under
contract No. FMRX–CT98–0194.
N l
1 Ž y1.
Sl ' Sl Ž N . s Ý l
sz Ž l. y c Ž ly1. Ž N q 1 . . Ž A.1 .
ks1 k Ž l y 1. !
Here z Ž1. stands for the Euler–Mascheroni constant, and z Ž l ) 1. for Riemann’s
z-function. The l th logarithmic derivative c Ž ly1. of the G-function can be readily
evaluated using the asymptotic expansion for Re N ) 10 together with the functional
equation.
Due to the simplicity of our parametrizations for the three-loop splitting functions,
only the most simple Mellin transforms occur for these quantities. Therefore we are able
to dispense with details here. The Mellin-N dependence of the exactly known Nf2-piece
can be found in Ref. w30x.
284 W.L. Õan NeerÕen, A. Vogt r Nuclear Physics B 568 (2000) 263–286
The moments the non-singlet ‘q’-coefficient function Ž3.2. entering F2e.m . are given
by
Ž2.q
c 2,NS Ž N . s q3.55555 Ž 6 S4 q 8 S3 S1 q 3 S22 q 6 S2 S12 q S14 .
½
q Nf y0.59259 Ž 2 S3 q 3 S2 S1 q S13 . y 4.2963 Ž S2 q S12 .
For the charged current ‘y’-combination the third to fifth line of this result are,
according to Eq. Ž3.3., replaced by
229.916 31.58 9.7467 393.703 2.9678
q
N
S3 q
Nž 2
q
N / S2 q
N
S2 S1 q
N
S13
and
128.4 13.30 59.12 141.7 0.086
c LŽ2.y
,NS Ž N . s y S2 q S12 q ž y 2 / S1 y
N N N N N3
16 6 6 6 25
q Nf
27 ½ y
Nq1
S1 q
N
q
Ž N q 1.
2
y
Nq1 5 . Ž A.5 .
References
Abstract
The interaction of the partonic fluctuation of the virtual photon in deep inelastic scattering with
soft color fields describing the hadron is treated in an eikonal approximation. It is known that, in
this approach, the small-x limit of the leading-order gluon distribution xg Ž x,Q 2 . is a constant
characterizing the averaged local field strength in the target. Matching the next-to-leading order
calculation in this semiclassical framework with the one-loop parton model result, we obtain the
next-to-leading order contribution to xg Ž x,Q 2 .. It shows a lnŽ1rx . enhancement at small x and is
sensitive to the large distance structure of the target. The final expression is a simple integral over
non-abelian eikonal factors measuring the target color field. We derive a quantitative relation
between the short-distance cutoff of this integral and the scale of the gluon distribution function in
the MS scheme. Our calculation demonstrates that higher order contributions can be systematically
included in the semiclassical approach. q 2000 Elsevier Science B.V. All rights reserved.
1. Introduction
The small-x limit of deep inelastic scattering ŽDIS. remains one of the most
challenging problems on the interface of perturbative and non-perturbative QCD. It can
be expected that both the perturbative physics described by QCD resummation tech-
niques w1–11x and the non-perturbative soft dynamics underlying the growth of total
hadronic cross sections w12–17x are important for a complete picture of the small-x limit
of structure functions. In this paper, we systematically relate models for the soft color
fields, which we consider to be a promising tool for the characterization of the hadron,
to the hadronic gluon distribution, which is the basic object of the perturbative
treatment.
We are interested in a region of x and Q 2 where, although x is very small, the
concept of parton distributions and the conventional DGLAP evolution equations w1–4x
are applicable. For the description of the DIS process, the gluon distribution g Ž x, m2 .,
which dominates in the small-x region, has to be given at some scale m2 - Q 2 . Thus, the
problem of the non-perturbative structure of the target hadron has to be addressed.
0550-3213r00r$ - see front matter q 2000 Elsevier Science B.V. All rights reserved.
PII: S 0 5 5 0 - 3 2 1 3 Ž 9 9 . 0 0 7 2 2 - 1
288 H.G. Dosch et al.r Nuclear Physics B 568 (2000) 287–304
Our basic starting point is the idea of describing high-energy processes in QCD by
studying the eikonalized interaction of energetic partons with soft color fields w18,19x. In
this approach, which has been used for the description of DIS in w20x, soft color fields in
the center-of-mass frame of the collision mediate the interaction between the energetic
projectile and target quarks. Progress towards the description of the energy dependence
of cross sections in the framework of soft color field dynamics was reported, e.g., in
w21,22x.
A closely related approach to both inclusive and diffractive DIS, which treats the
target proton as a superposition of soft color fields, was developed in w23–25x. In fact,
this semiclassical approach reproduces the treatment of w18–20x if the dynamics of the
underlying color field is modelled on the basis of the stochastic vacuum w26,27x and a
phenomenological ansatz for the constituent-quark wave function of the proton is made.
For different approaches to the description of the target color fields see, e.g., Refs.
w28,29x. However, in the following we are completely general and use no model-specific
features of the target color field configurations or, equivalently, the wave functional of
the proton.
In the present paper, the gluon distribution g Ž x, m 2 . at x < 1 and m2 4 L2 Žwhere
L is a soft hadronic scale. is calculated for a target given by soft color field
configurations. Here ‘soft’ means that all momentum components of the field are O Ž L..
Following w30x, a scalar ‘photon’, coupled directly to the gluon field, is used as a
convenient theoretical tool for extracting the gluon distribution. The leading order
calculation gives a constant for x g Ž x, m2 ., which is a measure for the averaged gluon
field strength in the target. This is in agreement with the seminal paper of Mueller w30x
and with w25x, the spirit of which we follow closely.
In our opinion, it is crucial, both from a theoretical and a phenomenological
perspective, to demonstrate the viability of the approach at higher orders. However,
already at next-to-leading order the gluon distribution is scheme dependent and a careful
matching of the partonic calculation Žwe use the MS scheme. and the semiclassical
calculation is necessary to obtain an unambiguous result. With this result, we treat
problems that were not addressed in the one-loop calculations of w30,31x, where the
scheme dependence was not discussed. In fact, the problems of regularization and
scheme dependence arise immediately if one attempts to translate the one-loop, uninte-
grated gluon distribution of w31x into a correction to the leading term in the spirit of w25x
Žsee Ref. w32x for a comparison of the results of w25x and w31x in the case of the quark
distribution.. As emphasized in w25x, where diffractive and inclusive quark and gluon
distributions were calculated in the semiclassical approach, the inclusive gluon distribu-
tion dominates the small-x region. Therefore, we expect that our next-to-leading order
result is the dominant correction relevant as input for the next-to-leading order DGLAP
evolution.
In our approach, the most intricate part is the semiclassical calculation at next-to-
leading order. Working in Feynman gauge, we employ the optical theorem and calculate
the forward scattering amplitude. In the high-energy limit, certain diagrams can be
dropped. The remaining contribution is given in the form of a two-gluon production
cross section. In this way, the identification with the parton model result becomes simple
since the dangerous high-mass region, where the semiclassical approximation fails,
cancels explicitly.
H.G. Dosch et al.r Nuclear Physics B 568 (2000) 287–304 289
Let us note that, for a soft hadron governed by the single scale L, the perturbative
expansion of the gluon distribution makes no sense. However, following ideas of w33x
Žsee also the recent calculations of Refs. w34–36x., we can always assume that we are
dealing with a very large target, in which case the gluon distribution becomes calculable
without losing the interest of being genuinely non-perturbative in its origin Žsee Ref. w37x
for a discussion of the new hard scale in a framework close to the present paper.. It
remains to be seen in how far this large-target approach will allow for a description of
the qualitative features in the realistic proton case.
The paper is organized as follows. In Section 2 the scattering of a scalar photon off
the target color field is calculated in the parton model and the semiclassical approach.
The comparison of the two results gives rise to the leading-order semiclassical expres-
sion for the gluon distribution. In Sections 3 and 4, the semiclassical and parton model
calculations, respectively, are carried out at next-to-leading order. The extraction of the
next-to-leading order contribution to the gluon distribution from the comparison of the
semiclassical and the parton model results is the subject of Section 5. Section 6 contains
our conclusions, and a number of technical details of the calculations are outlined in
Appendices A–D.
In the following analysis we use a scalar ‘photon’ coupled directly to the gluon field
as a convenient theoretical tool for extracting the gluon distribution w30x. To be precise,
the real ‘photon’ field x couples to the field strength tensor Fmn via the interaction
Lagrangian
l
LI s y x trFmn F mn . Ž 1.
2
The leading-order amplitude for the scattering of the ‘photon’ off a classical color
field is given by the diagram in Fig. 1.
Let q and k be the momenta of the incoming virtual ‘photon’ Ž q 2 s yQ 2 . and the
outgoing gluon respectively. We define the light-cone components of a vector p by
p "s p 0 " p 3 and work in a frame where the plus components of q and k are large. In
the high-energy limit, the amplitude T a corresponding to Fig. 1 is given in the rest
frame of the proton by
i 2p d Ž k 0 y q0 . T a Ž D H . s yi l Ž 12 kq A˜y
a
Ž D. . Ž e H
)
DH . . Ž 2.
Here D s k y q, the field A˜ is the Fourier transform of the external color field A, and e
is the polarization vector of the produced gluon. The evaluation of the r.h.s. of Eq. Ž2. in
Fig. 1. The process x ™ g in an external color field with one gluon exchange.
290 H.G. Dosch et al.r Nuclear Physics B 568 (2000) 287–304
Fig. 2. The process x ™ g in an external color field. The dot symbolizes the initial x gg vertex.
the high-energy limit shows that T a does not depend on Dq and Dy due to the
softness of the external field. It is convenient to consider the impact parameter space
amplitude
i l q0
T˜ a Ž x H . s Hdx q tr T aŽ eH
) A
E H . Ay Ž xq , x H . Ž 3.
2CA
Žwith E H ' ErE x H ., which is related to the amplitude in Eq. Ž2. by a Fourier
transformation in transverse space. The xy dependence of A A is irrelevant in the
high-energy limit. Here A A s A b T b and T b are the generators of SUŽ Nc . in the adjoint
representation, which we use throughout this paper; CA s Nc .
Resumming the gluon exchange to all orders means that the fast gluon created at the
initial x gg vertex acquires a non-abelian eikonal factor while travelling through the rest
of the external field. This is illustrated in Fig. 2.
The transition to the resummed amplitude of Fig. 2 is realized by the substitution
A
E H Ay ™U A
Ž` , x q . Ž x H . E H AyA UŽ`A,†xq . Ž x H . Ž 4.
in Eq. Ž3., where
ig `
UŽ`A, xq . Ž x H . s Pexp y H A
dxq Ay Ž xq , x H . . Ž 5.
2 x q
sscŽ0. Ž x ,Q 2 .
2
l2 E l2
s Hd 2
xH WxAH Ž yH. y H s0 ' Hd 2
x H < E H WxAH Ž 0 H . < 2 ,
4 g 2 CA E yH 4 g 2 CA
Ž 7.
where the index Ž0. stands for ‘leading order’. Here
WxAH Ž y H . s U A Ž x H . U A † Ž x H qy H . y 1 Ž 8.
and
H.G. Dosch et al.r Nuclear Physics B 568 (2000) 287–304 291
2
pl2
spŽ0.
m Ž x ,Q . s xg Ž0. Ž x ,Q 2 . . Ž 10 .
4
Identifying the cross sections of Eqs. Ž7. and Ž10., one obtains
1 1
xg Ž0. Ž x ,Q 2 . s 2 Hd 2
x H < E H WxAH Ž 0 H . < 2 . Ž 11 .
2p a s 2CA
This result has been derived in w25x identifying the scaling violations of F2 with the
gluon distribution. Note the color factor 1r2CA in Eq. Ž11. due to the adjoint
™
representation, which we use throughout the paper. As expected, this leading-order
gluon distribution xg Ž0. Ž x,Q 2 . is constant for x 0 and shows no scaling violations
beyond those induced by the explicit a s factor on the r.h.s. of Eq. Ž11.. In Ref. w20x the
value of lim x ™ 0 xg Ž x,Q 2 . has been given in the stochastic vacuum model.
™
The leading ln Q 2 calculation of the process x gg in the semiclassical approach
reproduces the conventional gluon–gluon splitting function, as shown in w25x. Here, we
need the complete next-to-leading order total cross section for the scattering of a x
particle off an external color field. This calculation can be simplified significantly if one
starts with a discussion of all diagrams contributing to the forward amplitude. The total
cross section follows from the imaginary part of this amplitude. Furthermore, it is
convenient to begin by considering the two-gluon exchange approximation.
Clearly, the leading order diagram for the forward amplitude is simply the square of
Fig. 1. At next-to-leading order, all the diagrams in Fig. 4 have to be considered.
In the high-energy limit, the diagrams Žd., Že. Žwith their mirrored analogues. and
diagram Žf. do not contribute. This can be understood intuitively by recalling that the
gluon field is localized in a given region of space. Therefore, in the limit of infinite plus
momentum, the right-moving gluonic degrees of freedom have no time for a virtual
fluctuation between their first and second interaction with the external field Žsee
Appendix A for a more technical argument.. A related discussion in the case of particle
radiation in high-energy scattering of external fields can be found in w38x.
292 H.G. Dosch et al.r Nuclear Physics B 568 (2000) 287–304
Fig. 4. Two-gluon exchange diagrams for the forward scattering of the scalar ‘photon’ x off a color field. The
mirrored analogues of Žc., Žd. and Že. and the analogue of graph Ža. with the two t channel gluons attached to
different horizontal lines are not shown.
What remains to be calculated is the imaginary part of diagram Ža. Žand the
corresponding diagram where the two t channel gluons are attached to different lines.,
™
diagram Žb. and diagram Žc. Žwith its mirrored analogue.. This corresponds to calculat-
ing the cross section for x gg from the amplitude defined by the three diagrams in
Fig. 5. Note that this is a significant simplification since the amplitude in Fig. 5 has no
contributions arising from cutting diagrams Žd., Že. and Žf. of Fig. 4.
The next step is to resum the interaction with the external field to all orders, i.e. to
repeat the step leading from Fig. 1 to Fig. 2. The result is illustrated in Fig. 6. Here, the
first diagram corresponds to the fluctuation of the incoming ‘photon’ into a gg pair
before the target and the subsequent scattering of the gluons off the color field, treated in
the eikonal approximation. In an expansion in powers of the external field, the leading
term reproduces the first two diagrams of Fig. 5. The second diagram of Fig. 6
corresponds to the creation of the two gluons in the space-time region of the external
field, via a x ggg vertex. The two fast gluons then acquire non-abelian eikonal phases
while travelling through the rest of the color field. An expansion in powers of the
external field generates the third diagram of Fig. 5.
The amplitude of Fig. 6 can be calculated using the methods of w23,24x Žsee also Ref.
w39x.. Some details relevant to this particular process are discussed in Appendix B. The
result can be given in the form
l2 dkXH2
sscŽ1. Ž x ,Q 2 . s 6
da
H H a Ž1ya .
32 Ž 2p .
2
N 2d i j q 2 k i k j
= d xHH 2
Hd 2
kH ˜ Ž
WxAH kXH y k H . , Ž 12 .
N 2qkH
2
where a and 1 y a are the longitudinal momentum fractions carried by the two gluons,
N 2 s a Ž1 y a . Q 2 , k H and kXH are the transverse momenta of one of the two gluons
Fig. 5. The simplified amplitude for x ™ gg, which can be used for the calculation of the total cross section.
H.G. Dosch et al.r Nuclear Physics B 568 (2000) 287–304 293
Fig. 6. The relevant contributions to the amplitude in the next-to-leading order semiclassical calculation. The
dot in the second diagram symbolizes the initial x ggg vertex.
before and after the interaction with the external field respectively, and W˜ A is the
Fourier transform of the function defined in Eq. Ž8.. Summation over the indices i, j and
over the color indices of W˜ A is implicit.
The full next-to-leading order semiclassical result is given by the sum of Eqs. Ž7. and
Ž12.:
Here l d s lmye r2 and g d s g my e r2 are the d-dimensional couplings, and s, ˆ t,ˆ uˆ are
the usual Mandelstam variables of a 2-to-2 scattering process.
™
From this squared amplitude, the total partonic cross section sˆ Ž1. Ž z,Q 2 . for the
process x g gg is obtained by standard methods. The variable z is defined by
z s Q 2rŽ Q 2 q sˆ.. Alternatively, it is given by z s xry, where y is the fraction of the
target momentum carried by the struck gluon. Combining this with the leading order
result, one can write
1 dz
sp m Ž x ,Q 2 . s Hx Ž s 0 d 4 d Ž 1 y z . q sˆ Ž 1 . Ž z ,Q 2 . . yg b Ž y . , Ž 15 .
z
where s 0 d4 s pl2dr4 Žcf. Eq. Ž10... Note that in the following we only calculate terms
enhanced by powers of lnŽ1rx .. Therefore virtual corrections, which affect only the
endpoint z s 1 and do not produce such terms, do not contribute.
1 dz as Q2
sp m Ž x ,Q 2 . s s 0 ds44 x Hx z ž d Ž1yz . q
2p
Pg g Ž z . ln
m2
q CMSg Ž z .
/
=g Ž y, m2 . , Ž 17 .
where the integrand is only complete in the region z - 1. In Eq. Ž17. Pg g Ž z . Žwith
z - 1. is the usual gluon–gluon splitting function. The coefficient function C MS g Ž z .,
characteristic of the process under consideration, has been derived to be
1yz 11CA
CMSg Ž z . s Pg g Ž z . ln y . Ž 18 .
z 6 z Ž1yz .
Inserting Eq. Ž19. into Eq. Ž17., both g Ž0. and g Ž1. are kept in the d Ž1 y z . term, but
only the leading-order distribution g Ž0. is kept in the a s contribution. Now, identifying
ssc and sp m , the leading order parts of both cross sections cancel and one finds
xg Ž1. Ž x , m2 .
2
1 dkXH2 N 2d i j q 2 k i k j
s 7 Hd aH a Ž 1 y a . Hd 2
xH Hd 2
kH ˜ Ž
WxAH kXH y k H .
4 Ž 2p . N 2qkH
2
as 1 Q2
y Hx dz Pg g Ž z . ln q CMSg Ž z . yg Ž0. Ž y, m2 . . Ž 20 .
2p m2
When evaluating the r.h.s. of this equation, all terms that are not enhanced by lnŽ1rx .
will be dropped. In particular, we can use
2CA 2CA 1
Pg g Ž z . , and C MS
g Ž z. , ž ln y 11
12 . / Ž 21 .
z z z
H.G. Dosch et al.r Nuclear Physics B 568 (2000) 287–304 295
Note that in Eq. Ž20. we are only interested in the leading twist contribution of the
semiclassical cross section which will be determined in Eq. Ž22..
The integral involving the function W˜ A is conveniently rewritten using the integra-
tion variable z s Q 2rŽ Q 2 q M 2 ., where M 2 s sˆ s kXH2 rŽ a Ž1 y a .. is the invariant
mass of the two gluons in the final state. We introduce a parameter mX such that, for a
soft hadronic scale L governing the behaviour of W˜ A , one has L2 < mX 2 Q Q 2 . This
allows us to decompose the integral according to Žsee Appendix C for details.
2
1 dkXH2 N 2d i j q 2 k i k j
7
da
H H a Ž1ya . H d2 xH H d2kH W˜xAH Ž kXH y k H .
4 Ž 2p . N 2qkH
2
1 1 dz Q2 2 1 dz X2
s
4p 3 H z
ln
zm X2 Hd 2
x H < E H WxAH Ž 0 H . < 2 q
p
H Hm dk f Ž k . ,
z 0
X2
H
X2
H
Ž 22 .
where the non-perturbative W A dependence is encoded in the function f :
d2 yH d2 zH X
f Ž kXH2 . s H Ž 2p . 2 2
H Ž 2p . 2 2
Hd 2
x H tr WxAH Ž y H . WxAH† Ž z H . e i k H Ž y H yz H .
yH zH
2
2Ž y H z H .
=
ž 2 2
yH zH
y1 .
/ Ž 23 .
Here and in the following we disregard all terms suppressed by powers of L2rmX 2 . The
motivation for writing the integral in the form given in Eq. Ž22. is the explicit separation
of the ln Q 2 term, which is multiplied by the short-distance specific function E H WxAH
Ž0 H ..
Note that we have not specified the lower bound of the z integrations in Eq. Ž22..
Clearly, the kinematical limit is z s x since the invariant mass of the produced gg pair
can not be larger then the total center-of-mass energy available. However, no such
bound appears explicitly in the semiclassical treatment since the classical color field
behaves like an infinitely heavy target. The physical cutoff is provided by the break-
down of the semiclassical approximation for z ; x, i.e. y ; 1.
Therefore, in order to obtain the leading logarithm, the second term on the r.h.s. of
Eq. Ž22. has to be treated by applying the substitution
H
1 dz
z
™ 1
ln .
x
Ž 24 .
For the first term on the r.h.s. of Eq. Ž22., only the short distance structure of the
external field matters. According to Eq. Ž11., this short distance behaviour is character-
ized by the leading order gluon distribution yg Ž0. Ž y, m2 .. Thus, we substitute
1 1 dz Q2
H ln Hd 2
x H < E H WxAH Ž 0 H . < 2
2CA z z mX 2
™H x
1 dz
z
ln
Q2
z mX 2
2p 2a s Ž m2 . yg Ž0. Ž y, m2 . Ž 25 .
296 H.G. Dosch et al.r Nuclear Physics B 568 (2000) 287–304
™
and assume that the behaviour of the phenomenological gluon distribution at z x, i.e.
y s xrz 1, correctly accounts for the region where the semiclassical treatment is no
™
longer valid. In fact, when inserting Eq. Ž22. with the substitutions Eq. Ž24. and Ž25.
into Eq. Ž20., the details of the large-y behaviour of yg Ž0. Ž y, m2 . do not matter since the
ln2 Ž1rx . enhanced contributions from the semiclassical and the partonic calculations
cancel. Thus, we obtain
1 as m2 2 X2
xg Ž1. Ž x , m2 . sln
x p
CA ln
ž m X2
q 11
12 xg
/Ž0.
Ž x , m2 . q Hm dk f Ž k .
p 0
X2
H
X2
H , Ž 26 .
where the mX 2 dependence cancels between the two terms. Therefore, we can set
mX 2 s m2 expw11r12x and write
2 1 m 2 exp w11r12 x
xg Ž1. Ž x , m2 . s ž /Hln dkXH2 f Ž kXH2 . . Ž 27 .
p x 0
Note that the correction xg Ž1. Ž x, m2 . shows scaling violations consistent with the
Altarelli–Parisi evolution at small x and a logarithmic small-x enhancement that is
sensitive to the non-perturbative, large-size structure of the target.
It would be interesting to evaluate Eq. Ž27. in the framework of the model of the
stochastic vacuum of w19x or of the large hadron model employed in w25x. Note in
particular that, following w25x, the large-Nc expression
2 2 2
2
Hd x H tr Ž WxAH Ž y H . WxAH† Ž z H . . s V Nc2 1yeya y H yeya z H qeya Ž y H yz H .
Ž 28 .
can be used in Eq. Ž23.. Here V is the geometrical cross section of the target, the impact
parameter dependence of the target thickness Žwhich would be reflected in an impact
parameter dependence of the parameter a. is neglected, and averaging over all relevant
color field configurations is assumed. With this model, the unintegrated, i.e. kXH
dependent, version of xg Ž1. Ž x, m2 . reproduces the recent result of Mueller Žcf. Eq. Ž44.
of Ref. w31x.. The result presented in Eq. Ž27. extends the discussion given in w31x by
carefully matching the semiclassical and the parton model calculations. This gives rise to
a precise definition of the cutoff of the kXH integration in Eq. Ž27. and to the
interpretation of the result as a correction to the leading order gluon distribution of Eq.
Ž11..
Eqs. Ž23. and Ž27. can be significantly simplified Žcf. Appendix D for details. leading
to the main result of our paper:
2
1 1 ` dy H
xg Ž1. Ž x , m2 . s
p3 ž /H
ln
x r Ž m .2
4
yH ½ y d 2 x H trWxAH Ž y H . ,
H 5
1
y2 g E
2
4e 12
r Ž m. s . Ž 29 .
m2
It is easy to see how such a formula comes about: if the cutoff m2 in Eq. Ž27. could be
taken to infinity, the integration dkXH2 s d 2 kXH rp would give rise to the d-function
d 2 Ž y H yz H . Žcf. Eq. Ž23.., and either the y H or the z H integration could be trivially
performed. The remaining integration, say the y H integration, is now divergent at small
H.G. Dosch et al.r Nuclear Physics B 568 (2000) 287–304 297
y H , showing that the cutoff in Eq. Ž27. can, in fact, not be removed. The integrand of
this divergent y H integration is precisely the one of Eq. Ž29. and, as can be seen by
closer inspection of the relevant integrals Žcf. Appendix D., it is possible to translate the
upper cutoff of the kXH integration into a lower cutoff of the y H integration. This is the
origin of the compact formula in Eq. Ž29..
Given a specific model for the gluon fields of the target that allows for the calculation
of the fundamental quantity WxAH Ž y H ., Eq. Ž29. can be used to improve the leading
order semiclassical result of Eq. Ž11..
With the large-Nc expression of w25x,
™
parton model approach. At leading order, we have to equate the cross section for the
™
transition x g in a soft external field with the cross section of the process x g g as
™
given in the parton model. At next-to-leading order, the cross section for the transition
™
x gg in a soft external field has to be equated with the parton model cross section of
the process x g gg. The a s correction of the gluon distribution shows a logarithmic
enhancement, i.e. x g Ž1. Ž x, m2 . A lnŽ1rx ..
In our approach we can not go beyond the leading-lnŽ1rx . approximation. This is a
fundamental limitation of the semiclassical treatment and many related approaches,
which has its origin in the artificial separation of the QCD dynamics into the soft
degrees of freedom of the target and the high-energy modes of the projectile. Integrating
298 H.G. Dosch et al.r Nuclear Physics B 568 (2000) 287–304
over the fluctuations of the projectile, one has to drop the soft modes since their
interaction can not be treated in the eikonal approximation.
Parton distributions at next-to-leading order are scheme dependent. Our final result in
Eq. Ž29. Žtogether with the leading order contribution of Eq. Ž11.. provides the gluon
density in the MS scheme. The short-distance cutoff in Eq. Ž29. is quantitatively related
to the scale of the gluon distribution. This enables us to obtain numerical predictions for
the gluon distribution at next-to-leading order in any non-perturbative approach describ-
ing the soft color field of the proton.
Acknowledgements
One of us ŽA.H.. would like to thank W. Buchmuller ¨ and T. Gehrmann for the
fruitful collaboration on a previous paper, during which essential ideas underlying this
present investigation took shape.
To illustrate the vanishing of diagrams Žd., Že. and Žf. of Fig. 4 in the high-energy
limit, consider first the simpler case where the horizontal gluon lines are replaced by
scalar lines. The scalar version of diagram Fig. 4d. with the naming of momenta used in
this appendix is shown in Fig. 8.
In the high-energy limit, the soft external field can not change the momentum plus
component of the fast right-moving particles essentially, kq, qq. Furthermore, the
dependence of the external field vertex on the minus component of the transferred
momentum can be neglected. Thus, the minus component integrations in the diagram
Fig. 8 take the form
I s dky dly
H H
1
= 2 2 2
. Ž A.1 .
w kq ky y k H q ie xw lq ly y l H q ie x w Ž k y l . q Ž k y l . yy Ž k y l . H q ie x
Note first that this expression vanishes unless 0 ( lq( kq, in which case the two poles
in the complex ly plane lie on different sides of the integration contour. However, this
means that both kq and Ž k y l .q have to be positive, so that now both poles in the
complex ky plane are on the same side of the integration contour. This demonstrates
that the integral I vanishes in any case.
The same argument applies to the scalar analogues of diagrams Figs. 4e,f, since there
the momentum k also flows through more than one propagator between the two external
field vertices. This results in the presence of several poles in ky, all on the same side of
the integration contour, and therefore in the vanishing of the integral.
All that has been said above applies to the gluonic case as well, as long as there is no
additional minus component dependence introduced by the numerator factors. Such a
dependence would prevent us from closing the integration contour. Working in Feynman
gauge and applying the decomposition of the gluon propagator given in Appendix B to
the lines with momentum k, one can easily convince oneself that, in the high-energy
limit, there is indeed no dependence on the minus components.
Thus, we are justified in disregarding diagrams Figs. 4d–f when calculating the
forward scattering amplitude at next-to-leading order.
The left-hand diagram of Fig. 6 contains three contributions: two of them describe the
interaction of a single outgoing gluon with the color field, whereas in the third case both
gluons interact with the field. Here we consider the latter case as an example, and
outline some crucial steps of the evaluation of the corresponding amplitude which we
call T g g .
Working in Feynman gauge and denoting the momenta of the gluons before Žafter.
the interaction by k and p Ž kX and pX ., respectively, the amplitude reads
d4 k yig mr
i2pd Ž kX0 q pX0 y q0 . Ti gj g s H Ž 2p . e a)
4 Ž i.
X X
Ž p . Va m Ž p , p . Hrs Ž p,k .
p2
yig sn
Vnb† Ž kX ,k . eŽbj.) Ž kX . ,
= Ž B.1 .
k2
where the indices i, j characterize the polarization of the gluons in the final state. The
quantity H represents the x gg vertex,
Hrs Ž p,k . s i l Ž Ž kp . g rs y kr ps . , Ž B.2 .
while V Ž pX , p . and V Ž kX ,k ., which will be explicitly given below, are the effective
vertices for the scattering of the fast gluons off the external field.
To simplify the calculation we exploit an alternative representation of the metric
tensor appearing in the gluon propagators. For instance, the tensor g mr of the propaga-
tor yig mrrp 2 is decomposed according to
2 n mp r p mn r nmn r
g mr s y Ý eŽmi. Ž p . eŽrj. Ž p . q q y 2
p2 , Ž B.3 .
is1 np np Ž np .
with the light like vector n s Ž nq,ny,n H . s Ž0,2,0 H .. A possible choice of the polariza-
tion vectors eŽ i.Ž p . in Eq. ŽB.3., which in particular satisfy eŽ i.Ž p . p s 0, is given by
2 p H eŽ i .H
ž
eŽ i. Ž p . s 0,
pq /
,e Ž i . H , Ž B.4 .
In the high energy limit, the t channel exchange of gluons leads to an amplitude
which is proportional to qq. As a consequence, in order to obtain the leading part of the
amplitude, in Eq. ŽB.3. only the term containing the polarization vectors has to be
considered. Because of gauge invariance of the x gg vertex the second term on the r.h.s.
of Eq. ŽB.3. vanishes. The contribution of the remaining two terms is at most
proportional to Ž qq . 0 and hence negligible.
The effective vertices are now multiplied by the polarization vectors of two on-shell
gluons; these vertices are governed by the non-abelian eikonal factor defined in Eqs. Ž9.
and Ž5.. One finds
2
= Ũ A Ž pXH y p H . y Ž 2p . d 2 Ž pXH y p H .
2
= Ũ A † Ž kXH y k H . y Ž 2p . d 2 Ž kXH y k H . . Ž B.6 .
The diagram on the r.h.s. of Fig. 6 cancels the k 2 and p 2 term in the expression
Ž Q 2 q k 2 q p 2 . of Eq. ŽB.6.. Now, the ky integration can be performed by closing the
integration contour in the lower half of the complex ky plane.
The resulting expression contains terms proportional to U˜ A U˜ A †, U˜ A , and U˜ A †, as
well as a constant term. The two additional contributions, where only a single gluon
interacts with the external field and which belong to the left-hand side of Fig. 6, contain
terms proportional to U˜ A and U˜ A † and a constant term. If one adds these contributions,
the color field dependence of the total amplitude Ti j turns out to be
4
U˜ A Ž pXH y p H . U˜ A † Ž kXH y k H . y Ž 2p . d 2 Ž pXH y p H . d 2 Ž kXH y k H .
X X
s d 2 x H eyi x H Ž k H qp H . W˜xAH Ž kXH yk H . .
H Ž B.7 .
This provides us with the final result of the total amplitude,
yi l qq N 2d i j q 2 k i k j X X
Ti j s 2 Hd 2
kH 2 2 Hd 2
x H eyi x H Ž k H qp H . W˜xAH Ž kXH yk H . .
2 Ž 2p . N qkH
Ž B.8 .
H.G. Dosch et al.r Nuclear Physics B 568 (2000) 287–304 301
The cross section in Eq. Ž12. is obtained by means of the standard formula for
scattering off an external field,
1 d 3 kX d 3 pX 1
sscŽ1. Ž x ,Q 2 . s H 3 X 3
2pd Ž kX0 q pX0 y q0 . < Ti j < 2 , Ž B.9 .
2 Ž 2p . 2 k 0 Ž 2p . 2 pX0 2 q0
where the two identical particles in the final state require the factor 1r2 in front of the
integral.
In this appendix we present some details of the derivation of Eq. Ž22.. To introduce
the integration variable z we exploit the relation kXH2 rŽ a Ž1 y a .. s M 2 s Q 2 Ž1 y z .rz.
Therefore the kXH integration appearing on the l.h.s. of Eq. Ž22. can be replaced
according to
H a Ž1ya . ™Q H
dkXH2 2
1 dz
, Ž C.1 .
z2
where just the leading contribution at small z is kept. Concerning the lower bound of
the z integration we refer to the discussion given in Section 5.
Subsequently, it is convenient to divide the a integration of Eq. Ž22. in two parts by
introducing an arbitrary parameter mX which fulfills the condition L2 < mX 2 Q Q 2 . To be
specific we separate the symmetric and asymmetric gluon configurations in the integra-
tion by means of
X2 2 X2 2
Hd a sHm1yrMm rM d a q 2H0m rM d a ,
X2 2
Ž C.2 .
where the symmetric Žasymmetric. configurations give rise to the first Žsecond. term on
the r.h.s. of Eq. Ž22.. For a given value of M 2 the symmetric contribution contains large
transverse momenta kXH while the asymmetric term is entirely soft.
To extract the leading twist of the symmetric hard part we use
N 2d i j q 2 k i k j
Hd 2
kH W˜xAH Ž kXH y k H . s Ci j, a d 2 k H k a W˜xAH Ž yk H .
H
N 2qkH
2
and
2
4 2
Hd 2
k H k aW˜xAH Ž yk H . s Ž 2p . E H WxAH Ž 0 H . Ž C.5 .
are important for the further evaluation of the hard contribution. In writing Eq. ŽC.4. we
have anticipated the symmetric kXH integration and employed the substitution kXa kXb
kXH2 d a br2. Finally, performing the a integration and keeping only the contribution that
™
is dominant at small z, the first term on the r.h.s. of Eq. Ž22. is obtained.
In the case of the asymmetric soft contribution, which is independent of Q 2 and
therefore leading twist, the a integration is replaced by an integral over kXH . Since a is
small we have Q 2a , kXH2 z, leading to the replacement
X2 2 z X2
H0m rM d a , Q H0m dk 2
X2
H . Ž C.6 .
The remaining step to obtain the second term on the r.h.s. in Eq. Ž22. is the k H
integration. Keeping in mind that for the soft term we can neglect N 2 this integration
can easily be done with the aid of
ki kj 2p 2 yi y j
Hdk 2
H 2
kH
e i k H y Hs 2
yH ž d i jy 2
yH / . Ž C.7 .
In this appendix, the details of the derivation of Eq. Ž29., which represents a
particularly simple form of our final result, are described.
It is convenient to introduce a real parameter e ) 0 and to write the gluon distribution
given by Eqs. Ž23. and Ž27. in the form
2 1
xg Ž1. Ž x , m2 . s 2 ž /
ln I0 Ž 0, mX . , Ž D.1 .
p x
where
Ie Ž a,b . ' Ha -k X2
d 2 kXH fe Ž kXH2 . , Ž D.2 .
2 2
H-b
d2 yH d2 zH X
fe Ž kXH2 . ' H Ž 2p . 2 2y e
H Ž 2p . 2 2y e
h Ž y H , z H . e i k H Ž y H yz H .
yH zH
2
2Ž y H z H .
=
ž 2 2
yH zH
y1 ,
/ Ž D.3 .
and
Furthermore, we have
I0 Ž 0, mX . s lim Ie Ž 0,` . y Ie Ž mX ,` . .
e™0 Ž D.5 .
The kXH integration in the definition of Ie Ž0,`. is easily performed giving a d-function
of y H yz H . The result is
d2 yH
Ie Ž 0,` . s H Ž 2p . 2 4y2 e
hŽ y H , y H . . Ž D.6 .
yH
Since
C r 2e 1 2
dy H
Ie Ž 0,` . s q H 4
hŽ y H , y H . q OŽ e . . Ž D.8 .
2p 2 e 4p r -y 2 2
H
yH
The parameter r has been introduced to separate the small-distance from the large-dis-
tance part of the y H integration.
In the kXH integration defining Ie Ž mX ,`., the momentum variable kXH is always large.
Therefore, the result is only sensitive to the small distance structure of hŽ y H , z H ., and
we can write
d2 yH d2 zH X
Ie Ž mX ,` . s Hm -kX2 X2
d 2 kXH H Ž 2p . 2 2y e
H Ž 2p . 2 2y e
C y H Pz H e i k H Ž y H yz H .
H yH zH
2
2Ž y H z H .
=
ž 2 2
yH zH
y1 .
/ Ž D.9 .
When inserting Eqs. ŽD.8. and ŽD.10. in Eq. ŽD.5., the poles in e cancel. Setting the
parameter r, which so far was arbitrary, to
1
y2 g E
1y2 g E 12
2
4e 4e
r Ž m. s s , Ž D.11 .
mX 2 m2
References
w1x V.N. Gribov, L.N. Lipatov, Sov. J. Nucl. Phys. 15 Ž1972. 438.
w2x V.N. Gribov, L.N. Lipatov, Sov. J. Nucl. Phys. 15 Ž1972. 675.
w3x G. Altarelli, G. Parisi, Nucl. Phys. B 126 Ž1977. 298.
w4x Yu.L. Dokshitzer, Sov. Phys. JETP 46 Ž1977. 641.
w5x E.A. Kuraev, L.N. Lipatov, V.S. Fadin, Sov. Phys. JETP 44 Ž1976. 443.
w6x E.A. Kuraev, L.N. Lipatov, V.S. Fadin, Sov. Phys. JETP 45 Ž1977. 199.
w7x Y.Y. Balitsky, L.N. Lipatov, Sov. J. Nucl. Phys. 28 Ž1978. 822.
w8x J. Bartels, Nucl. Phys. B 151 Ž1979. 293.
w9x J. Bartels, Nucl. Phys. B 175 Ž1980. 365.
w10x A.H. Mueller, Nucl. Phys. B 415 Ž1994. 373.
w11x N.N. Nikolaev, B.G. Zakharov, Z. Phys. C 64 Ž1994. 631.
w12x A. Donnachie, P.V. Landshoff, Nucl. Phys. B 244 Ž1984. 322.
w13x A. Donnachie, P.V. Landshoff, Phys. Lett. B 296 Ž1992. 227.
w14x A. Donnachie, P.V. Landshoff, Phys. Lett. B 437 Ž1998. 408.
w15x J.R. Cudell, A. Donnachie, P.V. Landshoff, Phys. Lett. B 448 Ž1999. 281.
w16x ¨
K. Golec-Biernat, M. Wusthoff, Phys. Rev. D 59 Ž1999. 014017.
w17x U. D’Alesio, A. Metz, H.J. Pirner, Eur. Phys. J. C 9 Ž1999. 601.
w18x O. Nachtmann, Ann. Phys. 209 Ž1991. 436.
w19x ¨
H.G. Dosch, A. Kramer, Phys. Lett. B 252 Ž1990. 669.
w20x H.G. Dosch, T. Gousset, H.J. Pirner, Phys. Rev. D 57 Ž1998. 1666.
w21x I. Balitsky, Phys. Rev. D 60 Ž1999. 014020.
w22x J. Jalilian-Marian, A. Kovner, H. Weigert, Phys. Rev. D 59 Ž1999. 014015.
w23x ¨
W. Buchmuller, M.F. McDermott, A. Hebecker, Nucl. Phys. B 487 Ž1997. 283.
w24x ¨
W. Buchmuller, M.F. McDermott, A. Hebecker, Nucl. Phys. B 500 Ž1997. 621, ŽE..
w25x ¨
W. Buchmuller, T. Gehrmann, A. Hebecker, Nucl. Phys. B 537 Ž1999. 477.
w26x H.G. Dosch, Phys. Lett. B 190 Ž1987. 177.
w27x H.G. Dosch, Yu.A. Simonov, Phys. Lett. B 205 Ž1988. 339.
w28x F. Hautmann, Z. Kunszt, D.E. Soper, Phys. Rev. Lett. 81 Ž1998. 3333.
w29x Yu.V. Kovchegov, L. McLerran, Phys. Rev. D 60 Ž1999. 054025.
w30x A.H. Mueller, Nucl. Phys. B 335 Ž1990. 115.
w31x A.H. Mueller, preprint CU-TP-937, hep-phr9904404.
w32x ¨
W. Buchmuller, talk at New Trends in HERA Physics, Ringberg Workshop 1999, hep-phr9906546.
w33x L. McLerran, R. Venugopalan, Phys. Rev. D 49 Ž1994. 2233.
w34x Yu.V. Kovchegov, Phys. Rev. D 54 Ž1996. 5463.
w35x Yu.V. Kovchegov, Phys. Rev. D D 55 Ž1997. 5445.
w36x Yu.V. Kovchegov, A.H. Mueller, Nucl. Phys. B 529 Ž1998. 451.
w37x A. Hebecker, H. Weigert, Phys. Lett. B 432 Ž1998. 215.
w38x S.J. Brodsky, P. Hoyer, Phys. Lett. B 298 Ž1993. 165.
w39x A. Hebecker, preprint HD-THEP-99-12, hep-phr9905226.
Nuclear Physics B 568 Ž2000. 305–318
www.elsevier.nlrlocaternpe
Abstract
We study the pair production of charged Higgs bosons at the CERN Large Hadron Collider in
the context of the minimal supersymmetric extension of the standard model. We compare the
contributions due to qq annihilation at the tree level and gg fusion, which proceeds at one loop.
At small or large values of tan b , HqHy production proceeds dominantly via bb annihilation,
due to Feynman diagrams involving neutral CP-even Higgs bosons and top quarks, which come in
addition to the usually considered Drell–Yan diagrams. In the case of gg fusion, the squark loop
contributions may considerably enhance the well-known quark loop contributions. q 2000 Else-
vier Science B.V. All rights reserved.
1. Introduction
One of the prime objectives of the CERN Large Hadron Collider ŽLHC. is the search
for spin-zero particles which remain in the physical spectrum after the elementary-par-
ticle masses have been generated through the Higgs mechanism of electroweak symme-
try breaking w1x. Should the world be supersymmetric, then the Higgs sector is more
complicated than in the standard model ŽSM., which predicts just one scalar Higgs
boson. The Higgs sector of the minimal supersymmetric extension of the SM ŽMSSM.
consists of a two-Higgs-doublet model Ž2HDM. and accommodates five physical Higgs
bosons: the neutral CP-even h 0 and H 0 bosons, the neutral CP-odd A0 boson, and the
charged H "-boson pair. At the tree level, the MSSM Higgs sector has two free
parameters, which are usually taken to be the mass m A of the A0 boson and the ratio
tan b s Õ 2rÕ1 of the vacuum expectation values of the two Higgs doublets.
1
¨ Theoretische Physik, Universitat
Permanent address: II. Institut fur ¨ Hamburg, Luruper Chaussee 149,
22761 Hamburg, Germany.
0550-3213r00r$ - see front matter q 2000 Elsevier Science B.V. All rights reserved.
PII: S 0 5 5 0 - 3 2 1 3 Ž 9 9 . 0 0 7 3 2 - 4
306 ´ B.A. Kniehl r Nuclear Physics B 568 (2000) 305–318
A.A. Barrientos Bendezu,
The discovery of the H " bosons would prove wrong the SM and, at the same time,
give strong support to the MSSM. The logistics of the H "-boson search at the LHC
™ ™ ™
may be summarized as follows. For H "-boson masses m H - m t y m b , the dominant
production mechanisms are gg,qq tt followed by t bHq w1x.2 The dominant decay
mode of H " bosons in this mass range is Hq tnt unless tan b - m crmt f 1 w1x. In (
contrast to the SM top-quark events, this signature violates lepton universality, a
criterion which is routinely applied in ongoing H "-boson searches at the Fermilab pp
™ ™
collider Tevatron w2,3x. For larger values of m H , the most copious sources of H "
™
bosons are provided by gb tHq w4–7x, gg tbHq w8x, and qb q X bHq w9x. The
preferred decay channel is then Hq tb, independently of tan b w1x. An interesting
™
alternative is to produce H " bosons in association with W . bosons, so that the
leptonic decays of the latter may serve as a trigger for the H "-boson search. The
™
dominant subprocesses of W " H . associated production are bb W " H . at the tree
level and gg W "H . at one loop, which were investigated for m b s 0 and small
™
values of tan b Ž0.3 ( tan b ( 2.3. in Ref. w10x and recently, without these restrictions, in
Refs. w11,12x. A careful signal-versus-background analysis, based on the analytic results
of Ref. w11x, was recently reported in Ref. w13x.
™
In this paper, we investigate HqHy pair production in the MSSM. At the tree level,
this proceeds via qq annihilation, qq Hq Hy, where q s u,d, s,c,b. The Drell–Yan
process, where a photon and a Z-boson are exchanged in the s channel Žsee upper
Feynman diagram in Fig. 1. has been studied by a number of authors w14–16x. As
pointed out in Ref. w11x, in the case q s b, there are additional Feynman diagrams
involving the h 0 and H 0 bosons in the s channel and the top quark in the t channel Žsee
middle and lower diagrams in Fig. 1.. As we shall see later on, for small or large values
of tan b , their contribution greatly exceeds the one due to the Drell–Yan process, which
is independent of tan b . To our knowledge, these additional diagrams have not been
2
Here and in the following, the charge-conjugate processes will not be explicitly mentioned.
´ B.A. Kniehl r Nuclear Physics B 568 (2000) 305–318
A.A. Barrientos Bendezu, 307
™
considered elsewhere in the literature. At one loop, HqHy pair production receives an
additional contribution from gg fusion, gg HqHy. Although the cross section of gg
fusion is suppressed by two powers of a s relative to the one of qq annihilation, it is
expected to yield a comparable contribution at multi-TeV hadron colliders, due to the
overwhelming gluon luminosity. In the 2HDM, gg fusion is mediated by heavy-quark
loops Žsee upper two rows in Fig. 2. w17–19x. We calculated these QCD contributions
and found full agreement with the analytical and numerical results presented in Ref.
w19x. In the MSSM, there are additional QCD contributions induced by squark loops Žsee
lower three rows in Fig. 2., which we shall present here.
As for bb annihilation, it should be noted that the treatment of bottom as an active
™ ™
flavour inside the colliding hadrons leads to an effective description, which comprises
™
contributions from the higher-order subprocesses gb Hq Hy b, gb Hq Hy b, and
™
gg Hq Hy bb. If all these subprocesses are to be explicitly included along with
bb Hq Hy, then it is necessary to employ a judiciously subtracted parton density
™
function ŽPDF. for the bottom quark in order to avoid double counting w4–7,20,21x. The
evaluation of bb Hq Hy with an unsubtracted bottom PDF is expected to slightly
overestimate the true cross section w4–7,20,21x. For simplicity, we shall nevertheless
adopt this effective approach in our analysis, keeping in mind that a QCD-correction
factor below unity is to be applied.
The circumstance that the spectrum of states is more than doubled if one passes from
the SM to the MSSM gives rise to a proliferation of parameters, which weakens the
predictability of the theory. A canonical method to reduce the number of parameters is
to embed the MSSM into a grand unified theory ŽGUT., e.g., a suitable supergravity
ŽSUGRA. model, in such a way that it is recovered in the low-energy limit. The MSSM
thus constrained is described by the following parameters at the GUT scale, which come
in addition to tan b and m A : the universal scalar mass, m 0 ; the universal gaugino mass,
m1r2 ; the trilinear Higgs-sfermion coupling, A; the bilinear Higgs coupling, B; and the
Higgs-higgsino mass parameter, m. Notice that m A is then not an independent parameter
anymore, but it is fixed through the renormalization group evolution. The number of
parameters can be further reduced by making additional assumptions. Unification of the
t-lepton and b-quark Yukawa couplings at the GUT scale leads to a correlation between
m t and tan b . Furthermore, if the electroweak symmetry is broken radiatively, then B
and m are determined up to the sign of m. Finally, it turns out that the MSSM
parameters are nearly independent of the value of A, as long as < A < Q 500 GeV at the
GUT scale. Further details on the SUGRA-inspired MSSM scenario may be found in
Ref. w22x and the references cited therein.
™
This paper is organized as follows. In Section 2, we shall list analytic results for the
™
tree-level cross section of qq Hq Hy, including the Yukawa-enhanced contributions
for q s b, and the squark loop contribution to the gg HqHy amplitude in the MSSM.
The relevant Higgs-squark coupling constants and the squark loop form factors are
™
relegated to Appendices A and B, respectively. In Section 3, we shall present quantita-
tive predictions for the inclusive cross section of pp HqHyq X at the LHC adopting
the SUGRA-inspired MSSM. Section 4 contains our conclusions.
2. Analytic results
™
u s m 2H y x b'S mT expŽ y ., respectively. Notice that sp T2 s tu y m 4H . In the parton model,
the differential cross section of AB HqHyq X is given by
d 2s
dy dpT2
Ž AB ™H H qX .
q y
sÝ
a, b
1
Hx dx
a
a Fa r A Ž x a , M a . Fb r B Ž x b , Mb .
xb s
m2H y t
ds
dt
Ž ab ™H H
q y
., Ž 1.
c w2 y sw2 Ib y 2 sw2 Q b Ib
g Hq Hy Z s y , ÕZ b b s y , aZ b b s y ,
2 cw 2 cw 2 cw
g H q H y g s ysw , Õg b b s ysw Q b , Ž 5.
310 ´ B.A. Kniehl r Nuclear Physics B 568 (2000) 305–318
A.A. Barrientos Bendezu,
™
recover the well-known Drell–Yan cross section of qq Hq Hy w14–16x from Eq. Ž3.
by putting S s T "s 0 and substituting b q. The approximation S s T "s 0 is justi-
™
is the propagator function of particle X, with mass m X and total decay width G X . We
fied for the quarks of the first and second generations, q s u,d, s,c, because S and T "
are then suppressed by the smallness of the corresponding Yukawa couplings. However,
the S and T " terms give rise to sizeable contributions in the case of q s b, especially at
small or large values of tan b . The full cross section of qq annihilation is obtained by
complementing the bb-initiated cross section of Eq. Ž3. with the Drell–Yan cross
sections for q s u,d, s,c.
™
The one-loop diagrams for gg HqHy in the MSSM are displayed in Fig. 2. As for
the quark loops, our analytical results fully agree with those listed in Ref. w19x, and there
is no need to repeat them here. In the squark case, the T-matrix elements corresponding
to the triangle and box diagrams are found to be
2
GF m W a s Ž mr .
T˜^ s ´mc Ž pa . ´nc Ž p b . A1mn F˜^ ,
'2 p
2
GF m W a s Ž mr .
T˜I s ´mc Ž pa . ´nc Ž p b . A1mn F˜I q A 2mn G˜ I ,
ž / Ž 7.
'2 p
respectively, where a s Ž m r . is the strong coupling constant at renormalization scale m r ,
´mc Ž pa . is the polarization four-vector of gluon a and similarly for gluon b, it is summed
over the colour index c s 1, . . . ,8,
2
A1mn s g mn y pan p bm ,
s
2 m 2H u y m 2H t y m2H
A 2mn s g mn q
p T2 ž s
pan p bm q
s
pan pHmq q
s
p bm pHn q q pHmq pHn q ,
/
Ž 8.
and the form factors F˜^ , F˜I , and G˜ I are listed in Appendix B. Due to Bose
l
symmetry, T˜^ and T˜I are invariant under the simultaneous replacements m n and l
™
pa p b . Consequently, F˜^ , F˜I , and G˜ I are symmetric in t and u.
The parton-level cross section of bb Hq Hy including both quark and squark
contributions is then given by
™H H
2
ds GF2 a s2 Ž m r . 2
2 mW
dt
Ž gg q y
.s
256 Ž 2p .
3 Ý Q
C^ Q
F^ q FI y
s
ž F˜ ˜
^ q FI /
Qst , b
2 2
2 mW 2
q GI y G˜ I q H I , Ž 9.
s
´ B.A. Kniehl r Nuclear Physics B 568 (2000) 305–318
A.A. Barrientos Bendezu, 311
Q Q
where the generalized coupling C ^ and the form factors F^ , FI , G I , and H I may
be found in Eq. Ž8. and Appendix A of Ref. w19x, respectively.
3. Phenomenological implications
useful to replace m A by m H , the mass of the H " bosons to be produced, in the set of
MSSM input parameters. We vary tan b and m H in the ranges 1 - tan b - 40 f m trm b
and 120 GeV - m H - 550 GeV, respectively. As for the GUT parameters, we choose
™
Fig. 3. Total cross sections s Žin fb. of pp Hq Hy q X via qq annihilation Ždashed line. and gg fusion
Žsolid line. at the LHC as functions of tan b for m H s 200 GeV. For comparison, also the Drell–Yan
contribution to qq annihilation Ždotted line. and the quark loop contribution to gg fusion Ždot-dashed line. are
shown.
312 ´ B.A. Kniehl r Nuclear Physics B 568 (2000) 305–318
A.A. Barrientos Bendezu,
m1r2 s 100 GeV, A s 0, and m - 0, and tune m 0 so as to be consistent with the desired
value of m H . All other MSSM parameters are then determined according to the
SUGRA-inspired scenario as implemented in the program package SUSPECT w30x. We
do not impose the unification of the t-lepton and b-quark Yukawa couplings at the GUT
scale, which would just constrain the allowed tan b range without any visible effect on
the results for these values of tan b .
™
We now study the total cross section of pp HqHyq X at the LHC, with c.m.
energy 'S s 14 TeV. In Fig. 3 the full contributions due to qq annihilation Ždashed
line. and gg fusion Žsolid line. are displayed as functions of tan b for m H s 200 GeV.
For comparison, also the Drell–Yan contributions to qq annihilation for q s u,d, s,c,b
Ždotted line. and the quark loop contribution to gg fusion Ždot-dashed line., which is the
full one-loop result for gg-fusion in the 2HDM, are shown. In the case of qq
annihilation, as anticipated in Section 1, the Yukawa-enhanced contribution for q s b
greatly enhances the conventional Drell–Yan cross section for large values of tan b , by
more than a factor of three for tan b s 40. The expected enhancement for small tan b is
invisible, since solutions with tan b Q 2 are excluded in the SUGRA-inspired MSSM for
our choice of input parameters. As for gg fusion, the dot-dashed line nicely agrees with
Fig. 6 of Ref. w19x. The quark loop contribution exhibits a minimum at tan b
™
Fig. 4. Total cross sections s Žin fb. of pp Hq Hy q X via qq annihilation Ždashed lines. and gg fusion
Žsolid lines. at the LHC as functions of m H for tan b s1.5, 6, and 30.
´ B.A. Kniehl r Nuclear Physics B 568 (2000) 305–318
A.A. Barrientos Bendezu, 313
(
f m trm b f 6. This may be understood by observing that the average strength of the
(
Hy bt coupling in Eq. Ž2., proportional to m 2t cot 2b q m 2b tan2b , is then minimal w19x.
Passing from the 2HDM to the MSSM, we need to coherently add the squark loop
contribution according to Eq. Ž9.. We observe that this leads to a significant rise in cross
section, by up to 50%, unless tan b is close to 10. Nevertheless, the full tree-level cross
section is dominant for all values of tan b .
In Fig. 4 the m H dependence of the full qq-annihilation Ždashed lines. and gg-fusion
cross sections Žsolid lines. is studied for tan b s 1.5, 6, and 30. As we have already seen
in Fig. 3, qq annihilation always dominates. Its contribution modestly exceeds the one
due to gg fusion, by a factor of three or less, if tan b f 1.5 or 30 and m H R 200 GeV,
but it is more than one order of magnitude larger if m H Q m t . The gg-fusion contribu-
tion is greatly suppressed if tan b f 6, independently of m H . For all values of tan b , the
latter exhibits a dip located about m H s m t , which arises from resonating top-quark
propagators in the quark box form factors. ŽIn the case of tan b s 1.5, this dip lies in the
excluded m H range.. This feature may also be seen in Fig. 5 of Ref. w19x, where the
quark loop contribution is shown separately. We note in passing that we also find good
agreement with that figure.
For a comparison with future experimental data, the qq-annihilation and gg-fusion
™
channels should be combined. From Fig. 4 we read off that the total cross section of
pp Hq Hyq X at the LHC is predicted to be 180 fb Ž0.9 fb. in the considered MSSM
scenario if tan b s 30 and m H s 120 GeV Ž500 GeV.. If we assume the integrated
luminosity per year to be at its design value of L s 100 fby1 for each of the two LHC
experiments, ATLAS and CMS, then this translates into about 36 000 Ž180. signal
events per year.
4. Conclusions
We studied the hadroproduction of HqHy pairs within the MSSM, adopting a
SUGRA-inspired scenario. We included the contributions from qq annihilation and gg
fusion to lowest order and provided full analytic results. Our analysis reaches beyond
previous studies w14–19x in two important respects. In the case of qq annihilation, we
demonstrated that previously neglected Yukawa-type contributions in the bb channel
lead to a substantial increase in cross section if tan b is large, by more than a factor of
three for tan b s 40. In the case of gg fusion, we upgraded a previous result w19x, which
we confirmed, from the 2HDM to the MSSM by including the contributions induced by
virtual squarks. As a result, the gg-fusion cross section may be significantly enhanced,
by up to 50%, depending on tan b . Should the MSSM be realized in nature, then HqHy
pair production will provide a copious source of charged Higgs bosons at the LHC, with
an annual yield of up to 36 000 signal events, which amounts to 72 000 charged Higgs
bosons per year.
™
After this paper was submitted, a preprint w31x appeared, in which the cross section of
pp HqHyq X via gluon fusion was studied for the LHC. The analytic results for the
squark loop contributions presented there agree with our results.
314 ´ B.A. Kniehl r Nuclear Physics B 568 (2000) 305–318
A.A. Barrientos Bendezu,
Acknowledgements
B.A.K. thanks the Theory Group of the Werner-Heisenberg-Institut for the hospitality
extended to him during a visit when this paper was finalized. The work of A.A.B.B. was
¨
supported by the Friedrich-Ebert-Stiftung through Grant No. 219747. The II. Institut fur
Theoretische Physik is supported by the Bundesministerium fur ¨ Bildung und Forschung
under Contract No. 05 HT9GUA 3, and by the European Commission through the
Research Training Network Quantum Chromodynamics and the Deep Structure of
Elementary Particles under Contract No. ERBFMRXCT980194.
In this appendix, we collect the couplings of the h 0 , H 0 , and H " bosons to the
squarks q˜i , with q s t,b and i s 1,2, which are relevant for our analysis. Defining the
mixing matrix which rotates the left- and right-handed squark fields, q˜L and q˜R , into the
mass eigenstates q˜i as
we have w23–26x
g h 0 b˜ 1 b˜ 1 g h 0 b˜ 1 b˜ 2
ž g h 0 b˜ 2 b˜ 1 g h 0 b˜ 2 b˜ 2 /
m Z sq Ž Ib3 y sw2 Q b . m2b sa m b Ž m ca q A b sa .
q
sM b
˜
cw
m b Ž m ca q A b sa .
2 mW cb
mW cb 2 mW cb
m Z sq sw2 Q b
cw
q
m 2b sa
mW cb
0 T
Ž M b˜ . ,
´ B.A. Kniehl r Nuclear Physics B 568 (2000) 305–318
A.A. Barrientos Bendezu, 315
y
2 mW sb
m Z cq sw2 Q t
cw
y
m 2t sa
mW sb
0 Ž M t˜ . ,
T
g H 0 b˜ 1 b˜ 1 g H 0 b˜ 1 b˜ 2
ž g H 0 b˜ 2 b˜ 1 g H 0 b˜ 2 b˜ 2 /
m Z cq Ž Ib3 y sw2 Q b . m2b ca m b Ž m sa y A b ca .
y y
sM b
˜
cw
m b Ž m sa y A b ca .
2 mW cb
mW cb
y
2 mW cb
m Z cq sw2 Q b
cw
y
m2b ca
mW cb
0 Ž M b˜ . ,
T
g H q t˜1 b˜ 1 g H q t˜1 b˜ 2
ž g H q t˜2 b˜ 1 g H q t˜2 b˜ 2 /
2
ymW s2 b q m2t cot b q m2b tan b m b Ž m q A b tan b .
s M t̃
'2 mW
m t Ž m q A t cot b .
'2 mW
'2 mW
m t m b Ž tan b q cot b .
'2 mW 0 Ž M b˜ . ,
T
= Ž M t̃ . ,
T
2 c w2
0
2
2 mW
c 2 b sw2 Q t
2 c w2
y
m2t cot 2b
2
2 mW
0
316 ´ B.A. Kniehl r Nuclear Physics B 568 (2000) 305–318
A.A. Barrientos Bendezu,
g H q H y b˜ 1 b˜ 1 g H q H y b˜ 1 b˜ 2
ž g H q H y b˜ 2 b˜ 1 g H q H y b˜ 2 b˜ 2 /
c 2 b Ib3 Ž 1 y 2 c w2 . y sw2 Q b m2t cot 2b
y 0
sM b̃
= Ž M b̃ . .
T
2 c w2
0
2
2 mW
c 2 b sw2 Q b
2 c w2
y
m2b tan2b
2
2 mW
0
Ž A.2 .
The corresponding Feynman rules emerge for the trilinear Žquartic. couplings by
multiplication with ig Ž ig 2 ., where g is the SUŽ2. coupling constant. Similar relations
apply for the squarks of the first and second generations, which are also included in our
analysis. However, in these cases, we neglect terms which are suppressed by the
smallness of the corresponding light-quark masses.
In this appendix, we express the squark triangle and box form factors, F˜^ , F˜I , and
G̃ I , in terms of the standard scalar three- and four-point functions,
2
C0 Ž p 12 , Ž p 2 y p 1 . , p 22 ,m20 ,m12 ,m22 .
d4q 1
s H ip Ž q 2 2 2 2
,
y m20 q i e . Ž q q p1 . y m12 q i e Ž q q p 2 . y m22 q i e
d4q 1
s H ip 2 ,
Ž q 2 y m20 q i e . w Ž q q p1 . 2y m12 q i e xw Ž q q p 2 . 2y m22 q i e xw Ž q q p 3 . 2y m23 q i e x
Ž B.1 .
which we evaluate numerically with the aid of the program package FF w32x. We have
F˜^ s g H q H y h 0 Ph 0 Ž s . g h 0 t˜1 t˜1 F1 Ž s,m t˜1 . q g h 0 t˜2 t˜2 F1 Ž s,m t˜2 . q g h 0 b˜ 1 b˜ 1 F1 Ž s,m b˜ 1 .
y g H q H y b˜ 2 b˜ 2 F1 Ž s,m b˜ 2 . ,
´ B.A. Kniehl r Nuclear Physics B 568 (2000) 305–318
A.A. Barrientos Bendezu, 317
lm .
F˜I s g H2 q t˜1 b˜ 1 F2 Ž h, s,t ,u,m t˜1 ,m b˜ 1 . q Ž m t˜1 b˜ 1
qg 2
F Ž h, s,t ,u,m ,m . q Ž m l m
H q t˜1 b˜ 2 2 t˜1 b˜ 2 t˜1 b˜ 2 .
qg 2
F Ž h, s,t ,u,m ,m . q Ž m l m
H q t˜2 b˜ 1 2 t˜2 b˜ 1 t˜2 b˜ 1 . q g H2 q t˜2 b˜ 2
= F Ž h, s,t ,u,m ,m . q Ž m l m . ,
t˜2 b˜ 2 t˜2 b˜ 2 Ž B.2 .
™ F . Here, we have
2
= Ž 2 D 10 q D 02 q D 03 q D 04 q D 05 . 4 , Ž B.3 .
where t 1 s t y h, u1 s u y h, and
References
w1x Z. Kunszt, F. Zwirner, Nucl. Phys. B 385 Ž1992. 3, and references cited therein.
w2x CDF Collaboration, F. Abe et al., Phys. Rev. D 54 Ž1996. 735.
w3x B. Bevensee Žfor the CDF and D0 Collaborations., Report No. FERMILAB–Conf–98r155–E ŽMay
1998., to be published in the proceedings of 33rd Rencontres de Moriond: QCD and High Energy
Hadronic Interactions, Les Arcs, France, 21–28 March 1998.
w4x J.F. Gunion, H.E. Haber, F.E. Paige, W.-K. Tung, S.S.D. Willenbrock, Nucl. Phys. B 294 Ž1987. 621.
w5x R.M. Barnett, H.E. Haber, D.E. Soper, Nucl. Phys. B 306 Ž1988. 697.
w6x F.I. Olness, W.-K. Tung, Nucl. Phys. B 308 Ž1988. 813.
w7x V. Barger, R.J.N. Phillips, D.P. Roy, Phys. Lett. B 324 Ž1994. 236.
w8x J.L. Diaz-Cruz, O.A. Sampayo, Phys. Rev. D 50 Ž1994. 6820.
w9x S. Moretti, K. Odagiri, Phys. Rev. D 55 Ž1997. 5627.
w10x D.A. Dicus, J.L. Hewett, C. Kao, T.G. Rizzo, Phys. Rev. D 40 Ž1989. 787.
w11x A.A. Barrientos Bendezu, ´ B.A. Kniehl, Phys. Rev. D 59 Ž1998. 015009.
w12x A.A. Barrientos Bendezu, ´ B.A. Kniehl, Preprint No. DESY 99-121, KEK-TH-638 and hep-phr9909502
ŽSeptember 1999., to appear in Phys. Rev. D.
w13x S. Moretti, K. Odagiri, Phys. Rev. D 59 Ž1999. 055008.
w14x E. Eichten, I. Hinchliffe, K. Lane, C. Quigg, Rev. Mod. Phys. 56 Ž1984. 579.
w15x E. Eichten, I. Hinchliffe, K. Lane, C. Quigg, Rev. Mod. Phys. 58 Ž1986. 1065, Erratum.
w16x N.G. Deshpande, X. Tata, D.A. Dicus, Phys. Rev. D 29 Ž1984. 1527.
w17x S.S.D. Willenbrock, Phys. Rev. D 35 Ž1987. 173.
w18x J. Yi, M. Wen-Gan, H. Liang, H. Meng, Y. Zeng-Hui, J. Phys. G 24 Ž1998. 83.
w19x A. Krause, T. Plehn, M. Spira, P.M. Zerwas, Nucl. Phys. B 519 Ž1998. 85.
w20x D.A. Dicus, S. Willenbrock, Phys. Rev. D 39 Ž1989. 751.
w21x D.A. Dicus, C. Kao, Phys. Rev. D 41 Ž1990. 832.
w22x A. Djouadi, J. Kalinowski, P. Ohmann, P.M. Zerwas, Z. Phys. C 74 Ž1997. 93.
w23x J.F. Gunion, H.E. Haber, Nucl. Phys. B 272 Ž1986. 1.
w24x J.F. Gunion, H.E. Haber, Nucl. Phys. B 402 Ž1993. 567, Erratum.
w25x J.F. Gunion, H.E. Haber, Nucl. Phys. B 278 Ž1986. 449.
w26x J.F. Gunion, H.E. Haber, G. Kane, S. Dawson, The Higgs Hunter’s Guide ŽAddison-Wesley, Redwood
City, 1990..
w27x C. Caso et al. ŽParticle Data Group., Eur. Phys. J. C 3 Ž1998. 1.
w28x D. Abbaneo et al. ŽThe LEP Collaborations ALEPH, DELPHI, L3, OPAL, the LEP Electroweak Working
Group and the SLD Heavy Flavour and Electroweak Groups., Report No. CERN-EPr99-15 ŽFebruary
1999..
w29x H.L. Lai et al., CTEQ Collaboration, Report No. MSU-HEP-903100 and hep-phr9903282 ŽMarch 1999..
w30x A. Djouadi, J.-L. Kneur, G. Moultaka, Report No. PMr98-27 and GDR-S-017 Ž1998..
w31x O. Brein, W. Hollik, Preprint No. KA-TP-11-1999 and hep-phr9908529 ŽAugust 1999., to appear in Eur.
Phys. J. C.
w32x G.J. van Oldenborgh, Comput. Phys. Commun. 66 Ž1991. 1.
Nuclear Physics B 568 Ž2000. 319–363
www.elsevier.nlrlocaternpe
a
¨
Department of Theoretical Physics 2, Lund UniÕersity, SolÕegatan 14A, S22362 Lund, Sweden
b
Department of Physics, P.O. Box 9, FIN-00014 UniÕersity of Helsinki, Helsinki, Finland
Received 8 July 1999; accepted 22 October 1999
Abstract
The vector and axial-vector two-point functions are calculated to next-to-next-to-leading order
in Chiral Perturbation Theory for three light flavours. We also obtain expressions at the same
order for the masses, mp2 , m 2K and mh2 , and the decay constants, Fp , FK and Fh . We present some
numerical results after a simple resonance estimate of some of the new O Ž p 6 . constants. q 2000
Elsevier Science B.V. All rights reserved.
1. Introduction
With the new collider facilities, upcoming experiments will bring higher statistics
data samples into the low energy regime. Due to their accuracy higher order calculations
are needed to update the theoretical prediction for the measurements. In this frame the
calculation of the two-point functions at next-to-next-to-leading order ŽNNLO. at low
energies has become necessary. These provide us with the pseudoscalar masses and the
decay constants, which are needed input for most other quantities. The two-point Green
functions are also basic tools in the study of the strong interaction. They form the basis
for a series of very useful sum rules starting with the Weinberg w1x and DMO w2x sum
rules Žwe refer to Refs. w3,4x and references therein for a more complete discussion about
sum rules..
In this work we are concerned with the low energy regime of QCD. We will study
the two-point functions with Chiral Perturbation Theory ŽCHPT., valid for energies
below the first resonance Ž E ; mr . and describing the strong interactions using the
pseudoscalar octet as the basic fields. This is by now a fairly developed field. We refer
to Ref. w5x for reviews and various abstracts on recent works.
q
Work supported in part by TMR, EC-Contract No. ERBFMRX-CT980169 ŽEURODAFNE..
0550-3213r00r$ - see front matter q 2000 Elsevier Science B.V. All rights reserved.
PII: S 0 5 5 0 - 3 2 1 3 Ž 9 9 . 0 0 6 7 4 - 4
320 ´ et al.r Nuclear Physics B 568 (2000) 319–363
G. Amoros
For a future study of various sum rules, we present the vector and axial-vector
two-point functions at NNLO in three flavour CHPT, in the limit of unbroken isospin.
Four of the six basic correlators have been calculated earlier w6,7x and we have fully
confirmed their results in the vector–vector two-point function and partially 2 in the
axial-vector case. The other two are new and complete the three flavour basis. As a
byproduct we also give the masses and decay constants to NNLO.
The interest in the NNLO calculation is beyond the precise measurement of the
couplings and masses of the effective theory, it allows to test the convergence of the
theory and provides a more stringent check on the principle of resonance saturation of
the constants in the low energy chiral Lagrangian. While this principle worked well at
O Ž p 4 . w8,9x only a few tests at O Ž p 6 . have been done. In this paper we estimate some
of the constants appearing to two loops and check their effect on the full O Ž p 6 . results
including the loop contributions. We use the MS-subtraction scheme and the recent
classification for the O Ž p 6 . Lagrangian w10x.
Some applications to chiral sum rules for the isospin and hypercharge cases can be
found in Refs. w11,12x. We intend to return to that subject in a future publication.
The paper is organized as follows. In Section 2 the two-point Green functions are
defined, followed by a short overview of CHPT in Section 3. The vector two-point
function is relatively easy since it only involves products of one-loop integrals. Its
calculation is described and results given in Section 4. The masses and decay constants
can be calculated in two ways. The masses can be obtained from the zero of the inverse
pseudoscalar propagator or from the pole in the axial-vector two-point function. The
decay constants can be directly determined from the residue of the poles in the
axial-vector two-point function or through the definition with the axial-vector–pseudo-
scalar function. We have checked that both methods agree and the first is described in
Section 5. Finally the axial-vector two-point functions are presented in Section 6. The
new O Ž p 6 . constants appearing are estimated in Section 7 on the basis of Resonance
Dominance. In Section 8 some results are presented, postponing a more detailed analysis
w13x. And finally in Section 9 we discuss our main results.
We refer the lengthiest expressions and the more technical discussion of the loop
integrals and renormalization to appendices. In Appendix A.1 and Appendix A.2 we
give the full expressions for the masses and the decay constants. In Appendix B we
display the the axial-vector two-point function components. The loop integrals are
collected in Appendix C.
2
The authors of Refs. w6,7x use a different method to perform the sunset integrals making comparison of
those parts difficult. We agree on all the parts we could check without converting their sunset functions to
ours.
´ et al.r Nuclear Physics B 568 (2000) 319–363
G. Amoros 321
where the indices i and j run over the three light quark flavours, u, d and s. Working in
the isospin limit all SUŽ3. currents can be constructed using isospin relations from
1
Vmp Ž x . s 11 22
'2 Ž Vm Ž x . y Vm Ž x . . ,
1
Vmh Ž x . s 11 22 33
'6 Ž Vm Ž x . q Vm Ž x . y 2Vm Ž x . . ,
VmK Ž x . s Vm31 Ž x . . Ž 2.
We refer to these as isospin, hypercharge and kaon respectively. E.g. the electromag-
netic current corresponds to
e e
Vmem s Vmp Ž x . q h
'2 '6 Vm Ž x . . Ž 3.
should share the same symmetries as the basic underlying theory w17,18x, in particular
Lorentz invariance, local chiral symmetry, parity and charge conjugation. Thereby in the
following we use that the generating functional of both theories, QCD and CHPT, are
the same at low energies w18x
½H
exp iZ 4 s Dq Dq D Gm exp i d 4 x LQCD s DUexp i d 4 x Leff .
H 5 H ½H 5 Ž 7.
F02
L2 s ² Dm UD m U † q U †x q x †U : , Ž 8.
4
where
2
U Ž f . s u Ž f . s exp Ž i'2 FrF0 . , Ž 9.
parametrizes the pseudo-Goldstone bosons and
x s 2 B0 Ž s q ip . Ž 10 .
is given in terms of the scalar and pseudoscalar sources s and p. ² X : s trflavour Ž X . and
U, s, p are matrices in flavour space. Both B0 and F0 are constants not restricted by
symmetry. They are related with the quark condensate and the meson decay constant
respectively. To respect local invariance the external sources are incorporated through
the covariant derivatives
DmU s Em U y irm U q iUlm , Ž 11 .
and the field strength tensors
FLmn s E m l n y E n l m y i w l m ,l n x , FRmn s E m r n y E n r m y i w r m ,r n x . Ž 12 .
As mentioned above the purpose of this paper is to compute two-point Green
functions, the most straightforward way is to incorporate classical sources in the
effective action. This reduces the calculation of any n-point Green function to the
evaluation of functional derivatives acting on the generating functional
dn
G Ž n. Ž x 1 , . . . , x n . s Zw J x . Ž 13 .
d j Ž x 1 . PPP d j Ž x n . Js0
This allows for instance, to relate the chiral condensate with the constant B0 by taking
the derivative with respect to the scalar sources. This formalism is not only suitable for
an easy calculation but also allows to incorporate the electromagnetic, weak interactions
and the symmetry breaking through the quark masses via the following identifications:
rm s Õm q am , lm s Õm y am , ssMq . . . , Ž 14.
where M stands for the diagonal quark mass matrix, M s diag Ž m u ,m d ,m s . .
´ et al.r Nuclear Physics B 568 (2000) 319–363
G. Amoros 323
To get the desired chiral order – O Ž p 6 . – in our calculation we will deal with three
kind of contributions: Ži. tree, one- and two-loop diagrams involving vertices of L 2 , Žii.
tree and 1-loop graphs with vertices from L 2 and from L4 given by
L4 s L1² =m U † = m U :2 q L2 ² =m U † =n U :² = mU † =n U : q L3 ² =m U † = m U =n U † =n U :
q L 4 ² =m U † = m U :² x †U q x U † : q L5 ² =m U † = m U Ž x †U q x U † . :
q L6 ² x †U q x U † :2 q L 7 ² x †U y x U † :2 q L8 ² x †Ux †U q x U †x U † :
a mn s Ž FRmn y FLmn . r2 ,
um s i u† Ž Em y irm . u y u Ž Em y ilm . u† 4 ,
Gm s 12 u† Ž Em y irm . u q u Ž Em y ilm . u† 4 ,
xqs u†x u† q u x † u ,
=m X s Em X q Gm X y X Gm . Ž 16 .
Within the framework of previous sections we can start to calculate the vector–vector
two-point functions. The first contribution appears at O Ž p 4 .. It has been calculated for
the isospin case in Ref. w21x in SUŽ2. CHPT. The extension to O Ž p 6 . in SUŽ3. CHPT
has been done in Ref. w6x for the isospin and the hypercharge case. We have reproduced
their results and in addition we present the kaon vector two-point function as well here.
The calculation to O Ž p 6 . presents no new difficulties besides being rather tedious
since only products of one-loop integrals appear.
The O Ž p 4 . contributions come from diagrams Ža. – Žc. in Fig. 1. Diagrams Žd., Že.,
Žm. and Žn. can be calculated directly or using wave-function renormalization and mass
corrections. We have checked that both approaches give the same result. As a conse-
324 ´ et al.r Nuclear Physics B 568 (2000) 319–363
G. Amoros
Fig. 1. Diagrams for the vector two-point function. The crossed circle stands for the O Ž p 4 . vertex insertion.
Wiggly lines are the external vector currents. Dots are O Ž p 2 . vertices and a square is a O Ž p 6 . vertex. The
solid lines are meson propagators.
quence the final result contains no three-propagator integrals. They cancel when the
O Ž p 4 . result is expressed in terms of the physical masses using
E
f Ž m2i 0 ;q 2 . s f Ž m 2i ;q 2 . q Ý Ž m2j0 y m 2j . f Ž m2i 0 ;q 2 . , Ž 17 .
j E m2j 0 m 2i 0 sm 2i
where the m2i 0 are the bare masses and the m 2i the next-to-leading order masses. In
addition we replace F0 by Fp and all masses by their physical ones in the O Ž p 6 .
expression.
There are no one-particle reducible contributions to the vector two-point functions.
We have performed the following checks:
™
1. In the isospin and hypercharge case the longitudinal part vanishes.
2. In the SUŽ3. limit, i.e. mp2 s m 2K s mh2 m2chiral , all two-point functions are equal.
3. The SUŽ3. breaking effect in the form-factors appears only in second order in the
quark masses, i.e. order Ž m s y m ˆ . 2 , as required by the Ademollo–Gatto theorem
w22,23x.
4. All divergences with a non-analytical dependence on masses or q 2 cancel and the
lnŽ4p . and g terms can be absorbed in the counter-terms as well. Both of these
follow from general renormalizability theorems.
´ et al.r Nuclear Physics B 568 (2000) 319–363
G. Amoros 325
5. The remaining divergences are in agreement with those from the general calculation
using heat-kernel methods w24x and with the double logarithms of Ref. w25x.
2 r r pp 2 KK 2
P VŽ1.
p Ž q . s y4 Ž 2 H1 q L10 . y 8 BV Ž q . y 4 BV Ž q .
1 2 pp
q ½ y16 Ž2 q B V Ž q 2 . q q 2 BVK K Ž q 2 . . Lr9
Fp2
2
q16 q 2 Ž 2 mp q m K . Ž Lr9 q Lr10 . q 4 q 2 Ž 2 BVp p Ž q 2 . q BVK K Ž q 2 . . 5
y 32 mp2 C61
r
y 32 Ž mp2 q 2 m2K . C62
r
y 8 q 2 C93
r
. Ž 19 .
The hypercharge transverse part is given by
2 r r KK 2
1 2 KK
P VŽ1.
h Ž q . s y4 Ž 2 H1 q L10 . y 12 BV Ž q . q ½ y48q B V Ž q 2 . Lr9
Fp2
2
q48 m K Ž Lr9 q Lr10 . q 36 q 2 Ž BVK K Ž q 2 . . 5 y 32 m C 2
h
r
61
y 32 Ž mp2 q 2 m 2K . C62
r
y 8 q 2 C93
r
. Ž 20 .
The longitudinal part vanishes for the above two. These results agree with those
obtained in Ref. w6x when the differences in subtraction schemes are taken into account.
The expressions for the kaon two-point functions are new and are somewhat longer.
The transverse part is given by
2
y6 mp mh q 12 m K mh . q 9 q 2 Ž BVp K Ž q 2 . q BVh K Ž q 2 . . 5
32 2
y
q2
Ž mp2 y m2K . r
C38 y 32 m2K C61
r
y 32 Ž mp2 q 2 m 2K . C62
r
8 2
y
q2
Ž mp2 y m2K . C91r y 8 q 2 C93r . Ž 21 .
This two-point function has also a longitudinal part
3 2 2
P VŽ0.K Ž q 2 . s 4 Ž m2K y mp2 . J p K Ž q 2 . q Ž m2K y mh2 . J h K Ž q 2 .
ž /
2q
1
q Fp ½ž
q 2 2 8 Ž m 2K y mp2 . Ž 3 mp y 2 m K y mh .
2
2
q12 Ž m2K y mp2 . J p K Ž q 2 . q 12 Ž m 2K y mh2 . J h K Ž q 2 . Lr5 /
3
q 4Ž 5mp mp y 4mp m K y 6 mp mh y 4m K m K q 12 m K mh y 3 mh mh .
y Ž y5mp q 2 m K q 3 mh . Ž m2K y mp2 . J p K Ž q 2 .
3
4
5.1. Masses
The definition of mass is the position of the pole in a two-point Green function
GŽ p 2 ,m2 . that contains the relevant particle as a possible intermediate state. The
axial-vector two-point function is a suitable candidate to obtain the masses for the
pseudoscalar mesons. The general structure of this two-point function is shown in terms
of one-particle-irreducible Ž1PI. diagrams in Fig. 2. However the meson-propagator or
the two-point function of meson fields itself is a simpler set with the same pole. We
denote the sum of 1PI graphs by i P Ž p 2 ,m2i0 , F0 .. The set of diagrams contributing to
P Ž p 2 ,m2i0 , F0 . is depicted in Fig. 3c. The propagator is
n
i ` i
GŽ p 2
,m 2i0 , F0 . s p 2 y m2 Ý
0 ns0
ž iP Ž p 2
,m2i0 , F0 . p 2 y m2
0
/
i
s , Ž 24 .
p 2 y m 20 q P Ž p 2 ,m2i0 , F0 .
where m20 stands for the lowest order mass and m2i0 collectively denotes the various
lowest order masses. The physical mass is given by the zero of the denominator once the
external legs are on mass-shell
Fig. 2. The diagrams contributing to the axial-vector two-point function. The filled circles indicate the
one-particle-irreducible diagrams. Solid lines are pseudoscalar meson propagators and the wiggly lines indicate
insertions of an axial-vector current. For the inverse propagator the wiggly lines are meson legs and for the
decay constant the right wiggly line is a meson leg while the left remains an axial current.
328 ´ et al.r Nuclear Physics B 568 (2000) 319–363
G. Amoros
Fig. 3. The set of diagrams contributing to the 1PI quantities. Ža. axial-vector–axial-vector, Žb. axial-vector–
pseudoscalar, Žc. pseudoscalar–pseudoscalar. Žd. – Žl. the respective diagrams when the dashed lines are
replaced with the external legs of Ža., Žb. or Žc.. A line is a meson propagator, a wiggly line an external source,
a dot a vertex of O Ž p 2 ., a crossed circle a vertex of O Ž p 4 . and a crossed box a vertex of O Ž p 6 ..
formulae only appear in the preprint version. in the way described earlier, see Footnote
2, while the kaon result is new.
Notice that the precise expression for the O Ž p 6 . is dependent on the choice of the
O p 4 . expression3 , using the Gell-Mann–Okubo relation at O Ž p 4 ., produces differ-
Ž
ences at O Ž p 6 ..
The masses depend on seven combinations of the O Ž p 6 . constants. All the relevant
checks described in Section 4 were done.
3
This is why our expressions have some differences with those of Ref. w7x even after correcting for the
renormalization scheme.
´ et al.r Nuclear Physics B 568 (2000) 319–363
G. Amoros 329
The explicit formulae are rather long and can be found in Appendix A.2. The relevant
checks described in Section 4 were performed.
The axial-vector two-point functions to lowest order are quite simple and all three
reduce to
2 F02
PmnA a Ž q . s 2 F02 gmn y qm qn . Ž 29 .
q 2 y m20 a
The NLO corrections only introduce minor changes. The decay constants change to Fa ,
the masses to the physical ones and there is an additional contribution from the O Ž p 4 .
constants to P AŽ1. Ž 2. r r Ž 6 . corrections
a q s 4 L10 y 8 H1 . In fact a very large part of the O p
is of a similar nature. We thus define
2 Fa2
PmnA a Ž q . s 2 Fa2 gmn y qm qn q Ž qm qn y gmn q 2 . Pˆ AŽ1. 2
a Žq .
q 2 y m2a
q qm qn Pˆ AŽ0. 2
a Žq . . Ž 30 .
The function ˆ P AŽ1.
a q
Ž 2. can be fully calculated from diagram Ža. of Fig. 2. These are
depicted in more detail in Fig. 3d–l and discussed in Appendix B.1.
All the diagrams in Fig. 2 contribute to Pˆ AŽ0. Ž 2 . even though most of their contents
a q
actually go into the redefinitions of the respective decay constants and masses. The full
result is given in Appendix B.2. The results fulfill the same checks as in Section 4. We
call Pˆ AŽ0.
a q
Ž 2 . and Pˆ AŽ1.
a q
Ž 2 . the longitudinal and transverse remainder respectively.
In this section we estimate some of the O Ž p 6 . constants that appear in the results.
We assume saturation by the lightest vector, axial-vector and scalar mesons, extending
the formalism used in Refs. w8,9x to the present case.
For the spin-1 mesons we use the realization where the vector contribution to the
chiral Lagrangian starts at O Ž p 6 .. Keeping only the relevant terms for our calculation
we have
fV
L V s y 14 ² Vmn V mn y 2 M V2 Vm V m : y ² Vmn fqmn : q . . . , Ž 31.
2'2
where
mn
Vmn s =m Vn y =n Vm , f" s u Ž Õmn y amn . u† " u† Ž Õmn q amn . u ,
and the same holds for the axial-vector with the label change V A and fq fy. Vm
and Am are three-by-three matrices in flavour space and describe the full vector and
l l
axial-vector nonets, thus we assume a nonet symmetry throughout the rest of this
section. The rest of the notations was already presented in Section 3.
330 ´ et al.r Nuclear Physics B 568 (2000) 319–363
G. Amoros
After integrating out the vectors the terms contributing to the two-point Green
functions at O Ž p 6 . are
f V2 f A2
LV , A s 2
² fqmn =m=l fqln : q ² fymn =m=l fyln : . Ž 32 .
4 MV 4 MA2
LS s 12 ² = mS=m S y MS2 S 2 : q c d ² Su m um : q c m² S xq :
dm
q ² S 2xq : q cg ² Sfq mn fqmn : q cgX ² Sfy mn fymn : . Ž 33 .
2
After integrating out the scalars, the O Ž p 6 . contribution we are interested in comes from
the terms
yc m2 d m c m2 c d cm d m
LS s 4
² xq = 2xq : q ² xq3 : q ² xq2 um u m :
2 MS 2 MS4 MS4
c m cg c m cgX
q ² xq fq mn fqmn : q ² xq fy mn fymn : , Ž 34 .
MS2 MS2
obtained after the shift of the vacuum expectation value and using the equation of
motion for the scalars. Note that only the relevant terms are written and, as in the vector
and axial case, a full nonet of scalars is assumed in S.
As input parameters we use
M V s Mr s 0.77 GeV, f V s 0.20, MA s M a1 s 1.23 GeV,
™ ™
MS and d m are obtained from the masses of the scalars K 0) Ž1430. and a 0 Ž980.. The
value cg is obtained from G Ž f 0 gg . s 0.56 " 0.11 keV and is compatible also with
G Ž a0 gg . s 0.30 " 0.10 keV. The values of c m and c d are obtained forcing the
™
saturation of some of the O Ž p 4 . constants by the scalars w8,9x and are compatible with
those obtained in G Ž a 0 ph .. cgX value cannot be determined from data at present, we
assume a value similar to cg .
Using the notation of Ref. w24x for the O Ž p 6 . terms, the spin-1 Lagrangian yields
f V2 f A2
r
C87 ; 18
ž M V2
y
MA2 / ; 7.6 P 10y3 GeVy2 , Ž 36 .
r
f V2
C93 ;y ; y17 P 10y3 GeVy2 , Ž 37 .
4 M V2
´ et al.r Nuclear Physics B 568 (2000) 319–363
G. Amoros 331
r
cd cm d m
C14 ; ; y4.3 P 10y3 GeVy2 , Ž 38 .
MS4
r
d m c m2
C19 ; ; y2.8 P 10y3 GeVy2 , Ž 39 .
2 MS4
r
c m2
C38 ; ; 1.2 P 10y3 GeVy2 , Ž 40 .
2 MS4
r
c m cg
C61 ; ; 1.9 P 10y3 GeVy2 , Ž 41 .
MS2
r
c m cgX
C80 ; ; 1.9 P 10y3 GeVy2 . Ž 42 .
MS2
We stress that the goal of this section is to roughly estimate the values of the O Ž p 6 .
constants. A real determination would imply the use of chiral sum rules or other
processes to fix them. It is worth mentioning that the result in Eq. Ž37. is the same if we
use an antisymmetric formalism for the vector Lagrangian, and is in agreement with the
result extracted from the experimental data using sum rules for the vector–vector
two-point functions w11x. The generalization to the three flavours introduces a new
relation of O Ž p 6 . constants, 4C38
r r
q C91 , due to the explicit chiral symmetry breaking in
the kaon Green function.
Finally, we remark that the precise value of the constants can have an important
variation depending of the input values in Eq. Ž35.. Consequently, although the values
cited in this section are used for the numerical results, with the understanding that the
other counter-terms are set to zero, we have to keep in mind that these values could
overestimate the physical ones. The latter is especially true for d m since the K 0) and a 0
mass difference appears unnaturally large.
We defer a more accurate comparison with experimental data to Ref. w13x, but we
would like to present some results using our explicit expressions. We use the values for
the Cir obtained in the previous section and two sets of the O Ž p 4 . constants. They only
differ in the values of Lr1 , Lr2 and Lr3 . Set A is obtained from the fit of the unitarized K l4
calculation while set B refers to K l4 and pp data at one loop accuracy w26x. We give
both sets to show an example of the variation with the O Ž p 4 . constants. We do not
show results for varying the other Lri but this results in a similar variation in size of the
O Ž p 6 . results. The explicit values we use, at m s 0.77 GeV, are
Lr1 s 0.37 P 10y3 , Lr2 s 1.35 P 10y3 , Lr3 s y3.5 P 10y3 , set A ,
Lr1 s 0.60 P 10y3 , Lr2 s 1.50 P 10y3 , Lr3 s y3.3 P 10y3 , set B ,
332 ´ et al.r Nuclear Physics B 568 (2000) 319–363
G. Amoros
In Figs. 4 and 5 we plot the real part of the three vector–vector two-point functions
choosing set A inputs.
In all the three cases the slopes are given mainly by the O Ž p 6 . constants as estimated
above. We have shown these contributions in the curves labelled ŽCT.. Essentially – and
r
with exception of C93 – the main effect of varying the input parameters is to shift the
plots vertically. We see that the loop effects are larger in the isospin case and smaller for
both, the hypercharge and kaon. In the chiral limit all three cases reduce to the same,
Fig. 4. Different contributions to the real part of the vector–vector two-point function in the isospin and the
hypercharge case. The label CT indicates the contribution from only the counter-terms Lri at O Ž p 4 . and Cir at
O Ž p 6 .. p 4 gives the O Ž p 4 . result. VMD indicates the vector model of Eq. Ž45..
´ et al.r Nuclear Physics B 568 (2000) 319–363
G. Amoros 333
Fig. 5. Different contributions to the real part of the kaon vector–vector two-point function. The superscripts L
ŽT. refers to the longitudinal Žtransverse. part. For comparison we also plot the isospin case. CT as in Fig. 4.
and differences are related to the breaking of the SUŽ3. symmetry. For the isospin, the
two-pion channel produces the notable difference with the counter-term contribution,
while for the hypercharge and kaon the smaller difference is explained by the larger
masses in the loops and some explicit breaking of the symmetry through the quark
masses in the counter-term contributions.
In Fig. 4 we also plotted the case with a complete saturation by the vector meson
ŽVMD.
2 f V2 Ž 1 y q 2rM V2 .
Re Ž P V . ; 2
, Ž 45 .
Ž 1 y q 2rMV2 . q G 2rM V2
with G s 0.150 GeV. The conclusion is that models with only vectors explain the main
part of the two-point function, however an important contribution coming from the
two-pion intermediate states is present. The curve including only the counter-terms –
isospin ŽCT. – can also be obtained with the first two terms of the expansion in the
previous formula with G s 0 and considering the tiny modification due to the scalars.
We continue our discussion with the masses and decay constants. We have summa-
rized our numerical results in Table 1 using the values for the O Ž p 4 . constants quoted
above. As one sees in columns three to six, both masses and decay constants have
substantial O Ž p 6 . loop contributions. In addition the pure polynomial piece at O Ž p 6 .
tends to have the opposite sign and is very large using our model dependent estimates.
This only reinforces the statement in Section 7 of the lack of knowledge in the scalar
334 ´ et al.r Nuclear Physics B 568 (2000) 319–363
G. Amoros
Table 1
O Ž p 4 . and O Ž p 6 . contributions to the masses and decay constants. The numbers in parentheses are the
extended double log approximations of Ref. w25x. Columns three to six are the O Ž p 6 . loop contributions only.
The contribution from the Cir is listed separately in the last column.
OŽ p4 . set A set A set A set B Cir
m ŽGeV. 0.77 0.77 0.5 1.0 0.77 –
Fp r F0 0.068 y0.101 y0.066 y0.100 y0.172 y0.001
Ž Fp r F0 . Ž0.013. Žy0.050.
FK r Fp 0.216 0.055 y0.023 0.100 0.035 y0.450
Ž FK r Fp . Ž0.08. Ž0.06.
Fh r Fp 0.312 0.092 y0.011 0.150 0.065 y0.600
mp2 r mp2 phy y0.039 0.214 0.132 0.238 0.355 y0.003
m2K r m2K phy y0.003 0.241 0.246 0.194 0.423 y0.873
mh2 r mh2 phy y0.045 0.312 0.234 0.273 0.521 y2.428
sector. The terms containing d m are the only ones contributing in this subsection and
seem severely overestimated even though they are of a size expected by naiÕe
dimensional analysis.
In order to have a full presentation of the O Ž p 6 . contributions a refit of all O Ž p 4 .
coefficients using the full O Ž p 6 . expressions would be needed. We postpone this till
after the main other processes are also calculated to this order given the dependence of
the O Ž p 6 . contributions on Lr1 –Lr3 . As an example using set A at m s 0.77 GeV
otherwise but shifting Lr5 to 1.0 P 10y3 reproduces the experimental value of FK rFp
when setting d m s 0.
For the decay constants the O Ž p 6 . contributions to the ratios are smaller than the
O Ž p 4 ., not including the estimates from scalar exchange to the O Ž p 6 . constants.
To judge the effect of the O Ž p 6 . contributions on determining the quark mass ratios
we use the lowest order, O Ž p 4 . and O Ž p 6 . formulae in terms of physical quantities to
obtain the lowest order masses using Eq. Ž26.. This leads to
2 ms 2 m 20 K y m 20p
°25.9 Ž p . ,2
s ~
s 24.9 Ž p 4 . , and
m20p
mu q md
¢24.1 Ž p . ,6
2 ms 3m20h y m20p
°24.2 Ž p . ,2
s ~
s 24.4 Ž p 4 . , Ž 46 .
2 m 20p
mu q md
¢23.3 Ž p . ,6
using the results from set A at m s 0.77 GeV and Cir s 0. These ratios can be compared
with 2 m srŽ m u q m d . s 25.5 w27,28x, 22.8 w29x; extracted from QCD sum rules and
lattice calculations respectively.
The emerging conclusions about the convergence of the chiral series should be very
cautious since a full study includes also the effect of the O Ž p 6 . constants. However,
while the corrections calculated are significant they do not show evidence of a
breakdown of the chiral expansion for the quantities presented here.
´ et al.r Nuclear Physics B 568 (2000) 319–363
G. Amoros 335
Fig. 6. Different contributions to the real part of the remainder axial–axial two-point function. The superscripts
L ŽT. refers to the remainder longitudinal Žtransverse. part. CT as in Fig. 4.
In Fig. 6 we plotted the dependence on momenta of the real part for the remainders of
the axial-vector two-point functions for the three cases under study. Because both the
longitudinal and transverse remainders have poles at q 2 s 0, we show the combination
Pˆ AŽ1. q Pˆ AŽ0.. A priori we would expect a different behaviour for the isospin case due to
the three pion channel. However there is virtually no effect because the imaginary part is
very small in the energy region we are considering in agreement with the dominance of
the a1 axial meson. For the other two cases even the three pseudoscalar channel is far.
The curves are thus very linear. The vertical shifts are due to the explicit breaking from
the quark masses. The O Ž p 6 . contributions are rather small, the scale in the plot should
be compared with <2 Fp2rq 2 < which is larger than 0.07 for the entire region plotted.
In this paper we have calculated to NNLO in CHPT the vector and axial-vector
two-point functions in the isospin limit and in the complete three flavour basis.
In the vector–vector case, we confirm previous results for the isospin and hyper-
charge w6x.
For the axial-vector case, besides the cancellation of the non-analytic poles, we obtain
the same double and simple poles that appear with the use of the heat kernel expansion
w24x. We also agree with the double logarithms, appearing in previous work w7x, for the
isospin and hypercharge cases. All these checks give us confidence about our result.
336 ´ et al.r Nuclear Physics B 568 (2000) 319–363
G. Amoros
We have also given expressions to NNLO for the masses and decay constants. The
Lagrangian at O Ž p 6 . contains a rather large number of free constants. We have
estimated some of them using a simple resonance estimate and used this to present some
first numerical results for the two-point functions, masses and decay constants. We also
studied somewhat the m-dependence of the final result.
Although the corrections are significant they do not show evidence of a breakdown of
the chiral expansion. For instance, our estimates of the quark mass ratios are in
agreement with previous determinations. However, the sensitivity to the input values,
indicate that the O Ž p 4 . constants need to be refitted using the full O Ž p 6 . expressions
and that better estimates of the O Ž p 6 . constants are necessary.
Acknowledgements
We thank Ll. Ametller for a careful reading of the manuscript. The work of P.T. was
supported by the Swedish Research Council ŽNFR..
A.1. Masses
y 16 mp6 C17
r
q 48 mp6 C19
r
q 80 mp6 C20
r
q 48 mp6 C21
r
q 32 mp6 C31
r
q 32 mp6 C32
r
y 64 mp4 m2K C13
r
y 32 mp4 m2K C15
r
q 64 mp4 m2K C16
r
Ž6 . 1
Fp4 Ž mp2 . Loops s 2 Ž y527r1296mp6 p 2 y 3217r1728mp6
Ž 16p 2 .
y37r324mp4 m2K p 2 y 139r216mp4 m 2K y 11r36mp2 m4K p 2
1
y15r16mp2 m 4K . q Ž Ž y2 mp4 y 2 mp2 m2K . mp y 2 mp2 m2K mK
16p 2
q4mp6 Lr1 q Ž 74r9mp6 y 16r9mp4 m 2K q 104r9mp2 m4K . Lr2
q Ž 56r27mp6 y 16r27mp4 m 2K q 86r27mp2 m4K . Lr3 .
y Ž 589r36mp2 q 2 m2K . mp2 y 4 mp2 mp m K q 5r3mp2 mp mh
y 112 mp4 mp Lr1 y 64 mp4 mp Lr2 y 56 mp4 mp Lr3
q Ž 144mp4 q 80 mp2 m2K . mp Lr4 q 96 mp4 mp Lr5
q Ž y256mp4 y 160mp2 m 2K . mp Lr6 y 176mp4 mp Lr8
q Ž y2r3mp4 my2 2 2 2 2 2 r
K y 7mp . m K y 4r3mp m K mh y 128mp m K m K L1
Ž A.5 .
in agreement with Ref. w19x. The O Ž p 6 . contributions are
Ž6 .
Fp2 Ž m2K . CT s y16mp4 m2K C14
r
y 48 mp4 m 2K C16
r
q 16mp4 m 2K C17
r
q 48 mp4 m 2K C19
r
q 48mp4 m2K C20
r
q 48 mp4 m2K C21
r
y 32 mp2 m4K C13
r
q 32 mp2 m 4K C14
r
y 16 mp2 m4K C15
r
q 64 mp2 m4K C16
r
y 32 mp2 m4K C17
r
y 96 mp2 m 4K C19
r
y 32 mp2 m4K C20
r
q 192 mp2 m4K C21
r
q 32 mp2 m 4K C32
r
y 32 m6K C12
r
y 64 m6K C13
r
y 32 m6K C14
r
y 32 m6K C15
r
y 64 m6K C16
r
q 96 m6K C19
r
q 128m6K C20
r
q 192 m6K C21
r
q 32 m6K C31
r
q 64m6K C32
r
,
Ž A.6 .
Ž6 . 1
Fp4 Ž m2K . Loops s 2 Ž y1r8mp4 m2K p 2 y 13r24mp4 m2K y 73r648mp2 p 2
Ž 16p 2 .
q 3r8 H F Ž mp2 ,mp2 ,m2K ;m2K . m4K q 1r4H F Ž mp2 ,m2K ,mh2 ;m2K . m4K
y 3r32 H F Ž m2K ,mp2 ,mp2 ;m2K . m 4K q 9r16 H F Ž m2K ,mp2 ;mh2 ;m2K .
y 3r2 H1F Ž mp2 ,mp2 ,m2K ;m2K . m4K y 3r2 H1F Ž m2K ,mp2 ,mh2 ;m2K . m 4K
y 163 Ž mp4 y 2 mp2 m 2K y 8 m 4K . Lr6 q 163 Ž 3mp4 y 8 mp2 m2K q 8 m4K . Lr8
q 16 mp6 C16
r
q 16r3mp6 C17
r
q 128r9mp6 C18
r
y 16 mp6 C19
r
q 16 mp6 C20
r
y 16 mp6 C21
r
y 32r3mp6 C31
r
q 32 mp6 C32
r
y 128r9mp4 m2K C12
r
y 512r9m6K C14
r
y 512r9m6K C15
r
y 256r3m6K C16
r
y 512r9m6K C17
r
y 512r9m6K C18
r
q 256m6K C19
r
q 256m6K C20
r
q 256m6K C21
r
q 512r3m6K C31
r
q 512r3m6K C32
r
q 512r3m6K C33
r
, Ž A.9 .
Ž6 . 1
Fp4 Ž mh2 . Loops s 2 Ž y91r11664mp6 p 2 y 1781r15552 mp6
Ž 16p 2 .
y269r486mp4 m2K p 2 y 4133r1296mp4 m2K q 133r108mp2 m 4K p 2
y 64r9H F Ž m2K ,m2K ,mh2 ;mh2 . m 4K q 1r8 H F Ž mh2 ,m2K ,m2K ;mh2 . mp4
y H F Ž mh2 ,m2K ,m2K ;mh2 . mp2 m2K q 2 H F Ž mh2 ,m2K ,m2K ;mh2 . m4K
q 128r243H F Ž mh2 ,mh2 ,mh2 ;mh2 . m 4K q H1F Ž mp2 ,m2K ,m2K ;mh2 . mp4
y4H1F Ž mp2 ,m 2K ,m2K ;mh2 . mp2 m2K q 2 H1F Ž m2K ,m 2K ,mh2 ;mh2 . mp4
1
Fp4 Ž FpŽ6. . loops s 2 Ž 35r288mp4 p 2 q 41r128mp4 q 1r144mp2 m2K p 2
Ž 16p 2 .
y 32 mp2 m 2K C16
r
q 16mp2 m 2K C17
r
q 16 m4K C14
r
q 16 m4K C15
r
q 32 m4K C16
r
, Ž A.16 .
1
Fp4 Ž FKŽ6. . Loops s Ž 1r16mp4 p 2 q 13r48mp4 q 1r32 mp2 m2K p 2
Ž 16p 2 .
y 64r3mp2 m 2K C14
r
q 16r3mp2 m2K C15
r
y 32 mp2 m2K C16
r
y 64r3mp2 m 2K C17
r
y 128r3mp2 m 2K C18
r
q 64r3m 4K C14
r
q 64r3m 4K C15
r
q 32 m 4K C16
r
q 64r3m4K C17
r
q 64r3m 4K C18
r
,
Ž A.19 .
1
Fp4 FhŽ6.
ž / s Ž 5r96mp4 p 2 q 91r384mp4 q 1r48mp2 m2K p 2
Loops
Ž 16p 2 .
X
y 1r4H F Ž m2K ,m 2K ,mh2 ;mh2 . mp4
X
q 2 H F Ž m2K ,m 2K ,mh2 ;mh2 . mp2 m2K
X
y 32r9H F Ž m2K ,m2K ,mh2 ;mh2 . m 4K
X
q 1r16 H F Ž mh2 ,m 2K ,m 2K ;mh2 . mp4
X X
y 1r2 H F Ž mh2 ,m2K ,m2K ;mh2 . mp2 m2K q H F Ž mh2 ,m2K ,m2K ;mh2 . m4K
q 3r16 H F Ž mh2 ,m 2K ,m2K ;mh2 . mp2 y 3r4H F Ž mh2 ,m2K ,m2K ;mh2 . m2K
X
q49r972 H F Ž mh2 ,mh2 ,mh2 ;mh2 . mp4
X
y 56r243H F Ž mh2 ,mh2 ,mh2 ;mh2 . mp2 m 2K
X
q 64r243H F Ž mh2 ,mh2 ,mh2 ;mh2 . m4K
X
q 1r2 H1F Ž mp2 ,m2K ,m2K ;mh2 . mp4
X
y 2 H1F Ž mp2 ,m2K ,m 2K ;mh2 . mp2 m2K
X
q H1F Ž m 2K ,m 2K ,mh2 ;mh2 . mp4
X
y 20r3H1F Ž m 2K ,m2K ,mh2 ;mh2 . mp2 m 2K
X
q 32r3H1F Ž m 2K ,m2K ,mh2 ;mh2 . m 4K
X
F
q 3r16 H21 Ž mp2 ,m2K ,m2K ;mh2 . mp4
X
F
y 3 fsr2 H21 Ž mp2 ,m2K ,m2K ;mh2 . mp2 m2K
X X
F
q 3 H21 Ž mp2 ,m2K ,m2K ;mh2 . m4K q 3r16 H21F Ž mh2 ,m2K ,m2K ;mh2 . mp4
X
F
y 3r2 H21 Ž mh2 ,m2K ,m2K ;mh2 . mp2 m2K
X
F
q 3 H21 Ž mh2 ,m2K ,m2K ;mh2 . m4K . Ž A.20 .
Ž6 . 1
Ž q 2 Fp2 Pˆ AŽK1. Ž q 2 . . loops s 2 Ž 1r24mp2 q 2p 2 y 5r48mp2 q 2 q 3r32 m4K
Ž 16p 2 .
q 32r3mp2 C80
r
q 2 y 32 mp2 C81
r
q 2 q 256r3m 4K C37
r
y 64r3m 4K C91
r
y 128r3m 2K C12
r
q 2 y 64m2K C13
r
q2
y 128r3m2K C80
r
q 2 y 64 m2K C81
r
q 2 y 16C87
r
q 4 y 8C93
r
q4 ,
Ž B.8 .
Ž6 . 1
2
žq F 2 ˆ
Ž1.
p P Ah Ž q 2 . / loops s Ž 1r96mp4 y 1r12 mp2 m2K y 1r4mp2 q 2
2 2
Ž 16p .
The expansion of the resummed self-energy around the relevant pseudoscalar mass
leads in general to rather high derivatives and produces naturally the combinations
1
HiL Ž m12 ,m22 ,m23 ;q 2 ;m24 . s 2 Ž
q 2 HiF Ž m12 ,m 22 ,m 23 ;q 2 .
Ž q 2 y m24 .
yq 2 HiF Ž m12 ,m22 ,m23 ;m24 . y m24 Ž q 2 y m 24 .
X
=HiF Ž m12 ,m22 ,m23 ;m24 . . Ž B.11 .
for HiF s H F, H1F, H21
F4
and
1
H M Ž m12 ,m22 ,m23 ;q 2 ;m 24 . s 2 Ž H F Ž m12 ,m22 ,m23 ;q 2 .
2
Žq y m24 .
yH F Ž m12 ,m22 ,m23 ;m 24 . y Ž q 2 y m24 .
X
=H F Ž m12 ,m22 ,m23 ;m24 . . . Ž B.12 .
2
All of these functions are regular at q s m24 .
The longitudinal isospin remainder is
Ž6 .
Ž q 2Pˆ AŽp0 . Ž q 2 . . CT s 8 mp4 C91r , Ž B.13 .
Ž6 . y3r32 mp4
Ž q 2 Fp2 Pˆ AŽp0. Ž q 2 . . loops s 2 2
q 1r3H L Ž mp2 ,mp2 ,mp2 ;q 2 ;mp2 . mp4
Ž 16p .
y 2 H M Ž mp2 ,mp2 ,mp2 ;q 2 ;mp2 . mp6
q H L Ž mp2 ,m2K ,m 2K ;q 2 ;mp2 . mp4
We need integrals with one, two and three propagators in principle. These we define
by
1 d dq 1
A Ž m2 . s H d
. Ž C.1 .
i Ž 2p . q y m2
2
qO Ž e . ,
1 1 m12 x q m22 Ž 1 y x . y x Ž 1 y x . p 2
J Ž m12 ,m 22 , p 2 . s y
16p 2
H0 dx ln ž m12 x q m22 Ž 1 y x . / , Ž C.4 .
Ž C.5 .
The two-propagator integrals can all be reduced to B and A via
1
B1 Ž m12 ,m 22 , p 2 . s y Ž A Ž m12 . y A Ž m22 . q Ž m22 y m12 y p 2 . B Ž m12 ,m22 , p 2 . . ,
2 p2
1
B22 Ž m12 ,m 22 , p 2 . s Ž A Ž m22 . q 2 m12 B Ž m12 ,m22 , p 2 .
2 Ž d y 1.
q Ž m22 y m12 y p 2 . B1 Ž m12 ,m22 , p 2 . . ,
1
B21 Ž m12 ,m22 , p 2 . s Ž A Ž m22 . q m12 B Ž m12 ,m22 , p 2 . y dB22 Ž m12 ,m22 , p 2 . . . Ž C.6 .
p2
The basic method used here is the one from Passarino and Veltman w30x.
In this appendix we discuss the nontrivial two-loop integrals that show up in this
calculation. They have been treated in several places already, in general and for various
special cases. We use here a method that is a hybrid of various other approaches. We
only cite the literature actually used. We define
1 d dq ddr X
²² X :: s 2 H Ž 2p . d d
,
i Ž 2p . Žq 2
y m12 .Ž r 2
y m 22 . Ž Ž q q r y p . 2 y m23 .
Ž C.7 .
´ et al.r Nuclear Physics B 568 (2000) 319–363
G. Amoros 355
²² qm rn :: s ²² rm qn :: ,
2 H24 Ž m12 ,m22 ,m23 ; p 2 . s yH22 Ž m12 ,m 22 ,m23 ; p 2 . y H22 Ž m22 ,m12 ,m23 ; p 2 .
We do not explicitly evaluate the integrals analytically. H, H1 and H21 are all finite
after two subtractions. We therefore evaluate them as follows Ž Hi stands for H, H1 and
H21 .
E
Hi Ž m12 ,m22 ,m23 ; p 2 . s Hi Ž m12 ,m 22 ,m 23 ;0 . q p 2 Hi Ž m12 ,m 22 ,m 23 ;0 .
E p2
I Ž n1 ,n 2 ,n 3 .
1 d dq ddr 1
s 2 H Ž 2p . d d n1 n2 n3 , Ž C.15 .
i Ž 2p . Ž q 2 y m12 . Ž r 2 y m22 . Ž Ž q q r . 2 y m23 .
which show up in the momentum expansion of Hi . The I Ž n1 ,n 2 ,n 3 . with one of the
n i s 0 are separable and are e.g.
I Ž 1,1,1 .
y1 G 2 Ž1qe . 2e
1
s
Ž 16p . 2 2 2Ž 1 y e . Ž 1 y 2 e .
Ž 4p . ½ y
e2
Ž m12 q m22 q m23 .
2
q
e
Ž m12 ln 1 q m22 ln 2 q m23 ln 3 . q m12 Ž ln 2 ln 3 y ln 1Ž ln 1 q ln 2 q ln 3 . .
qm22 Ž ln 3 ln 1 y ln 2 Ž ln 1 q ln 2 q ln 3 . . q m23 Ž ln 1 ln 2 y ln 3 Ž ln 1 q ln 2 q ln 3 . .
In ŽC.17. we used ln i s lnŽ m2i . and the function C Ž m12 ,m22 ,m 23 .. The expression for C is
somewhat dependent on the relation between the various masses. Using
2
l m s l Ž m12 ,m22 ,m23 . s Ž m12 y m22 y m23 . y 4 m22 m23 , Ž C.18 .
´ et al.r Nuclear Physics B 568 (2000) 319–363
G. Amoros 357
™
The integrals can then be done using q˜m q˜n gmn q 2rd and equivalent identities for the
higher orders. We have run this procedure to higher orders then necessary to check the
cancellations of infinities there. This results in
H Ž m12 ,m22 ,m23 ;0 . s I Ž 1,1,1 . ,
E 4yd 4
2
H Ž m12 ,m 22 ,m 23 ;0 . s I Ž 2,1,1 . q m12 I Ž 3,1,1 . ,
Ep d d
d y 2 2
H1 Ž m12 ,m22 ,m23 ;0 . s I Ž 1,1,1 . y m12 I Ž 2,1,1 . . ,
d d
E 1
H Ž m2 ,m 2 ,m 2 ;0 . s Ž Ž 2 y d . Ž d y 4. I Ž 2,1,1 .
E p2 1 1 2 3 d Ž d q 2.
q8 Ž d y 4 . m12 I Ž 3,1,1 . y 24 m14 I Ž 4,1,1 . . ,
1
H21 Ž m12 ,m 22 ,m 23 ;0 . s Ž d Ž d y 2. I Ž 1,1,1 .
d Ž d q 2.
y4 Ž d y 2 . m12 I Ž 2,1,1 . q 8 m14 I Ž 3,1,1 . . ,
E
H21 Ž m12 ,m22 ,m23 ;0 .
E p2
1
s
d Ž d q 2. ½Ž yd 2 q 10 d y 48 . I Ž 2,1,1 .
2 m22 q m23
Ž 16p 2 . H1 Ž m12 ,m22 ,m23 ;0 . s l2 q 18 l1 Ž 2 m12 q m22 Ž 1 y 4ln 2 .
4
m12 dm
qm23 Ž 1 y 4ln 3 . . q 14 y1 q
ž lm /
=C Ž m12 ,m 22 ,m 23 . q m12 Ž 38 y 12 ln 1 .
p2
q m 22
ž 24
q 169 y 14 ln 2 q 14 Ž ln 2 ln 4 y ln 1 ln 3 .
/
p2
q m 23
ž 24
q 169 y 14 ln 3 q 14 Ž ln 3 ln 4 y ln 1 ln 2 .
/
qO Ž e . , Ž C.30 .
2 E m14 m22 m23 dm
Ž 16p 2 . 2
H1 Ž m12 ,m22 ,m23 ;0 . s y 121 l1 q C Ž m12 ,m22 ,m23 .
Ep l3m
6 m12 m22
y2 m12 y
lm
Ž dm q 2 m23 . / q 727 q O Ž e . ,
Ž C.31 .
2 2
m22 q m23
Ž 16p . H21 m12 ,m22 ,m23 ;0
Ž .s l2 q 361 l1 Ž 3m12 q m22 Ž 2 y 12ln 2 .
6
1
qm23 Ž 2 y 12ln 3 . . q Ž yl2m q l m m12 dm
6 l2m
m12 ln 1 dm
q m12
ž 17
72 y 13 ln 1 q
6 lm /
p2 m12
q m22
ž 36
q 19 1
54 y 9 ln 2 y
6 lm
Ž dm q 2 m23 . ln 2
360 ´ et al.r Nuclear Physics B 568 (2000) 319–363
G. Amoros
p2
q 16 Ž ln 2 ln 4 y ln 1 ln 3 . q m23
/ ž 36
q 19 1
54 y 9 ln 3
m12
y
6 lm
Ž dm q 2 m22 . ln 3 q 16 Ž ln 3 ln 4 y ln 1 ln 2 . /
qO Ž e . , Ž C.32 .
2 E m12
Ž 16p 2 . H21 Ž m12 ,m22 ,m23 ;0 . s y 241 l1 q 288
17
y Ž dm q 2 m12 .
E p2 24l m
5m14 m 22 m23 m16 m22 m23 5m22 m23
y
6 l2m
q
l3m ž 1q
lm /
m16 dm m22 m23
=C Ž m12 ,m 22 ,m 23 . q
12 l2m ž
ln 1 1 q 30
lm /
m14 m 22 m43
q ln 3 52 Ž m23 y m12 y m22 .
l3m
m23 m12 m23
q ln 3 Ž m23 y m22 . q ln 3 Ž y3m14
12 l m 12 l2m
2 2 2 2
½ H , H , H 5 Ž m ,m ,m ; p .
1 21 1 2 3
m 22 m23
s
`
HŽ m qm .
2 3
2
ds (ž
l 1,
s s
,
/ =
1
H0 dx K Ž x , s , p
2
2
. 1, x , x 2 4 , Ž C.35 .
´ et al.r Nuclear Physics B 568 (2000) 319–363
G. Amoros 361
with
K2 Ž x , s , p 2 .
1 m12 Ž 1 y x . q s x y x Ž 1 y x . p 2 p2 x Ž1 y x .
s
Ž 16p 2 .
2 ž ln
m12 Ž 1 y x . q s x
q
m12 Ž 1 y x . q s x /
Ž C.36 .
and
2
lŽ x , y, z . s Ž x y y y z . y 4 yz , Ž C.37 .
¨ ´ function.
the Kallen
The dispersive representation
B Ž m22 ,m23 ,t .
Ž dy3 .r2
p Ž1yd .r2 2 3y2 d ` m22 m23 1
s
G Ž Ž d y 1 . r2 .
H Ž m 2qm 3 . 2 žž
d s l 1,
s s
,
// s Ž dy4.r2
syt
,
Ž C.38 .
has been used here instead of the simpler case with equal masses used in Ref. w32x.
Above threshold, s G Ž m1 q m 2 q m 3 . 2 the functions Hi develop imaginary parts and
they can then be evaluated from their dispersive representation
s2 ` dz Im Hi Ž z .
Hi Ž m12 ,m22 ,m23 ; s . s H . Ž C.39 .
p Ž m qm qm .1 2 3
2 z2 zys
The imaginary parts are given by Žin d s 4.
2 2 2
½
Im H , H1 , H21 5 Ž m ,m ,m ; s .
1 2 3
with
1
E1max s
2's
Ž s q m12 y Ž m 2 q m 3 . 2 . ,
m223 s s q m12 y 2's E1 ,
1
E2max y E2min s
m223 's (lŽ s,m ,m . (lŽ m
2
1
2
23
2 2 2
23 ,m 2 ,m 3 . . Ž C.41 .
In this paper we have employed the version of Modified Minimal Subtraction ŽMS.
that is customary in CHPT. The precise procedure has been discussed in great detail in
Ref. w34x.
The procedure used in Ref. w6x corresponds to subtracting only the l0 terms present
in all the integrals, including those in l1 and l 2 .
As mentioned in Ref. w34x a Taylor expansion of the p 4 coefficients introduces in
principle new parameters d i via the Laurent-expansion of the L i s a ire q LM S
i q ed i
q . . . We have checked that the terms involving d i take the form of a local action for
the quantities considered in this manuscript, thus they can be absorbed in the p 6 LECs
as proven in general in Ref. w24x.
We have defined
y2 e
y1 y2 e
y1
Li ' Ž m c . ž 2
32p e
Gi q Lri Ž m . s Ž m .
/ ž 32p 2
Gi l0 q Lri Ž m . q O Ž e . .
/
Ž D.1 .
In the main text we have suppressed the explicit m-dependence of the Lri . The
coefficients Gi are given in Ref. w19x and ln c s y1r2ŽlnŽ4p . y g q 1.. The order e
term in the last part of Eq. ŽD.1. has been used as well to check the explicit
cancellations of lnŽ4p . and g in all expressions.
Similarly the coefficients in the p 6 Lagrangian are used to absorb the remaining
infinities via
y4 e
g2i g1i
Ci ' Ž m c . 2 ž
q q Cir Ž m . /
e e
s my4 e Ž g 2 i l 2 q g 1 i l1 q Cir Ž m . q O Ž e . . . Ž D.2 .
Dropping the terms with l0 , l1 , l 2 , replacing the Ci by Cir
in the main text and
subtracting the terms proportional to C, l0 , l1 and l2 in the expressions for the
integrals given in the preceding appendices, gives the results in the MS scheme.
References
¨
The sphaleron rate: Bodeker’s leading log
Guy D. Moore
Department of Physics, UniÕersity of Washington, Seattle WA 98195-1560, USA
Abstract
¨
Bodeker has recently shown that the high temperature sphaleron rate, which measures baryon
number violation in the hot standard model, receives logarithmic corrections to its leading
parametric behavior; G s k X wlogŽ m Drg 2 T . q O Ž1.xŽ g 2 T 2rm2D . a W
5 4
T . After discussing the physi-
cal origin of these corrections, I compute the leading log coefficient numerically: k X s 10.8 " 0.7.
The log is fairly small relative to the O Ž1. ‘‘correction;’’ so non-logarithmic contributions
dominate at realistic values of the coupling. q 2000 Elsevier Science B.V. All rights reserved.
1. Introduction
It has been known for some time now that baryon number is not a conserved quantity
in the minimal standard model w1x. It is violated non-perturbatively because of the
anomaly, the chiral couplings of fermions to SUŽ2. weak, and the topologically
non-trivial vacuum structure of SUŽ2.. However, as is characteristic of a non-perturba-
tive process in a weakly coupled theory, the rate of violation is so tiny that it is
completely irrelevant phenomenologically. Certainly, if baryon number is also violated
due to high dimension operators descended from some GUT, baryon number violating
decay rates due to the GUT mechanism will greatly exceed the electroweak rate, of
order m1W expŽy4pra W . - 10y1 70 GeV even before accounting for additional suppres-
sion from powers of small CKM matrix elements and a high power of Ž m protonrmW ..
However, as first realized by Kuzmin, Rubakov, and Shaposhnikov in 1985, the
efficiency of standard model baryon number violation is very much higher at finite
temperature w2x. A perturbative estimate w3x based on a saddle-point expansion about
Klinkhamer and Manton’s sphaleron w4x indicated that the rate is more than enough to
erase any relic abundance of baryon number left over from the GUT scale, unless the
0550-3213r00r$ - see front matter q 2000 Elsevier Science B.V. All rights reserved.
PII: S 0 5 5 0 - 3 2 1 3 Ž 9 9 . 0 0 7 4 6 - 4
368 G.D. Moorer Nuclear Physics B 568 (2000) 367–404
² Ž NCS Ž t . y NCS Ž 0 . . 2 :
V™ ` t™ `
G ' lim lim
Vt
, Ž 1.
where the expectation values refer to a trace over the equilibrium thermal density matrix.
The quantity G is the topological susceptibility of the electroweak sector at finite
temperature, in Minkowski time.
We know that, when the electroweak phase transition is first-order, the value of G
jumps discontinuously between the phases; it is quite small in the broken phase and
much larger in the symmetric phase. An old power counting argument says that the
symmetric phase rate should be of order G ; a W4 T 4 with an order unity coefficient. This
argument relied, correctly, on the natural non-perturbative length scale in the hot plasma
being 1rŽ a W T .. One then assumes that the natural time scale is the same; on-dimen-
sional grounds the space-time rate of NCS diffusion must then be of order Ž a W T . 3a W T.
However, Arnold, Son, and Yaffe have shown that at leading parametric order, the
natural time scale is not 1rŽ a W T ., but 1ra W2 T w8x, up to possible logarithmic
corrections, which the authors did not consider. More recently, Bodeker ¨ has demon-
strated that logarithmic corrections to their argument do occur w9x.
This says nothing about the numerical value of the sphaleron rate; it could be
parametrically a W5 T 4 but numerically irrelevantly small. Ambjørn and Krasnitz pre-
sented numerical evidence that it was large, by considering classical, thermal Yang–Mills
theory on the lattice w10x. Their definition of NCS was not topological and could
therefore suffer from potentially severe lattice artifacts, but Turok and I studied the same
system with a topological definition of NCS and verified that G is substantial w11x. The
sphaleron rate we found, expressed in physical units, was lattice spacing dependent,
which turns out to be a prediction of the arguments of Arnold, Son, and Yaffe; the extra
power of a W arises from the interaction between the infrared fields and ultraviolet
excitations, as I will discuss more below, and on the lattice it becomes one power of the
lattice spacing a.
In fact the lattice spacing dependence of G for pure classical lattice Yang–Mills
theory only fits G A a if there are substantial corrections to scaling, which has led some
to call into question whether the Arnold, Son, and Yaffe’s analysis is correct. In this
paper I will assume that it is, as seems justified on theoretical grounds and numerical
evidence from classical Yang–Mills theory ‘‘enhanced’’ with added degrees of freedom
which reproduce the hard thermal loop effects w13x. I will return to the large corrections
to scaling in the classical lattice theory in Subsection 5.3.
Because the interactions between infrared and ultraviolet excitations are important to
setting the sphaleron rate, it is somewhat difficult to actually extract G at leading order
G.D. Moorer Nuclear Physics B 568 (2000) 367–404 369
in a W for the continuum quantum theory, at the physical value of a W or even in the
parametric small a W limit. Hu and Muller ¨ proposed a technique based on including the
UV physics, lost to the lattice regulation, by introducing new degrees of freedom which
influence the IR fields in the same way w12x. They implemented and applied the
technique jointly with me w13x. As I will discuss later, this technique still suffers from
some poorly controlled systematics, which are related to the logarithmic corrections
discovered by Bodeker;¨ in fact, beyond the leading log the infrared physics the
technique will simulate is not rotationally invariant.
A good first step to answering the remaining questions about the sphaleron rate is to
determine it at leading logarithmic order in a W . Bodeker
¨ has demonstrated that this can
be done within an effective theory which is completely UV well behaved; in fact it is
nothing but the Langevin equation for 3-D Yang–Mills theory w9x. The leading log
behavior is probably not very useful by itself, for estimating G at the physical value of
a W . Leading log expansions often miss large constant corrections; we know for instance
that the O Ž g 2 T . contribution to the Debye mass has a much larger constant contribution
than the leading log might suggest w14,15x. However, it is still useful to know the leading
log; for instance, its size is related to the severity of the systematic problems with the
method of Hu and Muller, ¨ and it might in principle be useful for extrapolating lattice
results which correspond to an inappropriate value of a W back to the correct value.
The purpose of this paper is to determine the coefficient of the leading log behavior
of G ; namely, to find k X defined through
mD g 2T 2
G s k X log
ž qO Ž 1 . / a W5 T 4 q Ž higher order . . Ž 2.
g 2T m2D
ŽMy logs are always natural logs.. Neglecting the O Ž1. means that logŽ1rg . is treated as
much larger than any order unity constant, a rather extreme interpretation of the
perturbative expansion. Probably this expansion is completely unjustified, but as I said
there are still important things to be learned from making it. Until Section 5 I will not
worry about whether the expansion in logŽ1rg . 4 1 is justified; the goal is simply to
determine k X . I will also work in Yang–Mills theory, which is appropriate at leading
order only for temperatures well above the equilibrium temperature. I will mention how
to include the Higgs field in Section 5.
A summary of the paper is as follows. Section 2 will discuss at an intuitive, physical,
but non-technical level why the rate has the parametric form I show; where the extra a
comes from and particularly why there is a log. The section provides two apparently
different arguments, one in terms of conductivities and scattering processes for hard
particles and one in terms of hard thermal loops ŽHTL’s. and Wilson lines; the two are
of course equivalent. Some salient details about the Wilson line are in Section 3, which
is more technical. This section also shows how the numerical model of Hu, Muller ¨ and
¨
Moore fails beyond leading log order. Section 4 studies Bodeker’s effective theory for
extracting the leading log, numerically. Since the Langevin dynamics are simple but
numerically costly, the emphasis is on controlling systematic errors. Many of the details
appear either in previous papers or Appendix A. For the reader’s ease I present the
answer now: k X s 10.8 " 0.7. The dominant error here is statistical; systematic errors
are completely under control. In Section 5 I discuss the meaning of the result, and try to
estimate what the sphaleron rate is for the realistic values of m 2D and g 2 by using this
370 G.D. Moorer Nuclear Physics B 568 (2000) 367–404
¨
result to extrapolate the results of Hu, Muller, and Moore to the right value of the log.
My estimate is G , 20 a W5 T 4 for m2D s Ž11r6. g 2 T 2 and g 2 s 0.4; however there are
uncontrolled systematics which may be as large as 30%. I also discuss corrections to the
approximation that g < 1; the largest of these will be of order 10%. The conclusion
concludes. There is also a technical appendix which discusses the match between lattice
and continuum Langevin time scales; the match between continuum and Langevin time
scales is computed at O Ž a. and the match between lattice Langevin and heat bath time
scales is determined by a measurement.
5
2. The physics behind a W log(m D r g 2 T)
Here I will give an argument for the a W5 logŽ1rg .T 4 law based on Lenz’s Law and
the conductivity of the plasma. The argument has very recently been made quantitative
w22x, but I will present it at the qualitative, intuitive level.
1
I should mention parenthetically that the classical theory also contains quadratic and cubic divergences in
the energy density, but these do not affect the IR dynamics responsible for baryon number violation.
G.D. Moorer Nuclear Physics B 568 (2000) 367–404 371
To see how hard thermal loops influence the sphaleron rate, I first make the point that
the sphaleron rate is set by the evolution of very soft infrared fields, where by very soft I
mean fields with wave number k ; g 2 T. Parametrically shorter wave lengths do not
contribute appreciably because the probability for non-perturbative physics to occur at
such scales is exponentially suppressed. According to standard sphaleron type argu-
ments, the contribution from the scale k ; g 2y d T is suppressed by of order
expŽykrg 2 T . s expŽygyd .. Even the scale k ; g 2 T logŽ1rg . gives a contribution
suppressed by a power of g.
The second point is that it is physics in the transverse sector which matters, and in
particular, diffusion of NCS requires the evolution of magnetic fields. To see this, first
go back to the definition of NCS ,2
t2 3
g2
NCS Ž t 2 . y NCS Ž t 1 . s Ht dtHd x Eia Bia Ž x . . Ž 3.
1
8p 2
Now the B field is always transverse, meaning that D P B s 0, by the Bianchi identity;
and so only the transverse part of the E field contributes. The Bianchi identity also
states that D = E s yw Dt , B x. Since the relevant part of the electric field is transverse,
it will in general have non-zero covariant curl. For E to be non-zero and to remain the
same sign for long enough to give a non-trivial contribution to HEia Biadt, there must then
be time evolution of infrared magnetic fields.
At this point it is useful to recall how infrared magnetic fields evolve in the abelian
theory, on wave lengths longer than the Debye screening length. The answer is familiar
plasma physics; the plasma is very conducting, and a conducting medium resists changes
in magnetic fields by Lenz’s Law. In the limit of infinite conductivity the magnetic
fields are perfectly frozen; for finite conductivity the time scale for their evolution scales
with conductivity. A magnetic field of wave number k, with well more than its mean
thermal energy density but much less energy density than is contained in the bulk
plasma, decays according to
dA i 1
w D0 , A i x s
dt
sy
s Ž k,v<k .
Ž k 2di j y k i k j . A j , Ž 4.
in the parametric limit that the decay time is well longer than 1rk, which is satisfied at
all length scales parametrically longer than the Debye screening length. The characteris-
tic decay time of a magnetic field in the plasma is then t s s Ž k, v < k .rk 2 . Of course I
have only written the dissipative part of the magnetic field evolution equation; there
must also be a noise term which is uniquely specified by the requirement that the
thermodynamics of the IR magnetic fields are correct.
Note that the conductivity is wave number dependent. It has a good infrared limit
which is achieved for length scales larger than the mean scattering length l free of a
2
Although I set the speed of light cs1, I typically write expressions non-covariantly with a positive space
metric, which is convenient in the finite temperature context because the thermal bath establishes a preferred
frame.
372 G.D. Moorer Nuclear Physics B 568 (2000) 367–404
current carried by hard particles. In the abelian theory a particle’s charge is preserved
when it undergoes a scattering, so this length scale is the mean length for large angle
scattering, parametrically l free ; 1ra 2 T times logarithmic corrections. If every particle
had the same free path and a scattering perfectly randomized its momentum, the
conductivity on scales longer than l free would be
m2D l free
ss , k < 1rl free . Ž 5.
3
When scattering processes are more complicated this formula defines an effective value
of l free . For length scales well between the Debye screening length and l free , where
scatterings of the charge carriers can be neglected, the conductivity is related to the
Debye length through
p m2D
ss , m D 4 k 4 1rl free . Ž 6.
4k
ŽIn both expressions the Debye length m D , which is O Ž gT ., is just keeping track of the
number density, charge, and ²1rE : of the particles.The derivations of each expression
assume ultrarelativistic dispersion relations for the charge carriers.. This expression
follows from the form of the transverse self-energy and the fact that the conductivity we
are discussing in this case is just a special case of the HTL self-energy, s Ž k, v < k . s
Im P T Ž k, v .rv . This is the connection between this ‘‘conductivity’’ picture and the
hard thermal loops.
The behavior of the electroweak gauge fields for scales parametrically between the
non-perturbative scale k s g 2 T and the Debye scale k s gT is the same as in the abelian
theory at leading parametric order.3 Since the mean free path of a hard excitation to
undergo any scattering is ; 1rŽ g 2 T logŽ1rg .. w24x, the conductivity for k s g 2y d T,
with 0 - d - 1, is s s p m2D r4 k ; g d T, and the time constant associated with the decay
of a magnetic field is t ; 1rŽ g 4y 3 d T .. Although the scale k ; g 2 T does not fit within
™
the range of validity of this argument it cannot be that the decay rate for a magnetic field
with k ; g 2 T differs from the d 0 limit by any non-zero power of g. Hence the
relevant time scale for the dynamics of non-perturbative infrared magnetic fields in
weakly coupled, hot Yang–Mills theory is O ŽŽ g 4 T .y1 ., up to corrections at most
logarithmic in g. This is a paraphrase of the argument of Arnold, Son, and Yaffe w8x,
who however neglected the possibility of logarithmic corrections.
Logarithmic corrections do in fact occur. While in the abelian theory the electrical
conductivity only reaches a long wave length limit at a scale set by a hard particle’s free
path for large angle scattering, in the non-abelian theory that limit is set by the free path
3
Some years ago Ambjørn and Olesen argued that non-abelian fields obey an anti-Lenz’s Law w23x,
apparently in contradiction to the argument presented here. Their work refers to the non-abelian interactions
between W and Z fields in the presence of strong Želectromagnetism. magnetic fields at zero temperature. It
may have some bearing on the mutual interactions of the fields at the g 2 T scale, but the response of the harder
modes to the very soft fields is at leading order the Lenz law type behavior seen in the abelian theory.
G.D. Moorer Nuclear Physics B 568 (2000) 367–404 373
for any scattering. The reason is that when a colored particle undergoes a scattering,
however small the transfer momentum, its color is changed. This degrades the color
current even if it does not degrade the momentum carried by the particle substantially.
The total rate for any scattering to occur is twice the damping rate, which has been
computed for hard particles at leading log by Pisarski. The damping rate for an adjoint
charged particle of any spin is w24x
Ng 2 T mD
gs log q O Ž 1. , Ž 7.
4p g 2T
™
and for a fundamental representation particle it is the same with N Ž N 2 y 1.r2 N.
Note the log, which arises from an integral over exchange momenta, running from the
gT to the g 2 T scale; also note that the result is independent of the particle’s momentum,
provided it is harder than the gT scale. A collision largely but incompletely randomizes
a particle’s charge, and so on scales longer than ; 1rŽ g 2 T logŽ1rg .. the electrical
conductivity of the plasma is ; m2D r3g , up to non-logarithmic corrections. To find the
numerical constant one must determine how thoroughly a scattering randomizes a
particle’s charge, which depends on the representation of the particle. Also, the particle’s
charge is not destroyed, just transferred to another particle; one must check whether this
induces any important currents. ŽIt turns out not to, because a particle is as likely to
scatter from a charge carrier moving in one direction as in the exact opposite direction..
The calculation is quite non-trivial but it has been done recently by Arnold, Son, and
Yaffe w22x, who show that, at leading log, the conductivity is simply m 2D r3g , with g
given in Eq. Ž7.. This is independent of the group representation of the particles carrying
the current.
Performing an extreme parametric expansion, logŽ1rg . 4 1, the scale set by particle
mean free paths and the non-perturbative scale are well separated, and the fields with
k ; g 2 T see a k-independent conductivity. ŽSee Fig. 1.. Hence the relevant infrared
dynamics for transverse modes is, at leading logarithmic order,
3 Ng 2 T log Ž 1rg .
w D0 , A i x s Dj Fji q noise , Ž 8.
4p m2D
where the transverse part of the noise is fixed by the requirement that the thermodynam-
ics come out right. The longitudinal part of the noise generates time-dependent gauge
rotations of the A fields, which are irrelevant to Chern–Simons number; so we may
choose the amplitude of the longitudinal part of the noise to be whatever we want. It is
most convenient to choose it to be of the same magnitude as the transverse part, in
¨
which case we reproduce the effective theory of Bodeker, which is also the Langevin
equation for 3-D Yang–Mills theory.
While this derivation has presented the ideas in an intuitive way it is scarcely
rigorous, so I will now approach the problem a little more formally by looking at the
hard thermal loop effective theory.
¨
Now I will go through the argument for Bodeker’s effective theory also from the
point of view of the effective HTL theory for the infrared modes. The idea of the HTL
374 G.D. Moorer Nuclear Physics B 568 (2000) 367–404
Fig. 1. ‘‘It’s quite simple, really . . . .’’ A scorecard of the scales involved in the problem and the
approximations which are valid in each. None of these scales are distinct if we do not take g <1.
effective infrared theory is that one can construct an effective theory for the modes with
k < T, valid at leading order in g, by integrating out all degrees of freedom with k ; T.
The procedure for separating the degrees of freedom and regulating the effective theory,
cut off at some scale between gT and T, is left unspecified and does not affect the result
at leading order.
The resulting effective theory is a classical theory for the remaining modes, but with
a non-local ‘‘HTL’’ effective action correction. The non-local effective action was first
derived in w25–27x. In the current context, since in the long term we have in mind a
position space regulation of the IR theory, it is most convenient to write the effective
action in real space, as was first done by Huet and Son w28x:
a a
y w D 0 , Ei x Ž x ,t . q Dj , Fji Ž x ,t .
m 2D yi y j
s j ia Ž x ,t . q Hd 3
y U a b Ž Ž x ,t . , Ž x q y,t y y . . Ejb Ž x q y,t y y . . Ž 9.
4p y4
Here U a b ŽŽ x,t .,Ž x q y,t y y .. is the adjoint parallel transporter along the straight,
lightlike path between the points Ž x,t . and Ž x q y,t y y .. Note that the electric field on
the right-hand side is at the retarded time t y y. The noise j ia is Gaussian with a
non-vanishing two-point function for lightlike separated points,
m 2D T yi y j
² j ia Ž x ,t . j jb Ž x q y,tX . : s d Ž < t y tX < y < y < . U a b Ž Ž x ,t . , Ž x q y,tX . . .
4p y4
Ž 10 .
For k < gT, the effective action simplifies somewhat. The coefficient of the term on
the right in Eq. Ž9., which contains one time derivative Žsince Ei s w D 0 , A i x., is large
enough that the time evolution is determined by this term and we can neglect the w D 0 , E x
term on the left. Further, the time scale associated with the fields’ evolution is longer
G.D. Moorer Nuclear Physics B 568 (2000) 367–404 375
than the time retardation appearing in the non-local HTL action, and we are justified to
neglect that retardation. The result is
a m 2D yi y j
Dj , Fji Ž x. s j ia Ž x. q Hd 3
y U a b Ž x , x q y . Ejb Ž x q y . , Ž 11 .
4p y4
and the noise correlator is simplified by dropping the retardation there as well Žthough to
get its magnitude right we must remember that there are two contributions from
d Ž< t y tX < y < y <... If in addition k 4 g 2 T logŽ1rg . then it is possible to choose a gauge
such that the parallel transporter is close to the identity and at leading order it can be
ignored. In this case we can recover Eq. Ž6. by expanding w Dj , Fji x to leading order in A
and Fourier transforming.
Huet and Son argue that, to model the O Ž g 2 T . modes alone, we can use Eq. Ž11.,
interpreted as an expression for fields at the g 2 T scale alone with all shorter wave
length scales integrated out. The reasoning is that the higher modes only appear in the
equation in w Dj , Fji x. But w Dj , Fji x is a thermodynamical quantity depending only on the
transverse fields, and as already stated, the UV causes perturbatively small corrections to
this sector, which can be ignored at leading order. The power counting arguments are
laid out explicitly in w29x.
However, this misses one key issue. Can we neglect the influence of the modes with
g 2 T < k < gT on the adjoint parallel transporter U a b Ž x, x q y . when y ; 1rŽ g 2 T .?
The answer is, no. When evaluating an adjoint parallel transporter for a path of length
; 1rŽ g 2 T ., essential contributions arise from all scales intermediate between gT and
g 2 T. This problem has recently been addressed by Arnold and Yaffe, in the context of
studying O Ž g 2 T . corrections to the Debye screening length w30x. They show that, in
SUŽ N . pure gauge theory, any two point correlator of equal-time adjoint fields at
separation y, connected by a straight adjoint Wilson line, falls off at least as rapidly as
Ng 2 T m reg
exp Ž yyrl . , l y1
s
4p ž log
g 2T
qK , / Ž 12 .
average over the more UV scales. In particular the parallel transporter relevant for the
g 2 T modes’ evolution is the average of the parallel transporter over realizations of the
g 2 T < k < gT fields. As I demonstrate in Subsection 3.1, averaging over realizations
of k 4 g 2 T modes leads to exponential damping of the parallel transporter, for y of
order 1rŽ g 2 T logŽ m D rg 2 T ... In the Coulomb gauge,4
ab
UCoulomb Ž x , x q y . , d a bexp Ž yyrl. , Ž 13 .
with l the same as in Eq. Ž12.. If we are permitted to expand in logŽ m D rg 2 T . 4 1,
then the integrand on the right-hand side of Eq. Ž11. has already fallen away before y
comes on order 1rŽ g 2 T .; therefore the approximation which gives Eq. Ž13. is valid, up
to log corrections, throughout the range of y which dominates the contribution to the
integral. The effective theory for the g 2 T modes Žfixing the gauge freedom on scales
more UV than g 2 T to Coulomb gauge. is therefore
yi y j a
j ia q d 3 yeyŽ y r l.
H Eja Ž x q y . s Dj , Fji Ž x. . Ž 14 .
y4
Since the integral is dominated by y ; 1rŽ g 2 T logŽ m D rg 2 T .., and for the IR fields
of interest E varies only on the 1rg 2 T scale, it is permissible at leading log order to
pull the E field out of the integral. The integral is then quite simple; performing it gives
m2D l
Dj , Fji s Eia q j ia ,
3
2Tm2D l
² j ia Ž x ,t . j jb Ž y,tX . : s d Ž x y y . d Ž t y tX . d i j d a b , Ž 15 .
3
where the form for the noise correlator also follows from the approximation for the
parallel transporter. ŽAlternately, one can always recover the form of j by insisting that
¨
the thermodynamics come out correctly.. This is Bodeker’s effective theory, though it
remains to establish that m reg should be m D . I discuss this more in Section 3.
Now it is time to look more carefully at the Wilson line appearing in Section 2, first
to verify the claims there, second to show the connection to the argument involving
scatterings of the hard particles, and third because it is relevant to the analysis of the
¨
results of Hu, Muller, and Moore.
4
The use of Coulomb gauge becomes problematic when considering length scales l 01r g 2 T, and for
considering any unequal-time correlator if the total volume of space considered is V 4 Ž g 2 T .y3 ; however
this is not relevant because technically we are only applying Coulomb gauge to modes with k 4 g 2 T in order
to integrate them out and establish an effective theory. The gauge fixing of the IR effective theory, i.e. of the
problematic k ; g 2 T modes, has not been specified. Also, at leading log the same results would be obtained
in Landau gauge.
G.D. Moorer Nuclear Physics B 568 (2000) 367–404 377
What we want to know about is the Wilson line between lightlike separated points a
spatial distance l apart, with l 4 1rm D but less than 1rŽ g 2 T . by at least a logarithmic
factor. Actually, we want to know the average of the Wilson line over realizations of the
modes with k R g 2 T logŽ1rg ., since these fields fluctuate faster than the O Ž g 2 T . fields,
which therefore see the average over realizations, up to corrections subdominant in
1rlogŽ1rg ..
The Wilson line is given by
l
U s Pexp H0 ig Ž A q A . Ž z ,t s z . dz ,
0 z Ž 16 .
with A in the adjoint representation. I will only consider the transverse contributions
here, the longitudinal ones are subdominant. I use Coulomb gauge, in which the A 0
contribution arises entirely from the longitudinal modes and the A z only arises from the
transverse modes. Further, I will work at leading parametric order, by which I mean that
higher point correlators and vertex insertions are ignored, and combinations of A fields
are evaluated assuming A is Gaussian by applying Wick’s theorem. Of course I will
include the hard thermal loop corrections to the A field propagators. These approxima-
tions are justified at leading log down to k ; g 2 T logŽ1rg ., which is all we need. ŽIf I
were interested in lengths l ; 1rg 2 T rather than logarithmically shorter, most of the
approximations I make would break down completely..
All terms with odd powers of A vanish on averaging over realizations, while even
terms look like Žapplying Wick’s theorem between lines 1 and 2.
n
` Ž y. g 2 n
²U : s Ý Hdz1 . . . dz2 n² A z Ž z1 . . . . A z Ž z2 n . :
ns0 Ž 2 n. !
n
` 1 yg 2 l
s Ý
ns0 n! ž 2
H0 dz dz ² A Ž z . A Ž z
1 2 z 1 z 2 .: /
yg 2 l
s exp
ž 2
H0 dz dz ² A Ž z . A Ž z
1 2 z 1 z 2 /
.: , Ž 17 .
so the average over realizations is the identity times the exponential of the two-point
contribution. ŽNote that for any given realization the Wilson line has unit modulus. But
the average over realizations does not, its modulus falls exponentially with distance..
To evaluate this we need the two-point correlator,
d3k dv T ki k j X
² A i Ž x ,t . A j Ž y,tX . : s e i v Ž t yt . eyi kPŽ yyx . .
H Ž 2p . 3
2p v ž
r Ž k , v . di j y
k2 /
Ž 18 .
Here r Ž k, v . is the spectral density, which is the magnitude of the discontinuity in the
propagator 1rŽ v 2 y k 2 y P T Ž k, v .. across the real v axis on analytic continuation
from Euclidean Žimaginary. v ,
2 y1
r Ž k , v . s 2Im Ž Ž v q i e . y k 2 y P T Ž k , v q i e . . , Ž 19 .
378 G.D. Moorer Nuclear Physics B 568 (2000) 367–404
and Trv is the classical approximation for 1 q nŽ v ., with nŽ v . the Bose distribution
function.
So far I have suppressed group indices, but when they and the integrals over z are
evaluated we get
Tr ²U : Ng 2 d3k d v Tr Ž k , v . k 2x q k 2y 4sin2 Ž k z y v . lr2
log sy H Ž 2p . 3
.
Tr 1 2 2p v k2 Ž kzyv .
2
Ž 20 .
The equal-time correlator of the A field goes as Trk 2 plus subleading corrections,
for all k 4 g 2 T ; so
d v Tr Ž k , v . T
H 2p s . Ž 21 .
v k2
What matters now is where r is concentrated. In the regime k < m D , almost all the
contribution to Eq. Ž21. is from < v < < k, see Ref. w24x. This is just the statement that
these modes evolve on time scales slower than 1rk, as I have already discussed. Hence,
in evaluating the low k contribution to Eq. Ž20. I can set k z y v , k z , and then perform
the integral over v , giving
Ng 2 T < < d3k 1 k 2x q k 2y 4sin2 k z lr2
y
2
H k (m D
3
Ž 2p . k
2
k2 k z2
. Ž 22 .
Ng 2 lT mD
sy
4p ž log
l
q O Ž 1. . / Ž 23 .
Now the term with sin2 in it is forcing < k < s k z ; only modes propagating along the
direction of the Wilson line keep in phase, others destructively interfere. But the
polarization vector of such a mode is close to orthogonal to the Wilson line; because
2
k 4 1rl, for any k for which < k < y k z - 1rl, the factor k H rk 2 will be near zero. Thus,
the only modes which avoid destructive phase interference are polarized in the wrong
direction to contribute significantly. Continuing to carry out the integral by defining
x s k zrk, we get
Ng 2 T 1 2
4sin2 Ž lk Ž 1 y x . r2 .
y
2p 2
Hy1dxHm dk Ž 1 y x .
Ž1yx .
2
D
Ng 2 T
;y Ž log Ž lm D . q O Ž 1. . , Ž 26 .
2p 2 m D
which is O Ž g .. The hard modes do not contribute at leading order to the Wilson line
along a lightlike path.
I have not treated the modes with k ; m D , which are more complicated because this
is where r does not fit into either limiting category. But they turn out to give a result
smaller by a logarithm than that from the modes with k < m D . The final result is that,
on averaging over realizations of the modes with k greater than g 2 T by at least a
logarithm, the Wilson line in Coulomb gauge is
Ng 2 T mD
²U : s 1 exp Ž ylrl . , ly1 s
4p ž log
g 2T /
q O Ž 1. . Ž 27 .
Now look at Eq. Ž20. again. Remember that the Wilson line is representing the
trajectory of a hard particle, with p much greater than any momentum scale which gives
a leading order contribution to the integral. We want to interpret Eq. Ž20. as l times the
rate for the particle to undergo a scattering involving the transfer of a soft field, times a
Žrepresentation-dependent. group theory factor which tells how thoroughly the scattering
randomizes the particle charge. To see the relation, take the large l limit. Then
4sin2 ŽŽ k z y v . lr2.rŽ k z y v . 2 s 2p l d Ž k z y v ., and the decay rate per unit length of
the Wilson line is
Ng 2 d3k d v Tr Ž k , v . k 2x q k 2y
H Ž 2p . 3
2pd Ž k z y v . . Ž 28 .
2 2p v k2
Recall that the hard particle starts out with pz 4 < k <, p x s p y s 0. If it emitted a particle
of wave number k, its momentum would change to p y k and its energy would change
by yk P pˆ s yk z plus a correction of order k 2r< p <, which is negligible by assumption.
The delta function is just the energy conserving delta function appearing in the
expression for the rate of the process shown in Fig. 2a, the emission of a soft gluon by
an adjoint charged, hard mode.
The rate for this process has been considered by Pisarski w24x; his expression Ž4.4.
does not look quite the same as Eq. Ž28. for the following reasons. First, his definition of
380 G.D. Moorer Nuclear Physics B 568 (2000) 367–404
Fig. 2. Ža. Emission of a gluon below the light cone; Žb. Self-energy diagram which, cut, describes the
emission; Žc. The cut must split an HTL insertion, so the physical process is scattering with a small exchange
momentum.
the spectral density differs from the one used here by a factor of 2p . Second, he has
allowed the hard particle to be a little off shell, which is important in the part of the
integral with k ; g 2 T logŽ1rg . but not for higher k. ŽNote however that my large l
approximation breaks down in exactly this regime.. For momenta k 4 g 2 T logŽ1rg . we
may integrate over v t in his expression and obtain Eq. Ž28.. The difference caused by
k ; g 2 T logŽ1rg . does not change the leading log, but would be important in investigat-
ing corrections to the leading log. Also note that Pisarski considers the contribution of
longitudinal momenta and finds that they do not give a log, only a constant times Ng 2 T.
The process considered involves the emission of a soft mode with < k < ) < v <. The
reason that such emission is possible at all is because of the hard thermal loop correction
to the gauge propagator. The rate is the imaginary part obtained by cutting the one-loop
self-energy diagram, Fig. 2b. The cut gives a non-zero result when it goes through a
HTL self-energy insertion, which can be interpreted as diagram Žc. in the figure. Hence
the Wilson loop calculation is proportional to the rate for scatterings of one hard particle
off all other hard particles by exchange of soft intermediaries.
Pisarski calculates the total rate for a particle to undergo a collision, and because of
the conventional definition of the damping rate, his damping rate is actually half of the
scattering rate. Also, his result does depend on the representation of the particle
undergoing the scattering. The rate the Wilson line calculation determines is actually the
rate of color randomization, not of collisions; there is a representation-dependent
correction between the two, which depends purely on color factors at the vertex where
the particle of interest interacts with the soft background field. The total collision rate of
a particle is proportional to the group factor
Tr T a T a
, Ž 29 .
Tr 1
where T a is in whatever representation the particle is in. The original disturbance of the
particle distribution from equilibrium is caused by an electric field, which is an adjoint
object; the disturbance to the single particle density matrix is proportional to E b T b. The
color randomization per collision is
Tr 1 T aT b T a
1y , Ž 30 .
Tr T a T a Tb
where the second term tells how much the color after the collision is aligned with the
color before. Multiplying by the total collision rate gives a color randomization rate
A Nr2, independent of representation.
G.D. Moorer Nuclear Physics B 568 (2000) 367–404 381
¨
3.3. Subleading corrections in the method of Hu and Muller
Now I will discuss the relation between the quantum theory and the technique
proposed by Hu and Muller, ¨ refined and implemented jointly with myself, concentrating
on whether the behavior is the same at next to leading logarithmic order.
First I should explain why the technique is necessary. Traditionally people have tried
to determine the sphaleron rate by studying classical Yang–Mills Žor Yang–Mills Higgs.
theory regulated on a spatial lattice. However, if we just study the classical theory on the
lattice, using the lattice spacing as the UV cutoff, the hard lattice modes generate HTL
effects which do not look like Eq. Ž9. and in fact are not rotationally invariant w31x. Even
in the leading log approximation this is a problem, because it means that, where Eq. Ž15.
has Ei , we will get E rescaled by a rotationally non-invariant factor, determined in w32x.
A proposal by Arnold to fix this problem, staying within lattice classical theory, by
making the dispersion relations turn up very steeply w32x, does not work because the
hard modes are then Landau damped 5. The only alternatives I am aware of involve
adding new degrees of freedom which influence the IR classical fields in a way
equivalent to correct hard thermal loops. Two such proposals exist in the literature. One
¨
is due to Bodeker, McLerran, and Smilga w31x, more recently discussed by Iancu w33x. I
will not discuss it since no one has yet specified a complete discrete numerical
implementation. The other idea was proposed by Hu and Muller ¨ w12x; the details of the
implementation were worked out and applied jointly with me w13x.
We also simulated the classical IR physics by studying non-perturbatively the
classical system regulated on a spatial lattice, thereby treating the left-hand side of Eq.
Ž9. fully non-perturbatively. To include the HTL effective action, the right-hand side of
the expression, we added to the classical lattice system a large number of adjoint
charged classical particles. They take coordinate positions in the continuous space in
which the lattice fields sit, obey ultrarelativistic dispersion relations, and interact with
the lattice fields when they cross the dual planes to lattice links. There are two parts to
the interaction with the lattice fields.
First, the particles ‘‘kick’’ lattice electric fields, and their momenta receive a similar
‘‘kick’’. The kicks and the approximately random distribution of the charges performs
the noise, and a correlation between past fields and the ‘‘kick’’ the gauge field receives,
arising from a change in the particle trajectory from the ‘‘kick’’ it received, accounts for
the non-local term. The size of each kick is proportional to a charge Q which is made
small so the particles individually interact weakly.
Second, the particles’ adjoint charges are parallel transported by the gauge field
connection. Since the gauge fields are only defined on the lattice links, the Wilson line
used for the parallel transportation of a particle is the sequence of lattice links which
maintains the shortest distance to the actual path of the particle, as illustrated in Fig. 3.
The exact rule used to choose the sequence of links is that a link is used if the Wilson
™ ™
5
The problem with Arnold’s proposal is that it gives a Lorentz non-invariant hard mode dispersion relation
under which 1 2 and 2 1 processes are kinematically allowed. They are efficient, so the hard excitations
have a mean free path for hard scatterings ;1r g 2 T. I have Arnold’s agreement on this point.
382 G.D. Moorer Nuclear Physics B 568 (2000) 367–404
Fig. 3. A simple 2-D example of how a Wilson line Ždiagonal. on a lattice Ždashed lines. is replaced by the
sequence of links which stay closest to it Žsolid, jagged line..
line penetrates the face of the dual lattice corresponding to that link. Parallel transporting
the particle’s charge ensures that we are including the parallel transporter in the
right-hand side of Eq. Ž9.; it is also absolutely necessary to make the update rule
manifestly gauge covariant. The full update is described in excruciating detail in w13x.
The Debye mass depends on the density ² n: and charge Q of the added particles as
m D A Q 2 ² n:, so by changing the number of particles one can tune the HTL strength to
2
be whatever is needed while keeping Q small. If there were only IR fields, meaning
™ ™
fields with k < 1ra Ž a the lattice spacing. then the behavior would correctly reproduce
Eq. Ž9., at least in the limit Q 0, ² n: ` with m2D fixed. This is also discussed in
w13x.
™ ™
The lattice theory involves two scales, a and m D ; to get the leading parametric
behavior we must seek the limits a 0, m D ` Žif we think of the length scale
1rŽ g 2 T . as remaining fixed.. There are two ways we could go about doing this. One
corresponds to using a as a cutoff between the gT and the T scales, meaning that we
maintain a < 1rm D but a 4 g 2 Trm2D . The latter condition is essential to make sure
that ‘‘bad’’ hard thermal loops arising from modes with k ; 1ra, which have the wrong
dependence on k, v and in particular are not rotationally invariant w31x, are subdominant
to ‘‘good’’ hard thermal loops due to the particles. The other way of taking the limits is
to make m D 4 1ra, so the wave number Ž1ra. falls between the gT and g 2 T scales.
In each approach, the model correctly generates the effective HTL dynamics up to
power corrections in Ž g 2 aT ., Ž Qrg 2 aT ., g 2 Trm D , and Ž g 2 T .rŽ am2D ., except perhaps
for the behavior of the parallel transporter. The contribution to the parallel transporter
from IR fields is correct because for a gauge field of wave number k < 1ra the
replacement of the straight line path with the ‘‘jagged’’ path actually used Žsee again
Fig. 3. gives the right behavior up to corrections suppressed by O Ž k 2 a 2 .. The question
is, how does the UV contribute to the parallel transporter?
In the case where we make m D 4 1ra, all of the lattice degrees of freedom have
overdamped evolution. None of them propagate and there are no problems from hard
thermal loops arising from classical lattice degrees of freedom with k ; 1ra. Except for
the parallel transporter, the systematics are then best under control. However, the log
appearing in the parallel transporter is obviously cut off by the inverse lattice spacing,
logŽ m D rg 2 T . becomes C q logŽ1rg 2 aT . with C a coefficient to be determined by
matching. Since we know that the lattice regulator is not rotationally invariant we expect
G.D. Moorer Nuclear Physics B 568 (2000) 367–404 383
Tr ²U : Ng 2 pra d3k ` d v Tr Ž k , v .
log sy Hypra Ž 2p . Hy` 2p3
FŽ k,v . , Ž 31 .
Tr 1 2 v
FŽ k,v .
w Lp xqx x
1 nxyx
sÝ
e
H0 ½
dx dy dz a e x Ý
n xs1
expi v
ž px
y k x Ž n x y 0.5 .
2
nxyx nxyx
yk y y q
px
py y k z z q
px
pz
/ žÝ Ý /5
q
ny
,
nz
similar , Ž 32 .
and the values in other octants follow from cubic symmetry. Here the sum over e is
over a basis of the two unit vectors satisfying Ý i e i sinŽ k i ar2. s 0. The integral over
Ž x, y, z . appearing in the definition of F averages over starting positions for the Wilson
line within a lattice cell, and square brackets always mean the argument is rounded
down to an integer. The rounding down to an integer makes it very difficult to evaluate
the expression analytically. It is also tricky to find r Ž k, v . on the lattice, away from the
m D 4 1ra or m D < 1ra limits.
In the infrared, meaning k < 1ra, the sum in F is well approximated by an integral,
recovering Eq. Ž20.. In the UV, while Eq. Ž31. is manifestly cubic symmetric, it is not
spherically symmetric: the k integral is over a cubic range, and neither F Ž k, v . nor
r Ž k, v . are rotationally invariant. The integral has been computed by Arnold and Yaffe
in the large L limit for the special case that pi lies in a lattice direction and the spectral
density is concentrated at v s 0 w30x, although they did not present their calculation in
this context. I will now compute it in the opposite limit, m D < 1ra, but still along a
lattice direction. If L is large and pi lies in a lattice direction Žsay, the z direction. then
F Ž k, v . , 2p l d Ž k z y v mod Ž2pra... For m D < 1ra the dispersion relation is the free
384 G.D. Moorer Nuclear Physics B 568 (2000) 367–404
lattice dispersion relation v 2 a 2 s Ý i Ž2 y 2cosŽ k i a.. and the spectral density r all lies
on shell, and the integral reduces to
Ng 2 lT pra d3k k˜ 2x q k˜ 2y 2p
Hypra Ž 2p . 3 2 ž (
2pd k z y k˜ 2 mod / , Ž 33 .
2 Ž k̃ 2 . a
4. Numerics
Now that I have discussed the establishment of the effective theory, Eq. Ž15., I will
discuss how to make a lattice model of that effective theory and how I compute the
leading log coefficient of the sphaleron rate by using that model. First, define a
Langevin time t related to the time appearing in Eq. Ž15. via
3 3 Ng 2 T log Ž m D rg 2 T .
dt ' dt s dt . Ž 34 .
m2D l 4p m2D
Note that t has dimensions of length squared, not length. The effective infrared theory
then has the familiar form
a a
w Dt , A i x Ž x ,t . s Dj , Fji Ž x ,t . q j ia Ž x ,t . ,
² j ia Ž x ,t . j jb Ž y,t X . : s 2Td Ž x y y . d Ž t y t X . , Ž 35 .
which is a Langevin equation. Studying it, I will find the diffusion constant for
Chern–Simons number
² Ž NCS Ž t . y NCS Ž 0 . . 2 :
G Langevin s lim lim
V ™ ` t™ ` Vt
, Ž 36 .
G.D. Moorer Nuclear Physics B 568 (2000) 367–404 385
numerically. To do that I find the diffusion constant per lattice site per a 2 of Langevin
time, and multiply by ay5 . On-dimensional grounds G Langevin must be of order a W5 T 5.
Using Eq. Ž34., the relation between G Langevin and k X defined in Eq. Ž2. is
3 N G Langevin
kX s . Ž 37 .
4p a W5 T 5
Before getting further it is worth commenting that the IR behavior of Eq. Ž35. is
insensitive to the UV regulation and the limit in which that regulation is removed, exists.
¨
Though this statement appears banal, it makes Bodeker’s effective theory completely
different from the classical Hamiltonian dynamics. The essential difference is that there
are no long time scale correlations for the UV fields in the Langevin evolution; a mode
with wave number k gets randomized in Langevin time t ; 1rk 2 , which is much faster
than the natural time scale for the evolution of IR fields. Hence the IR fields see the
average over all excitations of the UV fields. The influence of the UV must be purely
thermodynamic, and we know from the super-renormalizability of 3-D Yang–Mills
theory that the thermodynamic influence of the UV on the transverse sector is well
behaved. For a more rigorous presentation of the argument see Ref. w22x, who show that
any purely dissipative update algorithm will give a good continuum limit. For the
Hamiltonian system, on the other hand, UV modes are propagating; the unequal-time
correlator behaves like cosŽ kt . rather than like expŽyk 2t .. The existence of long time
scale correlations of the UV fields is what makes the HTL effects important to the IR
dynamics. Because of this difference, we can expect a good small lattice spacing a limit
to exist for the Langevin time dynamics, and it is worth it to try to control systematic
errors.
I discuss the continuum t , spatial lattice implementation of Eq. Ž35. in Appendix A;
here I will just mention how I discretize the time update. I define a time step Dt s a2D,
D a pure number much less than 1. The fields will be well defined at times n Dt , n an
integer. The noise is constant in each interval w n Dt ,Ž n q 1. Dt .; its value at each point,
direction, and Lie algebra direction is drawn from the Gaussian distribution with mean
(
value 2Tra3Dt ; its value at each point, direction, and Lie algebra direction, and in each
time interval, is independent. To determine the fields at time Ž n q 1. Dt from the fields
at time n Dt I use the following second order algorithm: I compute dArdt Ž n Dt q 0.
and use it to predict AŽŽ n q 1. Dt .; then I average the values of dArdt at the starting
point and at the predicted end point, and use this average to update AŽ n Dt . to time
Ž n q 1. Dt . The step size errors are O Ž D 2 .. In particular, if I were studying the free
theory a mode with wave number k would be updated with step size errors , Ž k 2Dtr2. 2 .
The direct errors in the update of the IR fields are tiny. However, the most UV modes
are only updated correctly if D < 1. The UV fields influence the IR modes radiatively,
so we do need D to be small; but the radiative corrections are suppressed by O Ž g 2 aT .,
and it is not too difficult to get D small enough to make step size errors subdominant to
statistics.
Unfortunately the above update is quite inefficient. However, there is a much more
efficient algorithm for dissipatively updating the fields, the heat bath algorithm. Rather
than applying a very small step of Langevin update to each lattice link in parallel, the
idea is to go through the links of the lattice at random, performing a complete heat bath
update of each link. The relation between Langevin time and the number of links
386 G.D. Moorer Nuclear Physics B 568 (2000) 367–404
Table 1
X
Results for k at two lattice spacings and two lattice volumes. The results show excellent spacing and volume
independence.
Lattice spacing a Volume Langevin time k X " Statistical error
2r3 g 2 T Ž8r g 2 T . 3 290000 a2 10.44"0.23
2r3 g 2 T Ž16r g 2 T . 3 49500 a2 10.30"0.21
2r5g 2 T Ž16r g 2 T . 3 21000 a2 10.70"0.67
2r7g 2 T Ž16r g 2 T . 3 42000 a2 10.26"0.79
G.D. Moorer Nuclear Physics B 568 (2000) 367–404 387
As I noted already in Subsection 2.3, the log arises from the behavior of a Wilson
line, and the same Wilson line appears in the definition of the Debye mass beyond
leading order. In this context the value for the O Ž1. correction to the leading log
behavior has been found by Laine and Philipsen. In that case, wlogŽ m D rg 2 T . q O Ž1.x
has O Ž1. s 6.7. If the same number held for the sphaleron rate, then using the standard
model value of m D to evaluate the log, the leading behavior would be G s
89Ž g 2 T 2rm2D . a W5 T 4 s 48 a W5 T 4 . This is a crude way of estimating the non-leading
corrections, though, and I do not take it too seriously. In particular there is evidence that
the length scale relevant for baryon number violating processes is longer than the
, 1rg 2 T Laine and Philipsen find for the O Ž g 2 T . correction to Debye mass; the
baryon number violation rate on a cubic toroidal lattice 3rg 2 T across is over 1000 times
slower than for a large volume w36x, so physics must be going on involving lengths at
least half as long as 3rg 2 T.
¨
5.2. Estimate using Hu, Muller, and Moore’s results
Another way of trying to determine the subleading corrections is to use the value of
¨
the leading log coefficient to correct my results with Hu and Muller. There, we used a
technique which included hard thermal loops, but in a way which does not correctly
reproduce the subtleties of the Wilson line responsible for the logarithmic dependence of
G on m D . As discussed in Subsection 3.3, the log arises because excitations more UV
than the g 2 T scale make the Wilson line effectively randomize the charge of a
388 G.D. Moorer Nuclear Physics B 568 (2000) 367–404
propagating particle, and there is a log in the reciprocal length for randomization. The
best approach would be to compute the angle averaged value of the reciprocal length, in
the quantum theory and in the lattice theory actually studied. Unfortunately, so far I can
only compute the reciprocal length in the lattice theory in one direction, and only in the
cases m D < 1ra or m D 4 1ra. The data in w13x are taken at m D ; 1ra, not 4 1ra,
so it is at least reasonable do the match using the small m D a approximation. I will make
do with the log evaluated in the one direction where I can do the integral, which gives a
difference in logs between the two theories of logŽ m D Žlattice.rm D Žcontinuum. q 0.59.
I will use the datapoint from that paper taken with the largest value of m D , because
the ‘‘wrong’’ lattice mode induced HTL’s really are strongly subdominant to the
‘‘right’’ particle induced HTL’s for this case. Redoing the match between the lattice and
continuum time scales, which was performed wrongly there because we did not have the
O Ž a. calculation performed in Appendix A of this paper, revises the result from
G s 53 " 5Ž g 2 T 2rm2D . a W5 T 4 down to 50 " 5Ž g 2 T 2rm2D . a W5 T 4 . This result was ob-
tained at m D , 4 g 2 T, so the difference of logs between the lattice theory where this
number was computed and the quantum theory at the physical value of m 2D is about 0.63
due to the values of m D plus 0.59 due to UV contributions present on the lattice but not
in the continuum theory. Using the determined coefficient of the leading log term, I
should correct the diffusion constant we found downwards by 1.22 = 10.8, giving
37 " 5, with only the statistical error bar shown. ŽSubstituting in m2D s 11 g 2 T 2r6 gives
G s 20 " 3 a W5 T 4 ..
Using k X to correct the old data in this way assumes that
mD d m 2D
G s k Xa W5 T 4 .
dm D ž 2
g T 2 / Ž 39 .
We really only know that this true in the large Ž m D rg 2 T . limit, where the leading log
expansion is valid. It may have quite non-negligible corrections at realistic values of
m D , which would appear in a systematic expansion in logŽ m D rg 2 T . as inverse powers
of the log; the O Ž1. correction to the leading log behavior would really be O Ž1. s C1 q
C2 ŽlogŽ m D rg 2 T ..y1 q . . . Intuitively I expect that the real m D dependence will be
weaker than the leading log suggests Žmeaning C2 ) 0., though I cannot give a cogent
argument to show this is so. In this case I have performed an overcorrection, and the real
rate would be higher. For now I will accept the corrected answer as the best current
guess, but I take a systematic error bar of order 30% to cover both the rotational
non-invariance not handled correctly in the correction, and the fact that the correction
may be an overestimate at realistic m D .
There is a further cause of systematic error in the determination of the sphaleron rate,
arising from corrections to the g < 1 approximation. I will mention the two such
corrections which I think are the largest; fortunately they have opposite sign and the
optimistic can hope that they largely cancel.
One problem is that the parametric argument for the importance of hard thermal loops
assumes m D 4 g 2 T, and it is not clear realistically whether this is obtained w14x. There
is evidence that it is not. In particular, while the sphaleron rate in classical Yang–Mills
G.D. Moorer Nuclear Physics B 568 (2000) 367–404 389
theory depends on the lattice spacing in a way consistent with the Arnold-Son-Yaffe
prediction G A my2 w x
D A a, the corrections to the linear dependence are large 11 . Of
course, some dependence is expected, since we know now that the scaling behavior
should be not G A a but G A aŽlogŽ1ra. q O Ž1..; but we can determine the coefficient
of the logŽ1ra. term by using the results of this paper. According to Arnold w32x, we can
relate the classical lattice results to continuum ones, approximately, by replacing
m2D , 0.684 g 2 Tra. In this case the a dependence of G in classical, lattice Yang–Mills
theory should be
X 2 y1 r2 g 2 aT
Gon the lattice s 0.465k log ž Žg aT . q O Ž 1. / a W4 T 4 , Ž 40 .
4
with k X the same as the one we find but the O Ž1. correction different. This formula
makes it possible to correct the data in w11x to remove the logarithmic dependence on a,
for instance by adjusting the data so they all correspond to g 2 aT s 1r4, which is the
value for the finest lattice used there. I have done so, and the result is plotted in Fig. 4.
I fit the data to the form Gra s C1 q C2 a to find the corrections to the small spacing
limit which do not arise from the log. The fit is very good, see Fig. 4, but the coefficient
C2 is quite substantial. The physical value of HTL strength, using m D s Ž11r6. g 2 T 2
and g 2 s 0.4, corresponds to g 2 aTr4 s 0.037, where the correction C2 a accounts for
an 11% shift from the small a limit, with the actual value falling below the large m D
extrapolation. The correction reduces the sphaleron rate. Given the other systematics in
play it is probably not fair to call this a measurement of the correction to the m D 4 g 2 T
limit; rather I will call it an estimate to tell how large the systematic error is.
Another correction which is parametrically suppressed but not necessarily very small
arises from QCD scatterings of quarks. Quarks are responsible for almost half of m2D ,
and hence almost half of all HTL effects. But quarks scatter strongly. While a strong
scattering does not disturb a quark’s electroweak charge, it does change its direction of
flight, whereas the calculation of the HTL’s is made assuming particles maintain straight
line trajectories. If the free path for strong scattering were ( 1rg 2 T this would make an
O Ž1. correction to the quarks’ influence on IR physics. Actually the quark free path for
Fig. 4. Small lattice spacing extrapolation of data in pure classical lattice Yang–Mills theory, taken from Ref.
w11x. The data have been corrected to absorb the leading log dependence on m2D A1r a determined here. They
show a substantial linear correction to the predicted G A a behavior. This is evidence of non-negligible
corrections to the parametric m D 4 g 2 T limit.
390 G.D. Moorer Nuclear Physics B 568 (2000) 367–404
large angle scattering is parametrically order Ž a s2 logŽ1rg s .T .y1 and the correction is
formally O Ž a s2ra W . ; g 2 ; but numerically this might not be small. The effect of this
correction is to increase the sphaleron rate. It might be possible to include this correction
within the context of the ‘‘particles’’ approach by calculating more precisely the size
and frequency of quark scatterings and adding them to the dynamics of the particle
degrees of freedom.
I should also mention that the evaluation of the leading log coefficient made here was
within pure SUŽ2. Yang–Mills theory, without a Higgs boson. In the context of
baryogenesis we actually need to know the sphaleron rate in the presence of at least one
Higgs boson, in the symmetric phase and roughly at the equilibrium temperature. The
Higgs field’s evolution is not overdamped, because the hard thermal loops for a scalar
field are nothing but a mass squared correction. Hence the k ; g 2 T modes of the Higgs
fields evolve on the time scale 1rg 2 T, which is parametrically faster than the gauge
fields. Therefore, on the 1rg 4 T time scale, the gauge fields only see the thermodynamic
average over all Higgs fields in the fixed gauge field background6 . This could be
simulated by including the Higgs field in the Hamiltonian, and evolving it with heat bath
dynamics, but updating the Higgs fields much more often than the gauge fields. Then we
would have to extract the limit as the Higgs field update is made infinitely faster than
the gauge field update. I have not yet attempted to do this, but I anticipate that for
parameters which make the phase transition strongly first order, the effect should be a
slight reduction of the leading log coefficient for the sphaleron rate.
6. Conclusions
6
Dam Son pointed this out to me.
G.D. Moorer Nuclear Physics B 568 (2000) 367–404 391
large as the statistical ones. There are also corrections to the g < 1 approximation,
probably at around the 10% level.
Besides making it possible to correct previous results, the leading log approximation
¨
and Bodeker’s effective theory provide a clean test-bed for determining how good an
approximation it is to treat the symmetric phase of Yang–Mills Higgs theory using just
Yang–Mills theory. Checking how much difference the Higgs field makes is an
interesting project for the future. It would also be interesting to study the dependence of
k X on the number of colors, since the sphaleron rate in the strong sector is also
phenomenologically interesting for baryogenesis.
Acknowledgements
¨
I am grateful to Peter Arnold, Dietrich Bodeker, Dam Son, and Larry Yaffe for
illuminating discussions.
Ref. w34,35x explains how the lattice to continuum relation for thermodynamical
properties of 3-D gauge or gauge-Higgs theory can be studied as an expansion in g 2 aT,
where a is the lattice spacing and g 2 is the coupling in 4-D notation. The combination
g 2 T is the coupling constant of the 3-D theory. Since it is dimensionful, a perturbative
matching between continuum and lattice theories must be an expansion in g 2 Ta on
dimensional grounds. In Yang–Mills Higgs theory with fundamental or adjoint scalar
fields the leading terms can behave as 1ra because the theory contains dimension 2
operators, but in strict Yang–Mills theory the lowest order operator is dimension 4 and
the leading corrections are O Ž a.. Further, the only O Ž a. correction is a rescaling of the
coupling, equivalent to a rescaling of the physical length scale, and it only arises from
one-loop diagrams, and has been computed w34,35x.
Arnold, Son, and Yaffe have recently demonstrated that, because of super-renormal-
izability and general considerations worked out by Zinn-Justin and Zwanziger w37x, the
same applies to the dynamics under Langevin dynamics w22x. In particular, any high
dimension corrections which can appear in Eq. Ž15. would change the dynamics at
O Ž a2 ., simply because there are no gauge invariant, P-even dimension-5 operators in
Yang–Mills theory. The only possible O Ž a. correction, besides the thermodynamic one
392 G.D. Moorer Nuclear Physics B 568 (2000) 367–404
already mentioned, is a rescaling of the Langevin time scale, which will arise exclu-
sively at one loop. This opens the possibility of computing the O Ž g 2 aT . corrections
between the continuum and lattice Langevin dynamics, by performing a one-loop match.
It also means the Langevin dynamics can be replaced by heat bath dynamics at the cost
of another O Ž g 2 aT . shift, which I will measure, rather than compute, in Appendix A.2.
In the continuum theory and in temporal gauge the Langevin evolution is
dA ai Ž x . EH
sy a q j ia Ž x . ,
dt E Ai Ž x .
² j ia Ž x ,t . j jb Ž y,t X . : s 2Td i j d a b d 3 Ž x y y . d Ž t y t X . . Ž A.1 .
The Langevin equation for the lattice theory must be written in terms of the unitary
parallel transporter matrices Ui Ž x . and their derivatives. By definition the matrix
Ui Ž x . g SUŽ N . is the matrix such that, if F is a fundamental representation object
transforming as an object at point x q ai,ˆ Ui Ž x .F is the parallel transport to point x;
thus Ui Ž x . ‘‘lives on’’ the link between the site x and the site x q ai.ˆ Writing DLa for
™
the left acting derivative,
DLaU s igaT aU , DLa F Ž U . s F Ž U U q DLaU . y F Ž U . , Ž A.2 .
the Langevin equation for U is
dUi Ž x .
s yDLaUi Ž x . Ž bL TDLa H KS q j ia Ž x . . ,
dt
8 di j da b d x y
² j ia Ž x . j jb Ž y . : s d Žtyt X . . Ž A.3 .
g 2 a 4bL
Here T a is a fundamental representation Lie algebra generator with the standard
normalization; H KS is the Kogut–Susskind Hamiltonian, the sum over elementary
plaquettes of the trace of the plaquette,
H KS s Ý 1 y 12 Tr UI ; Ž A.4 .
I
and bL is the inverse temperature in lattice units. At leading perturbative order
bL s 4rŽ g 2 aT ., but it receives radiative corrections, computed in w34,35x, which shift it
by a constant. What I write here as bL is bnaive in the notation of w34,35x, but in the
body of the paper I have always used the O Ž a. improved definitions when I refer to a or
bL . The combination bL H KS equals HrT of the continuum theory, up to radiative
corrections and high dimension operators which correct infrared physics at O Ž a 2 .. The
radiative corrections are absorbed up to errors which are O Ž a 2 . by the one-loop
radiative correction to bL , which I will present in due course.
Our task is to compute to O Ž a. the relation between the Langevin time scale t in the
continuum and lattice cases. The correction at loop order l will be O Ž g 2 aT . l , with the
g 2 T from loop counting and the a to balance dimensions; so we need only go to
one-loop order. Part of the correction is from the shift in b already mentioned, but there
are also corrections from the relation between U and A and from radiative differences
between the lattice D a and the continuum derivative. I will be satisfied to perform the
calculation in a particular gauge, strict Coulomb gauge, without checking for the gauge
independence of the result. By strict Coulomb gauge I mean that at every Langevin time
the 3-D configuration is in 3-D Landau gauge. I should also fix a global time-dependent
gauge ambiguity; but this is irrelevant at the level of perturbation theory.
G.D. Moorer Nuclear Physics B 568 (2000) 367–404 393
In Coulomb gauge the U matrices are all close to the identity matrix, with a departure
of order by1L
r2
; so it makes sense to define a lattice gauge field A ai through
Ui Ž x . 'exp Ž igaT aA ai . . Ž A.5 .
The A field so defined has the same normalization as the continuum A field at tree
level, but there are radiative corrections. The value of the radiative correction to A is
obtained in the theory with an added scalar field by matching the one-loop values of the
gauge-scalar three-point vertex at small transfer momentum. There is a contribution
from one-loop vertex corrections and from one-loop scalar wave function corrections,
and neither depends on the number of scalar particles in the theory, so the answer is the
same in the pure gauge theory. The one-loop correction was computed as a byproduct in
w34,35x in a general gauge, for SUŽ2.. The result, in Coulomb gauge but for SUŽ N .
gauge theory, is
Ng 2 aT 1 S j
A ai Ž continuum. s 1q ž 4 /ž 18 4p
q3
4p / A ai Ž lattice. . Ž A.6 .
The numerical constants S and j were first defined in w38x, and their numerical values
are S s 3.175911536 and j s 0.152859325.
Next I need to find the relation between applying the lattice and continuum
derivatives. For this purpose it is actually more convenient to write the update rule in
terms of center acting derivatives,
DCa U s U 1r2 Ž igaT a . U 1r2 , Ž A.7 .
because the formulation will then be parity symmetric. Naively one would expect that if
we set dUirdt s Eia DCa U, that dA ai rdt s Eia, and indeed this is correct at leading order
in A. But beyond leading order there are corrections; expanding both sides gives
dUi d
s exp Ž igaT aA ai .
dt dt
s igaT a
dA ai
dt
y
g 2 a2 T b T c
2
A bi
dAci
dt ž
q Ž b c. l /
ig 3a3 T d T e T f dA if
y
6 ž A di A ei
dt /
q permutations q O Ž a 4 . , Ž A.8 .
g 2 a2
Eia D aU s 1 q igaT aA ai y T b T cA bi Aci q O Ž a3 .
2
g 2 a2
= igaT d Eid 1 q igaT eA ei y T f T hA if A hi q O Ž a3 . . Ž A.9 .
2
We must expand to O Ž a 2 . corrections because ² A2 : ; 1ra. Equating the expressions,
after some work we obtain
dA ai d
g 2 a2
Ž lattice . s E i d ad y f f A b AcqO Ž a 3 . . Ž A.10 .
dt 24 a b e c d e i i
To save some writing I left off marking that all A, E above are in position space and at
the same coordinate position. The above correction is strictly a lattice effect and the
equivalent continuum relation is dA ai rdt s Eia.
394 G.D. Moorer Nuclear Physics B 568 (2000) 367–404
Naively I should now equate Eia with ybL DCa H KS q j , evaluate the ŽUV dominated.
mean value of the correction between the lattice and continuum relations, and thereby
determine the rescaling of the Langevin time. This would not be strictly correct, though,
since the Langevin equation, Eq. ŽA.3., gives time evolution in temporal gauge. The
temporal gauge evolution breaks the Coulomb gauge condition, and if we are to do the
calculation in Coulomb gauge we must make a time-dependent gauge change to
maintain the Coulomb condition at all Langevin times. There is some danger that the
gauge changing will also lead to a radiative correction to the time relation; it turns out it
does not, but I will go through the calculation anyway to show that it does not, since the
cancellations may be special to strict Coulomb gauge.
The Coulomb gauge condition is DL P A a Ž x . s 0, which means
This does not look cubic invariant because our labeling associates Ui , and hence A i , on
the link between x and x q aiˆ with the site x; it might better be associated with the
ˆ which would make the cubic invariance more obvious.
point x q air2,
To maintain this Coulomb condition, we must apply a time-dependent gauge transfor-
mation at each site. The difference between Coulomb gauge and temporal gauge
satisfying the Coulomb gauge condition at t s 0 will be a gauge transformation by
L s T expHigT a G a Žt . dt ., where the T means the exponential should be time ordered
with respect to t . The value of G is fixed by the requirement that the Coulomb
condition remain true; we must choose G so the departure from the gauge condition due
to evolution of the fields and due to the action of G cancel. In an infinitesimal time
interval dt the gauge change alters U through
d Ui Ž x . s Ž 1 y igT a G a Ž x . dt . Ui Ž x . Ž 1 q igT b G b Ž x q aiˆ . dt . y Ui Ž x . .
Ž A.12 .
Expanding U on each side, the G contribution to the time evolution of A is
dA ai G d Ž x q aiˆ . y G d Ž x . g 2 a2
dt
Ž from G . s
a ž da d q
12
f a b e f c d e A bi Ž x . Aci Ž x .
/
G c Ž x q aiˆ . q G c Ž x .
y g f a b c A bi Ž x . . Ž A.13 .
2
Now we need to substitute this expression, and the relation between E and dArdt ,
into Eq. ŽA.11. to determine the relation between E and G. The result, Fourier
transformed to momentum space, is
ig 2 a2
0 s ik˜ i Eia Ž k . y k˜ i f a b e f c d e b c
Hlm A Ž l . A Ž m . E
i i i
d
Ž k y l y m . y k˜ 2 G Ž k .
24
Here summation over vector indices is implied in terms where the index appears at least
2 times. I have used the conventional shorthand k˜ i s Ž2ra.sinŽ k i ar2., k˜ 2 s Ý i k˜ i2 , and
Hl s Hd 3 lrŽ2p . 3, where the range of the integration is wypra,prax 3. Indices are
summed whenever the index appears an even number of times, which can be more than
twice in cubic symmetric but rotationally non-symmetric expressions; but if an index
appears an even number of times on each side of a q or y sign I am re-using it, it is an
independent index in each expression. To get the continuum version of this expression,
drop the O Ž a2 . terms and set k˜ s k, cosŽ kar2. s 1.
We can determine G perturbatively in g by expanding Eq. ŽA.14. in powers of g.
The result to O Ž g 2 . is
k˜ i Eia k˜ i l˜j
GaŽ k. si q gf a b c Hl k˜ cos Ž l˜i ar2 . A bi Ž k y l . Ejc Ž l .
k˜ 2
l˜
2 2
k˜ i l˜j m̃ k
y ig 2 f a b e f c d e Hlm ˜ cos Ž l˜i ar2 . cos Ž m
˜ j ar2 . A bi Ž k y l .
k l˜ m̃
2 2 2
ig 2 a 2 k˜ i
= Acj Ž l y m . Ekd Ž m . y fa b e fc d e b c
Hlm A Ž l . A Ž m . E
i i i
d
Ž k y l y m.
24 k˜ 2
ig 2 a 2 k˜ i k˜ i m̃ j
y fa b e fc d e Hlm A bi Ž l . Aci Ž k y l y m . Ejd Ž m . . Ž A.15 .
12 k˜ 2 m̃2
This in turn must be substituted into Eq. ŽA.13. to find the true value of dArdt in
Coulomb gauge,
dA ai Ž k .
dt
k˜ i k˜ j l˜k k˜ i k˜ j
ž
s di j y
k˜ 2 / Eja q igf a b c Hl l˜ 2 ž di j y
k˜ 2 /
cos Ž l j ar2 . A bj Ž k y l . Ekc Ž l .
y1 k˜ i k˜ j
q g 2 a2 f a b e f c d e Hlm 24 ž di j y
k˜ 2 / A bj Ž l . Acj Ž m . Ejd Ž k y l y m .
1 ˜ i m˜ k d i j
m k˜ i k˜ j k˜ j m
˜k l˜k m
˜l
y
12 ž ˜
m 2
y
k˜ 2 m
˜2 / A bj Ž k y l y m . Acj Ž l . Ekd Ž m . y
l˜2 m
˜2
k˜ i k˜ j
ž
= di j y
k̃ 2 / cos Ž l j ar2 . cos Ž m k ar2 . A bj Ž k y l . Ack Ž l y m . Eld Ž m . .
Ž A.16 .
396 G.D. Moorer Nuclear Physics B 568 (2000) 367–404
Below I will be interested in the case where E is uncorrelated with A, in which case
the mean value of the term linear in A vanishes and we can substitute the leading order
Landau gauge A field correlator,
k˜ i k˜ j y1
Ž k˜ 2 .
² A ai Ž k . A bj Ž l . : s d a b d Ž k q l . d i j y
ž k̃ 2 / , Ž A.17 .
into the remaining terms. The contributions from the terms with Ž1r12. and Ž1. in front
vanish, only the term with the leading Ž1r24. coefficient contributes. It requires that we
perform the integral
1 S
a2 Hl l˜ 2
sa , Ž A.18 .
4p
which is the definition of the constant S which appeared earlier. We also need the
integral
l˜12 a S
a2 Hl 2
s . Ž A.19 .
Ž l˜ 2 . 3 4p
The result is
dA ai Ž k . k˜ i k˜ j 1 Ng 2 aT S
dt ž
Ž lattice. s d i j y
k˜ 2 / 1q
9 4 4p
Eja Ž k . , Ž A.20 .
valid when E is uncorrelated with gauge fields. The first factor projects out the
transverse component of E and is responsible for maintaining Coulomb gauge. The
continuum expression is the same but with a set to zero.
Using the previously established relation between lattice and continuum gauge field
normalization, the relation for the continuum normalized gauge field is
dA aa Ž k . Ž contin. Ng 2 aT 1 S j k˜ i k˜ j
dt Ž latt .
s 1q
4 ž 6 4p
q3
4p /ž di j y
k˜ 2 / Eja Ž k . , Ž A.21 .
valid for the gauge field response to the random force part of the Langevin equation. I
will not attempt to study the response of A from the Hamiltonian gradient part of the
Langevin equation, since it would involve understanding the radiative corrections to
D a H and would lead to an E which might be correlated with A fields. Since we know
that the Langevin equation correctly thermalizes the lattice system when we use the
radiatively corrected value of bL , it is sufficient to study the response to the random
force alone to determine the rescaling of the Langevin time scale.
Note that the correction appearing in Eq. ŽA.21. is precisely one quarter of the
radiative wave function correction for an adjoint scalar field in 3-D lattice gauge theory
when the scalar self-coupling vanishes, see Refs. w34,35x. This is perhaps not too
surprising. In the real time theory, the time evolution is generated by the electric fields,
which appear in the thermodynamics as the A 0 field, an adjoint scalar with zero
self-coupling. In w11x Turok and I speculated incorrectly that the time scale correction
would not contain any large tadpole corrections; but Eq. ŽA.21. is one quarter the adjoint
scalar wave function correction, which does contain tadpoles. Our incorrect guess was
G.D. Moorer Nuclear Physics B 568 (2000) 367–404 397
based on analyzing the abelian theory, where the A 0 field turns out to be a free field.
The correction found here vanishes in the abelian case, for compact or non-compact
implementations.
Now I will finish the relation between time scales. I have just shown that the
response of the A field to the random force, shifting its normalization to correspond to
the continuum theory normalization, is
dA ai Ž x .
s Ž 1 q corr . j , Ž A.22 .
dt Ž latt .
where Ž1 q corr. is the quantity in brackets in Eq. ŽA.21.. But the autocorrelator of j is
not the same as it would be in the continuum, because of the radiative corrections to bL .
The mean square change in A ai over a Langevin time dt Žlatt. is, using Eq. ŽA.3.,
2Tdt Ž latt . 4
² Ž d A ai Ž x . . 2 : s Ž 1qcorr. 2 3 2
. Ž A.23 .
a g aTbL
However, bL s 4rg 2 aT only at leading order in bL . Beyond leading order, in the pure
gauge theory, it is w34,35x
bL g 2 aT g 2 aT N 2y2 37N j
4
y 1 ' Zy1
g y1s
4 ž 3N
q
3 4p / . Ž A.24 .
The continuum theory Langevin equation would cause a mean square change to A of
2Tdt Ž contin.
² Ž d A ai Ž x . . 2 : s , Ž A.25 .
a3
dt Ž contin. 2
g 2 aT N S 19 j N 2y2
dt Ž latt .
s Ž 1 q corr . Z g s 1 q
4 ž 3 4p
y
3 4p
y
3N / .
Ž A.26 .
The effect of the heat bath algorithm on the infrared degrees of freedom is equivalent
to Langevin evolution. I will show this at tree level, which gives a tree relation between
Langevin time and the number of heat bath updates applied. The only O Ž a. correction to
this relation possible is an O Ž a. shift in time scales between the algorithms; to find the
magnitude of the shift I make a direct numerical measurement. I do not attempt a
perturbative calculation of the relation between continuum Langevin time and amount of
heat bath applied, beyond leading order.
First I will show that the effect of the heat bath algorithm on the IR degrees of
freedom is equivalent to the Langevin algorithm, and I find the relation between the
Langevin time scale to the number of links updated by heat bath, at leading order. The
way the heat bath algorithm works is as follows:
This is a heat bath update; for a more precise description see Ref. w39x. The first part,
the quench of the link, serves to perform the yE HrE A part of the Langevin update, and
the multiplication by a random SUŽ2. element reproduces the noise part of the Langevin
update.
To see the relation between the heat bath and the Langevin update at leading order it
is sufficient to consider the linearized theory, that is, to expand the Hamiltonian to
quadratic order in the gauge field A. On an N = N = N toroidal lattice the relation
between the connections U, the gauge field A i Ž x ., and the Fourier transform of the
gauge field AŽ k, s . Ž s a polarization index. is Žwriting all Lorentz and group index sums
explicitly, there is no implicit summation convention in what follows.
g 2 a2
Ui Ž x . s exp žÝ igaT aA ai /
Ž x. ,1q Ý igaT aA ai Ž x. y Ý Aai Ž x . Aai Ž x . ,
a a 8 a
Ž A.27 .
ˆ .. ,
A a Ž k , s . s Ny3r2 Ý e i Ž s,k . A ai Ž x . exp Ž yik P Ž x q air2 Ž A.28 .
x ,i
ˆ .. ,
A ai Ž x . s Ž longitudinal piece . q Ny3r2 Ý A a Ž k , s . e i Ž s,k . exp Ž ik P Ž x q air2
k,s
Ž A.29 .
where e i Ž s,k . is a transverse polarization vector, satisfying
Ý e i Ž s,k . e i Ž sX ,k . s ds, s X Ž A.30 .
i
G.D. Moorer Nuclear Physics B 568 (2000) 367–404 399
and
Ý e i Ž s,k . k˜ i s 0 ; Ž A.31 .
i
there are two such states for each k. The sum over k includes all k of form Ž2praN . n,
with n a triple of integers each in the range 0 ( n i - N. Only the transverse fields are of
interest here, at the order we are working the longitudinal part is pure gauge. Its
behavior depends on our choice of gauge fixing, and has no influence on the transverse
fields Žor on physics.. ŽThis is not true at higher order in g, where the longitudinal terms
are responsible for the Faddeev–Popov determinant.. At the level we are working the
value in thermal equilibrium of A is given by
) T
² Ž A a Ž k , s . . A b Ž kX , sX . : s d k , k X d a b d s, sX . Ž A.32 .
k̃ 2
Now let us analyze how the fields change when the heat bath algorithm is applied to
a link Ž x,i .. The terms in the Hamiltonian containing the link x,i are
H 1 2
T
> 2
a T
Ý
a, j/i
1
2 ž A Ž x . q A Ž x q ajˆ . y A Ž x q aiˆ . y A Ž x . /
a
j
a
i
a
j
a
i
2
q 12 yA aj Ž x y ajˆ . q A ai Ž x y ajˆ . q A aj x y a Ž iˆq jˆ . y A ai Ž x .
ž ž / / ,
Ž A.33 .
and ‘‘quench’’ part of the heat bath algorithm will replace A ai Ž x . with the value which
minimizes this expression,
1
A ai Ž x ,after . s
4
Ý ž Aaj Ž x . q Aai Ž x q ajˆ . y Aaj Ž x q aiˆ . y Aaj Ž x y ajˆ .
j/i
qA ai Ž x y ajˆ . q A aj x y a Ž iˆq jˆ .
ž // . Ž A.34 .
Using equation ŽA.29. and Eq. ŽA.31., and adding a term j a to represent the noise
which will be added, this is
A ai Ž x . Ž after .
a2 k˜ 2
s A ai Ž x ,before . q j a y Ý ˆ .. .
e i Ž s,k . A a Ž k , s . exp Ž ik P Ž x q iar2
k,s N 3r2
Ž A.35 .
Now we should compute the size of j a. Because the Hamiltonian is expanded only to
quadratic order, the noise is Gaussian, of amplitude set by the size of the quadratic in
A i Ž x . term in H, which from Eq. ŽA.33. is Ž2ra2 T .Ý a A ai Ž x . A ai Ž x .. The amplitude of
the noise j a is then Žno sum on a. ² j aj a : s a 2 Tr4.
400 G.D. Moorer Nuclear Physics B 568 (2000) 367–404
Next we will see what impact this update has had on the Fourier mode AŽ k, s ..
Combining Eqs. ŽA.28. and ŽA.35., we find
e i2 Ž s,k . a2 k˜ 2
A a Ž k , s,after . s A a Ž k , s,before . 1 y
ž 4N 3 /
e i Ž s,k .
q ˆ ..j a
exp Ž yik P Ž x q air2
N 3r2
a 2 l˜ 2e i Ž s,k . e i Ž sX ,l .
y Ý A a Ž l, sX ,before .
X
Ž s ,l ./ Ž s, k . 4N 3
ˆ .. .
= exp Ž i Ž k y l . P Ž x q air2 Ž A.36 .
The Ž k, s . term in the sum is removed and included instead in the first term. It is
responsible for the damping term in the Langevin equation. Both the noise term j and
the final term, which I will call the ‘‘cross-talk’’ term, are responsible for the noise term
in the Langevin equation.
To measure the magnitude of the Langevin damping term, we must compute the
correlator ² A) Ž k, s,before. AŽ k, s,after.:. Because ² AŽ k, s . AŽ l, sX .: s 0 s ² AŽ l, sX . j :,
we get
a2 k˜ 2e i2 Ž s,k .
² A) Ž k , s,before . A Ž k , s,after . : s ² A Ž k , s,before . 2 : 1
ž 4N 3 / . Ž A.37 .
It is also important to make sure that the mean square value of A a Ž k, s . is unchanged
by the update, which is the requirement that the noise have the right amplitude. Here we
get a little surprise; squaring Eq. ŽA.36.,
² Ž A a Ž k , s,after . . 2 :
2
a2 k˜ 2e i2 Ž s,k . e i2 Ž s,k .
² Ž A a Ž k , s,before . 2 : q ²Ž j a . 2 :
s 1y
ž 4N 3 / N3
2
1 a 2 l˜ 2
² A a Ž l, sX . A a Ž l, sX . :e i2 Ž s,k . e i2 Ž sX ,l .
q
N6
Ý
Ž l , sX ./ Ž k , s .
ž / 4
. Ž A.38 .
² Ž A a Ž k , s,after . . 2 :
2
a2 Te i2 Ž s,k . 1 1 1
s ² Ž A a Ž k , s,before . :
N 3 ž y
2
q
4
q
16 N 3
Ý e i2 Ž sX ,l . a2 l˜ 2
X / .
l, s
Ž A.39 .
G.D. Moorer Nuclear Physics B 568 (2000) 367–404 401
At leading order in large N, the sum gives 4 N 3. Therefore the mean size of A a Ž k, s . is
unchanged, which means that we have the correct amount of noise. We see that fully
half of the noise actually arises from ‘‘cross-talk’’ between the mode of interest and
extremely UV modes, with the other half arising from the noise explicitly appearing in
the algorithm.
To be Langevin the noise must have zero unequal-time correlation. This is the case
for j by explicit construction, but we need to check it for the ‘‘cross-talk’’ noise. For
the heat bath algorithm to act like a Langevin algorithm, the ‘‘cross-talk’’ contribution
to AŽ k, s . from updating the Ž x,i . link must be independent of that from the Ž y, j . link,
at least on averaging over the choice of Ž y, j . Žwhich is indeed chosen randomly in the
algorithm I use.. This is the case; from Eq. ŽA.36., the correlation between the
‘‘cross-talks’’ is
a4 l˜2 m̃2
e i Ž s,k . e j Ž s,k . e i Ž sX ,l . e j Ž sXX ,m . ² A a Ž l, sX .
X
Ý Ý
XX
Ž l , s ./ Ž k , s . Ž m , s ./ Ž k , s .
ž 16 N 6 /
= A a Ž m, sXX . :exp ik P x y y q a Ž iˆy jˆ. r2 exp Ž yil P Ž x q air2
ž ž ˆ . //
a 4 l˜2
ˆ .. s
qim P Ž y q ajr2 Ý e i Ž s,k . e j Ž s,k . e i Ž sX ,l . e j Ž sX ,l .
Ž l , sX . / Ž k , s . 16 N 6
™
applied., the only remaining correction would be a rescaling of the time scale, which
must be at worst O Ž a. since it vanishes as g 2 0 Žin which limit the calculation just
presented is exact., and on-dimensional grounds any O Ž g 2 . correction must be O Ž g 2 aT ..
We could in principle determine this O Ž a. correction by an analytic computation,
extending the one just presented to second order in g. Instead I compute the subleading
effects by the following strategy. I choose some infrared measurable O , and measure it
at each lattice point at a closely spaced series of Langevin times. I do the same using the
heat bath algorithm. Then I compare the unequal-time correlator or autocorrelator
C Žt y t X . s ² O Ž x,t . O Ž x,t X .:, where the average is over the ensemble of Langevin
trajectories, or in practice over coordinates and times in a single very long Langevin
trajectory. To match the time scales, we see what rescaling of the heat bath time scale is
needed to make the autocorrelator match the autocorrelator for the Langevin case. Any
IR measurable will do because we know that the algorithms both behave as Langevin
algorithms on the IR degrees of freedom, so the only O Ž a. difference would be a time
rescaling which will be of the same amplitude for any unequal-time observable.
I should explain that the reason this is worth doing at all is that, first, the heat bath
algorithm is much faster and does not suffer from step size errors like the Langevin
algorithm, and second, there are infrared measurables other than the topological density,
for which the autocorrelation statistics improve much more quickly. If the latter were not
true we would spend as much computer time making the match between techniques as it
would take to do the measurement of G by the Langevin method.
The measurable I choose is a fundamental representation Wilson loop after some
amount of cooling, specifically a 4 = 4 square Wilson loop after t s 3.125a 2 of gradient
flow cooling under H KS . This is an infrared measurable because such a large Wilson
loop samples mostly the infrared gauge fields, and because the cooling removes most of
the UV fluctuations anyway.
Incidentally, it is not too hard to compute the leading order perturbative prediction for
this quantity. The mean trace of an l = l Wilson loop in SUŽ N . gauge theory after a
length t of gradient flow cooling, neglecting lattice artifacts, is
= exp Ž y2 k 2t . q O Ž g 4 . . Ž A.41 .
™
In the t 0 limit the integral has logarithmic UV divergences but for finite t it has a
well defined value and is dominated by the infrared, k Q ty1 r2 . Of course, for the
Wilson loops under consideration here, perturbation theory will be unreliable because
the length scales involved are close to the scale where perturbation theory breaks down
completely. It might be interesting to see whether the infrared fields are stronger or
weaker than at leading order in perturbation theory, though.
I measured the same site, unequal-time Wilson loop correlator by measuring each
4 = 4 Wilson loop in an even sublattice every a 2r2 of Langevin time, for a series of
Langevin trajectories each about 200 a2 long, with 50 a 2 Langevin time between
trajectories to eliminate correlations between trajectories. For each trajectory I deter-
mined the autocorrelator averaged over volume and time. The autocorrelation function
G.D. Moorer Nuclear Physics B 568 (2000) 367–404 403
C Ž Dt . looks something like an exponential tail but is not well fit by one; there is some
small amount of much longer time scale correlation caused by the slow evolution of the
most infrared gauge fields. To compare the Langevin and heat bath time scales I
averaged C Ž Dt . over data sets for each update method and determined the rescaling of
the Langevin time which minimized the difference between the results,
Dtmax 2
x 2s H0 Ž C Langevin Ž Dt . y rC C heat Ž rt Dt . . d Dt , Ž A.42 .
where I allow a rescaling both of the autocorrelation time and of the overall magnitude
of C. I chose Dtmax to be enough that the autocorrelator has fallen about 1 y ey1 of the
way to its large Dt limit, but the result is quite insensitive to the specific choice. The
coefficient rt at the extremum of x 2 gives the rescaling of the Langevin time scale. The
multiplicative rescaling of C is necessary because there are small, very long time scale
correlations in the measurable which can effectively shift one data set somewhat with
respect to the other. As a check I compared the first half of the Langevin data I took
with the second half. I find that rt s 1 within a small tolerance the same size as the
jackknife error bars, but rC differs from 1 by a few percent.
For Ž4rg 2 aT . s 6 I ran each update procedure on a 24 3 lattice for t , 9000 a2 . The
rescaling of the time scales was
Dt heat s 1.098 " 0.007Dt Langevin , Ž A.43 .
with the error bar determined by the jackknife method.
The Langevin step size used here was D s 0.05. The definition of D and the
second-order algorithm I used are in Section 4 in the body of the paper. To check for
step size errors I evolved a trajectory for half as much Langevin time, using D s 0.025.
The rescaling between this trajectory and the heat bath was 1.099 " 0.010 and the
rescaling between Langevin evolutions with the two step sizes was 1.001 " 0.009, so
Langevin step size errors are negligible at D s 0.05.
I also measured NCS during the D s 0.05 Langevin trajectory; the ratio of the
diffusion constants for the heat bath and Langevin algorithms was
G heat
Ž unrescaled. s 1.114 " 0.058 , Ž A.44 .
G Langevin
which is compatible with the measured difference in time scales, but with much larger
error bars. The autocorrelator of the Wilson loop develops good statistics more quickly
than the diffusion constant for NCS .
Incidentally the mean value of the Wilson loop trace was ²2 y Tr U4= 4 : s 0.261,
while the leading order perturbative prediction is 0.096. The infrared of Yang–Mills
theory has more excitation than leading order perturbation theory predicts, by quite a bit
on the scale of a square Wilson loop 8rŽ3 g 2 T . on a side.
For Ž4rg 2 aT . s 10 I used a 40 3 lattice but only t s 2000 a 2 . The rescaling of the
time scales was smaller as expected, 1.065 " 0.016. Assuming the subleading correction
to be purely O Ž a., we would have guessed from the bL s 6 result that the rescaling
factor would be 1.059 " 0.004, which is within error. The rescaling at both lattice
spacings are within error of being t Langevinrt heat s Z g , and I speculate that this is the
correct analytic relation at O Ž a.. For the finest lattice spacing, I have simply extrapo-
lated the medium spacing data assuming a pure O Ž a. form for the correction.
404 G.D. Moorer Nuclear Physics B 568 (2000) 367–404
The deviation of the mean value of the Wilson loop measurable from perturbation
theory is also smaller on the finer lattice; the value is 0.110, while the perturbative
estimate is 0.058. This is also expected, since a 4 = 4 Wilson loop is smaller in physical
units in this case, and perturbation theory works better as the length scale becomes
smaller.
References
a
´
Dpto. de Fısica ´
Teorica, UniÕersidad del Paıs´ Vasco, Apdo. 644, E-48080 Bilbao, Spain
b
Instituut Õoor Theoretische Fysica, UniÕersiteit Õan Amsterdam, Valckenierstraat 65, 1018 XE Amsterdam,
The Netherlands
c
Spinoza Instituut, UniÕersiteit Utrecht, LeuÕenlaan 4, 3584 CE Utrecht, The Netherlands
Received 9 June 1999; received in revised form 18 August 1999; accepted 1 November 1999
Abstract
It is well known that string physics has changed the way we now look at Cosmology.
In particular, string theory leads to consider multidimensional cosmological scenarios in
a natural way, since superstrings can only be consistently quantized in ten dimensions.
M-theory has confirmed this trend extending cosmology to the realm of the eleventh
dimension Žsee for example w1–8x..
0550-3213r00r$ - see front matter q 2000 Elsevier Science B.V. All rights reserved.
PII: S 0 5 5 0 - 3 2 1 3 Ž 9 9 . 0 0 6 9 7 - 5
406 ´
A. Feinstein, M.A. Vazquez-Mozor Nuclear Physics B 568 (2000) 405–420
Higher dimensional cosmology is not new. Before the advent of string theory the
Kaluza–Klein paradigm had already put forward the idea that four-dimensional space-
time should not be something taken for granted in cosmology Ža collection of the most
relevant articles on the subject can be found in w9x; see also Ref. w10x.. In its original
version, the Kaluza–Klein program aimed to describe all four-dimensional matter fields
as purely gravitational Žor supergravitational. degrees of freedom in 4 q N dimensions.
Although this idea met serious obstacles when trying to account for the existence of
chiral matter in four dimensions w11,12x, it has been partially incorporated in string
theory where the presence of ten-dimensional chiral matter fermions solves the problem.
When looking at the four-dimensional physics, the extra internal dimensions leave
their blueprints in the form of a plethora of massless scalar and vector fields. On general
grounds, the presence of these massless scalars poses a serious problem in trying to
extract realistic cosmological models from string theory. The situation is complicated by
the fact that most of these fields represent flat directions of the superpotential that are
not lifted by quantum corrections as long as enough supersymmetry is preserved. In
many occasions, however, the study is restricted to the simplest cosmological models by
looking just at the effects of the dilaton and tensor fields, for example, and ignoring the
remaining moduli fields altogether 1 w13,44–50x.
The ultimate motivation of stringrM-theory cosmology is of course to solve the
problem of cosmological singularities, explain the initial conditions in cosmology and
the dimensionality of space-time. Close to t s 0, quantum gravity effects become
dominant and hopefully will smear the semiclassical singularity, thus opening a new
window to the study of the early universe. Although a full quantum cosmology
description of the early universe is still missing, the semiclassical analysis supported by
the use of stringy symmetries has been useful in getting a flavor of the physics close to
the Big-Bang singularity w14,22–31x. On the other hand, from a Kaluza–Klein perspec-
tive it is possible that the existence of the initial singularity might just be the result of
integrating out the physics associated with higher dimensions w15–17x and that it could
be removed already in the semiclassical approximation.
In this paper we will combine these two ideas and investigate the effect of extra
dimensions on the initial cosmological singularity by studying a family of Friedmann-
Robertson-Walker ŽFRW. cosmologies coupled to a number of scalar fields which we
associate with compactification moduli. We find that in the particular case of open or
flat Ž k s y1,0. universes in which only one of the extra dimensions has a non-trivial
dynamics, the Big-Bang singularity might be just an artifact of the Kaluza–Klein
reduction which may be removed when going to higher dimensions. On the other hand,
in the case of spatially closed solutions Ž k s 1. the Big-Bang or the Big-Crunch
singularity is postponed in the higher-dimensional model, again when only one of the
extra dimensions is time-varying. It is interesting that by switching on more than one
dynamical internal dimension the regularization of curvature singularities is definitively
spoiled in all cases. When interpreting, in the open and flat cases, this higher dimen-
sional cosmologies as solutions in M-theory we find that the U-duality group G 7 acts on
1
From the M-theory perspective, on the other hand, the dilaton itself is just a moduli associated with the
compactification of the eleventh dimension. The reason to separate it from other fields is that in the weakly
coupled string limit of M-theory its compactification scale g s l s is much smaller than the string scale l s .
´
A. Feinstein, M.A. Vazquez-Mozor Nuclear Physics B 568 (2000) 405–420 407
the moduli space of metrics by relating singular geometries with non-singular ones. This
seems to indicate that a certain class of cosmological singularities in M-theory can be
physically resolved in terms of an equivalent dual regular background.
The plan of the article is the following. In Section 2 we will review the properties of
moduli fields arising from compactifications of Ž4 q N .-dimensional Einstein gravity on
a N-dimensional straight torus and provide a general algorithm to construct Gowdy-type
cosmologies in the presence of moduli fields. We will apply in Section 3 this algorithm
to generate modular FRW cosmologies and, after undoing the Kaluza–Klein reduction,
will study the structure of singularities of the higher dimensional ‘‘parent’’ metric. In
Section 4 the analysis will be focused on M-theory compactifications to four dimensions
and the action of U-duality on the moduli space of solutions. Finally, conclusions will be
summarized in Section 5.
Scalar fields appear naturally in the old Kaluza–Klein program or its stringrM-the-
ory versions. Here, we will consider the dimensional reduction of a Ž4 q N .-dimensional
metric on an N-dimensional straight torus T N s ŽS 1 . N using the ansatz 2
4
N N ci
2
2 ci '3 2
ds4qN s ey 3 ' Ý ds42 q Ý e Ž dw i . . Ž 1.
is 1
is1
N N
H (
S s d 4 x y g Ž4. R Ž 4 . y 2
ž Ý Em ci E m ci q 23 Ý Em ci E m cj
is1 i-j
/ , Ž 2.
leading to the result that the breathing modes of the higher dimensional metric appear as
mixed scalar in the four-dimensional action. To eliminate this mixing we may perform a
diagonalization in field space by defining the new fields w i through the relation
c i s Di j w j , Ž 3.
2
For a generic analysis of the structure of the dimensionally reduced action, see Refs. w37,38x.
408 ´
A. Feinstein, M.A. Vazquez-Mozor Nuclear Physics B 568 (2000) 405–420
ymy1r2
1 my1r2
2 my1r2
3 ... my1r2
Ny1 my1r2
N
0 y2 my1r2
2 my1r2
3 ... my1r2
Ny1 my1r2
N
Ds .
.
.
.
.
.
.
.
.
.
Ž 4.
. . . . .
0 0 0 ... my1 r2
Ny 1 my1r2
N
¢ 0 0 0 ... y Ž N y 1 . my1r2
Ny 1 N
ß
my1r2
with
m n s 23 n Ž n q 1 . , n s 1, . . . , N y 1,
m N s 31 N Ž N q 2 . ,
which is not an O Ž N . transformation. The new scalar fields w i will be propagation
eigenstates and there will be no classical mixing among those. Their dynamics will be
governed by the action
N
H (
S s d 4 x y g Ž4. R Ž 4 . y 2 Ý Em wi E m wi . Ž 5.
is1
In our analysis we have assumed that all scalar fields have a geometric origin as
moduli of dimensional reduction. This point of view is very much appropriate for
M-theory where the dilaton is on the same footing with all other scalar as compactifica-
tion moduli. Nonetheless, in those regimes of M-theory that can be described in terms of
a weakly coupled string theory the dilaton field plays a privileged role as the field whose
vacuum expectation value determines the string coupling constant. In this case, dimen-
sional reduction from ten dimensions will produce again a number of scalar fields in the
lower-dimensional theory. The dynamics of those moduli can be extracted again from
the Kaluza–Klein ansatz
6
2
2
ds10 s ds42 q Ý e 2'2 s Ž dw i .
i , Ž 6.
is1
where now the ten-dimensional metric is no longer a vacuum solution of Einstein
equation but rather a solution of dilaton gravity instead. The resulting four-dimensional
action in string frame is w21x
6
H (
Sstring s d 4 x y g Ž4. ey2 f R Ž 4 . q 4Em f E mf y 2 Ý Em si E msi , Ž 7.
is1
where the four-dimensional dilaton f is defined in terms of the ten-dimensional one as
1 6
f s fŽ10. y Ý si . Ž 8.
'2 is1
We see from Ž7. that the kinetic term for the fields si is diagonalized in the
four-dimensional action, although it is conformally coupled to the dilaton field. This
conformal coupling can be removed as usual by a conformal transformation of the
metric. It is however when we re-express the four-dimensional dilaton in terms of the
ten-dimensional one via Ž8. that the mixing between the different si appears. In order to
recover the result from the compactification of a vacuum solution of M-theory ŽEq. Ž2.
´
A. Feinstein, M.A. Vazquez-Mozor Nuclear Physics B 568 (2000) 405–420 409
with N s 7. we would need to lift the solution Ž6. of ten-dimensional dilaton gravity to
a vacuum solution in eleven dimensions.
In the following we will be interested in finding exact solutions to the field equations
derived from the four-dimensional action Ž5., that we can write in the manifestly
O Ž N .-invariant form as
S s d 4 x y g R y 2 EmF T E mF
H ' Ž 9.
where we have defined
w1
.
wN 0
F ' .. .
The matter energy–momentum tensor for Ž9. can be written as a sum of the correspond-
ing stress-energy tensors for each scalar field 3
N
Tmn s Ý TmnŽ i. s 2 Ž EmF T EnF y 12 gmn Es F T E sF . . Ž 10 .
is1
Let us concentrate our attention on Gowdy-type cosmologies with line element
p Žt , z .
ds 2 s e f Ž t , z . Ž ydt 2 q dz 2 . q K Ž t , z . e dx 2 q eypŽ t , z . dy 2 . Ž 11 .
At first sight, the Gowdy-type coordinates seem an unnecessary complication, since we
will be mostly dealing with cases where both geometries and scalar fields are homoge-
neous. The telling point to use them, nevertheless, is that: Ži. The scalar field equation
are linear in the metric functions. Žii. The evolution of the transversal metric functions
K Ž t, z . and pŽ t, z . is decoupled from scalar field dynamics, the longitudinal function
f Ž t, z . being the only one influenced by the presence of the scalar fields. And Žiii. that
due to the presence of a six-parameter isometry group G6 which includes the three-di-
mensional group G 3 acting simply transitively on the three-dimensional surfaces of
constant curvature in FRW models, the above line element, which has a G 2 isometry
group, naturally includes all three FRW geometries.
Let f Ž t, z . s f Ž t, z . vac such that Ž11. is a solution of the Einstein vacuum equations.
In this case the functions pŽ t, z . and K Ž t, z . satisfy the following conditions:
E E
K Ž t , z . p˙ Ž t , z . y K Ž t , z . pX Ž t , z . s 0
Et Ez
and
K¨ Ž t , z . y K XX Ž t , z . s 0.
The idea now is to solve Einstein equations with the energy–momentum tensor Ž10.. As
it happens with a single scalar field Žsee for example w18x., the transverse part
3
We have normalized the energy–momentum tensor in such a way that the Einstein equations are
Gmn sTmn .
410 ´
A. Feinstein, M.A. Vazquez-Mozor Nuclear Physics B 568 (2000) 405–420
f Ž t , z . vac ™f Ž t, z. vac q f Ž t , z . sc
and the equations for f Ž t, z . sc can be written from the Einstein equations as
2K
f˙Ž t , z . sc s Ž K X Tt z y K˙ Tt t . ,
K y K˙ 2
X2
2K
f X Ž t , z . sc s Ž K X Tt t y K˙ Tt z . ,
K y K˙ 2
X2
where Tmn are the components of the energy–momentum tensor. Substituting Ž10. we
finally get
2K N N N
f˙Ž t , z . sc s 2 KX Ý w˙ i wiX y K˙ Ý w˙ i2 q Ý wiX 2
K X 2 y K˙ 2 is1
ž is1 is1
/ ,
2K N N N
f X Ž t , z . sc s KX Ý w˙ i2 q Ý wiX 2 y 2 K˙ Ý w˙ i w iX .
K X 2 y K˙ 2 ž is1 is1
/ is1
Ž 12 .
The structure of this expression Žsum over each scalar field. is the result of the fact that
Einstein equations are linear in the energy–momentum tensor and the energy–momen-
tum tensor itself is a sum of the contributions from each scalar field. Expressions Ž12.
are invariant under the global O Ž N . symmetry of Ž5., as one expects, since a rotation of
the fields by an element of this group does not modify the geometry. In addition, the
scalar fields w i must satisfy the wave equation
E E
K Ž t , z . w˙ i Ž t , z . y K Ž t , z . w iX Ž t , z . s 0, i s 1, . . . , N. Ž 13 .
Et Ez
Eqs. Ž12. can be used to generalize the algorithm of generation of string cosmologies
given in Ref. w13x. Taking w i s si Ž i s 1, . . . ,6. and w 7 s f , the four dimensional
dilaton, we generate exact solutions of the Einstein equations with seven scalar fields
that, after the conformal transformation by e 2 f will provide us with solutions to the
equations derived from the string theory action Ž7.. After this, we can further transform
the resulting four-dimensional metric by O Ž2,2;R. w13x or SLŽ2,R. w51x to generate other
fields, the moduli si remaining invariant under these transformations.
In addition to this, we can use this generation technique to directly construct
four-dimensional modular cosmologies representing toroidal compactifications of cos-
mological solutions of M-theory. In the following, we will particularize our analysis to a
certain class of these cosmological solutions that render homogeneous cosmologies in
four dimensions and use them to study how four-dimensional physics can be regularized
in eleven dimensions. This family of solutions will also be useful to study the moduli
space of M-theory on T 7.
´
A. Feinstein, M.A. Vazquez-Mozor Nuclear Physics B 568 (2000) 405–420 411
We will now apply what we have learnt to the construction of rolling moduli
cosmologies in four dimensions w32–35x. We start with a vacuum solution and apply the
algorithm Ž12. with a collection of N time-dependent scalar fields of the form
w i Ž t . s qi w 0 Ž t . , i s 1, . . . , N Ž 14 .
with qi a N-tuple of real numbers and w 0 Ž t . a particular homogeneous solution to the
wave Eq. Ž13.. The numbers qi 4 play now the role of coordinates in the moduli space of
solutions. From Ž12. we get the following equations for f Ž t, z . sc
N 2 KK˙
f˙Ž t , z . sc s y
ž /
Ý qi2
is1 K X 2 y K˙ 2
w˙ 02 ,
N 2 KK X
f X Ž t , z . sc s
žÝ /
is1
qi2
K X 2 y K˙ 2
w˙ 02 . Ž 15 .
Notice that in the family of solutions under consideration, the O Ž N . global symmetry of
the Einstein-Klein–Gordon action Ž5. acts linearly on the qi ’s and that the numerical
prefactor in Ž15. is just the O Ž N .-invariant metric on the moduli space.
In the generic case, the resulting family of metrics will always have strong curvature
singularities at some values of the time coordinate t. In what follows we will study what
happens to these singularities from a higher-dimensional point of view, when we undo
the Kaluza–Klein reduction.
Let us start with the following solution to the vacuum Einstein equations in four
dimensions
y1 r2
2
ds vac s Ž sinh2 t . Ž cosh4t y cosh4 z . 3 4 Ž ydt 2 q dz 2 .
q 12 sinh2 tsinh2 z Ž tanh z dx 2 q cotanh z dy 2 . Ž 16 .
and ‘‘dress’’ it with the homogeneous scalar fields Ž14. taking
'3
w0 Ž t . s log tanht.
2
The solution coupled to the N scalar fields is
1 3
Ž3 l y1 . Ž1y l .
ds 2 s Ž sinh2 t . 2
Ž cosh4t y cosh4 z . 4
Ž ydt 2 q dz 2 .
q 12 sinh2 tsinh2 z Ž tanh z dx 2 q cotanh z dy 2 . , Ž 17 .
where we have defined l as the O Ž N .-invariant quadratic form
N
l' Ý qi2 . Ž 18 .
is1
The interesting feature of these solutions is that the dynamics of the scalar fields is
only relevant at early times, saturating to a constant when t `. One would expect this
sort of behavior in ‘‘realistic’’ models for the modular fields in stringrM-theory
™
412 ´
A. Feinstein, M.A. Vazquez-Mozor Nuclear Physics B 568 (2000) 405–420
cosmology. The scalar fields are supposed to play an important role in the evolution
during the early epoch, but are expected to be frozen at a certain vacuum expectation
value after supersymmetry breaking. From that time on, the dynamics of the universe is
dominated by matter or radiation.
The case with l s 1 is especially interesting, since we recover an open FRW
universe
N
2
q Ý tanh2 p i t Ž dw i . , Ž 20 .
is1
where we have introduced the new constants pi s Di j q j , with Di j the matrix Ž4.. We
can now rewrite the condition l s 1 in terms of the new moduli space coordinates pi 4
as
N N
l' Ý pi2 q 23 Ý pi pj s 1. Ž 21 .
is1 i-j
™
this, we can evaluate the square of the Riemann curvature tensor Ra b c d R a b c d, which is
non-vanishing, in the t 0 limit
Ra b c d R a b c d ; C Ž pi . t 2Ž Sy3. q O w t 2Ž Sy2. x , Ž 22 .
´
A. Feinstein, M.A. Vazquez-Mozor Nuclear Physics B 568 (2000) 405–420 413
N
where we have written S s Ý is1 pi and C Ž pi . is defined by
N
2
C Ž pi . s 163 Ž S y 1 . Ž S 2 y 2 S q 5 . q Ý pi2 3 q pi2 q Ž S y 3 . Ž pi q S .
is1
N
q Ý pi p j .
i-j
™
First, let us notice that S ( '3 l - 3 so the leading term in Ž22. will always diverge at
t 0 limit. Thus, in general, we will have curvature singularities in 4 q N dimensions
in this limit. However, we find the surprising result that whenever pi s 1, pj s 0 Ž j / i .
C Ž pi . s 0 and actually the curvature invariant is regular for all times and given by 4
18
Ra b c d R a b c d s . Ž 23 .
cosh8 t
This corresponds to having only one dynamically non-trivial cycle whose scale factor
evolves with time. In the four-dimensional metric, on the other hand, due to the
non-diagonal relation between pi 4 and qi 4 , in general all scalar fields will have a
non-trivial dynamics. Note, however, that the dimensionally reduced model Ž17. is
generally inhomogeneous for generic l, the regular higher-dimensional solution falling
into a homogeneous and isotropic class. One might wonder whether higher-dimensional
regularity and lower-dimensional isotropy are physically linked somehow.
We now pass to discuss a four-dimensional metric with closed spatial sections. The
vacuum seed metric is obtained from Ž16. by replacing hyperbolic functions by their
trigonometric counterparts. We again couple the space-time to N scalar fields of the
form Ž14. now with
'3
w0 Ž t . s log tant.
2
Imposing as before l s 1, we are left with the family of four-dimensional metrics
dsl2s1 s sin2 t Ž ydt 2 q dz 2 . q 12 sin2 t sin2 z Ž tan z dx 2 q cotan z dy 2 . ,
'3 N
wi Ž t . s
2
qi log tan t , with Ý qi2 s 1. Ž 24 .
is1
The coordinate transformation from the standard closed FRW metric to Ž24. are given in
the Appendix of Ref. w18x, the coordinates z, x and y used here being just Euler angles
for S 3. In spite of the similarities with the flat case, the structure of cosmological
singularities is now richer. Studying the scalar curvature R we find that the metric Ž24.
has curvature singularities at t s l p2 with l being an integer. We have a bouncing
universe that evolves from a Big-Bang singularity at t s l p into a Big-Crunch at
t s Ž2 l q 1. p2 .
4
Actually, it is easy to realize that, provided this curvature invariant is finite, so are all other scalars
™
constructed from contractions of any number of Riemann tensors. In our case this follows from the fact that
R 1a;1 b is finite when t 0, where the index 1 denotes the dynamical internal dimension w 1.
414 ´
A. Feinstein, M.A. Vazquez-Mozor Nuclear Physics B 568 (2000) 405–420
where the same definition for the pi as in Subsection 3.1, satisfying the condition Ž21.,
has been used. From inspection of Ž25. we might be tempted to infer that the higher
™
dimensional theory will be singular again when t s l p2 . Computing the curvature
invariant when t 0q we get Eq. Ž22., as in the previous case. However, if we now
compute the same invariant close to the Big-Crunch singularity Ž t py
. we find ™
2
abcd y2Ž Sq3. y2 Ž Sy2.
Ra b c d R ; C Ž ypi . t qOw t x.
The situation is somewhat different as compared to the open case. There we found that
when the condition l s 1 is saturated by a single pi s 1, the initial Big-Bang singularity
disappears altogether by going to higher dimensions. Here we find that we do not get rid
of all curvature singularities, but only of ‘‘half’’ of them. The resulting geometry in this
case in only singular when t s p2 Ž2 l q 1., whereas those at t s p l are smeared in
4 q N dimensions. The curvature invariant now is given by Ž23. with the hyperbolic
cosine replaced by a trigonometric one. The reverse situation happens when we take
pi s y1 and pj s 0 Ž j / 0.: we avoid singularities located at t s p2 Ž2 l q 1., the
geometry being the same as for the previous case but now with t t q p2 .
In Ref. w17x a similar situation was noticed for a closed FRW cosmology that could
™
be resolved into a five-dimensional black hole interior with just one curvature singular-
ity in the past or in the future. The difference with the model analyzed here is that in the
case at hand our higher-dimensional universe still has a finite life, although it is doubled
with respect to the one of the four-dimensional geometry.
Finally, let us briefly analyze the case of cosmological models with flat spatial
sections. The four-dimensional metric and scalar fields system is given by
ds 2 s 2 t Ž ydt 2 q dz 2 q dx 2 q dy 2 . ,
'3 N
wi Ž t . s
2
qi log t , with Ý qi2 s 1, Ž 26 .
is1
We can transform this solution into a standard Kasner form by rewriting it in co-moving
time coordinates Žand re-scaling spatial coordinates. to give
N
2a0 2
2
ds4q 2
N s ydT q T Ž dX 2 q dY 2 q dZ 2 . q Ý T 2 a Ž dW i .
i , Ž 28 .
is1
´
A. Feinstein, M.A. Vazquez-Mozor Nuclear Physics B 568 (2000) 405–420 415
where
N
1y Ý pj
js1 2 pi
a0 s N , ai s N Ž i s 1, . . . , N . , Ž 29 .
3y Ý pj 3y Ý pj
js1 js1
Using the definition of the Kasner exponents in terms of the original pi it is possible to
check that the only case in which we will have a regular geometry at t s 0 will again
occur if pi s 1 with all other pj Ž j / i . vanishing. In this case the resulting metric is
flat. Actually it can be seen to be Ž4 q N .-dimensional Minkowski space-time in Rindler
coordinates 5. The non-trivial topology of the internal dimensions will hinder global
identification of the manifold with static Minkowski solution.
4. M-theory connections
Let us particularize the study of the cosmological models of the previous section to
the case with N s 7 in which they can be interpreted as four-dimensional cosmologies
arising from a compactification of M-theory on T 7 s ŽS 1 . 7 with vanishing three-form. In
describing the moduli space of cosmological solutions coupled to scalar fields of the
form Ž14. we have two different possibilities. In the four-dimensional theory it seems
natural to take the coordinates qi Ž i s 1, . . . ,7. which characterize the seven independent
scalar fields coupled to gravity. Due to the global O Ž7. symmetry of the low-energy
action, there is a natural choice for the moduli space metric
7
I4 s Ý qi2 .
is1
The resulting geometry will only depend on qi through the quadratic form I4 as seen in
Eq. Ž15..
5
In Refs. w15,16x an inverse procedure was evoked by dimensionally reducing from a five-dimensional flat
model to four dimensions to argue that the cosmological singularity could be an artifact of dimensional
reduction.
416 ´
A. Feinstein, M.A. Vazquez-Mozor Nuclear Physics B 568 (2000) 405–420
Incidentally, this moduli space metric is the same as the one obtained in w14x Žsee also
Refs. w19,20x. for the compactifications of M-theory on a seven-torus with A M N P s 0
using group theoretical considerations. The O Ž7. global symmetry will act linearly on
the pi ’s leaving I11 invariant. It is however important to notice that the eleven-dimen-
sional geometry do transform under O Ž7., in contrast to the four-dimensional one that
was a singlet under the action of this group. In eleven dimensions the only transforma-
tions that leave invariant the space-time metric are permutations of the pi ’s, which
generates the Weyl subgroup of the mapping class group of the seven-dimensional torus,
SLŽ7,Z..
We will consider the family of M-theory metrics obtained by taking N s 7 in Ž20..
These line elements are vacuum solution of eleven-dimensional supergravity everywhere
in the submanifold of the moduli space defined by I11 s 1. We can ask now about the
action on this metric of G 7 ' Weylw E7Ž7.ŽZ.x, the subgroup of U-duality transformations
preserving the straight torus T 7 s ŽS 1 . 7 and the vanishing of the three-form w14,19,20x,
which is generated by permutations of the pi ’s and the so-called 2r5 transformation
with s s p 1 q p 2 q p 3 . It is easy to show that G 7 leaves invariant the bilinear form I11
and it is thus a discrete finite subgroup of O Ž7. connecting physically equivalent
M-theory vacua.
In the following we will restrict our analysis to the open FRW metric of Subsection
3.1 although most of our results can be extended to the flat case as well. Acting with the
elements of G 7 on the different solutions characterized by pi 4 we can look how
sensitive M-theory physics is to the semiclassical geometry. The most striking fact we
find is that the U-duality group maps solutions with a Big-bang initial singularity into
geometries that are regular for all values of the cosmic time. For example, the metric
2
2
ds11 s 2 Ž sinht cosh2 t . Ž ydt 2 q dz 2 q sinh2 z dx 2 q cosh2 z dy 2 .
3
5 7
2 2 y 43 2
q Ž tanht . 3
Ý Ž dw i . q Ž tanht . Ý Ž dw i .
is1 is6
6
Since we are just performing a change of coordinates, I4 s I11. We will use however different notation to
indicate the coordinates used to write the quadratic form.
´
A. Feinstein, M.A. Vazquez-Mozor Nuclear Physics B 568 (2000) 405–420 417
Ž 13 , 13 , 13 , 13 , 13 ,y 23 ,y 23 . ™ Ž ,y
1
3
2
3 ,y 23 , 13 , 13 , 13 , 13 . ™ Ž 1,0,0,0,0,0,0. .
2r5
™
metrics at large times is to a great extend left invariant by the action of G 7 . The general
feature of all solutions Ž20. when t ` is that the open three dimensional space-time
inflates while the internal torus reach a constant volume, namely
Vol 3D
;t 3,
Vol T 7
with t the co-moving time. Physically, what we have is a cosmological solution that
evolves from an eleven-dimensional regime in which all scale factors are of the same
size into an asymptotic state with describes a ‘‘large’’ four-dimensional expanding
universe and a ‘‘small’’ static seven-dimensional torus Žcf. Refs. w36,42,43x..
In the case of the flat solution, on the other hand, the large-time behavior of the
metric is not universal. As we saw in Subsection 3.3, in this case we can rewrite the
metric in the Kasner form Ž28. with exponents given by Ž29.. The U-duality group G 7
acts on these exponents in a rather complicated way through the transformation of the
moduli space coordinates pi 4 Žcf. Ref. w14x.. Since the sum Ý7is1 pi is not invariant
under the 2r5 transformation, the asymptotic form of the metrics Ž27. or Ž28. will be
sensitive to rotations by elements of the U-duality group. In addition to this, the internal
manifold will not remain ‘‘small’’ in general. We will have the usual behavior of any
Kasner universe, with at least one contracting direction and a number of expanding ones.
The main characteristic of the non-singular M-theory cosmologies that we have
obtained is that in all cases there is only one time-varying coordinate, while the
remaining ones are constant during the evolution of the Universe. In trying to make a
string theory interpretation of such solutions it seems natural to identify the dynamical
coordinate with the eleventh dimension associated with the dilaton field. If we do so and
perform a dimensional reduction of the open Ž k s y1. regular eleven-dimensional
solution down to ten dimensions, the resulting metric and dilaton in string frame are7
6
2 2
dsstring s sinh2 t Ž ydt 2 q dz 2 q sinh2 z dx 2 q cosh2 z dy 2 . q tanht Ý Ž dw i . ,
is1
f Ž t . s 32 log tanht
which corresponds to a string background with three open spatial dimensions and
another six compactified on a torus whose dynamics is characterized by a global
breathing mode Ža situation extensively studied in the Kaluza–Klein and string cosmol-
2 4
7 2
The appropriate Kaluza–Klein ansatz in this case is ds11 s ey 3 f ds10
2
q e 3 f dw 2 .
418 ´
A. Feinstein, M.A. Vazquez-Mozor Nuclear Physics B 568 (2000) 405–420
™
ogy literature w22–35,39–41x.. The geometry is singular when t 0 but, on the other, at
large times we find again an inflating three-dimensional space together with a frozen
internal torus, now also with a constant dilaton. Notice, however, that M-theory
U-duality maps these kind of singular string cosmologies into the regular ones that are
obtained by identifying the dilaton field with one of the static circles. These transforma-
tions involve the interchange of the dilaton with other moduli fields and thus are
intrinsically non-perturbative from the point of view of string theory.
Incidentally, let us remark that because of the invariance of I11 ' l under O Ž7., the
family of solutions Ž17. is stable under the U-duality group which thus maps solutions
into solutions Žcf. Ref. w14x.. This is what one should expect, since G 7 is a symmetry of
M-theory on a straight torus with a vanishing three-form.
5. Concluding remarks
In this paper we have been concerned with four-dimensional cosmologies that arise
from the compactification of stringrM-theory on a straight torus. We have given a
general algorithm to generate these modular cosmologies in four dimensions with two
commuting isometries. Using this technique we have also constructed FRW metrics and
observed that, although they are singular in four dimensions, in the open Ž k s y1. and
flat Ž k s 0. cases their lifted higher-dimensional vacuum images have a regular curva-
ture invariant Ra b c d R a b c d. In the case of closed FRW cosmologies Ž k s 1., singulari-
ties are not removed but the lifetime of the higher-dimensional universe Ži.e. the time
elapsed from the Big-Bang to the Big-Crunch. is twice that of their four-dimensional
versions.
It is important to stress, however, that in general the homogeneous character of the
scale factors of the extra dimensions derived from Ž14. does not ensure the homogeneity
of the reduced four-dimensional cosmology. Given the absence of a singularity-free
higher-dimensional ‘‘parent’’ cosmology for the four-dimensional inhomogeneous solu-
tions, it is remarkable that the regularity condition for the space-time in higher
dimensions translates itself into isotropy of the four-dimensional solution. Thus, the
initially isotropic universe could be naturally chosen by the requirement of regularity of
the higher-dimensional space-time.
We have also studied four-dimensional modular cosmologies related to toroidal
compactifications of stringrM-theory. In the case of M-theory on a straight seven-torus
with vanishing three-form we have found open and flat singular metrics that can be
rotated into regular ones by the elements of the U-duality group G 7 , thus indicating that
the physics of a certain class of singular universe can be described in terms of a dual
regular geometry. In the case of homogeneous open universes, however, the large time
asymptotic behavior of the metric is insensitive to the action of G 7 . In all cases we find
an inflating three-dimensional space with k s y1 and an internal torus whose size
asymptotically reaches a constant value. It is remarkable that U-duality is able to relate
geometries with such a different behavior near t s 0 without modifying the dynamics of
the universe at large times. Incidentally, this stabilization of the internal dimensions is
produced by the presence of the positive spatial curvature and it is absent in the case of
the flat solutions.
´
A. Feinstein, M.A. Vazquez-Mozor Nuclear Physics B 568 (2000) 405–420 419
Because of the O Ž7. invariance of the dimensionally reduced cosmologies, all the
eleven-dimensional metrics labeled by pi 4 and the same value of I11 will produce
exactly the same geometry in four dimensions, although it will be coupled to scalar
fields with different amplitudes in each case. In particular, using the classical symmetry
of the four-dimensional action any solution in our family with I4 s 1 can be rotated into
one with a regular higher-dimensional ‘‘parent’’ metric. However, since M-theory in
this background is not invariant under the full group O Ž7., one expects that quantum
effects will break this global symmetry down to the U-duality group G 7 which preserves
the lattice of charges. The situation is completely analogous to the classical invariance of
the four-dimensional effective action under O Ž6,6;R. w21x, a symmetry that is broken
down to the T-duality group O Ž6,6;Z. by string effects.
Finally, it would be interesting to check whether our results can be extended to
compactifications of M-theory on a generic seven-torus with a non-vanishing three-form.
From our analysis it is clear at least that there will be solutions of this kind that can be
transformed into non-singular universes by U-duality, as can be seen by acting on the
regular metric with the Borel generators of E7Ž7.ŽZ. w38x.
Acknowledgements
References
w1x ¨ S. Mukherji, C.N. Pope, K.-W. Xu, Phys. Rev. D 55 Ž1997. 7926. hep-thr9610107.
H. Lu,
w2x A. Lukas, B.A. Ovrut, D. Waldram, Nucl. Phys. B 495 Ž1997. 365. hep-thr9610238.
w3x A. Lukas, B. Ovrut, Phys. Lett. B 437 Ž1998. 291. hep-thr9709030.
w4x N. Kaloper, I. Kogan, K.A. Olive, Phys. Rev. D 57 Ž1998. 7340. hep-thr9711027.
w5x K. Benakli, Int. J. Mod. Phys. D 8 Ž1999. 153. hep-thr9804096.
w6x A. Lukas, B.A. Ovrut, D. Waldram, Phys. Rev. D 60 Ž1999. 086001. hep-thr9806022.
w7x H.S. Reall, Phys. Rev. D 59 Ž1999. 103506. hep-thr9809195.
w8x M. Maggiore, A. Riotto, Nucl. Phys. B 548 Ž1999. 427. hep-thr9811089.
w9x T. Appelquist, A. Chodos, P.G.O. Freund, Modern Kaluza–Klein Theories ŽAddison-Wesley, New York,
1987..
w10x J.M. Overduin, P.S. Wesson, Phys. Rep. 283 Ž1997. 303. hep-thr9805018.
w11x E. Witten, Nucl. Phys. B 186 Ž1981. 412.
w12x E. Witten, Fermion quantum numbers in Kaluza–Klein theory, Proceedings of the Shelter Island II
Conference ŽM.I.T. Press, 1985..
w13x ´
A. Feinstein, R. Lazkoz, M.A. Vazquez-Mozo, Phys. Rev. D 56 Ž1997. 5166. hep-thr9704173.
w14x T. Banks, W. Fischler, L. Motl, J. High Energy Physics 01 Ž1999. 019. hep-thr9811194.
w15x G.W. Gibbons, P.K. Townsend, Nucl. Phys. B 282 Ž1987. 610.
w16x G.W. Gibbons, G.T. Horowitz, P.K. Townsend, Class. Quant. Grav. 12 Ž1995. 297. hep-thr9410073.
w17x F. Larsen, F. Wilczek, Phys. Rev. D 55 Ž1997. 4591. hep-thr9610252.
420 ´
A. Feinstein, M.A. Vazquez-Mozor Nuclear Physics B 568 (2000) 405–420
Abstract
We report the results of a new accurate evaluation of light nuclei yields in primordial
nucleosynthesis. All radiative effects, finite nucleon mass, thermal and plasma corrections are
included in the proton to neutron conversion rates. The relic densities of 4 He, D and 7 Li have been
numerically obtained Õia a new updated version of the standard BBN code. In particular the
theoretical uncertainty on 4 He is reduced to the order of 0.1%. q 2000 Elsevier Science B.V. All
rights reserved.
1. Introduction
Big Bang Nucleosynthesis ŽBBN. still represents one of the key subject of modern
cosmology even if its clear understanding traces back to over 25 years ago w1–3x. The
reason for this relies on the fact that BBN is one of the most powerful tools to study
fundamental interactions, since light nuclei abundances are crucially depending on many
elementary particle properties. As a well-known example, the 4 He abundance is strongly
affected by the number of effectiÕe neutrino degrees of freedom, but others fascinating
phenomena such as neutrino degeneracy or oscillation phenomena can be studied too,
using the universe few seconds after the bang as a laboratory.
In the recent years, the experimental accuracy of the light primordial nuclei abun-
dances, mainly the one of 4 He, underwent a sort of revolution. The qualitative results of
not too many years ago, suggesting that the 4 He mass fraction Y4 was of the order of
0.25, recently turned in measurements with accuracies of the order of one percent. A
similar good improvement has been obtained in both deuterium ŽD. and 7 Li abundances
0550-3213r00r$ - see front matter q 2000 Elsevier Science B.V. All rights reserved.
PII: S 0 5 5 0 - 3 2 1 3 Ž 9 9 . 0 0 7 2 5 - 7
422 S. Esposito et al.r Nuclear Physics B 568 (2000) 421–444
Y2 ' DrH and Y7 '7 LirH. In particular for D, measurements in distant Quasars
Absorption line Systems ŽQAS. now represent a reliable estimate of the primordial value
for Y2 , which is only lowered by subsequent stellar processing. Paradoxically, the
refinement of these experimental techniques, due to the uncertainties in the models
describing stellar activity, is at the basis of large discrepancies between different set of
results. Such discrepancies are possibly of systematic origin, or may reveal new aspects
of cosmological evolution of the universe. The observations of Y4 from regression to
zero metallicity in blue compact galaxies in two independent surveys still produce two
incompatible results, a low value w4–6x,
™ e q p,
ne q n y
™ e q n,
q
e q p ™ n q n,
ne qp
n ™ e q n qp,
Ž a. Ž d.
y y
e q n ™ n qp, e q n qp ™ n,
Ž b. e Ž e. e Ž 1.7 .
q y
Ž c. e Ž f. e
S. Esposito et al.r Nuclear Physics B 568 (2000) 421–444 423
which fix at the freeze out temperature ; 1 MeV the neutron to proton density ratio.
The Born rates, obtained in the tree level V y A limit and with infinite nucleon mass,
have been corrected to take into account basically three classes of relevant effects:
Ži. electromagnetic radiative corrections, which largely contribute to the rates of the
fundamental processes, in particular in the low temperature regime, T ( 0.1 MeV;
Žii. finite nucleon mass corrections, which are of the order of TrMN or m erMN ,
with m e , MN the electron and nucleon mass, respectively;
Žiii. plasma effects, proportional to the surrounding plasma temperature, which both
affect the microscopic process rates Ža. – Žf., as well as the neutrino to photon
temperature ratio through e ", g equations of state.
The other main source of theoretical uncertainty comes from the partial knowledge of
nuclear rates relevant for nuclei formation. Their numerical expressions, obtained by a
convolution of the experimental data with a Boltzmann distribution, are affected by
uncertainties of the order of 10% Žsee references quoted in w24x.. More crucially, in
many cases, these fits are known to well describe the data in a temperature interval
which is only partially overlapping the one relevant for BBN, 0.01 MeV ( T ( 10 MeV.
However, both a Monte Carlo analysis to sample the error distribution of the reaction
cross sections w25,26x, and a more recent method based on linear error propagation w27x,
show that, in particular for 4 He mass fraction, the effect is at most as large as the one
due to the uncertainty on neutron lifetime tn , and smaller than 1%. Therefore it is
theoretically justified to look, as in w23x, for all sources of theoretical uncertainty up to
this level of precision. The situation gets worse with D and 7 Li, where the uncertainties
due nuclear reactions can be as large as Ž10–30.% w27x.
This paper represents the natural companion to w23x. We have built a new updated
version of the standard BBN code, which is available since many years w1–3,24x, where
all corrections Ži. – Žiii. have been included. In particular we have also included the
l
modified e ", g equations of state due to electromagnetic mass renormalization. In
Section 2 we review the corrections to n p Born rates, while in Section 4 we discuss
the numerical method we have used to integrate the set of equations relevant for BBN,
which are described in Section 3. The numerical results for light nuclei abundances, as
functions of the final baryon to photon density ratio, h , the number of effective neutrino
degrees of freedom, Nn , and the neutron lifetime, tn , are reported in Section 5, where
they are discussed and compared with the experimental data. We have also performed a
fit of these abundances with a precision of the order of 0.1% in the interesting range for
the parameters h , Nn and tn . Finally in Section 6 we give our conclusions.
is necessary, though not sufficient, to have a reliable evaluation of the rates for the
processes Ž1.7.. An effort in this direction has been pursued in the last ten years by
many authors. Recently, the entire set of corrections to the Born rates v B at the level of
1% accuracy have been recalculated in w23,28x, with quite compatible results. In this
section we shortly summarize the main corrections Dvrv B coming from considering
radiative, finite nucleon mass, and thermal effects. This short review is here included for
the sake of completeness and to fix the notation. A detailed discussion of the subject can
be found in our paper w23x.
Let us consider as an example, the thermal averaged rate per nucleon for the neutron
decay process Že.. In the simple V y A tree level, and in the limit of infinite nucleon
mass, which we will refer to as Born approximation, one has
vB Ž n ™ e q n qp . s
y
e
GF2 Ž C V2 q 3CA2 .
2p 3
`
X X 2
H0 d < p < < p < q02 Q Ž q0 . 1 y Fn Ž q0 .
= 1 y Fe Ž pX0 . , Ž 2.1 .
where GF is the Fermi coupling constant, C V and CA the nucleon vector and axial
coupling. In our notation pX and pX0 are the electron momentum and energy, and q0 the
neutrino energy. The integration limits are imposed by the Q-function, q0 0 0. For
reaction Že. we have q0 s Mn y M p y pX0 ' D y pX0 . The Fermi statistical distributions
for e " and neutrinos in the comoÕing frame, neglecting chemical potentials, are
X y1 y1
Fe Ž pX0 . s w e b < p 0 < q 1 x , Fn Ž q0 . s w e bn < q 0 < q 1 x , Ž 2.2 .
with b s 1rT and bn s 1rTn .1 All other rates for processes Ža. – Žd., Žf. can be simply
obtained from Ž2.1. properly changing the statistical factors and the expression for q0
w23x.
The accuracy of Born approximation can be tested by comparing, for example, the
prediction for neutron lifetime with the experimental value tnex s Ž886.7 " 1.9. s w29x.
Using C V s 0.9751 " 0.0006 and CArC V s 1.2601 " 0.0025 w29x, Eq. Ž2.1. in the
vanishing density limit gives tn , 961 s. Therefore, to recover the experimental value, a
correction of about 8% is expected to come from radiative andror finite nucleon mass
effects. In the same way these corrections are also expected to contribute to all six
processes Ža. – Žf. relevant for BBN. In addition to these, microscopic n p reactions
taking place in the early universe, also feel the presence of the surrounding plasma of g
l
and e " pairs in thermodynamical equilibrium. Emission and absorption of real g or e "
from the thermal bath can be taken into account using the finite temperature field theory
in the real time formalism. This has been considered by several authors w30–45x, and
recently in w23x.
1
The ratio Tn r T is fixed by entropy conservation and using the neutrino decoupling temperature w23x Žsee
Section 2.4.1..
S. Esposito et al.r Nuclear Physics B 568 (2000) 421–444 425
where the first term is the short-distance contribution and A g s y0.34 is a perturbative
QCD correction. The other two terms are related to the axial-induced contributions, with
MA s 400–1600 MeV a low energy cut-off in the short-distance part of the g W box
diagram, and C related to the remaining long distance term.
The global effect of these two kind of corrections, improved by resumming all
leading logarithmic corrections a n ln n Ž MZ . w48x, is Õia the multiplicative factor
a Mp a Ž Mp .
G Ž pX0 ,q0 . s 1 q g Ž pX0 ,q0 . qA g
2p ž ln
MA /
q2C q
2p
S Ž M p , MZ . ,
Ž 2.5 .
where a Ž m . is the QED running coupling constant defined in the MS scheme and
SŽ M p , MZ . a short distance rescaling factor, defined in w23x.
Another effect to be considered, which can be in fact as large as few percents of the
Born rates, is the so-called Coulomb correction, due to the rescattering of the electron in
the field of the proton and leading to the Fermi function for Coulomb scattering
pX0
F Ž pX0 . , 1 qap X .
ž / Ž 2.6 .
<p <
However, this effect is only present when both electron and proton are in either the
initial or final states, namely it only corrects the amplitudes of processes Ža., Žb., Že. and
Žf..
One may wonder if including the effects given by Ž2.5. and Ž2.6. the theoretical
prediction for neutron decay is now compatible with the experimental results. Evaluating
numerically the integral over the phase space one finds tnth s 893.9 s, still at variance
with the experiment. Even adding all known sub-leading effects the agreement does not
really improve w49x. As in Ref. w23x we take the point of view of rescaling all the rates
Ž1.7., after including finite nucleon mass corrections Žsee Section 2.3., by the constant
426 S. Esposito et al.r Nuclear Physics B 568 (2000) 421–444
l
There are three additional contributions to the n p rates which appear when one
relaxes the approximation of infinitely massive nucleons. The leading effects are
proportional to m erMN or TrMN , which, in the temperature range relevant for BBN,
can be as large as the radiative corrections considered in Section 2.2. This has been first
pointed out in w50,51x and then also numerically evaluated in w23x. At order 1rMN there
are new couplings appearing in the expression of the weak hadronic current, the larger
one coming from the weak magnetic moments of nucleons
GF f 2 X n X
Jmwm s i
'2 MN u p Ž p . smn Ž p y q . u n Ž q . , Ž 2.7 .
where, from CVC, f 2 s Vu d Ž m p y m n .r2 s 1.81Vu d . Both scalar and pseudoscalar con-
tributions can be shown to be much smaller and negligible for the accuracy we are
interested in. At the same order in inverse nucleon mass power it is also necessary to
include the deformation of the allowed phase space for the relevant scattering and decay
Fig. 1. The total Born rates, v B , for n ™ p Žsolid line. and p ™ n transitions Ždashed line.. This notation is
adopted hereafter.
S. Esposito et al.r Nuclear Physics B 568 (2000) 421–444 427
Fig. 3. The finite nucleon mass corrections to Born rates, Dv M r v B , for n l p transitions.
428 S. Esposito et al.r Nuclear Physics B 568 (2000) 421–444
The third effect is due to the initial nucleon thermal distribution. In the infinite
nucleon mass limit, the average of weak rates over nucleon distribution is in fact trivial,
since the nucleon is at rest in any frame. For finite MN , by considering only 1rMN
terms, the effect of the thermal average over the thermal spreading of the nucleon
velocity produces a purely kinetic correction Dv K , whose expression can be reduced to
a one-dimensional integral over electron momentum which can be numerically evalu-
ated. The explicit expression, which we do not report for brevity can be found in
Subsection 4.2 and Appendix C of Ref. w23x. The ratios Dv K rv B for n p are
reported in Fig. 4. Their size is rapidly growing with temperature, since they are
l
proportional to the ratio TrMN .
l
The n p rates, calculated as the processes would occur in vacuum, get slight
corrections from the presence of the surrounding plasma of e " pairs and g . These are
the so-called thermal-radiatiÕe effects.
To compute these corrections one may use the standard real time formalism for finite
temperature field theory. According to this scheme, field propagators get additional
contributions proportional to the number density of that particular specie in the
surrounding medium. For g and e " we have
i d Ž k2 . i
i Dgmn Ž k . s y 2
q 2p b <k0<
g mn s y q 2p d Ž k 2 . B Ž k 0 . g mn ,
k e y1 k2
Ž 2.8 .
i
i S e Ž pX . s X y 2p d Ž pX 2 y m2e . Fe Ž pX0 . Ž pu X q m e . . Ž 2.9 .
pu y m e
The entire set of thermal corrections Dv T R , at first order in its typical scale factor, i.e.
a Trm e , have been computed by several authors w30–45x with quite different results. We
have recently performed this lengthy calculation in w23x, to which we refer for all details,
and we have found a good agreement with the original result of Ref. w30x, namely that
they contribute to correct the Born rates only for less than 1%.
Tn
T
s~ž¢ I Ž xgD . q 2 I Ž x eD . / T ( TD
, Ž 2.10 .
1 T ) TD
with
` 1r2 y1
IŽ x. s H0 Ž y 2 q 2 yx . Ž 4 y 2 q 8 yx q 3 x 2 . exp Ž x q y . " 1 dy. Ž 2.11 .
According to our notation, x a ' maRrT and x aD ' maRrTD with a s g ,e Žq or y in the
above integrand is for fermions or bosons, respectively.. Note that mgR and m eR are the
S. Esposito et al.r Nuclear Physics B 568 (2000) 421–444 429
vŽ n ™ p . s t1 expŽ yq
n
ex np z.
13
Ý a l zyl ,
ls0
0.01 MeV ( T ( 10 MeV, Ž 2.12 .
S. Esposito et al.r Nuclear Physics B 568 (2000) 421–444 431
°1
v Ž p ™ n . s~ t
13
ex
exp Ž yq p n z . Ý bl zyl 0.1 MeV ( T ( 10 MeV
Ž 2.13 .
¢ n
0
ls1
0.01 MeV ( T - 0.1 MeV
with
a0 s 1 a1 s 0.160615 a 2 s 0.456817 = 10 1
a3 s y0.401109 = 10 2 a 4 s 0.137254 = 10 3 a5 s y0.583644 = 10 2
a6 s 0.655938 = 10 2 a 7 s y0.162185 = 10 2 a8 s 0.371109 = 10 1
a9 s y0.378497 a10 s 0.223840 = 10y1 a11 s 0.723091 = 10y5
a12 s y0.462476 = 10y4 a13 s 0.186287 = 10y5 qn p s 0.340994,
Ž 2.14 .
The fit has been obtained requiring that the fitting functions differ by less than 0.1%
™™
from the numerical values, while it is also a good approximation to consider a vanishing
rate v Ž p n. for T ( 0.1 MeV, see Eq. Ž2.13., since it is a rapidly decreasing function
with T 0.
Let us consider Nnuc species of nuclides, whose number densities, X i s n irn B , are
normalized with respect to the total baryon density n B . The different nuclides are
ordered in the following way: n, H, D, 3 H, 3 He, 4 He, 6 Li, 7 Li, 7 Be, . . . Žfor the complete
list see Ref. w24x.. Denoting with RŽ t . the universe scale factor, the BBN set of
equations as functions of R, n B , T, X i , and of the electron chemical potential
f e ' m erT reads 2
Ṙ 8p
R
s ( 3 MP2
w rg q r e q rn q rB x 1r2 , Ž 3.1 .
2
We are using natural units " s cs k B s1.
432 S. Esposito et al.r Nuclear Physics B 568 (2000) 421–444
n˙ B R˙ 24p
nB
s y3
R
sy ( MP2
w rg q r e q rn q rB x 1r2 , Ž 3.2 .
me p 2 nB
L ž T
, fe s / 2 T3
Ý Zj X j ,
j
Ž 3.3 .
Ṙ
T˙s y 3
½ R
rg qpg qr e qp e qQ Ž T y TD . Ž rn qpn . qp B
Er e Ef R˙ Ef e
q
Ef e ž j
e
Ý E X X˙j y 3 R n B
j E nB / q nB Ý Ž D M j q 32 T . X˙j
j
5
y1
d rg Er e Er e Ef e d rn
= ½ dT
q
ET
q
Ef e E T
q Q Ž T y TD .
dT
q nB 3
2 Ý Xj
j
5 , Ž 3.4 .
X lNl X kNk X iN i X jN j
Ni G k l ™ i j y Gi j ™ k l
Ẋ i s Ý
j, k ,l
ž Nl ! Nk ! Ni ! Nj ! / ' Gi Ž X j . . Ž 3.5 .
In the previous relations r and p denote the energy density and the pressure of an
arbitrary particle specie. The function LŽ z, y . in Ž3.3. is defined as
1 ` ey eyy
LŽ z , y . ' Hz dx x 'x 2 y z 2 ž y / , Ž 3.6 .
2 e xqe y e x q eyy
i, j,k,l s 1, . . . , Nnuc , and the ith nuclide, with charge and atomic number Ž Zi , A i ., has
mass Mi and mass excess Ž Mu is the atomic mass unit.
D Mi s Mi yA i Mu . Ž 3.7 .
Moreover, in Ž3.5. we are considering in the sum a reaction between Ni nuclides of type
i and Nj of type j which results in Nl nuclides of type l and Nk of type k, with its
reverse reaction. The energy density and the pressure of baryons take the form
r B s n B Mu q Ý Ž D M j q 32 T . X j , Ž 3.8 .
j
pB s nB T Ý Xj . Ž 3.9 .
j
Eq. Ž3.1. is easily recognized as the Friedmann equation where we have neglected for
simplicity the cosmological constant. Eq. Ž3.2. rules the scaling on n B , whereas Ž3.3.
states the neutrality of primordial plasma. From entropy conservation one gets Ž3.4., and
Ž3.5. are the Boltzmann equations for the Nnuc nuclide number densities. Note that the
presence of the Q-function in Ž3.4. is connected with neutrino decoupling at T s TD .3
3
We have assumed that all neutrinos decouple at the same temperature TD . Actually muon and tau
neutrinos decouple at a slightly larger temperature of 3.5 MeV, but nevertheless our approximation is largely
consistent with the required precision on 4 He yields.
S. Esposito et al.r Nuclear Physics B 568 (2000) 421–444 433
In the set of equations Ž3.1. – Ž3.5. one can safely substitute Eq. Ž3.3. with an
analogous relation, obtained expanding the l.h.s. of Ž3.3. with respect to f e ,
` ex
L Ž z , y . , y H dx x 'x 2 y z 2 2
' y f y1 Ž z . . Ž 3.10 .
z Ž e x q 1.
In this case, Eq. Ž3.3. provides an explicit expression for f e s f e ŽT,n B , X j .,
p2 me nB
fe ,
2
f ž /
T T3
Ý Zj X j .
j
Ž 3.11 .
The consistency of this approach has been tested by means of an iterative check.
The set of Eqs. Ž3.1. – Ž3.5. can be transformed in a set of Nnuc q 1 differential
equations with the dimensionless variable z s m erT as the evolution parameter. For
numerical reasons, it is also better to turn the variable n B into the dimensionless quantity
hˆ ' n BrT 3 , which varies more slowly with z than n B . In terms of these new variables
the BBN set of equations becomes
dhˆ 3hˆ
s 1 y Hˆ z ,h,
ˆ X j G z ,h,
ž ˆ Xj / ž / , Ž 3.12 .
dz z
dX i Gˆi
ˆ Xj
s G z ,h,
ž / , Ž 3.13 .
dz z
where the function G is
ž ˆ Xj
G z ,h, /
° Erˆa 3
hˆ Ý X j
¶
s ~ a
ž
Ý 4 rˆa y z E z / q 4Q Ž z D y z . rˆn q
2 j
•.
4 3
¢3 Ý Ž rˆa q pˆ a . q 3 Q Ž z D y z . rˆn q hˆ Ý X j Hˆ q hˆ Ý ž z D Mˆ j q
2 / ˆß
Gj
a j j
Ž 3.14 .
In the previous equations z D s m erTD , a s e,g , and we have considered the dimension-
less Hubble parameter Hˆ s Hrm e ,
1r2
8p m e 1 3
Hˆ z ,h,
ž ˆ Xj s / ( 3 MP z 2
rˆg q rˆ e q rˆn q hˆ zMˆ u q Ý z D Mˆ j q
ž j
ž 2 / /Xj ,
Ž 3.15 .
and the quantities Mˆ u s Murm e , D Mˆ j s D M jrm e , Gˆj s Gjrm e . Energy densities and
pressures have been adimensionalized dividing by T 4 . In Eqs. Ž3.12. and Ž3.13. we have
neglected the terms containing the derivatives of chemical potential. In Appendix A we
report the expressions for pˆ a and rˆa evaluated taking into account the g and e "
434 S. Esposito et al.r Nuclear Physics B 568 (2000) 421–444
Ž 3.17 .
where Bi denotes the binding energy.
4. Numerical method
The most critical part of the BBN code concerns the method of numerical resolution
of the set of differential Eqs. Ž3.12., Ž3.13.. In fact, since at high temperatures nuclear
reactions proceed in both forward and reverse directions with almost equal rapidity, the
r.h.s. of Ž3.13. results to be a small difference of large numbers. When this occurs the
numerical problem is said to be stiff. As a consequence, the step size is limited more
severely by the requirement of stability than by the accuracy of the numerical technique.
In other words, to preserve integration stability it is required to use a shorter step size
than what would be dictated by accuracy only. In order to manage the problem, we use a
NAG routine implementing a method belonging to the class of Backward Differentiation
Formulas ŽBDFs. w53x. This is quite a new approach for BBN codes. In fact the standard
code w1–3,24x uses instead the implicit differentiating method Žbackward Euler scheme.
w53x for writing the r.h.s. of Ž3.13. and a Runge–Kutta solver.
Few comments on the different numerical methods are in order. Let us consider the
differential equation
dy Ž t .
sf Ž t, yŽ t. . . Ž 4.1 .
dt
In the Runge–Kutta methods the solution at t iq1 is completely determined by its value
at t i Žone-step methods., namely the solver has no memory. A different approach is
provided by a wide class of numerical methods referred to as multistep methods like
BDFs. Here, the values of the solution at t k Ž k s i,i y 1, . . . ,i y p ., y Ž t k . ' y k , previ-
S. Esposito et al.r Nuclear Physics B 568 (2000) 421–444 435
ously computed, and the unknown value y Ž t iq1 . ' yiq1 , are interpolated by a polyno-
mial, P Ž t; yiq1 , yi , . . . ., in order to approximate the solution and its derivative. Substi-
tuting in the differential equation,
dP
Ž t iq1 ; yiq1 , yi , . . . . , f Ž t iq1 , yiq1 . , Ž 4.2 .
dt
one obtains a family of BDFs,
Ž t iq1 y t i . f Ž t iq1 , yiq1 . , P Ž t iq1 ; yiq1 , yi , . . . .
y P Ž t i ; yiq1 , yi , . . . . s a 0 yiq1 q a 1 yi q . . . . Ž 4.3 .
Two methods can be used for solving the previous equation in the implicit case, a 0 / 0:
functional iteration and Newton’s method. In the former case some initial guess is taken
for yiq1 and refined by iteration. In the latter case, one linearizes Eq. Ž4.3. by expanding
f around yi . The new point, yiq1 , is then found by inverting a matrix, in a way similar
to the backward Euler scheme. The NAG routine implements both methods and
incorporates an error control test, which drives the step-size adjustment.
The nuclear reaction network used in the code includes all the 88 reactions between
the 26 nuclides present in the standard code w1–3,24x. We used the same nuclear rate
data of the standard code, which are collected and updated in w54x. In order to reduce the
computation time one can also choose a reduced network, made of the 25 reactions
Table 1
The reduced network of nuclear reactions
l
l
Ž1. n p
l
3
Ž2. T He
l
Ž3. pq n Dqg
l
Ž4. nq D T qg
l
Ž5. nq3 He 4
Heqg
l
Ž6. nq6 Li 7
Liqg
l
Ž7. nq3 He Tq p
l
Ž8. nq7 Be 7
Liq p
l
Ž9. nq6 Li 4
HeqT
l
Ž10. nq7 Be 4
Heq4 He
l
3
Ž11. pq D Heqg
l
4
Ž12. pqT Heqg
l
Ž13. pq6 Li 7
Beqg
l
Ž14. pq6 Li 4
Heq3 He
l
Ž15. pq7 Li 4
Heq4 He
l
Ž16. Dq4 He 6
Liqg
l
Ž17. T q4 He 7
Liqg
l
3
Ž18. Heq4 He 7
Beqg
l
3
Ž19. Dq D Heq n
l
Ž20. Dq D Tq p
l
4
Ž21. DqT Heq n
l
Ž22. Dq3 He 4
Heq p
l
3
Ž23. Heq3 He 4
Heq pq p
l
Ž24. Dq7 Li 4
Heq4 Heq n
Ž25. Dq7 Be 4
Heq4 Heq p
436 S. Esposito et al.r Nuclear Physics B 568 (2000) 421–444
Table 2
The predictions on light element abundances obtained with the numerical code for h s 5=10y1 0 and Nn s 3.
In the lower rows the effect of the various corrections is reported
Y2 Y3 Y4 Y7
y4 y4
v Tot 0.3638=10 0.1175=10 0.2446 0.2814=10y9
vB 0.3727=10y4 0.1184=10y4 0.2550 0.2873=10y9
Dv R y2.3% y2.8% y3.8% y1.9%
Dv M 0.2% 0.1% 0.3% 0.2%
Dv K 0.2% 0.1% 0.3% 0.2%
Dv T R y0.6% y0.1% y0.7% y0.4%
dt v T y0.3% y0.1% y0.6% y0.3%
between 9 nuclides listed in Table 1. Using the complete network we have verified that
the reduced one affects the abundances for no more than 0.01 %, while it greatly reduces
the evaluation time.
l
The reliable numerical code just discussed can now be used to study the effect of the
different corrections to n p Born rates on light elements abundances. By definition
X3 X5 M6 X6 X8
Y2 s , Y3 s , Y4 s , Y7 s . Ž 5.1 .
X2 X2 Ý Mj X j X2
j
l
In the first two rows of Table 2 are shown the predictions for Y2 , Y3 , Y4 and Y7 at
h s 5 = 10y1 0 , corresponding to the complete n p rates, v Tot , and to the Born
™
approximation, v B .4 As is clear from Table 2, the main effect of the corrections, which
results into the enhancement of the n p conversion rate, is to allow a smaller number
of neutrons to survive till the onset of nucleosynthesis. This ends up in a smaller fraction
of elements which fix neutrons with respect to hydrogen.
The effects on light element yields due to the various corrections with respect to the
Born predictions are also reported in Table 2. For all nuclides the pure radiative
correction Dv R provides the dominant contribution, while the finite nucleon mass
effects, the kinetic and the thermal-radiative ones almost cancel each other. Finally the
last row reports the further contribution due to the additional term required to recover
the experimental neutron lifetime w23x.
If we make use of the results of w27x to quantify the uncertainties coming from
nuclear reaction processes, we observe that only for Y4 the radiative correction affects
the Born result by an amount larger than the theoretical uncertainties, including nuclear
reactions. For 4 He mass fraction in fact, the theoretical uncertainty due to nuclear
4
Note that, according to our notation, with v B we denote the pure Born predictions for n p rates l
without any constant rescaling of coupling to account for the experimental value of neutron lifetime Žsee
Section 2.2..
S. Esposito et al.r Nuclear Physics B 568 (2000) 421–444 437
Fig. 7. The 4 He mass fraction, Y4 , versus h. The three solid lines are, from larger to lower values of Y4 , the
predictions corresponding to Nn s 3 and tnex s888.6 s, 886.7 s, 884.8 s, respectively. Analogously, the dashed
lines correspond to Nn s 4 and the dotted ones to Nn s 2. The dotted and dashed horizontal band are the
experimental values of Refs. w4–6x and w7,8x, respectively, with 1 s interval.
reaction rates is estimated to be of the order of 0.1% and thus comparable with the
uncertainty due to the experimental error on neutron lifetime. For D, 3 He and 7 Li the
uncertainty due to the poor knowledge of nuclear reaction rates is estimated to be of the
l
order of Ž10–30.% w27x, thus completely covering any radiativerthermal correction on
n p rates.
In Fig. 7 the predictions on Y4 are shown versus h for Nn s 2,3,4 and for a 1 s
variation of tnex . The two experimental estimates for the primordial 4 He mass fraction,
Y4Ž l . and Y4Ž h., as horizontal bands, are also reported. Fig. 8 and Fig. 9 show the
predictions for D and 7 Li abundances. Note that, due to the negligible variation of Y2
and Y7 on small tn changes, no splitting of predictions for 1 s variation of tnex is
present.
A fit, up to 1% accuracy, of the relevant observables Y2 , Y3 , Y4 and Y7 as a function
of x s log 10 Ž 10 10h . , Nn and tn has been performed. The following expressions have
been obtained:
4
10 3 P Y2 s Ý a i x i q a5 Ž Nn y 3. exp a6 x q a7 x 2 4 , Ž 5.2 .
is0
4
10 5 P Y3 s Ý a i x i q a5 Ž Nn y 3. exp a6 x 4 , Ž 5.3 .
is0
438 S. Esposito et al.r Nuclear Physics B 568 (2000) 421–444
Fig. 8. The quantity Y2 versus h is reported. The notation used is the same of Fig. 8. Due to the negligible
dependence of Y2 on small variations of tnex no splitting of lines is present. The horizontal bands dashed and
dotted are the experimental values of Refs. w9–17x.
Fig. 9. The quantity Y7 versus h. The notation used is the same of Fig. 8. There is no splitting of lines related
to Dtn , due to the negligible dependence of Y7 on small variations of tnex . The horizontal bands dashed and
dotted are the experimental values of Refs. w18,19x and w21x, respectively.
S. Esposito et al.r Nuclear Physics B 568 (2000) 421–444 439
Table 3
Values of the fit coefficients Ž5.2. – Ž5.5. for light element abundances
Coeff. 10 3PY2 10 5PY3 10PY4 10 9PY7
a0 0.4854 3.325 2.209 0.5419
a1 0.2919 0.1496 0.5548 y0.5981
a2 y0.3516 1.597 y0.6491 y1.914
a3 0.5048 y1.923 0.7661 4.521
a4 y0.4269 1.312 y0.5366 0.1587
a5 0.7772=10y1 0.1782 0.1614 y0.3256
a6 y4.397 y1.705 0.2059=10y2 y4.102
a7 0.5925 – 0.1300 5.072
a8 – – y0.4156=10y4 y1.209
a9 – – 0.7433=10y2 y0.6269
5
10 P Y4 s Ý a i x i q a6 Ž t y te x . q a7 Ž Nn y 3. q a8 x Ž t y te x . q a9 x Ž Nn y 3. ,
is0
Ž 5.4 .
3 4
10 9 P Y7 s Ý
is0
a i x i q a 4 Ž Nn y 3 . q a5 x Ž Nn y 3 . exp ½Ýis1
5
a5qi x i , Ž 5.5 .
where te x s 886.7 s and the values of the fit coefficients are reported in Table 3.
Neutrino decoupling has been shown by many authors w55–61x to be a process which
still takes place when e " pairs annihilate. This implies that neutrinos are in fact slightly
X
Fig. 10. The ratio Ž Y4 yY4 .r Y4 versus log 10 Ž10 10h . for Nn s 3 and tn s885.3 s w22x Žsee Section 5..
440 S. Esposito et al.r Nuclear Physics B 568 (2000) 421–444
Fig. 11. The likelihood distributions for the light element yields Y2 , Y4 , Y7 are shown as functions of Nn and
log 10 Ž10 10h ., normalized to unity in correspondence of the experimental values. From left to right and from
top to bottom the following cases are considered: Ža. high D, low 4 He; Žb. high D, high 4 He; Žc. low D, low
4
He; Žd. low D, high 4 He. The plots for cases Žc. and Žd. are rescaled by a factor 25 and 100 times,
respectively, compared to the one of Ža. and Žb..
reheated during this annihilation process and their final distribution in momentum space
shows an interesting non-equilibrium shape. In Ref. w62x it is estimated that the effect on
Y4 due to the inclusion of this slight neutrino heating is very small, d Y4 ; 1.5 = 10y4 , in
the whole range 10y1 0 ( h ( 10y9 . We have included this constant correction to Y4
prediction.
From the fit reported in Eq. Ž5.4. it is easy to quantify the theoretical error on Y4 .
Since this is basically due to the uncertainty on tn we have
D Y4 Ž a6 q a8 x . Dtn
s ( 0.1%. Ž 5.6 .
Y4 10Y4
In Fig. 10, our prediction for Y4 of Eq. Ž5.4. with Nn s 3 and tn s 885.3 s, is compared
with an analogous fit, Y4X , performed in w22x. The agreement between the two expressions
obtained by independent codes is up to 1% in the relevant range for h.
S. Esposito et al.r Nuclear Physics B 568 (2000) 421–444 441
2
2 s42 5
y Ž Y7 Ž Nn , x . y Y7ex .
=exp
½ 2 s 72 5 . Ž 5.7 .
Notice that all functions have been normalized to unity in the maximum. As is clear
from the plots, the analysis prefers the high value of D Žplots Ža. and Žb... In both cases
the distributions are centered in the range x g 0.2–0.4, but at Nn ; 3 for low 4 He and
Nn ; 3.5 for high 4 He. For low D the compatibility with experimental data is worse.
Note that Žc. and Žd. distributions have been multiplied by a factor of 25 and 100
respectively and centered in the range x g 0.6–0.8, and at Nn ; 2 for low 4 He and
Nn ; 3 for high 4 He. The better agreement at 1 s of the data sets Ža. and Žb. with the
theoretical predictions is basically due to the effect of 7 Li data which corresponds to
values for h compatible with low D data of Žc. and Žd. at 2 s only. It should be
mentioned however that these results only take into account experimental errors, so that
the confidence level regions in the Nn –x plane would be broader by convoluting the
considered distributions with the ones containing the theoretical error.
6. Conclusions
In this paper a detailed study of the effects on light element yields of the radiative,
finite nucleon mass, thermal and plasma corrections to Born rates Ž1.7. has been carried
out. The aim of such an analysis was to reduce the error on, basically, Y4 to less than
1%, which is motivated by the most recent experimental determinations for 4 He
abundance. This accurate analysis has been performed using an update version of the
BBN standard code w1–3,24x. A different numerical approach, based on BDF techniques,
has been implemented to solve the stiff Boltzmann equations for nuclei densities. The
numerical results for 4 He mass fraction almost confirm the computation reported in Ref.
w22x, while the theoretical error, also including the propagation of uncertainties on
nuclear processes, as estimated in w27x, is of the order of 0.1%. Our analysis shows that
the preferred experimental values are high value for D and low one for 4 He, in which
case the distribution is centered at x ; 0.3 and Nn ; 3.
a p2 ` z2 dx
½
m eR Ž z , y . , m e 1 q
pz 2
3
q
ž
Hz 2'x 2
yz2 q
( 2
2 y yz 2
log L
/ 1qe x 5
Ž A.2 .
,
References
a
ˆ CH-2000 Neuchatel,
Institute of Physics, UniÕersity of Neuchatel, ˆ Switzerland
b
Department of Physics, National Taiwan UniÕersity, Taipei 10764, Taiwan, China
Received 8 July 1999; accepted 25 October 1999
Abstract
1. Introduction
It was shown by Connes, Douglas and Schwarz w1x that for the Matrix model
compactified on a torus, the three-form C field background should be incorporated in
the supersymmetric Yang–Mills theory ŽSYM. by deforming the base space into a
quantum space. Due to the correspondence between the DLCQ M-theory and the
D-brane world-volume field theory w2x, this means that the D-brane field theory lives on
a non-commutative space in the presence of a NS–NS B field background w3–5x. In a
previous paper we quantized an open string ending on a D-brane with a constant NS–NS
B field background, and we found that the space-time coordinates of the open string
endpoint are non-commutative. The non-commutativity precisely agrees with previous
results w1x, but in addition we showed that the compactness of the torus is not necessary
for the non-commutativity, and in general the deformation is parametrized by F s B y F
instead of B.
The quantization of an open string that we did in Ref. w6x was carried out explicitly in
terms of a mode expansion of the generic solution of the equations of motion for the
space-time coordinates X m. We used a heuristic argument in the derivation of commuta-
tion relations among the mode coefficients, and then check that it is a consistent
quantization of the classical theory.
0550-3213r00r$ - see front matter q 2000 Elsevier Science B.V. All rights reserved.
PII: S 0 5 5 0 - 3 2 1 3 Ž 9 9 . 0 0 6 8 5 - 9
448 C.-S. Chu, P.-M. Ho r Nuclear Physics B 568 (2000) 447–456
In this paper we carry out the quantization again by following the standard quantiza-
tion procedure of Dirac for systems with constraints. The constraints here are, of course,
the boundary conditions of the open string ending on a D-brane. We find that there is a
single ambiguous expression which needs to be regularized. We obtain our previous
results by choosing a natural regularization. We also find the commutation relations for
the fermionic fields using supersymmetry.
2. Necessity of non-commutativity
where A i , i s 0,1, . . . , p, is the UŽ1. gauge field living on the D p-brane. We use the
convention h ab s diagŽy1,1. and e 01 s 1 as in w6x. Here the string background is
Gmn s hmn , F s constant, H s dB s 0. Ž 2.
Adding the fermionic sector does not affect the result and will be considered later. With
slight modification, the considerations in this paper can also be applied to study open
string ending on a D-brane in type I string theory.
If both ends of a string are attached to the same D p-brane, the last term in Ž1. can be
written as
y1 2 ab
HSd se Fi j Ea X i Eb X j . Ž 3.
4pa X
Furthermore, consider the case B s Ý i,p js0 Bi j dX i dX j, then the action Ž1. can be written
as
1
SB s y dt L s
H d 2s g a bhmn Ea X m Eb X n q e a bFi j Ea X i Eb X j .
H Ž 4.
4pa X
Here
F s B y dA s B y F Ž 5.
is the modified Born–Infeld field strength and x 0a is the location of the D-brane. Indices
are raised and lowered by hi j s Žy,q, . . . ,q ..
One obtains the equations of motion
Ž Et2 y Es2 . X m s 0 Ž 6.
and the boundary conditions at s s 0,p :
Es X i q Et X j Fj i s 0, i , j s 0,1, . . . , p, Ž 7.
X a s x 0a , a s p q 1, . . . , D. Ž 8.
C.-S. Chu, P.-M. Ho r Nuclear Physics B 568 (2000) 447–456 449
The constraint Ž8. is standard. We will be mainly interested in the constraint Ž7.. As
demonstrated in w6x, the BC Ž7. implies that
2pa X P k Ž t ,0 . Fk i s yEs X j Ž t ,0 . M ji , Ž 9.
where P k is the canonical momentum
2pa X P k Ž t , s . s Et X k q Es X j Fj k , Ž 10 .
and Mi j s hi j y Fi kFk j . It follows that
2pa X P k Ž t ,0 . , P j Ž t , s X . Fk i s yEs X k Ž t , s . , P j Ž t , s X . s s0
Mk i , Ž 11 .
and
2pa X P k Ž t ,0 . , X j Ž t , s X . Fk i s yEs X i Ž t , s . , X j Ž t , s X . s s0
. Ž 12 .
These simple relations show that the standard canonical commutation relations for
F s 0,
X i Ž t , s . , Pj Ž t , s X . s i d jid Ž s , s X . , Ž 13 .
Pi Ž t , s . , Pj Ž t , s X . s 0, Ž 14 .
X i Ž t , s . , X j Ž t , s X . s 0, Ž 15 .
are no longer valid when F / 0. They are not compatible with the boundary condition
Ž7. when F / 0. In particular, without doing any further calculations, one can see
already from Ž12. that the string coordinates must necessarily be non-commutative
somewhere along the string. A consistent quantization was therefore proposed in w6x and
it was shown that the canonical commutation relations are modified to
P i Ž t , s . , P j Ž t , s X . s 0, Ž 16 .
X kl
y1
X k Ž t , s . , X l Ž t , s X . s "2p i a Ž M F . , s s s X s 0 or p ,
0,½ otherwise,
Ž 17 .
X i Ž t , s . , P j Ž t , s X . s ih i jd˜ Ž s , s X . , Ž 18 .
where d˜ Ž s , s X . is the delta function on w0,p x with vanishing derivative at the boundary,
e.g. Es d˜ Ž s , s X . s 0 for s s 0,p . See Ref. w6x for its explicit form. Thus we see that the
string becomes non-commutative at the endpoint, i.e. the D-brane becomes non-com-
mutative. It was further shown that for the case F s B, Ž17. agrees with the results
obtained from other considerations w9,10x. The result of our quantization was later
confirmed in w11x using standard string perturbation theory.
Although our quantization in w6x is entirely consistent and produces results in
agreement with other considerations, the argument there was mainly based on intuitive
guess and consistency argument. It may perhaps be more satisfactory to give a
450 C.-S. Chu, P.-M. Ho r Nuclear Physics B 568 (2000) 447–456
‘‘derivation’’ of that result based on more traditional method. In the next section, we
will show that the result in w6x can also be obtained from the standard constraint
quantization of Dirac. Consequences of the quantization Ž16. – Ž18. in the boundary state
formalism Žclosed string. and D-brane physics are under investigation w12x; as well as
the quantization of charged open string w13–15x and its possible consequences1.
In this section, we compute the Dirac bracket starting from the standard Poisson
brackets
Ž X i Ž s . , Pj Ž s X . . s d jid Ž s , s X . , Ž 19 .
Ž Pi Ž s . , Pj Ž s X . . s 0, Ž 20 .
Ž X i Ž s . , X j Ž s X . . s 0. Ž 21 .
See Refs. w19,20x for a review of the Dirac procedure of constrained quantization. Since
Es X i q Et X j Fj i s 2pa X P j Fj i q Es X j M ji , Ž 22 .
the boundary condition is a constraint in the phase space
F i Ž 0 . s F i Ž p . s 0, i s 0,1, . . . , p, Ž 23 .
where we have introduced the notation
F i Ž s . s 2pa X P j Fj i q Es X j M ji . Ž 24 .
Using the Hamiltonian
1 2 2
HB s
4pa X
Hd s Ž Ž E t X . q Ž Es X . . , Ž 25 .
1
See in particular the following papers for some recent developments: Ref. w16x studies the closed string
interaction in the presence of non-commutative D-branes; the papers w17x and w18x study the effects of the
B-field in the boundary state formalism.
2
We thank A. Bilal for a helpful discussion about the issue of equations of motion in determining the
secondary constraints.
C.-S. Chu, P.-M. Ho r Nuclear Physics B 568 (2000) 447–456 451
One can then compute the Poisson matrix C Ž a k n .Ž b l m . of the constraints. The basic
ones are the C Ž a k 0.Ž b l 0. ones:
Ž F k Ž s . ,F l Ž s X . . s 2pa X Ž F M . k l Es d Ž s , s X . q Es X d Ž s X , s . , Ž 28 .
Ž F k Ž s . , Es P l Ž s X . . s M k l Es Es d Ž s , s X . ,
X X Ž 29 .
Ž Es P k Ž s . , Es P l Ž s X . . s 0
X Ž 30 .
and in general
C Ž a k n .Ž b l m . Ž s , s X . s Ž f Ž a k n . Ž s . , f Ž b l m . Ž s X . . s Es2 n Es2Xm C Ž a k 0.Ž b l 0. Ž s , s X . .
Ž 31 .
A distinct feature of our case is that the constraints Ž27. are imposed only at the
boundary of the open string. As a result, the Dirac bracket should be given by
Ž AŽ s . , B Ž s X . . s Ž AŽ s . , B Ž s X . . y Ý Ž AŽ s . , f Ž a k n . Ž s . .
U XX
s XXs XXX
CŽ a k n .Ž b l m . Ž s XX , s XXX .
0 y Ž My1 . k l R n m Ž s XX , s XXX .
s
ž Ž My1 . k l R n m Ž s XX , s XXX . 2pa X Ž F My1 . k l Sn m Ž s XX , s XXX . / Ž 33 .
ij
Ž X iŽ s . , X jŽ s X . . U
s y2pa X Ž My1F .
= Ý Es2XXnd Ž s , s XX . R n m Ž s XX , s XXX . Es2XXXmq1d Ž s X , s XXX .
XX XXX
s s
ij
Ž X i Ž p . y X i Ž 0. , X j Ž s X . . U
s y2pa X Ž My1F . Ý
XX XXX
Es2XXnq1d
s s
ij
s y2pa X Ž My1F . P Ž ds X 0 q ds X p . , Ž 39 .
where we have used in the last step above
Ý Es2XXnq1d Ž s , s XX . R n m Ž s XX , s XXX . Es2XXXmd Ž s X , s XXX . < sss p
s0 s ds X 0 q ds X p . Ž 40 .
s XXs XXX
Formally, if one exchange the order of integration and the sum in the above, one gets
zero times a delta function. Therefore Ž41. calls for a better justification. One way to
justify it is to use a lattice regularization by replacing the interval w0,p x by a lattice of M
equidistant points with spacing e s prM,
d Ž s ,s X . ™ 1e d ss X , Hd s ™ e Ý .
s
Ž 42 .
Since we know w6x from the D-brane field theory considerations that
U U
Ž X i Ž 0. , X j Ž 0. . s yŽ X i Ž p . , X j Ž p . . , Ž 43 .
it follows from Ž39. that the non-trivial Dirac bracket must be
ij
s y Ž X i Ž p . , X j Ž p . . s y2pa X Ž My1F . .
U U
Ž X i Ž 0. , X j Ž 0. . Ž 44 .
C.-S. Chu, P.-M. Ho r Nuclear Physics B 568 (2000) 447–456 453
Ž X iŽ s . , P jŽ s X . . U
s h i j Ž d Ž s ,s X . y QŽ s ,s X . . , Ž 45 .
where
QŽ s ,s X . s Ý Es2XXmq1d Ž s , s XX . R m k Ž s XX , s XXX . Es2XXXkq1d Ž s XXX , s X . . Ž 46 .
s XXs XXX
4. Inclusion of fermions
In a generic background, the fermionic part of a RNS open string 3 gets additional
couplings, for example
yi i ab
HB i jc e ra Eb c j . Ž 48 .
4pa X
The situation is much simpler for our case with the flat background Ž2. and the complete
supersymmetric Lagrangian can be easily written down. The equation of motion is not
modified
Ecqi s Ecyi s 0, Ž 49 .
where E s Et q Es , E s Et y Es . The open string world-sheet in the type IIand type 0 string
theories has the supersymmetry
d X m s ec m , dc m s yi r a Ea X me Ž 50 .
with the preserved supersymmetry parametrized by
eqs ley , l s "1. Ž 51 .
3
We use the convention of two-dimensional spinor algebra in w 21 x . In particular,
cq 0 1 0 1
cs
cy ž / ž
, r0s
y1 0
, r1 s
1 0
. / ž /
454 C.-S. Chu, P.-M. Ho r Nuclear Physics B 568 (2000) 447–456
m n X
bracket Ž c Ž s ., X Ž s .. is zero and the constraints never mix the bosonic and
fermionic fields, so this Poisson bracket is not modified by the Dirac procedure.
Applying Ž50., we obtain
U U U
i Ž E X m , X n . q Ž cqm , cqn . y l Ž cqm , cyn . P ey
0sd Ž c m, Xn . s
U
ž P ley / . Ž 54 .
Now we will concentrate on the modified boundary condition Ž52.. The Dirac bracket
relations for the tranverse directions can be obtained simply by setting F s 0 in the
following. Substituting Ž52. into the above, we obtain
U
yi U
Ž cqi , cqj . s
2
Ž E X i, X k . Žh yF . k j,
U
yi U
Ž cyi , cyj . s
2
Ž E X i, X k . Žh q F . k j. Ž 55 .
At s s 0,p , the use of Ž9. allows one to express Es X i and Et X i in terms of P k
Es X i s y2pa X P k Ž F My1 . k i , Et X i s 2pa X P k Ž My1 . k i . Ž 56 .
Thus
ij
ŽE X iŽ s . , X jŽ s X . . s 2pa X Ž My1 Ž h q F . . d˜ Ž s , s X . ,
U
Ž 57 .
ij
ŽE X iŽ s . , X jŽ s X . . s 2pa X Ž My1 Ž h y F . . d˜ Ž s , s X . ,
U
Ž 58 .
and so Ž55. implies
4
We thank V. Schomerus for a useful email exchange about this boundary condition. A different boundary
condition was used in w6x which corresponds to a non-supersymmetric D-brane.
C.-S. Chu, P.-M. Ho r Nuclear Physics B 568 (2000) 447–456 455
with s and s X at the boundary. It is also easy to see that in the interior of the string, the
Dirac bracket is not modified and takes the same form as Ž59.. Therefore the fermion
commutator is not modified Žexcept for the modification of the delta function. by the
presence of F and takes the standard form.
5. Remarks
In the quantization procedure of Dirac, Eq. Ž41. is the only ambiguity. But as
mentioned in Section 2, it is inconsistent to say that Ž41. vanishes. Also note that the
ambiguity resides only in the commutation relations for the endpoint coordinates, the
quantization for the space-time coordinates in the interior of the opens string has no
ambiguity, and their commutation relations are the same as if F s 0. The lattice
regularization we choose there gives the same result we obtained in w6x. It leads to the
conclusion that a D-brane in the B field background has a non-commutative world-
volume in the sense that its world-volume theory is the SYM theory living on a
non-commutative space.
Upon quantization, the Dirac bracket becomes the Žanti.commutator of the operators.
From the point of view of the open string, the modifications to the commutation
relations is ‘‘not very much’’. But as discussed in w6x, since the endpoint of the open
string is living on the D-brane, the D-brane world-volume becomes a non-commutative
one. It is well known that a non-commutative manifold can be described in the dual
language of the algebra of functions living on it, one immediately arrives at the
conclusion that the D-brane world-volume field theory is a non-commutative one, i.e.
one with a modified multiplication. In fact it is easy to check that our modified
commutators Ž44. are equivalent to the following Moyal product w23,24x defined on the
D-brane world-volume w1x,
E E
f Ž j . w g Ž j X . s exp Q i j f Ž j . g Ž j X . < js j X ,
ž Ej Ej X j
i / Ž 60 .
Acknowledgements
References
w1x A. Connes, M.R. Douglas, A. Schwarz, J. High Energy Phys. 02 Ž1998. 003. hep-thr9711162.
w2x N. Seiberg, Phys. Rev. Lett. 79 Ž1997. 3577. hep-thr9710009.
w3x M.R. Douglas, C. Hull, J. High Energy Phys. 2 Ž1998. 8. hep-thr9711165.
w4x C. Hofman, E. Verlinde, U-duality of Born–Infeld on the non-commutative two-torus, hep-thr9810116.
w5x N. Seiberg, talk given at the Conference New Ideas in Particle Physics and Cosmology, Univ. Penn., May
19-22, 99; transpancies available at http:rrdept.physics.upenn.edurparticle – meetingrseiberg.html.
w6x C.-S. Chu, P.-M. Ho, Nucl. Phys. B 550 Ž1999. 151. hep-thr9812219.
w7x J. Dai, R.G. Leigh, J. Polchinski, Mod. Phys. Lett. A 4 Ž1989. 2073.
w8x R. Leigh, Mod. Phys. Lett. A 4 Ž1989. 2767.
w9x Y.-K.E. Cheung, M. Krogh, Nucl. Phys. B 528 Ž1998. 185.
w10x T. Kawano, K. Okuyama, Phys. Lett. B 433 Ž1998. 29.
w11x V. Schomerus, D-branes and deformation quantization, hep-thr9903205.
w12x C.-S. Chu, P.-M. Ho, work in progress.
w13x A. Abouelsaood, C.G. Callan, C.R. Nappi, S.A. Yost, Nucl. Phys. B 280 Ž1987. 599.
w14x C. Bachas, Phys. Lett. B 296 Ž1992. 77. hep-thr9209032.
w15x C. Bachas, Phys. Lett. B 374 Ž1996. 37. hep-thr9511043.
w16x S. Hyun, Y. Kiem, S. Lee, C.-Y. Lee, Closed string interacting with non-commutative D-branes,
hep-thr9909059.
w17x N. Ishibashi, A relation between commutative and non-commutative descriptions of D-branes, hep-
thr9909176.
w18x K. Okuyama, A path integral representation of the map between commutative and non-commutative
gauge fields, hep-thr9910138.
w19x P.A.M. Dirac, Lectures on Quantum Mechanics, Yeshiva University, 1964.
w20x A. Hanson, T. Regge, C. Teitelboim, Constrained Hamiltonian Systems, RX-748, 1976. Contribution to
the Lincei Interdisciplinary Centre for Mathematical Sciences and their Applications, no. 22.
w21x ˝ S. Theisen, Lectures on String Theory ŽSpringer, Berlin, 1989..
D. Lust,
w22x C.-S. Chu, P.-M. Ho, Y.-C. Kao, World-volume uncertainty relations for D-branes, hep-thr9904133.
w23x J.E. Moyal, Proc. Camb. Phil. Soc. 45 Ž1949. 99.
w24x F. Bayen, M. Flato, C. Fronsdal, A. Lichnerowicz, D. Sternheimer, Ann. Phys. 111 Ž1978. 61.
w25x F. Ardalan, H. Arfaei, M.M. Sheikh-Jabbari, Dirac quantization of open strings and non-commutativity in
branes, hep-thr9906161.
Nuclear Physics B 568 Ž2000. 457–471
www.elsevier.nlrlocaternpe
Abstract
Gauged WZNW models are integrable conformal field theories. We integrate the classical
SLŽ2,R.rUŽ1. theory with periodic boundary conditions, which describes closed strings moving in
a curved target-space geometry. We calculate its Poisson bracket structure by solving an initial
state problem. The results differ from previous field-theoretic calculations due to zero-modes. For
a future exact canonical quantization the physical fields are Žnon-locally. transformed onto
canonical free fields. q 2000 Elsevier Science B.V. All rights reserved.
1. Introduction
The SLŽ2,R.rUŽ1. gauged WZNW model has attracted much interest in the past
w1–8x before it was recognized that this non-linear theory is classically integrable w9x.
More generally, we could prove that integrability holds for any gauged WZNW theory
w10x. This was known for nilpotent gauging only, which yields Toda theories w7,11,12x.
So far we have completely integrated the non-linear equations of motion of the classical
SLŽ2,R.rUŽ1. model for a field theoretic case with asymptotic boundary conditions
w10x.
In this paper we solve the classical SLŽ2,R.rUŽ1. theory for periodic boundary
conditions. As a conformal field theory this model describes a closed string moving in
the background of a black hole target-space metric w3x. It is especially interesting for
quantization; quantum mechanical deformations of its metric and a correlated dilaton
w13x were obtained in some perturbative manner w4–6x. However, these calculations were
based on an incomplete effective action w10,14,15x.
Our intention is to provide a different understanding of such quantum mechanical
results. Starting with an entirely classical approach w1,10x, we expect that the exact
classical solution of this theory will facilitate its exact canonical quantization. We
0550-3213r00r$ - see front matter q 2000 Elsevier Science B.V. All rights reserved.
PII: S 0 5 5 0 - 3 2 1 3 Ž 9 9 . 0 0 7 2 4 - 5
458 ¨
U. Muller, G. Weigtr Nuclear Physics B 568 (2000) 457–471
calculate, as in the field theoretic case, the Poisson bracket structure of the theory by
solving an initial state problem and look for a canonical transformation of the physical
fields onto canonical free fields. But the results for periodic boundary conditions cannot
simply be inferred from the field theoretic ones because additional zero-modes become
important.
To make this paper self-contained, we mention in Section 2 some of our earlier
results; more details are found in Ref. w10x. First we define the theory, give its general
solution and inspect the symmetry properties. Section 3 solves an initial state problem
which allows us to calculate in Section 4 the Poisson bracket structure of the model. A
free-field realization of these brackets is given in Section 5. The summary provides a
canonical transformation of the physical fields onto the free fields. Some technical
details are found in two appendices.
The exact action of the SLŽ2,R.rUŽ1. gauged WZNW theory written in light-cone
coordinates z s t q s , z s t y s
1 2
S w r ,t x s HM Ž E rE r q tanh r E tE t . dz dz
z z z z Ž 1.
g2
was derived entirely classically and in a gauge-invariant manner w1,10x. M has cylindri-
cal topology where the space-like submanifolds are topologically equivalent to a circle
M s R = S1 , i.e. 0 ( s ( 2p , y` - t - `, Ž 2.
The physical fields r Ž s ,t ., t Ž s ,t ., which represent the position of a closed bosonic
string in the target-space at proper time t , are subject to the boundary conditions
r Ž s q 2p ,t . s r Ž s ,t . , t Ž s q 2p ,t . s t Ž s ,t . q 2p w, w g Z. Ž 3.
The t coordinate is an angular variable given modulo 2p only, and the winding number
w tells us how often the string surrounds the coordinate origin. The string moves in the
curved metric of a Euclidean black hole w3x
ds 2 s dr 2 q tanh2 r dt 2 , Ž 4.
and the dynamics is given by the equations of motion
sinh r
Ez Ez r s Ez t Ez t ,
cosh3 r
1
Ez Ez t s y Ž E z rE z t q E z t E z r . . Ž 5.
sinh r cosh r
These equations are integrable because they have a Lax pair representation
Ez y C, Ez y C s Ez C y Ez C y w C,C x s 0. Ž 6.
C and C take values in the Lie algebra of the group SLŽ2,R. w10x
C s C aT a , C s C aT a Ž a s 1,2,3 . Ž 7.
¨
U. Muller, G. Weigtr Nuclear Physics B 568 (2000) 457–471 459
with
0 1 1 0 0 y1
T1s ž ,/ T2s ž , / T 3 s T 1T 2 s ž /. Ž 8.
y1 0 0 y1 y1 0
One can check that
C1 s y 12 tanh2 r Ez t , C2 s C3 s 0, C1 s 12 tanh2 r Ez t ,
1 1
C2 s y Ez Ž sinh rcos t . , C3 s Ez Ž sinh r sin t . Ž 9.
cosh r cosh r
makes the flatness condition Ž6. equivalent to the equations of motion Ž5.. But unlike
Toda theories w16,17x there is at present no general method to integrate a Lax pair
following from a non-nilpotent gauged WZNW model like Ž6. directly. We found the
general solution of Ž5. in w9x by analysing non-abelian Toda theories w7,8x as
i X
sinh2 r s XX , t s iŽ B y B . q ln Ž 10 .
2 X
with the definitions
BX BX
XsAq Ž 1 q AA . , XsAq Ž 1 q AA . . Ž 11 .
AX AX
A s AŽ z ., B s B Ž z ., A s AŽ z . and B s B Ž z . are complex Žanti-.chiral parameter
functions and AX Ž z ., etc. derivatives. However, as we shall see we must restrict this
solution in order to render r and t real. Straightforward substitution shows that the
solution Ž10. fulfills the equations of motion Ž5.. But it will become obvious from the
investigation of initial-value problems of Section 3 that the solution Ž10. exhausts the
entire solution space Žexcluding singular solutions..
The theory is also characterized by conservation laws. The equations of motion Ž5.
guarantee, in particular, conservation and chirality of the energy–momentum tensor Žwe
shall omit the anti-chiral parts whenever possible.
1
T ' Tz z s
g2
Ž Ž Ez r . 2 q tanh2 r Ž Ez t . 2 . , Tz z s 0, Ž 12 .
A ™ T w A x s aAcA qy db ,
B ™ T w B x s B q ln Ž cA q d . ,
A ™T w Ax s
dA y c
,
bA q a
B ™ T w B x s B q ln Ž bA q a . ,
a yb
ž c d /
g GL Ž 2,C . , Ž 18 .
they are only given by the physical fields up to four complex constants. We shall fix this
arbitrariness in Section 3.
The monodromy properties of the functions A, B, A, B are, as well, determined by
GLŽ2,C. transformations
pA Ž z . y q
A Ž z q 2p . s T X A Ž z . s ,
rA Ž z . q s
B Ž z q 2p . s T X B Ž z . s B Ž z . q ln Ž rA Ž z . q s . ,
sA Ž z . y r
A Ž z y 2p . s T X A Ž z . s ,
qA Ž z . q p
B Ž z y 2p . s T X B Ž z . s B Ž z . q ln Ž qA Ž z . q p . ,
p yq
ž r s /
g GL Ž 2,C . . Ž 19 .
We should remark here that the GLŽ2,C. transformations act, indeed, in two different
manners. We also find that n is not periodic modulo 2p
n Ž s q 2p ,t . s n Ž s ,t . q 2p w q iln Ž ps q qr . . Ž 20 .
Therefore, the conserved quantities V " and V " are not periodically defined. We can
describe their periodicity behaviour by the conserved total momentum of the field t
1 2p 2p
Pt s 2 H0 tanh2 r t˙ d s X s H0 p t Ž s X ,t . d s X . Ž 21 .
g
¨
U. Muller, G. Weigtr Nuclear Physics B 568 (2000) 457–471 461
Using Eqs. Ž14., instead of Ž20. we obtain for n and n the periodicity relations in terms
of Pt
sq2 p X
n Ž s q 2p ,t . y n Ž s ,t . s Hs n Ž s X ,t . d s X s 2p w q g 2 Pt ,
sq2 p X
n Ž s q 2p ,t . y n Ž s ,t . s Hs n Ž s X ,t . d s X s 2p w y g 2 Pt . Ž 22 .
So we have
2 2
V " Ž z q 2 p . s e " ig Pt
V" Ž z . , V " Ž z y 2 p . s e " ig Pt
V" Ž z . , Ž 23 .
and we can define, up to a constant normalization, new periodic conserved quantities
2 2
W "' e . ig P t z rŽ2 p .
V" , W "' e " ig P t z rŽ2 p .
V". Ž 24 .
Comparing Ž20. and Ž22., the real-valued momentum Pt becomes
g 2 Pt s iln Ž ps q qr . , Ž 25 .
so that the monodromy transformations Ž19. are restricted to those with unit determinant,
< ps q qr < s 1. Ž 26 .
ž / ™ž
y2
y1
a
c
yb
d /ž / y2
y1
,
ž / ™ž
y2
y1
d
b
yc
a /ž y2
y1/,
a yb
ž c d /
g GL Ž 2,C . . Ž 31 .
462 ¨
U. Muller, G. Weigtr Nuclear Physics B 568 (2000) 457–471
These properties already restrict the possible GLŽ2,C. transformations Ž31. to scalings
with only two free parameters a and d
ž /™ž / ž / ™ž /
y2
y1
ay 2
dy 1
,
y2
y1
dy 2
ay 1
. Ž 40 .
This means that we have implicitly fixed two of the four integration constants. Any
function invariant under the scalings Ž40. can now be determined uniquely by Ž39., in
particular, the periodic quotients
yXk Ž z .
hk Ž z . ' , k s 1,2. Ž 41 .
yk Ž z .
They determine the solution of Ž34. by
2p z cQ
ln y 1 Ž z . s 12 H0 h1 Ž zX . h Ž z y zX . dzX q a1 q ,
2p 2
2p z cQ
ln y 2 Ž z . s 12 H0 h 2 Ž zX . h Ž z y zX . dzX q a 2 q Ql y . Ž 42 .
2p 2
Ql, cQ are integration constants defined by the Eqs. ŽA.10. of Appendix A, and hŽ z .
denotes the periodic saw-tooth function
z z
h Ž z . s e 2p Ž z . y s 2 n q 1 y for 2p n - z - 2p Ž n q 1 . , n g Z.
p p
Ž 43 .
Here e 2 p Ž z . is the stair-step function
e 2 p Ž z . s 2 n q 1 for 2p n - z - 2p Ž n q 1 . , n g Z, Ž 44 .
and
2p
ak s H0 hk Ž z . dz Ž 45 .
areX the zero-modes of the fields hk Ž z .. ŽIn passing we mention that Ž39., Ž42. imply
e a s e a 1 , and we define a X ' a 1.. But we have to stress here in particular that this
result does not deliver hk or y k explicitly as functions of u, u. However, it will be
sufficient in order to calculate their Poisson brackets.
q d Ck y k Ž z . ,
y 2 Ž zX . y1 Ž z . E Ž z , zX . y e 2 p Ž z y zX .
V 1 Ž z , zX . ' ,
y 1 Ž zX . yX2 Ž zX . y y 2 Ž zX . yX1 Ž zX . 2
y1 Ž zX . y 2 Ž z . E Ž zX , z . q e 2 p Ž z y zX .
V 2 Ž z , zX . ' ,
y 1 Ž zX . yX2 Ž zX . y y 2 Ž zX . yX1 Ž zX . 2
a1 y a2
exp ž e 2 p Ž z y zX . / y 2 Ž z . y1 Ž zX .
X 2
EŽ z , z . ' a1 y a2 . Ž 50 .
sinh y1 Ž z . y 2 Ž zX .
2
The variations d Ck correspond to the undetermined scalings Ž40.. They cannot simply
be set to zero because we are integrating Ž50. over non-periodic functions of zX , and
without the term d Ck y k Ž z . these integrals would depend on a shift of the integration
range.
Since the functions hk Ž z . do not depend on the scalings Ž40., their variations
yXk Ž z . d yXk Ž z . yXk Ž z . d y k Ž z .
dhk Ž z . s d s y 2 Ž 51 .
yk Ž z . yk Ž z . yk Ž z .
¨
U. Muller, G. Weigtr Nuclear Physics B 568 (2000) 457–471 465
h1 Ž z . y h 2 Ž z .
v 1 Ž z , zX . ' 12 E Ž z , zX . ,
h1 Ž zX . y h 2 Ž zX .
h1 Ž z . y h 2 Ž z .
v 2 Ž z , zX . ' y 12 E Ž zX , z . . Ž 52 .
h1 Ž zX . y h 2 Ž zX .
The integrands of Ž52. are periodic functions of the integration variable, and Ž52.
determines the Poisson brackets of hk Ž z .. The non-vanishing ones are given by
X
g2
h1Ž z . ,h 2 Ž z . 4 s Ž h1 Ž z . y h 2 Ž z . . E Ž z , zX . Ž h1 Ž zX . y h 2 Ž zX . .
2
y g 2 Ž h1 Ž z . y h 2 Ž z . . d 2 p Ž z y zX . . Ž 53 .
The Poisson brackets of the functions ln y k result, finally, by means of Ž42.
ln y1Ž z . ,ln y1Ž zX . 4 s 0,
g2 z y zX g2
ln y1Ž z . ,ln y 2 Ž zX . 4 s ž e 2 p Ž z y zX . y / y E Ž z , zX .
2 2p 2
g2 2p
qq H dz E Ž z , zX . ,
8p 0
g2
ln y1Ž z . ,ln y1Ž zX . 4 s y Ž z y zX . ,
4p
g2 2p h2 Ž z .
ln y1Ž z . ,ln y 2 Ž zX . 4 s y H dz E Ž z , zX . ,
8p 0 h1 Ž z .
X
g2 2p h2 Ž z .
ln y 2 Ž z . ,ln y 2 Ž z . 4 s y H dz E Ž z , zX .
8p 0 h1 Ž z .
g2 2p h 2 Ž zX .
q H0 dzX E Ž zX , z . ,
8p h1 Ž zX .
g2 g2 2p
ln y 2 Ž z . ,ln y 2 Ž zX . 4 s y Ž z y zX . y H dzX E Ž zX , z .
4p 8p 0
g2 2p
q H d z E Ž z , zX . . Ž 54 .
8p 0
In distinction to the field theoretic results of Ref. w10x we observe here a structurally
changed non-local realization of the algebra. This is due to zero-modes which arise
additionally in the periodic case. It might be surprising that the algebra treats y 1 and y 2
Žas well as y 1 and y 2 . asymmetrically. A symmetric treatment of the functions y k and
y k is presented in Appendix B. But it turns out that the algebra Ž54. is more appropriate
for transforming the y k , y k onto canonical free fields.
466 ¨
U. Muller, G. Weigtr Nuclear Physics B 568 (2000) 457–471
1 a yX1 q b yX2 C
fs
gC ž ln
y 1 yX2 y yX1 y2
qD ,
/ xs
g
ln Ž a y 1 q b y 2 . . Ž 62 .
™ ™
But the constants can be further restricted. Taking into consideration the invariance of
Ž59. under f e i df , x eyi dx , we can choose C real positive. Of course, the physics
should not depend on the choice of the branch of the logarithm. This implies C s 1.
¨
U. Muller, G. Weigtr Nuclear Physics B 568 (2000) 457–471 467
Furthermore, f 2 is defined modulo 2prg only Ži.e. f 2 takes values on a circle with
radius 1rg .
2p 2p
f2 ' f2 q , f2 ' f2 q . Ž 63 .
g g
Up to winding contributions the free fields c k are assumed periodic
2 p wX
c 1 Ž s q 2p ,t . s c 1 Ž s ,t . , c 2 Ž s q 2p ,t . s c 2 Ž s ,t . q , wX g Z,
g
Ž 64 .
and we obtain
pk p wX
f k Ž z q 2p . y f Ž z . s q d k ,2 ,
2 g
pk p wX
f k Ž z q 2p . y f Ž z . s y d k ,2 , p k g R. Ž 65 .
2 g
But this is consistent with Ž62. and Ž39. only if one of the pairs Ž a , a . and Ž b , b . is
™
Ž0,0.. Choosing b s b s 0, the rescaling Ž40. now allows us to set a s a s 1, and
considering the invariance under c k c k q const., we can without loss of generality fix
D s D s 0. Thus the solution Ž62. simplifies finally to
1 yX1 1
f s ln X X , x s ln y 1 ,
g y1 y 2 y y1 y 2 g
1 yX1 1
fs ln , xs ln y 1 . Ž 66 .
g y 1 yX2 y yX1 y2 g
As expected, from the non-local Poisson bracket relations Ž54. we get for the fields f k ,
f k , indeed, the local free-field Poisson brackets
dk l sys X
fk Ž t q s . ,fl Ž t q s X . 4 s y e 2p Ž s y s X . y
ž , /
4 2p
X
dk l X
sys X
f
k Ž t y s . , f l Ž t y s . 4 4 2p
s e ž
Ž s y s . y
2p
, /
d
f k Ž t q s . , f l Ž t y s X . 4 s y 8pk l Ž s q s X . . Ž 67 .
Solving now Ž66. for y k , y k , their non-local free-field representation result
y 1 Ž z . s exp Ž gx Ž z . . ,
exp Ž gx Ž z . . 2p g p1
y2 Ž z . s y H dzXgx X Ž zX . exp y ž e 2 p Ž z y zX . y 2gf 1 Ž zX . ,
/
2sinh Ž g p r2 . 01 2
y 1 Ž z . s exp Ž gx Ž z . . ,
exp Ž gx Ž z . . 2p g p1
y2 Ž z . s y H d zXgx X Ž zX . exp y ž e 2 p Ž z y zX . y 2gf 1 Ž zX . ,
/
2sinh Ž g p r2 . 01 2
Ž 68 .
468 ¨
U. Muller, G. Weigtr Nuclear Physics B 568 (2000) 457–471
6. Summary
In our opinion this may be a good starting point for a quantization. Therefore, it remains
a challenge to implement the exact canonical quantization of the SLŽ2,R.rUŽ1. model
on the basis of our results.
Acknowledgements
We would very much like to thank C. Ford and G. Jorjadze for reading the
manuscript and for useful discussions.
Since the two parts of ŽA.2. are the chiral and anti-chiral components of c , and
distributing the zero-mode cQ half and half to these components, by comparison we
obtain
cQ cQ cQ cQ
D1 s q Ql , D1 s q Ql , D2 s y , D2 s y . Ž A.9 .
2 2 2 2
This immediately implies Ž42.. We give, finally, the explicit expressions for the
constants cQ and Ql in terms of the hk
1 2p
cQ s H Ž c Ž s ,t . y cP Ž s ,t . . d s , with
2p 0
s 2p Ec Ž s X ,t . t 2p Ec Ž s X ,t .
c P Ž s ,t . s H ds X q H ds X ,
2p 0 Es X 2p 0 Et
1 2p Pl
Ql s H
2p 0 žŽ. l z y
2p /
z dz , with
2p X 2p h1 Ž z . hX2 Ž z . y h1X Ž z . h 2 Ž z .
Pl s H0 l Ž z . dz s H0 dz , Ž A.10 .
h1 Ž z . Ž h1 Ž z . y h 2 Ž z . .
1 2p t
kQ s H ln y 1 Ž t q s . y ln y 2 Ž t y s . y Ž a1 y a2 . y i s m ds ,
2p 0 2p
1 2p t
kQ s H ln y 1 Ž t y s . y ln y 2 Ž t q s . y Ž a1 y a2 . y i s m ds ,
2p 0 2p
Ž B.1 .
kQ cQ kQ cQ
ln y˜ 1 Ž z . s ln y 1 Ž z . q y , ln y˜ 2 Ž z . s ln y 2 Ž z . q y ,
2 2 2 2
kQ cQ kQ cQ
ln y˜ 1 Ž z . s ln y 1 Ž z . q y , ln y˜ 2 Ž z . s ln y 2 Ž z . q y . Ž B.2 .
2 2 2 2
¨
U. Muller, G. Weigtr Nuclear Physics B 568 (2000) 457–471 471
X
g2 X
z y zX g2
ln y˜ 1Ž z . ,ln y˜ 2 Ž z . 4 s ž e 2p Ž z y z . y / y E Ž z , zX .
2 2p 2
2 2
g 2p g 2p
qq H dz E Ž z , zX . q H dzX E Ž z , zX .
8p 0 8p 0
g2 2p 2p
y 2 H0 H0 dz dzX Ž E Ž z , zX . y E Ž zX , z . . ,
32p
y˜ 1Ž z . , y˜ 1Ž zX . 4 s y˜ 2 Ž z . , y˜ 2 Ž zX . 4 s y˜ 1Ž z . , y˜ 2 Ž zX . 4 s y˜ 1Ž z . , y˜ 2 Ž zX . 4 s 0,
ln y˜ 2 Ž z . ,ln y˜ 2 Ž zX . 4
g2 g2 2p g2 2p
sy Ž z y zX . q H dz E Ž z , zX . y H d zX E Ž zX , z .
4p 8p 0 8p 0
g2 2p 2p
y 2 H0 H0 dz dzX Ž E Ž z , zX . y E Ž zX , z . . . Ž B.3 .
32p
References
Superconformal hypermultiplets
Bernard de Wit a,b, Bas Kleijn a , Stefan Vandoren c
a
Institute for Theoretical Physics, Utrecht UniÕersity, 3508 TA Utrecht, The Netherlands
b
¨ GraÕitationsphysik, Albert Einstein Institut, Am Muhlenberg
Max Planck Institut fur ¨ 1,
D-14476 Golm, Germany
c
C.N. Yang Institute for Theoretical Physics, SUNY, Stony Brook, Stony Brook, NY 11794-3840, USA
Abstract
1. Introduction
0550-3213r00r$ - see front matter q 2000 Elsevier Science B.V. All rights reserved.
PII: S 0 5 5 0 - 3 2 1 3 Ž 9 9 . 0 0 7 2 6 - 9
476 B. de Wit et al.r Nuclear Physics B 568 [PM] (2000) 475–502
1
¨
Note that hyper-Kahler manifolds that are in the image of the c-map are sometimes called special, because
¨
of the underlying special geometry features. We stress that the usage of the term special hyper-Kahler in this
paper has no relationship to special geometry.
B. de Wit et al.r Nuclear Physics B 568 [PM] (2000) 475–502 477
only a restricted holonomy group contained in SpŽ n y 1.; locally they are a product of a
flat four-dimensional space and a Ž4 n y 4.-dimensional quaternionic space. After gaug-
ing away the degrees of freedom associated with the dilatations and the SUŽ2.
transformations, the quaternionic space remains when coupling to supergravity. We
present the full Lagrangian and transformation rules for the supersymmetric non-linear
sigma models based on special hyper-Kahler ¨ spaces, including the option of gauged
isometries. Furthermore we construct local SpŽ n. = SpŽ1. sections of the so-called
associated quaternionic bundle which is known to exist for any special hyper-Kahler ¨
manifold w9x. It turns out that the use of these sections greatly simplifies the formulation
of the transformation rules and the Lagrangian. In this way our general results remain
closely in line with the results of Ref. w6x; the formulae are identical up to modifications
by connections and covariant tensors. When the sections are trivial, so that the
connections can be put to zero and the tensors become constant, they can be identified
with the hypermultiplet scalar fields and one directly recovers the results of Ref. w6x.
Guided by supersymmetry we thus make contact with the mathematical results quoted
above and we construct the general action and transformation rules in a new form.
The last topic is to couple these supersymmetric non-linear sigma models to
supergravity, using the conformal multiplet calculus. In addition to presenting the
corresponding field theory, we exhibit how the quaternionic manifold emerges in the
coupling. This manifold can now be encoded in terms of SpŽ n. = SpŽ1. sections that are
projective with respect to quaternionic multiplication.
Our results could facilitate the study of type-II string compactifications on Calabi–Yau
three-folds. These lead to four-dimensional models with both vector multiplets and
hypermultiplets. While the moduli space of the vector multiplet scalars is described in
¨
terms of a special Kahler geometry and is well understood, much less is known about
the full quaternionic hypermultiplet moduli space. It is known that at string tree level the
quaternionic manifolds are obtained from a special Kahler ¨ manifold via the c-map w13x.
One would like to understand what the corrections are to the classical hypermultiplet
moduli space coming from both string perturbation theory and non-perturbative effects
w14–17x. With rigid conformal symmetry, the results of this paper could also be helpful
in the description of cone branes w11x.
This paper is organized as follows. In Section 2 we briefly summarize some essential
features of hypermultiplet Lagrangians with gauged target-space isometries. For hyper-
multiplets there exists no unconstrained off-shell formulation in terms of a finite number
of degrees of freedom, hence the supersymmetry algebra will only be realized up to the
field equations of the hypermultiplet fermions. This is in contrast with the vector
multiplets, introduced to gauge the isometries, and the superconformal theory itself, for
which off-shell formulations exist. As a result of the latter, the algebra of gauged
isometries and of the superconformal transformations, including certain field-dependent
structure constants, is completely fixed and not affected by the presence of hypermulti-
plets. Section 3 deals with rigidly superconformal hypermultiplets, where we find the
¨
constraints on the hyper-Kahler manifold imposed by superconformal invariance. Sub-
section 3.1 defines the superconformal transformation rules, the second one deals with
¨
the hyper-Kahler potential and the construction of local SpŽ n. = SpŽ1. sections, and the
third one gives the Lagrangian and the transformation rules. The geometry of special
¨
hyper-Kahler manifolds is explained in Section 4. We first discuss the cone structure of
478 B. de Wit et al.r Nuclear Physics B 568 [PM] (2000) 475–502
¨
these hyper-Kahler manifolds which lead to a tri-Sasakian space. The latter is indeed an
SpŽ1. fibration over a smaller space, which we prove to be quaternionic. This quater-
nionic space couples to supergravity, as we then show in Section 5. Here we present the
action for the hypermultiplets associated with a special hyper-Kahler¨ target space
coupled to conformal supergravity and exhibit how the target-space metric becomes
quaternionic.
2. Preliminaries
™™
transformations, two of which are associated with the target space. One set consists of
the target-space diffeomorphisms f f X Ž f .. The other refers to reparametrizations of
the fermion ‘frame’ of the form z a S a b Ž f . z b, and corresponding redefinitions of
™
other quantities carrying indices a or a . For example, the fermionic metric transforms
as Gab w Sy1 xg a w Sy1 x d b Ggd . There are connections, GA a b , associated with these
fermionic redefinitions, which appear in the Lagrangian and supersymmetry transforma-
tion rules. Finally, there are chiral SUŽ2. ( SpŽ1. redefinitions of the supercharges,
which in the rigidly supersymmetric case must be constant and are therefore trivial. In
the locally supersymmetric case this will be different and in the latter part of this paper
we will have to deal with local SUŽ2..
B. de Wit et al.r Nuclear Physics B 568 [PM] (2000) 475–502 479
d Q f A s 2 g i Aa e zi a q gaA i e i z a ,
ž /
d Q z a s VAai Euf Ae i y d Q f A GA a
b zb ,
d Q z a s VAi a Euf Ae i y d Q f A GA a
b zb . Ž 2.1 .
A
In principle g and VA each denote Ž4 n. = Ž4 n. complex quantities, but as we shall see
below, these quantities are related and satisfy a pseudoreality condition. As it turns out
they will play the role of the quaternionic Žinverse. vielbeine of the target space.
Observe that the supersymmetry variations are consistent with a UŽ1. chiral invariance
under which the scalars remain invariant, while the fermion fields and the supersymme-
try transformation parameters transform nontrivially. This group will be denoted by
UŽ1. R to indicate that it is a subgroup of the automorphism group of the supersymmetry
algebra. In Section 3 we will see that this UŽ1. will correspond to one of the conformal
gauge transformations. However, for generic g A and VA , the SUŽ2. R ( SpŽ1. part of the
automorphism group cannot be realized consistently on the fields. This would require
the presence of an SUŽ2. isometry in the target space. In the above, we merely used that
z a and z a are related by complex conjugation.
The Lagrangian takes the following form:
1 1
L s y g A B Em f AE mf B y Ga b z a Du z b q z b Du z a y Wa bgd z agm z b z gg mz d ,
ž /
2 4
Ž 2.2 .
where we employed the covariant derivatives
Dm z a s Em z a q Em f A GA a
b zb , Dm z a s Em z a q Em f A GA a
b zb . Ž 2.3 .
Besides the Riemann curvature R A B C D we will be dealing with another curvature
R A B a b associated with the connections GA a b , which takes its values in spŽ n. (
uspŽ2 n;C.. The tensor W is defined by
1
Wabgd s R A B e g g i Aa gbi B Ged s R A B C D g i Aa gbi B g jCg gd j D , Ž 2.4 .
2
and will be discussed shortly in more detail.
The target-space metric g A B , the tensors g A , VA and the fermionic hermitian metric
Gab Ži.e. satisfying Ž Ga b . ) s Gba . are all covariantly constant with respect to the
Christoffel connection and the connections GA a b . Furthermore we note the following
relations:
) 1
Ž Ji j . A B ' Ž JAi jB . s ´ i k ´ jl JAk Bl , J Ai jC JCk Bl s
2
´ iŽ k´ l . j g A B q ´ Ž iŽ k JAl .Bj. .
Ž 2.8 .
In addition we note the following useful identities:
1
gA i a VBj a s ´ i k JAk Bj q g A B dij , JAi jB g aBk s yd kŽ i´ j.l
gA l a . Ž 2.9 .
2
We also note the existence of covariantly constant antisymmetric tensors,
1 1
Vab s ´ i j g A B g i Aa g jBb , V a b s ´ i j g A B VAi a VBj b , Ž 2.10 .
2 2
satisfying Vag V gb s yda b . Their complex conjugates satisfy
)
Vab ' Ž Va b . s G ga V gd G db . Ž 2.11 .
The tensor V can be used to define a reality condition on V and g ,
where, as a result of the cyclicity property of the Riemann tensor, Wabgd is symmetric in
all four indices. This tensor is linearly related to the tensor Ž2.4. upon multiplication
with the tensors G and V . In terms of Wabgd the curvatures read
1
RA BCD s ´ i j ´ k l VAai VBbj VCgk VDd l Wa bgd ,
2
1
Vae R A B e b s y ´ i j VAgi VBdj Wa bgd . Ž 2.17 .
2
The above results are all derived from the requirement of supersymmetry. To
characterize the geometry of the target space, one could start from the non-singular VAi a
and a non-singular skew-symmetric tensor Vab that is covariantly constant with respect
to a symplectic connection GA a b . Subsequently one notes that ´ i j Va b VAj b and the
inverse of VA , denoted by g iBa , are linearly related by a symmetric matrix g A B .
Requiring that this matrix is real we can identify it with the target-space metric while the
ensuing reality constraint on the VA enables their identification as the corresponding
quaternionic vielbeine. This information is sufficient for deriving all the algebraic
identities listed above. The vielbeine and the symplectic connection then allow the
definition of an affine target-space connection, with respect to which the vielbeine are
covariantly constant thus leading to a generalized vielbein postulate. All of above results
then follow upon assuming that the target space has no torsion so that the affine
connection and the Christoffel connection coincide.
dG f A s g u I k IA Ž f . , Ž 2.18 .
where g is the coupling constant and the k IA Ž f . satisfy the Killing equation,
DA k IB q DB k I A s 0 . Ž 2.19 .
The isometries constitute an algebra with structure constants f I J K ,
k IBE B k JA y k JBE B k IA s yf I J K
k KA . Ž 2.20 .
482 B. de Wit et al.r Nuclear Physics B 568 [PM] (2000) 475–502
Our definitions are such that the gauge fields that are needed once the u I become
space-time dependent, transform according to dG WmI s Em u I y gf JK I WmJ u K . The Killing
equation generally implies the following property:
DA DB k IC s R B C A E k IE . Ž 2.21 .
Quantities that carry SpŽ n. indices, such as VAai ,
are only required to be invariant
under isometries up to fermionic equivalence transformations. Thus yg Ž k IB E B VAai q
EA k IB VBai . must be cancelled by a suitable infinitesimal rotation on the index a . Here we
assume that the effect of the diffeomorphism is entirely compensated by a rotation that
affects the indices a . In principle, one can also allow for a compensating SpŽ1.
transformation acting on the indices i, j, . . . . However, the latter transformations must be
constant, so they will generically not appear here. This is equivalent to requiring that the
isometry group will commute with supersymmetry.
Let us parametrize the compensating transformation acting on the SpŽ n. indices by
dG z a s g w t I y k IA GA x a b z b , where the Ž f-dependent. matrices t I Ž f . remain to be
determined,
a
yk IB E B VAai y EA k IB VBai q Ž t I y k IB GB . b VAbi s 0 . Ž 2.22 .
Obviously similar equations apply to the other geometric quantities, but as those are not
independent we do not need to consider them. Using the covariant constancy of VA , we
derive from Ž2.22.,
a
Ž tI . b VAbi s DA k IB VBai , Ž 2.23 .
so that
a 1
Ž tI . bs VAai gbB i DB k IA . Ž 2.24 .
2
Target-space scalars will satisfy algebraic identities, such as
g g g
Ž tI . a Ggb q Ž t I . b Gag s Ž t I . wa Vb xg s 0 . Ž 2.25 .
This establishes that the field-dependent matrices t I take values in spŽ n.. From Ž2.19.
and Ž2.21., it easily follows that
a
DA t I b skI
B
RA B a b , Ž 2.26 .
for any infinitesimal isometry. From the group property of the isometries it follows that
the matrices t I satisfy the commutation relation
a a
w tI , t J x b s fI J
K
Ž tK . b qkI
A
k JB R A B a b , Ž 2.27 .
which takes values in spŽ n.. The apparent lack of closure represented by the presence of
the curvature term is related to the fact that the coordinates f A on which the matrices
depend, transform under the action of the group. One can show that this result is
consistent with the Jacobi identity.
Furthermore we derive from Ž2.22. that the complex structures JAi jB are invariant
under the isometries,
k ICEC JAi jB y 2 E w A k IC JBi jxC s 0 . Ž 2.28 .
B. de Wit et al.r Nuclear Physics B 568 [PM] (2000) 475–502 483
This means that the isometries are tri-holomorphic. From Ž2.28. one shows that
EAŽ JBi Cj k IC . y E B Ž JAC
ij
k IC . s 0, so that, locally, one can associate three Killing potentials
Žor moment maps. PIi j to every Killing vector, according to
EA PIi j s JAi jB k IB . Ž 2.29 .
Observe that this condition determines the moment maps up to a constant. Up to
constants one can also derive the equivariance condition,
JAi jB k IA k JB s yf I J K
PKi j , Ž 2.30 .
which implies that the moment maps transform covariantly under the isometries,
dG PIi j s u J k JA EA PIi j s yf JI K
PKi j u J . Ž 2.31 .
Summarizing, the invariance group of the isometries acts as follows:
a
dG f s g u I k IA , dG z a s g Ž u I t I . b z b y dG f AGA a
b zb . Ž 2.32 .
When the parameters of these isometries become space-time dependent we introduce
corresponding gauge fields and fully covariant derivatives,
Dm f A s Em f A y gWmI k IA , Dm z a s Em z a q Em f A GAa b z b y gWm a
bz
b
,
Ž 2.33 .
where Wm a b s WmI Ž t I . a b . The covariance of Dm z a depends crucially on Ž2.26. and
Ž2.27.; after some calculation one finds
a
dG Dm z a s g Ž u I t I . b Dm z b y dG f AGA a
b Dm z b . Ž 2.34 .
The gauge fields WmI are accompanied by complex scalars X , spinors and auxiliary I
V iI
fields Yi Ij , constituting off-shell N s 2 vector multiplets. For our notation of vector
multiplets, the reader may consult w18x.
The minimal coupling to the gauge fields requires extra terms in the supersymmetry
transformation rules for the hypermultiplet spinors as well as in the Lagrangian, in order
to regain N s 2 supersymmetry. The extra terms in the transformation rules are
¨
In this section we determine the restrictions on the hyper-Kahler geometry that follow
from imposing invariance under rigid superconformal transformations. As we already
mentioned in Section 1, the corresponding spaces, called special hyper-Kahler ¨ mani-
folds, have an intriguing geometrical structure. In Section 5 we will obtain the coupling
of hypermultiplets to conformal supergravity. A crucial element in the construction of
this coupling is that the full superconformal theory is known in an off-shell form, so that
the superconformal algebra remains unaffected in the presence of matter fields. Our goal
is more modest in this section where we only consider rigid superconformal transforma-
tions. This aspect does not play a role for the derivation of the superconformal
transformations on the hypermultiplets and the results of this section describe the
situation that would arise when freezing all the fields of conformal supergravity to zero
in a flat space-time metric. In that case the superconformal transformations acquire an
explicit but fixed dependence on the space-time coordinates parametrized by a finite
number of space-time-independent parameters Žthis is explained, for instance, in w23x..
In Subsection 3.1 we impose the superconformal algebra on the fields and find the
transformation rules as well as a number of important results for the complex structures
and the moment maps associated with possible isometries. In Subsection 3.2 we derive
¨
the existence of a hyper-Kahler potential and reformulate the theory in terms of local
sections of an SpŽ n. = SpŽ1. bundle. Then, in Subsection 3.3 we present the Lagrangian
and the transformation rules in terms of these local sections.
d S Ž h . , d Q Ž e . f A s Ž e ihi q e ih i . k DA q 2 Ji k A
B ´ k j e ihj y e jh i k DB .
ž / Ž 3.3 .
This result can be confronted with the corresponding expression from the N s 2
superconformal algebra, which reads
k iAj s Ji j A
B k DB . Ž 3.5 .
Now we proceed to impose the same commutator on the fermions, where on the
right-hand side we find a Lorentz transformation, a UŽ1. transformation and a dilatation,
if and only if we assume the following condition on k DA :
DA k DB s dAB . Ž 3.6 .
The geometric significance of these results will be discussed in later subsections. Here
we note that Ž3.6. suffices to show that the kinetic term of the scalar fields is invariant
under dilatations, provided one includes a space-time metric or, in flat space-time,
includes corresponding scale transformations of the space-time coordinates. Neverthe-
less, observe that k DA is not a Killing vector of the hyper-Kahler
¨ space, although it still
satisfies Ž2.21., but an example of a conformal homothetic Killing vector. Another
consequence is that the SUŽ2. vectors k iAj , as expressed by Ž3.5., are themselves Killing
vectors, because their derivative is proportional to the corresponding antisymmetric
complex structure
DA k Bi j s yJAi jB . Ž 3.7 .
¨
From this it follows that the Kahler two-forms are exact, provided that the Killing
vectors are globally defined. The product rule of the SUŽ2. Killing vectors can now be
worked out and one finds
k B i j E B k A k l y k B k l E B k A i j s 2 k A Ž iŽ k ´ l . j. , Ž 3.8 .
which is indeed in accord with the SUŽ2. structure constants.
486 B. de Wit et al.r Nuclear Physics B 568 [PM] (2000) 475–502
covariantly constant. Interchanging the order of the derivatives and extracting the
complex structure then gives
k iBj DB k IA s JiAj B k IB , Ž 3.14 .
which implies that the tri-holomorphic Killing vectors commute with SUŽ2.. From the
above equations one can derive the following result for the variation of the moment
maps under a dilatation:
k DA EA PIi j s JAi jB k DA k IB s yk Ai j k IA s 2 PIi j , Ž 3.15 .
i.e. they scale with conformal weight 2. Here we have adjusted an integration constant in
PIi j in the last equation. Combining the above equation with previous results, one
establishes that the moment maps transform under SUŽ2. according to
k kAl EA PIi j s 2 dŽŽki ´ l . m PIj. m . Ž 3.16 .
The latter expresses the fact that the moment maps form a triplet under SUŽ2.. It is then
easy to check that the action is invariant under dilatations, UŽ1. and SUŽ2..
¨
3.2. Hyper-Kahler potential and Sp(n) = Sp( 1 ) sections
The existence of the homothetic Killing vector satisfying Ž3.6. has important conse-
quences for the geometry. First of all Ž3.6. implies that k DA can Žlocally. be expressed in
terms of a potential x , according to k D A s EA x . Up to a suitable additive integration
constant, one can then show that w12x
1
x Žf. s g A B k DA k DB . Ž 3.17 .
2
Observe that x is positive for a space of positive signature. A second Žcovariant.
derivative acting on x yields the metric, and therefore a third derivative vanishes,
DA DB x s g A B , DA DB DC x s 0. Ž 3.18 .
The first condition expresses the fact that the metric is the second Žcovariant. derivative
of some function, somewhat analogous to the Kahler ¨ ¨
potential in Kahler metrics, but
now written in real coordinates. A Kahler¨ potential is guaranteed to exist for any
¨
hyper-Kahler space, but the potential x does not always exist. In the literature x is
¨
sometimes called the hyper-Kahler potential Žsee, e.g., Refs. w8,9x.. This means that x
¨
serves as a Kahler potential for each of the three complex structures, as follows from
1
2
Ž dAC q J L A C . Ž dBD y J L B D . DC DD x s JALB , Ž 3.19 .
where J L s Ž s 2 s L . i j J i j and L s 1,2,3 is kept fixed.
¨
The hyper-Kahler potential x is invariant under isometries, as follows directly from
Ž3.13.. In particular it is invariant under the SUŽ2. isometry; explicitly,
i i
dx s Ž u SUŽ2 . . k ´ jk k iBj E B x s Ž u SUŽ2 . . k ´ jk Ji j A B k DB k DA s 0 , Ž 3.20 .
where we made use of Ž3.5.. However, it is not invariant under dilatations,
dx s k DB E B x s 2 x . Ž 3.21 .
488 B. de Wit et al.r Nuclear Physics B 568 [PM] (2000) 475–502
g A B DA A i a DB A j b s d i j G a b ,
R A B g a Vgb DC A i a DD A j b ´ i j s R A B C D . Ž 3.32 .
B. de Wit et al.r Nuclear Physics B 568 [PM] (2000) 475–502 489
q 2 g X g a Vbg z az b q 2 g z a Va b V i b g A i g q h.c.
1
q 2 g 2 Gab Ai a X b g X g d A i d q g A i a Va b Y i j b g A j g , Ž 3.33 .
2
where the covariant derivatives are defined by
Dm A i a s Em A i a q Em f A GAa b A i b y g Wm a
b Ai b ,
Dm z a s Em z a q Em f A GAa b z b y g Wm a
b zb , Ž 3.34 .
and we have used Lie-algebra valued vector multiplet fields associated with gauged
isometries, Wm a b , X a b , Y i j a b and V i a b Žfor the precise definition, see below., In
addition to the equation in Subsection 3.2 we made use of the identities,
k IA VAai s k IA DA A i a s t Ia b A i b ,
1 1 b
PI i j s y k A i j k IA s y Va b A i a Ž t I . g A j g . Ž 3.35 .
2 2
The first relation follows from Ž2.23. and Ž3.12., and for the second equation we made
use of the last equality in Ž3.15..
The action may be compared to the one in w6x Žmore precisely, to the part that
pertains to the rigidly supersymmetric Lagrangian.. However, in that reference, the A i a
are identical to the coordinate fields, whereas in the present more general case they are
local sections as explained in Subsection 3.2. Because the target-space manifold is not
flat, we encounter a non-trivial metric in Ž3.33. as well as non-trivial SpŽ n. connections
in the covariant derivatives Ž3.34.. Furthermore, the generators t I Ž f . associated with the
isometries are not constant, but depend on the scalar fields as we indicated before. This
means that the Lie-algebra valued vector multiplet fields associated with the gauged
isometries depend also on the hypermultiplet scalars. Their definitions are
a I a
Wm b s Wm tI Ž f . b ,
a a
X a b s X I tI Ž f . b , X a b s X I tI Ž f . b ,
a
Y i ja b s Y I i j tI Ž f . b ,
a a
V i a b s V I i tI Ž f . b , Vi a
b s Vi
I
tI Ž f . b . Ž 3.36 .
490 B. de Wit et al.r Nuclear Physics B 568 [PM] (2000) 475–502
Nevertheless, the correspondence with the formulation in w6x will be helpful later on
when evaluating the coupling to conformal supergravity.
In order to obtain the transformation rules of the SpŽ n. = SpŽ1. sections under
dilations, SUŽ2. and isometry transformations, we use the general relation
d A i a s df B E B A i a s df B VBai y df BGB a
b Ai b . Ž 3.37 .
Using Ž2.9., Ž2.23. and Ž3.12., we then find for a combined dilatation, chiral transforma-
tion and target-space isometry, that
d A i a s u D A i a q Ž u SUŽ2 . . i j A j a q gu I t I a
b A i b y df AGA a
b Ai b . Ž 3.38 .
This result should be combined with that for the fermions, derived in Section 2,
3 1
dz a s u D z a y i u UŽ1. z a q gu I t I a b z b y df AGA a b z b . Ž 3.39 .
2 2
Similarly we determine the transformations under Q- and S-supersymmetry,
d A i a s 2 e i z a q 2 ´ i j G a bVbg e jz g y d Q f BGB a
b Ai b ,
dz a s Du A i ae i y d Q f BGB a
b z b q 2 g X a b A i b ´ i je j q A i a h i ,
dz a s Du Ai ae i y d Q f BGB a
b z b q 2 g X a b Ai b ´ i j e j q Ai a hi . Ž 3.40 .
Again, we stress that, apart from the SpŽ n. connection Žand a slight change in
notation., these transformation rules are identical to the ones specified for a flat target
space w6x, where the local sections can be identified directly with the target-space
coordinates.
Finally, we recall that it is straightforward to write down actions for the vector
multiplets that are invariant under rigid N s 2 superconformal transformations. Those
are based on a holomorphic function that is homogeneous of degree two w24x.
¨
In this section we discuss the properties of the special hyper-Kahler space. We will
show how this space can be described as a cone over a tri-Sasakian manifold. The latter
spaces Žwhich are of dimension 4 n y 1. are characterized by the existence of three Ž1,1.
tensors and three Killing vectors that are subject to certain conditions. A manifold is
¨
tri-Sasakian if and only if its cone is hyper-Kahler. Tri-Sasakian spaces are Einstein and
take the form of an SpŽ1. fibration over a quaternionic space. This quaternionic space is
the one that appears in the coupling of hypermultiplets to supergravity Žfor more details,
see Ref. w10x, where the relation between special hyper-Kahler,
¨ tri-Sasakian and quater-
nionic spaces is reviewed from a more mathematical viewpoint..
We start by noting that the Riemann tensor vanishes upon contraction with any one of
the four vectors Ž k DA ,k iAj ., i.e.
R A B C E k DE s 0 , R A B C E k iEj s 0. Ž 4.1 .
The first equation Ž4.1. is derived by antisymmetrizing the second equation Ž3.18. in the
indices w AB x. The second Ž4.1. follows from inserting Ž3.7. into Ž2.21.. Incidentally,
B. de Wit et al.r Nuclear Physics B 568 [PM] (2000) 475–502 491
Ž4.1. implies that the Ricci tensor has at least four null vectors. However, in the case at
¨
hand this poses no extra restrictions as hyper-Kahler spaces are Ricci-flat. The above
results can also derived from the fact that the SpŽ n. holonomy group is reduced to
SpŽ n y 1., c.f. Ž3.28.. This follows from applying Ž3.30..
We recall that these four vectors are orthogonal Žcf. Ž3.5., Ž3.17..,
k DA k D A s 2 x , k iAj k Ak l s dŽki d j.l x , k DA k Ai j s 0 . Ž 4.2 .
¨
This implies that the hyper-Kahler manifold is locally a product R 4 = Q 4 ny4 , where R 4
denotes a flat four-dimensional space. By decomposing R 4 as Rq= S 3 , we can write the
¨
hyper-Kahler manifold as a cone over a so-called tri-Sasakian manifold; the latter is then
a fibration of SpŽ1. over Q 4 ny4 . Hence the manifold can be written as 2 Rq=wSpŽ1. =
Q 4 ny4 . Spaces with a homothety can always be described as a cone. This becomes
manifest when decomposing the coordinates f A into coordinates tangential and orthog-
onal to the Ž4 n y 1.-dimensional hypersurface defined by setting x to a constant. The
line element can then be written in the form w12x,
dx 2
ds 2 s q 2 x h a b Ž x . dx a dx b , Ž 4.3 .
2x
where the x a are the coordinates associated with the hypersurface 3. In the present case
this hypersurface must be a tri-Sasakian space and the hyper-Kahler ¨ space is therefore a
cone over the tri-Sasakian space.
The purpose of the remainder of this section is to establish that Q 4 ny4 is a
quaternionic manifold. In Section 5 we show how Q 4 ny4 arises in the coupling of
hypermultiplets to supergravity. The tangent space of the hyper-Kahler ¨ space can be
decomposed into the four directions along Ž k DA ,k iAj ., and a Ž4 n y 4.-dimensional space
Q 4 ny4 that is locally orthogonal to that. Tensors that vanish upon contraction with
Ž k DA ,k iAj . will be called horizontal.
Let us introduce a vector field VA i j which will serve as a connection for SpŽ1. in a
way that will become clear shortly,
ki j A
VA i j s s Ji j A B E B ln x . Ž 4.4 .
x
This vector field is invariant under target-space dilatations and gauge isometries, i.e.
d D VA i j s k DB E B VA i j q EA k DB V B i j s 0 ,
dG VA i j s k IB E B VA i j q EA k IB V B i j s 0, Ž 4.5 .
and rotates under target-space SUŽ2., as follows from
d VAi j s k B k l E B VAi j q EA k B k l V Bi j s 2 ´ Ž iŽ k VAl . j. . Ž 4.6 .
2
Strictly speaking it is SpŽ1.rZ 2 where SpŽ1. is the group that acts on the quaternionic vielbeine and on
the sections introduced in the previous chapter.
3
In terms of a radial variable r 2 s 2 x , this yields the usual form of a cone metric
ds s dr 2 q r 2 h a b Ž x . dx adx b .
2
492 B. de Wit et al.r Nuclear Physics B 568 [PM] (2000) 475–502
s xy1Dab DA A i a DB A j b q DA A j a DB A i b
, Ž 4.7 .
where we have used the definition
1
Dab s Va b y ´ k l Vag A k g
ž / žV bd Al d .
/ Ž 4.8 .
x
Observe that Dab is a projection operator, i.e. it satisfies Da b V bgDgd s yDa d , and it
projects onto the Ž2 n y 2.-dimensional subspace orthogonal to the A i a ,
Dab A i b s 0. Ž 4.9 .
Note that we have k DB DB Da b s k iBj DB Da b s 0, so that Da b is invariant under dilata-
tions and SUŽ2. transformations. One can also show that Dab DB A i b is horizontal, i.e.
k DB Da b DB A i b s k iBj Da b DB A i b s 0 . Ž 4.10 .
The identity Ž4.7. can be generalized to
1 1
xy1Dab DA A i a DB A j b s ´ i j GA B q R A B i j , Ž 4.11 .
2 2
where
GA B s xy1´ i j Da b DA A i a DB A j b . Ž 4.12 .
Observe that both GA B and R A B i j are of zero weight under the homothety and are
horizontal, i.e. they vanish upon contraction with any of the four vectors Ž k DA ,k iAj ., and
are thus orthogonal to the corresponding Žlocal. four-dimensional subspace.
The tensor GA B will provide a metric for Q 4 ny4 . The relation between GA B and the
¨
hyper-Kahler metric g A B is given by
1 1
gABs k DA k DB q k A i j k Bi j q x GA B
2x x
1
s k D A k D B q x VA i j V Bi j q GA B , Ž 4.13 .
2x
where we have used Ž3.30. and Ž3.31.. Observe that this relation reflects both the cone
¨
structure of the hyper-Kahler space and the SpŽ1. fibration of the tri-Sasakian space. It is
not possible to give an explicit expression for the inverse metric, at least not in general,
but this is not really needed in view of the horizontality of GA B . When acting on
horizontal tensors, x g A B acts as the inverse metric in view of the identity
GAC g C D GD B s xy1 GA B . Ž 4.14 .
b
We already showed that Dab DB A i was horizontal, and conversely, the horizontal
projection GA B g B C DC A i a is in the Ž2 n y 2.-dimensional eigenspace projected onto by
Dab . Therefore Da b DB A i b is a candidate for the quaternionic vielbein associated with
B. de Wit et al.r Nuclear Physics B 568 [PM] (2000) 475–502 493
Q 4 ny4 and Dab projects onto the tangent space of Q 4 ny4 . More precisely, we introduce
the following related sets of 4 n y 4 vectors:
1 1
V̂Aai ' y V a b Dbg VAgi s y V a b Dbg DA A i g ,
x ' x '
1
gˆA i a ' GA B g iBa s ´ i j Da b Vˆ Aj b , Ž 4.15 .
'x
which satisfy algebraic relations that are completely analogous to those satisfied by the
¨
quaternionic vielbeine of the hyper-Kahler space. In particular we note that VˆA and gˆA
are each other’s inverse in the reduced 4 n y 4.-dimensional space,
Ž
1
D̂A k DB s xy1 y k D A k DB q k i j A k i j B ,
ž /
2
1 1 B
D̂A k iBj s xy1 y k D A k iBj q k i j A k DB y k Ak l ´ k Ž i k j . l .
ž / Ž 4.24 .
2 2
The above construction can be generalized to tensors H that carry also SUŽ2. indices,
indicating that they transform covariantly under target-space SUŽ2. transformations, e.g.
as in k kAl EA H i s dŽik ´ l . j H j in the simplest case. Then one can show that the derivatives
of these tensors are still horizontal, provided one covariantizes DˆA and includes an
SUŽ2. connection VA i j . The crucial identity for showing this is k iAj VAk l s dŽki d j.l .
With respect to the new connection, GA B is covariantly constant,
D̂C GA B s 0, Ž 4.25 .
so that the new connection must be just the Christoffel connection associated with GA B .
Likewise the tensors JA B i j are covariantly constant modulo a rotation that involves the
SpŽ1. connection,
¨
hyper-Kahler manifold is defined by lowering the upper index by means of the metric
g D E . Furthermore it satisfies the Bianchi identity Dˆ w A Rˆ B C x D E s 0.
Let us now calculate the Ricci tensor, which is symmetric by virtue of Ž4.25.,
Rˆ A B s x Rˆ AC B D g C D s y2 Ž n q 1 . GA B . Ž 4.29 .
¨
Observe that we used that the original hyper-Kahler manifold was Ricci flat and that
GA B g A B x s 4Ž n y 1.. We may also verify the expressions for the SpŽ1. holonomy
Rˆ A B C D g C E g D Fx 2 JEF
ij
s y4 Ž n y 1 . JAi Bj , Ž 4.30 .
where we used that the original hyper-Kahler¨ manifold has zero SpŽ1. holonomy. These
are the expected results w28,29x for a Ž4 n y 4.-dimensional quaternionic manifold with
SpŽ1. curvature given by Ž4.19..
This completes the discussion of target-space properties. We now return to aspects
¨
related to the SpŽ n. bundle over the special hyper-Kahler space. First of all we consider
a modification of the connection GA a b such that the modified derivative of a tensor
that is orthogonal to A i a remains orthogonal. This requires that this derivative acting on
A i a must be proportional to A i a itself. When combining this with a few other obvious
requirements 4 ,
a a
2
ĜA b s GA b y ´ i j A i Ž a DA A j g . q A i a A j g VAi j Vgb . Ž 4.31 .
x
With this modification, the tensors Vab and Ga b remain covariantly constant. The
presence of the term proportional to VAi j is required to preserve covariance with respect
to target-space SUŽ2. transformations. This term also ensures that the modification is
horizontal. With the modified connection we establish the required result,
1
D̂A A i a s EA ln x A i a q VA i k A l a ´ k l , Ž 4.32 .
2
where the last term can be interpreted as an SUŽ2. covariantization of the derivative on
the left-hand side. The result Ž4.32. suffices to show that the modified derivative of a
tensor that is orthogonal to A i a , will remain orthogonal. It is now obvious that the
projection operator Dab is covariantly constant under the modified derivative
D̂A Da b s 0. Ž 4.33 .
Including the modified connections GˆA B C and GˆA a b as well as the SUŽ2. connec-
tion VAi j, one can explicitly verify that DˆAVBai is equal to 12 EA ln x VBai , up to terms that
4
In determining the precise modifications of the various connections, we were also guided to some extent
by supersymmetry. However, this aspect is postponed to Section 5, where we outline the significance of the
results of this section in the context of the coupling of hypermultiplets to supergravity.
496 B. de Wit et al.r Nuclear Physics B 568 [PM] (2000) 475–502
are proportional to A k a. This implies that the quaternionic vielbeine introduced in Ž4.15.
are covariantly constant with respect to the new connections, so that we have
Rˆ A B C D gˆa D i q Rˆ A B b
a gˆ Cib q R iAkB gˆ Cla ´ k l s 0 . Ž 4.35 .
Here Rˆ A B a b is the curvature associated with the new connection Ž4.31.. We can
explicitly evaluate this tensor,
2 1
R̂ A B a b s R A B a b q V agDgd ´ i j DA A i Ž d DB A j e . Deb y R Ai jB A i aA j g Vgb ,
x x
Ž 4.36 .
which indeed satisfies the first integrability relation. Note that all expressions appearing
in Ž4.36. are horizontal.
Now we recall that for a special hyper-Kahler¨ manifold the tensor Wabgd defined in
Ž2.16. satisfies the constraint
Wabgd A i d s 0 . Ž 4.37 .
With this in mind we write the new curvature tensors as follows:
1
Rˆ A B C D s ´ i j ´ k l VˆAai VˆBbj VˆCgk VˆDd l Wˆa bgd q GDw A GB xC
2
y 2 JAi Bj JC D i j q 2 JCwi j A JB x D i j ,
1
Va e Rˆ A B e b s y´ i j VˆAgi VˆBdj Wˆa bgd q 2 Da Ž g D d . b
2
q xy1 R iAjB A i gA j d Vga Vdb , Ž 4.38 .
where
Ŵabgd ' x Wa bgd . Ž 4.39 .
One can now verify that these curvatures satisfy also the second integrability condition
Ž4.35.. We will return to this and related issues in Section 5.
We close this section with a brief discussion of the isometries. For every tri-holomor-
¨
phic Killing vector of the special hyper-Kahler manifold we construct a corresponding
vector in the horizontal manifold Q 4 ny4 by the projection
kˆ I A s GA B k IB . Ž 4.40 .
By explicit calculation one can then show that DˆA kˆ IB q Dˆ B kˆ I A s 0, so that we have a
corresponding Killing vector in the horizontal space and thus an isometry. Observe that
¨
the SUŽ2. isometries of the special hyper-Kahler manifold do not generalize in this way,
because the corresponding kˆ I A would simply vanish. This is not so surprising, as the
SUŽ2. acts on the corresponding tri-Sasakian space through its SpŽ1. fibre.
B. de Wit et al.r Nuclear Physics B 568 [PM] (2000) 475–502 497
kˆ IC EC JA B i j y 2 E w A kˆ IC JB xC i j s k IC EC JA B i j y 2 E w A k IC JB xC i j
k DB Dˆ B kˆ IA s k iBj Dˆ B kˆ IA s 0 . Ž 4.46 .
Finally the algebra of the isometries is governed by
kˆ IB E B kˆ JA y kˆ JB E B kˆ IA s yf I J K
kˆ KA q 2 JB C i j kˆ IB kˆ CJ k A i j . Ž 4.47 .
Hence the algebra of isometries is satisfied up to SUŽ2..
gauge fields associated with SUŽ2. = UŽ1., denoted by Vmi j Žantihermitian. and Am . The
fermionic gauge fields are the gravitino fields cmi and the fields fmi associated with
S-supersymmetry. Finally, the matter fields are Ta b i j Žantisymmetric and self-dual in
Lorentz indices and antisymmetric in SUŽ2. indices., a spinor x i and a real scalar D.
The fields vma b , fma and fmi are not independent and can be expressed in terms of the
other fields. We refer to w5,6x for more details on the notation and conventions.
The transformation rules have been given in previous sections, but will change in the
context of local supersymmetry. The most obvious change concerns the replacement of
the derivatives by derivatives that are covariant with respect to the additional gauge
symmetries. The derivatives covariant with respect to the bosonic gauge symmetries for
the scalar fields, the sections and the fermion fields, read
1
Dm f A s Em f A y bm k DA q Vmi k ´ jk k iAj y g WmI k IA ,
2
1
Dm A i a s Em A i a y bm A i a q Vm i j A j a y g Wm a b A i b q Em f A GAa b A i b ,
2
1 3 1
Dm z a s Em z a y vma b ga b z a y bm z a q iAm z a
4 2 2
a b A a b
y g Wm b z qEm f GA b z , Ž 5.1 .
where we have also included the terms related to possible gauged isometries. All
covariantizations follow straightforwardly from the formulae presented in Subsection 3.3
and from the gauge field conventions given in w5,6x. Observe that the derivative in Em f A
multiplying the connection GA a b does not require an additional covariantization.
The transformation rules under Q- and S-supersymmetry are now as follows:
df A s 2 g i Aa e zi a q gaA i e i z a ,
ž /
d A i a s 2 e i z a q 2 ´ i j G a bVbg e jz g y d Q f BGB a
b Ai b ,
dz a s Du A i ae i y d Q f BGB a
b z b q 2 g X a b A i b ´ i je j q A i a h i ,
dz a s Du Ai ae i y d Q f BGB a
b z b q 2 g X a b Ai b ´ i j e j q Ai a hi , Ž 5.2 .
where we have made use of the supercovariant derivatives Žwe also give the supercovari-
ant derivative of z a which is not needed above.,
Dm f A s Dm f A y g i Aa cmi z a y gaA i cm i z a ,
Dm A i a s Dm A i a y cm i z a y ´ i j G a bVbg cmj z g ,
1 1
Dm z a s Dm z a y Du A i acmi y
A af i . Ž 5.3 .
2 2 i m
We have verified that no further modifications of the fermionic transformation rules
beyond those given above are possible, assuming that the bosonic transformation rules
remain the same. One of the underlying reasons for the absence of additional terms may
be that the above rules are already consistent with rigid supersymmetry and with the
¨
case of a flat hyper-Kahler manifold which was taken as a starting point in w6x. All
additional modifications would thus have to vanish in the corresponding limits, while at
B. de Wit et al.r Nuclear Physics B 568 [PM] (2000) 475–502 499
the same time one must preserve covariance under target-space diffeomorphisms and
fermionic frame reparametrizations. Therefore the possible modifications should be
proportional to the target-space curvature times the superconformal fields and, as it turns
out, it is difficult if not impossible to see how such terms could emerge. Given the fact
that the transformation rules take the same form, we expect the same situation for the
Lagrangian, where, again, it is difficult to construct suitable modifications that would
vanish in the appropriate limits.
Motivated by these considerations, we write down the Lagrangian by converting and
covariantizing the relevant equation Ž3.28. in Ref. w6x. Here we suppress the hypermulti-
plet auxiliary fields, as we no longer insist on off-shell supersymmetry for the hypermul-
tiplets. The result reads as follows, where the derivatives are all fully covariantized:
1 3
ey1 L s ´ i j Vab A i a D a Da q D A j bž /
2 2
1
q Vab 2 g 2 ´ i j A i a X b g X g d A j d q g A i a Y i j b g A j g
2
1
y ž z ay cmig mA i a /ž Gba Du z b
2
3 1
qVab ž ´ jkx j A k b y ´ jk Ta b jk g a bz b /
2 4
b
yg Vab V i ž g Ai g q 2 X b g z g / / q h.c.
1
q g Vab A i a V i b g z g q h.c. . Ž 5.4 .
2
After substituting the expressions for the dependent gauge fields fmi and fma in terms of
the other fields and dropping a total derivative, we write the Lagrangian as follows:
1 1 1
ey1 L s y Gab Dm A i b D mAi a q R Ga b A i bAi a q D Ga b A i bAi a
2 12 4
1
y Gab z a Duz b q z b Duz a y Wa bgd z agm z b z gg mz d
ž /
4
1 1
q Gab y žA i bAi a ey1´ mnrscm jgn Dr csj q A i bAi a c j m g mx j
12 8
1 1
y A k bA k a cmi cn j Timn ia b
j y A z xi q V ag Ggl z bg a b Ta b i j ´ i jz l
48 16
2 1
qz bg m Du Ai acm i y Ai a z b g mn Dm cn i q Ai a z bg a b Ta b i jg mcmj
3 24
1
y ey1´ mnrs cmign cr j A i b Ds A j a
4
1
y z bg mg ncm i cniz a q ´ i j V a r Grl cn j z l q h.c.
ž / / . Ž 5.5 .
2
500 B. de Wit et al.r Nuclear Physics B 568 [PM] (2000) 475–502
Here we did not include the terms related to gauged isometries. To incorporate those one
includes the relevant terms into the covariant derivatives and adds the following
g-dependent terms to the Lagrangian:
1
ey1 Lg s 2 g 2 Ga b Ai a X b g X g d Adi q g A i a Va b Y i j b g A j g
2
1 1
y cmig mV k a b Vag A i g A k b y cmig mncnk A k b
Vag A i g X a b q h.c. .
2 2
Ž 5.6 .
As mentioned above, these results are in agreement with the action presented in
Subsection 2.2 as well as with the results of w6x in the appropriate limits. In addition we
performed a number of independent checks on Ž5.5. and Ž5.6.. For instance, because the
superalgebra closes only modulo the field equations for the fermion fields z a and z a ,
we have calculated these field equations from the supersymmetry transformation rules
Ž5.2.. As it turns out the result is in agreement with the field equations derived from the
action.
The above action is invariant under all superconformal symmetries. In particular the
scalar fields are subject to dilatations and to SUŽ2. transformations. Ignoring the
contributions from the vector multiplets, which are essential for obtaining the complete
and consistent action for Poincare´ supergravity coupled to vector multiplets and hyper-
multiplets, but which do not affect the target-space geometry of the hypermultiplets, we
express the bosonic terms in scale-invariant quantities, by introducing a normalized
section
Aˆi a s xy1r2 A i a , Ž 5.7 .
which satisfies Vab ˆ a ˆ b
A i A j s ´ i j . Similarly we redefine the various other fields, such
as the vierbeine, spin connection, etc., by a x-dependent scale transformation. The result
for the bosonic terms then takes the form
1 1
L s y e Vab ´ i j Dm Aˆi a D mAˆj b y R y D , Ž 5.8 .
2 3
where R is the Ricci scalar of the space-time. Suppressing possible gauged isometries
for convenience, this results in
1 1 1
L s y e Vab ´ i j Em f A DA Aˆi a q Vm i k Aˆk a E mf B DB Aˆj b q V m j lAˆl b
ž /ž /
2 2 2
1 1
q e Rq e D . Ž 5.9 .
6 2
The field equations for the SUŽ2. gauge fields Vm i j yield,
Vm i j s y2 Em f A VA i k ´ k j . Ž 5.10 .
This result can be substituted back into the Lagrangian, which then reads
1 1 1
L s y e GA B Em f A E mf B q e R q e D , Ž 5.11 .
2 6 2
B. de Wit et al.r Nuclear Physics B 568 [PM] (2000) 475–502 501
so that the target-space metric GA B corresponds indeed to the quaternionic space which
we constructed in Section 4. The terms with the Ricci scalar and the auxiliary field D
combine with similar terms from the Lagrangian of the vector multiplets to give the
Einstein–Hilbert action.
The material derived in Section 4 now fits in nicely with what is known about the
general coupling of hypermultiplets to supergravity w2x. First of all, the quantity Dab
projects out precisely the S-invariant hypermultiplet spinors which thus describe 2 n y 2
physical spinors after modding out the S-supersymmetry. Hence, the non-linear sigma
model comprises precisely the expected 4 n y 4 scalars and 2 n y 2 spinors. The relevant
quaternionic vielbeine have already been defined in Ž4.15., but can equally well be
obtained from working out the above Lagrangian after removing the appropriate gauge
degrees of freedom We will list a number of relevant identities, which all follow from
Section 4,
1 1
Dab VˆAai VˆBbj s ´ i j GA B q RA Bi j ,
2 2
GA B gˆaA i gˆbB j s ´ i j Da b ,
Acknowledgements
Theoretical Physics in Utrecht and the AEI for their hospitality. This work is supported
in part by the European Commission TMR programmes FMRX-CT96-0012, in which
the Albert Einstein Institut and the University of Wales in Swansea participate, and
ERBFMRX-CT96-0045, in which Utrecht University participates.
References
Abstract
r
The chiral WZNW symplectic form V chir is inverted in the general case. Thereby a precise
r
relationship between the arbitrary monodromy dependent 2-form appearing in V chir and the
exchange r-matrix that governs the Poisson brackets of the group valued chiral fields is
established. The exchange r-matrices are shown to satisfy a new dynamical generalization of the
classical modified Yang–Baxter ŽYB. equation and Poisson–Lie ŽPL. groupoids are constructed
that encode this equation analogously as PL groups encode the classical YB equation. For an
arbitrary simple Lie group G, exchange r-matrices are found that are in one-to-one correspon-
dence with the possible PL structures on G and admit them as PL symmetries. q 2000 Elsevier
Science B.V. All rights reserved.
1. Introduction
This paper contains a systematic analysis of the classical phase space that arises from
the chiral separation of the degrees of freedom in the Wess–Zumino–Novikov–Witten
ŽWZNW. model w1x. The WZNW model occupies a central position in conformal field
theory w2x. Various structures that emerged from its study play an increasingly important
ˆ in other areas of theoretical physics and in mathematics as well w3x. Among these
role
1
Corresponding author.
´ ..
E-mail address: lfeher@sol.cc.u-szeged.hu ŽL. Feher
0550-3213r00r$ - see front matter q 2000 Elsevier Science B.V. All rights reserved.
PII: S 0 5 5 0 - 3 2 1 3 Ž 9 9 . 0 0 7 3 8 - 5
504 J. Balog et al.r Nuclear Physics B 568 [PM] (2000) 503–542
structures are the quadratic exchange algebras that encode the Poisson brackets ŽPBs. of
the chiral group valued fields, g C Ž x C . for C s L, R, which yield the general solution of
the WZNW field equation as g Ž x L , x R . s g LŽ x L . gy1
R
Ž x R .. These exchange algebras
were investigated intensively at the beginning of the decade w4–15x motivated by the
idea to understand the quantum group properties of the WZNW model w16x directly by
means of canonical quantization w17–21x. In accordance with the general philosophy of
quantum groups w22x, the Poisson–Lie ŽPL. symmetries of the chiral fields should be the
most relevant in this respect.
The chiral WZNW Poisson structures found in the literature have the form
m 1
½g C 5
Ž x . , gC Ž y . s Ž gC Ž x . m gC Ž y . . Ž rˆ q 12 Iˆsign Ž y y x . . ,
kC
0 - x , y - 2p , Ž 1.1 .
ˆ
where I is given by the quadratic Casimir of the simple Lie algebra, G , of the WZNW
ˆ The choice of the PB is
group, G, and the interesting object is the ‘exchange r-matrix’ r.
¨
highly non-unique due to the fact that the g C are determined by the physical field g
only up to the gauge freedom g C g C p for any constant p g G. In general rˆ may
depend on the monodromy matrix M of the chiral fields, which satisfy g C Ž x q 2p . s
g C Ž x . M. There are two qualitatively different cases that correspond to building the
WZNW field out of chiral fields with diagonal monodromy Ž‘Bloch waves’. or out of
fields with generic monodromy.
For Bloch waves w8–10x, the Poisson structure is essentially unique and the associated
r-matrix is a solution of the so-called classical dynamical Yang–Baxter ŽCDYB.
equation, which has recently received a lot of attention Žsee e.g. the review in Ref. w23x..
For chiral fields with generic monodromy, it has been argued in w11,14x that the
possible exchange r-matrices should correspond to certain local differential 2-forms r
on open domains Gˇ ; G, whose exterior derivative is the 3-form that occurs in the
WZNW action. The precise connection between r and rˆ has not been elaborated, and in
most papers dealing with generic monodromy actually only those very special cases
were considered for which rˆ is a monodromy independent constant. In these cases rˆ is
necessarily a solution of the classical modified YB equation on the Lie algebra G with a
certain definite normalization ŽEq. Ž3.71. with Ž3.64... This is a nice situation since if
the same r-matrix is used to equip G with a PL structure, then the gauge action of G on
the chiral WZNW field defines a PL symmetry. However, this mechanism of PL
symmetry is not available in the physically most interesting case of a compact Lie
group, because the relevant normalization admits no constant r-matrix for a compact G .
Thus, in addition to the problem to understand the case of a general r , an interesting
question is whether the exchange r-matrix can be chosen for a compact group in such a
way to admit a PL symmetry on the chiral WZNW phase space.
In this paper we study the family of chiral exchange algebras Ž1.1. concentrating on
the case of generic monodromy Žfor a related investigation of Bloch waves, see Ref.
w24x.. Our main results are the following.
The above mentioned results have been announced in w26x without proofs. In addition
to their detailed account, several other technical results can be found in this paper. The
systematic exposition of the subject and the numerous examples that we present may be
useful as a starting point for future studies.
The organization of the rest of the paper is as follows. In Section 2 a necessary
review of the chiral separation of the WZNW phase space is presented. Section 3
contains a detailed account of the inversion of the possible symplectic structures on the
chiral WZNW phase space, leading to the exchange algebra Ž1.1.. Here many interesting
additional issues are considered as well. In Section 4 an alternative, shorter but less
rigorous, derivation of the general exchange algebra is given, and a quick derivation of
the exchange algebra of Bloch waves is also included. Section 5 is devoted to a general
outline of the PL symmetries of the exchange algebra, and in particular to the exchange
r-matrices for which the standard gauge action of G on the chiral WZNW field yields
such a symmetry. Section 6 deals with the interpretation of the chiral WZNW Poisson
structures in terms of PL groupoids. The paper ends with a discussion, and there are also
two appendices containing some examples and the details of a proof.
In this section we review the structure of the WZNW Hamiltonian system concentrat-
ing on the possible symplectic forms on the chiral extension of its solution space, which
are examined throughout the paper. The presentation closely follows the line of thought
found in w14x.
We consider a simple, real or complex, Lie algebra, G , with a corresponding
connected Lie group, G, and identify the phase space of the WZNW model associated
with the group G as
M s T ) G˜ s Ž g , JL . < g g G,
½ ˜ JL g G˜ , 5 Ž 2.1 .
where G˜ s C S ,G . is the loop group and G˜ s C S , G . is its Lie algebra. The
`Ž 1 `Ž 1
G-valued Žresp. G-valued. functions on the real line R. The phase space is equipped
with the symplectic form
2p k 2p X
V ksd H0 d s Tr Ž JL dg gy1 . q H0 d s Tr Ž dg gy1 . n Ž dg gy1 . Ž 2.2 .
2
with some constant k . Here prime denotes derivative with respect to the space variable,
s g R, and for any A, B g G Tr Ž AB . denotes a fixed multiple of the Cartan–Killing
scalar product. If g and the ‘left-current’ JL serve as coordinates on M , then the
‘right-current’ is given by
JR s ygy1 JL g q k gy1 g X , Ž 2.3 .
and in the alternative variables Ž g, JR . the symplectic form reads
2p k 2p X
V k s yd H0 d s Tr Ž JR gy1 dg . y H d s Tr Ž gy1dg . n Ž gy1 dg . . Ž 2.4 .
2 0
Although the expression of V k appears rather formal at first sight, it can be used to
unambiguously associate Hamiltonian vector fields and PBs with a set of admissible
functions, which include, for example, the Fourier components of g, JL and JR . We do
not elaborate the precise meaning of the symplectic form here, since this is a standard
matter in the context of the full WZNW model, but will face the analogous issue in the
chiral context later, where it is much less understood. The only point that we wish to
note is that in the case of a complex Lie algebra the admissible functions depend
holomorphically on the matrix elements of g, JL , JR in the finite-dimensional irre-
ducible representations of G, and G˜ = G˜ is then a model of the holomorphic cotangent
bundle.
The phase space M represents the initial data for the WZNW system, whose
dynamics is generated by the Hamiltonian
1 2p
H WZ NW s H0 d s Tr Ž JL2 q JR2 . . Ž 2.5 .
2k
Denoting time by t and introducing lightcone coordinates as
x L :s s q t , x R :s s y t ,
E E
EL s s 12 Ž Es q Et . , ER s s 12 Ž Es y Et . , Ž 2.6 .
E xL E xR
Hamilton’s equation can be written in the alternative forms w1x
kE L g s JL g , E R JL s 0 m kE R g s gJR , E L JR s 0. Ž 2.7 .
Let M sol be the space of solutions of the WZNW system. M sol consists of the
smooth G-valued functions g Ž s ,t . which are 2p-periodic in s and satisfy E R Ž E L g gy1 .
s 0. The general solution of this evolution equation can be written as
g Ž s ,t . s g LŽ x L . gy1
R Ž xR . , Ž 2.8 .
where Ž g L , g R . is any pair of G-valued, smooth, quasiperiodic function on R with equal
monodromies, i.e. for C s L, R one has g C Ž x C q 2p . s g C Ž x C . M with some C-inde-
J. Balog et al.r Nuclear Physics B 568 [PM] (2000) 503–542 507
™
Notice that Mˆ is a principal fibre bundle over M sol with respect to the above action of
G. The projection of this bundle, q : Mˆ M sol , is given by
q :Ž gL , gR . ¨ g s g L gy1
R i.e. g Ž s ,t . s g LŽ x L . gy1
R Ž xR . . Ž 2.11 .
We can identify M with M sol by associating the elements of the solution space with
their initial data at t s 0. Formally, this is described by the map
i : M sol ™M ,
i: M sol
2 g Ž s ,t . ¨ Ž g Ž s ,0. , J Ž s . s Ž kE g g
L L
y1
. Ž s ,0 . . g M . Ž 2.12 .
k.
Obviously, i )Ž
V is then the natural symplectic form on the solution space. Explicitly,
2p
Ž i )V k . Ž g . s yk d H d s Tr Ž gy1E R g gy1dg .
ž 0
1 2p
q
2
H0 d s Tr Ž gy1 dg . n Es Ž gy1 dg .
/ Ž 2.13 .
™M
ts0
y 12 Tr Ž Ž gy1 y1
C dg C . Ž 0 . n dMC MC . ,
MC s gy1
C Ž x . g C Ž x q 2p . . Ž 2.17 .
This crucial formula of Vˆ was first obtained by Gawedzki w11x.
k
It is clear from its definition that d Vˆ k s 0, but Vˆ k is not a symplectic form on Mˆ,
™
since it is degenerate. Of course, its restriction to any Žlocal. section of the bundle
q : Mˆ M sol is a symplectic form, since such sections yield Žlocal. models of M sol. On
the other hand, one can check that V chir has a non-vanishing exterior derivative w11x:
d V chir Ž g C . s y 16 Tr Ž My1 y1 y1
C dMC n MC dMC n MC dMC . . Ž 2.18 .
508 J. Balog et al.r Nuclear Physics B 568 [PM] (2000) 503–542
Although this cancels from d Vˆ k , since ML s MR for the elements of Mˆ, it makes the
chiral separation of the WZNW degrees of freedom a very non-trivial and interesting
problem.
The problem of the chiral separation can be described as follows w14x. First, recall
that the chiral currents JC Ž C s L, R . generate two commuting copies of the non-twisted
affine Kac–Moody ŽKM. algebra of G and the WZNW field Ž2.8. is a KM primary field
under the Poisson bracket defined by the symplectic form on M sol. In fact, by defining
Fourier components as
2p
JCa , n :s H0 dx C eyi n x C Tr Ž T a JC . Ž x C . Ž 2.19 .
using a basis 2 T a of G , it can be derived from Ž2.13. that the currents satisfy
JCa , m , JCb , n 4 s fgab JCg , mqn q 2 ipkC m dm ,y n I ab , JLa , m , JRb , n 4 s 0, Ž 2.20 .
and
g Ž x L , x R . , JLa , n 4 s eyi n x T a g Ž x L , x R . ,
L
g Ž x L , x R . , JRa , n 4 s yeyi n x g Ž x L , x R . T a .
R
Ž 2.21 .
Second, the currents almost completely determine the chiral WZNW fields g C , and thus
also g s g L gy1
R , by means of the differential equations
k C EC g C s JC g C for C s L, R. Ž 2.22 .
Thus it appears an interesting possibility to construct the WZNW model as a reduction
of a simpler model, in which the left and right-moving degrees of freedom would be
separated in terms of completely independent chiral fields g L and g R regarded as
fundamental variables. It is clear that the solution space of such a chirally extended
model must be a direct product of two identical but independent spaces, i.e. it must have
the form
Mˆ ext :s ML = MR Ž 2.23 .
with
MC :s g C < g C g C` Ž R,G . , g C Ž x q 2p . s g C Ž x . MC , MC g G 4 . Ž 2.24 .
Ideally, the space Mˆ should be endowed with such a symplectic structure, ˆ
ext
that k
V ext ,
reduces to Vˆ k on the submanifold Mˆ; Mˆ ext defined by the periodicity constraint
ML s MR . Ž 2.25 .
ˆ k
It is easy to see that these requirements force V ext to have the following form:
Vˆ ext
k r
Ž g L , g R . s k L V chir r
Ž g L . q k R V chir Ž gR . , Ž 2.26 .
where
r
V chir Ž g C . s V chir Ž g C . q r Ž MC . Ž 2.27 .
with some 2-form r depending only on the monodromy of g C . Since in the extended
2
We have I a b :s Tr ŽT a T b . and w T a ,T b x s fgab T g with summation over coinciding indices.
J. Balog et al.r Nuclear Physics B 568 [PM] (2000) 503–542 509
r .
model the factors Ž MC , k C V chir should be symplectic manifolds separately, we have to
satisfy the condition
r
d V chir s y 16 Tr Ž MCy1 dMC n MCy1dMC n MCy1 dMC . q d r Ž MC . s 0. Ž 2.28 .
The problem now arises from the well-known fact that no globally defined smooth
2-form exists on G that would satisfy this condition for all MC g G.
There are two rather different wayouts from the above difficulty w14x. The first is to
restrict the possible domain of the monodromy matrix MC to some open submanifold in
G on which an appropriate 2-form r may be found. We refer to a choice of such a
domain and a r as a chiral extension of the WZNW system, and will explore the
structure of the associated PB in the subsequent sections.
The second possibility is to restrict the domain of the allowed monodromy matrices
much more drastically from the beginning, in such a way that after the restriction d V chir
vanishes, whereby the difficulty disappears. For example, one may achieve this by
restricting the monodromy matrices to vary in a fixed maximal torus of G, which
amounts to constructing Ža subset of. the solutions of the WZNW field equation in terms
of chiral ‘Bloch waves’. This second possibility is especially natural in the case of
compact or complex Lie groups, for which there is only one maximal torus up to
™
conjugation. The restriction to Bloch waves is equivalent to a partial Žand local. gauge
fixing of the bundle q : Mˆ M sol. The resulting symplectic form is studied in detail in
w24x.
We here investigate the structure of the chiral WZNW phase space MC introduced in
Section 2. The analysis is the same for both chiralities, C s L, R, and we simplify our
notation by putting Mchir for MC and g, M, J, k for g C , MC , JC , k C , respectively. We
assume that the monodromy matrix M is restricted to some open submanifold Gˇ ; G on
which a smooth 2-form r is chosen in such a way that Ž2.28. holds. The domain in
Mchir that corresponds to M g Gˇ is denoted by Mˇchir . It turns out that kV chir
r
, defined by
ˇ
Ž2.27. with Ž2.17., is non-degenerate if G is appropriately chosen Žso that Eq. Ž3.34. has
a smooth, unique solution., and we shall describe the general features of the PBs on
Mˇchir associated with this symplectic form. We will then consider examples, in particular
the choices of r introduced in w14x that lead to Poisson–Lie symmetry on the full Mchir .
r
Before we can turn to the task of inverting kV chir , we need to set up some
conventions.
An element A g G has the components Aa s Tr Ž ATa . and A a s Tr Ž AT a . with
respect to dual bases Ta and T b of G :
Tr Ž Ta T b . s dab , I ab s Tr Ž T a T b . , Ia b s Tr Ž Ta Tb . . Ž 3.1 .
510 J. Balog et al.r Nuclear Physics B 568 [PM] (2000) 503–542
We will use I ab and Iab to raise and lower Lie algebra indices. Given a matrix Qa b ,
we can define an operator Q g EndŽ G . and an element Qˆ g G m G by
Q Ž A . s Ta Q abAb , Qˆ s Q ab Ta m Tb . Ž 3.2 .
The matrix I ab defines the identity operator I, and Iˆs Ta m T a is the ‘tensor Casimir’.
For any M g G, the matrix of the linear operator Ad M on G , which we write as
ŽAd M .Ž A. s MAMy1 , will be denoted as
To make sure that the curve passes through g g Mchir at t s 0, we require g Ž x,0. s g Ž x ..
A vector X w g x at g g Mchir is obtained as the velocity to the curve at t s 0, encoded by
the G-valued, smooth function
d
j Ž x . :s gy1 Ž x . g Ž x ,t . . Ž 3.13 .
dt ts0
It is useful to note that, due to the analogous property of g, the function j on R may be
reconstructed from its restriction to w0,2 p x. The monodromy properties of j Ž x . can be
derived by taking the derivative of Ž3.12.,
j X Ž x q 2p . s My1j X Ž x . M. Ž 3.14 .
This can be solved in terms of a smooth, 2p-periodic G-valued function, X J Ž x ., and a
constant Lie algebra element, j 0 , as follows:
x
y1
j Ž x. sj0q H0 dy g Ž y. XJ Ž y. g Ž y. . Ž 3.15 .
¨¨
A vector field X on Mchir is an assignment, g X w g x, of a vector to every point
g g Mchir . It acts on a differentiable function, g F w g x, on Mchir by the definition
d
XŽ F.w gxs F w gt x , g t Ž x . s g Ž x ,t . . Ž 3.16 .
dt ts0
¨
by the pair Ž j 0 , X J Ž x . . , we can specify a vector field by the assignments g j 0 w g x g G
and g X J w g x g G˜ . Of course, the evaluation functions F x w g x:s g Ž x . and F x w g x:s
¨
J Ž x . are differentiable with respect to any vector field, and their derivatives are given by
XŽ g Ž x . . s g Ž x . j Ž x . , XŽ J Ž x . . s k X J Ž x . , Ž 3.17 .
which clarifies the meaning of X J as well. It is also obvious from Ž3.11. that the
monodromy matrix yields a G-valued differentiable function on Mchir , g M s
gy1 Ž x . g Ž x q 2p ., whose derivative is characterized by the G-valued function
¨
X Ž M . My1 s Mj Ž x q 2p . My1 y j Ž x . . Ž 3.18 .
Having defined vector fields, one can now introduce differential forms as usual. We
only remark that by Ž3.17. evaluation 1-forms like dg Ž x ., dJ Ž x . or Ž gy1 dg .X Ž x . are
perfectly well defined:
dg Ž x . Ž X . s X Ž g Ž x . . s g Ž x . j Ž x . , dJ Ž x . Ž X . s X Ž J Ž x . . s k X J Ž x . ,
X
Ž gy1 Ž x . dg Ž x . . Ž X . s j Ž x . . X
Ž 3.19 .
Now we turn to the following problem. For a fixed Žscalar. function F on the phase
space Mˇchir , we are looking for a corresponding vector field, Y F, satisfying
r
X Ž F . s kV chir Ž X ,Y F . Ž 3.20 .
512 J. Balog et al.r Nuclear Physics B 568 [PM] (2000) 503–542
for all vector fields X. Notice that Y F does not necessarily exist for a given F. We say
that F is an element of the set of admissible Hamiltonians, denoted as H, if the
q Tr Ž j 0 Ž B Y Ž M . y MB Y Ž M . My1 . . Ž 3.24 .
with
B Y Ž M . :s q ab Ž M . TaTr Ž Tb My1 Y Ž M . . . Ž 3.25 .
Of course, all the expressions in the above formulae are functions of g g Mˇchir .
Let us now suppose that F g H and apply the above formula to Y:s Y F. Then the
form of the right-hand side of Ž3.20. implies that there must exist a smooth G-valued
function on R, A F Ž x ., and a constant Lie algebra element, a F, such that for any vector
field X
2p
X Ž F . sk H0 dx Tr Ž X J Ž x . A F Ž x . . q k Tr Ž j 0 a F . . Ž 3.26 .
¨ ¨
This means that F g H must have an exterior derivative parametrized by the assignments
g A F Ž x .w g x and g a F w g x. On the other hand, if F is such that Ž3.26. holds, then
we may try to solve Ž3.20. for the Hamiltonian vector field. Using the parametrization of
Y by h Ž x ., this leads to the following two equations:
h Ž x . y 12 My1 Y Ž M . q B Y Ž M . s gy1 Ž x . A F Ž x . g Ž x . , Ž 3.27 .
and
My1 Y Ž M . q Y Ž M . My1 y 2 B Y Ž M . q 2 MB Y Ž M . My1 s y2 a F . Ž 3.28 .
X
Now it is clear that h x can be directly read off from 3.27 . From the identity
Ž . Ž .
hX Ž x . s gy1 Ž x .YJ Ž x . g Ž x ., we then obtain that
X 1
YJ Ž x . s A F Ž x . q AF Ž x . , J Ž x . . Ž 3.29 .
k
J. Balog et al.r Nuclear Physics B 568 [PM] (2000) 503–542 513
This not only gives us the explicit formula of YJ , but also means that any F g H must be
such that the right-hand side of Ž3.29. defines a 2p-periodic smooth function of x.
Incidentally, this is equivalent to the monodromy condition Ž3.14. applied to the
Hamiltonian vector field Y. To proceed further, we use
Y Ž M . My1 s Mh Ž 2p . My1 y h Ž 0 . . Ž 3.30 .
This implies that Eqs. Ž3.27. and Ž3.28. are not linearly independent, and they can be
simultaneously solved for h Ž x . only if one has
a F s gy1 Ž 0 . A F Ž 0 . y A F Ž 2p . g Ž 0 . . Ž 3.31 .
We now also see that the pair Ž A F,a F . is uniquely determined for any F g H. Indeed,
the restriction of A F to w0,2p x is completely fixed by Ž3.26., and is uniquely extended to
a function on R on account of the periodicity of the expression in Ž3.29..
To summarize, we have shown that every element F g H must satisfy the three
conditions 3 expressed by Ž3.26., the periodicity of YJ Ž x . in Ž3.29., and Ž3.31.. Con-
versely, it turns out that these conditions characterize H. In fact, if these conditions are
satisfied then the solution of Ž3.27., Ž3.28. for h is given by
h Ž x . s gy1 Ž x . A F Ž x . g Ž x . y 12 a F q r Ž M . Ž a F . , Ž 3.32 .
where
r Ž M . Ž a F . s Ta r ab Ž M . abF Ž 3.33 .
ab
and the matrix r is defined as the solution of the linear equation
r ab q Ž Wg a y 2 qg a y 2 q au Wgu . r gb s 12 I a b y 12 W ba q q a b y qg a W bg . Ž 3.34 .
y1 Ž FŽ
This formula of h Ž x . s g x .Y g Ž x .. is one of the main results in this paper.
Some remarks are here in order. First, in Ž3.34. we suppressed the M-dependence of
the various matrices like qg a Ž M . s q ba Ž M . Ibg and Wg a s Tr ŽTg MT a My1 .. Second,
in terms of the notations given at the beginning of the section, r Ž M . is the linear
operator on G associated with the matrix r ab Ž M .. By introducing now the operators
r " Ž M . and q " Ž M . that correspond to the matrices
ab
r" Ž M . s r a b Ž M . " 12 I a b , ab
q" Ž M . s q a b Ž M . " 12 I a b , Ž 3.35 .
Eq. Ž3.34. can be rewritten, in fact, in the following equivalent form:
qq Ž M . ( ry Ž M . s qy Ž M . (Ad Ž My1 . ( rq Ž M . . Ž 3.36 .
The solution can be formally written as
y1
r Ž M . s 12 Ž qq Ž M . y qy Ž M . (Ad Ž My1 . .
( Ž qq Ž M . q qy Ž M . (Ad Ž My1 . . . Ž 3.37 .
This shows that one must define the domain Gˇ in such a way that the inverse operator
above exists, which is always possible since it becomes the identity operator at
M s e g G. Then it is easy to see that Ž3.37. yields a smooth, antisymmetric matrix
ˇ By choosing Gˇ ; G appropriately, hence we may indeed
function r ab Ž M . on G.
3
These conditions do not depend on the 2-form r , a reason for this is described at the end of Subsection
3.5.
514 J. Balog et al.r Nuclear Physics B 568 [PM] (2000) 503–542
¨
r
modified classical Yang–Baxter equation. Incidentally, kV chir is symplectic Ži.e. non-
degenerate. in the sense that it permits to unambiguously determine the map H2 F Y F,
as we just saw, and H will turn out to contain a ‘complete set of functions’ on Mˇchir .
Finally, as for the derivation of Eq. Ž3.34., note that one may arrive at the special
form of the integration constant in formula Ž3.32. of h Ž x . by the expectation of a
classical exchange algebra type PB for the field g Ž x ., or simply by inspecting the
equations that result if one writes h Ž x . s gy1 Ž x . A F Ž x . g Ž x . q constant. After introduc-
ing Ž3.32. as an ansatz, it is not difficult to verify that Ž3.27. and Ž3.28. reduce to Ž3.34..
Below we describe a large set of functions that are admissible Hamiltonians and
apply the result in Ž3.32. to find their Hamiltonian vector fields. We shall also discuss
the interpretation of these Hamiltonian vector fields in terms of PBs, in particular we
shall see that the field g Ž x . is subject to a quadratic exchange algebra.
Let us first study functions that depend on g only through the current J s k g X gy1 .
y
Of course, the evaluation functions Fay w g x s Ja Ž y . do not belong to H, since A Fa Ž x . in
Ž3.26. would not be a smooth function of x. Therefore we consider the ‘smeared out’
version
2p
Fm :s H0 dx Tr Ž m Ž x . J Ž x . . , Ž 3.39 .
where m Ž x . is a G-valued, smooth, 2p-periodic test function. In this case we find that
A Fm Ž x . s m Ž x . , a Fm s 0. Ž 3.40 .
The conditions expressed by Ž3.26., Ž3.29. and Ž3.31. are trivially satisfied and thus
Fm g H. The parameter h Ž x . of the Hamiltonian vector field Y Fm is h Ž x . s
gy1 Ž x . m Ž x . g Ž x ., whence
Y Fm Ž g Ž x . . s m Ž x . g Ž x . ,
Y Fm Ž J Ž x . . s m Ž x . , J Ž x . q kmX Ž x . , Y Fm Ž M . s 0. Ž 3.41 .
This shows in particular that Fm generates an infinitesimal action of the loop group on
the phase space with respect to which g Ž x . is an affine KM primary field, and the KM
current J Ž x . transforms according to the coadjoint action of the Žcentrally extended.
loop group, as expected. Naturally, the local functionals of J defined as the integral
over w0,2 p x of any differential polynomial in the components of J, with periodic,
smooth test function coefficients, also belong to H; the corresponding Hamiltonian
vector fields are easy to determine.
J. Balog et al.r Nuclear Physics B 568 [PM] (2000) 503–542 515
Now we study some non-local functionals of the current. Let E g G denote the path
ordered exponential integral of J Ž x . over w0,2p x. More precisely, we put E :s EŽ2p .,
where EŽ x . g G is defined as the solution of
k EX Ž x . s J Ž x . E Ž x . with E Ž 0 . :s e g G. Ž 3.42 .
Let w be an arbitrary smooth function on G. Introduce a corresponding function, F , on
the phase space by
F w g x :s w Ž E . . Ž 3.43 .
From the well-known formula of the variation of EŽ x ., we obtain that
1
AF Ž x . s Ž E Ž x . T a Ey1 Ž x . . Ž Ra w . Ž E . , aF s 0, Ž 3.44 .
k
where Ra is defined in Ž3.6.. It follows that the conditions imposed by Ž3.26. and Ž3.29.
are satisfied, actually from Ž3.29. we get Y F Ž J Ž x .. s 0. However, F does not belong
to H in general. By means of Ž3.44. we get that
1
AFa Ž 0 . y AFa Ž 2p . s Ž Day w . Ž E . , Ž Days Ra y La . . Ž 3.45 .
k
Because of the condition Ž3.31., this means that F g H precisely if w is an inÕariant
function on G with respect to the adjoint action of G on G. Examples of invariant
functions are furnished by the trace of E k Ž k s 1,2, . . . . in some representation. That
only the invariant functions of E are admissible is a well-known result in the context of
current algebras, where they provide the Casimir functions of J. In our context, we
obtain from the above that for an invariant function w
1
YF Ž g Ž x. . s g Ž x . T a Ž Ra w . Ž M . , Y F Ž J Ž x . . s 0, Y F Ž M . s 0.
k
Ž 3.46 .
To derive these, we used that, since g and E satisfy the same differential equation,
g Ž x . s EŽ x . g Ž0.. Hence M s gy1 Ž0. E g Ž0., and for an invariant function
gy1 Ž 0 . T a g Ž 0 . Ž Ra w . Ž E . s T a Ž Ra w . Ž M . . Ž 3.47 .
The monodromy matrix M is not a function of J alone, but its invariant functions
coincide with those of E , and we have just seen that these functions belong to H. Let us
now take an arbitrary smooth function, c , on G and associate with it a function, C , on
the chiral WZNW phase space by C w g x:s c Ž M .. Using Ž3.18. and the definition of the
parameter of a vector field, Eq. Ž3.15., one gets
1 1
AC Ž x . s Ž g Ž x . T a gy1 Ž x . . Ž Ra c . Ž M . , aC s T a Ž Day c . Ž M . .
k k
Ž 3.48 .
CŽ
It follows that C g H. For the Hamiltonian vector field we obtain Y J Ž x .. s 0 and
1 1
gy1 Ž x . Y C Ž g Ž x . . s T a Ž Daq c . Ž M . q Ta r a b Ž M . Ž Dby c . Ž M . .
2k k
Ž 3.49 .
516 J. Balog et al.r Nuclear Physics B 568 [PM] (2000) 503–542
Let us elaborate this for the functions defined by the matrix elements of M in some
representation L of G. We denote these matrix elements as MiLj and denote by g iLj Ž x .
the corresponding matrix element of g Ž x .. Now we shall use rˆŽ M . in Ž3.38. and
ˆ
rˆ" Ž M . s rˆ Ž M . " 12 I, M1 s M m 1, M2 s 1 m M. Ž 3.50 .
Then Ž3.49. can be rewritten in the tensorial form
L 1 L
Y M k l Ž g iLj Ž x . . s
k
Ž g Ž x . m M Qˆ Ž M . . i k , jl , Ž 3.51 .
where
Qˆ Ž M . :s rˆq Ž M . y My1
2 rˆy Ž M . M 2 Ž 3.52 .
is taken in the corresponding representation of G , and our notation is Ž K m L. i k, jl s
K i j L k l . Furthermore, we obtain
L 1 L
Y M k l Ž MiLj . s
k
Ž M m M Dˆ Ž M . . i k , jl Ž 3.53 .
with
Dˆ Ž M . :s Qˆ Ž M . y My1 ˆ
1 Q Ž M . M1 . Ž 3.54 .
We shall comment on the interpretation of these equations later on.
The PB of two smooth functions F1 and F2 on a finite-dimensional smooth
symplectic manifold is defined by the standard formula
F1 , F2 4 s Y F 2 Ž F1 . s yY F1 Ž F2 . s V Ž Y F 2 ,Y F1 . , Ž 3.55 .
where Y F i is the Hamiltonian vector field associated with Fi by the symplectic form V .
The so obtained Poisson algebra is closed under pointwise multiplication of the
functions as well as under the PB. One may formally apply the same formula in the
infinite-dimensional case to the admissible smooth functions that possess a Hamiltonian
vector field. However, it then may be a very non-trivial problem to fully specify the set
of functions that form a closed Poisson algebra, and are a complete set in the sense that
they separate the points of the phase space. In our case, it is clear from the foregoing
formulae that the products of local functionals of the current J and the smooth functions
of the monodromy matrix M form two subsets of H that are separately closed under the
PB. Moreover, these two subsets commute with each other under the PB Žthey should
clearly be each others centralizer in an appropriate Poisson algebra.. But they do not
form a complete set of functions on our phase space, since the fundamental field g Ž x .
cannot be completely reconstructed out of J Ž x . and M.
™
Let us again consider a representation L:G GLŽ V . of G. Since the evaluation
functions Fi xj w g x s g iLj Ž x . are not elements of H, we smear out the local field and define
2p
Ff w g x :s H0 dx Tr Ž f Ž x . g L Ž x . . , Ž 3.56 .
J. Balog et al.r Nuclear Physics B 568 [PM] (2000) 503–542 517
where f :R ¨ EndŽV . is a smooth test function. It is then easy to see from Ž3.26. that
1 2p
A Ff Ž x . s g Ž x . T a gy1 Ž x . Hx dy Tr Ž f Ž y . g L Ž y . TaL . for x g w 0,2 p x ,
k
1 2p
a Ff s Ta H0 dyTr Ž f Ž y . g L Ž y . TaL . . Ž 3.57 .
k
By inspecting the condition that Y Ff Ž J Ž x .. in Ž3.29. must be periodic, we find that
Ff g H for those f that satisfy
f Ž k . Ž 0 . s f Ž k . Ž 2p . s 0 , k s 0,1, . . . Ž 3.58 .
Ff
Assuming that this holds, the Hamiltonian vector field Y is obtained from Ž3.32. as
1 2p
gy1 Ž x . Y Ff Ž g Ž x . . s Ta Hx dy Tr Ž TaLf Ž y . g L Ž y . . y 12 a Ff q r Ž M . Ž a Ff . ,
k
x g w 0,2 p x . Ž 3.59 .
This permits the following interpretation. Let us define the ‘Poisson bracket’ of the
evaluation functions by the equality
2p 2p m
Y Ff Ž Fx . :s Fx , Ff 4 :s H0 H0 dx dy Tr 12 x Ž x . m f Ž y . g L Ž x . , g L Ž y .
ž ½ 5/,
Ž 3.60 .
where Tr 12 means the normalized trace over V m V and
L m
½g Ž x. , g LŽ y. 5 i k , jl
s g iLj Ž x . , g kLl Ž y . 4 . Ž 3.61 .
With these definitions, formula Ž3.59. of the Hamiltonian vector field is equivalent to
m 1 L
½g L
Ž x . , g L Ž y . s Ž g L Ž x . m g L Ž y . . Ž rˆ Ž M . q 12 Iˆsign Ž y y x . . ,
5 k
0 - x , y - 2p . Ž 3.62 .
Ff Ž
Indeed, upon integration the right-hand side of Ž3.60. equals Y Fx . for any functions
f and x subject to Ž3.58.. This equation has the form of a quadratic exchange algebra
type PB for the field g Ž x .. Such a classical exchange algebra is usually regarded as a
classical analogue of a quantum group symmetry in the chiral WZNW model, but
observe that in general our r-matrix is monodromy dependent.
The admissible Hamiltonians of type Fm , C and Ff that we studied in the above
should together generate a closed Poisson algebra. Although at present we cannot fully
characterize the set of elements that belong to this algebra, we wish to point out that the
Jacobi identity for three functions of type Ff , in any Poisson algebra that contains them,
is equivalent to the following equation for rˆŽ M .:
rˆ12 Ž M . ,rˆ23 Ž M . q Qa b Ž M . T1a R b rˆ23 Ž M . q cycl. perm.s y 14 fˆ. Ž 3.63 .
Here
fˆs fabg T a m T b m T g Ž 3.64 .
518 J. Balog et al.r Nuclear Physics B 568 [PM] (2000) 503–542
and the cyclic permutation is over the three tensorial factors with rˆ23 s r ab Ž1 m Ta m Tb .,
T1a s T a m 1 m 1 and so on. Furthermore, we use the components of Qˆ s Qa b T a m T b
given by Ž3.52., for which
2p X
V Bloch Ž b . s y 12 H0 dx Tr Ž by1 db . n Ž by1 db . y 12 Tr Ž Ž by1 db . Ž 0 . n d v . .
Ž 3.67 .
It is known w8–10x Žfor a proof in the spirit of the present paper, see Ref. w24x. that the
PBs associated with Ž3.67. are encoded by the following classical exchange algebra:
m 1
½ b Ž x . , b Ž y . 5 s k Ž b Ž x . m b Ž y . . Ž Rˆ Ž v . q 1
2 Iˆsign Ž y y x . . ,
0 - x , y - 2p , Ž 3.68 .
1
Rˆ Ž v . s Ý < a < 2 coth Ž 12 a Ž v . . Ea m Ey a . Ž 3.69 .
4 a
The domain A is chosen so that a Ž v . f i2p Z for any root a and the root vectors Ea
satisfy the normalization Tr Ž Ea Ey a . s 2 < a < 2. As was first pointed out in w8x, the
Jacobi identity of the PB in this case gives rise to the equation
E
Rˆ12 Ž v . , Rˆ23 Ž v . q Ý H1k Rˆ23 Ž v . q cycl. perm.s y 14 fˆ, Ž 3.70 .
k Ev k
4
In this context G is either a complex simple Lie algebra or its normal or compact real form.
J. Balog et al.r Nuclear Physics B 568 [PM] (2000) 503–542 519
r
We have seen that any symplectic structure kV chir on Mˇchir gives rise to a PB of the
form Ž3.62. governed by an ‘exchange r-matrix’ rˆŽ M ., which is a solution of Eq.
Ž3.63.. Those cases for which the exchange r-matrix is M-independent have already
been discussed by Falceto–Gawedzki w14x and others. The main point w14x here is that
one can construct an appropriate 2-form r out of any constant, antisymmetric solution r̂
of the modified classical YB equation,
rˆ12 ,rˆ23 q rˆ13 ,rˆ23 q rˆ12 ,rˆ13 s y 14 fˆ, Ž 3.71 .
r
and then the same rˆ appears in the exchange algebra determined by kV chir . We below
present the construction of w14x, showing that in our formalism it is easy to give a
complete proof as well.
All antisymmetric solutions of Ž3.71. are known. In fact, Belavin and Drinfeld w34x
classified the solutions in the case of a complex simple Lie algebra and their solutions
belong also to the normal real form. For other real forms very few solutions survive Žsee
Theorem 3.3 in w35x.. There is no solution for the compact real form, because of the
negative sign of the coefficient on the right-hand side. To explain the mechanism w14x
whereby constant exchange r-matrices appear in the chiral WZNW model, we first need
to recall a few standard results on Eq. Ž3.71., which can be found in the reviews Žsee,
e.g., Refs. w14,36x..
In association with a solution of Ž3.71., rˆ s r ab Ta m Tb g G n G , one has the
constant linear operators r and r "s r " 12 I. It follows from Ž3.71. that the formula
w A, B x r s r Ž A . , B q A,r Ž B . , A, B g G , Ž 3.72 .
™
defines a new Lie bracket on the linear space G ; the Lie algebra Ž G ,w , x r . is denoted as
Gr . Then r ": Gr G are Lie algebra homomorphisms,
r " Ž A . ,r " Ž B . s r " Ž w A, B x r . . Ž 3.73 .
Any A g G can be decomposed as
A s Aqy Ay with A " :s r " Ž A . . Ž 3.74 .
As a consequence of Ž3.73., one has the following equality of linear operators on G :
exp Ž ad A " . ( r "s r " (exp Ž ad r A . ;A g G . Ž 3.75 .
Here ad A and ad r A are defined by Žad A.Ž B . s w A, B x and Žad r A.Ž B . s w A, B x r for
any A, B g G . Note also that there exists a neighbourhood of the unit element in G, now
denoted as Gˇ ; G, whose elements, M g G, ˇ admit a unique decomposition in the form
y1
M s Mq My with M "s e G " , Ž 3.76 .
where G varies in a neighbourhood of zero in G . On Gˇ one has
My1 dM s My Ž Mq
y1 y1
dMqy My y1
dMy . My . Ž 3.77 .
ˇ By means of Ž3.73., Ž3.77. leads to
Let X be an arbitrary vector field on G.
My1 y1 y1
" dM " Ž X . s Ž My M dM Ž X . My . " . Ž 3.78 .
520 J. Balog et al.r Nuclear Physics B 568 [PM] (2000) 503–542
¨
algebra PB on Mchir and the Sklyanin bracket Ž3.85. on G. In the present case, the
meaning of Eqs. Ž3.51.-Ž3.54. is that Mchir 2 g M s gy1 Ž x . g Ž x q 2p . provides the
‘non-abelian’ momentum map w14,36x for this PL action. ŽOf course, an equivalent right
PL action is obtained by replacing hy1 with h and using the opposite of the PB on G..
The above mechanism cannot be used to define a PL symmetry on Mchir in the case
of a compact Lie group G, since Ž3.71. has no solution for a compact G . This is
somewhat puzzling since as a quantum field theory the WZNW model is usually
considered in the compact domain, where various manifestations of quantum group
symmetries were found in the literature w16x. Later we shall see that PL symmetries can
be defined on the chiral WZNW phase space by certain mechanisms different from the
one described above, and they work in the compact case too.
We below introduce local coordinates on the chiral WZNW phase space consisting of
a periodic G-valued field and the logarithm, G , of the monodromy matrix. This will
r
lead us to realize the existence of a special choice of r such that with respect to kV chir
G generates an infinitesimal symplectic action of G on Mˇchir , i.e. a classical G-symme-
try. The parametrization will also shed a new light on the notion of admissible
Hamiltonians.
If the monodromy matrix M is near to e g G, then the chiral WZNW field can be
uniquely parametrized as
g Ž x . s hŽ x . e x G , Ž 3.87 .
¨
where hŽ x . is a G-valued, smooth, 2p-periodic function and G varies in a neighbour-
hood of zero in G , Gˇ ; G , for which the map Gˇ 2 G M s e 2 p G g Gˇ ; G is a
diffeomorphism. We may identify a domain in Mchir with the corresponding space of
parameters,
Mˇchir s G˜ = Gˇ s Ž h, G . 4 . Ž 3.88 .
An easy computation gives the following formula for V chir Ž2.17. in this parametriza-
tion:
0
V chir Ž h, G . s V chir Ž h, G . y r 0 Ž G . , Ž 3.89 .
where
2p X 2p
0
V chir Ž h, G . s y 12 H0 dx Tr Ž hy1 dh n Ž hy1 dh . . q d
H0 dx Tr Ž G hy1 dh . ,
Ž 3.90 .
2p
r 0 Ž G . s y 12 H0 dx Tr Ž d G n de x G eyx G . . Ž 3.91 .
522 J. Balog et al.r Nuclear Physics B 568 [PM] (2000) 503–542
g Ž x . , G˜a 4 s yg Ž x . Ta , ½ G˜ , G˜ 5 s f
a b
g
ab G˜g for G˜a :s y2pkGa . Ž 3.96 .
0
This means that in the case of the symplectic form kV chir the logarithm of M generates
a classical G-symmetry on Mˇchir . Indeed, the momentum map corresponding to this
symmetry is just G˜ . A classical G-symmetry is sometimes called ‘abelian’ to contrast it
with a proper Ž‘non-abelian’. PL symmetry, for which the symmetry group itself is
endowed with a non-zero PB w36x.
The above construction is valid for any simple Lie group. Perhaps even more
surprisingly than the possibility to define a classical G-symmetry on Mˇchir for any G , in
Section 5 it will turn out that the symplectic structure on Mˇchir can be chosen so as to be
compatible with any prescribed PL structure on G.
Now we explain how the characterization of the admissible Hamiltonians found in
Subsection 3.2 appears in the coordinates Ž h, G .. For this, we first remark that on
account of Ž3.88. it is natural to represent a vector field X on Mˇchir as
X s Ž X h , XG . , ˜
X h g ThG, XG g G Ž 3.97 .
y1 ˜
with h X h g G . By regarding h and G as evaluation functions, we may write
X h s X Ž h. and XG s X Ž G .. Of course, the derivative X Ž F . of function F on Mˇchir with
5
The expressions in Eqs. Ž3.94., Ž3.95. are defined by the power series expansions of the corresponding
2 2 kB
complex analytic functions around zero. For instance w39x, 2 r 0 sÝ`ks1 Ž 2 k .2!k Ž 21 Y . 2 ky1.
J. Balog et al.r Nuclear Physics B 568 [PM] (2000) 503–542 523
2p X
s H0 dx Tr Ž Ž X Ž h . hy1 . Ž yk Ž A F . y A F , k g X gy1 ..
2p 2p
ž
q k Tr X Ž G . H0 dx Ž gy1A F g . q k/ H0 dx Tr Ž eyx G X Ž e x G . , G
2p
ž
s k Tr X Ž G . H0 dx Ž hy1A F h . Ž x . . / Ž 3.103 .
We conclude that a function F for which X Ž F . has the form of the first line in Ž3.100.
is periodically differentiable if and only if Ž3.101. is satisfied and the right-hand side of
Ž3.102. defines a smooth, 2p-periodic function. In particular, all admissible Hamiltoni-
ans are periodically differentiable. Conversely, every periodically differentiable function
is admissible in the sense of Subsection 3.2 since it is possible to uniquely determine
Ž A F,a F . in terms of Ž d h F,dG F . in such a way Ž3.98. is converted into the first line of
Ž3.100.. To achieve this, if Ž d h F,dG F . are given, one has to solve the differential
equation Ž3.102. together with the condition in Ž3.104. for A F Ž x ., and then one may
define a F by Ž3.101.. It is not difficult to show that, since G is restricted to Gˇ , Ž3.104.
has a unique solution for the initial value A F Ž0. of A F Ž x ., which completes the proof.
In this section we rederive the PBs of the chirally extended WZNW model using
purely ‘kinematical’ considerations. Instead of explicitly inverting the symplectic form,
we postulate the natural properties of the chiral extension Žwhich we have established in
the symplectic formalism. and this way we can reproduce the quadratic exchange
algebras Ž3.62. and Ž3.68. almost effortlessly. This is especially so in the case of Bloch
waves, where the dynamical r-matrix Ž3.69. is determined algebraically. The subsequent
considerations are complementary to the symplectic approach presented in Section 3
and, for Bloch waves, in w24x. The kinematical derivation sheds a new light on the origin
of the chiral exchange algebras, and some issues are also easier to discuss in this
approach.
We have seen in Section 2 that it is very natural to extend the WZNW phase space as
M f M sol ™M ext
s MˇL = MˇR , Ž 4.1 .
ˇ ˇ
where ML and MR are two identical copies of the chiral phase space characterized by
the smooth, quasiperiodic chiral fields g C Ž x q 2p . s g C Ž x . MC , C s L, R. Since the
separation of the chiral degrees of freedom is an essential feature of the WZNW model,
we shall assume that MˇL and MˇR are independent and they are equipped with the same
symplectic structure Žup to an overall sign difference, see Eq. Ž2.16... The corresponding
Poisson algebra will be supposed to contain the important Hamiltonians Fm , C and Ff
studied in Subsection 3.3. The further main assumptions of the kinematical approach are
that the constraints ML s MR are first class and the corresponding gauge transforma-
tions 6 operate according to Ž2.10. so that the WZNW solution g Ž s ,t . s g LŽ x L . gy1
R
Ž xR.
is gauge invariant. These assumptions, together with simple properties of the original
WZNW phase space M sol , allow us to reproduce the PBs Ž3.62. and Ž3.68..
From now on we mainly concentrate on the chiral half of the problem and, for
notational simplicity, omit the subscript C, wherever it is possible.
6
In the rest of the paper, when considering gauge transformations or G-symmetries, we shall always
assume that the domain of allowed monodromy matrices, G;G, ˇ consists of full conjugacy classes in G. If this
did not hold, everything would still be true for infinitesimal gauge transformations or G-symmetries.
J. Balog et al.r Nuclear Physics B 568 [PM] (2000) 503–542 525
We start by noting that because the Fourier components of the left KM current belong
to the space of admissible Hamiltonians, acting on the left chiral field they must
generate the transformation
2p
JTn , g Ž x . 4 s yT eyi n x g Ž x . , where JTn :s H0 dx eyi n x Tr Ž TJ Ž x . . , Ž 4.2 .
which means that the chiral field is a KM primary field. This crucial relation can be
obtained by first noting that on the submanifold of M ext defined by imposing the
constraint ML s MR Ž2.21. holds for the product Ž2.8. that gives the WZNW solution.
The gauge invariance of the solution field and the fact that the different chiral pieces
completely Poisson commute then allow us to derive Ž4.2.. Of course, an analogous
relation is valid for the right-moving fields.
An other quantity which, by assumption, belongs to the space of admissible Hamilto-
nians is the monodromy matrix M. From Ž4.2. it follows that
JTn , M 4 s 0 , Ž 4.3 .
which is obvious because M is invariant under the KM transformations. For later use we
note that a quantity which Poisson commutes with the Fourier components of the KM
current must be a function of the monodromy matrix M. ŽThis is most easily seen by
using the parametrization Ž3.87...
We wish to determine the PBs of the ‘smeared out’ field7
2p
Ff s H0 dx Tr Ž f Ž x . g Ž x . . , Ž 4.4 .
where the matrix valued test function f Ž x . satisfies Ž3.58.. Note that while in the
symplectic approach of Section 3 the fact that Ff is an admissible Hamiltonian follows
from the properties of the symplectic form, here it is an additional assumption. In order
to compute the PBs with Ff it is enough to find
Bf Ž x . :s Ff , g Ž x . 4 Ž 4.5 .
Ff Ž
for 0 ( x ( 2p Ž Bf Ž x . corresponds to yY g Ž x .. in Subsection 3.3.. To constrain
Bf Ž x . we apply Ž4.2. to Ff and then using the fact that Ff is an admissible Hamiltonian
we obtain the local form
Ff , J Ž x . 4 s Tr Ž f Ž x . T a g Ž x . . Ta . Ž 4.6 .
Comparing Ž4.5. and Ž4.6. leads to the differential equation
k BfX Ž x . y J Ž x . Bf Ž x . s Tr Ž f Ž x . T a g Ž x . . Ta g Ž x . , Ž 4.7 .
whose solution is
1 2p
Bf Ž x . s H0 dy sign Ž x y y . Tr Ž f Ž y . g Ž y . T a . g Ž x . Ta q g Ž x . Uf , Ž 4.8 .
2k
where
Uf s Ufa Ta Ž 4.9 .
is a constant element of the Lie algebra. If we consider Ff̃ , an other Hamiltonian of type
7
Here some representation L of G is used like in Ž3.56., but henceforth L is omitted from all notations.
526 J. Balog et al.r Nuclear Physics B 568 [PM] (2000) 503–542
and f˜a is defined analogously. Eq. Ž4.10. implies that Ufa is a linear combination of the
integrals fa of the form
1
Ufa s y r a bfb , Ž 4.12 .
k
where r ab s yr ba . By means of Ž4.12., Ž4.8. becomes equivalent to the classical
exchange algebra
m 1
½ g Ž x . , g Ž y . 5 s k g Ž x . m g Ž y . Ž rˆq 1
2 Iˆsign Ž y y x . . , 0 - x , y - 2p ,
Ž 4.13 .
where the r-matrix rˆ s r ab Ta m Tb is an x-independent constant. Of course, as before,
Ž4.13. has to be interpreted in the distributional sense.
Although x-independent, the r-matrix can still depend on the phase space. This latter
dependence can be restricted by Poisson commuting JTn with Ž4.13. and applying the
Jacobi identity. In this way we get
JTn ,r ab 4 s 0 , Ž 4.14 .
which implies, as explained earlier, that rˆ must be a function of M alone.
Next we consider the PBs of the monodromy matrix. Using Ž4.5., Ž4.8. and Ž4.12. we
get
Ff , M 4 s gy1 Ž x . Ž Bf Ž 2p . y Bf Ž 0. M .
1
s fa Ž 12 Ž MT a q T a M . q r a b Ž MTb y Tb M . . Ž 4.15 .
k
and because M is an admissible Hamiltonian this implies the local form
m 1
½ g Ž x . , M 5 s k g Ž x . m M Qˆ Ž M . , Ž 4.16 .
where Qˆ is given by Ž3.52.. We obtain in a similar way that
m 1
½ M , M 5 s k M m M Dˆ Ž M . Ž 4.17 .
with Dˆ defined in Ž3.54..
To ascertain that our construction is self-consistent, we now show that
C s ML y MR f 0 Ž 4.18 .
J. Balog et al.r Nuclear Physics B 568 [PM] (2000) 503–542 527
is a first class constraint on M ext and the WZNW solution is gauge invariant. In fact, C
is first class since
m m m
½C , C5 s ½ M L 5 ½
, ML q MR , MR 5
1 1
s Ž ML m ML . Dˆ Ž ML . y Ž MR m MR . Dˆ Ž MR . f 0 .
k k
Similarly, the gauge invariance of g Ž s ,t . in Ž2.8. can be shown as follows:
m
k g Ž s ,t . , C s Ž g LŽ x L . m ML . Qˆ Ž ML . Ž gy1
½ 5 R Ž x R . m 1.
y Ž g LŽ x L . m MR . Qˆ Ž MR . Ž gy1
R Ž x R . m 1.
f Ž g LŽ x L . m M . Ž Qˆ Ž ML . y Qˆ Ž MR . . Ž gy1
R Ž x R . m 1. f 0 .
Here the notation f indicates ‘weak’ equality, i.e. equality on the constrained
manifold, and we used the assumption that MˇL and MˇR are independent and carry
opposite PBs.
For later use we mention an additional consistency check. Consider the path ordered
integral E s EŽ2p . defined in Ž3.42.. Since E and M are related by conjugation, their
invariants coincide:
´ N s Tr Ž E N . s Tr Ž M N . s m N , Ž 4.19 .
and since the PBs of the Hamiltonians ´ N can be calculated unambiguously using the
KM algebra only, the following relation must hold:
N
g Ž x . ,m N 4 s g Ž x . , ´ N 4 s g Ž x . T a Tr Ž M N Ta . . Ž 4.20 .
k
It is easy to verify by using Ž3.52. in Ž4.16. that Ž4.20. is indeed satisfied.
To summarize, by postulating the ML s MR constraint to be first class, as well as the
admissible Hamiltonian nature of the Fourier components of the current, the smeared out
chiral field and the monodromy matrix, we have established that the possible extensions
of the WZNW phase space correspond to the quadratic exchange algebra Ž4.13. with
some monodromy dependent exchange r-matrix. Of course the classical exchange
algebra can only provide a valid PB if it satisfies the Jacobi identity. This is guaranteed,
by construction, if the r-matrix is obtained as a solution of Ž3.36.. In the present
approach, we have to impose the Jacobi condition and, as mentioned in Subsection 3.3,
this leads to the dynamical YB equation Ž3.63.. The chiral extensions of the WZNW
model are thus characterized by the solutions of this equation.
Most known solutions of Ž3.63. are local in that they are defined only on a proper,
open submanifold Gˇ ; G. This is obviously the case for the solutions obtained by
solving Ž3.36. for the r-matrix, starting from a q-matrix representing by Ž3.21. a local
solution of Ž2.28.. On the other hand, as the example of constant r-matrices shows, there
are global solutions as well. Since constant solutions exist for non-compact groups only,
an interesting open question is whether there exist globally defined exchange r-matrices
also for compact groups. We hope to return to this question in a future study.
528 J. Balog et al.r Nuclear Physics B 568 [PM] (2000) 503–542
™
of the chiral extension. It is clear that the gauge transformed chiral fields given by
gŽ x. Ž ¶g . Ž x . s g Ž x . p Ž M . Ž 4.21 .
with an arbitrary monodromy dependent group element p M correspond to the same
Ž .
point in the physical phase space after the projection Ž2.11., provided we apply the same
gauge transformation to both chiral fields. In other words, both the original fields, g Ž x .,
and the gauge transformed fields, g˜ Ž x .:s g Ž x . pŽ M ., are smooth quasiperiodic G-val-
ued ‘coordinates’ of the same point in the physical phase space.
The multiplication law for two elements, ¶1 ,¶2 , of this huge gauge group is given by
p 12 Ž M . s p 2 Ž M . p 1 Ž ¶2 M . , Ž 4.22 .
where p 1 , p 2 and p 12 correspond to ¶1 ,¶2 and ¶12 :s ¶1¶2 :s ¶1 (¶2 , respectively. Here
¶M :s py1 Ž M . Mp Ž M . Ž 4.23 .
is how the monodromy matrix itself is transformed under a gauge transformation. To
qualify as an element of the gauge group we must also require that the inverse
transformation exists. This is equivalent to requiring that the group valued function,
pŽ M ., corresponding to the inverse element, ¶y1 , exists and solves
p Ž M . p Ž ¶M . s e . Ž 4.24 .
In terms of the new ‘coordinates’ g˜ Ž x . defined by Ž4.21. the exchange algebra has
the same form as Ž4.13. with a gauge transformed exchange r-matrix, r. ˜ˆ On account of
the Leibniz rule of the PB, one finds that
r̂˜ s py1 Ž M . m py1 Ž M . Ž rˆ Ž M . q Q a b Ta m Ab y Ab m Ta
qD ab Aa m Ab . p Ž M . m p Ž M . , Ž 4.25 .
where Aa :s Ž Ra p . p and r˜ˆ should be expressed as a function of the gauge
y1
transformed monodromy M:s ˜ ¶M. Since the Jacobi identity of the exchange algebra
Ž4.13. is independent of the coordinates used, it is clear that the solutions of the
dynamical YB equation Ž3.63. are transformed into each other by the elements of the
gauge group and therefore can be classified up to gauge transformations.
Below we briefly outline a kinematical derivation of the PBs on the space of chiral
WZNW Bloch waves, M Bloch defined in Ž3.66.. In order to emphasize their diagonality,
we denote the monodromy matrices of the Bloch waves, bŽ x ., here by D,
k
D s e v Hk , Ž 4.26 .
where Hk are the basis elements of a splitting Cartan subalgebra of G . We will also use
the derivatives E k s E Ev k.
The assumptions and the main steps of the construction Žwith obvious modifications
for the diagonal case. are the same as discussed above for the general case. Now we
obtain a classical exchange algebra of the form
m 1
½b Ž x . , b Ž y . s b Ž x . m b Ž y . Ž rˆ Ž v . q 12 Iˆsign Ž y y x . . , 0 - x , y - 2p .
5 k
Ž 4.27 .
J. Balog et al.r Nuclear Physics B 568 [PM] (2000) 503–542 529
The main difference comes from requiring Ž4.20., because, unlike in the general case
where it was a consequence of the exchange algebra, here it completely determines the
PBs of the monodromy matrix:
m 1 k
½ bŽ x . , D5 s k Ž bŽ x . m D. Ž H m H k .. Ž 4.28 .
On the other hand, the analogue of Ž4.16. implied by Ž4.27. now reads explicitly as
m 1
½ b Ž x . , D 5 s k Ž b Ž x . m D . ž rˆ Ž v . q 1
2 Iˆy Ž 1 m Dy1 . Ž rˆ Ž v . y 12 Iˆ. Ž 1 m D . .
/
Ž 4.29 .
The comparison of the last two equations fixes the exchange r-matrix almost com-
pletely:
rˆ Ž v . s Rˆ Ž v . q Xˆ Ž v . , Ž 4.30 .
where RˆŽ v . is given by Ž3.69. and XˆŽ v . is an antisymmetric purely Cartan piece,
Xˆ Ž v . s X k l Ž v . Hk m Hl , X k l Ž v . s yX l k Ž v . . Ž 4.31 .
Thus we have determined the exchange r-matrix algebraically up to the Cartan piece.
The Jacobi identity takes the following form for the diagonal case:
We studied in Subsection 3.4 the Poisson–Lie action of the group G on the chiral
ˆ of the exchange
phase space for the special case of a constant r-matrix playing the role
r-matrix. As mentioned there, this PL action is restricted to the case of complex or real,
non-compact groups, since there is no constant solution of Ž3.63. for compact groups. In
this section we consider a set of more general PL actions which also work for the
physically most interesting case of compact groups.
It is clear from the examples studied so far that the PL action we are looking for is a
kind of ‘hidden’ symmetry, extending and centralizing the Kac–Moody symmetries in
the total symmetry group of the chiral WZNW model. More precisely, we require that
It is not difficult to see that because of the above two requirements the PL action as a
non-linear group action on the chiral phase space has to be a special case of the gauge
™ ¶ g Ž x . s g Ž x . p Ž M ,h.
transformations discussed in Subsection 4.1:
gŽ x. h ;h g G. Ž 5.1 .
The G-valued function pŽ M,h. is chosen so that the group multiplication law ¶h¶k s ¶h k
is satisfied:
p Ž M ,k . p Ž ¶k M ,h . s p Ž M ,hk . , Ž 5.2 .
y1 Ž
where the induced action on the monodromy matrix is ¶k M s p M,k . MpŽ M,k .. One
must also require that pŽ M,e . s e.
The simplest case is the ‘standard’ Žleft. action
§ h g Ž x . s g Ž x . hy1 Ž 5.3 .
corresponding to pŽ M,h. s hy1 . A family of ‘trivial’ actions is obtained by conjugating
the standard action in the gauge group by an arbitrary element ¶ Žsee Ž4.21..:
§˜ h s ¶y1 § h¶. Ž 5.4 .
In terms of the corresponding G-valued functions we have
s˜Ž M ,h . s p Ž M . hy1 p Ž h P ¶M P hy1 . . Ž 5.5 .
In practice it is useful to consider the infinitesimal version of Ž5.1.. For the
a
parametrization h s e u Ta we have
¶h g Ž x . s g Ž x . y u a Xa g Ž x . q O Ž u 2 . , Ž 5.6 .
where the infinitesimal generators are of the form
Xa g Ž x . s yz b a Ž M . g Ž x . Tb Ž 5.7 .
b
with some monodromy dependent coefficients z a Ž M ., and satisfy the commutation
relations w Xa , Xb x s fa b g Xg . This latter equation can be expressed as a requirement on
the coefficients z b a as
z l a Dlyz v b y z l b Dlyz v a q fk l vz k a z l b q fa b lz v l s 0 . Ž 5.8 .
b b
Clearly the simplest solution of Ž5.8. is the standard one, z a s yd a.
J. Balog et al.r Nuclear Physics B 568 [PM] (2000) 503–542 531
The infinitesimal generators are also useful in studying the question of trivialization.
The infinitesimal form of Ž5.5. is
Xa Ž F1 . , F2 4 q F1 , Xa Ž F2 . 4 y Xa Ž F1 , F2 4 . s yf˜bg a Xb Ž F1 . Xg Ž F2 . ,
Ž 5.10 .
where the pair of structure constants Ž fab g , f˜bg a . together define a Lie bi-algebra. Now
applying this to F1 s g Ž x ., F2 s g Ž y . and parametrizing the exchange r-matrix as
r ab s k klz a k z b l Ž 5.11 .
we find that the infinitesimal action Ž5.7. will be PL if
for which the structure constants of the induced dual Lie algebra are
1
f˜bg a s Ž R sb fsa g q R gs fsa b . , Ž 5.14 .
k
where Rˆ s R ab Ta m Tb g G n G is some constant, antisymmetric r-matrix. The r-matrix
that occurs here is an arbitrary solution of the Žmodified. classical YB equation,
k ab Ž M . s K ab Ž M . q R ab . Ž 5.16 .
532 J. Balog et al.r Nuclear Physics B 568 [PM] (2000) 503–542
Ž 5.17 .
This depends on how the infinitesimal generator Xa is parametrized in terms of z b a ,
but the explicit reference to the dual structure constants Ž5.14. has disappeared.
From now on we will concentrate on the simplest case corresponding to the standard
action z b a s yd b a . In this case
r ab Ž M . s K ab Ž M . q R ab Ž 5.18 .
and Ž5.17. reduces to
Day K s w K ,T
Ta x , where Ta :s ad Ta , Ž 5.19 .
l l
that is, Ž Ta .s s fsa . Eq. Ž5.19. is the infinitesimal form of
Kˆ Ž hMhy1 . s Ž h m h . Kˆ Ž M . Ž hy1 m hy1 . , Ž 5.20 .
which expresses the equivariance of Kˆ Ž M . s K ab Ž
M .Ta m Tb under the action Ž5.3..
One may also verify directly that the standard left action of G equipped with the PB
Ž5.13. is PL for the exchange r-matrix Ž5.18. if Kˆ Ž M . is equivariant.
So far we have established that the action Ž5.3. of G on Mˇchir is PL if r Ž M . is the
sum of a constant r-matrix R and an equivariant piece K Ž M .. Of course, the exchange
r-matrix Ž5.18. has to be a solution of the dynamical YB equation Ž3.63.. Using Ž5.19.,
Ž3.63. can be rewritten as
bg
yŽ K T aK . q 12 Dq a K bg q cycl. perm.s Ž n 2 y 14 . f a bg Ž 5.21 .
in this special case, where the cyclic permutations are over the upper indices a , b ,g .
The search for solutions of Ž5.21. is made feasible by the observation that any analytic
function of Y Ždefined in Ž3.94.. is equivariant. We show in Appendix B that a solution
of Ž5.21. is given by the analytic function
K s 12 coth Ž Yr2 . y n coth Ž n Y . . Ž 5.22 .
This formula is valid on a domain Gˇ around e g G where the exponential parametriza-
tion is applicable and the power series of the above expression converges. The
derivation of the exchange r-matrix given by Ž5.18., Ž5.22., which is compatible with
the standard action of the PL group G equipped with the PB Ž5.13., is one of our main
results.
We conclude this section with a couple of remarks related to the above r-matrices.
We first note that for n s 0 Ž5.22. is understood as the appropriate limit and therefore
for R s 0 we recover from Ž5.18. the exchange r-matrix r 0 Ž3.95. together with the
classical G-symmetry discussed in Subsection 3.5. We can also have n s 0 in corre-
spondence with any antisymmetric solution R / 0 of the classical YB equation. For
n s 1r2 we get K s 0 and the dynamical r-matrix Ž5.18. then reduces to one of the
constant r-matrices treated in Subsection 3.4. For compact groups all solutions Ž5.22.
are strictly dynamical Žnon-constant., since in this case yn 2 0 0. Finally, we remark
that the existence of a suitable local 2-form r corresponding to the r-matrix Ž5.18.,
Ž5.22. is guaranteed by the solvability of Ž3.36..
J. Balog et al.r Nuclear Physics B 568 [PM] (2000) 503–542 533
We have seen that the classical G-symmetries discussed in Subsection 3.5 and the
special PL symmetries treated in Subsection 3.4 are generated by momentum maps.
Without going into details, we wish to mention that it is possible to show the existence
of a non-abelian momentum map also in the general case of the above PL symmetries.
The momentum map is given by a function mŽ M . on Mˇchir Ždepending on the
monodromy matrix only., which takes its values in the dual PL group G ) in such a way
that for all phase space functions F
Xa F s Ž my1 m, F 4 . a , Ž 5.23 .
where the Ž .a component on the right-hand side is evaluated in the dual Lie-algebra,
G ) . Moreover, the G )-valued momentum map satisfies the PBs
m
½ m , m5 s h )
Ž m. m m m , Ž 5.24 .
Let now suppose that a compact simple Lie algebra G acts on a phase space as a
classical symmetry generated by a G ) -valued, equivariant Ž‘abelian’. momentum map.
In this situation one can always define also an infinitesimal PL action of the group G
equipped with the so-called standard PL structure. This is a consequence of the fact w40x
that there exists a diffeomorphism between G ) and G ) that converts the natural linear
Poisson structure on G ) into the standard PL structure on G ) . Applying this to the
classical G-symmetry studied in Subsection 3.5, we can thus find a map,
G ) 2 G˜¨ mŽ G˜ . g G , )
Ž 5.25 .
where G˜ : Mˇ ™ G is given by Ž3.96., such that mŽ G˜ . satisfies Ž5.24. with respect to
chir
)
)
the standard PL structure. The resulting G -valued momentum map then generates a PL
action on the phase space Mˇchir as outlined above.
This somewhat surprising construction is not specific to the chiral WZNW phase
space, since it can be used whenever one has a classical G-symmetry based on a
compact simple Lie algebra. When applying it to the chiral WZNW phase space, the Lie
algebra action Ž5.23. constructed using the momentum map Ž5.25. will be different from
the standard one Ž5.3.. It is an interesting question whether this Lie algebra action can be
gauge transformed to the standard form and, if the answer is positive, to find the
corresponding gauge transform of the r-matrix r 0 Ž3.95.. We wish to discuss this in a
future publication.
534 J. Balog et al.r Nuclear Physics B 568 [PM] (2000) 503–542
The CDYB equation Ž4.32. can be regarded as the guarantee of the Jacobi identity in
a PL groupoid w25x. Below we show that Eq. Ž3.63. admits an analogous interpretation.
For this, we introduce a family of PL groupoids in such a way that a subfamily of these
is naturally associated with the possible PBs on the chiral WZNW phase space.
Remarkably, these groupoids are finite-dimensional Poisson manifolds that encode
practically all information about the infinite-dimensional chiral WZNW PBs.
Roughly speaking, a groupoid is a set, say P, endowed with a ‘partial multiplication’
that behaves similarly to a group multiplication in the cases when it can be performed.
To understand the following construction one does not need to know details of the
notion of a groupoid Žsee, e.g., Ref. w41x., since we shall only use the most trivial
example of such a structure, for which
PsS=G=Ss Ž M F, g , M I .4 , Ž 6.1 .
where G is a group and S is some set. The partial multiplication is defined for those
triples Ž M F, g, M I . and Ž M F, g, M I . for which M I s M F, and the product is
Ž M F , g , M I . Ž M F , g , M I . :s Ž M F , gg , M I . for M I s M F . Ž 6.2 .
In other words, the graph of the partial multiplication is the subset of
P=P=Ps Ž M F, g , M I .4 = ½Ž M F
,g,MI . =5 ½ ž Mˆ F
, gˆ , Mˆ I /5 Ž 6.3 .
defined by the constraints
M I sM F, Mˆ F s M F , Mˆ I s M I , gˆ s gg , Ž 6.4 .
where the hatted triple encodes the components of the product. A PL groupoid w42x P is
a groupoid and a Poisson manifold in such a way that the graph of the partial
multiplication is a coisotropic submanifold of P = P = Py, where Py denotes the
manifold P endowed with the opposite of the PB on P. In other words, this means that
the constraints that define the graph are first class. This definition reduces to that of a
PL group in the particular case for which the set S in Ž6.1. consists of a single point.
In the interpretation of Ž3.70. given in w25x the groupoid P is of the form above with
S taken to be a domain in the dual of a Cartan subalgebra of a simple Lie group G. By
thinking about a generic monodromy matrix, we now take P to be
P s Gˇ = G = G,
ˇ Ž 6.5 .
where Gˇ is some open domain in G. On this P, we postulate a PB , 4P defined, by
using the usual tensorial notation, as follows:
k g 1 , g 2 4 P s g 1 g 2 rˆ Ž M I . y rˆ Ž M F . g 1 g 2 ,
k g 1 , M2I 4 P s g 1 M2I Qˆ Ž M I . ,
k g 1 , M2F 4 P s M2FQˆ Ž M F . g 1 ,
k M1I , M2I 4 P s M1I M2I Dˆ Ž M I . ,
k M1F , M2F 4 P s yM1F M2FDˆ Ž M F . ,
k M1I , M2F 4 P s 0. Ž 6.6 .
J. Balog et al.r Nuclear Physics B 568 [PM] (2000) 503–542 535
Dˆ Ž M . s D ab Ž M . Ta m Tb . Ž 6.7 .
It is quite easy to verify that a PB given by the ansatz Ž6.6. always yields a PL groupoid,
since the constraints in Ž6.4. will be first class for any choice of the structure functions.
Of course, the structure functions must satisfy a system of equations in order for the
above ansatz to define a PB. The antisymmetry of the PB is ensured by
rˆ s yrˆ21 Ž rˆ21 :s r ab Tb m Ta . , Dˆ s yDˆ21 , Ž 6.8 .
while the Jacobi identity is, in fact, equivalent to the following system of equations:
rˆ12 ,rˆ13 q Qa b T1a R b rˆ23 q cycl. perm.s m fˆ, m s constant, Ž 6.9 .
Dˆ12 , Dˆ13 q Da b T1a R bDˆ23 q cycl. perm.s 0, Ž 6.10 .
rˆ12 ,Qˆ 13 q Qˆ 23 q Qˆ 13 ,Qˆ 23 q Da b T3a R b rˆ12
It can be checked that the Jacobi identities of the PL groupoid Ž6.9. – Ž6.12. are
ˆ Qˆ , Dˆ that arises in the WZNW model. This actually follows
satisfied for any triple r,
without any computation since, indeed, the Jacobi identities of the chiral WZNW PBs in
Ž3.62., Ž4.16., Ž4.17. lead to the same equations, with m s y 14 , and they are satisfied
r
since they follow from the symplectic form kV chir .
Among the ‘chiral WZNW PL groupoids’ described above there are those special
cases for which Kˆ s rˆ y Rˆ satisfies the equivariance condition Ž5.20. in relation with
some constant r-matrix Rˆ subject to Ž5.15.. In these cases, we equip the group G s h4
with the Sklyanin bracket opposite to that in Ž5.13.,
m 1
½ h , h s w h m h, Rˆ x ,
5 Ž 6.15 .
k
¨ Ž M , gh,h M h. g P
and consider its commuting right and left actions on P given respectively by the maps
P = G 2 Ž Ž M F , g , M I . ,h . F y1 I
Ž 6.16 .
G = P 2 Ž h, Ž M , g , M . . ¨ Ž hM h ,hg , M . g P .
and
F I F y1 I
Ž 6.17 .
Then it is not difficult to verify that these are both Poisson maps, i.e. they define two PL
actions of the PL group G Žwith Ž6.15.. on the PL groupoid P. In the final analysis, this
is a consequence of the fact that, as explained in Section 5, in the present situation we
have a PL action of G on the chiral WZNW phase space whose Poisson structure is
encoded by Ž P, , 4P .. Here Gˇ ; G must be Ad -invariant, see footnote 6.
In w25x PL groupoids are associated with arbitrary subalgebras K ; G , although the
corresponding dynamical r-matrices are described only if K is a Cartan subalgebra.
The K s G special case of their groupoids is in fact equivalent to our PL groupoid
whose structure function is the r-matrix in Ž3.95.. Their PL groupoids are different from
ours in general.
7. Conclusion
In this paper we explored the Poisson structures on the chiral WZNW phase space of
group valued quasiperiodic fields with generic monodromy. We have shown that the
possible PBs are defined by the exchange r-matrices that are solutions of Ž3.63.. This
equation can be viewed as an analogue of the celebrated CDYB equation Ž3.70., since
ˆ for chiral WZNW fields with diagonal monodromy. An
the latter plays a similar role
analysis of chiral WZNW Bloch waves and their classical Wakimoto realizations in the
spirit of the present paper is contained in w24x.
We have given an interpretation of our dynamical YB equation Ž3.63. in terms of a
family of PL groupoids, whose further study may be fruitful. In this respect, the most
interesting open questions appear to be to quantize these PL groupoids and to find
applications for them outside the chiral WZNW context. It is known that Eq. Ž3.70.
admits interesting applications in the field of integrable systems w31x.
We also investigated the PL symmetries of the exchange algebra Ž3.62.. We have
found that for any PL structure on the WZNW group G there is a corresponding choice
of the exchange r-matrix such that the standard gauge action of G on the chiral fields
becomes a PL action. It would be desirable to understand if this result has any analogue
at the level of the quantized Žchiral. WZNW model.
J. Balog et al.r Nuclear Physics B 568 [PM] (2000) 503–542 537
Acknowledgements
This investigation was supported in part by the Hungarian National Science Fund
ŽOTKA. under T019917, T030099, T025120, M028418 and by the Ministry of Educa-
tion under FKFP 0178r1999, FKFP 0596r1999.
In this appendix we present an explicit, local formula for the most general exchange
r-matrix on the simplest compact Lie group G s SUŽ2.. The formula ŽA.15. below is
valid in a neighbourhood of the unit element. It illustrates some general results obtained
in Section 3, and it may prove useful in a future study of the question whether globally
defined exchange r-matrices exist for SUŽ2. or not.
As a basis for the Lie algebra suŽ2., we choose the generators T a :s 21i sa , where the
sa Ž a s 1,2,3. are the usual Pauli matrices. We parametrize the matrices r a b Ž M . and
q ab Ž M . that appear in Ž3.36. in terms of 3-component vectors as
r a b s e a b cR c , q a b s e a b c Qc , Ž A.1 .
where e a b c is the totally antisymmetric tensor for which e 123 s 1. Furthermore, we
identify the SUŽ2. group manifold with S 3 ; R 4 by writing M g SUŽ2. according to
M s x 0 s 0 q ix a sa , x 02 q x a x a s 1, Ž A.2 .
whereby x 0 , x a define smooth functions on SUŽ2. Ž s 0 is the 2 = 2 unit matrix.. It is
then a matter of straightforward calculation to translate Eq. Ž3.37. into the formula
2 R a R a q 12 D aq R a q e a b c R c Dby R a s y 18 . Ž A.5 .
Observe that we cannot have a constant solution since the R a must be real. It is also
worth noting that locally we have the inverse of ŽA.3. given by
Q a s 14 D˜ y1 Ž yx a q 4 x 0 R a q 4e a b c x b R c . , ˜ x0 q 4 x a R a .
D:s Ž A.6 .
This formula defines via ŽA.1. a solution of ŽA.4. out of any solution of ŽA.5..
538 J. Balog et al.r Nuclear Physics B 568 [PM] (2000) 503–542
We shall now derive the general local solution of ŽA.5. by making explicit the
general local solution Ž3.93. of ŽA.4. that we have obtained in Subsection 3.5 for any
group. For this we need the exponential parametrization of SUŽ2.,
a
M s e 2 p G s e 2 p G aT with < G < 2 s Ga Ga - 1, Ž A.7 .
which covers the domain SUŽ2. _ ys 0 4 . Upon comparison with ŽA.2., we get
Ga
x 0 s cos Ž p < G < . , xa sy sin Ž p < G < . . Ž A.8 .
<G <
We shall also use the following expressions for the powers of the operator ad G . For the
odd powers, we have
2 nq1 n
Ž ad G . s Ž y1 . < G < 2 n Ž ad G . , n 0 0,
Ž ad G . Ž T a . s w G ,T a x s e a b c Tb Gc . Ž A.9 .
For the even powers, defining the matrix of Žad G . n by Žad G . n ŽT b . s wŽad G . n x a b T a,
we have
2n n Ga G b
Ž ad G . bs Ž y1. < G < 2 n d a b y
ž / , n 0 1. Ž A.10 .
a < G <2
Using these relations, we can rewrite the formulae Ž3.94. and Ž3.95. as follows:
2p < G < y sin Ž 2p < G < .
q0a b s e a b c Q cŽ0. with Q cŽ0. s Gc Ž A.11 .
8 < G <sin2 Ž p < G < .
and
1 Gc 1
r 0a b s e a b c R Ž0.
c with R Ž0.
c s
4 <G < ž cot Ž p < G < . y
p <G < / . Ž A.12 .
One may check that Eqs. ŽA.3. – ŽA.6. hold for these expressions, which represent
smooth functions on SUŽ2. _ ys 0 4 .
To obtain the most general 2-form r on SUŽ2. _ ys 0 4 that satisfies ŽA.4., we have
to add an arbitrary closed 2-form to the 2-form, r 0 , that corresponds to the matrix q0ab.
In fact, the result can be written as
sin Ž 2p < G < . y 2p < G <
r Ž G . s d Ga n d G b e cb a Gc
ž 2< G <3
q Uc Ž G . ,
/ Ž A.13 .
where UaŽ G . is a smooth ‘ vector function’ in the interior of the unit ball, < G < - 1,
which is divergence free, i.e.
3 E Ua Ž G .
Ý s 0. Ž A.14 .
as1 EGa
We then have to rewrite this 2-form in the manner indicated by the second parts of ŽA.1.
and ŽA.4.. By means of ŽA.3., this will provide us with the general local solution of
J. Balog et al.r Nuclear Physics B 568 [PM] (2000) 503–542 539
ŽA.5.. By performing the necessary Žrather tedious. calculations, in this way we obtain
the following formula:
Ga 1 1
Ra s < <
4 cot Ž p G . y
<G < < G < Ž 4p y 2 G P U .
Ga 1
q Ž G P U . y < G < Ua , Ž A.15 .
<G < Ž 4p y 2 G P U . 2 p < G <
where G P U s GaUa . This expression is valid on the open subset of SUŽ2. that excludes
ys 0 and the points where G P U s 2p . In particular, R a is smooth in a neighbourhood
of the unit element, for which G s 0, since UaŽ G . is smooth there by assumption. One
can also verify explicitly that on its domain of validity R a solves the dynamical YB
equation ŽA.5. for any divergence free UaŽ G .. For this verification, one needs to spell
out ŽA.5. more explicitly. For instance, if one uses the x a in ŽA.2. as coordinates around
s 0 g SUŽ2., then ŽA.5. becomes
(1 y x b xb E Ra E Ra E Ra
2 Ra Ra y q2 xa Rb s y 18 .
y2 Rb xb Ž A.16 .
2 E xa E xb E xa
In summary, we have derived the form of the most general exchange r-matrix in a
neighbourhood of the unit element of SUŽ2.. The solution ŽA.15. may in general
develop singularities away from the unit element, and it is an open question if globally
smooth solutions of ŽA.5. exist on SUŽ2. or not.
ab
Y
a
R ŽX .s b
ž exp Ž Y . y 1
exp Ž Y .
/ . Ž B.5 .
It is clear that all terms in Ž5.21. are built from powers of Y and structure constants. It
will prove useful to contract all the indices with Lie algebra generators and thus
reformulate Ž5.21. as an equation in the triple tensor product of the Lie algebra. We
introduce the notation
ak bl gs
² k ,l,m: :s Ž Y k . Ž Y l. Ž Y m. fkls Ta m Tb m Tg . Ž B.6 .
Cyclic permutation of the indices now corresponds to cyclic permutation of the tensor
factors and we also introduce the symbol
w k ,l,m x :s ² k ,l,m: q ² l,m,k : q ² m,k ,l : . Ž B.7 .
We expand both sides of Ž5.21. in powers of n and every coefficient of n also in
powers of Y . In our notation this latter expansion corresponds to putting together all
terms with a fixed total N s k q l q m, and Eq. Ž5.21. requires separately the equality of
all such terms on the two sides.
We start with the n 0 terms. We find that the N s 0 piece is satisfied identically,
while for N s 2 m q 2 Ž m s 0,1, . . . . we get
2 mq2
a mq 1 Ý Ž y1. sq 1 w0, s,2 m q 2 y s x y Ý a r a s w 0,2 s q 1,2 r q 1 x
ss0 rqssm
2r
q Ý ar ak Ý Ž y1. sq 1 w2 k q 2, s,2 r y s x s 0 . Ž B.8 .
rqksm ss0
Before proceeding we note that using the Jacobi identity for the structure constants
we can write down the following identity:
² k q 1,l,m: q ² k ,l q 1,m: q ² k ,l,m q 1: s 0 . Ž B.12 .
Now it is easy to see that ŽB.9. is a special case and ŽB.11. is a simple consequence
of the above identity. In fact, to prove ŽB.11. we group w0,2 m q 3,2 r q 1x with the first
Žw0,2 r q 1,2 m q 3x with the last. m q 1 terms of the sum:
m
s
Z1 s w 0,2 m q 3,2 r q 1 x q Ý Ž y1. w2 r q 2, s,2 m q 2 y s x ,
ss0
m
Z2 s w 0,2 r q 1,2 m q 3 x q Ý Ž y1. j w2 r q 2,2 m q 2 y j, j x ,
js0
References
Abstract
Various structural properties of the space of symmetry breaking boundary conditions that
preserve an orbifold subalgebra are established. To each such boundary condition we associate its
automorphism type. We show that correlation functions in the presence of such boundary
conditions are expressible in terms of twisted boundary blocks which obey twisted Ward
identities. The subset of boundary conditions that share the same automorphism type is controlled
by a classifying algebra, whose structure constants are shown to be traces on spaces of chiral
blocks. T-duality on boundary conditions is not a one-to-one map in general. These structures are
illustrated in a number of examples. Several applications, including the construction of non-BPS
boundary conditions in string theory, are exhibited. q 2000 Elsevier Science B.V. All rights
reserved.
1. Introduction
0550-3213r00r$ - see front matter q 2000 Elsevier Science B.V. All rights reserved.
PII: S 0 5 5 0 - 3 2 1 3 Ž 9 9 . 0 0 6 6 9 - 0
boundary condition– are precisely the one-dimensional irreducible representations of
C Ž A ..
This paper is a continuation to w1x. A very brief summary of the pertinent results of
w1x is as follows. The algebra C Ž A . is a commutative associative semisimple algebra.
Thus its regular representation is fully reducible, and the structure constants are
expressible through the corresponding diagonalizing matrix S˜ by an analogue of the
Verlinde formula. This matrix S, ˜ in turn, can be expressed in terms of various quantities
that are already known from the chiral conformal field theory associated to A Žsee
formula ŽA.6...
The conformal field theory with chiral algebra A can be obtained from the A-theory
as an orbifold by the group G; conversely, the original A-theory is recovered from the
A-theory as an integer spin simple current extension, with the group G of simple
currents being the character group of the orbifold group, G s G ) . Denoting the labels
for the primary fields of the A-theory by l and those for the primary A-fields by l, a
natural basis of C Ž A . is labeled by pairs Ž l, w ., where l refers to a A-primary in the
untwisted sector and w g Sl) is a group character, while the boundary conditions are
labeled by pairs w r , cˆr x consisting of an arbitrary primary label r of the A-theory and a
character cˆr g Ul) . Here the stabilizer Sl and the untwisted stabilizer Ul are sub-
groups of the simple current group G s G ) ; Sl consists of all simple currents in G
that leave l fixed, and Ul is the subgroup of Sl on which a certain alternating
bi-homomorphism Fl, which can be defined through the modular properties of one-point
chiral blocks on the torus, is trivial. For the precise meaning of these terms we refer to
w1x Žin particular appendix A. and to w4x. Quite generally, the number of basis elements
of C Ž A . – or equivalently, the number of independent boundary blocks, i.e. chiral
blocks for one-point correlation functions of bulk fields on the disk – and the number of
boundary conditions have to be equal. It is a rather non-trivial result of the analysis that
this equality indeed holds for the two sets of labels, so that in particular the matrix
S˜' S˜Ž l , w .,w r , cˆr x is a square matrix.
The results established in w1x clearly demonstrate an unexpectedly nice behavior of
the space of conformally invariant boundary conditions. In the present paper we show
that indeed this space is endowed with even more structure. Section 2 deals with the
implementation of the orbifold group on the representation spaces of the chiral algebra
A; this discussion does not yet involve boundary conditions at all, but the results will be
needed in the sequel. Sections 3–8 are devoted to the discussion of several additional
generic features of symmetry breaking boundary conditions. We start in Section 3 by
analyzing the notion of automorphism type, which in the present setting is a derived
concept that arises as a direct consequence of the general structure and does not need be
introduced by hand. Next we show that boundary conditions of definite automorphism
type can be naturally formulated with the help of twisted boundary blocks, which satisfy
twisted Ward identities ŽSection 4.. Furthermore, to the boundary conditions of fixed
automorphism type one can associate their own classifying algebra, which is an invariant
subalgebra of the total classifying algebra C Ž A . and whose structure constants can be
understood in terms of suitable traces on chiral blocks; this is done in Section 5. It is
also shown that the individual classifying algebra for automorphism type g only
depends on the automorphism g, but not on the specific orbifold subalgebra A, i.e. not
on the group G containing g.
Afterwards, in Section 6, we turn to a detailed study of the dependence of the
classifying algebra on the chosen torus partition function, which leads to the concept of
T-duality of boundary conditions. First we show that only the ‘difference’ between an
automorphism characterizing the torus partition function and the automorphism type of a
boundary condition is observable, and then we discuss aspects of T-duality among
Žfamilies of. boundary conditions for fixed choice of the torus partition function. We
emphasize that T-duality on boundary conditions is not a one-to-one map, in general. In
Section 7 we establish an action of the orbifold group G on the space of boundary
conditions, which implies a certain ‘homogeneity’ among the boundary conditions for
fixed G-orbit w r x. Finally we introduce in Section 8 the concept of a uniÕersal
classifying algebra, which governs all conformally invariant boundary conditions at the
same time, and discuss the possibility to obtain this algebra by a suitable projective
limit.
In Sections 9 and 10 we turn our attention to a specific class of boundary conditions
to which we refer as involutary, namely those where the orbifold group is Z 2 . We first
address some general features and then, in Section 10, analyze several classes of
examples that are of particular interest. Afterwards, in Section 11 we provide several
classes of examples with more complicated orbifold groups, in which for instance
untwisted stabilizer subgroups occur that are proper subgroups of the full stabilizers.
Finally, some of the pertinent formulae from Ref. w1x that will be needed in the sequel
are collected in Appendix A.
The elements g of the orbifold group G are automorphisms of the chiral algebra A.
In the sequel we will have to deal with various subgroups of G and their properties. We
first observe that every automorphism g of A can be implemented on the physical
A-modules Hl by maps
Qg ' QgŽ l. : Hl ™ Hg wl , Ž 2.1 .
which obey the g-twisted intertwining property
Qg Y s g Ž Y . Qg for all Y g A , Ž 2.2 .
and the maps Qg are defined by this property up to a scalar multiple. ŽFor a concrete
realization of these maps in WZW theories, see Ref. w5x.. In general, such an implemen-
tation QgŽ l. maps a given space Hl to some other A-module Hg wl , thereby organizing
the primary fields of the A-theory into orbits, much like the simple current group
G ( G ) organizes w6x the A-primaries into orbits.
To every A-primary l we can associate the stabilizer
Sl :s g g G < g wl s l4 , Ž 2.3 .
which is a subgroup of G whose elements constitute endomorphisms Hl ™ Hl .
Stabilizers of A-primaries on the same G-orbit are identical Žin the more general case of
non-abelian G, they are conjugate subgroups.; the vacuum has a maximal stabilizer,
SV s G. Also, A-modules on the same G-orbit are isomorphic as A-modules. The
endomorphisms Qg for g g Sl provide us with an action of Sl on Hl which is, in
general, only projective, and hence determines a two-cocycle El of Sl or, more
precisely Žin agreement with the fact that the maps Qg are defined only up to
normalization., the cohomology class of El. We denote by Ul the subgroup of Sl that
corresponds to the regular elements of the associated twisted group algebra C El Sl , i.e.
Ul :s g g Sl < El Ž g , g X . s El Ž g X , g . for all g X g Sl 4 . Ž 2.4 .
The scalar factors in the definition of the implementers Qg can be chosen in such a
way that the maps Qg with g g Ul provide us with a honest representation of Ul on
Hl. It follows that the A-modules Hl can be decomposed as
Hl ( [ VJˆ m Hl ,Jˆ , Ž 2.5 .
ĴgUl)
where the spaces VJˆ are projective Sl-modules and the spaces Hl,Jˆ are A-modules. We
make the mild technical assumption that all these modules VJˆ and Hl ,Jˆ are irreducible;
this holds true in all known examples, and is rigorously proven for the vacuum V w7,8x
as well as w9x for other A-modules, including twisted sectors. In the case of the vacuum,
no multiplicities appear in this decomposition; thus the action of G is genuine and we
have UV s SV s G.
Since by construction g leaves the subalgebra A of A fixed, the maps Qg are
ordinary intertwiners for A; hence in the decomposition Ž2.5. they act solely on the
degeneracy space VĴ . Moreover, by the general properties of twisted group algebras
Žcompare Appendix B of w1x., all the spaces VĴ have the same dimension < Sl <r< Ul < , and
(
the basis of the twisted group algebra C El Sl can be chosen in such a way that every
g g Ul is implemented as a diagonal matrix acting on VJˆ.
The result Ž2.5. should be compared to the similar decomposition Ž3.9. of Ref. w1x
that arises from the simple current point of view, i.e.
Hl ' Hw l , cˆ x s [ Vcˆ m HJ l . Ž 2.6 .
Jg GrSl
Here the spaces Vĉ are projective Sl-modules, with corresponding cocycle Fl of Sl ,
while the spaces HJ l are A-modules; by assumption, the latter modules are irreducible
Žthis assumption is indeed satisfied for all cases we know of.. Simple current theory
w4,6,10,11x shows that in the decomposition Ž2.6. isomorphic A-modules appear pre-
cisely as a consequence of fixed point resolution; therefore the multiplicity of HJ l in
this decomposition is given by < Ul) <. On the other hand, as seen above, elements of the
orbifold group G that are not in the stabilizer Sl relate isomorphic A-modules. Thus we
can identify the groups Ul) and GrSl . To make this manifest, we dualize the exact
sequence 0 ™ Ul ™ G and complete it to an exact sequence
0 ™ Sl ™ G s G ) ™ Ul) ™ 0 . Ž 2.7 .
Conversely, in the decomposition of a A-module Hl there appear < Ul) <
many irre-
ducible A-modules. In simple current language, the number of irreducibles is just the
length of the orbit, and hence again we can dualize 0 ™ Ul ™ G and complete it to
0 ™ Sl ™ G s G ) ™ Ul) ™ 0 . Ž 2.8 .
As a consequence, the cardinalities of the respective subgroups of G and G are related
by
< Sl < < Ul < s < G < s < G < s < Sl < < Ul < , Ž 2.9 .
and hence in particular the dimensions of the degeneracy spaces in the decompositions
Ž2.5. and Ž2.6. coincide:
2 2
Ž dim Vcˆ . s < Sl <r< Ul < s < Sl <r< Ul < s Ž dim VJˆ . . Ž 2.10 .
We abbreviate these dimensions by
dl :s dim Vcˆ s dim VJˆ . Ž 2.11 .
There is also a manifest relationship between the groups Sl and GrUl . First we
realize that the implementation of G on the whole G-orbit of l provides us with a
two-cocycle of G with values in UŽ1. whose restriction to Sl = Sl coincides with El ; we
again denote this cocycle by the symbol El. Given such a cocycle, its commutator
cocycle El, which is defined by
El Ž g , g X . :s El Ž g , g X . rE El Ž g X , g . Ž 2.12 .
X
w x
for all g, g g G, constitutes 1 a bi-homomorphism on G = G which is alternating in
the sense that ElŽ g X , g . s ElŽ g, g X . ) . Now let us characterize for every J g Sl an
element h J of G by the property that
El Ž h J , g . s J Ž g . for all g g G ; Ž 2.13 .
such a group element h J exists because, owing to the exactness of the sequence Ž2.8.,
for every J g Sl we have JŽ g . s 1 for all g g Ul , and Ž2.13. characterizes h J
uniquely up to an element of Ul. Furthermore, as a consequence of the character
property of El in the first argument, we have
El Ž h JJX , g . s El Ž h J h JX , g . Ž 2.14 .
for all g g G, which tells us that h J h JX s h JJX modulo Ul . It follows that the mapping
J l h J Ul Ž 2.15 .
constitutes an isomorphism between Sl and GrUl . It is worth noting that this
isomorphism is logically independent from the isomorphism between G )rS Sl and Ul)
that exists according to the sequence Ž2.8..
Again this result has an obvious dual analogue. To work this out we recall from
Appendix A of w1x that the commutator cocycle Fl of Fl , defined by FlŽJ,L. s
FlŽJ,L.rF FlŽL,J. for J,L g Sl , possesses a natural extension. Fl is an alternating
bi-homomorphism on Sl = Sl , while its extension is a bi-homomorphism on G = Sl ;
by imposing the alternating property it can be further extended to a bi-homomorphism
on G = G , still to be denoted by Fl. By the character property of Fl in the second
argument, we can then associate to every g g Sl an element K g ' K Žgl. of G by
stipulating that
Fl Ž K g ,L . s g Ž L . for all L g G , Ž 2.16 .
which determines K g uniquely up to elements of Ul . The character property of Fl in the
first argument implies
Fl Ž K g g X ,L . s Fl Ž K g K g X ,L . , Ž 2.17 .
so that K g K g X s K g g X modulo Ul . Thus the map
g l K g Ul Ž 2.18 .
between Sl and GrU
Ul is an isomorphism.
For later reference we also mention another isomorphism that is similar to Ž2.18..
Namely, we now consider a character gˆ of Sl rather than a character g of G . Then the
requirement that
ˆ gˆ ,L s g Ž L .
Fl Kž / for all L g Sl Ž 2.19 .
ˆ gˆ g Sl uniquely modulo Ul , and
determines K
ˆ gˆ Ul
gˆ l K Ž 2.20 .
is an isomorphism between those elements of Sl) which are the identity on Ul Žand
hence can be regarded as restrictions of elements of Sl to Sl . and SlrU Ul . Furthermore,
combining the prescriptions above we learn that the restrictions of the relevant bi-homo-
morphisms to the stabilizer groups are closely related. Indeed, denoting by gˆ the
restriction of a given element g of Sl to Sl , we have
ˆ gˆX Ž g . s gˆ Kˆ gˆX s Fl Kˆ gˆ ,Kˆ gˆX
El Ž g , g X . s K ž / ž / Ž 2.21 .
ˆ gˆ ,Kˆ gˆX g Sl.
with g, g X g Sl and K
3. Automorphism types
ˆ
We have demonstrated in w1x that the reflection coefficients R wŽ lr,, cwr.;x V , i.e. the operator
product coefficients in the expansion ŽA.5. of a bulk field approaching the boundary, are
equal to one-dimensional irreducible representations R w r , cˆr x of the classifying algebra
C Ž A ., evaluated at the basis element F˜ Ž l, w . of C Ž A .. Thus they are given by Žsee
formula Ž5.48. of Ref. w1x.
ˆ
R wŽ lr ,, cwr.;x V s R w r , cˆr x F˜ Ž l , w . s S˜Ž l , w .,w r , cˆr xrS˜V ,w r , cˆr x .
ž / Ž 3.1 .
Moreover, it follows from the sum rule ŽA.7. and the fact that C Ž A . is semisimple, that
the reflection coefficients even provide all inequivalent irreducible C Ž A .-representa-
tions. The isomorphism classes of irreducible C Ž A .-representations are in one-to-one
correspondence with the conformally invariant boundary conditions that preserve the
orbifold subalgebra A s A G of the chiral algebra A. In this section we discuss some
implications of this basic result.
Let us associate to each boundary condition r ' w r , cˆr x the collection of all mon-
odromy charges QJ Ž r ., J g G , of r . The monodromy charges do not depend on the
choice of a representative of the G-orbit w r x, and via the prescription
g r Ž J . :s exp Ž 2p iQJ Ž r . . Ž 3.2 .
for all J g G , they furnish a character
gr g G ) Ž 3.3 .
of the simple current group. ŽThis G-character should not be confused with cˆr , which is
a character of the subgroup Ur : G .. The group G ) s Ž G ) . ) can be naturally
identified with the orbifold group, G ) ' G, and hence the quantity g r can be regarded
as an element of G.
To proceed, we observe that because of the simple current symmetry
S˜JŽ l , w .,w r , cˆ x s g r Ž J . S˜Ž l , w .,w r , cˆ x Ž 3.4 .
that was established in formula Ž5.16. of Ref. w1x, we have
S˜JŽ l , w .,w r , cˆr x S˜Ž l , w .,w r , cˆr x
R w r , cˆr x F˜ JŽ l , w . s
ž / s gr Ž J. s g r Ž J . P R w r , cˆr x F˜ Ž l , w .
ž /
S˜V ,w r , cˆr x S˜V ,w r , cˆr x
Ž 3.5 .
for every simple current J g G . Now the reflection coefficients constitute the main
ingredient in the relation between the boundary blocks B ˜ Ž l, w . Ždefined in formula ŽA.1..
and the boundary states Bw r , cˆ x. When using the notation introduced in Ž3.2., the precise
relationship, established in formula Ž6.4. of Ref. w1x, reads
ˆ r , cˆ x w r , cˆ x
R wŽ lr ,, cw .;x V ²CV ˜ Ž l,w . .
w
Bw r , cˆ x s [ [ :B Ž 3.6 .
l wg Sl)
g l'1
Thus the observation Ž3.5. tells us that the boundary blocks B ˜ Ž l, w . for primary fields l
of the A-theory that lie on one and the same G-orbit contribute to the boundary states
Bw r , cˆ x with a fixed relative phase, which is determined by the element g r of the orbifold
group. Put differently, in the presence of the boundary condition r ' w r , cˆ x the
reflection of a bulk field at the boundary is twisted by the action of the group element
g r g G.
This observation suggests that, in the terminology of w12x, the orbifold group element
g r provides us with the automorphism type1 of the boundary condition r . To establish
that this is indeed the case, we insert the expression Ž3.1. for the reflection coefficients
and the explicit values of the one-point correlators of the boundary vacuum fields
w r , cˆ xw r , cˆ x
CV into formula Ž3.6., so as to arrive at
Bw r , cˆ x s [ [ S˜Ž l , w .,w r , cˆ x B˜ Ž l , w . . Ž 3.7 .
l wg Sl)
g l'1
Next we split the summation over all untwisted l into a summation over G-orbits and
one within orbits, and the summation over Sl) into one over Ul) and one over
Sl) rUUl) . To this end we choose Žonce and for all. arbitrarily a set l ( 4 of representa-
tives of the set of G-orbits and a set w( 4 of representatives of the classes of Sl) rU
Ul) ;
more precisely, the symbol l ( will refer to the chosen representative of the orbit w lx,
1
The corresponding term in w13x is the gluing automorphism. The information contained in a boundary
condition that goes beyond the automorphism type was referred to as the Chan – Paton type in w12x. Thinking
in analogy with the general analysis of modular invariant partition functions on the torus, it may seem to be
more suggestive to take the fusion rule automorphism g w as a starting point for the description of the
automorphism type w12x. However, several different automorphism types may give rise to one and the same
permutation g w. For instance, in the case of WZW theories, automorphisms of the underlying finite-dimen-
sional compact simple Lie algebra g provide us with an automorphism type g, but whenever that
automorphism of g is inner, the associated map g w is just the identity. Another example is given by the inner
automorphisms of the rational free boson theories whose fixed point algebras correspond to the boson theory
compactified at an integral multiple of the original radius; this will be discussed in Subsection 11.1.
and w( g Sl) to the chosen representative of the class in Sl)rU
Ul) that restricts to
wˆ g Ul , i.e. satisfies w( < Ul s wˆ . Then 3.7 becomes
) Ž .
Bw r , cˆ x s [ [ )
[ S˜JŽ l ( , w ( .,w r , cˆ x
˜ JŽ l ( , w ( .
B
w l ( x wˆ g Ul Jg GrUl
g l'1
s [ [ S˜Ž l ) ( , w ( .,w r , cˆ x [ ˜ JŽ l ( , w ( . .
g r Ž J. B Ž 3.8 .
w l ( x wˆ g Ul Jg GrUl
g l'1
Here in the second line we have used the simple current relation Ž3.4., as well as the fact
that by this identity the matrix element S˜Ž l, w .,w r , cˆ x vanishes when g r ŽJ. / 1 for any
J g Ul. The latter observation shows that only those boundary blocks B ˜ Ž l, w . contribute
to the boundary state Bw r , cˆ x for which the character g r is equal to one on the whole
untwisted stabilizer Ul : G , which in turn implies that g r g G is actually an element
of Sl : G, i.e.
g r g Sl . Ž 3.9 .
Thus as a character of G the function g r factorizes to a character of GrU Ul ; according
to the sequence Ž2.7., the latter group can be identified with Sl) . As we will see later,
this result is perfectly natural.
We now concentrate on the J-summation for fixed values of l ( and w( . To proceed,
we need a few further tools. First, it turns out to be useful to introduce for every g g Sl
and every J g G the endomorphism
Og ,J :s dly1 r2 Ý g Ž L . OJL w ( Ž 3.10 .
Lg SlrUl
of Vĉ , where Oc are the endomorphisms defined by ŽA.3.. Inserting that formula for Oc
and interchanging the order of summations, the maps Og ,J can also be written in the
form
g Ž LX . Fl Ž L,LX .
) )
Og ,J s dly2 Ý w( Ž L . Rwˆ Ž L . Ý
X
Lg SlrUl J L g Sl rUl
)
s Ý
)
ˆg
w( Ž L . Rwˆ Ž L . P d L ,Kˆ g sJ w( K ž / ˆg ,
Rwˆ K
ž / Ž 3.11 .
Lg SlrUl J
where Og ,J ( is defined in Ž3.10. and Pm is the projector from the A-module Hl to its
isotypical component of type Hm . Owing to the identity Ž3.12., Qg is in fact indepen-
dent of the choice of representatives J( Žwhereas Og ,J ( again depends on that choice.,
and from Ž3.13. it follows that
ˆ g ,Kˆ g X Qg g X
Qg Qg X s Fl K ž / Ž 3.17 .
X
for all g, g g Sl. Let us also note that when specializing to the A-vacuum sector,
where UV s SV s G, we simply have K ˆ g s 1 and hence Og ,J s id for all J g G , so that
ŽV .
Qg s [J g G g J PJ .
Ž .
With these results at hand, we can now address the J-summation that appears in
formula Ž3.8.. Inserting the definition ŽA.1. of the boundary blocks B,˜ we find that
[ ˜ JŽ l ( , w ( . s
g Ž J. B [ g Ž J( . [ ˜ Ž J(l ( , J
g Ž L. B w(.
(L
Jg GrUl J(g GrSl Lg Sl rUl
s Nl dly2 [ g Ž J( . [ g Ž L . b( (
J(g GrSl Lg Sl rUl
w( Ž LX . Rwˆ Ž LX . m id m B J ( l ( , Ž 3.18 .
)
= ž X
[
L g SlrUl J L
(
/
Vcˆl m Vcˆlq ™ C is the non-degenerate linear form defined in Ž4.21. of Ref. w1x.
where b( :V
Performing the L-summation, this becomes
[ ˜ JŽ l ( , w ( . s Nl
g Ž J. B [ g Ž J( . b ( (
Jg GrUl J(g GrSl
)
= ž w ž Kˆ /
J( ( g
ˆ g m id m B J ( l (s Nl BlŽ g .
Rwˆ K
ž / /
Ž 3.19 .
with
BlŽ g . :s Bl ( Ž Qg m id . . Ž 3.20 .
Here Bl s BlŽ1. are the boundary blocks of the A-theory, which are given by expression
Ž4.25. of Ref. w1x, and we have used that via the identity ŽA.4. they can be written in the
form
Bl ' Bw l , wˆ x s Nly1 [ [ B˜ Ž Jl ( ,w .
wg Sl) Jg GrSl
w% ŵ
s Nly1 [ [ B˜ Ž Jl ( ,Lw ( .
Jg GrSl Lg Sl rUl
s dly1 r2 [ b( ( ž Ý /
OL w ( m id m B J l s [ b( m B J l . Ž 3.21 .
Jg GrSl Lg Sl rUl Jg GrSl
At this point it is worth realizing that the twisted intertwining property Ž2.2. of Qg is
formulated independently of the subalgebra A, and hence Qg only depends on the
automorphism g itself, but not on the particular orbifold group G containing g we are
considering. By the result Ž3.20., this independence on the choice of G then holds for
the quantities BlŽ g ., too.
Inserting the result Ž3.19. into formula Ž3.8., we finally see that
Br s [ Nl [ S˜Ž l ( , w ( ., r
BlŽ g r . . Ž 3.22 .
w l( x wˆ g Ul)
g l'1
Thus, in summary, the boundary state Br can be entirely constructed from the
information contained in the boundary blocks B w l , wˆ x together with the action of Sl they
carry and in the character g r of G. As we will see, this implies that g r indeed
constitutes the automorphism type of the boundary condition r . We have also seen that
only those boundary blocks B ˜ Ž l, w . contribute to the boundary state Br for which the
stabilizer Sl contains g r which, incidentally, shows that the factorization to a character
Ž
of GrU Ul is a rather natural property of the elements of Sl .. This should be regarded as
a selection rule on the possible boundary blocks that show up in the boundary state; the
concrete form of this selection rule is completely determined by the automorphism type
of the boundary condition.
Our derivation also demonstrates that in the case of our interest one can associate an
automorphism of the chiral algebra A to eÕery boundary condition. This comes as a
result of our analysis and does not have to be put in as an assumption. In contrast, when
the subalgebra A that is preserved by a boundary condition is not an orbifold subalgebra
A G , then the boundary condition need not necessarily possess an automorphism type.
Indeed, as we will see in Section 11.3, most conformally invariant boundary conditions
of the Z 2-orbifold of a free boson, compactified at a rational radius squared, do not
possess an automorphism type.
Here JX is defined by J s JX J(X with J(X the representative for a class in GrS Sl Žsee before
formula Ž3.16.., and RJ ( l (Ž YJ . is a map from HJ ( l ( to HJX( J ( l ( . To proceed we use the
commutation properties
ˆ g s Fl JX ,Kˆ g Rwˆ Kˆ g Rwˆ Ž JX . s g Ž JX . Rwˆ Kˆ g Rwˆ Ž JX .
Rwˆ Ž JX . Rwˆ K
ž / ž / ž / )
ž / Ž 4.2 .
and
[ ž id m R Ž Y . / ( P
J(l ( J J ( l ( s RJ ( l (
XX XX Ž YJ .
J(g GrSl
2
This relation is needed for compatibility with the fact that while the isomorphism class of the projective
Sl-representation Rwˆ is the same for all values of l within the class w l ( x, its explicit realization can depend
on l, and this dependence should precisely be characterized by Fl.
a character of the simple current group G s G ) . Formula Ž4.5. reproduces the twisted
intertwiner property Ž2.2. and hence is the desired result.
We also remark that according to Section 2 the twisted intertwiners carry a projective
representation that is characterized by the cohomology class of the cocycle El or,
equivalently, by the commutator cocycle El. On the other hand, according to relation
Ž3.17. the concrete realization Ž3.16. of the twisted intertwiners can be characterized by
the commutator cocycle Fl. This is compatible because of the identity Ž2.21..
Now for every field Y Ž z . s Ý n g Z Yn zyn y D Y of conformal weight D Y in the chiral
algebra A, the ordinary boundary block BlŽ1. of the A-theory obeys the Ward identity
appropriate for a two-point block on P 1. That is,
BlŽ1. ( Ž Rl Ž Yn . m 1 q z Y 1 m Rl Ž Yyn . . s 0 Ž 4.6 .
with z Y s Žy1 . DYy1
. When combined with the definition Ž3.20. of the twisted boundary
blocks, the twisted intertwiner property Ž4.5. therefore allows us to write Žsuppressing
from now on the representation symbol Rl .
BlŽ g . ( Ž Yn m 1 . s Bl ( Ž Qg m id . ( Ž Yn m 1 . s Bl ( Ž g Ž Yn . m 1 . ( Ž Qg m id .
s yz Y Bl ( Ž 1 m g Ž Yyn . . ( Ž Qg m id .
We now restrict our attention to the collection of boundary conditions that possess
some fixed automorphism type g. According to the results of Section 8 the correspond-
ing boundary states can all be written as linear combinations of the twisted boundary
blocks Ž3.20. with fixed g. This suggests to study analogous elements of the classifying
algebra; accordingly we introduce for every l with g g Sl the linear combination
< Ul <
g Ž J . F˜ JŽ l ( , c ( .
)
FlŽ g . :s Ý Ž 5.1 .
< G< Jg GrUl
that follows for every g g Sl with the help of the simple current symmetry Ž3.4..
ŽWhen g is not in Sl, then according to the remarks after Ž3.8. this expression
vanishes..
Using the Verlinde-like formula that expresses the structure constants of C Ž A . in
˜ the result Ž5.2. allows us to compute the product of
terms of the diagonalizing matrix S,
two elements of C Ž A . of the form Ž5.1. as
X XX
FlŽ g .wFlŽX g . s d g , g X Ý NlŽ ,gl.X lFlŽXXg . Ž 5.3 .
XX
l
with3
XX XX y1
NlŽ ,gl.X l ' NlŽ ,gl.X l Ž A . :s Ý S˜Ž l ( , c ( ., r
S˜Ž lX( , c X( ., r S˜Ž)lXX( , c XX( ., r S˜V , r
ž / . Ž 5.4 .
r
g rsg
This means that the elements FlŽ g . for all l with g g Sl span not only a subalgebra,
but even an ideal of the classifying algebra C Ž A .. We call this ideal of C Ž A . the
indiÕidual classifying algebra for automorphism type g and denote it by C Ž g . Ž A .. Also,
by construction the FlŽ g . are linearly independent, and hence they furnish a basis of
C Ž g . Ž A ., i.e. for every fixed g g G we have
3
Note that this expression is only defined when g g Sl l SlX l SlXX .
With the help of the identities Ž2.9. among the sizes of the various subgroups it thus
follows that
Ý < Sl < s Ý < Sl < . Ž 5.6 .
l l
Q G Ž l .s0
This decomposition may be regarded as expressing the fact that in the situation of our
interest every boundary condition has an automorphism type. Put differently, the algebra
C Ž A . provides a unified description of the boundary conditions for the < G < different
automorphism types that correspond to the elements of G.
In the special case of trivial automorphism type, g s 1, we can use the result w1x that
S˜Ž l, c ., r s Sl , r for all r with g r s 1 to see that the ideal C Ž1. Ž A . is nothing but the
fusion rule algebra of the A-theory, so that we recover the known results w3,17x for
boundary conditions that do not break any of the bulk symmetries. It should also be
noticed that the precise form of the structure constants of the ideals C Ž g . Ž A . does
depend on the choice of representatives l ( and c( Žexcept when g s 1, where
independence of this choice follows as a consequence of the simple current relation
Ž3.4... This is, however, perfectly fine, because the twisted boundary block depends on
the choice of representatives as well, and in fact in a manner so as to cancel the overall
dependence in all physically meaningful quantities like one-point correlators for bulk
fields on the disk.
To analyze the prefactor appearing here, we first remark that the index of the subgroup
SlXXX in Sl is some divisor nl of < GX <. There is a simple current Jl g Sl which plays an
analogous role for the embedding SlXXX : Sl as Jg plays for the embedding of G XX in G ,
i.e. nl is the smallest power such that Jlnl is in SlXXX , and the elements of Sl are of the
form Jlm KXX with KXX g SlXXX and m s 1,2, . . . ,nl y 1. Moreover, from the fact that
g g Sl it follows with the help of duality that Jlm f Ul for all m s 1,2, . . . ,nl y 1, and
hence we have Ul : SlXX . Thus when forming the untwisted stabilizer associated to SlXXX
one does not lose any elements of Ul, so that Ul : UlXXX . Observing that Ul is precisely
the kernel of the group homomorphism from UlXXX to C that maps KXX g UlXXX to the nlth
root of unity FlŽJl ,K XX ., it follows that the index of Ul in UlXXX is nl . Thus we have
< Sl <r< SlXXX < s nl s < UlXXX <r< Ul < , Ž 5.16 .
so that Ž5.15. reduces to
C Ž g . Ž A G . s C Ž g . Ž A ² g : . s :C
CŽ g. Ž 5.18 .
for every finite abelian orbifold group G with G 2 g. It is worth pointing out that the
fact that the individual classifying algebra does not depend on the preserved subalgebra
˜
constitutes another quite non-trivial check of our ansatz for the diagonalizing matrix S.
Ž g.
We also learn that the structure constants of the algebra C read
X X X 2
SlX X , r X SmX X , r X SnX )X , r X
NlŽXg, m. Xn s NlŽXg, m. Xn ŽA ² g:
. s<G < Ý X . Ž 5.19 .
w rX x X
SV X
,r
X
g rXsg
The independence on the specific group G finds its natural explanation in the fact
that these numbers are interpretable as traces of appropriate maps on bundles of chiral
blocks w12x. Namely, since for every l we are given the twisted intertwiner maps
QgŽ l. : Hl ™ Hg wl Ž2.1., we also have the corresponding tensor product maps
Qg ' QgŽ,lg1,l. 2. ..,.g. l m . :s QgŽ l1 . m QgŽ l 2 . m . . . m QgŽ l m . Ž 5.21 .
on tensor products of A-modules. In view of the definition w18,19x of chiral blocks
Vl1l 2 .. . l m as singlets Žwith respect to a suitable block algebra. in the algebraic dual
Ž Hl m Hl m . . . m Hl . ) of these tensor products, the twisted intertwining property
1 2 m
together with the fact that the automorphism g respects the grading of the chiral algebra
implies the existence of a linear map
Qg) ' Qg), gŽ,l.1. .l,2g. . . l m . : Vl1 l 2 . . . l m ™ Vg wl1 g wl 2 . . . g wl m Ž 5.22 .
between spaces of chiral blocks. When g wl i s l i for all i s 1,2, . . . ,m, this linear map
Qg) is an endomorphism so that one can compute its trace; we will be interested in the
traces of three-point blocks. ŽA concrete description of the block algebras and hence of
the maps Qg) is so far only available for the case of WZW theories, where the situation
can be analyzed in terms of the horizontal subalgebra of the relevant affine Lie algebra..
Now in the case of our interest, where all simple current stabilizers in the AX-theory are
trivial, the fusion rules of the A-theory can be expressed through the modular S-matrix
of the AX-theory as
n X 2
1 SlX X , r X SmX X , r X SnX )X , r X
Nl , m s < G < Ý < SrXX < SVX X , r X
w rX x X
X
Q G X Ž r .s0
rX .
SlX X , r X SmX X , r X SnX )X , r X X
s Ý Ý e2p i Q JX Ž
X s Ý NlX X J, nmX Ž 5.23 .
X
J gG
X
rX SV X
,r
X X
J gG
X
This indicates that the chiral three-point blocks of our interest can be decomposed into
the direct sum of spaces of chiral blocks of the AX-theory. Such a decomposition should
in fact be expected on general grounds, and the chiral blocks of the AX-theory should fit
together to sub-bundles of the bundles of chiral blocks of the A-theory. Now when
restricted to irreducible AX-modules the maps Qg are ordinary intertwiners, and as a
consequence the map Qg) acts on the subspaces of chiral blocks as a multiple of the
X
identity. On the subspace of dimension NlŽXg, m. XJn the map Qg) should therefore act with
eigenvalue g ŽJ.. It thus follows that upon choosing representatives of the G X-orbits, the
trace of this map is precisely given by the number Ž5.20.:
X
tr Vlmn Qg) Ž lmn . s NlŽXg, m. Xn Ž A ² g : . . Ž 5.24 .
6. T-duality
In all the considerations so far, we have required that the torus partition function, and
correspondingly the pairing of the labels l and l˜ of the bulk fields fl, l˜ , is given by
charge conjugation, which we denote by sc ) : l˜ s sc ) Ž l. ' lq. In this section we
analyze what happens when a different torus partition function is chosen.4 To this end
we first have to state what we mean by a chiral algebra A and its automorphisms in
more concrete terms than was done so far. In mathematical terms, a chiral algebra5 is a
Õertex operator algebra w21–23x; the relevant data are therefore the vector space HV ,
the vacuum vector V g HV , the Virasoro element Õ Vir g HV , and a ‘ vertex operator
map’ Y. The latter realizes the state–field correspondence, i.e. associates to every
Õ g HV a field operator Y Ž Õ; z . Žtechnically, a linear map from HV to EndŽ HV . m
Cww z, zy1 xx with z a formal variable., e.g. the energy–momentum tensor T to the
Virasoro element, Y Ž Õ Vir ; z . s T Ž z .. By an automorphism of a vertex operator algebra
A we mean an invertible linear map
s: HV ™ HV Ž 6.1 .
that is compatible with state–field correspondence, i.e. satisfies
sy1 Y Ž s Õ ; z . s s Y Ž Õ ; z . Ž 6.2 .
for all Õ g HV . ŽLet us stress that – unlike e.g. in w24x – at this point we do not require
that the map s leaves the Virasoro element fixed.. As already outlined in Section 2,
each such map is accompanied by a permutation s ) of the label set I:s l4 of
A-primaries and by twisted intertwiners Qs ' QsŽ l. : Hl ™ Hs ) l between the corre-
sponding irreducible A-modules.
4
It is by no means necessary that the permutation that Žfor maximally extended chiral algebra. characterizes
the torus partition function is equal to the permutation g w defined via the automorphism type– if such an
automorphism type exists at all – of a boundary condition. While both mappings are associated to the
transition from chiral conformal field theory Ži.e. conformal field theory on a complex curve. to full
conformal field theory Žconformal field theory on a real two-dimensional surface., they refer to such a
transition for two different world-sheets – the disk and the torus, respectively– which are not related by any
factorization rules. As a consequence, they can be chosen independently.
5
Not to be confused with the global object for which the term chiral algebra has also been used in the
recent mathematical literature w20x, which we prefer to call a block algebra.
To proceed, we introduce two particular subgroups of the group G of all maps f
from I to AutŽ A .. For every f g G and every l g I, the image
fl :s f Ž l . : HV ™ HV Ž 6.3 .
is an automorphism in AutŽ A .. We denote by GZ the subgroup of all those elements f
of G for which the map 6
p f) : I ™ I, l ¨ fl) Ž l . Ž 6.4 .
preserves conformal weights modulo integers, i.e. fulfills Dp f) Ž l . s Dl mod Z for all
l g I, and is an automorphism of the fusion rules, i.e. satisfies
Np f) Ž l .,p f) Ž m .,p f) Ž n . ' N fl) Ž l ., fm) Ž m ., fn) Ž n . s Nl , m , n Ž 6.5 .
for all l, m , n g I. ŽExamples for such automorphisms are those induced by simple
currents, see e.g. Refs. w6,25x.. Every element of GZ gives rise to a modular invariant
torus partition function
)
Z f Ž t . :s Ý xl Ž t . xp ) q
f Žl .
Žt . . Ž 6.6 .
lgI
The second subgroup of G of our interest consists of constant maps f whose image
– to be denoted by g f – is an automorphism of A that leaves the Virasoro element
fixed, g f Ž Õ Vir . s Õ Vir Žand hence in particular obeys g f) Ž V . s V and preserves confor-
mal weights exactly, not only modulo integers.. We denote the subgroup of those maps
of this kind that also lie7 in GZ by GB . Every automorphism g f of this subgroup of GZ
with f g GB can be used to define conformally invariant boundary conditions. Further-
more, for every f g GB and every f X g GZ the modular invariant partition functions Z f X
and Z f f X are physically indistinguishable, i.e. upon a suitable relabelling of the fields all
correlation functions in the associated conformal field theories coincide. Accordingly, it
is appropriate to refer to GB as the T-duality group of the theory. For every chiral
algebra A the T-duality group GB contains in particular the map f c whose image is the
charge conjugation automorphism sc , i.e. f c Ž l. s sc for all l g I, and p f)c Ž l. s sc Ž l.
s lq.
The two theories with partition functions Z f X and Z f f X being indistinguishable, in
particular the respective sets of all conformally invariant boundary conditions must be
the same. It is worth investigating this correspondence in some detail. Let us denote for
any f g GB and any f X g GZ by
X X
CŽ gf ; f . 'CŽ gf ;f .Ž A² g: . Ž 6.7 .
the classifying algebra for boundary conditions of automorphism type g f for a confor-
mal field theory with torus partition function Z f X . Recall from Section 5 that this algebra
Žas well as its distinguished basis. can be constructed by starting with any arbitrary finite
6
Note that typically several distinct f g G will give rise to one and the same permutation p f) .
7
In fact one should expect that the property of inducing a fusion rule automorphism need not be required
independently, but is satisfied automatically as a consequence of the consistency of the relevant orbifold
theory. This has been demonstrated in the case of order-2 automorphisms in w5x. Indeed, consistency of the
orbifold theory requires that the S-matrix of the A-theory behaves with respect to the permutation g w that is
induced by the non-trivial element g of Z 2 via the maps Qg as Sl , g wm s S g wl , m . When combined with the
Verlinde formula, this implies that g w furnishes an automorphism of the fusion rules of the A-theory.
abelian group G containing g. As also discussed there, for the case X
where f X s f c
corresponds to charge conjugation, the structure constants of C Ž g f ; f . are given by the
traces over the linear maps induced by Qg f on the three-point chiral blocks:
8
One might have expected that here products of Q-maps rather than the Q-map for the product of
automorphisms appears. But when the group GZ is realized projectively, this would lead to inconsistencies.
When GZ is realized genuinely, then the two descriptions are equivalent.
While in the considerations above the T-duality transformations had to be applied to
the torus partition function and to the boundary conditions simultaneously, there is also a
slightly different notion of T-duality for boundary conditions, to which we now turn our
attention. Namely, we keep the bulk theory fixed, and ask whether boundary conditions
of different automorphism type can be associated to each other through a suitable
element of the T-duality group. While in many cases this question turns out to have an
affirmative answer, the relationship in question is between families of boundary
conditions rather than between individual boundary conditions. In short, T-duality on
boundary conditions is not a one-to-one map, in general. Closer inspection shows that
the relevant families of boundary conditions can be understood as orbits of boundary
conditions with respect to some suitable symmetry. Here by the term symmetry we mean
a bijection g of the space of boundary conditions of a given automorphism type g such
that the annulus amplitudes coincide, i.e.
A g Ž r . g Ž rX . Ž t . s A r rX Ž t . Ž 6.12 .
X
for all r , r with automorphism type g r s g s g r X .
Let us study the presence of such a symmetry first in the example of the critical
three-state Potts model. It is known w29x that the duality symmetry of this model maps
the free boundary condition to any of the three fixed boundary conditions, and indeed
this is a specific case of the general duality w30x between free and ‘configurational’
boundary conditions of lattice spin models. Similarly, the new boundary condition
discovered in w29x gets mapped to any of the three mixed boundary conditions. In this
case the symmetry group H s Z 3 of the fixed or mixed boundary conditions is directly
inherited from the lattice realization of the Potts model. We also observe that precisely
those boundary conditions for which this Z 3-symmetry is spontaneously broken are
grouped in non-trivial orbits. Furthermore, one readily checks that for orbits that are
related by the T-duality p T) s sc ) , the sum rule
Npy1
)
T Ž r.
Npy1
) X
T Žr .
Ý A gp T) Ž r . g X p T) Ž r X . s Nry1 Nry1
X Ý A g Ž r . g XŽ rX . Ž 6.13 .
X
g ,g gH g ,g XgH
for the annulus coefficients holds, where Nr is the order of the stabilizer of the H-action
on r . Roughly speaking, the sum rule Ž6.13. tells us that T-dual orbits give rise to an
equal number of open string states on the boundary.
This pattern can be detected in various other examples as well. For instance, in the
theory of a single uncompactified free boson, there is a single Neumann boundary
condition, whereas the Dirichlet boundary conditions are labeled by a position in R,
which should be interpreted as the affine space over the group H s R of translations.
This group is spontaneously broken for Dirichlet boundary conditions, and it is
straightforward to check that relations Ž6.12. and Ž6.13. are satisfied in this case as well.
Another class of examples is provided by boundary conditions in WZW theories that
break the bulk symmetry via an inner automorphism of the underlying simple Lie
algebra g. In these cases the boundary conditions of each automorphism type are labeled
by the same set, namely by the primary labels of the original theory Žsee w5x for the case
of automorphisms of order two.. The group H is in this case realized by the action of
simple currents of the original theory, which account for the different possible choices of
the shift vector that characterizes the inner automorphism of g. Again the validity of
relations Ž6.12. and Ž6.13. is easily verified.
We would like to emphasize, though, that the existence of such T-duality relations is
in fact a quite special feature of an individual model. In general we do not expect that
boundary conditions of different automorphism type are related in such a manner. For
instance, in the case of boundary conditions of WZW theories that break the bulk
symmetry via an outer automorphism of g, no relations of the form Ž6.12. or Ž6.13. are
known to us.
7. Boundary homogeneity
In this section we exhibit another general aspect of the space of conformally invariant
boundary conditions that preserve only a subalgebra A of A. Namely, we show that the
orbifold group G is realized as a group of symmetries on the space of boundary
conditions. These symmetries permute the boundary conditions within each set
w r , cˆr x < cˆr g Ur) 4 with fixed G-orbit w r x; the permutation is the same for all G-orbits.
This behavior, to which we refer as boundary homogeneity, is similar to the so-called
fixed point homogeneity that is present in simple current extensions, and indeed the
arguments closely resemble the ones needed in the latter context w4x.
We start from the observation that the orbifold group G can be identified with the
dual G ) of the simple current group G s G ) , and consider some character C of G . For
every field r of the A-theory and every character cˆ g Ur) we then define a new
character C cˆ g Ur) by
C
cˆ Ž J . :s C Ž J . cˆ Ž J . Ž 7.1 .
for all J g Ur . This is indeed again a character of Ur . Moreover, manifestly the group
law of G is reproduced for different choices of C , so that our construction provides an
action of G on the group Ur) . Typically G does not act freely, but it does act
transitively.
The next step is to realize that this prescription supplies us with a well defined action
on the space of boundary conditions. This is not entirely trivial, because the labels for
the boundary conditions are not pairs of orbits, but rather are obtained by the equiva-
lence relation ŽA.9. of Ref. w1x, i.e. Ž l, cˆl . ; J Ž l, cˆl . ŽJwl, J cˆl ., which involves a
non-trivial manipulation of the characters. However, this complication does not do any
harm, because just as for the action of C g G ) it consists of a multiplication, so that
the two operations commute. We write
C
r :s r ,Ccˆ for r s r , cˆ . Ž 7.2 .
X
Similarly, when we extend by some smaller group G ; G , we can define the analogous
object C w r , cˆ X x X :s w r ,Ccˆ X x X .
Let us now explain in which sense the elements of G are to be regarded as
symmetries. Using the explicit expression Ž6.29. of Ref. w1x for the annulus coefficients,
we can establish the identity
X X
ˆ ˆ
ACC wwsr 1,,ccˆs1xxC w r 2 , cˆ 2 x s Awwsr ,1c, csˆ 1x x w r 2 , cˆ 2 x Ž 7.3 .
between annulus coefficients. Thus when we also act on the corresponding chiral labels
of the open string states in the annulus amplitude, we can absorb the transformation into
a relabelling of the summation variables, so as to conclude that the annulus partition
function is invariant. We expect that this extends indeed to a full-fledged symmetry at
the level of correlation functions, where again one has to act on the insertions on the
boundary.
As a consequence, the boundary conditions that correspond to one and the same
G-orbit w r x should in fact better be labeled by the elements of what may be called the
affinum over the character group Ur) rather than by Ur) itself. Here by the term
affinum over a group G we refer to the elementary structure of a set A G that carries a
free and transitive action of G.9 Conversely, the group G can be identified with the
quotient of A G = A G by the equivalence relation Ž p,q . ; Ž p h ,q h . for all h g G. For
every p g A G we are given an identification g l p g between the group G and its
affinum A G , but there does not exist any canonical identification. Roughly speaking, in
the structure of the affinum one ignores the special role played by the identity element;
thus an affinum and its group are related in much the same way as an affine space AV is
related to the corresponding vector space V, which can be identified with its group of
translations. Indeed, the group of automorphisms Žthat is, bijections intertwining the
action of G. of an affinum A G is precisely G. It follows that any two affina over G are
isomorphic; but the isomorphism is never canonical; it is always determined only up to
the isomorphism group G.
We should admit that, even though we avoided using the term, the concept of an
affinum is already implicit at several other places of our discussion of boundary
conditions. For instance, the quantities cl and cˆr that appear in the definition ŽA.6. of
the matrix S˜ are best regarded as elements of the affina of the respective character
groups, because w31x the matrices S J are only defined up to certain changes of basis in
the space of one-point blocks on the torus and because such a change amounts to a
relabelling of the characters.
We finally add a comment on how this symmetry property of the boundary conditions
looks like in the case of the three-state Potts model. In this case it exchanges
simultaneously two fixed and two mixed boundary conditions, while it leaves the third
fixed and mixed boundary conditions invariant. In the Potts model we actually have yet
another symmetry on the space of boundary conditions, the action of a Z 3 group. These
two symmetries combine to the symmetric group S3 .
9
Thus there exists a map A=G ™ A acting as Ž p, g . ¨ p g such that p g s p if and only if g s e,
p s Ž p g . h , and such that for each pair Ž p,q . g A= A there is a unique g g G with p g s q. When G is
gh
non-abelian, one must distinguish between left and right actions, and hence left and right affina.
subalgebras of A that contain A G.10 In our situation the orbifold group G is finite and
abelian, but the latter result continues to hold for all finite groups, and there is even a
version for compact Lie groups w9,33,34x.11
In this section we would like to address the issue of consecutive breakings of bulk
symmetries in more generality, which leads us in particular to introduce the notion of a
uniÕersal classifying algebra. While our detailed studies have so far been restricted to
cases where the preserved subalgebra A of the bulk symmetries satisfies A s A G for
some finite abelian group G, it is reasonable to expect that several features of our
analysis will persist for general A. In particular, it should again be possible to determine
a classifying algebra, provided that the following two pieces of information are
available:
We also expect that the statements about inclusions of classifying algebras valid for
the case of finite abelian orbifold groups generalize as follows. To every inclusion
A ® A of preserved bulk symmetry algebras there is associated a projection of the
corresponding classifying algebras; the classifying algebra for A is a suitable quotient
of the one for A. More generally, for every chain of inclusions
A ® AX ® A Ž 8.1 .
of symmetry algebras, there should exist a corresponding chain of projections
p pX
C Ž A . ∏ C Ž AX . ∏ C Ž A . Ž 8.2 .
for the associated classifying algebras. As a consequence, every irreducible representa-
tion of C Ž AX . gives rise to an irreducible representation of C Ž A .. This makes sense
indeed: an irreducible representation of C Ž AX . corresponds to a boundary condition that
preserves AX and thus, a fortiori, also preserves the smaller algebra A; it should
therefore correspond to some irreducible representation of C Ž A .. Relation Ž8.2. clearly
holds when both A s A G and AX s A H are orbifold subalgebras for abelian orbifold
groups with H : G. ŽMoreover, the projections are compatible with the distinguished
bases of the algebras, compare e.g. the arguments leading to formula Ž5.18. for the case
H s ² g :.. More generally, one can hope to obtain this way also quantitative informa-
tion on solvable orbifold groups.
This way the following picture emerges. The set M of all consistent subalgebras of a
given chiral algebra A that possess the same Virasoro element as A is partially ordered
10
A similar Galois correspondence has been established in the context of braided monoidal )-categories in
w32x.
11
Assuming that the statement holds for every group within a certain class X, it follows in particular that
whenever there exists at least one boundary condition that does not possess an automorphism type, then the
Virasoro algebra by itself cannot be the orbifold subalgebra of A with respect to any group that belongs to X.
by inclusion. It is reasonable to expect that M is even an inductive system; that is, given
any two consistent subalgebras A 1 and A 2 of A, one can find another consistent
subalgebra A 3 that is contained in their intersection,
A 3 : A1 l A 2 . Ž 8.3 .
Note that this implies in particular that we do not have to make the assumption that the
intersection of all consistent subalgebras of A contains a consistent subalgebra; rather,
one only needs to deal with intersections of finitely many subalgebras.
Assuming that also in general the inclusion A 1 ; A 2 implies that the classifying
algebra for A 2 is a quotient of the one for A 1 , one arrives at a projective system Ž A i .
of classifying algebras. Taking the projectiÕe limit over this system, we then arrive at a
uniÕersal classifying algebra
C ` :s lim A i . Ž 8.4 .
§
ŽThe projective limit of the closely related structure of a fusion ring has been studied in
w35x.. By construction, this algebra C ` governs all conformally invariant boundary
conditions. In other words, it is the classifying algebra C Ž V ir. for the case where the
preserved subalgebra just consists of the Virasoro algebra. The universal classifying
algebra can be found explicitly in simple models, e.g. for the free boson compactified on
a circle or for the Z 2-orbifold of these theories Žsee Subsection 11.1..
The construction of a surjective homomorphism p : C Ž A . ™ C Ž AX . that maps the
elements of the distinguished basis B of C Ž A . to elements of the distinguished basis
BX of C Ž AX . is in fact straightforward as far as the l-part of the labels Ž l, w . for B are
concerned. In contrast, concerning the w-part one faces complications which stem from
the absence of a simple relationship between UlX and Ul Žthis fact had also to be taken
into account in the manipulations that were necessary to establish integrality of the
annulus coefficients, see Subsection 6.4 of w1x.. As a matter of fact, for a complete
discussion of this issue even in the case of abelian orbifold subalgebras additional
simple current technology is required that goes beyond the results of w4x. In particular it
will be necessary to implement the powerful results that have recently been obtained in
w36x.
We also note that for consistency, along with the projection p there should come an
injection i from the set of boundary conditions that preserve AX to those that preserve
A, in such a way that the diagonalizing matrices for C Ž A . and for C Ž AX . are related as
In this section we focus our attention to the situation where the symmetries A s A G
that are preserved by the boundary conditions form a subalgebra that is fixed by an
involution √ . In other words, for such inÕolutary boundary conditions the orbifold
group G is just the Z 2 group consisting of √ and the identity. For the associated Z 2
orbifolds, a lot of information is available Žsee e.g. Refs. w5,37x. The vacuum sector of
the A-theory decomposes into subspaces as
HV ( HV [ HJ Ž 9.1 .
with J a simple current of order two, and the automorphism √ acts as
In the case at hand, the orbifold group G and the simple current group G s G ) are
both isomorphic to Z 2 . Thus in particular the exponentiated monodromy charge g takes
its value in Z 2 , so that there are just two automorphism types of boundary conditions.
We refer to those boundary conditions whose automorphism type is given by the identity
as eÕen boundary conditions while those with automorphism type √ will be called odd.
The Chan–Paton types for the even boundary conditions are labeled by the primary
fields l of the A-theory, while those for the odd boundary conditions are labeled by the
orbits of primary fields of the orbifold theory Žwhose chiral algebra is A s A Z 2 . with
monodromy charge Q J s 1r2. The simple current group is G s V ,J4 , so in particular
its orbits w lx either have length two Ži.e. have stabilizer Sl s V 4. or length one Ži.e. are
fixed points, with Sl s Z 2 .. Only fields l with vanishing monodromy charge can be
fixed points; therefore fixed points cannot give rise to odd boundary conditions.
The even boundary conditions preserve, of course, all bulk symmetries. The relevant
boundary blocks e Bl can therefore be expressed in terms of the boundary blocks of the
A-theory as
e
1
Bl s žB [ B / ,
l Jl Ž 9.3 .
< Sl <
and the Ward identities that come from a field Y in the chiral algebra A look like
e
Bl ( Ž Yn m 1 q z Y 1 m Yyn . s 0 Ž 9.4 .
with z Y s Žy1. DYy1 . As for any boundary conditions that preserve all of A, the
classifying algebra C Ž A . for the even boundary conditions is the fusion algebra of the
A-theory, with structure constants expressible via the Verlinde formula in terms of the
modular transformation matrix S of the A-theory.
As established in Subsection 4, in the case of odd boundary conditions, the Ward
identities Ž9.4. get replaced by those for twisted boundary blocks; here they read
o
Bl ( Ž Yn m 1 q z Y 1 m √ Ž Yyn . . s 0 . Ž 9.5 .
o
The odd boundary blocks Bl satisfying these constraints are ‘differences’
o
Bl s Bl [ Ž yB J l . Ž 9.6 .
of the boundary blocks of the A-theory. They are thus related to the ordinary boundary
blocks as in formula Ž3.20., i.e. we have
o
Bl se Bl ( Ž Q√ m id . , Ž 9.7 .
where the maps Q√ satisfy the √-twisted intertwining property Q√ Y s √ Ž Y . Q√
Žwhich together with their action on the highest weight vector determines them
uniquely..
Let us now display the total classifying algebra C Ž A . which governs even and odd
boundary conditions simultaneously. We already know what the labels for the basis of
C Ž A . and for its one-dimensional irreducible representations look like. Moreover, in the
formula ŽA.6. for the diagonalizing matrix S, ˜ in the Z 2 case only two different matrices
appear, namely the modular S-matrix S ' S V of the orbifold theory and the matrix
˘ S J for the simple current J. The length Nl of an orbit of J can be either one or two;
S:s
it will be convenient to use different symbols for labels for fixed points and those for
length-two orbits; we choose roman letters f, g, . . . for the former and greek letters
a , b , . . . from the beginning of the alphabet for the latter. Also, for simplicity we will
use one and the same symbol c to refer to a G-character and to its value on the
non-trivial element J g G , which can be either of "1. For the subsets of the whole set
I s l4 of primary labels of the A-theory that consist of the labels for full orbits and for
fixed points we write I( and If , respectively, i.e.
I( :s m g I < Nm s 2 4 , If :s m g I < Nm s 1 4 . Ž 9.8 .
We also introduce the corresponding subsets
I( :s a g I < a s a ,J a 4 g I( 4 , If :s f < Ž f , c . g If 4 Ž 9.9 .
of the label set I of the orbifold theory. With these notations the dimension of C Ž A .
reads dim C Ž A . s < I( < q 2 < If <. By the sum rule ŽA.7. this must equal the number of
boundary conditions, i.e. the number of Z 2-orbits of the A-theory, counted with their
stabilizer:
dim C Ž A . s 12 < I( < q 12 < I1r2 < q 2 < If < . Ž 9.10 .
This tells us that for every Z 2-orbifold the number < I( < of length-2 Q s 0 orbits
coincides with the total number < I1r2 < of Q s 1r2 orbits. On the other hand the number
< If < of fixed points, which necessarily have Q s 0, can be arbitrary. Also note that
dim C Ž A . ' < I < s < I( < q < If <, i.e. there are always at least as many even as there are odd
boundary conditions. The numbers of odd and even boundary conditions are equal
precisely in those cases where there are no fixed points, which happens precisely when
the associated automorphism of the fusion rules is the identity.
Further, the entries of the diagonalizing matrix S˜ read explicitly
˜ a , b ,g s 2 Na , b ,g ,
N ˜ a , b ,Ž f , c . s Na , b , f ,
N
˜ a ,Ž f , c .,Ž g , c X . s 12 Na , f , g q cc X N˘ f , a , g se Na ,Ž f , c .,Ž g , c X . ,
N ž /
˜ Ž f , c .,Ž g , c X .,Ž h , c XX . se NŽ f , c .,Ž g , c X .,Ž h , c XX .
N Ž 9.12 .
e
for the structure constants with three lower indices. Here N and N denote the fusion
coefficients of the A-theory and of the A-theory, respectively. Moreover, we have
˘
introduced twined fusion coefficients, defined with the help of the twined S-matrix S,
according to
S˘f , h Sr , h S˘g), h
N̆ f , r , g :s Ý . Ž 9.13 .
hgIf SV , h
Recall from Subsection 5.3 that these twined fusion coefficients are the traces of the
action of the outer automorphisms associated to J on bundles of chiral blocks.
Furthermore, from equation Ž9.12. we read off that the matrix C B s SS ˜˜t furnishes a
conjugation on the basis labels Žnot just an involution as in the generic case.. We have
˜ a , b g s Na , b g ,
N ˜ a , b Ž f , c . s Na , b f ,
N
X X
˜ a ,Ž f , c . g s 12 e Na ,Ž f , c . g ,
N ˜ a ,Ž f , c . Ž g , c . se Na ,Ž f , c .
N Ž g ,c .
,
XX XX
˜ Ž f , c .,Ž g , c X . g s 12 e NŽ f , c .,Ž g , c X . g ,
N ˜ Ž f , c .,Ž g , c X . Ž h , c . se NŽ f , c .,Ž g , c X . Ž h , c . .
N
Ž 9.15 .
o
Sa , m Sb , m
Na , b g s Ý žS ) )
g , m y SJg , m /. Ž 9.20 .
mgI SV , m
Here a priori the summation is over all sectors of the orbifold theory. But only the
twisted fields give a non-vanishing contribution, and in that case the two terms with g
and Jg are equal. Labeling these twisted fields by roman letters a,b, . . . and calling the
corresponding index set I1r2 , we thus obtain o Na , b g s 2Ý d g I1r 2 Sa , d Sb , d Sg), drSV , d .
Moreover, elements d on the same Z 2-orbit give identical results, hence we may rewrite
this formula as a sum over orbits d:s d,Jd4 . Denoting the set of these orbits by I1r2 , so
that
I1r2 :s a g I < a s a,Ja4 g I1r2 ,
½ 5 Ž 9.21 .
we arrive at
o
Sa , d Sb , d Sg), d
Na , b g s 4 Ý . Ž 9.22 .
dgI1r2 SV , d
We can interpret this result as stating that the structure constants o N are governed by the
matrix o S with entries
o
Sa , b :s 2 Sa , b with a g I( , b g I1r2 ; Ž 9.23 .
this is the diagonalizing matrix for o C Ž A .. Also note that owing to o SV , b s 2 SV , b ) 0
for all b g I1r2 , this matrix shares the positivity property of modular S-matrices Žthis
does not already follow from the commutativity and semisimplicity of o C Ž A ...
Moreover, by combining unitarity of S with the simple current symmetry SJm, n s
Žy1. 2 Q J Ž n.Sm, n one obtains
Ž 9.24 .
with the help of which one can show that the matrix Ž9.23. is unitary, which in turn tells
us once again that o S is a square matrix.
The structure constants o Na , b g as defined by Ž9.18. do depend on the choice of
representatives that has been made. Indeed, upon replacing g by J g , o Na , b g goes to
minus itself. But this does not change the classifying algebra o C Ž A ., since the choice of
a sign constitutes a one-cocycle on o C Ž A ., which in turn can be absorbed by choosing a
correlated sign for the boundary blocks. More concretely, for the full Z 2-orbits we have
the isomorphism Ha ( Ha [ HJ a ( H√ w a of A-modules, so that the associated even
boundary blocks are precisely e Ba s Ba [ B J a , as given by formula Ž9.3., where
B m : Hm m Hm q™ C are the orbifold boundary blocks. In contrast, there are two
possible choices of odd boundary blocks, namely o Ba s Ba [ ŽyB J a . as in Ž9.6. as well
as
o
B√ w a s B J a [ Ž yBa . s yo Ba . Ž 9.25 .
o o
By construction both of these linear forms Ba and B√ w a on Ha m Haq satisfy the
appropriate Ward identities, but of course we must keep just one out of the two. The
right prescription is to keep o Bb when the label b g I( is the chosen representative for
the orbit b g I( . ŽAlso, positivity of mixed annulus amplitudes is guaranteed only with
this choice, see below.. In short, the label of the orbifold boundary state that appears
with a positive sign in the boundary block Ž9.6. is the one that is to be chosen as the
representative of the orbit.
9.4. Annulus amplitudes
Using our general results from Section 6 of w1x it is also straightforward to calculate
all annulus amplitudes. For the case of two even boundary conditions we obtain
a
Aa b s Ý žN b ,m q Nb ,Jm a xm Ž i tr2 . ,
/
mgI(jIf
A a Ž f ,c . s Ý N f , m a xm Ž i tr2 . ,
mgI(jIf
1
A Ž f ,c . Ž g ,c X . s Ý žN g ,m
f ˘ g ,m
q cc X N f
/ x Ž i tr2. .
m Ž 9.26 .
2 mgI(jIf
Similarly, for two odd boundary conditions the annulus amplitudes are
1
A a bŽ t. s Ý N N a q Nb ,Jm a xm Ž i tr2. .
2 mgI jI m b , m ž / Ž 9.27 .
( f
Finally, for mixed annuli, i.e. annuli with one even and one odd boundary, we find
1
A a mŽ t . s Ý Nm Nm ,c a q Nm ,Jc a xc Ž i tr2 . .
ž / Ž 9.28 .
2 cgI1r2
for all possible values of the r i , and that they are ‘complete’ in the sense that
A r1 A r 2 s Ý Mr1 , r 2 r3
A r3 , Ž 9.31 .
r3
where the A’s are regarded as matrices in their two lower indices. The non-vanishing
coefficients M are
Ml , m n se Nl , m n ,
M a , b n s 12 Nn N a , b n q N a ,Jb n ,
ž /
2
M a , m c s 14 Ž Nm . žN a,m
c
q NJa , m c .
/ Ž 9.32 .
Note that the matrix M V is the identity matrix.
12
As already pointed out in Subsection 6.5 of w1x for the case of general orbifold group G, a rigorous
derivation of these relations is, however, not yet available.
10. Examples for involutory boundary conditions
which is nothing but the Žone-by-one. matrix S J. In contrast, for the diagonalizing
matrix o S of o C Ž A . there will typically not exist any preferred ordering of the labels
a g I( Žfor the rows of o S . and a g I1r2 Žfor the columns.; amusingly, in the present
case it is possible to order rows and columns in such a fashion that o S is symmetric. We
have
8 Ž a q 1. Ž b q 1. p
o
Sa , b s 2 Sa , b s ( kq2
sin
ž kq2 / . Ž 10.7 .
Now ordering again I( according to the value of the weight, i.e. as a s 0,2,4, . . . ,kr2
y 2, symmetry is achieved when I1r2 is ordered by taking first the weights of the form
4 j q 1 in ascending order and afterwards the weights of the form 4 j q 3 in descending
order, i.e. a s 1,5,9,13, . . . ,15,11,7,3. Doing so, the matrix Ž10.7. simply becomes the
matrix with entries
& 8 Ž 2 p y 1. Ž 2 q y 1. p
o
Sp , q s ( kq2
sin
ž kr2 q 1 / , Ž 10.8 .
where the integers p and q run from 1 to kr4. In particular, inspection shows that
o
C Ž A . coincides with the fusion algebra w40x of the Ž 2k q 1,2. Virasoro minimal models.
Via the simple current symmetries of the matrices S and S J, the diagonalizing matrix
˜
S of the classifying algebra is already completely determined by eS and o S. Concretely,
when we choose the ordering of the rows of S˜ as
Ž l , w . s 0 , 2 , 4 , . . . , 2k y 2 , Ž 2k ,q . , Ž 2k ,y . , k , k y 2 , . . . , 2k q 2 Ž 10.9 .
and the ordering of the columns according to
Ž r , c . s 0 , 2 , 4 , . . . , 2k y 2 , Ž 2k ,q . , Ž 2k ,y . , 1 , 3 , . . . , 2k y 1 , Ž 10.10 .
then S˜ is block diagonal of the form
e eo
S S
S˜s ž oe o /, Ž 10.11 .
S S
and the off-diagonal blocks are related to the diagonal ones by
eo
S < full s yo S , eo S < fixed s 0 , oe S seS < full , Ž 10.12 .
< <
where the symbols full and fixed stand for restriction of the rows to those corresponding
to full orbits and to fixed points, respectively.
In the s l Ž2. case under consideration, it is not too difficult to establish that – just
like the algebra o C Ž A . – also the total classifying algebra C Ž A . constitutes a structure
that has been encountered in conformal field theory before. Indeed, we will construct an
isomorphism between C Ž A . and an algebra that appears in the work of Pasquier et al.
To simplify some of the formulæ below, let us write k s 4 l with l g Z. Then a
2 l q 2-dimensional associative algebra with structure constants
for r, s,t g 1,2, . . . ,2 l q 24 has been considered in w41–43x and been called the
Pasquier algebra associated to the situation of our interest. In formula Ž10.13., Sˆ is the
matrix with entries
° ž'1
2
1
qI
/ for r s 2 l q 1 , s s l q 1 or r s s s 2 l q 2 ,
2 lq1
1 1
2 ž' 2 lq1
yI
/ for r s 2 l q 1 , s s 2 l q 2 or r s 2 l q 2 , s s l q 1 ,
~' l
Ŝr , s :s
1
2 q1
Ž y1 . Ž ry 1 . r2 Ž 1 y Ž y1 . .
r
for r s 1 ,2 , . . . ,2 l , s s l q 1 ,2 l q 2 , Ž 10.14 .
1
for r s 2 l q 1 ,2 l q 2 , s / l q 1 ,2 l q 2 ,
'4 l q 2
1 Ž2 l y r q 1 . Ž 2 s y 1 . p
¢' 4 lq2
P 2 cos
ž 4 lq2 / otherwise ,
with I as in Ž10.6.. Note that the matrix Nˆ2 with entries Ž Nˆ2 .st s N
ˆ 2, s t is nothing but the
incidence matrix of the graph D 2 lq2 . Conversely, up to a phase the columns of Sˆ are
uniquely determined by the two requirements that Sˆ is unitary and diagonalizes Nˆ2 –
with the exception, however, of the columns numbered l q 1 and 2 l q 2, which both
are eigenvectors to the same eigenvalue zero. For the latter two columns, in formula
Ž10.14. Žunlike in Table 2 of Ref. w41x. we have chosen specific linear combinations that
are singled out by the property that all structure constants N ˆ r, s t are non-negative
integers. We also remark that the matrix Ž10.14. is unitary, but it is not symmetric, nor
can it be made symmetric by any re-ordering of rows and columns,13 and finally that
rq 1
Sˆr ,2 lq2 ys s Ž y1 . Sˆr , s for r / 2 l q 1,2 l q 2 , s / l q 1,2 l q 2 . Ž 10.15 .
By inspecting formulæ Ž10.14. and Ž10.11., we observe the relation
´l
SˆrŽw r , c x.,sŽŽ l , w .. s S˜Ž l , w .,w r , c x Ž 10.16 .
(N l
between the matrices Sˆ and S. ˜ Here Nl is the length of the G-orbit through l Ži.e.
Nl s 2 except for N2 l s 1., ´ is the sign factor
1 for l s 2 l ,
´l :s ½ Ž y1.
lr2
otherwise ,
Ž 10.17 .
13
However, as follows from the identifications below, when combining suitable re-orderings with rescalings,
certain submatrices of Sˆ become symmetric.
and we introduced bijections r and s between the respective index sets of Sˆ and S˜ which
act as
rq1° for r / 2 l ,
~
r Ž w r , c x . :s 2 l q 1 for Ž r , c . s Ž 2 l ,q . , Ž 10.18 .
¢
2 lq2 for Ž r , c . s Ž 2 l ,y .
and
°lr2 q 1 for l / 2 l ,
~
s Ž Ž l , w . . :s l q 1 for Ž l , w . s Ž 2 l ,q . , Ž 10.19 .
¢2 l q 2 for Ž l , w . s Ž 2 l ,y . ,
respectively. In particular, the diagonalizing matrices Ž10.5. and Ž10.7. for the even and
odd boundary conditions obey
mrNm
e
(
Sm , n s Nm Ž y1 . SˆrŽ n .,sŽ m . Ž 10.20 .
and
ar2
o
Sa , b s '2 Ž y1 . Sˆbq1 ,sŽ a . , Ž 10.21 .
respectively.
Using the fact that S is symmetric, the result Ž10.16. tells us that up to normalizations
of the rows of S, ˜ transposition, and reordering of the rows and columns, the two
matrices S˜ and Sˆ coincide. As all these manipulations can be absorbed into a basis
transformation, it follows that the two algebras that via these matrices are associated to
the simple current extension of s l Ž2. l , i.e. the classifying algebra C Ž A . and Pasquier’s
algebra are isomorphic.
For concreteness, let us also display explicitly these matrices in the simplest case, i.e.
for l s 1:
°1 '2 1 '2 ¶
'3 0 y'3 0
1 1 1
Sˆs 1 ' '
'6 '2 Ž 1 q i 3 . 1
'2 Ž 1 y i 3 . ,
1 1
¢1 '
'2 Ž 1 y i 3 . 1
'2 Ž 1 q i 3 .ß
'
°1 1 1 y'3 ¶
1 1
1
1
2
Ž 1 q i'3 . 2
Ž 1 y i'3 . 0
S˜s . Ž 10.22 .
'3 1 1
1
2
Ž 1 y i'3 . 2
Ž 1 q i'3 . 0
¢1 1 1 '3 ß
10.4. Virasoro minimal models
The unitary minimal models of the Virasoro algebra are labeled by m s 2,3, . . . ; they
have conformal central charge c s c m :s 1 y 6rŽ m q 1.Ž m q 2.. Via their realization as
a coset theory s l Ž2. my 1 [ s l Ž2.1rs l Ž2. m , the s l Ž2. WZW situation of the previous
subsection gives rise to similar effects in these minimal models. The requirement that
the level must be divisible by four translates to the condition m g 4Z j Ž4Z q 1. on the
label m. In these cases the chiral algebra of the A-theory is obtained from A, which is
just the Virasoro algebra, via extension by the field J with label lJ s Ž m,1. ŽKac table
notation., which has conformal weight D J s mŽ m y 1.r4.
In these cases we even know that the Virasoro algebra A is the only consistent
subalgebra of A, simply because no other unitary conformal field theories exist at the
same value of c. Thus our methods supply us14 with all conformally invariant boundary
conditions of the A-theory. In particular there are precisely two automorphism types, the
even boundary conditions which preserve the full bulk symmetry A, and the odd
boundary conditions which preserve only the Virasoro subalgebra.
The primary fields of the A-theory with central charge c m are labeled by l ' Ž r,r X .
with
1(r(m , 1 ( rX ( m q 1 , Ž 10.23 .
X X
modulo the identification Ž r,r . ; Ž m q 1 y r,m q 2 y r ., so that there is a total of
mŽ m q 1.r2 sectors. We first look at the cases with m s 4 l for some l g Z ) 0 . Then
the A- and A-theory are commonly w39x denoted by Ž A 4 l , A 4 lq1 . and Ž A 4 l , D 2 lq2 .,
respectively. ŽIn the simplest of these, obtained for l s 1, the A-theory is the tetracriti-
cal Ising model Ž A 4 , A 5 . while the A-theory is the three-state Potts model Ž A 4 , D4 ... We
have
I( s Ž r ,r X . < r s 1,3,5, . . . ,4 l y 1, r X s 1,3,5, . . . ,2 l y 1,2 l q 3, . . . ,4 l q 1 4 ,
If s Ž r ,2 l q 1 . < r s 1,3,5, . . . ,4 l y 1 4 ,
I1r2 s Ž r ,r X . < r s 2,4,6, . . . ,4 l , r X s 2,4,6, . . . ,4 l 4 , Ž 10.24 .
and hence < I( < s < I1r2 < s 4 l 2 , < If < s 2 l . Thus we obtain 2 l Ž l q 2. even and 2 l 2 odd
boundary conditions, and hence a total of 4 l Ž l q 1. conformally invariant boundary
conditions.
The similar series with m s 4 l q 1 for l g Z ) 0 can be treated analogously. The A-
and A-theory are now known under the names Ž A 4 lq1 , A 4 lq2 . and Ž D 2 lq2 , A 4 lq2 .,
respectively. We have
I( s Ž r ,r X . < r s 1,3,5, . . . ,2 l y 1,2 l q 3, . . . ,4 l q 1 , r X s 1,3,5, . . . ,4 l q 1 ,
If s Ž 2 l q 1,r X . < r X s 1,3,5, . . . ,4 l q 1 ,
I1r2 s Ž r ,r X . < r s 2,4,6, . . . ,4 l , r X s 2,4,6, . . . ,4 l q 2 , Ž 10.25 .
so that < I( < s < I1r2 < s 2 l Ž2 l q 1., < If < s 2 l q 1, and the number of even and odd
boundary conditions is Ž2 l q 1.Ž l q 2. and l Ž2 l q 1., respectively.
14
At least modulo what goes w44x under the name of ‘complex charges’.
Via the coset construction, it is possible to express all the ingredients in the formula
for S˜ through quantities of the underlying s l Ž2. WZW models, so that again the
classifying algebra C Ž A . can easily be obtained explicitly. We refrain from displaying
any details, which are not too illuminating. We would like to mention, however, that
these results are in perfect agreement with the findings of w45,46x. In the latter papers,
various statements were encoded in the language of graphs; the following remarks allow
to make contact to that point of view.
The total number of conformally invariant boundary conditions is
Ž 10.26 .
Regarding the graphs A 4 l and D 2 lq2 Ži.e. the Dynkin diagrams of the respective simple
Lie algebras. as bi-colored, starting with Žsay. a black node, this can be understood as
follows. The even boundary conditions are in one-to-one correspondence with pairs of
black nodes from the ‘product’ of the two graphs, while odd boundary conditions are in
one-to-one correspondence with pairs of white nodes. Mixed pairs of nodes do not
correspond to boundary conditions, which accounts for the factor of 1r2 in Ž10.26.. The
latter selection rule may be implemented by a suitable folding of the A 4 l graph. The
resulting graph has a loop, hence in particular it is no longer bi-colorable; pairs of nodes
from D 2 lq2 and the folded graph are then in one-to-one correspondence with all
conformally invariant boundary conditions, including both even and odd ones.
Further, let us denote by EŽ r˜ . the r˜ th exponent of the Lie algebra D 2 lq2 . For every
r˜ s 1,2, . . . ,2 l q 2, the integer EŽ r˜ . lies in the label set of the A 4 lq1 graph; indeed, the
exponents correspond precisely to the black nodes of A 4 lq1 , with the middle node
appearing twice. We can therefore define for every s s 1,2, . . . ,4 l q 1 a matrix Vs
through
where S Ž A. and S Ž D. are the unitary diagonalizing matrices for the graphs A 4 lq1 and
D 2 lq2 . Thus S Ž A. is nothing but the modular S-matrix of the s l Ž2.4 l WZW model.15
By direct calculation one checks that the matrices Ž10.27. furnish a representation of the
fusion ring of s l Ž2.4 l . Further, it can be shown that V2 coincides with the incidence
15
On the other hand, the matrix S Ž D . defined this way is by far not unique; in particular, the incidence
matrix of D 2 lq2 has an eigenvalue with multiplicity 2. For a generic choice of diagonalizing matrix the
numbers N˜ r,˜ s˜ t̃ Ž4.6. will not be integral. But w45x there is a unique choice such that these numbers are integral.
matrix of D 2 lq2 . By the explicit form of the s l Ž2.4 l fusion rules, this implies that one
may equivalently define the matrices Vs inductively via V1:s 1, V2 :s incidence matrix
of D 2 lq2 and
Vs :s V2 Vsy1 y Vsy2 for s s 3,4, . . . ,4 l q 1 . Ž 10.28 .
One also has the matrix equation
s̃
˜ r˜ s Ý Ž Vq . r̃ N˜ s˜
Vq N Ž 10.29 .
s̃
˜ s̃ are the ‘graph fusion
for all q s 1,2, . . . ,4 l q 1 and all r˜ s 1,2, . . . ,2 l q 2, where N
Ž D.
matrices’ associated to S , i.e. the matrices with entries
Ñr˜ , s˜ t̃ :s Ý SrŽ˜ ,u
D. Ž D. Ž D.)
˜ S s,u
Ž D.
˜ ˜ S t̃ ,ũ rS1,u˜ Ž 10.30 .
ũ
˜ ˜s 1,2, . . . ,2 l q 2.
˜ s,t
for r,
In w45,46x also the E-type series of modular invariants for minimal models were
discussed in the framework of graphs, and cyclic groups larger than Z 2 have been
addressed in w47x. It will certainly be interesting to compare the results obtained there
with the ones that can be derived by the methods of the present paper, and in particular
to study how the graph oriented approach deals with the case of non-cyclic abelian
groups, where non-trivial two-cocycles appear.
In this section we present a few further examples, in which the orbifold group G is
larger than Z 2 . We first discuss two examples of direct relevance to string theory.
Afterwards we turn to some specific examples in which the effects of non-trivial
two-cocycles can be analyzed in detail.
Let us study a situation of immediate interest in which the orbifold group is cyclic
and hence has trivial second cohomology, so that the untwisted and full stabilizers
coincide. We start with the c s 1 theory of a free boson, compactified on a circle of
radius R s '2 N with N g Z ) 0 . The chiral algebra A of this theory consists of
operators of the form Ž10.1.. Inspection shows that for every m g Z ) 0 there is a
subalgebra A Ž m. of A which is obtained by restricting the value of n in Ž10.1. to be a
multiple of m. The algebra A Ž m. is nothing other than the chiral algebra of another free
boson theory, with the free boson X compactified on a circle of radius m'2 N .
Therefore it is a consistent subalgebra in the sense that it allows for the construction of
chiral blocks which obey factorization rules and have a Knizhnik–Zamolodchikov
connection. The orbifold group is G s Z m ; its generator acts on an operator of the form
Ž10.1. by multiplication with the phase expŽ2p i nrm.; in terms of the Fubini–Veneziano
field X this means
X ¨ X q 2pr Ž m'2 N . . Ž 11.1 .
This leaves the u Ž1. current j invariant and multiplies expŽi n'2 N X . by the phase
expŽ2p i nrm.. ŽTogether with the automorphism X ¨ yX Ž10.2., the transformation
Ž11.1. generates the dihedral group Dm ..
In this example no fixed points are present, so that it is most straightforward to write
down the classifying algebra. The A Ž m.-theory has 2 m2N primary fields, which may be
labeled by integers l mod 2 m 2N ; their u Ž1. charge is ql s lrm'2 N . The twist sector
is determined by the value of l mod m; in particular, those l s ml which are multiples
of m are in the untwisted sector and label a basis of the classifying algebra C Ž A Ž m. ..
The multiplication in C Ž A Ž m. . is just given by the restriction of the fusion product of the
A Ž m.-theory to the untwisted sector, i.e.
XX
˜ l ,lX l s d lqlXqlXX mod 2 m N .
N Ž 11.2 .
Ž m. .
Thus C Ž A is the group algebra of the cyclic group Z m N . The reflection coefficients
R ma ; V must therefore obey the relation R am l 1 ; V R am l 2 ; V s R amŽ l 1ql 2 .; V , where the addition is
modulo 2 m N . The solutions to this requirement are
R am l ; V s exp Ž 2p i mlar2 m2N . s exp Ž p i larm N . Ž 11.3 .
with a g Z. Moreover, a must be taken only modulo 2 m N , and hence the possible
values of a are in one-to-one correspondence with the Z m -orbits of the A Ž m.-theory, in
accordance with the general theory.
This result allows for the following geometric interpretation. After performing a
suitable T-duality transformation, we can assume that we are dealing with a Dirichlet
boundary condition, so that we can characterize the boundary state by the position a of a
point-like defect on the circle. Breaking the bulk symmetry to the subalgebra A Ž m.
correspond to defects located at 2 m N th roots of unity on the unit circle. In general, we
expect that boundary conditions breaking more chiral symmetries of the bulk correspond
to more generic locations in the space of boundary conditions.
In the case at hand we can also study explicitly the projective limit of classifying
algebras that was used in the definition ofX the universal classifying algebra in Section 8.
Manifestly, when mX divides m, then A Ž m . is a subalgebra of A Ž m.. In more fancy terms
this can be expressed as follows. Divisibility introduces a partial ordering on the set I
non-negative integers; the subalgebras A Ž m. form an inductive system over I. The
inductive limit consists just of the subalgebra A Ž`. of uncharged elements of A.
Moreover, there is a natural projection relating the classifying algebras:
X
C Ž A Ž m. . ( CZ m N ™ CZ mX N ( C Ž A Ž m . . . Ž 11.4 .
There then exists a projective limit
Ẑ N :s lim Z m N Ž 11.5 .
§
This decomposition does not constitute a grading of A over the additive group Z 0 0 ,
because the fusion structure within the chiral algebra reads w q1 x w w q2 x s w q1 q
q2 x q w < q1 y q2 <x. Still these fusions imply that for every integer l 0 2 the subspace
18
In coset theories there can, however, exist additional simple currents which arise from resolving fixed
points with integral quantum dimension.
action of the three polyhedral groups on the free boson Žcompactified at the self-dual
radius. that gives rise to the exceptional c s 1 theories.
In the case at hand, the relevant automorphisms of the chiral algebra A can be
understood in terms of the finite-dimensional Lie groups G and GX associated to g and
gX . Namely, for every boundary block Bl: Hl m Hlq that preserves the full bulk
symmetry and every element g g G the combination BlŽ g . :s Bl (Žg m id. provides us
with a twisted boundary block. The corresponding automorphism on the affine Lie
algebra g is the inner automorphism that acts on the modes Jna of g as Jna ¨ Žg J agy1 . n .
This automorphism preserves the symmetries in the subalgebra gX if and only if g is in
the centralizer CG Ž GX . of GX in G, and it acts trivially if and only if g is even in the
center ZŽ G . of G. Thus the non-trivial twists are those by elements in the group
CG Ž GX .rZŽ G ..
In the case of our interest, the relevant embedding on the level of Lie groups reads
GX :s Ž Spin Ž d1 . = Spin Ž d 2 . = Spin Ž d 3 . . r Ž Z 2 = Z 2 .
® Spin Ž d1 q d 2 q d 3 . s :G ; Ž 11.16 .
X X
note that the subgroup G is not simply connected. For determining CG Ž G .rZŽ G . it is
instructive to consider first the embedding
G˜ X :s SO Ž d1 . = SO Ž d 2 . = SO Ž d 3 . ® SO Ž d1 q d 2 q d 3 . s :G˜ Ž 11.17 .
that is obtained from Ž11.16. by dividing out the center of SpinŽ d1 q d 2 q d 3 .. Then the
center of the G˜ is trivial; moreover, using the matrix realization of these groups, one
shows that the centralizer CG̃ Ž G˜ X . is a Z 2 = Z 2 group consisting of the unit matrix
1 d1qd 2qd 3 and the diagonal matrices
1 d1 0 0 y1 d1 0 0
M˜ 23 :s
0
0
y1 d1
1 d2
0
0
0
1 d3 0 ,
0
M˜ 13 :s
0
0
1 d2
0
0
y1 d 3 0 ,
M˜ 12 :s
0
0
y1 d 2
0 1 d3
0
0 .
Already at this stage we can conclude that the centralizer CG X Ž G . is an extension of this
Ž 11.18 .
It can be checked that these matrices commute with all elements in GX . They form a
group of order 8; the structure of this group depends on the values of d1 ,d 2 and d 3 .
First, when all d i leave the same rest modulo 4Z, then the group is isomorphic to the
eight-element generalized quaternion group; this group has one two-dimensional and
four one-dimensional irreducible representations. Otherwise, i.e. when only two of the
d i leave the same rest modulo 4Z, the group is isomorphic to the dihedral group D4 of
8 elements. This group has four one-dimensional and one two-dimensional irreducible
representation, too. In both cases the centralizer is non-abelian.
Let us present some more details. The boundary blocks are labeled by the fields of
monodromy charge zero and characters of their Žfull. stabilizers; thus there are 8
boundary blocks coming from full orbits; we label them lexicographically,
B 1 :s B ooo , B 2 :s B oov , B 3 :s B ovo , B 4 :s B ovv ,
B 5 :s B voo , B 6 :s B vov , B 7 :s B vvo , B 8 :s B vvv . Ž 11.20 .
Note that the blocks numbered as 1,4,6,7 come from the vacuum of the A-theory, while
the others come from the field that carries the vector representation of soŽ d1 q d 2 q d 3 ..
In addition we have 4 boundary blocks coming from the fixed point Ž ß, ß, ß .. They
correspond to the four irreducible characters c of Z 2 = Z 2 ; we label them as
B 9 :s Bqqqq , B 10 :s Bqqyy , B11 :s Bqyqy , B 12 :s Bqyyq ,
Ž 11.21 .
where the " labels indicate the values "1 of c on the four elements of Z 2 = Z 2 , in the
lexicographic order chosen in formula Ž11.15..
The boundary conditions are labeled by the orbits and characters of their untwisted
stabilizers. Thus in addition to the three boundary conditions that preserve all of A,
there are three conditions from the length-4 orbits and two conditions for each of the
three length-2 orbits which have stabilizer Z 2 . We label them according to
B1 ˆ
s Ž ooo. , Ž ovv. , Ž vov. , Ž vvo. 4 , B7 ˆ
s Ž o ßß . , Ž vßß . 4 with c s1 ,
B2 ˆ
s Ž oov. , Ž ovo. , Ž voo. , Ž vvv. 4 , B8 ˆ
s Ž o ßß . , Ž vßß . 4 with c sy1 ,
B3 ˆ
s Ž ßßß . 4 , B9 ˆ
s Ž ßo ß . , Ž ß vß . 4 with c s1 ,
B4 ˆ
s Ž ßoo. , Ž ßov . , Ž ß vo . , Ž ß vv. 4 , B10 ˆ
s Ž ßo ß . , Ž ß vß . 4 with c sy1 ,
B5 ˆ
s Ž o ßo . , Ž o ß v . , Ž vßo . , Ž vß v . 4 , B11 ˆ
s Ž ßßo . , Ž ßß v . 4 with c s1 ,
B6 ˆ
s Ž oo ß . , Ž ovß . , Ž vo ß . , Ž vvß . 4 , B12 ˆ
s Ž ßßo . , Ž ßß v . 4 with c sy1 .
Ž 11.22 .
With this numbering, the diagonalizing matrix S˜ looks as follows:
° 1 1 '2 '2 '2 '2 1 1 1 1 1 1 ¶
1 1 '
- 2 '2 '2 '
- 2 y1 y1 y1 y1 1 1
1 1 -'2 '2 -'2 '2 y1 y1 1 1 y1 y1
1 1 '2 '2 -'2 -'2 1 1 y1 y1 y1 y1
1 1 -'2 -'2 '2 '2 1 1 y1 y1 y1 y1
Acknowledgements
We would like to thank Valya Petkova and Patrick Dorey for discussions, and Peter
Bantay and Bert Schellekens for helpful correspondence.
Appendix A. Collection of formulae
Here we collect a few basic formulae from w1x that are used in this paper. The
equation numbers are the same as in w1x.
v The boundary blocks B ˜ Ž l, w . are the linear forms
bc s b( ( Ž Oc m id . Ž A.2 .
where Rĉ denotes the irreducible representation of the twisted group algebra that is
labeled by cˆ $ c . The Oc with c % cˆ Žthat is, c < Ul s cˆ . form a partition of unity:
Ý Oc s dl1r2 1 dl . Ž A.4 .
cg Sl)
c% cˆ
The operator product expansion that describes the excitation on the boundary caused by
a bulk field approaching it reads
y2 Dlq D
fŽ l , c .,Ž lq , cq . Ž re i s . s Ý Ž 1 y r 2 . m R aŽ l , c .; m Cma a Ž e i s .
m
<G<
S˜Ž l , cl.,w r , cˆr x :s cl Ž J . cˆr Ž J . SlJ , r ,
)
1r2 Ý Ž A.6 .
SlUl Sr Ur Jg SllUr
where the matrices S J represent the modular S-transformation on the one-point chiral
blocks with insertion J on the torus. The result that S˜ is a square matrix is equivalent to
the sum rule
Ý < Sl < s Ý < Ur < . Ž A.7 .
l w rx
Q G Ž l .s0
References
w1x J. Fuchs, C. Schweigert, Symmetry breaking boundaries I. General theory, hep-thr9902132, Nucl. Phys.
B, in press.
w2x D.C. Lewellen, Nucl. Phys. B 372 Ž1992. 654.
w3x G. Pradisi, A. Sagnotti, Ya.S. Stanev, Phys. Lett. B 381 Ž1996. 97.
w4x J. Fuchs, A.N. Schellekens, C. Schweigert, Nucl. Phys. B 473 Ž1996. 323.
w5x L. Birke, J. Fuchs, C. Schweigert, Symmetry breaking boundary conditions and WZW orbifolds, Adv.
Theor. Math. Phys. 3 Ž1999. No. 3.
w6x A.N. Schellekens, S. Yankielowicz, Int. J. Mod. Phys. A 5 Ž1990. 2903.
w7x C. Dong, H. Li, G. Mason, Int. Math. Res. Notices 18 Ž1996. 913.
w8x C. Dong, G. Mason, Duke Math. J. 86 Ž1997. 305.
w9x C. Dong, G. Mason, On the operator content of nilpotent orbifold models, hep-thr9412109.
w10x G. Moore, N. Seiberg, Phys. Lett. B 220 Ž1989. 422.
w11x A.N. Schellekens, S. Yankielowicz, Nucl. Phys. B 327 Ž1989. 673.
w12x J. Fuchs, C. Schweigert, Nucl. Phys. B 530 Ž1998. 99.
w13x A. Recknagel, V. Schomerus, Nucl. Phys. B 531 Ž1998. 185.
w14x C. Dong, J. Lepowsky, Generalized Vertex Algebras and Relative Vertex Operators ŽBirkhauser,
¨ Boston,
1993..
w15x C. Dong, H. Li, G. Mason, Commun. Math. Phys. 180 Ž1996. 671.
w16x M.B. Green, M. Gutperle, Nucl. Phys. B 460 Ž1996. 77.
w17x J.L. Cardy, Nucl. Phys. B 324 Ž1989. 581.
w18x K. Ueno, in: Physics and Geometry, ed. J.E. Andersen, H. Pedersen, A. Swann ŽMarcel Dekker, New
York, 1997. p. 603.
w19x A. Beauville, in: Hirzebruch 65 Conference on Algebraic Geometry wIsrael Math. Conf. Proc. 9x, ed. M.
Teicher ŽBar-Ilan University, Ramat Gan, 1996. p. 75.
w20x D. Gaitsgory, in: Quantum Fields and Strings: A Course for Mathematicians, ed. P. Deligne, P. Etingof,
D.S. Freed, L. Jeffrey, D. Kazhdan, J. Morgan, D.R. Morrison, E. Witten ŽAmerican Mathematical
Society, Providence, 1999..
w21x I.B. Frenkel, J. Lepowsky, A. Meurman, Vertex Operator Algebras and the Monster ŽAcademic Press,
New York, 1988..
w22x I.B. Frenkel, Y. Huang, J. Lepowsky, Memoirs Amer. Math. Soc. 104 Ž1993. 1.
w23x V.G. Kac, Vertex Algebras for Beginners ŽAmerican Mathematical Society, Providence, 1996..
w24x C. Dong, H. Li, G. Mason, Modular-invarianceof trace functions in orbifold theory, q-algr9703016.
w25x B. Gato-Rivera, A.N. Schellekens, Nucl. Phys. B 353 Ž1991. 519.
w26x I. Brunner, R. Entin, C. Romelsberger,
¨ J. High Energy Phys. 9906 Ž1999. 16.
w27x G. Pradisi, A. Sagnotti, Ya.S. Stanev, Phys. Lett. B 356 Ž1995. 230.
w28x J. Fuchs, C. Schweigert, Phys. Lett. B 414 Ž1997. 251.
w29x I. Affleck, M. Oshikawa, H. Saleur, Boundary critical phenomena in the three-state Potts model, preprint
cond-matr9804117.
w30x K. Druhl,
¨ H. Wagner, Ann. Phys. 141 Ž1982. 225.
w31x P. Bantay, Int. J. Mod. Phys. A 13 Ž1998. 199.
w32x M. Muger,
¨ Galois theory for braided tensor categories and the modular closure, math. CTr9812040.
w33x C. Dong, G. Mason, J. Algebra 214 Ž1998. 92.
w34x A. Hanaki, M. Miyamoto, D. Tambara, Duke Math. J. 97 Ž1999. 541.
w35x J. Fuchs, C. Schweigert, Commun. Math. Phys. 185 Ž1997. 641.
w36x A.N. Schellekens, Nucl. Phys. B 558 Ž1999. 484.
w37x R. Dijkgraaf, C. Vafa, E. Verlinde, H. Verlinde, Commun. Math. Phys. 123 Ž1989. 485.
w38x J. Fuchs, C. Schweigert, The action of outer automorphisms on bundles of chiral blocks, hep-thr9805026,
Commun. Math. Phys., in press.
w39x A. Cappelli, C. Itzykson, J.-B. Zuber, Nucl. Phys. B 280 Ž1987. 445.
w40x W. Wang, Duke Math. J. ŽInt. Math. Res. Notes. 71 Ž1993. 197.
w41x V. Pasquier, Nucl. Phys. B 285 Ž1987. 162.
w42x P. di Francesco, J.-B. Zuber, Nucl. Phys. B 338 Ž1990. 602.
w43x V.B. Petkova, J.-B. Zuber, Nucl. Phys. B 438 Ž1995. 347.
w44x A. Sagnotti, Ya.S. Stanev, Fortschr. Phys. 44 Ž1996. 585.
w45x R.E. Behrend, P.A. Pearce, J.-B. Zuber, J. Phys. A 31 Ž1998. L763.
w46x R.E. Behrend, P.A. Pearce, V.B. Petkova, J.-B. Zuber, Phys. Lett. B 444 Ž1998. 163.
w47x V.B. Petkova, J.-B. Zuber, in: GROUP21 Physical Applications and Mathematical Aspects of Geometry,
Groups, and A’s, Vol. 2, ed. H.-D. Doebner, W. Scherer, C. Schulte ŽWorld Scientific, Singapore, 1997.
p. 627.
w48x C. Schweigert, Theor. Math. Phys. 98 Ž1994. 326.
w49x P. Di Vecchia, M. Frau, I. Pesando, S. Sciuto, A. Lerda, R. Russo, Nucl. Phys. B 507 Ž1997. 259.
w50x A. Sen, Non-BPS states and branes in string theory, hep-thr9904207.
w51x A. Lerda, R. Russo, Stable non-BPS states in string theory: a pedagogical review, hep-thr9905006.
w52x H. Ooguri, Y. Oz, Z. Yin, Nucl. Phys. B 477 Ž1996. 407.
w53x D.-E. Diaconescu, M.R. Douglas, J. Gomis, J. High Energy Phys. 9802 Ž1998. 13.
w54x M. Oshikawa, I. Affleck, Phys. Rev. Lett. 77 Ž1996. 2604.
w55x V.G. Kac, I.T. Todorov, Commun. Math. Phys. 190 Ž1997. 57.
w56x A.N. Schellekens, N.P. Warner, Phys. Rev. D 34 Ž1986. 3092.
w57x F.A. Bais, P. Bouwknegt, Nucl. Phys. B 279 Ž1987. 561.
Nuclear Physics B 568 wPMx Ž2000. 594–640
www.elsevier.nlrlocaternpe
$
On the construction and solution of Uq Ž gl Ž2,1;C ..-symmetric
models
J. Gruneberg
¨ Theoretische Physik, UniÕersitat
Institut fur ¨ zu Koln,
¨ Zulpicher
¨ Straße 77, 50937 Cologne, Germany
Received 16 September 1999; accepted 22 October 1999
Abstract
$ X
The Uq Ž gl Ž2,1;C.. symmetric R-matrix, acting on the direct product of a four-dimensional
representation of this algebra with itself, constructed by Gould et al., is generalized to the case of
two different four-dimensional not necessarily unitary representations and to the case of a product
of these with the fundamental one. The latter is used in the algebraic Bethe ansatz solution of
some models. Together with the R-matrix acting on the product of the fundamental representation
with itself, these fundamental R-matrices form the starting point of an infinite fusion hierarchy of
exactly solvable models. A number of these are constructed explicitly. Some Bethe ansatz
equations for the eigenvalues of their transfer matrices are derived. q 2000 Elsevier Science B.V.
All rights reserved.
PACS: 75.10.Jm
Keywords: Graded Yang–Baxter equation; Quantum algebras; Bethe ansatz; Fusion hierarchies
1. Introduction
An important class of solvable models in statistical mechanics are the Bethe ansatz
w1x integrable ones. Baxter realized w2x that the unifying structure behind these models is
the famous Yang–Baxter equation ŽYBE.
X XX X XX X XX XX X
VV VV
R 12 Ž u,Õ . R13 Ž u,w . R V23V Ž Õ,w . s R V23V Ž Õ,w . R13
VV VV
Ž u,w . R12 Ž u,Õ . . Ž 1.1a .
X
V,V X and V XX are three in general different spaces. The operators RVV Žu. act in the direct
product V m V X . Both sides of Ž1.1a. are assumed to act in the three-fold product
V m V X m V XX . The lower indices i,j g 1,2,3 on the R-operators denote the two factors in
this product, on which the corresponding R-operator acts non-trivially. In general the
0550-3213r00r$ - see front matter q 2000 Elsevier Science B.V. All rights reserved.
PII: S 0 5 5 0 - 3 2 1 3 Ž 9 9 . 0 0 6 7 5 - 6
J. Grunebergr Nuclear Physics B 568 [PM] (2000) 594–640 595
2. Definitions
is defined via
†
Ž i :m j :. s Ž y1. p i p j ² i m ² j .
Ž . Ž .
Ž 2.3d .
Eqs. Ž2.3b. – Ž2.3d. are extended naturally to N-fold products Ž2.3a..
J. Grunebergr Nuclear Physics B 568 [PM] (2000) 594–640 597
$
Here Yamane’s construction w18–20x of Uq ŽslŽ1. Ž2,1;C.. will be used: First a complex
vector space E f C 3 with a non-degenerate, symmetric, bilinear form ² P ,P :: E m E C, ™
is defined. The linear independent subset P ; E
P s a0 , a1 , a2 4 Ž 2.4a .
contains the simple, positiÕe roots a i . A symmetric bilinear form ² P ,P : is defined on P
according to
² a 0 , a 0 : :s 0 , ² a 0 , a 1 : :s y1 , ² a 0 , a 2 : :s 1,
² a 1 , a 1 : :s 2 , ² a 1 , a 2 : :s y1 , ² a 2 , a 2 : :s 0 Ž 2.4b .
and extended per linearity to E. Also, a parity function p: P ™ 0,14 is defined on P
according to
p Ž a1 . s 0 , pŽ a0 . s pŽ a2 . s 1 . Ž 2.4c .
It is well known that the choice of simple roots is not unique. To be precise, Ž2.4b. and
Ž2.4c. correspond to the distinguished choice preferred by mathematicians; see e.g. Ref.
w16x. The triple ŽE, P ,p. is called a datum w19x. To each g g E an unique element Hg
from the dual space H :s E ) can be related via
d Ž Hg . s² d ,g : ;d g E . Ž 2.4d .
Let " be an indeterminate. q is defined formally as
q :s e " . Ž 2.5a .
Let Cww " xx denote the C-algebra of formal power series in ". The datum ŽE, P ,p.
characterizes an associatiÕe, Z 2-graded Hopf algebra A" over the ring of formal power
series Cww " xx with generators Hg for each g g E, and Ea i , Fa i with i s 0,1,2. The
parities of the generators are defined as
p Ž Hg . s 0, p Ž Ei . s p Ž Fi . :s p Ž a i . . Ž 2.5b .
On the superalgebra A" , the supercommutator wP,P x is defined as
The Hopf algebra structure maps are coproduct D: A" A" m A" , antipode S: A" ™ ™A
™
and co-unit ´ : A" C. They are explicitly defined via
"
D Ž H . s H m I q I m H ;H g H ,
D Ž Ei . s Ei m I q K a i m Ei ,
D Ž Fi . s Fi m Ky1
a i q I m Fi ,
S Ž H . s yS Ž H . ;H g H ,
S Ž Ei . s yKy1
a i Ei , S Ž Fi . s yFi K a i ,
´ Ž Hg . s ´ Ž Ei . s ´ Ž Fi . s 0 , Ž 2.7 .
where I is the identity, and provide the Z 2-graded Hopf algebra structure w7,17x. A"
allows for a decomposition
A" s Ny [ C w " x H [ Nq , Ž 2.8 .
2 2
where Nq and Ny are the subalgebras generated by Ei 4is0 and Fi 4is0 respectively.
Obviously Nq( Ny and C " H is a Cartan subalgebra.
ww xx
According to Yamane w18,19x Uq Ž sl Ž1. Ž2,1;C.. is the superalgebra A" , defined above,
i.e. by Ž2.4., Ž2.5., with the Hopf algebra structure Ž2.7. on which the standard Serre
relations, appearing already in the paper of Palev and Tolstoy w24x,
w Ei , Ei x s w Fi , Fi x s 0 for i s 0,1,2, Ž 2.9a .
E1 , w E1 , E 2 x s F1 , w F1 , F2 x s0 Ž 2.9b .
and Yamanes extra Serre relations w18x
E0 , E2 , E 0 , w E 2 , E1 x s E2 , E0 , E 2 , w E 0 , E1 x ,
F0 , F2 , F0 , w F2 , F1 x s F2 , F0 , F2 , w F0 , F1 x
Ž 2.10 .
are imposed. In Ž2.9b. and Ž2.10. wwP,P xx is defined via
pŽ a i . pŽ a j .
Ei , E j s Ei E j y Ž y1 . q ² a i , a j : E j Ei Ž 2.11 .
for the generators Ei , E j g Nq and similarly for Fi , Fj g Ny. For non-graded simple
affine Lie algebras Gabber and Kac proved the Serre theorem w22x. The q-deformed
relations appear naturally in Drinfel’d’s theory w7x. For affine simple Lie superalgebras
and their q-deformed counterparts Serre relations, were systematically constructed by
Yamane w18,19x. $
Uq Ž sl Ž2,1;C.. is the subalgebra of Uq Žsl Ž1. Ž2,1;C.., obtained from the latter by
omitting the generators with index 0 in the defining relations. The elements
EH :s w E1 , E2 x , FH :s w F1 , F2 x Ž 2.12a .
are canonically related to the highest root,
a H :s a 1 q a 2 , Ž 2.12b .
J. Grunebergr Nuclear Physics B 568 [PM] (2000) 594–640 599
of Uq Ž sl Ž2,1;C... The notion of grading Ž2.5b. is naturally extended to the set Dq of all
non-zero, positiÕe roots of Uq Ž sl Ž2,1;C.., i.e. Dqs DŽ0.
q q
j DŽ1. , with
q q
DŽ0. s a14 , DŽ1. s a2 , aH 4 . Ž 2.13 .
$ Ž1.
$
Ž2,1;C.. is related to Uq sl Ž1. Ž2,1;C.. by
Finally U Ž gl Ž
$ $
q
3. Representations
™
The graded complex vector spaces Vd Ž2.1. are used as carrier spaces for the
finite-dimensional matrix representations G Vd :Uq Ž gl Ž2,1;C.. EndŽ Vd .. Operators from
EndŽ Vd . are expressed in the basis Ž2.2b.. They can also be written as a matrix S d , i.e.
A B
Sd s ž C D
,/ Ž 3.1a .
where A and D are eÕen d 0 m d 0 and d 1 m d 1 matrices, while B and C are the
complementary odd submatrices. The supertrace of S d Ž3.1a. is defined as
s-tr S d 4 s tr A4 y tr D 4 . Ž 3.1b .
Here it is useful to regard q g C in Ž2.5a. or
q s e 2h Ž 3.2a .
with h g C. For this choice some of the formal power series " may not converge. To
avoid this difficulty the generators Ha i g H can be eliminated in favor of the K a i Ž2.6..
All finite-dimensional irreducible representations of glŽN,1;C. for N 0 1 can be conti-
nously deformed into corresponding ones of UqŽ gl ŽN,1;C.., iff q is generic, that is not a
root of unity, which will be excluded here. Except if stated otherwise, h is assumed to
600 J. Grunebergr Nuclear Physics B 568 [PM] (2000) 594–640
be real and positive and the results of Kac w16,23x remain valid. This was proved by
Palev and Tolstoy w24x. For a g A"
q a y qya sinh Ž 2h a .
w ax q s y1
s Ž 3.2b .
qyq sinh Ž 2h .
™
is defined as usual. Since sl Ž2,1;C. is basic w8,16x, a generalized, supersymmetric
Killing form B: H m H C is defined for any G Vd Ž2.1c. by
B Ž H , H X . s s-tr Vd G VdŽ H . G VdŽ H X . 4 ;H , H X g H . Ž 3.3a .
This definition could be extended to the full algebra. Here it is used to define the simple
roots a i Ži s 1,2.:
a i Ž H . s B Ž Ha i , H . ;H g H . Ž 3.3b .
The non-degenerate bilinear form ² P ,P : is realized as
² a , b :s B Ž Ha , Hb . ;a , b g E . Ž 3.3c .
™
Any representation G Vd can be made diagonal for all H g H by a convenient choice of
the basis for its carrier space Vd . The weights l i : H C are defined by
G VdŽ H . i :s l i Ž H . i :. Ž 3.4a .
A highest weight Õector c :g Vd , corresponding to the highest weight L Vd , is defined
by
G VdŽ a . c :s 0 ;a g Nq , Ž 3.4b .
where the decomposition Ž2.8. is applied to UqŽslŽ2,1;C... Being a functional on H , the
2
L Vd can be expressed in the basis a i 4is1 , i.e.
LVd s m 1 a 1 q m 2 a 2 . Ž 3.4c .
Note that m 1 , m 2 g C. The highest weights for any graded, irreducible representation are
unique and every other weight li of this representation can be written as
l i s LVd y p 1 a 1 y p 2 a 2 , Ž 3.4d .
where p 1 and p 2 are non-negative integers as usual. Two graded, irreducible representa-
tions are equivalent, iff they share the same highest weight w16x. This can be formulated
in a similar manner for lowest weights.
Uq Ž gl Ž2,1;C.. is naturally realized by its V3-module
V3at s Ž V3 , G V 3 . Ž 3.5a .
with Ž2,1.-grading on V3 Ž2.1c.. The generators are
G V 3Ž Ha 1 . s e11 y e 22 ,
G V 3Ž Ha 2 . s e 22 q e33 ,
G V 3Ž Hu . s e11 q e22 q e33 s IV 3 ,
G V 3Ž E1 . s e12 , G V 3Ž E2 . s e 23 ,
†
G V 3Ž Fi . s G V 3Ž Ei . for i s 1,2 . Ž 3.5b .
J. Grunebergr Nuclear Physics B 568 [PM] (2000) 594–640 601
They even coincide with the ones in the fundamental representation of sl Ž2,1;C. w16x.
The defining relations of Uq Ž gl Ž2,1;C.., as given in Section 2, can be checked easily.
Ž G V ,V3 . is irreducible. The weights for the V3-module Ž3.5. are listed in Appendix A.
3
Note that an equivalent representations G VX d can be obtained from any graded
representation G Vd by means of a similarity transformation,
G VX dŽ a . s S d ( G VdŽ a . ( Sy1
d ;a g Uq Ž sl Ž 2,1;C . . . Ž 3.6 .
S d is block-diagonal, i.e. B and C vanish in Ž3.1a., in order to conserve grading. Apart
from a trivial exchange of even and odd subspace in Ž2.1., these are the only
equivalence transformations w16x.
V4 with a Ž2,2.-grading Ž2.1c. is the carrier space of another parameter dependent
representation, G V4 of Uq Ž gl Ž2,1;C..: The module is denoted by
V4t Ž C . :s Ž V4 , G V4 . . Ž 3.7a .
The generators are given explicitly as
G V4Ž Ha 1 . s e 33 y e 44 ,
G V4Ž Ha 2 . s C Ž e11 q e33 . q Ž C q 1 . Ž e22 q e44 . ,
G V4Ž Hu . s e11 q e 22 q e 33 q e44 ,
C Cq1
G V4Ž E1 . s e 34 , G V4Ž E2 . s yk q 2 e 31 q m q 2
e 42 ,
C Cq1
) y 2 ) y 2
G V4Ž F1 . s e 43 , G V4Ž F2 . s yk q e13 q m q e 24 . Ž 3.7b .
The complex numbers k and k ) as well as m and m ) are not necessarily complex
conjugate to each other, but related to the parameter C by
w C x q s k )k , w C q 1 x q s m )m . Ž 3.7c .
Obviously q-numbers Ž3.2b. enter Ž3.7c.. Using Ž3.7., Ž2.5c. and Ž3.3., the defining
relations, stated in Section 2, can be checked immediately. It can easily be seen that
different choices of k , k ) and m , m ) subject to Ž3.7c. can be related by similarity
transformations Ž3.6..The representation G V4 is irreducible, iff C / 0,y 1. The weights
ŽA.2. are listed in Appendix A, in particular
LV4 s yC a 1 y 2C a 2 . Ž 3.7d .
™
In the limit h 0 Ž3.2., G V4 is well known, because it contains the lowest-dimensional
typical representation of sl Ž2,1;C. w16,23x. Eq. Ž3.7. can be derived from this limit w24x.
An irreducible representation G Vd is called typical, if any reducible, graded represen-
tation G with the same highest weight LVd Ž3.4b. can be put into the form of a direct
sum of G Vd and some other representation w16,23x. Since Nq( Ny, highest weight can
also be replaced by lowest weight in this definition.
q
For h s 0 Kac proved w16,23x that G Vd is typical, iff there is no a g DŽ1. , such that
q Ž
2 a f D 2.13 , for which
.
² LV q r , a :s 0
d
Ž 3.8a .
602 J. Grunebergr Nuclear Physics B 568 [PM] (2000) 594–640
ž V˜ Ž1.
4 , G V˜4Ž1.
t
/ (V ŽCqC . ,
4
X
ž V˜ Ž 2.
4 , G V˜4Ž2.
t
/ ( V Ž C q C q 1.
4
X
Ž 3.11a .
exists, which also holds as a vector space direct sum. Explicit realizations of these
modules, ŽA.4. and ŽA.5., are given in Appendix A. Using the explicit realization of
ŽV &
˜ 8 , G ., also listed there ŽA.11., the relation
V8
ž Ṽ , G& / ( V
8 V8 8
t
Ž C q CX . Ž 3.11b .
is realized immediately. Similarly to Ž3.10. this yields
V4t Ž C . m V4t Ž C X . ( V4t Ž C q C X . [ V8t Ž C q C X . [ V4t Ž C q C X q 1 . . Ž 3.11c .
G V 3mV 3 can be analyzed similarly, leading to
V3at m V3at ( V5at [ V4t Ž 1 . . Ž 3.12 .
The weights ŽA.8. and an explicit realization of the weight vectors ŽA.7. on V3 m V3 of
V5at :s Ž Vˇ5 , G Vˇ5 . are listed in Appendix A. The analysis of representations here is solely
based on Kac’s criterion Ž3.8.. It requires only a little effort to construct the modules as
given in Appendix A. The Queensland group developed more sophisticated techniques
w9,12,13x, which are useful in order to deal with more complicated situations.
$
If G V is any finite-dimensional graded representation of Uq Ž gl Ž2,1;C.., an eÕalua-
tion representation G Vx , of Uq Ž gl Ž1. Ž2,1;C.. is constructed via
G Vx Ž Ha 0 . s K y IV m G V Ž Ha H . ,
G Vx Ž Ha i . s IV m G V Ž Ha i . for i s 1,2 ,
G Vx Ž Hu . s IV m G V Ž Hu . ,
G Vx Ž E0 . s x m G V Ž EH . ,
G Vx Ž Ei . s IV m G V Ž Ei . for i s 1,2 ,
G Vx y1
Ž F0 . s x m G V Ž FH . ,
G Vx Ž Fi . s IV m G V Ž Fi . for i s 1,2 Ž 3.13a .
for the generators. Here x is a letter like q in Section 2 and K a central element. For
q s 1, this is the well-known loop extension of UŽ gl Ž2,1;C.., see e.g. Refs. w16,23x.
$
Quantization of the latter w17x can be omitted by directly checking Yamanes defining
relations of UqŽ gl Ž1. Ž2,1;C.., as stated in Section 2.The representation Ž3.13a. is not
$
faithful, since the deriÕatiÕe, well known for q s 1 w16,23x, has been omitted. The
X
restricted superalgebra is called UqŽ gl Ž2,1;C...The evaluation representation is obtained
604 J. Grunebergr Nuclear Physics B 568 [PM] (2000) 594–640
from Ž3.13a. by setting K equal to zero and the loop-variable x equal to a complex
number, i.e.
x s e2 u . Ž 3.13b .
Note that Ž3.13a. is not a Hopf-algebra automorphism. Also Ž3.13. is neither a highest
nor lowest weight representation. However, it was discovered by Drinfel’d w25x and
Jimbo w26x that this representation is related to the YBE Ž1.1b.. In particular u from
Ž3.13b. becomes a spectral parameter of difference type.
4. Fundamental R-matrices
Note that in Ž1.1b. the graded multiplication rule Ž2.3b. has to be applied. It is also
convenient to define
X
D Vd VdX :s Ž G Vd m G VdXX . ( D .
For all finite-dimensional quantized Lie superalgebras, e.g. for Uq Ž gl Ž2,1;C.. the
universal R-matrix has been constructed by Koroshkin and Tolstoy w28x. They also
provided a construction scheme for R X w29x, but the result X
would be of no use here,
because it is extremely cumbersome to evaluate X R Vd VdX Ž u. from Ž4.3.. However, the
existence of R X guarantees the existence of R Vd VdX Ž u. g EndŽ Vd m VdXX . for any pair of
irreducible, finite-dimensional representations G Vd , G VdXX of Uq Ž g ., with g finite, which
allow an extension Ž3.13. to the affinization Uq Ž gˆ X ..
The graded YBE itself was X
™
introduced by Kulish and Sklyanin long ago w27x. The
elementary intertwiner Pˇ Vd VdX :Vd m VdX VdXX m Vd is defined by its action
X
Pˇ V d V dX pŽ i . pŽ j .
i :m j :s Ž y1 . j :m i : Ž 4.4a .
on the canonical basis Ž2.3. of Vd m VdXX . Obviously
X X
Pˇ VdX Vd ( Pˇ Vd VdX s IVdmVdXX s IVd m IVdXX .
A more general intertwiner is defined as
X X X
RˇVd VdX Ž u . :s Pˇ Vd VdX ( R Vd VdX Ž u . . Ž 4.4b .
Immediately from Ž1.1b. and Ž4.4b. follows w27x
X XX XX X X XX X XX
Rˇ12 Ž u y Õ . RˇV23d VdX Ž u . Rˇ12 Ž Õ . s RˇV23d VdX Ž Õ . Rˇ12 Ž u . RˇV23dX VdX Ž u y Õ . .
V dX V dX X X V d V dX V d V dX X X
Ž 4.4c .
The interpretation as consistency equations for intertwiners is obvious. Moreover in this
form of the YBE additional signs Ž2.3b X
. due to grading vanish. These signs can be
VV Ž .
absorbed into the definition of R u in order to obtain a new solution to the
non-graded YBE Ž1.1b., see Ref. w27x. X
Having assured the existence of R V 3V 3 Ž u., R V 3V4 Ž u. and R V4 V4 Ž u. the simplest way of
constructing them is Jimbo’s method w14x, see also the book by Ma w30x: Using Ž4.1.,
Ž4.3. and Ž4.4. the first equation of Ž4.2a. can be written as
X X X X
RˇVd VdX Ž u . D Vd Vd Ž a . s D VdX Vd Ž a . Rˇ Vd VdX Ž u . Ž 4.5a .
for any a g UqŽglŽ2,1;C.. and almost all u g C. Using Ž3.13. this is done more explicitly
for E0 and F0 , i.e.
X
Ř Vd VdX Ž u . e 2 u G VdŽ Fa H . m IVdXX q G Vd Ž Ky1
a H . m G V dX Ž Fa H .
X
X
s G VdXX Ž Fa H . m IVd q e 2 u G VdXX Ž Ky1 ˇ V d V dX Ž u .
a H . m G V d Ž Fa H . R Ž 4.5b .
and
X
Ř Vd VdX Ž u . ey2 u G VdŽ Ea H . m G VdXX Ž K a H . q IVd m G VdXX Ž Ea H .
X
s G VdXX Ž Ea H . m G VdŽ K a H . q ey2 u IVdXX m G VdŽ Ea H . Rˇ Vd VdX Ž u . . Ž 4.5c .
X
Eqs. Ž4.5. are Jimbo’s equations. They determine Rˇ V d V dX Ž u. almost uniquely w14x. It is
easy to see that they imply
X X
RˇVd Vd Ž 0 . A IVdmVd , RˇVdX Vd Ž u . RˇVd VdX Ž yu . A IVdmVdXX .
606 J. Grunebergr Nuclear Physics B 568 [PM] (2000) 594–640
X
Therefore a natural normalization of RˇVd VdX Ž u. is
Ý PiV V s IV mV d d
d d
. Ž 4.8c .
i
J. Grunebergr Nuclear Physics B 568 [PM] (2000) 594–640 607
X
The PiVdX Vd have been calculated for V3 = V4 Ž3.10., V4 = V4X Ž3.11. and V3 = V3 Ž3.12.
in Appendix B. For V3 = V4 the normalization N Ž u. in Ž4.6. is fixed by N Ž u. s ry1
2
Ž u.:
RˇV 3V4 Ž u . s r Ž u . P1V 3V4 q P2V 3V4 Ž 4.9a .
V3V4 V V
with P1 from ŽB.1. and P2 3 4 from ŽB.7.. This ansatz fulfills Ž4.5. per construction.
In order to determine r Žu. one has to solve simply one out of eleven non-trivial
components of, e.g., Eq. Ž4.5c.. This yields
sinh Ž h Ž C q 2 . q u .
r Ž u. s . Ž 4.9b .
sinh Ž h Ž C q 2 . y u .
It is however a valuable test to check all non-trivial equations from Ž4.5b. and Ž4.5c.,
which has been done.
Similarly for V4 m V4X the normalization is fixed by N Ž u. s ry1 2
Ž u.. So the ansatz
Ž4.6. reads
X X X X
RˇV4 V4 Ž u . s f Ž u . P1V4 V4 q P2V4 V4 q g Ž u . P3V4 V4 Ž 4.10a .
X
V4V4
with Pi for i s 1,2,3 from B.2 , B.8 and B.3 . From 4.5b and 4.5c follows
Ž . Ž . Ž . Ž . Ž .
sinh Ž h Ž C q C X . q u .
f Ž u. s ,
sinh Ž h Ž C q C X . y u .
sinh Ž h Ž C q C X q 2 . y u .
g Ž u. s . Ž 4.10b .
sinh Ž h Ž C q C X q 2 . q u .
Again all non-trivial components of Ž4.5b. and Ž4.5c. have been checked.
Actually f Ž u., g Ž u. have been known before w13x. They can be calculated in terms of
the Žsecond-order. Casimir operator, see e.g. Ref. w16x of the representations in Ž3.11.,
without knowledge of the intertwiners. The formula is apparently due to Reshetikhin
w32,33x and explained in detail in the textbook by Ma w30x. This formalism was further
developed by Delius et al. w12x and used by Bracken et al. to calculate f Ž u. and g Ž u.
w13x. In the limiting case V4 s V4X the intertwiners RˇV4 V4 Ž u. have been calculated by
Gould et al. w9x. For further details see Appendix B.
For completeness RˇV 3V 3 is given as
sinh Ž 2h y u . V V
Ř V 3V 3 s P1V 3V 3 q P 3 3 Ž 4.11 .
sinh Ž 2h q u . 2
with PiV 3V 3 for i s 1,2 from ŽB.4. and ŽB.5..
From the definition Ž4.4b. follows immediately
X X X
R Vd VdX Ž u . s Pˇ VdX Vd ( RˇVd VdX Ž u . .
This can be used to obtain the final results:
R V 3V4 Ž u . s r Ž u . e11 m Ž e11 q e 33 . q e 22 m Ž e11 q e 44 .
q a 0 Ž u . e11 m Ž e 22 q e 44 . q e 22 m Ž e 22 q e 33 . q e 33
m b 0 Ž u . e11 y e 22 q g 0 Ž u . Ž e 33 q e44 . q d 1 Ž u . e12 m e43
q d 2 Ž u . e 21 m e 34 y ´ 1 Ž u . w e13 m e 23 q e 23 m e 24 x
q ´ 2 Ž u . w e 31 m e 32 q e 32 m e 42 x y z 1 Ž u . e13 m e 41 y qy1 e 23 m e 31
q z 2 Ž u . w e 31 m e14 y q e 32 m e13 x , Ž 4.12 .
608 J. Grunebergr Nuclear Physics B 568 [PM] (2000) 594–640
with r Ž u. from Ž4.9b. and the remaining coefficients ŽC.1. listed in Appendix C. The
q s 1-limit of this expression was obtained by Kulish in w34x. From Ž4.10. follows
X
R V4 V4 Ž u . s f Ž u . e11 m e11 q g Ž u . e 22 m e 22 y e33 m e33 y e44 m e44 q r5 e22 m e11
q r5X e11 m e 22 y r 10 Ž e 33 m e44 y e44 m e33 . y r 7 Ž e33 q e44 . m e11
y r 7X e11 m Ž e 33 q e44 . y r 9 Ž e 33 q e 44 . m e 22 y r 9X e 22 m Ž e 33 q e44 .
q r 1 e 21 m e12 q r 1X e12 m e 21 y r4 e 43 m e 34 y r4X e34 m e43
q r 2 Ž e 31 m e13 q e 41 m e14 . y r 2X Ž e13 m e 31 q e14 m e 41 .
q r 3 Ž e 32 m e 23 q e 42 m e 24 . y r 3X Ž e 23 m e 32 q e 24 m e 42 .
y r6 Ž e 24 m e13 y qy1 e 23 m e14 . q r 8X Ž e42 m e 31 y q e 32 m e 41 .
q r6X Ž e13 m e 24 y q e14 m e 23 . y r 8 Ž e 31 m e42 y qy1 e 41 m e32 .
Ž 4.13 .
X
with f Ž u. and g Ž u. from Ž4.10. and riŽ . for i s 1, . . . ,10 ŽC.2. also listed in Appendix
C. Note that the rescaled parameters Ž3.9. have been used in Ž4.9. but not in Ž4.12., in
order to unify the notation of Ž4.12. and Ž4.13..
Applying the same operation to Ž4.11. yields
R V 3V 3 Ž u . s e11 m e11 q e 22 m e 22 y d Ž u . e 33 m e 33
q c Ž u . e11 m Ž e 22 q e 33 . q e22 m Ž e11 q e 33 . q e33 m Ž e11 q e22 .
q a Ž u . w e 21 m e12 q e 31 m e13 q e32 m e23 x
q b Ž u . w e12 m e 21 y e13 m e 31 y e 23 m e32 x Ž 4.14 .
with aŽ u.,bŽ u.,cŽ u. and dŽ u. listed in ŽC.3. in Appendix C. Ž4.14. was known to Perk
and Schultz w35x. It is the standard q-deformation of the Yh Ž gl Ž2,1;C..-symmetrical
R-matrix listed by Kulish and Sklyanin w27x. Comparing Ž4.14. with the general
$
Yh Ž gl Ž m,n;C..-symmetrical R-matrix given there, it can immediately be generalized to
X
an Uq Ž gl Ž m,n;C..-symmetrical R-matrix R V mq nV mq n Ž u..
Also R V4 V 3 Ž u. can be obtained trivially from Ž4.12. via
X X X X X X
R VdX Vd Ž u . s Pˇ V dX V d
( R Vd VdX Ž u . ( Pˇ V dX V d
s Rˇ Vd VdX Ž u . ( Pˇ V dX V d
, Ž 4.15 .
X
V d V dX Ž
which holds for arbitrary R u.. This yields
R V4 V 3 Ž u . s r Ž u . Ž e11 q e 33 . m e11 q Ž e11 q e44 . m e 22 .
q a 0 Ž u . Ž e 22 q e 44 . m e11 q Ž e 22 q e 33 . m e 22
q b 0 e11 y e 22 g 0 Ž u . Ž e 33 q e 44 . m e 33 q d 1 Ž u . e 43 m e12
q d 2 Ž u . e 34 m e 21 q ´ 1 Ž u . w e 23 m e13 q e 24 m e 23 x
y ´ 2 Ž u . w e 32 m e 31 q e 42 m e 22 x q z 1 Ž u . e 41 m e13 y qy1 e 31 m e 23
y z 2 Ž u . w e14 m e 31 y q e13 m e 32 x , Ž 4.16 .
with r Ž u. from Ž4.9b. and the remaining coefficients from ŽC.1. in Appendix C.
J. Grunebergr Nuclear Physics B 568 [PM] (2000) 594–640 609
X
$
Directly from the existence of the universal R-matrix for Uq Ž gl Ž2,1;C.. w29x, and
Drinfel’d’s YBE Ž4.2b. follows that Ž4.12. together with Ž4.14., both constructed as
solutions to Jimbo’s equations Ž4.5., fulfill the YBE
V 3V 3 V 3V4
R 12 Ž u y Õ . R13 Ž u . R V233V4 Ž Õ . s R V233V4 Ž Õ . R13
V 3V4 V 3V 3
Ž u . R12 Ž uyÕ. . Ž 4.17a .
It does not matter whether the rescaling Ž3.9. is applied or not, or whether k and k )
from Ž3.7. are complex conjugates of each other. The only restrictions on k , k ) , m and
m ) , appearing in Ž4.12., are indeed given by Ž3.7c.. Different choices of these
parameters, preserving Ž3.7c., are related by Ž3.6.. It is well known, see e.g. Ref. w27x,
that this leads to an invariance of the YBE Ž1.1b. under
X
R Vd Vd Ž u . ™ Ž S mS d
X
d
X
X
. R Vd VdX Ž u . Ž Sy1 X
d m Ž Sd .
X
y1
.,
X
where S d acts on G Vd and S dX on G VdX X . Note that C is allowed to be complex.Similarly
the R-matrix Ž4.13. fulfills the YBE
X XX X XX X XX XX X
V4 V4 V4 V4
R 12 Ž u y Õ . R13 Ž u . R V234 V4 Ž Õ . s R V234 V4 Ž Õ . R13
V4 V4 V4 V4
Ž u . R12 Ž uyÕ. Ž 4.17b .
and together with Ž4.12.:
X X X X
V 3V 4 V 3V4
R 12 Ž u y Õ . R13 Ž u . R V234 V4 Ž Õ . s R V234 V4 Ž Õ . R13
V 3V 4 V 3V4
Ž u . R12 Ž uyÕ. . Ž 4.17c .
The argument given above applies also to the other V4-representations in Ž4.17b.,
Ž4.17c.. The YBE Ž1.1b. with V s V X s V XX s V3 was known to Perk and Schultz w35x.
X X X XX X X
0 s Ž Pˇ Ž u . Ž Pˇ
V dX V d X
( P1Vd VdX . 12 R 12,3
ŽV dmV dX .V dX V dX V d
( P2Vd VdX . 12 .
This is equivalent to the triangularity condition
X X XX X
Ž Pˆ1V V . 12 R12,3
d dX ŽV mV d dX .V dX
X
Ž u . Ž Pˆ2Vd VdX . 12 s 0 . Ž 5.2b .
X
Here Pˆ i
VdVdX
for i s 1,2 are the proper projectors from Vd m Vd on VdŽi.i Ž4.8a.. Note that X
X
only this property is necessary for fusion. In particular there is no need for an underlying
610 J. Grunebergr Nuclear Physics B 568 [PM] (2000) 594–640
group structure. This is illustrated e.g. in a very recent preprint by Maassarani w31x.
Using Ž5.2b., Ž4.8. and the YBE Ž1.1b. again, it can be concluded that
XX X X XX X
Ž u . :s Ž Pˆ1Vd VdX . 12 R12,3 Ž u . Ž Pˆ1Vd VdX . 12
Ž1.
V d V dX X ŽV dmV dX .V dX X
R Ž12.3
1 Ž 5.2c .
is indeed an R-matrix, i.e. a solution to the YBE, acting in VdŽ1.
1
m VdXXXX , with VdŽ1.
1
; Vd m
X
VdX . Similarly from the definition
XX X XX XX X
X
V dX ŽV dmV dX . V dX V d X
V d X V dX X
R 1,23 Ž u . :s R13 Ž u q u1 . R12 Ž u y u2 . Ž 5.3a .
and the corresponding triangularity condition
X XX X X
Ž Pˆ1V V . 23 R1,23
d dX V ŽV mVdX
X
d dX .
Ž u . Ž Pˆ2Vd VdX . 23 s 0 Ž 5.3b .
the R-matrix acting in VdXXXX m VdŽ1.
1
with VdXXXX arbitrary and VdŽ1.
1
; Vd m VdXX can be derived
analogously
XX X XX X X
X
V dX V d
R 1Ž23.
Ž1.
ˆ1Vd VdX . 23 R1,23
1 Ž u . :s Ž P
V dX ŽV dmV dX .
Ž u . Ž Pˆ1Vd VdX . 23 .
X
Ž 5.3c .
The results Ž5.2. and Ž5.3. can be combined to
X X X X X X
V d V dX Vd Vd
ŽV dmV dX .ŽV dmV dX .
R 12 ,34 Ž u . :s R14 Ž u q u 0 . R13 Ž u . R V24dX VdX Ž u . R V23dX Vd Ž u y u 0 . , Ž 5.4a .
which fulfills the triangularity conditions
X X X X
Ž Pˆ1V V . 12 R12,34
d dX ŽV mV .ŽV mV .
d dX
Ž u . Ž Pˆ2V V . 12 s 0,
d dX d dX
X X X X
vŽN1 . Ž u . 0 0
V 3at
Tˆa Ž 1, N < u . V l w :s
0
Bˆ1 Ž u y h .
v NŽ2. Ž u .
Bˆ2 Ž u y h .
0
v NŽ3. Ž u. 0 a
V l w :. Ž 5.5 .
Žn.
where d Žn. are the inhomogeneities from Ž1.2a.. v Vi Žu. are the local Õacuum ampli-
tudes. They are indexed according to the basis of V, in which the corresponding
diagonal elements are taken. Bˆ i Žu. are some operators acting on quantum space. The
actual eigenvectors can be written as
Ž 5.6b .
which are a consequence of the YBE Ž1.1b.. For V s V X s V3at and V :s V l w : the
Bethe equations read w11x
N sinh Ž la y d Ž i. q Ž C q 1 . h . m sinh Ž la q m k q h .
Ł sinh Ž l Ž i.
s Ł Ž 5.7a .
is1 a yd y Ž C q 1. h . ks1 sinh Ž la q m k y h .
for a s 1, . . . ,M and
M sinh Ž l j q mb q h . m sinh Ž mb y m k q 2h .
Ł s Ł Ž 5.7b .
js1 sinh Ž l j q mb y h . ks1, k/ b sinh Ž mb y m k y 2h .
at
Ž4.6a. in all coefficients as listed in Appendix C. After that the eigenvalues of tˆ V 3 Ž N < u.
to Ž5.6b. read
at
LVN3 Ž u < l1 , . . . , l M < m 1 , . . . , m m .
M sinh Ž u y l j q h . m sinh Ž u y m k y 2h .
s v NŽ1. Ž u .
½ Ł Ł
js1 sinh Ž u y l j y h . ks1 sinh Ž u y m k .
m sinh Ž u y m k q 2h . M sinh Ž u y l j q h .
qŁ
ks1 sinh Ž u y m k . 5 y v NŽ 3 . Ž u . Ł
js1 sinh Ž u y l j y h .
Ž 5.8a .
t X
LVN4 ŽC . Ž u < l1 , . . . , l M < m 1 , . . . , m m .
M sinh Ž u y l j q Ž C X q 1 . h .
s v NŽ1. Ž u . Ł
js1 sinh Ž u y l j y Ž C X y 1 . h .
M sinh Ž u y l j q Ž C X q 1 . h .
q v NŽ2. Ž u . Ł
js1 sinh Ž u y l j y Ž C X q 1 . h .
M sinh Ž u y l j q Ž C X q 1 . h . m sinh Ž u y m k y Ž C X y 2 . h .
y v NŽ3. Ž u. ½ Ł
js1 sinh Ž u y l j y Ž C X y 1 . h .
Ł
ks1 sinh Ž u y m k y C Xh .
M sinh Ž u y l j q Ž C X q 1 . h . m sinh Ž u y m k y Ž C X q 2 . h .
qŁ
js1 sinh Ž u y l j y Ž C X q 1 . h .
Ł
ks1 sinh Ž u y m k y C Xh . 5 .
Ž 5.9a .
For details see Ref. w11x. The local vacuum amplitudes are
t
v 1V4 ŽC . Ž u . s sinh Ž u q Ž C y C X . h . sinh Ž u q Ž C y C X q 2 . h . ,
t
v V2 4 ŽC . Ž u . s sinh Ž u y Ž C q C X q 2 . h . sinh Ž u y Ž C q C X . h . ,
t t
v V3 4 ŽC . Ž u . s v4V4 ŽC . Ž u . s sinh Ž u q Ž C y C X . h . sinh Ž u y Ž C q C X . h . . Ž 5.9b .
J. Grunebergr Nuclear Physics B 568 [PM] (2000) 594–640 613
Naturally the Bethe equations Ž5.7. are also analyticity conditions for Ž5.9.. Note that
Ramos and Martins developed another Bethe ansatz scheme w41x, which was established
as a general, constructive method recently w42x and could be used here. With u 0 s u1 q u 2
the fusion relations Ž5.2. for R-matrices can be extended trivially to transfer matrices
X 1 2
tˆ Vd Ž N < u q u1 . P tˆ VdX Ž N < u y u 2 . s tˆ Vd1 Ž N < u . q tˆ Vd 2 Ž N < u . . Ž 5.10 .
The transfer matrices on the right-hand side are fixed only up to a shift in the spectral
parameter u and a normalization factor.
From Ž3.12. and Ž4.14. the conditions Ž5.1. are fulfilled with u 0 s 2h. The analytical
normalization yields
at at
tˆ V 3 Ž N < u q h . P tˆ V 3 Ž N < u y h .
N
t at
s tˆ V4 Ž1. Ž N < u . q
ž /
Ł sinh Ž u y dŽ i . q Ž C q 1. h . tˆ V Ž N < u . .
is1
at
5
From this Ž5.8a. and Ž5.9a. the eigenvalue for tˆ V5 Ž N < u. in the eigenvectors Ž5.6b. can
be calculated immediately:
at
LVN5 Ž u < l1 , . . . , l M < m 1 , . . . , m m .
M sinh Ž u y l j q 2h . m sinh Ž u y m k y 3h .
s v NŽ1. Ž u .
½Ł Ł
js1 sinh Ž u y l j y 2 h . ks1 sinh Ž u y m k q h .
M sinh Ž u y l j . m sinh Ž u y m k q 3h . sinh Ž u y m k y 3h .
qŁ Ł
js1 sinh Ž u y l j y 2 h . ks1 sinh Ž u y m k q h . sinh Ž u y m k y h .
m sinh Ž u y m k q 3h .
qŁ
ks1 sinh Ž u y m k y h . 5
M sinh Ž u y l j q 2h . m sinh Ž u y m k y h .
y v NŽ 4 . Ž u .
½Ł Ł
js1 sinh Ž u y l j y 2 h . ks1 sinh Ž u y m k q h .
M sinh Ž u y l j . m sinh Ž u y m k q 3h .
qŁ Ł
js1 sinh Ž u y l j y 2 h . ks1 sinh Ž u y m k q h . 5 Ž 5.11a .
Similarly from Ž3.10. and Ž4.12. the conditions Ž5.1. are fulfilled with u 0 s yŽ C X q
2.h. This yields
at t X
tˆ V 3 Ž N < u y Ž C X q 1 . h . P tˆ V4 ŽC . Ž N < u q h .
N X
t
s
ž Ł sinh Ž u y dŽ i . q Ž CŽ i . y C q 1. h . tˆ V ŽC . Ž N < u .
is1
X
/ 8
N X
t
y
ž is1 /
Ł sinh Ž u y dŽ i . y Ž CŽ i . q C y 1. h . tˆ V ŽC q1. Ž N < u . .
X 4
t X
Again from this, Ž5.8a. and Ž5.9a. the eigenvalue for tˆ V 8 ŽC . to the eigenvectors
Ž5.6b. can be calculated:
t X
LVN8 ŽC . Ž u < l1 , . . . , l M < m 1 , . . . , m m .
M sinh Ž u y l j q Ž C X q 2 . h .
s v NŽ1. Ž u . Ł
js1 sinh Ž u y l j y Ž C X y 2 . h .
M sinh Ž u y l j y C Xh . m sinh Ž u y m k y Ž C X q 3 . h .
=
½Ł js1 sinh Ž u y l j y Ž C X q 2 . h .
Ł
ks1 sinh Ž u y m k y Ž C X q 1 . h .
m sinh Ž u y m k y Ž C X y 1 . h .
qŁ
ks1 sinh Ž u y m k y Ž C X q 1 . h . 5
M sinh Ž u y l j q Ž C X q 2 . h . m sinh Ž u y m k y Ž C X q 1 . h .
q v NŽ 3 . Ž u . Ł
js1 sinh Ž u y l j y Ž C X q 2 . h . ½ Ł
ks1 sinh Ž u y m k y Ž C X y 1 . h .
M sinh Ž u y l j y C Xh . m sinh Ž u y m k y Ž C X y 3 . h .
qŁ
js1 sinh Ž u y l j y Ž C X y 2 . h .
Ł
ks1 sinh Ž u y m k y Ž C X y 1 . h . 5
M sinh Ž u y l j y C Xh . sinh Ž u y l j q Ž C X q 2 . h .
y v NŽ 5 . Ž u. Ł
js1 sinh Ž u y l j y Ž C X q 2 . h . sinh Ž u y l j y Ž C X y 2 . h .
M sinh Ž u y l j q Ž C X q 2 . h . m sinh Ž u y m k y Ž C X q 3 . h .
y v NŽ 6 . Ž u. ½ Ł
js1 sinh Ž u y l j y Ž C X q 2 . h .
Ł
ks1 sinh Ž u y m k y Ž C X y 1 . h .
M sinh Ž u y l j y Ž C X . h . sinh Ž u y l j q Ž C X q 2 . h .
qŁ
js1 sinh Ž u y l j y Ž C X q 2 . h . sinh Ž u y l j y Ž C X y 2 . h .
m sinh Ž u y m k y Ž C X q 3 . h . sinh Ž u y m k y Ž C X y 3 . h .
=Ł
ks1 sinh Ž u y m k y Ž C X q 1 . h . sinh Ž u y m k y Ž C X y 1 . h .
M sinh Ž u y l j q Ž C X q 2 . h . m sinh Ž u y m k y Ž C X y 3 . h .
qŁ
js1 sinh Ž u y l j y Ž C X y 2 . h .
Ł
ks1 sinh Ž u y m k y Ž C X q 1 . h . 5 .
Ž 5.12a .
J. Grunebergr Nuclear Physics B 568 [PM] (2000) 594–640 615
which is apparently different from Ž5.5. and requires a different Bethe ansatz, although
the eigenvectors take the form Ž5.6b., due to the convenient choice of Bˆi Ž u. in Ž5.14..
For V Ž n. s V5at for n s 1, . . . , N in Ž1.2a. the action on the direct product of highest
weight vectors 1 :n is precisely the same. From the previous remarks it is sufficient to
solve this problem and replace the local vacuum amplitudes according to Ž5.13.. Note
that V Ž n. s V3at ,V5at can be mixed arbitrarily in Ž1.2.. This contributes just the corre-
sponding local vacuum amplitude to the eigenvalue equation via Ž5.6a.. The Bethe
equations are
Ž n.
N v 1V Ž la y d Ž i. y h .
Ł Ž n.
is1 v V2 Ž la y d Ž i. y h .
M sinh Ž la y l j q 2h . m sinh Ž la y m k q h .
s Ł Ł Ž 5.15a .
js1, j/ a sinh Ž la y l j y 2 h . ks1 sinh Ž la y m k y h .
for a s 1, . . . ,M and
M sinh Ž l j y mb q h .
Ł s1 Ž 5.15b .
js1 sinh Ž l j y mb y h .
at
LVN3 Ž u < l1 , . . . , l M < m 1 , . . . , m m .
M sinh Ž u y l j y h .
s v NŽ1. Ž u . Ł
js1 sinh Ž u y l j q h .
M sinh Ž u y l j q 3h . m sinh Ž u y m k .
q v NŽ2. Ž u .
½ Ł
js1 sinh Ž u y l j q h .
y1
5 Ł
ks1 sinh Ž u y m k q 2h .
,
Ž 5.16a .
with local vacuum amplitudes, which enter Ž5.15a., from
at
v 1V 3 Ž u . s sinh Ž u q 2h . ,
at at
v V2 3 Ž u . s v V3 3 Ž u . s sinh Ž u . ,
at
v 1V 5 Ž u . s sinh Ž u q 3h . ,
at at
v V2 5 Ž u . s v V3 5 Ž u . s sinh Ž u y h . . Ž 5.16b .
$
In contrast to Ž5.7. the second Žnested. Bethe ansatz is graded and involves the
X
Uq Ž gl Ž1,1;C..-symmetric R-matrix. At least superficially any similarity to the equiva-
lent Bethe ansatz based on Ž5.7. is lost. However, the equivalence of eigenvalue
equations can be shown using standard methods from the theory of complex functions.
J. Grunebergr Nuclear Physics B 568 [PM] (2000) 594–640 617
For V s V4t Ž C X . in Ž1.2b. it is convenient to apply the strategy from w11x. The
calculations are involved, but since the method has been explained in detail there, it is
sufficient to state the result:
t X
LVN4 ŽC . Ž u < l1 , . . . , l M < m 1 , . . . , m m .
M sinh Ž u y l j q Ž C X y 1 . h . m sinh Ž u y m k y C Xh .
s v NŽ1. Ž u . 1 y Ł
½ js1 sinh Ž u y l j q Ž C X q 1 . h . 5 Ł
ks1 sinh Ž u y m k q C Xh .
M sinh Ž u y l j q Ž C X q 3 . h .
q v NŽ2. Ž u . 1 y Ł
½ js1 sinh Ž u y l j q Ž C X q 1 . h . 5
m sinh Ž u y m k y C Xh .
=Ł Ž 5.17a .
ks1 sinh Ž u y m k q Ž C X q 2 . h .
with the local vacuum amplitudes
at at
v 1V 3 Ž u . s v V3 3 Ž u . s sinh Ž u q Ž C X q 2 . h . ,
at at
v V2 3 Ž u . s v4V 3 Ž u . s sinh Ž u q C Xh . ,
at at
v 1V 5 Ž u . s v V3 5 Ž u . s sinh Ž u q Ž C X q 3 . h . ,
at at
v V2 5 Ž u . s v4V 5 Ž u . s sinh Ž u q Ž C X y 1 . h . . Ž 5.17b .
The fusion is completely analogous to the previous one:
N
at at at
tˆ V 3 Ž N < u q h . P tˆ V 3 Ž N < u y h . s
ž /
Ł sinh Ž u y dŽ i . q h . tˆ V Ž N < u .
is1
5
N
t
q
ž is1 /
Ł sinh Ž u y dŽ i . y h . tˆ V Ž1. Ž N < u .
4
Ž n.
for V s V3at
and similarly for V Ž n.
s V5at
for n s 1, . . . , N, which will be left for the
reader. From this, Ž5.16a. and Ž5.17a. follows
at
LVN5 Ž u < l1 , . . . , l M < m 1 , . . . , m m .
M sinh Ž u y l j y 2h . m sinh Ž u y m k y h .
s v NŽ1. Ž u . Ł Ł
js1 sinh Ž u y l j q 2 h . ks1 sinh Ž u y m k q h .
M sinh Ž u y l j y 2h . m sinh Ž u y m k q h .
q v NŽ2. Ž u . Ł Ł
js1 sinh Ž u y l j . ks1 sinh Ž u y m k q 3h .
M sinh Ž u y l j q 4h . M sinh Ž u y l j q 4h .
=
½ Ł
js1 sinh Ž u y l j q 2h . 5
y 1 q v NŽ3. Ž u .
½ Ł
js1 sinh Ž u y l j .
M sinh Ž u y l j y 2h . m sinh Ž u y m k y h .
yŁ
js1 sinh Ž u y l j . 5 Ł
ks1 sinh Ž u y m k q 3h .
Ž 5.18a .
618 J. Grunebergr Nuclear Physics B 568 [PM] (2000) 594–640
at t X
tˆ V 3 Ž N < u y Ž C X q 1 . h . P tˆ V4 ŽC . Ž N < u q h .
X
N X
at t
s tˆ V 8 ŽC .
Ž N < u. y ž is1
X
/
Ł sinh Ž u y dŽ i . y Ž C q 1. h . tˆ V ŽC q1. Ž N < u . .
4
For V Ž n. s V5at the relation is similar and shall be left to the reader. Note that the actual
local quantum spaces V3at and V5at can be mixed in the fusion relations too. This leads of
course to ‘‘mixed prefactors’’.
For V s V3at and V Ž n. s V4t Ž C Ž n. . in Ž1.2. the action of the monodromy matrix Ž1.2a.
on the direct product V hw : of local highest weight vectors 1 : can be obtained trivially
and leads again to a different pattern:
vŽN1 . Ž u . 0 Bˆ1 Ž u q 2h .
V 3at
Tˆa Ž 1, N < u . V hw :s
0
0
v NŽ2. Ž u .
0
Comparing with Ž5.14., one concludes that the corresponding Bethe ansatz is entirely
Bˆ2 Ž u q 2h .
v NŽ3. Ž u. 0 a
V hw :. Ž 5.19 .
different and local quantum spaces cannot be mixed. However, the eigenvectors still
take the form Ž5.6b. and the whole calculation is very similar to the lowest weight Bethe
ansatz. In particular the Bethe equations read
N sinh Ž la y d Ž i. y Ch . m sinh Ž la q m k q h .
Ł sinh Ž l Ž i.
sŁ Ž 5.20a .
is1 a yd q Ch . ks1 sinh Ž la q m k y h .
for a s 1, . . . ,M and
M sinh Ž l j q mb q h . m sinh Ž mb y m k q 2h .
Ł s Ł Ž 5.20b .
js1 sinh Ž l j q mb y h . ks1, k/ b sinh Ž mb y m k y 2h .
J. Grunebergr Nuclear Physics B 568 [PM] (2000) 594–640 619
for b s 1, . . . ,m. For V s V3at in Ž1.2. the corresponding ‘‘highest weight eigenvalue
equations’’ read
at
LVN3 Ž u < l1 , . . . , l M < m 1 , . . . , m m .
M sinh Ž u y l j . m sinh Ž u q m k q 3h .
s v NŽ1. Ž u .
½ Ł Ł
js1 sinh Ž u y l j q 2 h . ks1 sinh Ž u q m k q h .
m sinh Ž u q m k y h . M sinh Ž u y l j .
qŁ
ks1 sinh Ž u q m k q h . 5 y v NŽ 3 . Ž u . Ł
js1 sinh Ž u y l j q 2h .
Ž 5.21a .
t X
LVN4 ŽC . Ž u < l1 , . . . , l M < m 1 , . . . , m m .
M sinh Ž u y l j y C Xh . M sinh Ž u y l j y C Xh .
s v NŽ1. Ž u. Ł q v NŽ2. Ž u. Ł
js1 sinh Ž u y l j q C Xh . js1 sinh Ž u y l j q Ž C X q 2 . h .
M sinh Ž u y l j y C Xh . m sinh Ž u q m k q Ž C X q 1 . h .
y v NŽ3. Ž u. ½Ł js1 sinh Ž u y l j q Ž C X q 2 . h .
Ł
ks1 sinh Ž u y m k q Ž C X y 1 . h .
M sinh Ž u y l j y C Xh . m sinh Ž u y m k q Ž C X y 3 . h .
qŁ
js1 sinh Ž u y l j q C Xh .
Ł
ks1 sinh Ž u y m k q Ž C X y 1 . h . 5 Ž 5.22a .
Nevertheless the fusion procedure for the ‘‘highest weight ansatz’’ with V4t Ž C X . as
local quantum space is completely analogous to the ‘‘lowest weight ansatz’’ and reads
at at
tˆ V 3 Ž N < u q h . P tˆ V 3 Ž N < u y h .
N
t at
s tˆ V4 Ž1. Ž N < u . q
ž /
Ł sinh Ž u y dŽ i . q Ž CŽ i . q 1. h . tˆ V Ž N < u . .
is1
at
5
From this the corresponding eigenvalue equation for tˆ V5 Ž N < u. with V Ž n. s V4t Ž C Ž n. .
follows as
at
LVN5 Ž u < l1 , . . . , l M < m 1 , . . . , m m .
M sinh Ž u y l j y h . m sinh Ž u q m k q 4h .
s v NŽ1. Ž u .
½ Ł Ł
js1 sinh Ž u y l j q 3h . ks1 sinh Ž u q m k q 2 h .
M sinh Ž u y l j q h . m sinh Ž u q m k q 4h .
qŁ Ł
js1 sinh Ž u y l j q 3h . ks1 sinh Ž u q m k q 2 h .
sinh Ž u q m k y 2h . m sinh Ž u q m k y 2h .
=
sinh Ž u q m k .
qŁ
ks1 sinh Ž u q m k q 2h . 5
M sinh Ž u y l j y h . m sinh Ž u q m k q 2h .
y v NŽ 4 . Ž u .
½ Ł Ł
js1 sinh Ž u y l j q 3h . ks1 sinh Ž u q m k .
M sinh Ž u y l j q h . m sinh Ž u q m k y 2h .
qŁ Ł
js1 sinh Ž u y l j q 3h . ks1 sinh Ž u q m k . 5 Ž 5.23a .
$ $
t
Of course this can be done for ˆ t NV 8 ŽC . Ž N < u.
with V Ž n.
analogously. The s V4t Ž C Ž n. .
X X
w x
abstract results of Tsuboi 44,45 for Uq gl m,n , specialized to Uq gl Ž2,1.. allow us to
Ž Ž .. Ž
obtain the eigenvalues given above, without any Bethe ansatz calculation, if the notation
is adjusted to the one used here. On the other hand, the present method could be used
without difficulties to obtain eigenvalues corresponding to higher dimensional atypical
representations V2atnq1 with n g N, which is omitted here, because their physical
applications are not very realistic at all. Frahm has done this in the q s 1-limit w47x. The
calculations of Tsuboi and Frahm basically rely on Schulz’s result w46x, respectively
Lai’s and Sutherland’s coordinate Bethe ansatz calculations w48,49x on the fundamental
representations and the fusion procedure explained above. Instead of calculating missing
eigenvalues algebraically like in w11x they obtain the eigenvalues for V different from
V3at in Ž1.2. by analytical considerations in the spirit of Reshetikhin’s analytical Bethe
ansatz w50x. This is admittedly very efficient and surely correct in the simple cases
considered so far, but seems to fail in more complicated situations in particular for
J. Grunebergr Nuclear Physics B 568 [PM] (2000) 594–640 621
V s V Ž n. s V8t Ž C . in Ž1.2. w47x, which will be solved in the following. The reason seems
to be that their method in contrast to Reshetikhin’s original work requires some intuitive
input, which is difficult to fix especially in the latter case. This also prevents a proof of
the resulting equations, which remained in the state of a conjecture.
For V s V3at ,V4t Ž C X . and V Ž n. s V t Ž C Ž n. . in Ž1.2. the local vacuum amplitudes
V 8t ŽC . Ž .
vi u corresponding to the highest weight vector u1 : from ŽŽA.9.. can be obtained
via
t at t
v iV 8 ŽC . Ž u . , v iV 3 Ž u y Ž C q 1 . h . P v iV4 ŽC . Ž u q h . , Ž 5.24 .
which is the analog of 5.13 . However, in order to demonstrate the generality of the
Ž .
present approach the lowest weight vector u 4 : from ŽA.9. will be taken as local
pseudo-vacuum here. Since this is not a direct product of lowest weight vectors, one has
to calculate the action of the direct product V l w : of these vectors for V s V3at and
V Ž n. s V8t Ž C Ž n. . in Ž1.2a. directly from Ž5.2c. with u1 s yŽ C q 1.h , u1 s yh by
means of ŽA.9., Ž4.12. with ŽC.1. and Ž4.14. with ŽC.3.. This yields
vŽN1 . Ž u . 0 0
V 3at
Tˆa Ž 1, N < u . V l w :s
Bˆ1 Ž u .
Bˆ2 Ž u .
v NŽ2. Ž u .
Bˆ3 Ž u q h .
0
v NŽ3.
which is apparently different from the known cases Ž5.5., Ž5.14. and Ž5.19., where at
Ž u. 0 a
V l w :, Ž 5.25 .
least some v NŽ i. Ž u. coincide. One might look for a different pseudo-vacuum – not
necessarily a product of highest or lowest weight vectors, in which Ž5.25. takes one of
these forms. An inspection of all local basis vectors from ŽA.9. shows not only that such
a vector does not exist, but also that the appearance of three non-trivial off-diagonal
operators Bˆi Ž u. is generic.
The general eigenvectors are too complicated to be discussed here, but it is easy to
verify by a direct algebraical calculation that
l1 , . . . , l M :1 s Bˆ1 Ž l1 . . . . Bˆ1 Ž l M . V l w : , Ž 5.26a .
which is totally symmetric in the Bethe parameters l1 , . . . , lM and
m 1 , . . . , m m :2 s Bˆ3 Ž m 1 . . . . Bˆ3 Ž m m . V l w : , Ž 5.26b .
V at3 Ž
which is totally antisymmetric in m 1 , . . . , m m , are eigenvectors of tˆ N < u., provided
these parameters solve the Bethe equations
N sinh Ž la y d Ž i. y Ž C q 1 . h .
Ł sinh Ž l Ž i.
is1 ayd y Ž C y 1. h .
M sinh Ž la y l j y 2h . m sinh Ž la y m k y h .
s Ł Ł Ž 5.27a .
js1, j/ a sinh Ž la y l j q 2 h . ks1 sinh Ž la y m k q h .
for a s 1, . . . M and
N sinh Ž mb y d Ž i. q Ž C q 2 . h . M sinh Ž l j y mb q h .
Ł sinh Ž i.
sŁ for b s 1, . . . ,m .
is1 Ž mb y d y Ž C q 2. h . js1 sinh Ž l j y mb y h .
Ž 5.27b .
622 J. Grunebergr Nuclear Physics B 568 [PM] (2000) 594–640
3
The corresponding eigenvalue equations for tˆ Vat Ž N < u. with V Ž n. s V8t Ž C Ž n. . read
at
LVN3 Ž u < l1 , . . . , l M < m 1 , . . . , m m .
M sinh Ž u y l j q 2h . M sinh Ž u y l j y 2h .
s v NŽ1. Ž u . Ł q v NŽ2. Ž u . Ł
js1 sinh Ž u y l j . js1 sinh Ž u y l j .
m sinh Ž u y m k q h . m sinh Ž u y m k q h .
=Ł y v NŽ3. Ž u . Ł Ž 5.28a .
ks1 sinh Ž u y m k y h . ks1 sinh Ž u y m k y h .
with the local vacuum amplitudes
t
v 1V 8 ŽC . Ž u . s sinh Ž u q Ž C q 1 . h . sinh Ž u y Ž C q 1 . h . ,
t
v V2 8 ŽC . Ž u . s sinh Ž u q Ž C q 1 . h . sinh Ž u y Ž C y 1 . h . ,
t
v V3 8 ŽC . Ž u . s sinh Ž u y Ž C q 3 . h . sinh Ž u y Ž C y 1 . h . . Ž 5.28b .
Similarly one has to calculate the action of V l w : on the monodromy matrix Ž1.2a. for
V s V4t Ž C X . and V Ž n. s V8t Ž C Ž n. . from Ž5.2c. with u 0 s yŽ C q 1.h , u1 s yh , by means
of ŽA.9., Ž4.16. with ŽC.1. and Ž4.13. with ŽC.2., which yields
vŽN1 . Ž u . 0 0 0
at
TˆaV 3 Ž 1, N < u . V l w :s
)
)
)
v NŽ2.
0
0
Ž u. )
v NŽ3. Ž u .
)
)
0
v Ž4. Ž u .
0 a
V l w :,
Ž 5.29 .
where a star denotes some non-trivial off-diagonal entry. SoX the method from Ref. w11x
4
can be applied in order to calculate the eigenvalues of tˆ V t ŽC . Ž N < u. corresponding to the
simple eigenvectors Ž5.26. algebraically:
t X
LVN4 ŽC . Ž u < l1 , . . . , l M < m 1 , . . . , m m .
m sinh Ž u y m k q Ž C X q 1 . h .
s v NŽ1. Ž u. Ł
ks1 sinh Ž u y m k y Ž C X y 1 . h .
m sinh Ž u y m k q Ž C X q 1 . h .
q v NŽ2. Ž u . Ł
ks1 sinh Ž u y m k y Ž C X q 1 . h .
M sinh Ž u y l j y Ž C X y 2 . h .
y v NŽ3. Ž u . Ł
js1 sinh Ž u y l j y C Xh .
m sinh Ž u y m k q Ž C X q 1 . h .
=Ł
ks1 sinh Ž u y m k y Ž C X y 1 . h .
M sinh Ž u y l j y Ž C X q 2 . h .
y v NŽ4. Ž u . Ł
js1 sinh Ž u y l j y C Xh .
m sinh Ž u y m k q Ž C X q 1 . h .
=Ł Ž 5.30a .
ks1 sinh Ž u y m k y Ž C X q 1 . h .
J. Grunebergr Nuclear Physics B 568 [PM] (2000) 594–640 623
at
q
žŁ Ž Ž is1
5
/
sinh u q C Ž i . q 2 . h y d Ž i . . sinh Ž u y C Ž i . h y d Ž i . . tˆ V5 Ž N < u . .
The corresponding eigenvalue equation for tˆ Vat Ž N < u. with V Ž n. s V8t Ž C Ž n. . can be
written as
at
LVN5 Ž u < l1 , . . . , l M < m 1 , . . . , m m .
M sinh Ž u y l j q 3h .
s v NŽ1. Ž u . Ł
js1 sinh Ž u y l j y h .
M sinh Ž u y l j q 3h . sinh Ž u y l j y 3h .
q v NŽ2. Ž u . Ł
js1 sinh Ž u y l j q h . sinh Ž u y l j y h .
m sinh Ž u y m k . M sinh Ž u y l j y 3h .
=Ł q v NŽ3. Ž u . Ł
ks1 sinh Ž u y m k y 2h . js1 sinh Ž u y l j q h .
m sinh Ž u y m k q 2h . M sinh Ž u y l j q 3h .
=Ł y v NŽ4. Ž u . Ł
ks1 sinh Ž u y m k y 2h . js1 sinh Ž u y l j q h .
m sinh Ž u y m k . M sinh Ž u y l j y h .
=Ł y v NŽ5. Ž u . Ł
ks1 sinh Ž u y m k y 2h . js1 sinh Ž u y l j q h .
m sinh Ž u y m k q 2h .
=Ł Ž 5.31a .
ks1 sinh Ž u y m k y 2h .
with the local vacuum amplitudes
t
v 1V 8 ŽC . Ž u . s sinh Ž u q Ch . sinh Ž u y Ž C q 2 . h . ,
t
v V2 8 ŽC . Ž u . s sinh Ž u q Ch . sinh Ž u y Ch . ,
t
v V3 8 ŽC . Ž u . s sinh Ž u q Ch . sinh Ž u y Ž C y 2 . h . ,
t
v4V 8 ŽC . Ž u . s sinh Ž u y Ch . sinh Ž u y Ž C q 4 . h . ,
t
v 5V 8 ŽC . Ž u . s sinh Ž u y Ž C y 2 . h . sinh Ž u y Ž C q 4 . h . . Ž 5.31b .
624 J. Grunebergr Nuclear Physics B 568 [PM] (2000) 594–640
/ 8
N
y
ž Ł sinh Ž u y dŽ i . y Ž C y C y 2. h . sinh Ž u y d Ž i. y Ž C q C q 2. h .
is1
X
X X
/
t
=tˆ V4 ŽC q1 . Ž N < u . .
t X
The corresponding eigenvalues of tˆ V 8 ŽC . Ž N < u. follow as
t X
LVN8 ŽC . Ž u < l1 , . . . , l M < m 1 , . . . , m m .
M sinh Ž u y l j q Ž C X q 3 . h . m sinh Ž u y m k q C Xh .
s v NŽ1. Ž u . Ł Ł
js1 sinh Ž u y l j q Ž C X q 1 . h . ks1 sinh Ž u y m k y C Xh .
M sinh Ž u y l j q Ž C X y 1 . h . m sinh Ž u y m k q Ž C X q 2 . h .
q v NŽ2. Ž u . Ł Ł
js1 sinh Ž u y l j q Ž C X q 1 . h . ks1 sinh Ž u y m k y C Xh .
M sinh Ž u y l j q Ž C X q 3 . h . m sinh Ž u y m k q C Xh .
q v NŽ3. Ž u. Ł Ł
js1 sinh Ž u y l j q Ž C X q 1 . h . ks1 sinh Ž u y m k y Ž C X q 2 . h .
M sinh Ž u y l j q Ž C X y 1 . h . m sinh Ž u y m k q Ž C X q 2 . h .
q v NŽ4. Ž u . Ł Ł
js1 sinh Ž u y l j q Ž C X q 1 . h . ks1 sinh Ž u y m k y Ž C X q 2 . h .
M sinh Ž u y l j q Ž C X q 3 . h . sinh Ž u y l j y Ž C X y 1 . h .
y v NŽ5. Ž u. Ł
js1 sinh Ž u y l j q Ž C X q 1 . h . sinh Ž u y l j y Ž C X q 1 . h .
m sinh Ž u y m k q C Xh . M sinh Ž u y l j q Ž C X q 3 . h .
=Ł y v NŽ6. Ž u . Ł
ks1 sinh Ž u y m k y C Xh . js1 sinh Ž u y l j q Ž C X q 1 . h .
sinh Ž u y l j y Ž C X q 3 . h . m sinh Ž u y m k q C Xh .
= Ł
sinh Ž u y l j y Ž C X q 1 . h . ks1 sinh Ž u y m k y Ž C X q 2 . h .
M sinh Ž u y l j q Ž C X y 1 . h . sinh Ž u y l j y Ž C X y 1 . h .
y v NŽ7. Ž u . Ł
js1 sinh Ž u y l j q Ž C X q 1 . h . sinh Ž u y l j y Ž C X q 1 . h .
m sinh Ž u y m k q Ž C X q 2 . h . M sinh Ž u y l j q Ž C X y 1 . h .
=Ł y v NŽ8. Ž u. Ł
ks1 sinh Ž u y m k y C Xh . js1 sinh Ž u y l j q Ž C X q 1 . h .
sinh Ž u y l j y Ž C X q 3 . h . m sinh Ž u y m k q Ž C X q 2 . h .
= Ł Ž 5.32a .
sinh Ž u y l j y Ž C X q 1 . h . ks1 sinh Ž u y m k y Ž C X q 2 . h .
J. Grunebergr Nuclear Physics B 568 [PM] (2000) 594–640 625
The completely analogous calculations for the highest weight Bethe ansatz starting
from Ž5.24. will be left for the reader.
In principle all results concerning non-fundamental representations in auxiliary space
in this section could be achieved by directly applying the method from Ref. w11x, without
reference to the fusion procedure. The latter however reduces the effort considerably, so
that it would be unwise to disregard this tool. On the other hand, rigorous results from
fusion require a complete input, which in general can be provided only by calculations
similar to the one in w11x. The R-matrices derived in Section 4 provide the elementary
building blocks of the fusion hierarchy that have to be diagonalized by direct methods in
order to obtain the consecutive results almost trivially by fusion. This justifies to call
them fundamental.
¨ in
Finally it should be noted that completeness of all extremal weight Bethe ansatze
the sense that missing states can be produced from them by applying Uq Ž gl Ž2,1;C..
626 J. Grunebergr Nuclear Physics B 568 [PM] (2000) 594–640
generators, and equivalence of highest and lowest weight eigenvalue equations can be
proved by well known standard arguments not to be repeated here.
6. Conclusions
$ X
Uq Ž gl Ž2,1;C.. symmetry has been explained in Section 2; some low-dimensional
representations, relevant for physical applications, have been given in Section 3; the
fundamental R-matrices have been constructed in Section 4. Eigenvalue equations of the
corresponding transfer matrices have been obtained in Section 5 by combining Bethe
ansatz calculations with the fusion procedure. This procedure can be iterated to higher
dimensional representations along the lines presented here. Partially this can be read off
from the results of Tsuboi, who worked with a purely analytical ansatz w44,45x. Several
conclusions may be drawn: Baxter noted w6x that all coefficients of R-matrices,
calculated in Sections 4 and 5 generically, are meromorphic functions in q, u and here
also in C. Therefore they can be analytically continued to almost all complex q-, u- and
C-values. All singularities can be removed by multiplying the R-matrices with the
vanishing denominators, as was done in Section 5. The remainingX zeroes indicate that
generically irreducible representations become reducible, i.e. Rˇ Vd VdX Ž u. Ž4.4b. is not an
intertwiner at these points, see e.g. Ref. w17x. The restriction to real q and C values in
Section 3 is due to the fact that an Euclidean metric has been chosen in Ž2.1a.. A simple
‘‘deformation’’ of the latter, e.g. absorbing some factors in the definition of vectors and
1-forms, leads to different expressions for the weight vector bases in Appendix A, but to
the same intertwiners listed in Appendix B. Therefore the product decompositions in
Section 3, hold for all generic q and C values, if a convenient metric has been chosen
for Vd and VdXX . The q-deformed Clebsch–Gordan coefficients
X
appearing implicitly in
Appendix A are artificial. The intertwiners RˇVd VdX Ž u. in the form Ž4.6. provide a
complete, metric-free characterization of G Vxd m G VydXX Ž3.13.. In addition ratios of Drin-
fel’d polynomials, which characterize the structure of the tensor products involved in the
$
construction of Ž1.2. appear on the left-hand side at least once in any set of Bethe
X
equations. Tsuboi formulated his results on Uq Ž gl Ž m,n;C.. in terms of roots and Dynkin
diagrams, not used here, which is a further step in establishing this viewpoint w44,45x.
However, many details are not understood at present and some work by mathematicians
would be appreciated.
The following physical aspect is easier to understand. Define
X X X X X X
R˜V4 V4 Ž C,C X . :sR V4 V4 Ž u . us0 s P
ˇ V4 V4 ( P1V4V4 q P2V4V4 q P3V4V4 . Ž 6.1a .
This is a purely UqŽglŽ2,1;C..-symmetric R-matrix. Ž4.17c. with u s v s 0 can be
written as
X XX X XX
R˜12
V4 V4
Ž C,C X . R˜13
V4 V4
Ž C,C XX . R˜V234 V4 Ž C X ,C XX .
X XX XX X
s R˜V234 V4 Ž C X ,C XX . R˜13
V4 V4
Ž C,C XX . R˜12
V4 V4
Ž C,C X . , Ž 6.1b .
which is the general form Ž1.1a. of the YBE. Note that C s C does imply V4 ( V4X only
X
up to a ‘‘deformation of the metric’’. This and not the strict identification is however
already sufficient to establish the Hamiltonian limit. Choosing all parameters in V4 and
J. Grunebergr Nuclear Physics B 568 [PM] (2000) 594–640 627
t X
V4X with C :s C Ž n. s C X real, the logarithmic derivative of tˆ V4 ŽC . Ž N < u. with respect to
u at u s 0 yields the Hamiltonian of the 2-parameter model with periodic boundary
conditions, which was established previously by Bariev et al. w10x, only for U 0 y1
ŽHubbard parameter. w9x, while the eigenvalues from Ž5.9. or its highest weight
counterpart are continuous at this point. However, staggering the V Ž n. ( V4t Ž C . conve-
niently allows to continue the Hamiltonian to U ( y1, as is expected for physical
reasons. In addition phase factors can be assigned to particle operators this way, which
are known not to change the eigenvalues. Note that this Hamiltonian is not Hermitian in
general and may find applications in non-equilibrium physics w51x.
Bariev et al. found that the q-deformed Bariev model, whose Hamiltonian violates
parity invariance admits long-range pair correlations, which are regarded as a one-di-
mensional analog of superconductivity w10x. Using X
C X as a spectral parameter, as
V 4t ŽC . Ž
indicated above, the logarithmic derivative of tˆ N <0. with respect to C X at C X s C
turns out to violate parity too and can be regarded as a conserved 2-particle current w51x.
It is plausible that long range pair correlations are due to the existence of this conserved
quantity, which trivially commutes with the Hamiltonian.
at
For V s V Ž n. s V5at in Ž1.2. the logarithmic derivative of tˆ V5 Ž N < u. with respect to u
at u s 0 yields a generalization of the spin-1 generalization of the tJ model’’, treated by
¨ and Tsvelick w52x. Its eigenvalues can be obtained from Ž5.18.. The
Frahm, Pfannmuller
quantization and diagonalization of higher dimensional atypical representations V2atnq1
with n g N is a simple generalization of the V3at and V5at cases. Tsuboi has shown, how
to obtain the eigenvalues w44,45x. It seams however difficult to find a realistic physical
application for these. X
t
On the other hand, the conventional Hamiltonian limit of tˆ V 8 ŽC . Ž N < u., diagonalized
in Ž5.32., seems very promising. Obviously the lowest weight Bethe ansatz Ž5.25. does
not allow to incorporate lower dimensional local quantum spaces. The same is true for
the highest weight ansatz only mentioned in passing. Preliminary results are available
and will be published elsewhere w53x. The existence of a second Hamiltonian limit
analogous to Ž6.1. is obvious. Its physical meaning is still unclear at the moment.
In general the solution of the underlying statistical model provides not only the
integrals of motion for the corresponding Hamiltonian, but is also the starting point for
the calculation of thermodynamical properties in the exact approach via the Trotter-
Suzuki mapping pioneered by Klumper ¨ w54,55x, which is now well developed in order to
treat graded models, related to limiting cases of the ones treated here, by Juttner, ¨
¨
Klumper and Suzuki w56,57x, culminating in Ref. w58x. The author hopes to come back to
some of these interesting problems in the near future.
Acknowledgements
This work has been performed within the research program of the Sonderforschungs-
¨ ¨ .. The author thanks J. Zittartz and A. Klumper
bereich 341 ŽKoln-Aachen-Julich ¨ for
¨
continuous support, G. Juttner, Y. Kato, K. Klauck, A. ¨
Klumper, J. Suzuki, A. Zvyagin,
and especially A. Fujii for stimulating discussions and encouragement. Special thanks
¨
goes to A. Klumper for carefully reading the manuscript and useful suggestions,
incorporated in the final version.
628 J. Grunebergr Nuclear Physics B 568 [PM] (2000) 594–640
The weights for the modules Ž Vd , G Vd . are defined in Section 3, Eq. Ž3.4.. All weight
vectors are chosen to be orthonormal. To achieve this, q is assumed to be real and
positive, while k , k ) and m , m ) are taken to be complex conjugate to each other.
For the V3-module Ž3.5. the basis vectors Ž2.1a. are weight vectors Ž3.4a.. The
corresponding weights are
LV 3 s l1 s ya 2 ,
l2 s LV 3 y a 1 ,
l3 s LV 3 y a 1 y a 2 . Ž A.1 .
G V4 Ž3.7. is irreducible, iff C / 0,y 1, which is assumed from here on.
For the V4-module again the basis vectors Ž2.1a. are weight vectors Ž3.4a. and the
corresponding weights are
LV4 s l1 s yC a 1 y 2C a 2 ,
l2 s LV4 ya 1 y 2 a 2 ,
l3 s LV4 ya 2 ,
l4 s LV4 ya 1 y a 2 . Ž A.2 .
Ž V4 , G V . can be characterized by fixing C Ž3.7a..
4
The elements G V4 of the weight vector basis of
Ž V4 , G V . ( V4t Ž C q 1.
4
Õ 2 :s 3 :m 2 : ,
Cq1 1
1 y
Õ 3 :s
(w C q 2x q
½ q 2 2
1 :m 2 :y m q 3 :m 3 : , 5
Cq1 1
1 y
Õ4 :s
(w C q 2x q
½ q 2 2 :m 2 :y m q 2 3 :m 4 : . 5 Ž A.3 .
ž V˜
4
Ž1.
, G V˜4Ž 1 . ( V4t Ž C q C X .
/
are u˜ i :, i :. In the product basis of V4 m V4X , they read
u˜ 1 :s 1 :m 1 : ,
J. Grunebergr Nuclear Physics B 568 [PM] (2000) 594–640 629
X
u˜ 2 :s w C x q w C q 1 x q qy2 C q w C X x q w C X q 1 x q q 2 C
1
X y
q w 2 x q w C q 1 x q w C X q 1 x q q C yC 4 2
½
= k X )mX ) q C 1 :m 2 :q k )m ) qyC 2 :m 1 :
C yC
X
1 1
y
yk k q ) X) 2
q 2 3 :m 4 :y q 2 4 :m 3 :
5 ,
X
C C
1 y 2
u˜ 3 :s
(w C x qq
yC
q w CX x q q C
X ½ kq 2 1 :m 4 :q k q X)
5
4 :m 1 : ,
X
C C
1 y 2
u˜ 4 :s
(w C x qq
yC
q wC xq q X C
X ½ kq 2 1 :m 3 :q k X ) q 5
3 :m 1 : . Ž A.4 .
ž V˜
4
Ž 2.
, G V˜4Ž 2 . ( V4t Ž C q C X q 1 .
/
are w˜ i :, i : or explicitly
X
w˜ 1 :s w C x q w C q 1 x q qy2 Ž Cq1 . q w C X x q w C X q 1 x q q 2 Ž C q1 .
1
X y
q w 2 x q w C q 1 x q w C X q 1 x q q C yC 4 2
½
= km qy Ž Cq1 . 1 :m 2 :q km
X
X X C q1
q 2 :m 1 :
C yC 1
1
qmm q X 2
qy 2 3 :m 4 :y q 2 4 :m 3 :
5 ,
1
w˜ 2 :s 2 :m 2 : w˜ 3 :s X
(w C q 1x qq
yC y1
q w C X q 1 x q q C q1
X
Cq1 C q1
y
= mq½ 2 3 :m 2 :q m q X 2
5
2 :m 3 : w̃4 :
1
s X
(w C q 1x qq
yC y1
q w C X q 1x q q C q1
X
Cq1 C q1
y
½
= mq 2 4 :m 2 :q m q X 2
2 :m 4 : . 5 Ž A.5 .
630 J. Grunebergr Nuclear Physics B 568 [PM] (2000) 594–640
ž Vˇ , G ˇ / ( V Ž 1.
4 V4 4
t
The Ž3,2.-graded carrier space Vˇ5 of the atypical representation G Vˇ5 is spanned by
weight vectors, expanded in the product basis Ž2.3. of V3 m V3
uˇ 1 :s 1 :m 1 :,
1 1
1 y
uˇ 2 :s
(w2x q
½ 2
q 1 :m 2 :q q 2
5
2 :m 1 : ,
uˇ 3 :s 2 :m 2 :,
1 1
1 y
uˇ 4 :s
(w2x q
½ 2
q 1 :m 3 :q q 2
5
3 :m 1 : ,
1 1
1 y
uˇ 5 :s
(w2x q
½ q 2 2 :m 3 :q q 2
5
3 :m 2 : . Ž A.7 .
1
1 Cq1
2
u 4 :s
(w C q 2x q
½ y
q 2 3 :m 4 :q m q 2 :m 2 : , )
5
u 5 :s 1 :m 3 :,
1
u 6 :s
(w2x q w C q 2x q
1
C 1
½
= w2x q q y
2 3 :m 1 :q k q q ) y
2
2
1 :m 4 :y q 2 :m 3 :
5 ,
1
1 1
u 7 :s
(w2x q
½ q 2 1 :m 4 :q qy 2 2 :m 3 : , 5
u 8 :s 2 :m 4 :. Ž A.9 .
The corresponding weights are
LV 8 s l1 s yC a 1 y Ž 2C q 1 . a 2 ,
l2 s LV 8 y a 1 ,
l3 s LV 8 y a 1 y 2 a 2 ,
l4 s LV 8 y 2 a 1 y 2 a 2 ,
l5 s L V 8 y a 2 ,
l6 s l 7 s L V 8 y a 1 y a 2 ,
l8 s L V 8 y 2 a 1 y a 2 . Ž A.10 .
Obviously Ž V8 , G V 8 . can also be labeled by C Ž3.10a.. Finally the weight vector basis of
ž V˜ , G ˜ / ( V Ž C q C . ,
8 V8 8
t X
X
C C
1 y 2
Õ˜ 2 :s
(w C x qq
yC
q w CX x q q C
X
½ kq 2 1 :m 4 :y k X ) q 5
4 :m 1 : ,
1
Õ˜ 3 :s X
(w C q 1x qq
yC y1
q w C X q 1 x q q C q1
X
Cq 1 C q1
y
½
= m q ) 2
2 :m 3 :y m q X 2
5
3 :m 2 : ,
632 J. Grunebergr Nuclear Physics B 568 [PM] (2000) 594–640
1
Õ˜4 :s X
(w C q 1x qq
yC y1
q w C X q 1 x q q C q1
X
Cq 1 C q1
y
= m) q ½ 2
2 :m 4 :y mX q 2
4 :m 2 : , 5
Õ˜5 :s 3 :m 3 :,
X X 2
Õ˜6 :s w 2 x q q Cy C Ž w C x q w C q 1 x q qy2 Cy1 y w C X x q w C X q 1 x q q 2 C q1 .
½
X X 2
q Ž w C x q qyC q w C q 1 x q qyCy1 q w C X x q q C q w C X q 1 x q q C q1 .
1
y
= Ž w C q 1x q w C X x q q w C x q w C X q 1x q . 5 2
X
Cy C
X
=
½ Ž w C x q w C q 1x q q
1
y2 Cy1
1
X
y w C x q w C q 1x q q X 2 C q1
.q 2
y
= q 2 3 :m 4 :y q 2 4 :m 3 :
X X
q Ž w C x q qyC q w C q 1 x q qyC y1 q w C X x q q C q w C X q 1 x q q C q1 .
= m )k X 2 :m 1 :y kmX ) 1 :m 2 :
1 1
5 ,
1 y
Õ˜ 7 :s
w2x q ½ 2
q 3 :m 4 :q q 2 4 :m 3 : , 5
Õ˜ 8 :s 4 :m 4 :. Ž A.11 .
Cq2 1 1
k) 2 2
y
2
y q q e 34 m e11 y q e 33 m e 21
w C q 2x q
Cq 2
m) 2
q q e 23 m Ž e13 q e 24 .
w C q 2x q
Cq 2
m y
q 2
q Ž e31 q e42 . m e32 . Ž B.1 .
w C q 2x q
For V˜4Ž1. ; V4 m V4X ŽA.4. the same now with a positive sign in Ž4.7. procedure leads to
X X
X k )m )k m X
P1V4 V4 s e11 m e11 q q CqC e11 m e 22
w C q C X x q w C q C X q 1x q
kmk X )mX ) X
q X X
qyŽ CqC . e 22 m e11
w C q C x q w C q C q 1x q
X
Cq C
k )k X 2
q q e11 m Ž e 33 q e 44 .
w C q CX x q
Cq C X
kk X ) y
q 2
q Ž e33 q e44 . m e11
w C q CX x q
w C x q w CX x q
q
w C q C X x q w C q C X q 1x q
Ž qy1 e33 m e44 q qe44 m e33 .
1
q
w C q C X x q w C q C X q 1x q
= w C x q w C q 1 x q e12 m e 21 q w C X x q w C X q 1 x q e 21 m e12 4
w C x q w CX x q
y Ž e me qe me .
w C q C X x q w C q C X q 1 x q 34 43 43 34
1 X
q w C x q Ž e31 m e13 q e41 m e14 .
w C q CX x q
1
y w C x q Ž e13 m e 31 q e14 m e 41 . 4 q
w C q C x q w C q C X q 1x q
X
° CqC
X
1 1
=~q
y 2
¢
2 X
m k wC xq q
)
ž 2e
13 m e 24 y q e14 m e23 /
1 1
y 2
yk m w C X x q q
) X
ž 2e
31 m e 42 y q e 41 m e 32 /
634 J. Grunebergr Nuclear Physics B 568 [PM] (2000) 594–640
CqC X 1
y 1
2
qq 2 X)
km w C x q q X
ž y
2 e 23 m e14 y q e 24 m e13 /
1
1
2
¶•
X)
ymk w C x q q ž y
2 e 32 m e 41 y q e 42 m e 31 /ß Ž B.2 .
= w C X x q w C X q 1 x q e12 m e 21 q w C x q w C q 1 x q e 21 m e12 4
w C q 1x q w C X q 1x q
y Ž e me qe me .
w C q C X q 1 x q w C q C X q 2 x q 34 43 43 34
1 X
y w C q 1x q Ž e32 m e23 q e42 m e24 .
w C q C X q 2x q
1
y w C q 1 x q Ž e 23 m e 32 q e 24 m e 42 . 4 q
w C q C q 1x q w C q C X q 2x q
X
° CqC Xq2 1 1
=~q
y y
2
¢ 2 X
m k w C q 1 x q q e14 m e 23 y q
) X
ž 2
e13 m e 24 /
1 1
yk m w C q 1 x q q e 41 m e 32 y q 2 e31 m e42
) X
ž 2
/
X
CqC q2 1 1
y
2 2
qq X)
km w C q 1 x q q e24 m e13 y q ž 2
e 23 m e14 /
1
y
1
2
¶•
ž
ymk X ) w C X q 1 x q q 2 e42 m e 31 y q e 32 m e 41 / ß. Ž B.3 .
J. Grunebergr Nuclear Physics B 568 [PM] (2000) 594–640 635
For Vˇ4 ; V3 m V3 ŽA.6. the result is now again with a minus sign in Ž4.7.
1
P2V 3V 3 s e 33 m e 33 y w e12 m e21 y e13 m e31 q e21 m e12 y e23 m e32
w2x q
q
qe31 m e13 q e 32 m e 23 x q e 22 m e11 q e33 m Ž e11 q e22 .
w2x q
qy1
q e11 m Ž e 22 q e 33 . q e 22 m e 33 . Ž B.4 .
w2x q
The intertwiner P1V 3V 3 for Vˇ5 ; V3 m V3 ŽA.7. is simply related to P2V 3V 3 ŽB.4. by Ž4.8c.,
i.e.
P1V 3V 3 s IV 3mV 3 y P2V 3V 3 . Ž B.5 .
In addition, the limit q ™ 1 is particularly simple, namely
P1V 3V 3 1
qs 1 s 2 žI V 3mV 3 q P
ˇ V 3V 3 / ,
P2V 3V 3 1
qs 1 s 2 žI V 3mV 3 y P
ˇ V 3V 3 / . Ž B.6 .
which was known to Kulish and Sklyanin long ago w27x. The intertwiner for V8 ; V3 m V4
ŽA.9. is obtained from Ž4.7. with a positive sign:
1
y q Cq 1 e 31 m e 24 q qyCy1 e 42 m e13
w C q 2x q
Cq2 1 1
k 2
y 2
q q 2e
q 11 m e 34 y q e12 m e 33
w C q 2x q
Cq2 1 1
k) y y
2
q q 2 q 2 e 34 m e11 y q e 33 m e 21
w C q 2x q
Cq 2
m) 2
y q e 23 m Ž e13 q e 24 .
w C q 2x q
Cq 2
m y
q 2
y Ž e31 q e42 . m e32 . Ž B.7 .
w C q 2x q
636 J. Grunebergr Nuclear Physics B 568 [PM] (2000) 594–640
X
Finally P2V4 V4 corresponding to V˜8 ; V4 m V4X ŽA.11. is given by
X
P2V4 V4 s e 33 m e33 q e 44 m e 44
w2x q X X
q k )m )km e11 m e 22 q kmk X )mX ) e 22 m e11
ag
Cq C X
k )k X CqC
X kk X ) y
q e11 Ž e33 q e 44 . q q 2
q Ž e33 q e44 . m e11
a a
Cq C Xq2
mmX ) y
q q 2 e 22 m Ž e 33 q e 44 .
g
X
Cq C q2
m )mX 2
q q Ž e33 q e44 . m e22
g
q d 2 qy1 qy1 d 2q
q
ž w2x q
y
w 2 x q ag / e 33 m e 44 q
ž w2x q
y
w 2 x q ag / e 44 m e 33
w2x q X X
y w C q 1x q w C x q e12 m e21 q w C x q w C q 1x q e21 m e12 4
ag
1 X
q w C x q Ž e13 m e31 q e14 m e41 .
a
y w C x q Ž e 31 m e13 q e 41 m e14 . 4
1 X
q w C q 1x q Ž e23 m e32 q e24 m e42 .
g
y w C q 1 x q Ž e 32 m e 23 q e 42 m e 24 . 4
1 d2
q
ž w2x q
q
w 2 x q ag / w e34 m e43 q e43 m e34 x
1
1
1
y
ag ½ m k´ q
) X
y
2e
13 m e 24 y q
1
2
e14 m e 23
1
y
X) 2 2
ymk h q e 42 m e 31 y q e 32 m e 41
1 1
y 2
X) 2e
ykm ´ q 23 m e14 q q e 24 m e13
1 1
y
X
qk mh q e 41 m e 32 y q
) 2 2
e 31 m e 42
5 , Ž B.8 .
J. Grunebergr Nuclear Physics B 568 [PM] (2000) 594–640 637
The elements of the R-matrix Ž4.12. are explicitly given by r Ž u. from Ž4.9b. and
1
a 0 Ž u . :s w C q 1x q r Ž u . y 14 ,
w C q 2x q
1
b 0 Ž u . :s w2x q r Ž u . y w C x q 4 ,
w C q 2x q
1
g 0 Ž u . :s r Ž u . y w C q 1x q 4 ,
w C q 2x q
1
d 1 Ž u . :s r Ž u . qyC y1 q q 4 ,
w C q 2x q
1
d 2 Ž u . :s r Ž u . q Cq 1 q qy1 4 ,
w C q 2x q
C C
m) y y1 q1
x ½ 5
´ 1 Ž u . :s rŽ . u q 2 qq 2 ,
wCq2 q
C C
m q1 y y1
x ½ 5
2 2
´ 2 Ž u . :s rŽ . u q qq ,
wCq2 q
638 J. Grunebergr Nuclear Physics B 568 [PM] (2000) 594–640
Cq1 Cq 1
k) y
z 1 Ž u . :s
w C q 2x q ½ rŽ . u q 2 qq 2
5 ,
Cq 1 Cq1
k y
x ½ 5
2 2
z 2 Ž u . :s rŽ . u q qq . Ž C.1 .
wCq2 q
X
The coefficients of R V4 V4 Ž u. Ž4.12. are explicitly given by f Ž u., g Ž u. Ž4.10b. and
X X
k )m )k m X X
r1 s
abg
Ž g q Cq C f Ž u . q w2x q b q a qyCyC y2 g Ž u . . ,
kmk X )mX ) X X
r 1X s
abg
Ž g qyC yC f Ž u . q w2x q b q a q CqC q2 g Ž u . . ,
X
Cq C Cq C X
k )k X 2
y
r2 s ž
a
q f Ž u. q q 2
X
/ ,
X) Cq C X Cq C
kk y
2
r 2X s
a ž /
q 2 f Ž u. q q ,
X
X) CqC Xq2 Cq C q2
mm y
2
g ž /
r3 s q 2 qq g Ž u. ,
X
X Cq C q2 Cq C Xq2
X
mm )
2
y
g ž
r3 s q qq 2 g u Ž ./ ,
qy1 X
r4 s 1 q
abg
w C x q Ž w C x qg f Ž u . y w C q 1 x q b .
q w C X q 1x q Ž w C q 1x q a g Ž u . y w C X x q b . 4 ,
q
r4X s 1 q w C x q Ž w C X x qg f Ž u . y w C q 1 x q b .
abg
q w C X q 1x q Ž w C q 1x q a g Ž u . y w C X x q b . 4 ,
1 X
r5 s
abg
Ž w C x q w C q 1 x qg f Ž u . y w 2 x q w C x q w C q 1 x q b
q w C X xw C X q 1 x q a g Ž u . . ,
1
r5X s X X
Ž w C x q w C q 1 x qg f Ž u . y w 2 x q w C x q w C q 1 x q b X
abg
q w C xw C q 1 x q a g Ž u . . ,
X 1
Cq C q1 CqC Xq1
m )k X 2 y
r6 s
abg žw x
C qg q 2
f Ž u . y b´ q y w C q 1 x q a q X 2
X
/
g Ž u. ,
X
CqC q1 1 Cq C q1
kmX ) y y
2
X
r6 s
abg žw x C X
qg q
2 f Ž u . q b´ q y w C q 1x q a q 2
g Ž u. , /
J. Grunebergr Nuclear Physics B 568 [PM] (2000) 594–640 639
1 X
r7 s
a
Ž w C x q y w C x q f Ž u. . ,
1
r 7X s X
Ž w C x q y w C x q f Ž u. . ,
a
X 1
Cq C q1 CqC Xq1
k )mX 2 y
r8 s
abg žw X
C x qg q 2
f Ž u . y bh q y w C q 1 x q a q 2
X
/
g Ž u. ,
CqC Xq1 1 Cq C q1
mk X ) y y
2
X
r8 s
abg žw C x qg q 2 f Ž u . q bh q X
y w C q 1x q a q 2
g Ž u. ,/
1 X
r9 s
g
Ž w C q 1x q y w C q 1x q g Ž u . . ,
1
r 9X s X
Ž w C q 1x q y w C q 1x q g Ž u . . ,
g
1 X
r 10 s
abg
w C x q Ž w C q 1 x q b y w C x qg f Ž u . .
q w C X q 1x q Ž w C X x q b y w C q 1x q a g Ž u . . 4 . Ž C.2 .
where the abbreviations ŽB.9., as introduced in Appendix B, have been used for brevity.
Finally the coefficients of Ž4.14. are explicitly given as
sinh Ž 2h .
a Ž u . :s eyu ,
sinh Ž 2h q u .
sinh Ž 2h .
b Ž u . :s eu,
sinh Ž 2h q u .
sinh Ž u .
c Ž u . :s ,
sinh Ž 2h q u .
sinh Ž 2h y u .
d Ž u . :s . Ž C.3 .
sinh Ž 2h q u .
References
w12x G.W. Delius, M.D. Gould, Y.-Z. Zhang, Nucl. Phys. B 432 Ž1994. 377.
w13x A.J. Bracken, G.W. Delius, M.D. Gould, Y.-Z. Zhang, J. Phys. A 27 Ž1994. 6551.
w14x M. Jimbo, Comm. Math. Phys. 102 Ž1986. 537.
w15x P.P. Kulish, E.K. Sklyanin, Quantum Spectral Transform Method – Recent Developments, in: Integrable
Quantum Field Theories, ed. J. Hietarina, C. Montonen, Lecture Notes in Physics 151 ŽSpringer, Berlin,
1981..
w16x J.F. Cornwell, Group Theory in Physics, Vol. 3, Supersymmetry and Infinite-dimensional Algebras
ŽAcademic Press, London, 1989..
w17x V. Chari, A. Presley, A Guide to Quantum Groups ŽCambridge Univ. Press, Cambridge, 1994..
w18x H. Yamane, Publ. Proc. Japan Acad. Ser. A 70 Ž1994. 31.
w19x H. Yamane, q-algr9603015 Ž1996..
w20x H. Yamane, Publ. RIMS Kyoto Univ. 30 Ž1994. 15.
w21x V.G. Kac, Infinite-dimensional Lie Algebras, 3rd ed. ŽCambridge Univ. Press, Cambridge, 1990..
w22x O. Gabber, V.G. Kac, Bull. Amer. Math. Soc. 5 Ž1981. 185.
w23x V.G. Kac, in: Differential Geometrical Methods in Mathematical Physics, ed. K. Bleuler, H.R. Petry, A.
Reed Ž1978. 597–626.
w24x T.D. Palev, V.N. Tolstoy, Commun. Math. Phys. 141 Ž1991. 549.
w25x V.G. Drinfel’d, Sov. Math. Dokl. 32 Ž1985. 254.
w26x M. Jimbo, Lett. Math. Phys. 10 Ž1985. 63.
w27x P.P. Kulish, E.K. Sklyanin, J. Sov. Math. 19 Ž1982. 1596.
w28x S.M. Koroshkin, V.N. Tolstoy, Commun. Math. Phys. 141, 599-607.
w29x S.M. Koroshkin, V.N. Tolstoy, in: Proc. International Workshop on Mathematical Physics ‘Quantum
Symmetries’ ed. H.-D. Doebner, V.K. Dobrev Ž1993. 336–351.
w30x Z.-Q. Ma, Yang–Baxter Equation and Quantum Enveloping Algebras ŽWorld Scientific, Singapore,
1993..
w31x Z. Maassarani, sol-int9903003.
w32x N.Y. Reshetikhin, Quantized universal enveloping algebras, the Yang–Baxter equation and invariants of
links I, LOMI-preprint E-4-87 Ž1987..
w33x N.Y. Reshetikhin, Quantized universal enveloping algebras, the Yang–Baxter equation and invariants of
links II, LOMI-preprint E-17-87 Ž1987..
w34x P.P. Kulish, J. Sov. Math. 35 Ž1986. 2648.
w35x J.H.H. Perk, C.L. Schultz, Phys. Lett. A 84 Ž1981. 407.
w36x E.K. Sklyanin, L.A. Takhtajan, L.D. Faddeev, Theor. Math. Phys. 40 Ž1980. 688.
w37x L.A. Takhtajan, L.D. Faddeev, Russ. Math. Surveys 34 Ž1979. 11.
w38x L.D. Faddeev, Sov. Sci. Rev. C 1 Ž1980. 107.
w39x P.P. Kulish, N.Y. Reshetikhin, JETP 80 Ž1981. 158.
w40x Z. Maassarani, J. Phys. A 28 Ž1995. 1305.
w41x J. Ramos, M. Martins, J. Phys. A 30 Ž1997. L195.
w42x M. Martins, solv-int 9901002 and private communication.
w43x ¨
H. Frahm, M.P. Pfannmuller, Nucl. Phys. B 479 Ž1996. 575.
w44x Z. Tsuboi, Physica A 252 Ž1998. 565.
w45x Z. Tsuboi, J. Phys. A Math. Gen. 31 Ž1998. 5485.
w46x C.L. Schultz, Physica A 122 Ž1983. 71.
w47x H. Frahm, private communication.
w48x C.K. Lai, J. Math. Phys. 15 Ž1974. 1675.
w49x B. Sutherland, Phys. Rev. B 12 Ž1975. 3795.
w50x N.Y. Reshetikhin, Sov. Phys. JETP 57 Ž1983. 691.
w51x J. Gruneberg, unpublished.
w52x ¨
H. Frahm, M.P. Pfannmuller, A.M. Tsvelick, Phys. Rev. Lett. 81 Ž1998. 2116.
w53x J. Gruneberg, to be published.
w54x ¨
A. Klumper, Ann. Physik 1 Ž1992. 540.
w55x ¨
A. Klumper, Z. Phys. B 91 Ž1993. 507.
w56x ¨
G. Juttner, ¨
A. Klumper, J. Suzuki, Nucl. Phys. B 487 Ž1997. 650.
w57x ¨
G. Juttner, ¨
A. Klumper, J. Suzuki, J. Phys. A 30 Ž1997. 1181.
w58x ¨
G. Juttner, ¨
A. Klumper, J. Suzuki, Nucl. Phys. B 522 Ž1998. 471.
Nuclear Physics B 568 wPMx Ž2000. 641–646
´
Gomez Dumm, D. B563 Ž1999. 82 Jamin, M. B565 Ž2000. 3
´
Gonzalez-Arroyo, A. B564 Ž2000. 159 Janik, R.A. B565 Ž2000. 193
Gorbahn, M. B565 Ž2000. 3 Jegerlehner, F. B566 Ž2000. 423
Gould, M.D. B566 Ž2000. 529 Johansen, A. B562 Ž1999. 291
Gracey, J.A. B563 Ž1999. 390 Julia, B. B563 Ž1999. 448
Graf, G.M. B567 Ž2000. 231
Gray, J.P. B564 Ž2000. 225 Kaiser, D.I. B561 Ž1999. 188
Greene, B.R. B566 Ž2000. 599 Kakushadze, Z. B562 Ž1999. 78
Grianti, F. B563 Ž1999. 97 Kakushadze, Z. B562 Ž1999. 158
Grignani, G. B561 Ž1999. 243 Kaloper, N. B567 Ž2000. 189
Grillo, A.F. B564 Ž2000. 489 Kamani, D. B561 Ž1999. 57
Grojean, C. B561 Ž1999. 77 Kamimura, K. B561 Ž1999. 341
Grojean, C. B671 Ž1999. 133 Karabali, D. B566 Ž2000. 331
Gruneberg, J. B568 Ž2000. 594 Karsch, F. B564 Ž2000. 303
Guan, X.-W. B562 Ž1999. 433 Kawai, H. B565 Ž2000. 176
Guasch, J. B562 Ž1999. 3 Kawano, T. B565 Ž2000. 427
¨
Guijosa, A. B565 Ž2000. 157 Keski-Vakkuri, E. B563 Ž1999. 279
Guitter, E. B567 Ž2000. 515 Khalil, S. B564 Ž2000. 19
¨
Gursoy, U. B561 Ž1999. 473 Kilgore, W.B. B566 Ž2000. 252
Gusynin, V.P. B563 Ž1999. 361 Kim, C. B564 Ž2000. 430
Kim, C. B565 Ž2000. 611
Hagiwara, K. B565 Ž2000. 483 Kim, C. B566 Ž2000. 331
Halbersma, R. B564 Ž2000. 29 Kim, H.-C. B562 Ž1999. 213
Halyo, E. B561 Ž1999. 17 Kim, J.E. B564 Ž2000. 3
Hambye, T. B564 Ž2000. 391 Kinar, Y. B566 Ž2000. 103
Harikumar, E. B565 Ž2000. 385 King, S.F. B562 Ž1999. 57
Hasler, D. B567 Ž2000. 231 Kirsten, K. B563 Ž1999. 603
Hassan, M. B561 Ž1999. 385 Kitanine, N. B567 Ž2000. 554
Hassan, S.F. B568 Ž2000. 145 Kitazawa, Y. B565 Ž2000. 176
Hatsuda, M. B561 Ž1999. 341 Kleijn, B. B568 Ž2000. 475
Hauer, T. B562 Ž1999. 497 Kniehl, B.A. B563 Ž1999. 200
Hauer, T. B567 Ž1999. 61 Kniehl, B.A. B568 Ž1999. 305
Haug, O. B565 Ž2000. 38 ¨
Kohler, G.O. B564 Ž2000. 391
Hautmann, F. B563 Ž1999. 153 Korchemsky, G.P. B566 Ž2000. 203
He, H.L. B563 Ž1999. 97 Korff, C. B567 Ž2000. 409
Hebecker, A. B568 Ž2000. 287 Kosower, D.A. B563 Ž1999. 477
Henneaux, M. B563 Ž1999. 448 Kovalenko, S. B565 Ž2000. 38
Hikami, K. B566 Ž2000. 511 Kraus, P. B563 Ž1999. 259
Ho, P.-M. B568 Ž2000. 447 Kristjansen, C. B567 Ž2000. 515
Hofman, C. B561 Ž1999. 125 Krogh, M. B564 Ž2000. 259
Hoppe, J. B567 Ž2000. 231 Kruczenski, M. B563 Ž1999. 279
Hull, C.M. B561 Ž1999. 293 Kuang, H.H. B563 Ž1999. 97
Hull, C.M. B564 Ž1999. 29 Kummer, W. B565 Ž2000. 291
Kunszt, Z. B563 Ž1999. 153
Ignesti, G. B563 Ž1999. 97 Kuraev, E.A. B568 Ž2000. 40
Ilieva, N. B565 Ž2000. 629 Kurihara, Y. B565 Ž2000. 49
Incicchitti, A. B563 Ž1999. 97 Kuroda, M. B565 Ž2000. 49
Ishibashi, N. B565 Ž2000. 176 Kyae, B. B564 Ž2000. 3
Isidori, G. B566 Ž2000. 3
Iso, S. B565 Ž2000. 176 Lalak, Z. B563 Ž1999. 107
Izquierdo, J.M. B567 Ž2000. 293 Langlois, D. B565 Ž2000. 269
Langmann, E. B563 Ž1999. 506
¨
Jager, S. B565 Ž2000. 3 Larsen, F. B563 Ž1999. 259
Jakob, R. B564 Ž2000. 471 Lautenbacher, M.E. B565 Ž2000. 3
644 CumulatiÕe Author Index B561–B568
Urrutia, L.F. B566 Ž2000. 547 Wadia, S.R. B564 Ž2000. 103
Uwer, P. B563 Ž1999. 477 Wadia, S.R. B564 Ž2000. 128
Weigt, G. B568 Ž2000. 457
Vaidya, S. B564 Ž2000. 128 Wetterich, C. B562 Ž1999. 524
Vainshtein, A. B566 Ž2000. 311 Wilczek, F. B566 Ž2000. 33
Vairo, A. B566 Ž2000. 275 Williams, J. B566 Ž2000. 373
Valle, J.W.F. B566 Ž2000. 92 Woynarovich, F. B562 Ž1999. 497
Van Baal, P. B564 Ž2000. 159
Van der Loo, S. B562 Ž1999. 277 Xiong, Z. B561 Ž1999. 3
Van der Schaar, J.P. B564 Ž2000. 29
Vandoren, S. B568 Ž2000. 475
Van Gils, S. B562 Ž1999. 277 Yamada, Y. B566 Ž2000. 642
Van Neerven, W.L. B568 Ž2000. 263 Yang, B. B565 Ž2000. 611
Van Ritbergen, T. B564 Ž2000. 343 Yang, S.-K. B566 Ž2000. 642
Vassilevich, D.V. B563 Ž1999. 603 Yau, S.-T. B567 Ž2000. 231
´
Vazquez-Mozo, M.A. B568 Ž2000. 405 ´ F.J.
Yndurain, B563 Ž1999. 45
Veneziano, G. B568 Ž2000. 93 Yung, A. B562 Ž1999. 191
Veretin, O.L. B566 Ž2000. 469
Vergados, J.D. B565 Ž2000. 38 Zanon, D. B564 Ž2000. 241
Vergara, J.D. B566 Ž2000. 547 Zdesenko, Yu.G. B563 Ž1999. 97
Vinti, S. B562 Ž1999. 549 Zhang, Y.-Z. B566 Ž2000. 529
Vogelsang, W. B568 Ž2000. 60 Zhukov, S. B567 Ž2000. 61
Vogt, A. B568 Ž2000. 263 Zvyagin, A.A. B565 Ž2000. 555
Volkov, M.S. B566 Ž2000. 173 Zwiebach, B. B567 Ž2000. 61