Академический Документы
Профессиональный Документы
Культура Документы
Accruals Models*
*
We are grateful for helpful comments and suggestions provided by David Mest, Ashni Singh, Andrew
Stark, participants at the 1999 Annual Meeting of the American Accounting Association, participants at
the 1999 Financial Reporting and Business Communication Conference, and two anonymous
reviewers. Financial support was provided by the Research Board of the Institute of Chartered
Accountants in England and Wales, The Leverhulme Trust, and the Economic and Social Research
Council. Correspondence to: Steven Young, I.C.R.A., The Management School, Lancaster University,
Lancaster, LA1 4YX, U.K., Tel: (+44)-1524-594242, Fax: (+44)-1524-594334, E-mail:
s.young@lancaster.ac.uk
Detecting Earnings Management Using Cross-sectional Abnormal Accruals
Models
Abstract
(Jones, 1991) and modified-Jones (Dechow et al., 1995) models, we also develop and
test a new specification, labeled the “margin model”. Empirical tests suggest that all
three models are well specified when applied to a random sample of firm-years.
However, the margin model appears to generate relatively better specified estimates
of abnormal accruals when cash flow performance is extreme. Analysis of the models’
ability to detect artificially induced earnings management indicates that all three
plausible levels of accruals management (e.g., less than ten percent of lagged total
models are found to be more powerful for revenue and bad debt manipulations. In
contrast, the margin model appears to be more powerful at detecting non-bad debt
expense manipulations. These results suggest that different models may be required in
different circumstances.
2
Detecting Earnings Management Using Cross-Sectional Abnormal
Accruals Models
1. Introduction
obstacle associated with the implementation of this approach, however, is the need to
achieving this separation are the standard-Jones model (Jones, 1991) and the
Guay et al. (1996), Dechow et al. (1995) and Kang and Shivaramakrishnan (1995) for
the US suggests that both models estimate discretionary accruals with considerable
imprecision. These findings have led some commentators to question the reliability of
the emerging body of empirical evidence on the uses made by managers of accounting
1
Following conventional practice, we use the terms “managed accruals”, “discretionary accruals”, and
“abnormal accruals” interchangeably throughout the remainder of the paper. Similarly, we also use the
terms “unmanaged accruals”, “non-discretionary accruals” and “normal accruals” interchangeably.
1
models, we also develop and test a new specification, labeled the “margin model”.2
specification (i.e., the probability of a Type I error) and power (i.e., the probability of
all three procedures are well specified when applied to a random sample of firm-
years. Additional tests suggest that the margin model is relatively better specified
when cash flow performance is extreme. We assess the power of each model to detect
accrual management using simulation procedures that allow for three different forms
debts) and bad debt manipulation. Findings suggest that all three procedures are
all three forms of accruals management (i.e., accruals less than ten percent of lagged
total assets). However, important differences in the relative power of the models are
substantially more powerful for detecting revenue and bad debt manipulations. In
contrast, the margin model is significantly better at detecting non-bad debt expense
manipulations.
The paper contributes to the literature in three ways. First, in contrast to the
studies by Guay et al. (1996), Dechow et al. (1995) and Kang and Shivaramakrishnan
warranted as such procedures are now widely employed in the earnings management
2
Kang and Shivaramakrishnan (1995) develop a model that uses an instrumental variables technique to
isolate the discretionary component of total acrruals. Based on simulation results, they present evidence
that their model is more powerful than the standard-Jones model at detecting abnormal accruals. We do
not, however, evaluate this model in the present paper for two reasons. First, other than in the original
Kang and Shivaramakrishnan (1995) paper, we are not aware of any other study that has used this
2
literature. Second, we develop an alternative procedure for estimating managed
under certain conditions. Third, the results offer some practical guidance for detecting
earnings management activity. Specifically, our findings suggest either that the choice
combination may afford the best opportunity of detecting accrual management, the
review of the standard-Jones and modified-Jones models and develops our alternative
model for estimating abnormal accrual activity. Section 3 provides details of the
research design and sample selection procedure. Section 4 presents our empirical
total accruals into their managed and unmanaged components.3 In the first stage, total
accruals (TA) are regressed on the change in sales (∆REV) and the gross level of
property, plant, and equipment (PPE) for each sample firm, using the longest available
time-series of data immediately prior to the event year. In the second stage, the
estimated parameters from this regression are combined with TA, ∆REV and PPE
data from the event year to determine the abnormal component of total accruals. The
procedure to estimate accrual manipulation. Secondly, the original time-series formulation of the
procedure is not amenable to the cross-sectional estimation techniques that are the focus of this paper.
3
In the discretionary accruals literature, total accruals are typically defined as the change in non-cash
working capital accounts, minus depreciation and amortisation.
3
intuition underlying the choice of PPE and ∆REV as explanatory variables in the first-
stage regression is that they help to control for unmanaged accruals associated with
the standard-Jones model, with the exception that the change in debtors (∆REC) is
subtracted from ∆REV at the second stage. In effect, the model therefore implicitly
assumes that all changes in credit sales in the event period result from earnings
management.5 In tests comparing the power of the modified-Jones model with that of
the standard-Jones model, Dechow et al. (1995) find that the former procedure is
in absolute terms, both models are found to generate tests of low power for earnings
percent of total assets). In addition, both models are shown to be poorly specified
of Type I errors when applied to firms with extreme cash flows. These results cast
4
The coefficient on PPE is predicted to be negative since property, plant, and equipment are linked to
the income-decreasing depreciation accrual. In contrast, the predicted sign for the ∆REV coefficient is
more ambiguous since a given change in sales can cause income-increasing changes in some working
capital accounts (e.g., trade debtors) and income-decreasing changes in others (e.g., trade creditors).
The ambiguous predictions for the estimated coefficient on ∆REV highlights the somewhat ad hoc
nature of the model. For her sample of 23 firms facing import relief investigations by the United States
International Trade Commission, Jones reports an insignificant positive mean coefficient on ∆REV
(0.035) and a significant negative mean coefficient on PPE (-0.033).
5
As Beneish (1998) discusses, to the extent that ∆REVt – ∆RECt can be re-written as CRt – REVt-1,
where CRt equals cash received in period t, the modified-Jones model lacks economic intuition.
4
isolating accruals manipulation. Consistent with this sceptical view, Guay et al.
procedure and conclude that neither procedure generates a reliable measure of accrual
management.
Jones model, Teoh et al. (1998) and DeFond and Jiambalvo (1994) focus exclusively
on the working capital component of total accruals. As Beneish (1998) and Young
(1999, p. 842) discuss, this formulation is potentially more appealing since continuous
(i.e., year-on-year) earnings management via the depreciation accrual is likely to have
limited potential. In addition, Young (1999) reports that Jones-style models based on
the procedures empirically because of the need for a sufficiently long time-series of
requirement raises several concerns. First, issues of survivorship bias naturally arise.6
Secondly, the assumption that the coefficient estimates on ∆REV and PPE remain
stationary over time may not be appropriate. Finally, the self-reversing property of
residuals. In an effort to overcome these problems, recent studies have begun to use
cross-sectional versions of the models (e.g., Becker et al., 1998; Subramanyam, 1996;
DeFond and Jiambalvo, 1994).7 Under this approach, the first stage regression is
6
For example, Jones (1991) requires that her sample firms have at least 10 time-series observations.
7
One should not interpret the current preference for cross-sectional models in the literature as evidence
of their improved ability to detect earnings management. It is important to recognise that the
replacement of time-series models by their cross-sectional counterparts introduces new problems. For
5
estimated separately for each industry-year combination, after which the resulting
industry- and time-specific parameter estimates are combined with firm-specific data
literature. However, little research has been conducted to date that evaluates the
abnormal accruals using a two-stage procedure, where the first stage involves
capture unmanaged accruals. In contrast to the two Jones models, however, the
explanatory variables included in our first-stage regression are derived from a formal
model linking sales, accruals and earnings.10 Following the discussion in the
grounds that this item represents an unlikely or unsuitable vehicle for systematic
earnings management. The starting point for our model is therefore working capital
accruals. We then seek to model “normal” changes in the following three key working
capital accrual components: stocks (∆STOCK), debtors net of bad debt allowance
example, cross-sectional models are less likely to capture the effects of (a) mean reversion in accruals,
(b) dynamic accrual management strategies, and (c) industry-wide earnings management. With valid
arguments for and against both estimation approaches, the relative power of cross-sectional and time-
series models to detect earnings management remains an open empirical question that is beyond the
scope of the present paper.
8
In the event that the sample firm is included in the first-stage regression model, its estimate of
discretionary accruals is equal to the corresponding regression residual and consequently the two-stage
estimation procedure collapses to a single stage.
9
An exception is the study by Jeter and Shivakumar (1999) which evaluates cross-sectional models
using quarterly and annual data for a sample of US firms.
10
Our model is consistent with that developed by Dechow et al. (1998).
6
(∆DEBT), and creditors (∆CREDIT). The remaining non-cash components of
working capital are not explicitly modeled because of their diverse natures.
Ignoring for the moment such expenses as wages and rent, we define the
where PUR is purchases of materials, COGS is cost of finished goods sold, REVC is
revenue from credit sales, CRC is cash received from customers, BDE is the bad debt
expense, and CPS is cash paid to suppliers. Note that while ∆STOCK in (1) includes
transfers between these inventory categories involve cancelling entries that can be
ignored when the inventories are aggregated. Next, we write working capital accruals
(WCA) as:
where sm equals the gross margin on recorded sales, cm equals the gross cash
contribution on cash collections from customers, and OTHER includes all non-cash
current assets other than stocks and trade debtors and all current liabilities other than
creditors. We assume that OTHER is orthogonal to REVC and CRC in equation (4).
two contribution margins: the gross margin on sales and its cash flow analogue, the
7
margin on cash received (the “cash margin”). Under this approach, working capital
accruals that do not result from sales and cash collections in the period are classified
as “abnormal” in nature and are considered the most likely manifestation of earnings
management.
where REV is total sales (Datastream item 104) and represents our proxy for REVC,
CR is total sales minus the change in trade debtors (Datastream items 104 – ∆370)
and represents our proxy for CRC, λ0, λ1 and λ2 are regression coefficients and ηt is
the regression residual. The λ1 coefficient represents an estimate of the sales margin
cash margin and is predicted to be negative. We refer to this procedure as the “margin
model” in the remainder of the paper. The primary difference between the margin
model and the standard-Jones and modified-Jones models is that the margin model
disaggregates the change in revenue term into two components at the parameter fitting
stage, substituting cash receipts in the current period for revenues in the prior period.
The primary advantage of this approach is its improved economic intuition, which
should, in turn, lead to a more precise estimate of normal accruals. The downside,
such, we anticipate that the margin model will be less powerful than the modified-
8
3. Research design
(s-J), the modified-Jones (m-J) and the margin models. To ensure comparability with
the margin model, both the s-J and m-J models are formulated using working capital
accruals (e.g., Teoh et al., 1998; DeFond and Jiambalvo, 1994).12 The first stage
regressions are estimated cross-sectionally for each industry (Datastream level-6) and
year combination using all firms with available accrual data on the Datastream “Live”
and “Dead” stocks files. All variables (with the exception of the intercept) are scaled
containing fewer than ten observations are dropped from the analysis to ensure more
Model specification
We evaluate model specification by examining the extent to which each of the
11
We test this prediction empirically in section 4.
12
Working capital accruals are defined as the change in non-cash current assets minus the change in
current liabilities, excluding the current portion of long-term debt [Datastream items ∆(376 – 375) –
∆(389 – 387)].
13
As such, the s-J and m-J models reported in this paper differ slightly from those estimated in extant
studies where the intercept is also scaled by total assets and the resulting regression is estimated with
the true constant term suppressed. We did not adopt this approach in the current paper for two reasons.
First, there is no theoretical reason for forcing the regression through the origin (e.g., we have no
reason to believe that total accruals will be zero when, say, ∆REV is zero). Secondly, regressions
estimated with the constant suppressed preclude an analysis of the goodness of fit of the models
because the associated R-square values are unreliable. However, to ensure comparability with extant
research, we repeated all tests using this alternative specification. In all cases the conclusions were
identical to those based on the findings reported in tables 2-5.
9
(a) Estimate the first stage regressions for each of the three abnormal accrual models
(s-J, m-J and margin) for each Datastream level-6 industry group in year t;
(b) Select 25 firms at random from year t and construct an indicator variable
(PART) defined as one if the firm has been selected and zero otherwise;
(c) Randomise all observations in year t and compute abnormal accruals (AA) for
each of the three models using the coefficient estimates obtained in (a);
(d) Estimate the following univariate regression for each measure of abnormal
accruals
from zero.
Steps (a) – (d) are then repeated 100 times for each sample year. Since observations at
stage (b) are selected at random, they should not be characterised by any systematic
reject the null hypothesis of βˆ = 0 at rates that significantly exceed the appropriate
test statistic (e.g., five percent or one percent levels). Accordingly, we interpret
rejection frequencies close to (significantly different from) the specified test level as
(measured using operating cash flows) is extreme. Empirical tests exploit the fact that
earnings management and measurement error. Since the true earnings management
10
attributable to measurement error. As such, a comparison of the distributional
performance, prior research has found that unusually high (low) operating cash flows
1999; Dechow et al., 1995). Accordingly, average abnormal accruals estimated using
a model that controls for high (low) cash flows are expected to be less negative (less
positive) than abnormal accruals estimated using a procedure that affords little
effective protection against extreme cash flow performance, all else equal. The test is
implemented by forming decile portfolios, ranked within each sample year, based on
operating cash flow scaled by lagged total assets.15 Abnormal accruals for each decile
portfolio are then computed using the three cross-sectional models. Finally, mean and
median abnormal accrual estimates generated by the three models are compared
14
While this approach allows direct comparisons of the relative magnitude of the measurement error
associated with alternative accrual models, a limitation of the procedure is that affords no measure of
the absolute level of misspecification.
15
Operating cash flow is measured as operating profit (Datastream item 137) minus working capital
accruals (Datastream items ∆376 – ∆375 – ∆389 + ∆387), plus depreciation and amortisation
(Datastream items 402 + 562).
16
Using a procedure equivalent to that employed to test for general model specification, Dechow et al.
(1995) evaluate model specification when cash flows are extreme by examining observed rejection
frequencies for samples of firms selected from the top and bottom deciles of the operating cash flow
distribution. We do not tabulate results based on this approach in the current paper because as Guay et
al. (1996) discuss, the test is based on the assumption of no earnings management. While this
assumption is expected to hold for the random samples used to test for general model misspecification,
it is unlikely to hold in the case of firms with extreme cash flow performance, given the strong
incentives for earnings management faced by these firms. For completeness, however, we did perform
such tests. Consistent with Dechow et al. (1995), rejection frequencies for all three models significantly
exceeded the specified test level for both the high and low cash flow partitions. However, the rejection
frequencies for the margin model were always lower than those obtained for the s-J and m-J models,
11
exist. This is achieved by adding a pre-determined amount of positive accruals to the
reported accruals of a randomly selected set of firms and then examining the ability of
the models to detect this artificial earnings management. The procedure is similar to
that described above for testing general model specification with the exception that at
stage (b), artificial income-increasing accruals are added to the reported accruals for
firms where PART equals one.17 As before, steps (a) – (d) are then repeated 100 times
for each sample year. However, since the observations where PART equals one are
expect a powerful model to reject the null hypothesis that βˆ = 0 , in favour of the
alternative that βˆ > 0 , at rates that significantly exceed the specified test level. All
else equal, the higher the rejection frequencies associated with a particular model, the
income-increasing accruals ranging from zero percent to 100 percent of firms’ lagged
(b) Bad Debt Manipulation – e.g., write-back of an existing bad debt provision. This
consistent with the view that the margin model is relatively better specified when cash flows are
extreme.
17
As such, the parameter estimates from the first stage regressions in part (a) are not contaminated by
the artificially induced earnings management.
12
(c) Revenue Manipulation – e.g., premature recognition of a sale, assuming all costs
are fixed (Dechow et al. 1995, p. 202). This approach is implemented by adding
3.3 Sample
The starting point for our sample is the population of firms on the Datastream
“Live” and “Dead” stocks files with the necessary accounting data for the
computation of the s-J, m-J and margin models and which have year-ends between 30
June 1990 and 31 May 1997. From this initial sample, we exclude all financial firms
because (a) their financial reporting environments differ from those of industrial firms
and (b) they have fundamentally different accrual processes that are not captured very
well by our expectations models of normal accrual activity. Firm-years with missing
Datastream level-6 industry codes are also omitted. Since the estimation of our cross-
sectional accrual models requires at least ten firms per industry-year combination, we
also exclude all Datastream level-6 industry groups with fewer than ten observations
in a given sample year. Finally, because the distribution of working capital accruals is
for which scaled working capital accruals lie outside the one and ninety-nine
percentiles.
groups. Of these, 418 firms (49.9%) are represented in all seven years, while 572
firms (68.3%) are represented at least five times. Firms with only a single year of data
18
Unlike Dechow et al. (1995) and Kang and Shivaramakrishnan (1995), because our models are
estimated cross-sectionally we do not have to make corresponding adjustments for accrual reversals in
13
number 78 (9.3%). Annual sample sizes are 601, 590, 607, 580, 624, 657 and 693 for
1990, 1991, 1992, 1993, 1994, 1995 and 1996, respectively. The largest industry
group in any given year is general engineering, with the number of firms ranging from
4. Results
Table 2 presents descriptive statistics for the Jones and margin models. Panel
A reports the coefficient estimates and associated fit statistics for the first-stage
panel A. Since the s-J and m-J models are equivalent at the estimation stage, table 2
reports only a single set of results for these two models in the columns headed “Jones
ability to capture, and hence control for, normal working capital accrual activity.19 For
the Jones models, the coefficient on ∆REV is positive in all years, with the exception
of 1994. However, the average magnitudes of the βs are generally very low, ranging
from a high of 0.068 in 1993 to a low -0.007 in 1994. The associated t-statistics
highlight the lack of association between working capital accruals (WCA) and the
∆REV term for our sample: median t-statistics for the industry-specific estimates of β
never attain significance in any of the seven estimation periods. The lack of
future accounting periods. This holds for all three types of accrual manipulation examined.
19
Although a relatively high R-squared statistic does not necessarily guarantee that a particular model
is effectively partitioning accrual activity into normal and abnormal components, a model that accounts
for only a small proportion of the variation in working capital accruals seems inconsistent with (a) the
rationale underlying the use of a regression-based modelling procedure and (b) management’s limited
flexibility to manipulate revenues and expenses under GAAP.
14
association between WCA and ∆REV is also reflected in the explanatory power of the
models: the median adjusted R-squared statistic ranges from a maximum of 15% in
1993 to a minimum of –2% in 1992. Moreover, the adjusted R-squared statistics are
negative for more than 45% of industries in four out of the seven sample years. In
other words, the Jones models have zero explanatory power for a large number of
industry years. Even at their best, the cross-sectional Jones models fail to account for
Panel A of table 2 also reports the corresponding statistics for the margin
model. As predicted, the coefficient on REV (λ1) is positive in all seven sample years.
Further, the magnitude of the coefficient lies within the feasible range for the gross
profit margin on sales. The median coefficient on the CR term (λ2) is negative in all
reported for λ1. The median adjusted R-squared statistic ranges from 41% (1993) to
7% (1994) and is significantly higher (p < 0.001) than that reported for the Jones
models in six of the seven sample years. Consistent with this, the proportion of
industries with essentially zero fit (i.e., zero or negative adjusted R-squared statistics)
in a given sample year is also significantly lower than the equivalent figures for the
Jones models. These results provide evidence that the margin model may be
somewhat better specified, in terms of the first stage estimation, than either of the two
Jones models.
panel A of table 2 to compute the elasticity of working capital accruals with respect to
15
the drivers of “normal” accruals for each of the three accruals models.20 For the
margin model, the net effect of a one percent change in REV and CR on working
capital accruals ranges from 152% in 1993 to –5% in 1994. In contrast, the effect of a
one percent change in ∆REV on working capital accruals for the s-J model ranges
from 31% in 1993 to –0.18% in 1994. The equivalent elasticities for the m-J model
are 26% in 1993 and –0.14% in 1994. The results indicate the higher responsiveness
of working capital accruals to the drivers of normal accruals in the case of the margin
model. This has both positive and negative implications regarding the relative
performance of the margin model. On the positive side, a high elasticity is expected to
normal accruals. On the negative side, a high elasticity means that the model will
elasticities for the s-J and m-J models: the higher elasticity associated with the s-J
In an effort to shed light on the reasons underlying the low explanatory power
20
An elasticity measures the effect of a one percent change in an independent variable (Xj) on the
dependent variable (Y). Elasticities, measured at the point of the means of the independent variables,
16
according to the adjusted R-squared statistics obtained from the first stage regression
models. For both the Jones and the margin models, an F-test fails to reject the null
hypothesis that the mean number of observations in the estimation portfolios differs
across the three goodness-of-fit partitions. These findings indicate that sparse data is
unlikely to be the primary factor driving the low explanatory power for certain
variability of WCA within the industry-year portfolios underlies the low adjusted R-
square values. Finally, panel B also reports the primary industry groups (i.e., those
patterns are apparent. As such, the factors that determine industry-year combinations
WCA from those where the first-stage regressions account for little or no variation in
Results of the tests designed to assess the general specification of the s-J, m-J,
and margin models are presented in Table 3. Type I errors from one-tailed tests are
reported for both the null hypothesis that abnormal accruals are greater than or equal
hypothesis that abnormal accruals are less than or equal to zero (alternative
procedure helps ensure that firms where PART equals one are unlikely to be
should not reject the null hypothesis of no earnings management at rates that
Xj
are computed as E j = βˆ j .
Y
17
significantly exceed the test level (e.g., 5% or 1%). Consistent with the findings
corresponds to the specified test levels for all three models: in all cases a binomial test
fails to reject the null hypothesis that the observed rejection frequencies equal the
specified test levels. These findings suggest that all three cross-sectional accrual
accrual models when cash flow performance is extreme. Results for the low (high)
CFO partition reveal that all three models generate positive (negative) abnormal
accrual estimates of around six to seven percent of lagged total assets. This is
consistent with all three models inducing positive (negative) measurement error when
cash flows are unusually low (high).21 Examination of the relative magnitudes reveals
that the margin model produces less extreme estimates of abnormal accruals than
those generated by the two Jones models. For example, in the low CFO portfolio,
mean (median) abnormal accruals for the margin model are 6.0% (5.6%) of lagged
total assets, compared with around 7.5% for the s-J and m-J models. These differences
are significant at the 0.001 level. Similarly, in the high CFO portfolio, mean abnormal
accruals for the margin model are –5.6% of lagged total assets, compared with –6.5%
for the s-J and m-J models (difference significant at the 0.001 level). To the extent
that low operating cash flows induce bias in estimated abnormal accruals and vice
versa, these results are consistent with the margin model being somewhat better
specified than either the s-J model or the m-J model when cash flow performance is
extreme. Note, however, that due to the nature of our empirical tests we are unable to
21
An alternative explanation for the positive (negative) abnormal accruals observed in the low (high)
cash flow portfolio is that these firms are systematically managing their earnings upwards (downwards)
as a means of smoothing reported income. Indeed, it is the inability to distinguish between
18
make inferences regarding the absolute level of measurement error. In particular, we
make no claim that the margin model is completely free of measurement error.
First, the margin model’s apparent superior specification does not appear to be
confined to the two extreme CFO portfolios. For example, relative to estimates
generated by the s-J and m-J models, margin model abnormal accruals are less
positive in CFO decile 2 and less negative in CFO deciles 7 - 9. Secondly, abnormal
accruals generated by the s-J model are significantly lower than those generated by
the m-J model when cash flows are unusually low (CFO deciles 1 and 2), suggesting
the s-J model’s relative superiority when applied to firms with low operating cash
flows. Finally, estimated abnormal accruals produced by the three models are
indistinguishable from each other and close to zero in the mid cash flow portfolios
(CFO deciles 5 and 6), suggesting that all models are reasonably well-specified when
plot the frequency with which the null hypothesis of no earnings management is
rejected (vertical axis) against the magnitude of the induced earnings management
(horizontal axis).22 Separate graphs are presented for each assumed source of earnings
manipulation, using two test levels (five percent and one percent). Rejection rates for
measurement error and earnings management that makes it impossible to draw inferences about
absolute model specification in these portfolios.
22
While we experiment with different magnitudes of income-increasing accruals management ranging
from zero percent to 100 percent of firms’ lagged total assets, the graphs in figure 1 only plot earnings
19
both test levels are computed using a one-tailed test. The graphs in panel A provide
the power functions for expense manipulation (excluding bad debts), evaluated at the
five percent and one percent levels. The equivalent results for bad debt manipulation
A striking feature of the results presented in figure 1 is that all three models
(or close to) 100% for artificially induced accrual manipulations of around six percent
of lagged total assets or greater. These results hold for rejection frequencies evaluated
at either the five percent or one percent test levels. Even for relatively low accruals
manipulations of around two percent of lagged total assets, the rejection frequencies
can be as high as 40%, depending on the particular accrual model used and the
specific type of earnings management considered. These results contrast with the
much lower rejection frequencies reported by Dechow et al. (1995) for the time-series
versions of the Jones and modified-Jones models. For example, Dechow et al. (1995)
report rejection rates of less than 30% for earnings management equal to five percent
of total assets, with the rates only approaching 100% when artificially induced
striking feature of the power functions presented in figure 1 is the apparent similarity
manipulation (panel C). Our results therefore fail to confirm the findings reported by
Dechow et al. (1995) that the m-J model is significantly more powerful at detecting
management levels up to ten percent of lagged total assets as in most cases, the detection rates reached
100% at or well before this level.
20
revenue-based manipulations. We attribute this result to the small coefficient
estimates reported for the Jones models in table 2, which act to dampen-down the
effect of revenue manipulations that contaminate the ∆REV term. In other words,
when the estimated coefficient on ∆REV is as low as that reported in table 2, it makes
little difference whether or not ∆REV is adjusted for the change in debtors associated
While all three models evaluated in this study appear capable of detecting
suggest some important differences with respect to the performance of the Jones
models (s-J and m-J) and the margin model. For example, the findings reported in
panel A for expense manipulation (excluding bad debts) indicate the margin model’s
relative superiority for low levels of earnings management (i.e., accruals less than six
percent of lagged total assets). A Z-test reveals that the rejection frequencies obtained
for the margin model are significantly higher (p < 0.05) than those obtained for the s-J
1.5% and 6.5% of lagged total assets. As discussed above, this is consistent with the
where the drivers of normal accrual activity are not contaminated by earnings
management. In contrast, the margin model is significantly worse than the s-J and m-J
models at detecting both bad debt manipulation (panel B) and revenue manipulation
(panel C). This is because the smaller coefficient estimates obtained in the first stage
fit of the Jones models serve to dampen-down the effect of earnings management that
estimates reported in table 2 for the margin model are associated with a reduction in
the model’s power to detect earnings management because a greater fraction of the
21
artificially induced accruals manipulation is unintentionally attributed to normal
accruals activity.
Guay et al. (1996), Dechow et al. (1995) and Kang and Shivaramakrishnan (1995)
examination of the literature indicates that cross-sectional models are now more
sectional versions of the models proposed by Jones (1991) and Dechow et al. (1995),
we also develop and test an alternative procedure, labelled the margin model. The
margin model differs from existing procedures in that the drivers of normal accruals
are derived from a formal model linking sales, accruals and earnings.
Results indicate that all three cross-sectional models appear well specified
that the margin model generates relatively better specified estimates of abnormal
accrual estimates produced by the margin model are significantly lower (higher) than
operating cash flows are unusually high (low). Further analysis designed to assess the
models’ ability to detect known cases of accruals management indicates that all three
procedures appear capable of generating high power tests for earnings management
upwards of five percent of lagged total assets: for earnings management exceeding
22
five percent of lagged total assets, all three models yield detection rates close to
100%. In contrast, Dechow et al. (1995) report rejection frequencies of between 20%
and 30% for their time-series models for artificially induced earnings management
equal to five percent of total assets. To the extent comparisons across different studies
conducted in different institutional regimes are meaningful, these results provide some
support for Subramanyam’s (1996) findings that his cross-sectional accrual model
may be more powerful than its time-series counterpart. Note, however, that the
Dechow et al. (1995) models are based on a measure of total operating accruals (i.e.,
including depreciation) while the models examined in this paper are defined in terms
of working capital accruals. This raises the possibility that the differences in rejection
frequencies between the two studies may be due to differences in the way accruals are
alternative interpretation of the high rejection rates documented for the models in this
study is that a pure working capital accruals measure may be more powerful than an
operating accruals measure when earnings are managed via working capital accounts.
Future research aimed at formally comparing the power of (a) alternative estimation
Regarding the relative power of the three models, findings suggest that the
detecting subtle revenue and bad debt manipulations (i.e., less than 10% of lagged
total assets in magnitude). Thus, despite their ad hoc nature, these models still appear
accrual management. The price for this improved detection power, however, is greater
23
(1995), we fail to document any significant difference in the relative power of the
result to the relatively small coefficient estimates obtained from the first stage
findings for revenue and bad debt manipulations, the margin model outperforms the
manipulations ranging from 1.5% to 6.5% of lagged total assets. We note, however,
that the improved power of the margin model over that of the s-J and m-J models in
relation to non-bad debt manipulations is more than offset by its relative inability to
abnormal accruals model to use, our results imply that this decision is likely to be
management via revenue or bad debt accounts is anticipated, then the standard-Jones
and modified-Jones models appear to offer the greatest chance of detecting it.
margin model may be more appropriate. In the absence of any strong priors regarding
the specific type of manipulation used, our results suggest that using all three models
accruals management (i.e., less than five percent of total assets in magnitude) remains
24
References
Becker, C., DeFond, M. L., Jiambalvo, J. and Subramanyam, K. R. (1998). ‘The
effect of audit quality on earnings management’. Contemporary Accounting
Research, 15: 1-24.
Beneish, M. D. (1998). ‘Discussion of “Are accruals during initial public offerings
opportunistic?”’. Review of Accounting Studies, 3: 209-221.
Dechow, P., Kothari, S. P. and Watts, R. (1998). ‘The relation between earnings and
cash flows’. Journal of Accounting and Economics, 25: 133-168.
Dechow, P., Sloan, R. and Sweeney, A. (1995). ‘Detecting earnings management’.
The Accounting Review, 70: 193-225.
DeFond, M. L. and Jiambalvo, J. (1994). ‘Debt covenant violation and the
manipulation of accruals’. Journal of Accounting and Economics, 17: 145-176.
Guay, W.R., Kothari, S. P. and Watts, R. (1996). ‘A market-based evaluation of
discretionary accrual models’. Journal of Accounting Research, 34: 83-105.
Healy, P. (1985). ‘The effect of bonus schemes on accounting decisions’. Journal of
Accounting and Economics, 7: 85-107.
Jeter, D.C. and Shivakumar, L. (1999). ‘Cross-sectional estimation of abnormal
accrual models using quarterly and annual data: effectiveness in detecting event-
specific earnings management’. Accounting and Business Research, 29: 299-319.
Jones, J. (1991). ‘Earnings management during import relief investigations’. Journal
of Accounting Research, 29: 193-228.
Kang, S.-H. and Shivaramakrishnan, K. (1995). ‘Issues in testing earnings
management and an instrumental variable approach’. Journal of Accounting
Research, 33(2): 353-367.
Subramanyam, K. R. (1996). ‘The pricing of discretionary accruals’. Journal of
Accounting and Economics, 22: 249-281.
Teoh, S. H., Wong, T. J. and Rao, G. R. (1998). ‘Are accruals during initial public
offerings opportunistic?’. Review of Accounting Studies, 3: 175-208.
Young, S. (1999). ‘Systematic measurement error in the estimation of discretionary
accruals: an evaluation of alternative modelling procedures’. Journal of Business,
Finance and Accounting, 26: 833-862.
25
Table 1
Sample Selection Criteria. The Sample Period Comprises all Non-Financial Firms on
the Datastream “Live” and “Dead” Stocks Files with Year-Ends Between 1 June 1990
and 31 May 1997 (Inclusive)
Criteria Firm-Years
Firm-years with non-missing accrual data: non-
financials only 4,600
26
Table 2
Descriptive Statistics for the Jones (s-J and m-J) Models and the Margin Model, Estimated Cross-Sectionally for Each Industry
(Datastream Level-6) and Year Combinationa
Panel B: Industry-year portfolios, partitioned according to the goodness-of-fit of the first-stage regressions
Jones Models Margin Model
Mean of industry-year Very Poor Fit: Poor Fit: Good Fit: Very Poor Fit: Poor Fit: Good Fit:
portfolios (R2 ≤ 0) (0 < R2 < 5%) (R2 > 20%) Fd (R2 ≤ 0) (0 < R2 < 5%) (R2 > 20%) Fd
Number of firms 18.716 20.130 19.807 0.490 18.277 18.762 21.275 1.728
Stand. dev. of WCA 0.075 0.074 0.077 0.231 0.075 0.067 0.077 1.654
Recurring industries:e Clothing & NA Breweries, pubs House building NA Clothing &
footwear & restaurants footwear
Vehicle Electrical Electrical
distributors equipment equipment
Other building Oil exploration Breweries, pubs &
materials & production restaurants
Engin. vehicle Paper, packaging
components & printing
27
Table 2 (Continued)
Aerospace Other construction
Broadcasting Broadcasting
Chain Stores Medical products
House building Engineering
Leisure facilities Fabricators
Multi-stores Electronics
Business support Ind. components
Publishers Diversified
industrials
a
The Jones models and the margin model are estimated separately for each industry-year combination. The minimum number of observations in an estimation portfolio is ten.
Since the s-J and m-J models are equivalent at the estimation stage, we report only a single set of results for these two models in the columns headed “Jones Models”. Working
capital accruals (WCA) are defined as non-cash current assets minus current liabilities (excluding the current portion of long-term debt). REV is total sales revenue, CR is cash
received, measured as total sales minus the change in trade debtors.
b
The number of industry groups in 1990 (1991, 1992, 1993, 1994, 1995, 1996) is 31 (30, 31, 29, 31, 31, 32).
c
The percentage of industry groups where the adjusted-R2 is negative.
d
The F-statistic from a one-way ANOVA testing the difference in means across the three goodness-of-fit categories.
e
Datastream level-6 industry groups where the adjusted-R2 meets the goodness-on-fit criteria in at least four years during the sample period.
28
Table 3
Comparison of Type I Error Rates for Abnormal Accruals Estimated Using the Standard-
Jones (s-J), the Modified-Jones (m-J) and Margin. Percentage of 700 Randomly Selected
Firm-Years for which the Null Hypothesis of No Earnings Management is Rejected
Using a One-Tailed Test.a
29
Table 4
Descriptive Statistics for Abnormal Accruals, Partitioned by Cash Flow from Operations
Scaled by Lagged Total Assets (CFO). Abnormal Accruals are Estimated Using the
Standard-Jones (s-J), the Modified-Jones (m-J) and Margin Modelsa
30
Figure 1
Simulation Results of Power Functions for Tests of Earnings Management Using
Abnormal Accruals. Simulations are Performed Using Artificially Induced Amounts of
Earnings Management Ranging from Zero Percent to 10 Percent of Lagged Total Assets
Using One-tailed Test Levels of Five Percent and One Percent. The Number of
Simulations is Equal to 700. Abnormal Accruals are Estimated Using the Standard-Jones
(Solid Line), the Modified-Jones (Dashed Line) and Margin (Dotted Line) Models
90 90
80 80
70 70
60 60
50 50
40 40
30 30
20 20
10 10
0 0
0 2 4 6 8 10 0 2 4 6 8 10
Induced Earnings Management (%) Induced Earnings Management (%)
90 90
80 80
70 70
60 60
50 50
40 40
30 30
20 20
10 10
0 0
0 2 4 6 8 10 0 2 4 6 8 10
Induced Earnings Management (%) Induced Earnings Management (%)
80 80
70 70
60 60
50 50
40 40
30 30
20 20
10 10
0 0
0 2 4 6 8 10 0 2 4 6 8 10
Induced Earnings Management (%) Induced Earnings Management (%)
31