Вы находитесь на странице: 1из 6

DIAGNOSTICS

ABSTRACT
AI systems provide a lot of promise in
Successful
application of
the analysis and evaluation of power
system data. This article shows some
of the benefits of the application of AI,

AI techniques:
identifying the strengths and weak-
nesses of the approach, and provides
a way forward to apply AI in a mean-
ingful and controlled manner.

KEYWORDS A hybrid
approach
application, artificial intelligence, ex-
perience, machine learning, scientific
machine learning

92 TRANSFORMERS MAGAZINE | Special Edition: Digitalization | 2020


Tom RHODES, Tony MCGRAIL

The major applications of AI and ML in the


transformer industry are the diagnostics and
giving the predictions where and when the
next transformer will fail

not perfect – it has its role in the analysis specific approaches meaning that analysis
of well-described problems with sufficient could be inconsistent based on the region
data to cover all possible situations that and the individual involved. So, the push to
may be found. Let us consider two things more uniform approaches based on AI and
which are true in our industry: ML seems both rational and sensible, espe-
cially as most experienced personnel, who
• we are almost always faced with incom- understand the data, are retiring.
plete and possibly ambiguous data,
• the analysis of data does not take place So, what can AI and ML do for us? Some
in a vacuum as we have a history and a examples of benefits include [5]:
knowledgebase to call on to check the
results. • In weather forecasting, AI has been
used to reduce human error,
So in simple terms, if an AI system is de- • Banks use AI in identity verification
veloped which analyses data for power processes,
transformers, then based on the data • A number of institutes use AI to sup-
available it should be able to replicate port helpline requests, sometimes via
what has already been developed as 'com- chatbots,
mon knowledge' or industry expertise. • Siri, Cortana and OK Google all build
For example, in DGA analysis, identi- on AI apps,
fying increased levels of acetylene with • AI systems can classify well-organised
increased probability of failure should be data – such as X-rays.
a rule which is identified [4]. If the AI is
unable to state the rule in clear terms, then On the downside, there are some issues [6]:
we may not trust other analyses described:
we have to have a believable audit trail for • AI may be good at interpolation with-
the analysis to justify actions. in a dataset, but not at extrapolation to
'new' data,
• 'Giraffing' – the generic name for iden-
Business environment: tifying the presence of objects where
In an ideal world, we would have com- those objects do not exist,
plete and detailed information on each of • Providing bias in the analysis based on
our transformers: maintenance history, unrepresentative datasets,
test data, monitoring data, fault data, and • Using a black-box approach, so the
Introduction
so on. There would be standards and ana- reason for a 'decision' is not clear and
There is a large push to use artificial intelli- lytic tools to tell us about each individual transparent.
gence (AI) and machine learning (ML) to transformer: the health, probability of fail-
help reduce the time of performing main- ure, remaining life and so on. In practice, In fact, many of the benefits of AI applica-
tenance on transformers and predicting the data may be incomplete, inconsistent, tion rely on having clean and well-ordered
where and when the next transformer will or missing. data – in terms of data mining, it is esti-
fail [1, 2, 3]. Major companies in differ- mated that 95 % of the possible benefits
ent industries are promoting and telling It is common for a subject matter expert can be achieved through data clean up
the wonders of AI and ML: managing the (SME) or a technician to analyse and eval- and standard statistical methods [7]. It is,
replacement plans of an ageing or aged uate all available data to make decisions however, also noted that AI systems can
fleet, reduction in maintenance while ex- about actions and interventions in their work 24/7 and do not get bored with re-
tending asset life, operational efficiency, region or area. Transformers would be petitive tasks.
all while capturing the expertise available ranked manually and grouped for prior-
so that it is not lost. These are lofty goals, itisation of maintenance, replacement or So it would seem that an appropriate ap-
and claims are already being made for other intervention. Some of the analysis proach to apply AI tools is to use them
the benefits of AI applications in 'the real methods may be used only by some SMEs where they are strong: analysing data to
world'. The problem we face is that AI is and not others, and they may have their own identify the majority of 'standard' or 'nor-

w w w . t ra n sfo r m e r s - m a g a z i n e . co m 93
DIAGNOSTICS

Figure 1. Sheep and goats

AI and ML algorithms have their pros and dimensions of the provided data. An
expert then classifies the resulting clus-
cons, and it is important to be aware of that ters and tests them against new cases.
in order to apply AI and ML techniques in As an example, consider an ML tool devel-
the best possible way oped to recognise sheep and / or goats in
pictures supplied, as per Fig. 1. In a super-
vised ML approach, an expert would clas-
mal' cases and allowing the SME's to con- ferent cases, for example, oil samples sify each picture, and the tool would try to
centrate on the data which are not clear which indicate overheating or paper find data differences between the pictures
or needs 'real attention'. Let the AI / ML degradation. A machine learning tool which reflects the classification. We may
interpolate but not extrapolate. tries to learn from parameters within not know why the tool does what it does
the data, for example, hydrogen con- – the ML can be considered a black box.
tent, moisture level, presence of PD, Once trained, we show the ML tool more
Machine learning types etc., and these parameters best reflect pictures for it to classify to see how well it
In general, machine learning may be split the expert classification. Then test the does – and if we just show pictures used
into two similar approaches, both requir- resulting tool against new cases to see in the training data, it will likely do very
ing large data sets which are split into test how effective it is. well. It is when we show it more complex
and training subsets [8]: b. In unsupervised machine learning, pictures, or pictures of another animal, the
a similar approach is used, but in this ML tool may fail.
a. In supervised machine learning, an case, the machine learning tool groups
'expert' classifies the data set into dif- the cases based on clusters in the many In unsupervised ML, the tool clusters the
data and the expert classifies it afterwards.
In both supervised and unsupervised ML
tools, the Ml performs very well when the
There are two basic ML learning types: test cases are similar to the training cases
but much less well when the supplied cases
supervised learning – that requires an ex- are different from the training cases. What
pert for initial data classification before the happens if there are multiple animals in a
picture? Or if there is a llama – how does
learning process and unsupervised learning that get classified? The effect called 'giraff-
– where the expert evaluates data after the ing' where an ML tool trained to identify
giraffes in supplied pictures then identi-
learning process fies giraffes in pictures where no giraffe

94 TRANSFORMERS MAGAZINE | Special Edition: Digitalization | 2020


is present - the effect is a result of the ML Unsupervised learning showed better re-
training where giraffes are over-represent-
ed in the training cases, but the cases of 'no sults compared to supervised learning in
giraffes' are underrepresented [9]. The ef-
fect can be seen in a visual chatbot which
the example of the classification of the par-
identifies the content of pictures – try ask- tial discharge EMI spectra
ing it how many giraffes are in a picture
you supply [10].
and guidelines available for support, not- ery ML solution they were offered or
Fig. 2 shows a high-level view of an ML ing that these can be inconsistent and tried for predictive maintenance was
classification process for partial discharge may not provide a good interpretation an assumption that given enough data
EMI spectra, conducted by Dr Imene Mi- in all cases. In practice, there is a need to we can make accurate predictions using
tiche as part of a Doble Engineering spon- focus, as there is a large amount of data. Gaussian modelling of the available data;
sored R&D project at Glasgow Caledo- For example, at Duke Energy, there are unfortunately, that assumption is not
nian University in the UK. Expert analysis over 10,000 Large Power Transformers true.
of EMI spectra was initially used as a base (Banks > 7.5 MVA) in their transformer
for a supervised ML approach – features fleet. These transformers have dozens of A Gaussian, or normal, distribution is
extracted from the data based on the en- data sources from DGA to offline tests to symmetrical about an expected value. In
tropy (orderliness) of the data are used to maintenance history to condition moni- practice, distributions of DGA values,
cluster the data, as shown. toring and generate millions of individual power factor levels, PD inception voltages
data points. Like most companies, Duke and other are not Gaussian, and that trend
The original EMI spectra cases from a has ever fewer people to manage that age- follows through the analysis to the point
number of different generator analyses ing fleet, and they need to be able to focus of classification. In addition, the realities
taken around the world are analysed by an on what is most critical, most important for transformer data include:
expert and classified; those classifications and most relevant.
are then used to drive the supervised ML • Limited and bad data,
analysis based on the entropic features ex- • Failure to document and maintain
tracted. The supervised approach yielded Practicalities at Duke Energy failed asset data,
an accuracy of subsequent test classifica- Duke Energy performed exhaustive re- • No investment in cleaning and verify-
tion of ~75 %. An unsupervised approach search over a number of years, looking ing data available,
was also performed, using the same en- for a 'good' AI / ML tool: by 'good' we • Data not normalised across multiple
tropic data, with the clusters plotted on an mean one which classifies cases well sources nor within a single source,
entropy chart to indicate the cluster inde- when they are clear but identifies those • Unique characteristics of data related
pendence. Subsequent classification of the which are 'less clear' as needing further to the manufacturing process for sister
unsupervised clusters yielded an accuracy analysis. One thing in common to ev- units (they are handmade),
in excess of 80 %. The improvement in
results from the unsupervised approach
demonstrates both the difficulty in clas- Duke Energy has developed a hybrid model
sifying the spectra and benefits of not as-
suming perfect a priori knowledge from which takes the best of available analysis
the expert. The application of the result-
ing ML system is being incorporated into
tools and ML systems, combined with the
Doble's EMI survey tools to support users scientific knowledge, to make the most ac-
in the field with their analyses.
curate decisions using Scientific Machine
For many analyses, there are standards Learning

Figure 2. Feature extraction approach to partial discharge EMI Spectra Analysis

w w w . t ra n sfo r m e r s - m a g a z i n e . co m 95
DIAGNOSTICS

The SciML tool applies takes the best of every failure is driven by condition related
failure modes and not every failure is pre-
both worlds – applying standards / guide- dictable.
lines and benefitting from the broad appli-
The first step in the development of a use-
cation of ML ful Health and Risk Management (HRM)
tool was to invest in data clean up and sub-
sequent data hygiene management – this
whereas the realities for the data scientists ogists and other scientific groups were de- is an ongoing task and needs constant vig-
include: veloping similar techniques, finding that ilance to prevent rogue data errors causing
pure machine learning was not producing false positives in the analyses. Data is made
• An assumption that the answer lies in accurate results in practice. The hybrid ap- available through a single-user interface,
the available data, without necessarily proach is now termed 'Scientific Machine and standard engineering algorithms are
referencing Transformer SMEs, Learning' (SciML), where actionable deci- applied to identify issues and data which
• ML assumes a Gaussian data distribu- sions are made based on reliable data sup- need a deeper analysis: condition-based
tion, but most failure modes are not ported by subject matter expertise. maintenance data (CBM), load variation,
based on Gaussian data, oil test, electrical test, and work order data
• Major companies like Dow Chemical, SciML is noted for needing less data, be- all provide the context in one interface for
Audi, and Intel have been open about ing better at generalisation, being more decision support. Analytics such as the
predictive models for major plant as- interpretable, and more reliable, than Doble Frank scores, TOA4 gassing scores
sets not being effective, both unsupervised and supervised ma- / severity, EPRI PTX indices are applied
• IT and data scientists do not usually chine learning [11]. Duke's use of SciML initially, and the results are normalised as
understand failure modes and may not went into effect in January 2019, while the a linear feature set which can be analysed
take them into account for their mod- terminology and papers on the concept with a supervised ML tool. The combina-
elling. from academic and commercial AI / ML tion of approaches allows data related to
platforms did not come into common use each transformer to be classified into one
Consequently, a lot of time, effort and re- until late 2019 / 2020. of the several predefined classifications or
source can be targeted at Ml systems which states: Normal, Monitor, Service, Stable,
do not support the 'real world'. Based on The SMEs are regularly asked by the as- Replace, and Risk Identified.
experience and SME inputs, Duke Energy set / finance group to "Provide a list of
has developed a hybrid model which takes transformers most likely to fail, or in the The approach is shown, at a high level, in
the best of available analysis tools and ML poorest condition, for our proactive re- Fig. 3.
systems, to allow SMEs and technicians placement project." The response was
to focus effectively, accessing data so they regionally based, with different spread- The SciML tool takes the best of both
can make the most accurate decisions sheets and different analyses and different worlds – applying standards / guidelines
where they are needed with fewer things collations as some SMEs have over 1,000 and benefitting from the broad applica-
'slipping through the cracks'. transformers to evaluate. Then there is a tion of ML. The process at Duke has re-
call coming in about a transformer that duced time for SMEs to perform annual
Duke's hybrid model methodology devel- failed and that is not on any of the sup- fleet evaluations in a few days rather than
opment occurred at the same time as biol- plied lists. Such failures are inevitable: not several weeks, consistently across the or-
ganisation. The number of 'bad actors'
slipping through the cracks is lower, but
not yet zero.

One of the features of the Hybrid system


is the ability of the system to automatically
change some states:

• a state may be automatically changed


to 'Monitor' or 'Service' based on raw
data,
• the state may be changed to 'Risk Iden-
tified' based on engineering analytics
and ML classification,
• no transformer state can be automat-
ically changed to 'Stable' or 'Replace':
that requires SME intervention. After
reviewing the data, the SME deter-
mines if a transformer is 'Stable' or
should be 'Replaced', with comments
Figure 3. Overview of hybrid engineering ML transformer fleet analysis tool – now called SciML recorded.

96 TRANSFORMERS MAGAZINE | Special Edition: Digitalization | 2020


Duke Energy's hybrid model of engi-
neered analytics and machine learning
AI / ML tools can provide benefit in the in-
has proven to be an excellent but imper- terpretation and classification of complex
fect tool, being far more 'accurate' than
either pure AI / ML tools have proven
data, but they can be fooled by data incon-
or engineered analytics alone. The trans- sistent with their training set, which is why
former state, as updated by SMEs is now
far more useful in making sound planning the ML tools require inputs and check from
decisions. the expert
Success, in terms of uptake and use of the
Hybrid model, has been based on a num-
ber of activities: data hygiene, collation of Bibliography [5] https://towardsdatascience.com/advan­
data sources, application of standards / tages-and-disadvantages-of-artificial-
guidelines for engineered analytics, data [1] E. Gulski, et al., “Data mining tech- intelligence-182a5ef6588c
normalisation for features to feed the ML, niques to assess the condition of high
continuous SME input and refinement in voltage electrical plant”, Paper 15-107, [6] https://abad1dea.tumblr.com/post/­­
a closed-loop evaluation. CIGRE Technical Session, Paris, France, 182455506350/how-math-can-be-racist-­
2002 giraffing
The benefits of the hybrid approach have
been to allow SMEs and field technicians [2] N.N. Ravi, S. Mohd Drus, P.S. Krish- [7] https://www.kdnuggets.com/2018/04/
to focus on important and critical cases. nan, "Data mining techniques for trans- dirty-little-secret-data-scientist.html
The system is not perfect, but it has iden- former failure prediction model: A sys-
tified bad actors more consistently and tematic literature review", 2019 IEEE [8] https://towardsdatascience.com/­
more accurately than any previous ap- 9th Symposium on Computer Applica- machine-learning-for-beginners-d247a­
proach used at Duke Energy. tions & Industrial Electronics, Malaysia, 9420dab
2019
[9] https://abad1dea.tumblr.com/post/­
Conclusions [3] CIGRE Technical Brochure 292, 2006, 182455506350/how-math-can-be-racist-­
AI / ML tools can provide benefit in the “Data mining techniques and applica- giraffing
interpretation and classification of com- tions in the power transmission field”
plex data, but they can be fooled by data [10] http://demo-visualdialog.cloudcv.org/
inconsistent with their training set. The [4] CIGRE Technical Brochure 296, 2006,
application of ML tools requires input “Recent developments in DGA interpre- [11] https://www.alcf.anl.gov/events/scientific
from the SMEs who can guide the devel- tation” -machine-learning-learning-­small-data
opment in specific applications. Under-
standing the raw data and making the
best use of data hygiene / management Authors
activities is a base for building an over- Dr. Tony McGrail of Doble Engineering Company
all analysis system which combines the provides condition, criticality, and risk analysis for
best practice, application of standards / substation owner / operators. Previously, he has spent
guidelines, and targeted use of AI / ML over 10 years with National Grid in the UK and the US as
systems. Doble Engineering has shown a substation equipment specialist, with a focus on power
the development of targeted AI / ML tools transformers, circuit breakers, and integrated condition
can bring benefit in practical data analy- monitoring, and has also taken on the role of substation
sis in the field; Duke Energy has shown asset manager identifying risks and opportunities for
that application of targeted ML tools can investment in an ageing infrastructure. McGrail is a Fellow of the IET, past-
support SMEs in their asset performance Chairman of the IET Council, a member of the IEEE, ASTM, ISO, CIGRE
analyses. and the IAM, and a contributor to SFRA and other standards.

Acknowledgements Tom Rhodes graduated from the Upper Iowa Univer-


sity with a B.S. Professional Chemistry. He has over 30
The authors would like to thank our col- years of data analysis for asset management of indust-
leagues at Duke Energy and at Doble rial systems. Mr. Rhodes worked as Implementer / Pro-
Engineering Company, and many more ject Leader at CHAMPS Software - implementing new
across the industry who have provided CMMS / Asset Management technology, and he has
comment, feedback and discussion of the held titles of Sr. Science and Lab Services Specialist, Sci-
application of AI techniques. And many entist and Lead Engineering Technologist at Duke Ener-
thanks to Dr Mitiche at Glasgow Caledo- gy. He is an author and regular presenter at Doble, IEEE, Distributec and
nian University for sharing her results of ARC conferences on Oil analysis and asset management.
AI analysis of PD / EMI data.

w w w . t ra n sfo r m e r s - m a g a z i n e . co m 97

Вам также может понравиться