Академический Документы
Профессиональный Документы
Культура Документы
The U.S. Food and Drug Administration (FDA) has published a white paper titled
Data Mining at FDA. The paper summarizes past and current data mining
activities at the FDA and addresses data miners in all sectors anyone interested
in the safety of FDA-regulated products, and those with a general interest in FDA
activities. The topics covered in the paper include:
This brief references the content of the white paper and draws upon the findings
specifically the FDAs use of Oracle Health Sciences Empirica Signal and Study
solutions in their data mining activities. The full white paper can be downloaded from
http://www.fda.gov/ScienceResearch/HealthInformatics/ucm446239.htm.
In response to the need to develop an FDA-wide data mining collaboration and strategy, the
FDA Data Mining Council (DMC) was formed in 2007. The DMC is collaborative and explores
standardized manner.
methods and best practices recommended by experts from other federal agencies, industry,
and academiaall of whom have analogous experience in knowledge discovery through
various data mining approaches. The Council serves as a forum for FDA scientists to share
their experiences and challenges in analyzing data contained in the vast databases the FDA
maintains to discuss new methods for such analyses. The FDA currently receives
approximately two million adverse event, use error, and product complaint reports each year
from consumers, healthcare professionals, manufacturers, and others. Since the early 1990s,
the FDA has advocated data mining to the industry in an effort to better understand the signals
within the safety data. Now, the FDA data mining experts have expanded their attention to
adding more sophisticated data mining methods and applying data mining to other types of
product safety-related FDA and non-FDA databases.
The Proportional Reporting Ratio (PRR) is the foundational concept for many disproportionality
methods. However, because this method does not adjust for small observed or expected
ORACLE BRIEF
numbers of reports of the product-event pair of interest, other more advanced statistical
methods are employed, such as the Multi-Item Gamma Poisson Shrinker (MGPS), which
produces Empirical Bayesian Geometric Mean (EGBM) scores. Several FDA Centers, including
CDER, CBER, and CFSAN, use the MGPS algorithm for their routine surveillance activities.
Various commercially available software programs generate PRR and/or EBGM scores, e.g.,
Oracle Health Sciences Empirica Signal. Oracle Health Sciences Empirica Signal is utilized by
the FDA for routine mining of drugs, foods, cosmetics, and dietary supplements, as shown in the
table below.
Product type
Drugs
Foods,
Cosmetics, and
Dietary
Supplements
Current #
reports
received
770,000 in
2013
Database
start date
Cumulative
# of reports
Stage of
use
Method or
tool
1968
>7,000,000
Routine
6,000 in
2013
2002
40,500
Routine
MGPS
with
Empirica
Signal
MGPS
with
Empirica
Signal
KEY POINTS
Encouraged by the success of using data mining methods for safety report analysis, FDA
experts have started to apply the techniques to other types of data.
Type of
data
Clinical
study data
in drug
applications
Stage of use
of data mining
Routine
Data mining
purpose
Save reviewers
from having to
create the tables
and graphs
CDER has applied Oracle Health Sciences Empirica Study to analyze drug clinical trial data in
either New Drug Applications or supplemental applications. Oracle Health Sciences Empirica
Study interfaces with data that conforms to the standardized Study Data Tabulation Model
(SDTM) of the Clinical Data Interchange Standards Consortium (CDISC) data standards to
create a wide set of automatically generated analytical outputs and tailor-made, reusable
tables and graphs. These outputs have helped reviewers to more efficiently analyze potential
safety issues in the clinical trial data of drugs approved by the FDA. The FDA notes the
benefits of data mining with these tools in the areas of standard processes (because data
mining is automated, the outputs are statistically objective and devoid of manual analyses),
simultaneous analysis (across an entire database at once), efficiency (analyses computed in
minutes), and the benefits of automated signal investigations (transparency with audit trails,
CONNECT W ITH US
oracle.com/healthsciences
drill-down capability, observation of signals over time, and study of a product in populations).
healthsciences_ww_grp@oracle.com
These tools are in the early stages of the automation efforts at the FDA. In the meantime, until
technology advancements and standardization practices proliferate, hands-on case reviews,
analysis of other data sources (e.g., FDA regulatory databases, the World Health Organization
drug safety report database, public scientific literature, and public knowledge databases) and
further epidemiologic assessments are necessary to characterize the clinical and public health
significance of signals generated by data mining analyses.
youtube.com/user/oraclehealthsciences
facebook.com/oraclehealthsciences
twitter.com/oraclehealthsci
blogs.oracle.com/health-sciences
Source: http://www.fda.gov/ScienceResearch/HealthInformatics/ucm446239.htm
To find out more and to access the full report, please click on the above link.
Contact: 1.800.633.0643
Copyright 2016, Oracle and/or its affiliates. All rights reserved. Oracle and Java are registered trademarks of Oracle and/or its affiliates. Other names
may be trademarks of their respective owners. Intel and Intel Xeon are trademarks or registered trademarks of Intel Corporation. All SPARC trademarks
are used under license and are trademarks or registered trademarks of SPARC International, Inc. AMD, Opteron, the AMD logo, and the AMD Opteron
logo are trademarks or registered trademarks of Advanced Micro Devices. UNIX is a registered trademark of The Open Group. 0116