Академический Документы
Профессиональный Документы
Культура Документы
Probability Estimation
Case Study:
Molecular Dynamics
1
24 Carbon Vapor Deposition (CVD)
2
24 Variable Definitions
Impact parameter b is the distance between the location of the central atom and the
point of intersection of the initial velocity vector and the diamond surface (the
origin of the axes).
Angle represents the angle between the x axis and the line from the origin to the
central atom.
3
24 Neural Network
PCNN p
NN
p b Neural Network ap PS p
PCNN p
trans
rot
4
24 Monte Carlo Data Collection
Data for training the neural network are obtained by molecular dynamics
(MD) simulations, where the motion of atoms and molecules in a material
under a given force are simulated, using known laws of physics to
calculate the forces on individual atoms
We use a total of 324 atoms to model the CVD system. Out of these, 282
atoms of diamond substrate are used to model the crystalline face with 40
atoms of hydrogen on the top layer of the diamond surface, and 2 atoms
in the C2 dimer.
The term Monte Carlo refers to the set of simulations that are obtained by
setting a number of the variables to random values for each trajectory. We
refer to the simulation of a single trajectory as an MD simulation, since
the principles of molecular dynamics are used to perform the
computations.
5
24 Training Data
The targets are obtained by estimating the probabilities of chemisorption,
scattering or desorption from the Monte Carlo trials:
PMC
X p N X
NT
where NX is the number of MD trajectories that resulted in reaction X, and NT
is the total number of MD trajectories computed in the Monte Carlo trials.
Since we do not know the true underlying reaction probabilities, we use the
estimates obtained from the Monte Carlo trials as target outputs for the neural
network. We can think of these estimates as noisy versions of the true
probabilities.
2000 different input/target pairs were generated. Of these, 70% were
randomly selected for training, 15% for validation, and 15% for testing. For
each trajectory, the p were generated randomly, using physically-appropriate
distributions for each variable. A total of 50 different trajectories were run to
obtain each estimated probability. This means that 2000x50 trajectories were
run to create the entire data set.
6
24 Network Architecture
Inputs Tan-Sigmoid Layer Softmax Layer
p1 1
a1 2
a2
5x1
W 1
S x1
W 3x1
1
S x5
n1 3xS 1 n2
1
S x1 3x1
1 b
1
1 b
2
1
5 S1x1 S 3x1 3
1 1 1 2 2 1 2
a = tansig(W p + b ) a = softmax(W a + b )
PCMC p
MC
t PS p
PCMC p
7
24 Softmax Transfer Function
expni
ai f ni S
expn
j 1
j
m Sm m
a1 ai a1
m
a1m a2m m m
a a
1 Sm
i 1
Sm
F m n m
m m
a2 a1 a2 ai a2
m m m
m m
a2 a S m
i 1
Sm
aSmm a1m aSmm a2m aS m ai aS m
m m m
i 1
8
24 Training Performance (S1=10)
1
10
Training
Validation
MSE
Validation
2
10
Training
3
10 0 1 2
10 10 10
Iteration # 9
24 Training and Validation Performance
10
24 Multiple Initial Conditions
11
24 Output vs Target Scatter Plots
1 1
0.8 0.8
0.6 0.6
PCNN PSNN
0.4 0.4
0.2 0.2
0 0
0 0.2 0.4 0.6 0.8 1 0 0.2 0.4 0.6 0.8 1
MC
P
C P
S
MC
1
0.8
0.6
NN
P D
0.4
0.2
0
0 0.2 0.4 0.6 0.8 1
MC
P D
12
24 Statistical Error in PMCX
0.8
PX 2 PXMC PX 2
0.6
PXMC PX 1 PX
0.4
NT
0.2
0
0 0.2 0.4 0.6 0.8 1
PX
13
24 Outputs vs Targets for NT=500
1 1
0.8 0.8
0.6 0.6
PCNN PSNN
0.4 0.4
0.2 0.2
0 0
0 0.2 0.4 0.6 0.8 1 0 0.2 0.4 0.6 0.8 1
PCMC PSMC
1
0.8
0.6
NN
P
D
0.4
0.2
0
0 0.2 0.4 0.6 0.8 1
MC
P
D
14
24 Reaction Prob. vs Impact Parameter
PCNN 0.5
0
0 0.5 1 1.5 2 2.5 3 3.5
PSNN 0.5
0
0 0.5 1 1.5 2 2.5 3 3.5
PDNN 0.5
0
0 0.5 1 1.5 2 2.5 3 3.5
b
15