Академический Документы
Профессиональный Документы
Культура Документы
A segment contribution activity coefficient model, derived from the polymer nonrandom twoliquid model, is proposed for fast, qualitative estimation of the solubilities of organic nonelectrolytes in common solvents. Conceptually, the approach suggests that one account for the liquid
nonideality of mixtures of complex pharmaceutical molecules and small solvent molecules in
terms of interactions between three pairwise interacting conceptual segments: hydrophobic
segment, hydrophilic segment, and polar segment. In practice, these conceptual segments become
the molecular descriptors used to represent the molecular surface characteristics of each solute
and solvent molecule. The treatment results in component-specific molecular parameters:
hydrophobicity X, polarity Y, and hydrophilicity Z. Once the molecular parameters are identified
from experimental data for common solvents and solute molecules, the model offers a simple
and practical thermodynamic framework to estimate solubilities and to perform other phase
equilibrium calculations in support of pharmaceutical process design.
Introduction
Solvent selection is a critical task in the chemical
synthesis and recipe development phase of the pharmaceutical and agricultural chemical industries.1-3 The
choice of solvents directly impacts the reaction rates,
extraction efficiency, crystallization yield, etc. Proper
solvent selection results in faster product separation and
purification, reduced solvent emission and lesser waste,
higher yield, lower overall cost, and better production
processes.
Solubility is a key property of concern in solvent
selection because pharmaceutical product isolation is
often done through crystallization at reduced temperature and/or with the addition of antisolvent. Solubility
data involving new drug molecules and their precursors
in the solvents rarely exist, if any. Although limited
solubility experiments are taken for a few solvents as
part of the process development practice, the experimental task can multiply rapidly when one considers
the choices of solvents and solvent-antisolvent mixtures, the effect of temperature, the impacts of impurities, the possibilities of multiple polymorphs, etc. As a
result, solvent selection is largely dictated by researchers preferences or prior experiences.
Existing solubility estimation techniques are best
represented by the Hansen model,4 the UNIFAC group
contribution model,5 and perhaps the Abraham solvation model.6 From the three, Hansen and UNIFAC are
activity coefficient models that can be used for the
estimation of solubilities in pure solvents and in solvent
mixtures. Other popular activity coefficient models, such
as van Laar, Wilson, nonrandom two liquid (NRTL), or
UNIQUAC, are not practical because use of these
models requires the determination of binary interaction
parameters from phase equilibrium data for each of the
solute-solvent and solvent-solvent binary mixtures.
Solute-solvent phase equilibrium data are rarely avail* To whom correspondence should be addressed. Tel.: (617)
949-1202. Fax: (617) 949-1030. E-mail: chauchyun.chen@
aspentech.com.
Ind. Eng. Chem. Res., Vol. 43, No. 26, 2004 8355
ln
lc
m
j xjGjmjm
k xkGkm
Solubility Modeling
xmGmm
mm
m
k xkGkm
k xkGkm
ln xSAT
)
I
fusS
Tm
1- ln SAT
I
R
T
(1)
ln lc,I
m )
for T e Tm
fusS ) fusH/Tm
j xj,IGjmjm
(2)
+ ln SAT
ln Ksp ) ln xSAT
I
I
(3)
ln I ) ln CI + ln RI
(4)
ln RI ) ln lc
I )
lc,I
rm,I[ln lc
m - ln m ]
m
k xk,IGkm
xj )
J xJrj,J
I i
xj,I )
j xj,IGjmjm
mm
m
k xk,IGkm
k xk,IGkm
(7)
(8)
xIri,I
rj,I
i ri,I
(9)
G ) exp(-R)
(10)
lc,S
rm,S[ln lc
m - ln m ]
m
(11)
with
(5)
(6)
xm,IGmm
xSAT
I
where
is the mole fraction of the solute I dissolved
in the solvent phase at saturation, fusS is the entropy
is the activity coefficient of
of fusion of the solute, SAT
I
the solute in the solution at saturation, R is the gas
constant, T is the temperature, and Tm is the melting
point of the solute. Given a polymorph, fusS and Tm
are fixed. At a fixed temperature, the solubility is only
a function of the activity coefficient of the solute in the
solution. Clearly, the activity coefficient of the solute
in the solution plays a key role in determining the
solubility.
Equation 1 is a simplified expression for solubility.
It ignores the contributions due to the difference between solid and liquid heat capacities at the melting
point and due to the pressure correction. When the
values of fusS and Tm are not available, the solubility
product constant, Ksp, can be introduced into eq 1 as an
adjustable parameter for data regression:
j xjGjmjm
Therefore
rm,S ) 1
(12)
ln lc,S
m ) 0
(13)
8356
lc
ln I)S
)
j xjGjSjS
k xkGkS
xmGSm
j xjGjmjm
Sm
m
k xkGkm
k xkGkm
(14)
GjS ) exp(-RjSjS)
(15)
GSj ) exp(-RjSSj)
(16)
ln CI ) ln
I
xI
rI )
I )
+ 1 - rI
J r
(17)
i ri,I
rIxI
J rJxJ
(18)
(19)
Ind. Eng. Chem. Res., Vol. 43, No. 26, 2004 8357
Table 1. NRTL Binary Parameters for Common Solvents in Pharmaceutical Process Design
solvent (component 1)
acetic acid
acetone
acetonitrile
anisole
benzene
1-butanol
2-butanol
n-butyl acetate
methyl tert-butyl ether
carbon tetrachloride
chlorobenzene
chloroform
cumene
cyclohexane
1,2-dichloroethane
1,1-dichloroethylene
1,2-dichloroethylene
dichloromethane
1,2-dimethoxyethane
N,N-dimethylacetamide
N,N-dimethylformamide
dimethyl sulfoxide
1,4-dioxane
ethanol
2-ethoxyethanol
ethyl acetate
ethylene glycol
diethyl ether
ethyl formate
formamide
formic acid
n-heptane
n-hexane
isobutyl acetate
isopropyl acetate
methanol
2-methoxyethanol
methyl acetate
3-methyl-1-butanol
methyl butyl ketone
methylcyclohexane
methyl ethyl ketone
methyl isobutyl ketone
isobutyl alcohol
N-methyl-2-pyrrolidone
nitromethane
n-pentane
1-pentanol
1-propanol
isopropyl alcohol
n-propyl acetate
pyridine
sulfolane
tetrahydrofuran
1,2,3,4-tetrahydronaphthalene
toluene
1,1,1-trichloroethane
trichloroethylene
m-xylene
water
triethylamine
1-octanol
12a
21a
12b
1.365
0.880
1.834
0.797
0.935
1.643
2.445
0.806
0.707
-1.108
1.244
1.787
1.490
-0.113
-0.165
-0.614
2.639
2.149
-0.148
1.309
0.884
1.121
0.368
-0.850
-0.194
-0.424
0.269
-0.168
1.430
1.534
2.870
3.021
2.131
4.263
-0.824
1.576
1.054
-0.138
0.589
0.325
1.245
1.636
1.246
0.533
-0.319
0.771
0.097
2.192
2.560
0.190
-0.940
1.400
-0.414
0.398
1.478
1.155
0.062
2.374
1.412
-0.036
-1.054
1.273
0.021
-0.583
2.027
3.270
0.496
-0.320
0.049
0.657
-0.523
2.567
2.558
1.099
-0.665
1.664
0.631
1.134
-0.869
0.535
1.026
1.981
-0.631
1.292
-0.197
-0.560
10.949
-0.908
-0.888
6.547
1.285
3.153
21b
3.207
4.284
1.983
0.450
-0.564
-1.167
-2.139
1.003
-0.024
-1.593
3.828
1.952
1.109
2.044
0.955
1.010
1.597
1.853
1.380
-1.660
12c
21c
3.692
-2.157
-1.539
5.977
5.843
5.083
5.314
4.013
3.587
7.369
7.026
4.954
6.012
2.833
9.519
4.783
0.508
3.828
1.612
3.103
-0.340
-1.202
6.547
10.949
6.547
10.949
0.103
1.389
0.715
-0.042
0.396
-0.566
2.751
3.029
-0.598
5.680
0.823
0.977
0.592
-0.235
1.968
2.128
4.868
2.702
0.437
2.556
-0.769
3.883
-1.479
5.269
-0.029
0.197
0.079
1.409
-0.990
1.045
1.773
3.583
2.541
2.032
2.571
3.146
0.396
0.563
4.241
7.224
-0.169
0.301
4.997
8.939
1.200
1.763
solvent characteristics
complex
polar
polar
hydrophobic
hydrophobic
hydrophobic/hydrophilic
hydrophobic/hydrophilic
hydrophobic/polar
hydrophobic
hydrophobic
hydrophobic
hydrophobic
hydrophobic
hydrophobic
hydrophobic
hydrophobic
hydrophobic
polar
polar
polar
polar
polar
polar
hydrophobic/hydrophilic
hydrophobic/hydrophilic
hydrophobic/polar
hydrophilic
hydrophobic
polar
complex
complex
hydrophobic
hydrophobic
polar
polar
hydrophobic/hydrophilic
hydrophobic/hydrophilic
polar
hydrophobic/hydrophilic
hydrophobic/polar
polar
hydrophobic/polar
hydrophobic/polar
hydrophobic/hydrophilic
hydrophobic
polar
hydrophobic
hydrophobic/hydrophilic
hydrophobic/hydrophilic
hydrophobic/hydrophilic
hydrophobic/polar
polar
polar
polar
hydrophobic
hydrophobic
hydrophobic
hydrophobic
hydrophobic
hydrophilic
hydrophobic/polar
hydrophobic/hydrophilic
a NRTL binary parameters for various solvent-hexane systems. The NRTL nonrandom factor parameter, R, is fixed as a constant
of 0.2. In these binary systems, the solvent is component 1 and hexane is component 2. s were determined from available VLE and LLE
data. b NRTL binary parameters for various solvent-water systems. The NRTL nonrandom factor parameter, R, is fixed as a constant
of 0.3. In these binary systems, the solvent is component 1 and water is component 2. s were determined from available VLE data.
c NRTL binary parameters for various solvent-water systems. The NRTL nonrandom factor parameter, R, is fixed as a constant of 0.2.
In these binary systems, the solvent is component 1 and water is component 2. s were determined from available LLE data.
8358
X
Y1.643
1.834
0.2
X
Z
6.547
10.949
0.2
YZ
-2.000
1.787
0.3
Y+
Z
2.000
1.787
0.3
X
Y+
1.643
1.834
0.2
Ind. Eng. Chem. Res., Vol. 43, No. 26, 2004 8359
Table 3. NRTL-SAC Molecular Parameters for Common
Solvents
solvent name
Y-
Y+
acetic acid
acetone
acetonitrile
anisole
benzene
1-butanol
2-butanol
n-butyl acetate
methyl tert-butyl ether
carbon tetrachloride
chlorobenzene
chloroform
cumene
cyclohexane
1,2-dichloroethane
1,1-dichloroethylene
1,2-dichloroethylene
dichloromethane
1,2-dimethoxyethane
N,N-dimethylacetamide
N,N-dimethylformamide
dimethyl sulfoxide
1,4-dioxane
ethanol
2-ethoxyethanol
ethyl acetate
ethylene glycol
diethyl ether
ethyl formate
formamide
formic acid
n-heptane
n-hexane
isobutyl acetate
isopropyl acetate
methanol
2-methoxyethanol
methyl acetate
3-methyl-1-butanol
methyl butyl ketone
methylcyclohexane
methyl ethyl ketone
methyl isobutyl ketone
isobutyl alcohol
N-methyl-2-pyrrolidone
nitromethane
n-pentane
1-pentanol
1-propanol
isopropyl alcohol
n-propyl acetate
pyridine
sulfolane
tetrahydrofuran
1,2,3,4-tetrahydronaphthalene
toluene
1,1,1-trichloroethane
trichloroethylene
m-xylene
water
triethylamine
1-octanol
0.045
0.131
0.018
0.722
0.607
0.414
0.335
0.317
1.040
0.718
0.710
0.278
1.208
0.892
0.394
0.529
0.188
0.321
0.081
0.067
0.073
0.532
0.154
0.256
0.071
0.322
0.164
0.109
0.131
0.157
0.513
0.883
0.217
0.448
0.257
0.707
1.340
1.000
1.660
0.552
0.088
0.052
0.236
0.419
0.673
1.162
0.247
0.673
0.566
0.197
0.025
0.898
0.474
0.375
0.351
0.514
0.205
0.210
0.235
0.443
0.604
0.548
0.426
0.758
0.190
0.007
0.082
0.030
0.219
0.194
0.030
0.564
2.890
0.086
0.081
0.318
0.049
0.141
0.041
0.089
2.470
0.154
0.149
0.043
0.224
0.036
0.224
0.485
0.355
0.330
0.172
0.141
0.424
0.039
0.541
0.691
0.208
0.832
1.262
0.858
0.157
0.372
0.401
0.507
0.237
0.421
0.338
0.165
0.280
0.341
0.108
0.498
0.027
0.251
0.337
0.538
0.469
0.251
0.480
0.469
0.067
0.322
0.252
0.562
0.560
0.314
0.485
0.305
1.216
0.223
0.030
0.070
0.134
0.135
0.426
0.040
0.555
0.320
0.003
0.587
0.174
0.248
0.511
0.353
0.457
0.304
0.287
0.285
0.021
0.316
1.000
0.557
0.766
0.105
0.032
0.624
0.335
8360
wt %
mole
fraction
methanol
acetone
ethanol
1,4-dioxane
acetic acid
methyl ethyl ketone
2-propanol
isoamyl alcohol
chloroform
diethyl ether
n-octanol
1,2-dichloroethane
1,1,1-trichloroethane
cyclohexane
33
29
20
19
12
12
10
10
6
5
3
3
0.5
0.005
8.053 10-2
1.163 10-1
6.007 10-2
1.029 10-1
4.347 10-2
5.174 10-2
5.924 10-2
5.155 10-2
4.057 10-2
2.119 10-2
2.186 10-2
1.670 10-2
3.706 10-3
2.335 10-5
NRTL-SAC
(mole fraction)
NRTL-SAC
(four solvents)b
(mole fraction)
UNIFAC
(mole fraction)
Hansen
(mole fraction)
7.950 10-2
1.084 10-1
3.907 10-2
1.130 10-1
1.709 10-1
4.838 10-2
3.257 10-2
4.552 10-2
4.547 10-2
1.127 10-2
2.491 10-2
1.352 10-2
2.743 10-3
4.962 10-5
8.053 10-2
1.163 10-1
3.208 10-2
1.204 10-1
1.670 10-1
5.016 10-2
2.903 10-2
4.195 10-2
4.057 10-2
9.081 10-3
2.015 10-2
1.232 10-2
2.001 10-3
2.335 10-5
7.722 10-2
8.782 10-2
1.606 10-2
5.699 10-2
9.522 10-2
6.596 10-2
2.897 10-2
1.490 10-2
9.735 10-2
1.685 10-2
1.453 10-2
3.969 10-2
3.750 10-2
9.351 10-4
4.256 10-2
7.892 10-2
4.643 10-2
1.997 10-2
9.053 10-2
5.642 10-2
7.174 10-2
5.155 10-2
3.369 10-2
2.558 10-2
3.664 10-2
2.809 10-2
2.238 10-2
4.695 10-3
aLiterature data, UNIFAC prediction results, and Hansen correlation results are taken from Frank et al.2
solvents are acetone, cyclohexane, methanol, and chloroform.
MW
no. of solvents
T (K)
aspirin
aspirin
p-aminobenzoic acid
benzoic acid
camphor
ephedrine
lidocaine
methylparaben
testosterone
theophylline
estriol
estrone
morphine
piroxicam
hydrocortisone
haloperidol
180.16
180.16
137.14
122.12
152.23
165.23
234.33
152.14
288.41
180.18
288.38
270.37
285.34
331.35
362.46
375.86
14
4
7
7
7
7
7
7
7
7
9a
12
6
14b
11c
13d
298.15
298.15
298.15
298.15
298.15
298.15
298.15
298.15
298.15
298.15
298.15
298.15
308.15
298.15
298.15
298.15
0.103
0.039
0.218
0.524
0.604
0.458
0.698
0.479
1.051
Y-
0.853
0.499
0.773
0.665
0.401
0.827
0.681
0.089
0.124
0.068
0.596
0.484
0.771
0.757
Y+
ln Ksp
rms error in ln x
1.160
1.372
1.935
0.450
0.478
0.777
0.799
0.760
0.405
-2.630
-2.582
-2.861
-1.540
-0.593
-0.296
-0.978
-2.103
-3.797
-6.110
-7.652
-6.531
-4.658
-7.656
-6.697
-4.398
0.506
0.533e
0.284
0.160
0.092
0.067
0.027
0.120
0.334
0.661
0.608
0.519
1.007
0.665
0.334
0.311
0.679
0.293
1.218
0.233
1.208
0.291
1.521
0.970
1.803
1.248
0.193
0.172
0.683
0.669
0.341
1.928
0.196
1.811
0.169
0.611
0.131
a With THF excluded. b With 1,2-dichloroethane, chloroform, diethyl ether, and DMF excluded.
chloroform and DMF excluded. e 14 solvents.
SAC including both molecular descriptors and parameters, the results are considered to be very satisfactory.
To test the predictive capability of NRTL-SAC, we
also fit the aspirin solubility data using only four
representative solvents (i.e., acetone for the polar
solvent, cyclohexane and chloroform for the hydrophobic
solvents, and methanol for the hydrophilic solvent) and
then use the identified molecular parameters to estimate the aspirin solubilities in the other 10 solvents.
As shown in Table 5, the molecular parameters for
With
Ind. Eng. Chem. Res., Vol. 43, No. 26, 2004 8361
extract and test the solubility data for the eight molecules reported by Lin and Nash.14 We also test the
model against six additional molecules with sizable
solubility data sets.
We apply the model with the solvents that are
included in Table 3. The molecular parameters determined for the solutes and the rms errors in ln x for the
fits are summarized in Table 5. A good representation
of the solubility data is obtained. The average rms error
in ln x for the 14 solutes (aspirin excluded) summarized
in Table 5 is 0.37. It corresponds to (45% accuracy in
solubility predictions. Certainly, the quality of the fit
reflects both the effectiveness of NRTL-SAC and the
quality of the molecular parameters identified from the
limited available experimental data for the solvents.
Figure 6. UNIFAC results for aspirin solubility at 298.15 K.
Solubility data2 for all 14 solvents are predicted with UNIFAC.
Conclusions
The NRTL-SAC model is a practical thermodynamic
framework for solubility modeling in pharmaceutical
process design. The model requires only componentspecific molecular parameters that represent the surface
interaction characteristics of the molecules. For solute
molecules, these parameters are identified from solubility measurements of the solute in a few representative
solvents, i.e., hydrophobic, hydrophilic, and polar solvents. The model is a useful tool for qualitative correlation and prediction of phase behavior, i.e., solubility, of
systems with large, complex pharmaceutical solutes in
common solvents.
Figure 7. Hansen correlation results for aspirin solubility at
298.15 K. Solubility data2 for all 14 solvents are fit simultaneously
with Hansen.
NRTL-SAC molecular descriptors are good representations of molecular surface interaction characteristics and
that the solvent molecules used to identify molecular
parameters for the solute can be thought of as molecular
sensors used to elucidate the surface interaction
characteristics of the solute molecule in solution. These
molecular sensors probe and express the solutesolvent interactions in terms of binary phase equilibrium data, i.e., solubility.
Note that, during the data regression, all experimental solubility data, regardless of their order of magnitude, were assigned with a standard deviation of 20%.
In addition to the experimental data and the NRTLSAC results for aspirin at room temperature, Table 4
also includes the UNIFAC prediction results and the
Hansen correlation results reported by Frank et al.2 To
be sure of the UNIFAC predictions and the Hansen
correlations, we duplicated Franks results. With UNIFAC and Hansen, the rms errors in ln x for the 14
solvents are 1.352 and 1.600, respectively. Figures 6 and
7 show the comparisons of experimental data and
computed solubilities with UNIFAC and Hansen. The
outliers could be attributed to either poor experimental
data or poor model representations. Given that the
NRTL-SAC results are clearly superior to those of
UNIFAC and Hansen, the results illustrate the relative
inability of UNIFAC and Hansen to capture solvent
effects on the solubility of aspirin.
The data compilation of Marrero and Abildskov13
provides a good source of public solubility data for large,
complex chemicals. To further test NRTL-SAC, we first
Acknowledgment
The authors are grateful to Hsien-Hsin Tung, Daniel
E. Bakken, Christopher Rentsch, and Jose E. Tabora
of Merck for their critical evaluation of NRTL-SAC,
UNIFAC, and Hansen models for solubility modeling
of Merck compounds in solvents and solvent mixtures.
We also thank Prof. John Prausnitz for his warm
encouragement and insightful critiques on the manuscript.
Literature Cited
(1) Gupta, A.; Gupta, S.; Groves, F. R., Jr.; McLaughlin, E.
Correlation of Solid-Liquid and Vapor-Liquid Equiibrium Data
for Polynuclear Aromatic Compounds. Fluid Phase Equilib. 1991,
64, 201.
(2) Frank, T. C.; Downey, J. R.; Gupta, S. K. Quickly Screen
Solvents for Organic Solids. Chem. Eng. Prog. 1999, Dec, 41.
(3) Kolar, P.; Shen, J.-W.; Tsuboi, A.; Ishikawa, T. Solvent
Selection for Pharmaceuticals. Fluid Phase Equilib. 2002, 194197, 771.
(4) Hansen, C. M. Hansen Solubility Parameters: A Users
Handbook; CRC Press: Boca Raton, FL, 2000.
(5) Fredenslund, A.; Jones, R. L.; Prausnitz, J. M. GroupContribution Estimation of Activity Coefficients in Nonideal Liquid
Mixtures. AIChE J. 1975, 21, 1086.
(6) Acree, W. E., Jr.; Abraham, M. H. Solubility Predictions for
Crystalline Nonelectrolyte Solutes Dissolved in Organic Solvents
Based upon the Abraham General Solvation Model. Can. J. Chem.
2001, 79, 1466.
(7) Klamt, A.; Eckert, F. COSMO-RS: a Novel and Efficient
Method for the a Priori Prediction of Thermophysical Data of
Liquids. Fluid Phase Equilib. 2000, 172, 43.
(8) Lin, S.-T.; Sandler, S. I. A Prior Phase Equilibrium Prediction from A Segment Contribution Solvation Model. Ind. Eng.
Chem. Res. 2002, 41, 899.
8362