Академический Документы
Профессиональный Документы
Культура Документы
Natural Products
on CD-ROM
This introduction screen gives access to (a) a general introduction to the
scope and content of DNP on CD-ROM, followed by (b) an extensive
review of the different types of natural product and the way in which they
are organised and categorised in DNP.
You may access the section of your choice by clicking on the
appropriate line below, or you may scroll through the text forwards or
backwards from any point.
Polyketides 29
Carbohydrates 35
Oxygen heterocycles 44
Benzofuranoids 48
Benzopyranoids 49
1
Flavonoids page 51
Tannins 60
Lignans 64
Terpenoids 72
Monoterpenoids 73
Sesquiterpenoids 77
Diterpenoids 101
Sesterterpenoids 118
Triterpenoids 121
Tetraterpenoids 131
Miscellaneous terpenoids 133
Meroterpenoids 133
Steroids 135
The sterols 140
Alkaloids 154
Alkaloids derived from ornithine 154
Alkaloids derived from lysine 156
Alkaloids derived from nicotinic acid 158
Alkaloids of polyketide origin 159
Alkaloids derived from anthranilic acid 160
Alkaloids derived wholly or in part from phenylalanine or
tyrosine 163
Isoquinoline alkaloids 163
Alkaloids derived from tryptophan 181
Monoterpenoid indole alkaloids 184
Terpenoid alkaloids 196
Steroidal alkaloids 200
Imidazole alkaloids 202
Oxazole alkaloids 202
Thiazole alkaloids 203
Pyrazine and quinoxaline alkaloids 203
Pyrrole alkaloids 203
Putrescine alkaloids 203
Spermine and spermidine alkaloids 203
Peptide alkaloids 204
Purines 205
Pteridines and analogues 205
Polypyrroles 206
2
Introduction to the DNP database
The Chapman & Hall/CRC Chemical Database is a structured database holding
information on chemical substances. It includes descriptive and numerical data
on chemical, physical and biological properties of compounds; systematic and
common names of compounds; literature references; structure diagrams and
their associated connection tables. The Dictionary of Natural Products on
CD-ROM is a subset of this database and includes all compounds contained in
the Dictionary of Natural Products (Main Work and Supplements).
The Dictionary of Natural Products (DNP) is the only comprehensive and
fully-edited database on natural products. It arose as a daughter product of the
well-known Dictionary of Organic Compounds (DOC) which, since its
inception in the 1930s has, through successive editions, always been a leading
source of natural product information.
In the early 1980s, following the publication of the Fifth Edition of DOC, the
first to be founded on database methods, the Editors and contributors for the
various classes of natural products embarked on a programme of enlargement,
rationalisation and classification of the natural product entries, while at the same
time keeping the coverage up-to-date. In 1992 the results of this major project,
which had grown to match DOC in size, were separately published in both book
(7 volumes) and CD-ROM format, leaving DOC with coverage of only the most
widely distributed and/or practically important natural products. DNP compi-
lation has since continued unabated by a combination of an exhaustive survey of
current literature and of historical sources such as reviews to pick up minor
natural products and items of data previously overlooked.
The compilation of DNP is undertaken by a team of academics and
freelancers who work closely with the in-house editorial staff at Chapman &
Hall. Each contributor specialises in a particular natural product class (e.g.
alkaloids) and is able to reorganise and classify the data in the light of new
research so as to present it in the most consistent and logical manner possible.
Thus the compilation team is able to reconcile errors and inconsistencies.
The resulting CD-ROM version, which is re-issued every six months,
represents an extremely well organised dictionary documenting virtually every
known natural product.
A valuable feature of the design is that closely related natural products (e.g.
where one is a glycoside or simple ester of another) are organised into the same
entry, thus simplifying and bringing out the underlying structural and
biosynthetic relationships of the compounds. Structure diagrams are drawn and
numbered in the most consistent way according to best stereochemical and
biogenetic relationships. In addition, every natural product is indexed by
structural/biogenetic type under one of more than 1000 headings, allowing the
rapid location of all compounds in the category, even where they have under-
gone biogenetic modification and no longer share exactly the same skeleton.
There is extensive (but not complete) coverage of natural products of
unknown structure, and the coverage of these is currently being enhanced by
various retrospective searches.
3
together into approximately 40,000 entries. The structure of an entry is shown
below.
Data Types
The format of a typical entry is given in Fig. 1, and shows the individual types
of data that may be present in an entry.
3
O
9
O
p-Menthan-9,3-olid-7-oic acid
4
DNP name Laudanosine
MeO
MeO NMe
Structural formula
H
OMe
(S)-form
OMe
(a) There are many instances in the primary literature of compounds being
named in ways which are gross violations of good IUPAC practice, e.g., where
the substituents are ordered non-alphabetically. These have been corrected.
(b) The number of trivial names used for acylating substituents has been kept
to a minimum but the following are used throughout.
Ph H
C C
H CO—
Cinnamoyl = 3-phenyl-2-propenoyl
Many other trivial appellations have from time to time appeared in the
literature for other acyl groups (e.g., Senecioyl = 3-methyl-2-butenoyl,
5
Feruloyl = 3-(4-hydroxy- 3-methoxyphenyl)-2-propenoyl or 4-hydroxy-
3-methoxycinnamoyl) but the systematic forms are usually employed except in
a few cases where the shortened form is used to abbreviate a very long and
unwieldy derivative descriptor as much as possible (e.g., for some of the
complex flavonoid glycosides).
(c) The term prenyl for the common 3-methyl-2-butenyl substituent,
(H3C)2C=CHCH2–, is used throughout.
(d) Names which are known to be duplicated within the chemical literature
(not necessarily within DNP), are marked with the sign†.
6
description which cannot be assigned to a particular stereoisomer described in
the entry.
For example, the CHCD entry for 2-Amino-3-hydroxy-3-phenylpropanoic
acid (β-Hydroxyphenylalanine, 9CI) has a general CAS number [1078-17-7]
and CAS numbers for all four optically active diastereoisomers [7352-06-9,
32946-42-2, 109120-55-0, 6524-48-4] as well as the two possible racemates
[2584-74-9] [2584-75-0]. However, among the additional registry numbers
quoted are the following:
[7687-36-7] – number for erythro-β-Hydroxyphenylalanine
[50897-27-3] – number for β-Hydroxy-L-phenylalanine
[68296-26-4] – number for β-Hydroxy-D-phenylalanine
[39687-93-9] – general number for the methyl ester, hydrochloride which
cannot be placed under any of the individual stereoisomers of
this compound described in the entry.
(e) Numbers may refer to derivatives similar to those described in the DNP
entry for which no data is available, or which have not yet been added to the
entry.
(f) Some DNP entries refer to families of compounds, such as the entry for
Calcitonin where only the porcine and human variants are described in detail.
The additional registry numbers given in this entry are those of a number of
other species variants which appear to have been identified according to CAS
but for which no attempt has been made to collate full data for DNP.
Diagrams
In each entry display there is a single diagram which applies to the parent entry.
Separate diagrams are not given for variants or derivatives.
Every attempt has been made to present the structures of chemical substances
as accurately as possible according to current best practice and IUPAC
recommendations. In drawing the formulae, as much consistency as possible
between closely related structures has been aimed at. Thus, for example, sugars
have been standardised as Haworth formulae and, wherever possible in complex
structures, the rings are oriented in the standard Haworth manner so that
structural comparisons can quickly be made. In formulae the pseudoatom
abbreviations Me, Et and Ac for methyl, ethyl and acetyl respectively, are used
only when attached to a heteroatom. Ph is used throughout whether attached to
carbon or to a heteroatom. Other pseudoatom abbreviations such as Pri for
isopropyl and Bz for benzoyl are not used in DNP.
Care must be taken with the numbering of natural products, as problems may
arise due to differences in systematic and non-systematic schemes. Biogenetic
numbering schemes which are generally favoured in DNP may not always be
contiguous, e.g., where one or more carbon atoms have been lost during
biogenesis.
Structures for derivatives can be viewed in Structure Search, but remember
that these structures are generated from connection tables and may not always
be oriented consistently.
Stereochemical conventions
Where the absolute configuration of a compound is known or can be inferred
from the published literature without undue difficulty, this is indicated. Where
only one stereoisomer is referred to in the text, the structural diagram indicates
that stereoisomer. Wherever possible, stereostructures are described using the
Cahn-Ingold-Prelog sequence-rule (R,S) and (E,Z) conventions but, in cases
where these are cumbersome or inapplicable, alternatives such as the
7
α,β-system are used instead. Alternative designations are frequently presented
in such cases.
The structure diagrams for compounds containing one or two chiral centres
are given in DNP as Fischer-type diagrams showing the stereochemistry
unequivocally. True Fischer diagrams in which the configuration is implied by
the North-South-East-West positions of the substituents are widespread in the
literature; they are quite unambiguous but need to be used with caution by the
inexperienced. They cannot be reoriented without the risk of introducing errors.
COOH COOH COOH COOH
H2N C H H2N C H or H2N H or H2N
CH3 CH3 CH3 CH3
DNP representation True Fischer projections
The relative stereochemical label (R*,S*) is first applied with the R* applying
to the chiral centre of higher priority (C-3). The absolute stereochemical
descriptor (S)- is then applied changing R* to S for the chiral centre of higher
priority and S* to R for the chiral centre of lower priority (C-2). For further
details, see the current CAS Index Guide.
For simplicity, the enantiomers of bridged-ring compounds, such as camphor,
are described simply as (+)- and (–)-. Although camphor has two chiral centres,
steric restraints mean that only one pair of enantiomers can be prepared.
For further information on the (R,S)-system, see Cahn, R,S et al, J. Chem.
Soc., 1951, 612; Experientia, 1956, 12, 81; Angew. Chem. Int. Ed. Engl., 1966,
5, 383.
Where appropriate, alternative stereochemical descriptors may be given using
the D, L or α,β-systems. For a fuller description of these systems, consult The
Organic Chemist's Desk Reference (Chapman & Hall, 1995).
8
general given for salts, hydrates or complexes (e.g. picrates) nor for most
"characterisation" derivatives such as acetates and methyl ethers of complex
natural products.
Where a derivative appears to have characterised only as a salt, the properties
of the salt may be given under the heading for the derivative. In such cases the
data is clearly labelled, e.g., Mp 179° (as hydrochloride).
Source
The taxonomic names for organisms given throughout are in general those given
in the primary literature. Standardisation of minor orthographical variations has
been carried out. Data in this field may be searched under Source/Synthesis or
All Text. Standards used are: Brummitt, R.K. (1992) Vascular Plant Families
and Genera, Royal Botanic Gardens, Kew; Willis, J.C. (1973) A Dictionary of
the Flowering Plants, Cambridge University Press, Cambridge; Gozmany, L.
(1990) Seven Language Thesaurus of European Animals, Chapman & Hall
London; Chemical Abstracts Service.
Importance/use
Care has been taken to make the information given on the importance and uses
of chemical substances as accurate as possible. Data in this field may be
searched under Use/Importance or All Text.
Type of Compound
All natural products are classified under one of more than 1050 headings
according to structural type, e.g., daucane sesquiterpenoid, pyrrolizidine
alkaloid, withanolide. Each structural type is assigned as a type of compound
code, e.g., VG0300, VX0150. Type of compound words and type of compound
codes may both be searched in Menu and Command search.
The full type of compound code index is given in Table 3, page 128 of the
printed User Manual, and in the Description of Natural Product Structures that
follows, each descriptive paragraph is followed by its Type of Compound
code(s).
Physical Data
Appearance
Natural products are considered to be colourless unless otherwise stated. Where
the compound contains a chromophore which would be expected to lead to a
visible colour, but no colour is mentioned in the literature, the DNP entry will
mention this fact if it has been noticed by the contributor.
An indication of crystal form and of recrystallisation solvent is often given
but these are imprecise items of data; most organic compounds can be
crystallised from several solvent systems and the crystal form often varies. In
the case of the small number of compounds where crystal behaviour has been
intensively studied (e.g. pharmaceuticals), it is found that polymorphism is a
very common phenomenon and there is no reason to believe that it is not
widespread among organic compounds generally.
9
(b) Where two or more melting points are recorded and differ by several
degrees (the most likely explanation being that one sample was impure), the
lower figure is given in parentheses, thus: 139° (134–135°).
(c) Where quoted figures differ widely and some other explanation such as
polymorphism or incorrect identity seems to be the most likely explanation,
both figures are quoted without parentheses, thus Mp 142°, Mp 205–206°.
(d) Known cases of polymorphism or double melting point are noted.
Boiling point determination is less precise than that of melting points and
conflicting boiling point data is not usually reported except when there appears
to be a serious discrepancy between the different authors.
Optical rotations
These are given whenever possible, and normally refer to what the DNP
contributor believes to be the best-characterised sample of highest chemical and
optical purity. Where available an indication of the optical purity (op) or
enantiomeric excess (ee) of the sample measured now follows the specific
rotation value.
Specific rotations are dimensionless numbers and the degree sign which was
formerly universal in the literature has been discontinued.
Solubilities
Solubilities are given only where the solubility is unusual. Typical organic
compounds are soluble in the usual organic solvents such as ether and
chloroform, and virtually insoluble in water. The presence of polar groups (OH,
NH2 and especially COOH, SO3H, NR+) increases water solubility.
pKa values
pKa values are given for both acids and bases. The pKb of a base can be
obtained by subtracting its pKa from 14.17 (at 20°) or from 14.00 (at 25°).
Spectroscopic data
Spectroscopic data such as uv wavelengths and extinction coefficients are given
only where the spectrum is a main point of interest, or where the compound is
unstable and has been identified only by spectroscopic data.
In many other cases, spectroscopic data can be rapidly located through the
references quoted.
10
The field of safety testing is a complex, difficult and rapidly expanding one,
and while as much care as possible has been taken to ensure the accuracy of
reported data, the Dictionary must not be considered a comprehensive source on
hazard data. The function of the reported hazard data is to alert the user to
possible hazards associated with the use of a particular compound, but the
absence of such data cannot be taken as an indication of safety in use, and the
Publishers cannot be held responsible for any inaccuracies in the reported
information, neither does the omission of hazard data in DNP imply an absence
of this data from the literature. Widely recognised hazards are included
however, and where possible key toxicity reviews are identified in the
references. Further advice on the storage, handling and disposal of chemicals is
given in The Organic Chemist's Desk Reference.
Finally, it should be emphasised that any chemical has the potential for harm
if it is carelessly used. For many newly isolated materials, hazardous properties
may not be apparent or may have been cited in the literature. In addition, the
toxicity of some very reactive chemicals may not have been evaluated for
ethical reasons, and these substances in particular should be handled with
caution.
Bibliographic References
The selection of references is made with the aim of facilitating entry into the
literature for the user who wishes to locate more detailed information about a
particular compound. Thus, in general, recent references are preferred to older
ones, particularly for chiral compounds where optical purity and absolute
configuration may have been determined relatively recently. The number of
references quoted cannot therefore be taken as an indication of the relative
importance of a compound, and the references quoted for important substances
may not be the most significant historically.
References are given in date order except for references to spectroscopic
library collections, which sort at the top of the list, and those to hazard/toxicity
sources which sort at the bottom.
The content of most references is indicated by means of suffixes, known as
reference tags. A list of the most common ones is given in Table 4, p. 145 of the
printed User Manual. For references describing a minor natural product which
has been included in DNP as a derivative of a parent compound, the reference
tag may be the identifying name of the natural product, e.g. (Laciniatoside II).
*RTECS® Accession Numbers are compiled and distributed by the National Institute for Occupational Safety and Health Service
of the U.S. Department of Health and Human Services of The United States of America. All rights reserved. (1996)
11
Some reference suffixes are now given in boldface type, where the editors
consider the reference to be particularly important, for example the best
synthesis giving full experimental details and often claiming a higher yield than
previously reported methods.
In some entries, minor items of information, particularly the physical
properties of derivatives, may arise from references not cited in the entry.
Journal abbreviations
In general these are uniform with the Chemical Abstracts Service Source Index
(CASSI) listing except for a short list of very common journals:
12
Description of
Natural Product Structures
This Description of Natural Product Structures is adapted from the printed
version of Dictionary of Natural Products, and revised for Dictionary of
Natural Products on CD-ROM
The purpose of this general introduction and review is to facilitate access to
the DNP Type of Compound Index which in turn leads on to the individual
DNP entries. The order of main sections is the same as in the Type of
Compound Index, and within the main sections the order of description of types
of compound in general parallels the order in which they appear in the Type of
Compound Index (except in the case of aliphatic natural products). Throughout
this Description, the names of natural products which are not specially
illustrated here but which are documented in DNP entres are given in boldface
type. (The names used in this Description may not necessarily be the Dictionary
entry names: use the Compound Name Index to locate substances if necessary.)
The various classes of natural product are described in respect of: (a)
structural characteristics; (b) nomenclature, including Chemical Abstracts
nomenclature; (c) biogenesis; (d) general biochemical significance and (e) any
other information about the class which is of general importance. For detailed
information about individual natural products it is necessary to locate the
compound within its entry, which will in turn facilitate access to the primary
literature.
Contents
Aliphatic natural products page 15
Semiochemicals 15
Lipids 22
Polyketides 29
Carbohydrates 35
Oxygen heterocycles 44
Benzofuranoids 48
Benzopyranoids 49
Flavonoids 51
Tannins 60
Lignans 64
13
Terpenoids page 72
Monoterpenoids 73
Sesquiterpenoids 77
Diterpenoids 101
Sesterterpenoids 118
Triterpenoids 121
Tetraterpenoids 131
Miscellaneous terpenoids 133
Meroterpenoids 133
Steroids 135
The sterols 140
Alkaloids 154
Alkaloids derived from ornithine 154
Alkaloids derived from lysine 156
Alkaloids derived from nicotinic acid 158
Alkaloids of polyketide origin 159
Alkaloids derived from anthranilic acid 160
Alkaloids derived wholly or in part from phenylalanine or
tyrosine 163
Isoquinoline alkaloids 163
Alkaloids derived from tryptophan 181
Monoterpenoid indole alkaloids 184
Terpenoid alkaloids 196
Steroidal alkaloids 200
Imidazole alkaloids 202
Oxazole alkaloids 202
Thiazole alkaloids 203
Pyrazine and quinoxaline alkaloids 203
Pyrrole alkaloids 203
Putrescine alkaloids 203
Spermine and spermidine alkaloids 203
Peptide alkaloids 204
Purines 205
Pteridines and analogues 205
Polypyrroles 206
14
Aliphatic natural products (VA)
A wide variety of small aliphatic and alicyclic compounds occur in nature.
Because they are diverse, no attempt will be made here to give a general
account; for information on specific aliphatics, their individual entries should be
consulted. Accounts are given below, however, of the semiochemicals, which
are structurally diverse but which are defined functionally and include many of
the aliphatic compounds of greatest biochemical importance and current
research interest, and of the lipids, which are structurally more or less well
defined.
In the Type of Compound Index, the aliphatic compounds included in the
Dictionary are simply classified by functional group and ring/chain structure.
A wide range of aliphatic compounds is documented as secondary products
from natural starting materials, e.g. in cooked foods, but these are outside the
scope of the Dictionary.
Semiochemicals
Semiochemicals are defined as chemicals that mediate communication between
individual organisms. The word is derived from the Greek word semeion which
means mark, sign or signal. Although some semiochemicals are released
purposefully (sex pheromones, the scent of flowers), others are released as a
consequence of normal metabolism, but nevertheless still convey information.
An example of the latter is the attraction of the tsetse fly to carbon dioxide in
the breath of cattle. It is of no consequence whatsoever to the fly that the
‘signal’ is intentional or otherwise.
Most workers exclude the consumption of food (but not its detection) and the
use of defence chemicals but again not their subsequent detection (cf. the stink
of the skunk) as semiochemical interactions. The study of semiochemical
interactions is termed chemical ecology. An alternative nomenclature
emphasises the transfer of information, and uses the term infochemical, instead
of semiochemical.
When semiochemicals act between members of the same species they are
known as pheromones from the Greek pherein to carry and horman to excite or
stimulate. They are the external counterparts of hormones which act as mes-
sengers between organs in the body. Pheromones have been divided into two
categories based on temporal criteria. Releaser pheromones elicit a response
which is immediate and usually behavioural, whereas primer pheromones cause
longer term physiological changes.
Pheromonal systems are usually the most highly developed semiochemical
systems because the species directly benefits from any improvement. Highly
developed in this context means that release of the pheromone is efficient and
timely and that the receiver has a sensitive and selective detection system.
Moreover because most pheromones are involved in reproductive functions
(mate attraction, courtship and copulation), increased efficacy is immediately
apparent in higher fecundity.
Semiochemicals that act between members of different species are called
allelochemics and these are further divided into allomones which cause an effect
favouring the emitter (such as the stink of the skunk) and kairomones which
favour the receiver (e.g. the odour of a prey species that attracts its predator).
Semiochemicals which favour both the receiver and the emitter are known as
synomones. A good example of this is the scent of a flower which attracts bees
to feed on nectar and pollinate them.
The most widely known semiochemicals are the volatile sexual attraction
15
pheromones of insects but volatility, although common, is not a prerequisite.
Semiochemicals may also be transferred by touch (e.g. the aphrodisiac
polypeptides of the golden hamster) or in solution (fish maturation factors).
They range in structure from carbon dioxide, ethylene (an attractant for bark
beetles), aliphatics, mono and polycyclic hetero and carbocycles through
steroids to polypeptides. All classes of organisms including yeast, corals,
crustacea, newts, fungi, plants, insects, spiders, mites, fish and mammals
employ some form of semiochemical. However only limited information is
available for amphibians, reptiles and birds.
Insect pheromones
There has been much interest in the use of insect pheromones for the monitoring
and control of insect pests in the field, gardens, stored products, food processing
factories and in commercial kitchens.
Semiochemicals of social insects, ants, termites, locusts, heteroptera and the
coniferphagus choristoneura have been reviewed (see bibliography); other
representative classes are reviewed below.
The structures of more lepidopteran pheromones are known than all other
pheromones together and moreover they are chemically fairly homogenous.
Typically they consist of an unbranched carbon chain with an even number of
carbon atoms, terminated by an oxygen containing group (acetate, alcohol or
aldehyde) with 0–4 alkenic bonds located predominantly towards the
hydrocarbon terminus of the chain.
The 1992 compilation of attractants for lepidoptera and other species by Arn
et al. includes 2292 semiochemicals from 1068 species, but only 264 unique
chemical structures. The most common attractants are Z9–14Ac (168 cases),
Z11–14Ac (157), Z11–16Ac (133), E11–14Ac (124) and Z7–12Ac (111).
Taken together these five chemicals are used by 30% of all species; however
80% of species use at least two components. Approximately 40% of the
structures have a terminal acetate group and the remaining 60% are evenly
divided between aldehydes, alcohols and hydrocarbons with a few 2-ketogroups.
Recently a nitrate terminal group was reported and presumably others remain to
be discovered. The commonest chain lengths in decreasing order of abundance
are C14, C12, C16 and C18. Approximately 40% are monoenes and 40%
dienes, with the double bonds predominantly located at the (ω-3) and (ω-5)
positions.
The biosynthesis of lepidopteran pheromones is controlled by a neuropeptide
(PBAN, pheromone biosynthesis activating neuropeptide). The pathway
commences with saturated fatty acids which undergo desaturation and
successive losses of terminal acetate units, followed by modification of the
terminal group. ∆9 and ∆11 desaturases for dodecanoic and tetradecanoic acids
(e.g. Bombykol) have been identified.
Methyl substituted straight-chain pheromones are fairly common in nature.
These range from simple systems with a sole methyl group remote from any
functionality or at the antiso position (ω-2) through n, (n + 6) and the more
common n, (n + 4) dimethyl compounds.
It is notable that n, (n + 4) polymethyl compounds have predominantly the
R-configuration. This results from incorporation of propionate into the normal
fatty acid biosynthetic pathway, in which the 2-pro-R proton is removed from
propionate. An interesting example which has been investigated in considerable
detail is 4R,8R-Dimethyldecanal, the aggregation pheromone of flour beetles.
2,6-Dichlorophenol is the female-produced pheromone for a number of ticks.
Several other species use Phenol and p-Cresol. However these are frequently
insufficient to elicit mating and a contact pheromone, Cholesteryl oleate, has
been identified from the American dog tick.
16
The aggregation pheromone of the acarid mite Lardoglyphus konoi is
Lardolure.
Fruit flies (Diptera) and some species of wasp (Hymenoptera) use simple
spiroketals as sex pheromones. 1,7-Dioxaspiro[5.5]undecane is the major
female produced sex attractant of the olive fly Bactrocera oleae (formerly
Dacus oleae). Two other minor components, the 3-hydroxy and 4-hydroxy
derivatives were also identified.
Simple spiroketal ring systems have also been identified as the structural
motif of a wide range of other pheromones, e.g. rectal gland secretions of male
Asian fruit flies, Chalcogran, the aggregation pheromone of the bark beetle
Pityogenes chalcographus, aggression inhibiting pheromones of the wasp
Paravespula vulgaris, sexual attractants of several species of Andrena bees and
cephalic secretions of cleptoparasitic bees.
Attack of a tree by a bark beetle is orchestrated by a complex score of
semiochemical notes. Initial attack by pioneers (females in monogamous
species) occurs in response to tree volatiles such as Myrcene. Upon landing
gallery excavation commences, followed by mating and release of pheromones
by the males. The synergistic blend of tree volatiles and the pheromones
produced by both sexes initiates mass attack, which overwhelms the tree’s
defences. Meanwhile fungal and yeast spores are introduced into the tree
adventitiously or from special chambers (mycangi) on the shoulders of the
beetles. These micro-organisms proliferate and block the sap channels which
prevents the galleries being flooded with resin and may produce the beetles’
aggregation pheromone. In some species such as the ambrosia beetle the larvae
feed on the fungus. Finally when the tree is fully colonised deterrent phero-
mones are produced by the beetle, or by the fungi.
Bridged spiroketals are predominantly found as pheromones of bark beetles
and usually have a 6,8-dioxabicyclo[3.2.1]octane skeleton. The first structure to
be elucidated was that of Brevicomin, the female-produced aggregation
pheromone of the western pine beetle, Dendroctonus brevicomis. Different
beetle species produce different isomeric Brevicomin compositions. Male
western pine beetles produce Frontalin which is the last component to be added
to the synergistic blend of exo-Brevicomin and Myrcene (from the host tree)
which initiates mass attack of the host tree by the beetles.
Some other notable examples are Multistriatin, the pheromone of the
European elm bark beetle, Scolytus multistriatis, responsible for ‘Dutch Elm’
disease, the male-produced pheromones of the swift moth, Hepialus hecta
which are one carbon higher homologues of the bark beetle pheromones
described above, and Lineatin which has the same carbon skeleton as
Grandisol.
Rodents
The behaviour and development of the house mouse (Mus musculus) is deter-
mined by a complex system of pheromonal effects mostly mediated by urine.
The best characterised of these are 3,4-Dehydro-exo-brevicomin and 2-sec-
Butyl-2-thiazoline which are testosterone-dependant aggression promotors,
isolated from the urine of adult male mice. One of the most interesting
developmental factors is the acceleration of puberty by volatiles from adult
urine. Recent evidence suggest that 3,4-Dehydro-endo-brevicomin is bound to a
lipocalin (a small peptide) in mouse urine. When the urine is deposited it
releases the volatile ligand which promotes investigation by other mice. When
immature mice pick up the lipocalin it accelerates development into puberty.
Dimethyl disulfide has been identified in male rabbit pellets and rat preputial
tissue. It is released in the vaginal secretions of the female hamster and acts as a
potent attractant.
17
Fish
The sexual development of many species of fish is determined by the presence
of steroids in the water. 17α, 20β-Dihydroxypregn-4-en-3-one is detected by
the medial olfactory tracts of male goldfish and causes gonadotrophin secretion.
The in vitro biosynthesis of several steroids has been demonstrated in the
ovaries of the female during the final stages of oocyte maturation. Male Atlantic
salmon are similarly affected by the corresponding 20-sulfate and Test-
osterone.
Herbivores
Hoofed mammals have many specialised glands for scent marking. The tail
gland in the red deer (Cervus elaphus) produces Phenol, m-Cresol, Benzoic
acid, 4-Ethylphenol, Dimethyl sulfone, o-Cresol, 3-Phenylpropanoic acid
and Phenylacetic acid. Many of these are common to other herbivore and
carnivore species.
Herbivores avoid areas which carnivores have marked. Treatment of conifer
saplings with red fox urine, a synthetic mixture of its volatile constituents or
Isopentenyl methyl sulfide suppressed feeding on the saplings by the snowshoe
hare (Lepus americanus); which is common prey for the red fox in winter.
Similarly trees in orchards were protected from feeding by voles and gophers by
these materials and by 2,5-Dihydro-2,4,5-trimethylthiazole, a constituent of
red fox faeces. (Z)-7-Dodecenyl acetate which is best known as a lepidopteran
sexual attractant in at least 111 species, is also produced by female elephants at
the time of ovulation.
Carnivores
The Carnivores (dogs, cats etc.) show a wide range of social behaviour ranging
from solitary individualism to hunting in packs. Territory marketing with urine,
faeces and various glands is common and social interactions initiated by sniffing
are common even amongst habitually solitary species.
The characteristic ‘skunky odour’ of fox urine is due to Methyl 2-phenyl-
ethyl sulfide and Isopentenyl methyl sulfide, which is also a component of wolf
urine and comprises more than 50% of the volatiles from female coyote urine.
Bis(3-isopentenyl) sulfide and Isopentanethiol are also found in coyote urine
and reach a maximum at oestrus. The latter is also a component of the defensive
‘scent’ of the striped skunk, together with a series of crotyl thiols. 2,2-Di-
methylthietane (Mustelan) was isolated from the anal glands of the mink and
the polecat which also contains 3,3-Dimethyl-1,2-dithiolane and Bis(3-iso-
pentenyl) sulfide. (2S)-2-Propylthietane is the major malodorous substance
from the anal gland of the stoat (Mustela erminea).
The anal sacs of the red fox secrete a mobile strongly smelling liquid con-
taining a complex mixture of short chain carboxylic acids, Trimethylamine and
diamines such as Putrescine and Cadaverine. The volatile fatty acids are
produced by anaerobic bacteria.
The anal sacs of the dog (Canis familiaris) and the coyote (Canis latrans)
also contain short chain carboxylic acids and Trimethylamine. Carboxylic acids
and medium chain esters, plus Indole and Hexanol were found in the anal gland
secretion of the aardwolf (Proteles cristatus), but no aliphatic amines were
detected. Indole is also present in the anal sac secretion of the ferret (Mustelo
furo), but the components which elicit the greatest attraction for conspecifics are
2-Propylthietane, 3-Propyl-1,2-dithiolane and cis and trans-2,3-Dimethyl-
thietane.
18
The supracaudal gland is present in most Canidae and is most highly
developed in the less social species. Sniffing the gland is common in social
interactions. The secretion of the gland has a floral odour and it is often
described as the ‘violet gland’. Dihydroactinidiolide and other volatile terpenes
have been detected in the distillate from the secretion of the red fox Vulpes
vulpes but very little is known about the other constituents.
Humans
The apocrine glands, which are mostly found in the armpit and on the head, are
believed to be the source of human pheromones. 5α-Androst-16-en-3-one (the
principal component of boar taint) and other steroids are secreted by males from
the armpits and modified by the microflora on the skin. Armpit extracts have
been shown to modify the menstrual cycle of women with irregular periods and
the curious phenomenon of menstrual synchronisation which occurs between
cohabiting women is believed to be mediated pheromonally. The odour of
steroids and the similar odour of musk has always played a significant role in
human culture, such as the base note of perfumes, the burning of incense and
the highly prized truffle.
Yeast
Saccharomyces cerevisiae (Bakers’ yeast) has two haploid mating types
designated as a and α which release the pheromones a-factor and α-factor
(respectively). These cause arrest of the cell cycle in the G1 phase and
development of pear shaped extrusions of the cell called shmoos. Agglutination,
cell fusion and nuclear fusion gives a zygote that produces a/α diploid buds.
Cells are able to discriminate amongst potential partners and select those with
the highest pheromone production. This has been termed courtship.
The genes MFα1 and MFα2 encode two α-factor precursors, consisting of
165 and 120 amino acids respectively. These are translocated into the classical
secretory pathway where they are glycosylated on asparagine and proteolysed to
give mature α-factor which is a 13 aminoacid peptide (WHWLQLKPGQPMY)
and released.
MFA1 and MFA2 are the two genes which encode 36 and 38 amino acid
peptide precursors of two forms of a-factor. They are synthesised and undergo
proteolysis in the cytoplasm to give 15-mers. These have a C-terminal -CAAX
motif, where C is Cysteine, A is an aliphatic aminoacid and X is variable. The
Cysteine undergoes S-farnesylation, the terminal tripeptide is cleaved, and
finally methylation of the terminal cysteine gives the a-factors (YIIKG(V or
L)FWDFA(S-Far)C-OMe). These both contain 12 aminoacid residues but differ
at residue 6.
The genes STE2 and STE3 encode the receptors for α and a-factor
respectively. The proteins are predicted to have seven potential transmembrane
domains, analogous to the β-adrenoceptor and rhodopsin receptors which
interact with mammalian G-proteins. Therefore G-proteins may also participate
in the mating signal transduction pathway in yeast.
Five unique pheromones have been identified from seven mating types of the
ciliated protozoa Euplotes raikovi. These consist of 38–40 aminoacid residue
polypeptides with an amino terminal aspartic acid residue and six conserved
half cystines, but otherwise they have little homology although Er-1189 and Er-
2 are bound equally well by some monoclonal antibodies. Different mating
types producing the same pheromone are able to mate and one type which
secretes Er-20 is unable to mate with any of the others.
Conjugative plasmid transfer by Enterococcus faecalis is induced by a
pheromone released by the prospective recipient.
19
Albone, E.S. (1984) in Mammalian Semiochemistry: The Investigation of Chemical
Signals Between Mammals, Wiley, Chichester, pp. 1–360.
Aldrich, J.R. (1988) Ann. Rev. Entomol., 33, 211.
Ali, M.F. et al, (1990) Biol. Rev. Cambridge Phil. Soc., 65, 227.
Anderson, D. et al, (1990) J. Mol. Biol., 216, 1.
Arn, H., Toth, M. and Priesner, E. (1986) List of Sex Pheromones of Lepidoptera and
Related attractants, OILB-SROP/IOBC-WRPS, Paris.
Arn, H., Toth, M. and Priesner, E. (1992) List of Sex Pheromones of Lepidoptera and
Related attractants, 2nd edn, OILB-SROP/IOBC-WRPS, Paris, pp. 1–179.
Ashby, M.N. et al, (1992) Prod. Natl. Acad. Sci. USA, 89, 4613.
Attygalle, A.B. et al, (1984) Chem. Soc. Rev., 13, 245.
Baker, R. et al, (1984) Nat. Prod. Rep., 1, 299.
Bedoukian, P. (1994) Perfum. Flavor, 19, 1 (rev).
Bell, W.J. and Carde, R.T. eds. (1984) The Chemical Ecology of Insects, Chapman &
Hall, New York.
Boivin, T.L.B. (1987) Tetrahedron, 43, 3309.
Brand, J.M. et al, (1979) Prog. Chem. Org. Nat. Prod., 37, 1.
Byers, J.A. (1991) Biol. Rev. Cambridge Phil. Soc., 66, 347.
Calkin, R.R. et al. (1994) Perfumery Practice and Principles, Wiley.
Claus, R. et al. (1979) Experientia, 35, 1674.
Claus, R. et al. (1981) Experientia, 37, 178.
Clissold, D. et al. (1994) Nat. Prod. Rep., 11, 621.
Dicke, M. et al. (1988) Functional Ecology, 2, 131.
Eriotou-Bargiota et al. (1992) Biochemistry, 31, 551.
Evans, D.A. et al. (1973) Chem. Soc. Rev., 2, 75.
Featherstone, C. (1990) Trends Biochem. Sci., 15, 169.
Fields, S. (1990) Trends Biochem. Sci., 15, 270.
Fiori, P.L. et al. (1990) J. Protozool., 37, 187.
Francke, W. (1984) in Advances in Invertebrate Reproduction, 3 (eds. E. Engels et al.),
Elsevier Science Publishers B.V., Amsterdam, pp. 493.
Freestone, N.P. (1988) Performance Chemicals, August, 26–29.
Galli, D. et al. (1991) J. Bacteriol., 173, 3029.
Gehrung, S. et al. (1990) J. Cell Biol., 111, 1451.
Hamilton, J.G.C. (1992) Parasitol. Today, 8, 130.
Harborne, J.B. (1993) Nat. Prod. Rep., 10, 327.
He, B. et al. (1991) Prod. Natl. Acad. Sci. USA, 88, 11373.
Hirsch, J.P. et al. (1992) Bioessays, 14, 367.
Hrycyna, C.A. et al. (1992) J. Biol. Chem., 267, 10457.
Hummer, H.E. and Miller, T.A. (eds.) (1984) Techniques in Pheromone Research,
Springer, Berlin.
Jackson, C.L. et al. (1991) Cell, 67, 389.
Jones, O.T. (1985) in Insecticides (D.H. Hutson and T.R. Roberts, eds.), Wiley,
Chichester, pp. 311–373.
Karlson, P. et al. (1959) Nature, 183, 155.
Kelly, D.R. (1990) Chem. Brit., 124.
Kelly, D.R. (1993) in The Chemistry of Natural Products, 2nd edn (ed. R.H. Thomson),
Blackie, Glasgow, pp. 382.
Kelly, D.R. (1996) Chem. Biol., 3, 595–602.
Kluge, A.F. (1986) Heterocycles, 24, 1699.
Ko, H.A. et al. (1990) J. Biol. Chem., 265, 21652.
Kurjan, J. (1992) Annu. Rev. Physiol., 54, 639.
Lake, M. (1989) Scents and Sensuality. The Essence of Excitement, John Murray,
London.
Law, J.H. et al. (1971) Ann. Rev. Biochem., 40, 533.
Lin, G. (1993) Pure Appl. Chem., 65, 1233 (rev, synth, insect pheromones).
Lipke, P.N. et al. (1992) Microbiol. Rev., 56, 180.
Luporini, P. et al. (1992) Dev. Genetics, 13, 9.
Lutz, R.J. et al. (1992) Prod. Natl. Acad. Sci. USA, 89, 3000.
Marcus, S. et al. (1990) Biochem. Biophys. Res. Commun., 172, 1310.
Marr, R.S. et al. (1990) J. Biol. Chem., 265, 20057.
20
Mayer, M.S. and McLaughlin, J.R. (1991) Handbook of Insect Pheromones and Sex
Attractants, CRC Press, Boca Raton.
Mori, K. (1992) Total Synth. Nat. Prod., 9, 1
Mori, K. (1997) Chem. Commun., 1153 (rev, pheromones)
Millar, J. and Haynes, K., Eds (1997) Methods in Chemical Ecology, Chapman & Hall,
N.Y., 1997
Morse, D. et al. (1986) J. Chem. Ecol., 12, 335.
Mundy, B.P. et al. (1977) Heterocycles, 6, 51.
Ortenzi, C. et al. (1990) J. Cell Biol., 111, 607.
Perron, F. et al. (1989) Chem. Rev., 89, 1617.
Pontius, L.T. et al. (1992) J. Bacteriol., 174, 1821, 3152.
Poucher, W.A. (1991) Poucher’s Perfumes, Cosmetics and Soaps, Volume 1, The Raw
Materials of Perfumery, 9th edn (A.J. Jouhar ed.), Chapman & Hall, London, pp.
236–237.
Prestwich, G.D. (1983) Sci. Am., 249, 68.
Prestwich, G.D. (1984) Ann. Rev. Entomol., 29, 201.
Proteau, G. et al. (1990) Biochem. Biophys. Res. Commun., 170, 182.
Rad, M.R. et al. (1992) Mol. Gen. Genet., 236, 145.
Raffioni, S. et al. (1992) Prod. Natl. Acad. Sci. USA, 89, 2071.
Rose, M.D. (1991) Ann. Rev. Microbiol., 45, 539.
Schafer, W.R. et al. (1990) Science, 249, 1133.
Schneider, D. (1992) Naturwissenschaften, 79, 241 (rev, pheromones)
Silk, P.J. et al. (1988) Ann. Rev. Entomol., 33, 83.
Silverstein, R.M. (1981) Science, 213, 1326.
Singer, A.G. (1991) J. Steroid Biochem. Mol. Biol., 39, 627.
Solomon, F. (1991) Ann. Rev. Cell Biol., 7, 633.
Stoddart, D.M. (1976) Mammalian Odours and Pheromones. The Institute of Biology’s
Studies in Biology no. 73, Edward Arnold, London.
Stoddart, D.M. (1988) Human odour culture: a zoological perspective, in Perfumery:
The psychology and biology of fragrance (eds S. Van Toller and G.H. Dodd),
Chapman & Hall, London, pp. 1–18.
Tesseire, P.J., Ed. (1995) Chemistry of Fragrant Substances, Allured Carol Stream, Il.,
USA.
Trotter, K.M. et al. (1990) Plasmid, 24, 57.
Tumlinson, J.H. (1989) Pure Appl. Chem., 61, 559.
Algae
The marine brown algae have evolved a unique pheromone system, the known
members of which are based on a range of acyclic or aliheterocyclic alkenes.
The majority of these are C11, examples being the Dictyopterenes, exemplified
by Dictyopterene A and Desmarestene. Some Fucus and Sargassum species
employ the smaller molecule 1,3,5-Octatriene (various stereoisomers). These
molecules modulate the navigation of the male gametophytes in the marine
environment over the very short ranges (~1 mm) required for fertilisation. They
are biosynthesised from highly unsaturated fatty acids by pathways not yet
understood; the precursor unsaturated acids such as eicosahexaenoic, are of a
type not found in terrestrial plants.
Dictyopterene A Desmarestene
21
recent characterisation of Lurlene, of a totally different structural type, as the
sex pheromone of the terrestrial green flagellate alga Chlamydomonas
allensworthii.
Jaenicke, L. et al, Angew. Chem. Internat. Ed. Engl., 1982, 21, 643–653.
Wirth, D. et al, (1992), Helv. Chim. Acta, 75, 734–744.
Lipids
Lipids have been defined in different ways at different times and there is still no
agreed definition of the term. Recent proposals are based mainly on chemical
structure and, in turn, on the underlying biosynthetic pathways; two recent
definitions read: ‘lipids are fatty acids and their derivatives, and substances
related biosynthetically or functionally to these compounds’ and ‘lipids are
compounds based on fatty acids or closely-related compounds such as the fatty
alcohols and the sphingosine bases’. This definition includes all the major
groups of materials generally recognised as lipids: it incorporates sterol esters
but not the free sterols.
22
(e.g. Isopalmitic, Anteisononadecanoic) or with carbocyclic units (e.g.
Sterculic).
(ii) Acids with one unsaturated centre are usually alkenic compounds with
cis- (Z-) configuration and with the double bond in one of a limited number of
preferred positions. This is most commonly ∆9 (e.g. Oleic) or n-9 (e.g. Erucic)
but the double bond can occur in other positions (e.g. Petroselinic) and
monoacetylenic acids are also known (e.g. Tariric).
(iii) Polyunsaturated acids are mainly poly-alkenic (cis-/Z-configuration) with
a methylene-interrupted arrangement of double bonds, i.e. double bonds are
separated from each other by one CH2 group. The pattern of 1,4-unsaturation is
characteristic of fatty acids and differs from that in isoprenoids which is usually
1,3-(conjugated) or 1,5-. Polyunsaturated fatty acids occur in biochemically
related families and the two most important are the n-6 family based on
Linoleic acid and the n-3 family based on α-Linolenic acid (see discussion on
biosynthesis below). Some acids have conjugated unsaturation which is both cis
and trans (e.g. Eleostearic, Calendic, Parinaric), some have mixed ene/yne
unsaturation both conjugated (e.g. Isanic) and non-conjugated (e.g.
Crepenynic), and there is a small group of acids in which unsaturation is not
entirely methylene-interrupted (e.g. Columbinic).
(iv) Fatty acids rarely have functionality apart from the carboxyl group and
the various types of unsaturation discussed above but acids are known with
fluoro, hydroxy, keto, and epoxy groups. Two important examples are
Ricinoleic (12-hydroxyoleic) and Vernolic (cis-12,13-epoxyoleic).
These generalisations have a biosynthetic basis and even some of the
exceptions can be accommodated in general biosynthetic schemes with only
minor modification.
The major biosynthetic pathways leading to fatty acids are summarised in
Figure 1. In the de novo pathway leading to saturated fatty acids, acetate (the
primer) condenses with malonate (the extender) to produce a C4 oxo acid which
is reduced in three steps to butanoate. This cycle of condensation and reduction
continues until, most commonly, palmitate has been obtained, though in lauric
oils rich in 12 : 0 the process stops mainly at the C12 level. The malonate is itself
derived from acetate by carboxylation in the presence of a biotin enzyme and
the carbon dioxide lost during condensation is that derived during carboxylation
so that the carbon atoms in butanoate and in the longer chain acids are entirely
acetate-derived.
a b
2:0 16:0 18:0
c b
n – 9 monoenes
b,c
18:1 n – 9 polyenes
d
b,c
18:2 n – 6 polyenes
b,c
18:3 n – 3 polyenes
23
If the acetate is replaced by a different primer then other fatty acids are
produced. This can be propionate (major product: heptadecanoate), 2-methyl-
propionate (iso acids), or 2-methylbutanoate (anteiso acids).
The chain-elongation process is similar in outline to the de novo process but
differs in some significant details. It operates with both saturated and
unsaturated acids and occurs with either acetate or malonate. Erucic acid is
made from Oleic acid by two chain-elongation steps:
Both Oleic acid and Erucic acid are n-9 monoenes. This chain-elongation process
is also important in the biosynthesis of polyenes (see below).
The most common route to monoene acids involves ∆9 desaturation. This
oxygen-requiring process occurs in plants, animals and microorganisms and
furnishes acids with a cis-double bond between carbon atoms 9 and 10, e.g. 9-
hexadecenoic, 9-octadecenoic (Oleic).
Further desaturation of Oleic acid to the 9,12-diene (Linoleic) and 9,12,15-
triene (α-Linolenic) occurs only in plants. The additional double bonds assume
a methylene interrupted pattern and lie between the existing double bond and
the methyl group. Animals requiring these acids for the production of n-6 and n-
3 polyene acids must obtain them through their dietary intake.
The bioconversion of 22 : 5 to 22 : 6 (and presumably of 22 : 4 to 22 : 5)
may occur by a more complicated pathway than that suggested in Figure 2
involving the sequence 22 : 5 → 24 : 5 (elongation) → 24 : 6 (desaturation) →
22 : 6 (chain shortening)
Desaturation between an existing double bond and the carboxyl group occurs
only rarely in plants (e.g. γ-Linoleic acid) but readily in animals. The additional
double bonds have cis- configuration and are introduced in a methylene-
interrupted pattern.
Some of these acids are required for the maintenance of health and are
designated essential fatty acids (efa). It is possible to produce efa deficiency in
n–6 n–3
Linolenic 18:2 18:3 α-Linolenic
desaturation
elongation
desaturation
— 22:4 22:5 —
desaturation
Figure 2. The n-6 and n-3 families of polynsaturated acids based on Linoleic and α-
Linolenic acids.
24
experimental animals but this state is rarely observed in humans who generally
ingest adequate quantities of Linoleic and α-Linolenic acid. Efa deficiency is
most likely to be observed under conditions where the normal enzymic
processes – especially desaturation – are impaired.
A wide variety of unusual fatty acids and phospholipids are found in sponges,
and these arise by totally different biosynthetic pathways.
Oxylipins (VA6150)
Three C20 acids – 20 : 3 (n-6), 20 : 4 (n-6), and 20 : 5 (n-3) – are precursors of
the PG1, PG2 and PG3 series of prostaglandins and of many other C20 metabo-
lites. These are known collectively as eicosanoids and are products of the
eicosanoid cascade.
The term oxylipin has been coined relatively recently to describe the class of
natural product, of which prostaglandins are the most widespread, that are
produced from C20 and in some cases C18 fatty acid precursors in at least one
stage of mono- or dioxygenase-dependent oxidation. Since it is now known that
C20 precursors are not universal, the term oxylipin is to be preferred to the
previous term eicosanoid.
The widest variety of structural types is found in marine organisms where
ring formation may produce three- (e.g. Constanolactones), five (e.g.
Ecklonialactone A) or six- (e.g. Manzamenone A) membered rings.
MeOOC
H (CH2)15CH3
H3C(CH2)15
COOMe
H O
HOOC
Manzamenone A
25
Glycerides (VA6700, VA6800, VA6900)
Fatty acids occur naturally as esters of Glycerol or of some other hydroxy
compound or as amides of long-chain amines such as Sphingenine. The less
common long-chain alcohols occur as esters or as ethers. Triacylglycerols are
major storage lipids whilst phospholipids (see below) are important membrane
constituents.
Acylglycerols are esters of glycerol and fatty acids. Partial glycerides are
important intermediates in metabolism and triacylglycerols are the major
constituents of natural fats and oils. In DNP glycerides are named as glycerol
triesters, e.g. entry name = Glycerol tri-9-octadecenoate.
In order to designate the stereochemistry of glycerol-containing components,
the carbon atoms of glycerol are numbered stereospecifically. When the glycerol
molecule is drawn in a Fischer projection with the secondary hydroxyl group to
the left of the central prochiral carbon atom, then the carbons are numbered 1, 2
and 3 from top to bottom. Molecules which are stereospecifically numbered in
this fashion have the prefix ‘sn’ immediately preceding the term ‘glycerol’ in
the name of the compound to distinguish them from compounds which are
numbered in a conventional fashion. The prefix ‘rac’ in front of the full name
shows that the compound is an equal mixture of both antipodes and ‘x’ is used
if the configuration is unknown or unspecified.
Any glycerolipid will be chiral when the substituents at the sn-1 and sn-3
positions are different. If both substituents are long-chain acyl groups then the
optical rotation will be extremely small.
Monoacylglycerols (monoglycerides) (VA6700) are fatty acid monoesters of
glycerol and exist in two isomeric forms:
α1 CH2OOCR CH2OH
β2 HO C H RCOO C H
α′ 3 CH2OH CH2OH
1-Monoacyl-sn-glycerol 2-Monoacyl-sn-glycerol
(α-isomer) (β-isomer)
CH2OOCR1 CH2OOCR1
R2COO C H HO C H
CH2OH CH2OOCR2
1,2-Diacyl-sn-glycerol 1,3-Diacyl-sn-glycerol
(α,β-isomer) (α,α′-isomer)
CH2OOCR1
R2COO C H
CH2OOCR3
Triacyl-sn-glycerol
These materials occur as mixtures with various acyl chains which may show
some fatty acid specificity. As a consequence particular fatty acids may be
concentrated in or excluded from particular positions in the glycerol ester. To
produce seed oils with more than 67% of a particular acid it may be necessary
to modify the acylating enzymes by genetic manipulation.
26
sn-glycerol-3-phosphate
1-acyl-sn-glycerol-3-phosphatea
1,2-diacyl-sn-glycerol-3-phosphate
2-monoacyl-sn-glycerolb
1,2-diacyl-sn-glycerol
phospholipids
triacylglycerol
(PC, PE, PS)
CH2OOCR1
R2COOCH O
CHOPOX
O–
27
H3C(CH2)14COSCoA
palmitoyl-CoA
serine
H3C(CH2)14COCH(NH2)CH2OH
3-ketodihydrosphinganine
H3C(CH2)14CH(OH)CH(NH2)CH2OH
dihydrosphinganine
H3C(CH2)12CH=CHCH(OH)CH(NH2)CH2OH
sphingenine
E
H3C(CH2)12CH=CHCH(OH)CHCH2OR2
NHR1
sphingenine (R 1 = R2 = H)
ceramides (R 1 = COR, R2 = H)
Fattorusso, E. et al. (1997) Prog. Chem. Org. Nat. Prod., 72, 215 (rev, glycolipids)
Gunstone, F.D. et al. (1994) The Lipid Handbook, 2nd edn, Chapman & Hall, London.
Gunstone, F.D. (1996) Fatty Acid and Lipid Chemistry, Blackie, London.
Jie, M.S.F.L.K. et al. (1997) Nat. Prod. Rep., 14, 163 (rev).
28
Polyketides (VC)
Fungi have the ability to produce a very wide range of structural types of
metabolite which are derived from a poly-β-ketomethylene chain. This chain is
formed by condensation of an acetyl unit (or other acyl unit) with malonyl or
methylmalonyl units, with concomitant decarboxylation as in fatty acid
biosynthesis but without the reduction of the intermediate β-dicarbonyl system.
The resulting polyketide chain can take part in internal aldol-type condensations
to give aromatic systems characterised by an alternating oxygenation pattern.
Alternatively reduction or partial reduction of the carbonyls during biosynthesis
can give rise to nonaromatic metabolites. One method of classifying polyketides
is by the number of acetate (or propionate) units in a metabolite; however, this
has the disadvantage of separating structurally similar types. The vast array of
polyketides is treated in DNP according to a mixture of structural, biosynthetic
and functional criteria. The advantage of this approach is that related
compounds are listed together. Aromatic polyketides are listed under the
appropriate aromatic grouping.
Herbert, R.B. (1989) The Biosynthesis of Secondary Metabolites, 2nd edn, Chapman &
Hall, London.
O’Hagan, D. (1991) The Polyketide Metabolites, Ellis Horwood, New York.
O’Hagan, D. (1992) Nat. Prod. Rep., 9, 447.
O’Hagan, D. (1995) Nat. Prod. Rep., 9, 447.
Simpson, T.J. (1991) Nat. Prod. Rep., 12, 1.
Turner, W.B. et al. (1983) Fungal Metabolites II, Academic Press, London.
H Cl
Br O
OH
Bermudenynol
29
OH
H O
O
O
H
H
O
H
OAc
Uvaricin
HO OH
22
OH
5
O
O CH3
15 1 O
O 1′
O NMe2
HO O OH
CH3 1″
OMe
3″
CH3
Erythromycin
H
O O O H O
HO O
O
CH3 CH3
OMe OMe H
O O
OH H
O
H
OMe
Avermectin A1a
30
Blizzard, T. et al. (1990) Recent Progr. Chem. Synth. Antibiot., 65 (synth).
Davies, H.G. et al. (1986) Nat. Prod. Rep., 3, 87 (Avermectins, Milbemycins).
Davies, H.G. et al. (1991) Chem. Soc. Rev., 20, 211; 271.
Fukagawa, Y. et al. (1988) Life Sci. Rep., 6, 267.
Kornis, G.I. et al. (1991) ACS Sympos. Ser. 443 (Avermectins, Milbemycins).
Nakata, M. et al. (1993) in Studies in Natural Product Chemistry, Vol. 12, (ed. Atta-ur-
Rahman), Amsterdam, Elsevier, pp. 35 (synth).
Neuzil, J. et al. (1986) Folia Microbiol. (Prague), 31, 402 (biosynth).
O’Hagan, D. (1989) Nat. Prod. Rep. 6, 205 (biosynth).
Omura, S. (1984) Macrolide Antibiotics, Chemistry, Biology and Practice, Academic
Press, London (general).
Omura, S. (1986) in Biotechnology, Vol. 4, (ed. H. Page), VCH, Weinheim, pp. 359
(general).
Paterson, I. et al. (1985) Tetrahedron, 41, 3569 (synth).
Seno, E.T. et al. (1986) in The Bacteria, Vol IX, (eds S.W. Queener et al.), Academic
Press, Orlando, pp. 231 (biosynth).
Tatsuta, K. (1990) Recent Prog. Chem. Synth. Antibiot., 1 (synth).
HO
AcO OH O
OH OH NH
MeO
H3C
O OH
O
O
Rifamycin
31
Polyenes (VC0300)
The group of antibiotics known collectively as polyenes is characterised by a
large lactone ring (20–44 membered) containing a series of conjugated double
bonds. This leads to the sub-division of the group into trienes, tetraenes etc. The
macrolide ring is often linked by a hydroxyl group to an aminosugar unit and
may have an aliphatic side chain possibly terminating with an aromatic residue.
Streptomyces are the usual producing organisms, and to date over 200
polyenes have been claimed. However, only some of these have established
structures. One reason for the paucity of structural information is that they are
often mixtures of closely related compounds. The advent of HPLC has enabled
better separation to be obtained and has indicated that many polyenes previously
considered to be defined were in fact mixtures of the same components but in
different proportions.
The macrolide ring is probably derived from acetate and propionate,
otherwise little is known about their detailed mechanism of biosynthesis.
Nystatin is a typical polyene antibiotic showing antifungal activity.
OH
COOH
OH OH OH OH
O
HO
OH
O O O
CH3
O
HO NH2OH
HO
Nystatin A1
Beau, J.M. (1990) Recent Prog. Chem. Synth. Antibiot., 135 (synth).
Bolard, J. (1986) Biochim. Biophys. Acta., 864, 257 (props).
Crandall, L.W. et al. (1986) in The Bacteria, Vol IX (ed. S.W. Queener) Academic
Press, Orlando.
Omura, S. (1984) Macrolide Antibiotics, Chemistry, Biology and Practice, Academic
Press, London.
Rinehart, K.L. (1983) Biotechnology, 1, 581 (ms).
Rychnovsky, S.D. (1990) Acta Pharm. Nord., 2, 155.
Simpson, T.J. (1985) Nat. Prod. Rep., 2, 321 (biosynth).
Thomas, A.H. (1986) J. Antimicrob. Chemother., 17, 269 (action).
32
The biochemistry of tetracycline production has been extensively studied
using mutant strains and cell-free systems to identify a variety of intermediates.
Biosynthetically tetracycline antibiotics are derived from oligoketides.
Angucyclines (VC0450)
The angucycline antibiotics are related to the tetracyclines but they have an
angular arrangement of the tetracyclic ring system as in Urdamycin A.
Angucyclinones are defined as natural products with a benz[a]anthracene
nucleus but no hydrolysable sugar moieties whereas the term angucycline
includes those with hydrolysable sugars.
O O
CH3
HO O
O OH
CH3 OH
O OH
O
HO O
O
CH3 CH3
O O
OH
HO
Urdamycin A
33
these properties have found little application. Their uses to date are mainly as
feed additives.
Biosynthetically, the polyethers are polyketide in origin. The major building
blocks are acetate, propanoate, and butyrate. There is evidence to suggest the
intermediacy of an epoxide in the formation of the tetrahydrofuran and
tetrahydropyran systems. Monensin A is a typical polyether antibiotic.
HO
CH2OH
O O O H H O OH
O H H
H
COOH
MeO
Monensin A
Berdy, J. (1986) in Biotechnology, Vol 4, (ed. H. Page) VCH, Weinheim, Ger., pp. 494.
Crandall, L.W. et al. (1986) in The Bacteria, Vol IX (ed. S.W. Queener et al.) Academic
Press, Orlando, pp. 385.
Denyer, S.P. et al. (1983) Antibiotics, Society for Applied Bacteriology, Washington, 77.
Dutton, C.J. et al. (1995) Nat. Prod. Rep., 12, 165.
Robinson, J.A. (1991) Prog. Chem. Org. Nat. Prod., 58, 1.
Siegel, M.M. et al. (1987) Biomed. Environ. Mass Spectrom., 14, 29 (ms).
Westley, J.W. (ed.) (1982) Polyether Antibiotics, Marcel Dekker, NY, (2 vols).
Westley, J.W. (1986) J. Nat. Prod., 49, 35 (biosynth).
Wieranga, W. (1981) in Total Synthesis of Natural Products (ed. J. Ap’Simon) Wiley,
New York.
Yonemitsu, O. et al. (1990) Recent Prog. Chem. Synth. Antibiot., 447 (synth).
O O
O
H
O
H O OMe
Aflatoxin B1
Heathcote, J.G. et al. (1978) Aflatoxins, Chemical and Biological Aspects, Elsevier,
Amsterdam.
Lacey, J. (1987) Trichothecenes and other Mycotoxins, Wiley, New York.
Simpson, T.J. (1984) Nat. Prod. Rep., 1, 287 (biosynth).
Steyn, P.S. et al. (1980) in Biosynthesis of Mycotoxins, Academic Press, New York
(biosynth).
Steyn, P.S. et al. (eds) (1986) Mycotoxins and Phycotoxins, Elsevier, Amsterdam.
34
Carbohydrates (VE)
This is an abbreviated account dealing only with aspects of carbohydrate
chemistry relevant to natural products. For a fuller coverage including synthetic
carbohydrates, see the companion disc Dictionary of Carbohydrates on CD-
ROM.
Carbohydrates comprise a family of polyhydroxy aldehydes, ketones and
acids, together with linear and cyclic polyols. They are diverse because they
exist as a wide range of stereoisomers.
These compounds are the most abundant and widespread organic substances
in nature and are essential constituents of all living matter. They are the most
important (in terms of volume and availability) of the non-nitrogenous
H C OH HO C H H C OH HO C H H C OH HO C H
H C OH HO C H HO C H HO C H HO C H H C OH
HO C H HO C H HO C H H C OH H C OH H C OH
H C OH H C OH H C OH H C OH H C OH H C OH
D-glucose, Hexitol; D-talose, Hexitol; D-galactose, Hexitol; D-mannose, Hexitol; D-glactose, Hexitol; D-altrose,
D-gulitol (– L- D-talitol galactitol D-mannitol D-gluctiol Hexitol;
glucitol) ( meso-) D-altritol,
(≡D-talitol)
CHO CHO
CHO CHO
HO C H H C OH
H C OH H C OH
CH2OH CH2OH
CHO
H C OH
CH2OH
D(R)-glyceraldehyde
Triol; glycerol
Figure 7. The fundamental aldoses, their corresponding alditols and their notional
relationships to glyceraldehyde, as illustrated for sugars of the D-series.
35
constituents of the chiral pool and are extremely important in chiral synthesis.
Of the 36 possible stereoisomeric pentoses, pentuloses, hexoses and hexuloses
only D-glucose, D-fructose, D-galactose, D-mannose and L-arabinose occur
naturally in the free state, but only the first two are found in significant amounts.
Photosynthesis is the means by which plants produce sugars from carbon
dioxide and water. In brief, it occurs by carbon dioxide being transferred to D-
erythro-pentulose-1,5-diphosphate to give, via an unstable β-keto-6-carbon acid,
two molecules of D-glyceric acid-3-phosphate, from which hexoses, for
example, D-fructose 1,6-diphosphate and D-glucose 1-phosphate can be formed.
Animals on the other hand use the reverse of the glycolysis metabolic pathway
to produce glucose from proteins and fats utilising phosphoenolpyruvate as an
intermediate. Most of the routes used by nature to interconvert sugars occur by
way of enzymic reactions on nucleoside diphosphate sugars, particularly
Uridine diphosphate glucose (UDPG) which gives D-galactose on
epimerization at C-4, D-glucuronic acid by oxidation at C-6 and D-xylose by
decarboxylation of this acid. Deoxygenation at C-6 and configuration changes at
C-4 and C-5 give L-rhamnose and by similar means the commonly occurring D-
sugars may be transformed into members of the L-series.
CH2OH CHO
HO C H H C OH
CH2OH
OH HO C H OH
H
CHO H CHO
OH H H C OH OH H
H HO
H C OH
H OH H OH
CH2OH
aldehydo – D-glucose
CH2OH CH2OH
HO C H HO C H
CH2OH CH2OH
O O OH O O OH
OH OH OH OH
OH HO OH HO
OH OH OH OH
α- D-glucofuranose β- D-glucofuranose α- D-glucopyranose β- D-glucopyranose
(Haworth)
HOH2C
O HOH2C
HO OH O
HO OH
HO
HO OH OH
(Mills) (Chair)
36
COCH2OH COCH2OH COCH2OH COCH2OH
H C H H C OH HO C H H C OH
HO C H HO C H H C OH H C OH
H C OH H C OH H C OH H C OH
COCH2OH COCH2OH
HO C H H C OH
H C OH H C OH
CH2OH CH2OH
D-threopentulose D-erythropentulose
(xylulose, lyxulose) (ribulose, adonose)
COCH2OH
H C OH
CH2OH
D-glycerotetrulose
(erythrulose, threulose)
CHO
H C OH
CH2OH
D(R)-glyceraldehyde
37
CHO
H C OH
D-gluco
HO C H
H C OH
H C OH
D-glycero
H C OH
CH2OH
D-glycero-D-gluco-heptose
38
In many natural products hydroxy groups other than at the anomeric centre of
the sugar are replaced by a thiol-group, an amino-group or a hydrogen atom.
Compounds arising from these changes are named respectively as thio-sugars
(e.g. 3-thioglucose), aminodeoxysugars (e.g. 3-amino-3-deoxyglucose) and
deoxy sugars but in this case the configuration of the remaining asymmetric
carbons must be described with a prefix (e.g. 3-deoxy-ribo-hexose not 3-
deoxyglucose).
Alditols (VE8600–VE8900)
The polyols obtained by reduction of the aldehyde function of an alditol (or the
keto function of a ketol) are known as alditols. An example is Mannitol. They
are named by a straightforward extension of the rules used for aldoses. The
alditol corresponding to a chiral sugar may be meso-, e.g. Galactitol.
Cyclitols (VE9000)
The polyhydroxycycloalkanes, known as cyclitols, are a group of natural
products closely related to the carbohydrates proper, of which the most
important are the inositols (1,2,3,4,5,6-cyclohexanehexols). Trivial names are
often used but systematic rules have been introduced to assign configurations at
each enumerated ring carbon atom and this requires the application of a
recommended numbering convention. Further information on the various
descriptions of stereochemistry for these compounds can be obtained by the
inspection of the individual Dictionary entries. It should be noted that some
meso-isomers in the series can have optically active derivatives.
39
myo-Inositol is the most abundant cyclitol, occurring both free and as its
derivatives. Its hexaphosphate is phytic acid and occurs in large amounts in
grain. myo-Inositol and its derivatives are universally present in cells and the
1,4,5-trisphosphate plays a vital role as a secondary messenger, which mediates
mobilization of intercellular calcium ions.
Disaccharides (VE9200)
These are formed by sugars combining with each other by way of glycosidic
links. If the anomeric position of a sugar is attached to the anomeric oxygen of
another, then non-reducing disaccharides such as the glucosylglucoside,
Trehalose, or the fructosyl-glucoside, Sucrose, are formed. The latter
compound is widespread, occurring in most plants. It constitutes a significant
part of man’s diet in Europe and the USA, being produced in pure form on a
larger scale than any other monomeric organic compound. If the glycosidic
bond in a disaccharide is to a non-anomeric hydroxyl of the second sugar, then
so-called reducing disaccharides are formed, Maltose and Lactose being typical
examples.
40
Glycosidic bonds in naturally occurring oligosaccharides and glycosides are
formed in natural glycosylations which take place primarily by way of the
nucleoside diphosphate sugars as follows:
nucleoside DP sugar + ROH → sugar OR + nucleoside DPH
Disaccharides or their phosphates are produced when ROH is a sugar or a
sugar phosphate. Polysaccharide biosynthesis is basically similar but requires an
oligomer primer as an acceptor; glycogen synthesis follows the course:
primer + UDPG → Gα-primer + UDP
there being one enzyme present which catalyses the formation of 1,4-bonds and
another responsible for glycosylations at position 6. The biosynthesis of
cellulose and other polysaccharides is basically similar, UDP being the
nucleoside diphosphate used predominantly. However, starch synthesis depends
rather on adenosine diphosphate.
Carbohydrates are important components of a variety of significant
biopolymers, the most obvious being DNA and RNA which contain 2-Deoxy-D-
ribose and D-Ribose respectively. Glycoproteins and glycolipids (described
elsewhere) are another important group, although the carbohydrates are usually
present as oligomers, comprising sugars drawn from the following: 2-
acetamido-2-deoxy-D-glucose, 2-acetamido-2-deoxy-D-galactose, D-mannose, D-
galactose, sialic acid and L-fucose, which are glycosidically attached to the
protein. They are widely distributed in all living organisms, occurring bound to
cell membranes and in a free soluble form. Their role has only recently been
appreciated as it becomes evident that they serve as recognition sites for a
variety of intra- and intermolecular communication events. They function as
specific binding sites for enzymes, hormones, soluble toxins, bacteria, and
viruses. They are also implicated in cell-cell recognition and interaction, such as
cell adhesion which is particularly important in inflammatory diseases. They
play a role in the development of normal cells and metastasis in cancer cells.
The structural basis for blood ABO(H) and Lewis group antigenicities resides in
the oligosaccharide portions of blood cell glycolipids and the associated
secreted glycoproteins.
Plant glycosides
Glycosides of many different aglycones are ubiquitous in the plant kingdom.
Many are formed from phenols, polyphenols, steroidal and terpenoidal alcohols
by glycosidic attachment to sugars. In a majority of cases D-glucose is present
but L-rhamnose, D- and L- fucose and L-arabinose occur quite frequently. Of the
pentoses, L-arabinose is more common than D-xylose and the sugars often occur
as oligosaccharides. For example, Digitonin and Digitoxin from Digitalis
contain respectively a branched heteropentasaccharide, comprising two glucose
41
and two galactose molecules and xylose, and a linear homotrisaccharide of 2,6-
dideoxy-β-D-ribo-hexose.
Tannins (see separate section below) are a special case of plant glycoside.
In general, the plant glycosides are so numerous that their sugar components
are not reported as such in the carbohydrate section of the Type of Compound
Index, except for a few special classes described below.
Aminoglycoside antibiotics
Aminoglycosides constitute a large and diverse group of metabolites produced
by both bacteria and Streptomyces. The group covers those antibiotics
containing a highly functionalised cyclohexane aglycone, i.e. cyclitol,
glycosidically linked to amino or neutral sugar residues.
The streptamine cyclitol sub-group are Streptomyces metabolites which have
broad spectrum activity. They find clinical use both as topical and systemic
agents but exhibit varying degrees of oto- and nephrotoxicity thus limiting their
application. In addition many are rendered ineffective by resistant strains
carrying aminoglycoside inactivating enzymes.
The group of aminoglycosides containing inositol and/or monoamino-
cyclitols, which are produced by Streptomyces and Nocardia spp. represent a
much smaller subdivision.
In the Type of Compound Index, the aminoglycoside antibiotics are listed
under one or more structural categories (e.g. cyclitol), but a general
bibliography is given here.
42
Whelton, A. et al. (1982) The Aminoglycosides, Marcel Dekker, New York.
Williams, N.R. (ed.) (1986) Carbohydr. Chem., Royal Society of Chemistry, London,
18, 176.
Nucleosides (VE9900)
These are glycosides of purines, pyrimidines and other heterocyclic bases. The
well-known quartet of Adenosine, Guanosine, Cytosine and Thymidine are
fundamental biomolecules essential to life through their participation in the
structure of DNA and RNA. A small number of ‘hypermodified’ nucleosides
such as Wybutosine occur in bacterial nucleic acids.
A more prolific source of different nucleoside structures is the nucleoside
antibiotics which are analogues of the essential purine and pyrimidine
nucleosides. They consist of a sugar linked to a base either via a ring nitrogen or
through a ring C atom (the latter are designated C-nucleosides). Structurally
they are rather diverse but a subclassification is given by Isono (loc. cit.).
Although most of the compounds are Streptomyces metabolites, fungal and
bacterial products have also been identified.
Many nucleosides show interesting antiviral and antitumour properties, and
find wide application as biochemical tools. The commercial importance of
synthetic and semisynthetic compounds has led to a concentration of research
effort on synthetic methods and several good reviews on this subject are
available.
A few nucleotides (nucleoside phosphate conjugates) are also found, e.g.
Agrocin 84.
43
Oxygen heterocycles (VF)
Many simple natural products contain basic oxygen heterocycles – for example
the furan derivative, Furaneol, the pyran-2-one derivative, Triacetic acid lactone
and the 4-pyrone, Kojic acid. Although most of these simple oxygen
heterocyclic compounds can be seen to be derived from polyketides or
carbohydrates, some have unknown biosynthetic origins. The oxygen
heterocycles are listed under the headings: β-Lactones (VF1000), Furans
(VF2000), Butanolides (VF3000), Pyrans (VF4000), Pentanolides (VF5000), 2-
Pyrones (VF6000) and 4-Pyrones (VF7000). Natural products that contain these
substructures in terpenoid, steroid or alkaloid skeletons are not listed here.
O
O OH
OH
HO
CH3 H3C CH2OH
H3C O O O O
Furaneol Triacetic acid lactone Kojic acid
Davies-Coleman, M.T. et al. (1989) Prog. Chem. Org. Nat. Prod., 55, 1.
Dickinson, J.M. (1993) Nat. Prod. Rep., 10, 71.
Ley, S.V. (1991) in Heterocycles in Bioorganic Chemistry, (eds J. Bergman et al.), RSC,
London.
Turner, W.B. et al. (1983) Fungal Metabolites II, Academic Press, London.
O OH
H H O O
O O O
O O
Avenaciolide Glauconic acid
Spiroketals (VF8000)
There are a number of biologically active spiroketals, clearly of acetate origin,
exemplified by 1,7-Dioxaspiro[5.5]undecane, a sex pheromone of the olive fly
(see also Aliphatic Natural Products above).
O O
1,7-Dioxaspiro[5.5]undecane
44
Simple aromatic natural
products (VG)
Simple benzene derivatives
These may be of terpenoid, polyketide or shikimate origin. Those of terpenoid
origin, such as the aromatic p-menthanes are listed in the terpenoid section.
Since there is a large number of benzenoid compounds they have been
subdivided into simple benzenes (VG0005), simple phenols (VG0010), simple
benzyl alcohols (VG0020), simple benzaldehydes (VG0030), simple aryl
ketones (VG0035), simple benzoic acids (VG0040), phenylacetic acid
derivatives (VG0050) and simple phenylpropanoids (VG0060). Benzoquinones
are listed according to their number of oxygen substituents. (VG0300–VG0330)
with a separate code for prenylated representatives (VG0370).
Fungi are a prolific source of simple benzoquinones which in the main arise
by the polyketide route.
Shikimic acid is derived from glucose in plants via the shikimate pathway.
Shikimic acid is the biogenetic precursor of the aromatic amino acids,
Phenylalanine, Tyrosine and Tryptophan. As the shikimate pathway is found
in plants but not in animals there is a great deal of interest in targeting shikimate
pathway enzymes for control of plant growth, particularly after the success of
Glyphosate as a herbicide.
COOH
HO OH
OH
Shikimic acid
The shikimic acid pathway feeds many biosynthetic routes including those
involving p-aminobenzoic acid, anthranilic acid, cinnamic acid and other
phenylpropanoids and hence to many other classes of natural products including
the flavonoids and lignans.
45
Diphenylmethanes, acylphloroglucinols and benzophenones (VG0450,
VG0460, VG0500–VG0506)
A small number of simple diphenylmethanes occur naturally but there is a
growing number of acylphloroglucinols being identified. Acylphloroglucinol
derivatives may have more than one diphenylmethane linkage and various alkyl
substituents. They are formed by coupling of aromatic units. Other couplings
can lead to benzophenone derivatives. The benzophenones, like the
benzoquinones, are subdivided in the Type of Compound index according to the
number of oxygen substituents.
O O
HO OH
HO OH HO OH
HO OH
Acetylphoroglucinol Benzophenone
Diphenylmethanone, 9CI
2
1 3
4 8 9 1
7 2
10 6 3
9 5 10 4
8
7 6 5 O
O
Dibenzo[b,d]pyran Dibenzo[b,e]pyran
9H-Xanthene, 9CI
O
8 1
7 2
6 3
5 4
O
Xanthone
9H-Xanthen-9-one, 9CI
46
Depsides and depsidones; other lichen substances (VG0600–VG0660)
Depsides are esters of polyketide aromatic acids with polyketide phenols such
as Lecanoric acid. Depsidones have an additional ether linkage to form a seven
membered ring as in Colensoic acid. Depsides and depsidones are
predominantly found in lichens, and often carry one or more chlorine
substituents; halogenated depsides and depsidones are indexed separately.
O
O
CH3
CH3 OH
COO
MeO O COOH
HO OH COOH
OH
Lecanoric acid
(Depside) Colensoic acid
(Depsidone)
Sargent, M.V. (1984) Prog. Chem. Org. Nat. Comp., 45, 103.
Dibenzyl Stilbene
47
Benzofuranoids (VH)
Simple benzofurans (VH1000), benzodifurans (VH2000) and isobenzofurans
(VH3000) (including phthalides) are listed here. Dimeric phthalides are
Diels–Alder adducts belonging to the Angeolide Group (VH3200). 2-
Phenylbenzofurans are probably derived biogenetically from stilbenes.
4 3 5 4 3
5
6 2 6 2
7 1 7 1
8
O O O
Benzofuran Benzodifuran
Benzo[b]furan, 9CI Benzo[1,2-b : 5,4-b′]furan, 9CI
5
4 3 O O
2O
6
7 1 O
O
Isobenzofuran
1(3H)-Isobenzofuranone, 9CI Angeolide
48
Benzopyranoids (VI)
1-Benzopyrans (VI0030)
Historically 1-benzopyrans have been known as chromans, chromanones,
chromones, and 2- and 3-chromenes but in DNP the simpler members are
named systematically. A large number of natural products, of both polyketide
and shikimate derivation, occur naturally. The coumarins are the largest class of
1-benzopyran derivatives. They are found mainly in higher plants. Most natural
coumarins are oxygenated at C-7; Umbelliferone (7-hydroxycoumarin) being
regarded as the structural and biogenetic parent of the more highly oxygenated
coumarins. Prenylation at carbon and oxygen is common in a large number of
coumarins. The prenyl groups found in coumarins exhibit the greatest number of
biogenetic modifications including cyclisation to dihydropyrans, pyrans,
dihydrofurans and furans. In the Type of Compound Index the very numerous
coumarins are subdivided into classes of manageable size according to their
oxygen substitution pattern (VI0100–VI7500), with separate sections for natural
products having additional rings; furo-1-benzopyrans (VI0050) and pyrano-1-
benzopyrans (VI0070).
O
4 5 4
5 3 3
6 6
7 7 1 2
8 1 2 8
O O
Chroman Chromanone
3,4-Dihydro- 2,3-Dihydro-
2H-1-benzopyran, 4H-1-benzopyran-
9CI 4-one, 9CI
O
5 4 5 4
6 3 6 3
7 12 7 12
8 8
O O
Chromone 2-Chromene
4H-1-Benzopyran-4-one, β-Chromene
9CI 4H-1-Benzopyran, 9CI
5 4 5 4
6 3 6 3
7 1 2 7
8 8 1 2
O O O
3-Chromene Coumarin
α-Chromene 2H-1-Benzopyran-2-one,
2H-1-Benzopyran, 9CI 9CI
Ellis, G.P. (ed.) (1977) Chromenes, Chromones and Chromanones, Wiley, New York.
Gill, M. (1993) in The Chemistry of Natural Products, 2nd edn (ed. R.H. Thomson),
Blackie, Glasgow, pp. 65.
Livingstone, R. (1977) in Rodd’s Chemistry of Carbon Compounds, Vol. IVE, Suppl.
1990.
Murray, R.D.H. et al. (1982) The Natural Coumarins, Wiley, Chichester.
Murray, R.D.H. (1991) Prog. Chem. Org. Nat. Prod., 58, 83.
Murray, R.D.H. (1997) Prog. Chem. Org. Nat. Prod., 72, 1.
2-Benzopyrans (VI9600)
Compared to the 1-benzopyrans, the 2-benzopyrans are less common. They are
normally of polyketide origin. The isochromene nucleus is found in fungal
49
metabolites such as Citrinin. Isocoumarins (VI9700) are the largest class of 2-
benzopyran derivatives.
6 5 4 4
3 5 3
6
7 8 2 7 2
1 O 8 1 O
Isochromene O
Isobenzopyran Isocoumarin
1H-2-Benzopyran, 9CI 1H-2-Benzopyran-1-one, 9CI
50
Flavonoids (VK)
The flavonoids are a large group of natural products which are widespread in
higher plants but also found in some lower plants including algae. The
anthocyanidins are responsible for flower colour in the majority of angiosperms,
but colourless flavonoids are also widespread and abundant. A variety of
biological functions is fulfilled by various members of the series, but many
metabolic and extracellular roles doubtless remain to be discovered.
Flavonoids fall into two major categories according to whether the central
heterocyclic ring is unsaturated or not. When unsaturation is present, as in
anthocyanins, flavones and flavonols, the molecule is planar (occasionally
distorted, e.g. by the substitution of the 2′-hydroxyl group in a 3-O-methyl
flavonol). Saturated flavonoids (flavanones, flavans) have one or more chiral
centres. Optical activity may also be present in flavonoids due to the presence of
glycosidic substituents.
Flavonoids can be classified according to their biosynthetic origin. Some
flavonoid types are both intermediates in biosynthesis as well as end-products,
which can accumulate in plant tissues. These include chalcones (the first formed
C15 structure derived from malonyl coenzyme A and p-coumaryl coenzyme A),
flavanones, flavanon-3-ols and flavan-3,4-diols. Other classes are only known as
end-products of biosynthesis, e.g. anthocyanins, flavones and flavonols. Two
further classes of flavonoid are those in which the 2-phenyl sidechain of
flavonoid isomerises to the 3-position (giving rise to isoflavones and related
isoflavonoids) and then to the 4-position (giving rise to the neoflavonoids).
Isoflavone
c
a d
chalcone flavanone flavone
a e
d
b
aurone flavanon-3-ol flavonol
b
flavan
51
such OH groups may carry a sugar substitution. Acylated O-glycosides are
known, where aromatic or aliphatic acids are linked through the 6-hydroxyl of a
glucose moiety. A special group of mainly flavone-based C-glycosides occurs in
plants. Sulfated conjugates are common in the flavone and flavonol series,
where the sulfation may be on a phenolic hydroxyl and/or on an aliphatic
hydroxyl of a glycoside moiety.
A fairly considerable number of C-glycosylated flavonoids occur naturally.
These are readily distinguished from O-glycosyl derivatives by their resistance
to acid hydrolysis. They commonly have one or two sugar residues directly
linked by a carbon-carbon bond at the C-1 of the sugar to the 6- or 8-position of
the flavone nucleus. Thus, the flavone Apigenin can occur with a glucose at C-6
and C-8 (Isovitexin) or at C-8 (Vitexin) or at both C-6 and C-8 (Vicenin 2).
Other apigenin C-glycosides are known where the carbon linked sugar is
arabinose, galactose or xylose or two of these monosaccharides.
C-Glycosides of flavones commonly occur both as such and with further
sugars O-glycosidically linked. These other glycosides readily lose their O-
linked sugar(s) on acid hydrolysis. Such O-glycosidic residues may be attached
either to a hydroxyl of the C-sugar or directly to one of the free phenolic
groups. Acylated C-glycosides have been described, e.g. the 2′′-p-coumarate of
Vitexin. Many flavone C-glycosides are known and they are widely distributed
throughout the plant kingdom. By contrast, C-glycosides of other classes of
flavonoid (e.g. flavonols, flavanones, isoflavones) are of rare occurrence.
In general there are two parallel systems of nomenclature, one based on trivial
names such as flavan and chalcone as the parent structure and the other based
on systematic chemical names, such as 3,4-Dihydro-2H-1-benzopyran (CA) for
flavan. The latter becomes cumbersome and easy to get wrong in cases of
polysubstitution. There are also two systems of ordering the substituents around
the flavan nucleus: one in which the A-and B-ring substituents precede C-ring
substituents (e.g. 3,5,7, 3′,4′-pentahydroxyflavone); and one in which the
substituents are ordered numerically (e.g. 3,3′, 4′, 5,7-pentahydroxyflavone).
There are additionally two conventions for drawing flavonoid formulae, with the
heterocyclic oxygen at the top and with the heterocyclic oxygen at the bottom.
In this Dictionary, the semisystematic flavan-type nomenclature is given
precedence, substituents are ordered numerically and structures are drawn with
52
the oxygen heterocyclic atom at the bottom. In the Type of Compound Index,
most of the main classes of flavonoid are subdivided according to the number of
oxygen substituents.
4
6 5 3
A
7 8 2
O 1′ 2′ 3′
C
6′ 5′ 4′
Flavan
3,4-Dihydro-2-phenyl-2H-1-benzopyran
(S)-form shown
Anthocyanidins (VK0010–VK0070)
Anthocyanidins are intensely coloured plant pigments found throughout
vascular plants (they are replaced by purple betalain (alkaloidal) pigments in
one order of higher plants, the Centrospermae or Caryophyllales). The flavylium
chromophore in e.g. Cyanidin is cationic, being associated in vivo with organic
acid anions. The sugar-free anthocyanidin aglycones are relatively few and vary
according to the number and position of hydroxy and methoxy substituents.
Structural complexity is associated with the sugar substituents that are present in
the water-soluble anthocyanins. The anthocyanins range from simple structures
such as cyanidin 3-glucoside (Chrysanthemin) to Ternatin A1, a delphinidin
derivative which is substituted by seven glucose, four p-coumaric acid and one
malonic acid moiety. Some third of all the known anthocyanins have malonic
acid (or other aliphatic dicarboxylic acid) residues linked through sugar and are
zwitterionic in their properties.
6 5 4
3
7 8 2
O 1′ 2′ 3′
+ C
6′ 5′ 4′
Flavylium (2-phenylbenzopyrylium)
53
The flavan-3-ols (or catechins) make up by far the largest class of monomeric
flavans. Two substances with the 3,3′,4′, 5,7-pentahydroxy substitution pattern,
namely Catechin and Epicatechin, are extremely widespread. Most flavan-3-
ols, such as Catechin, are of the 2R, 3S configuration. Those with the 2R, 3R
configuration are prefixed with ‘epi’, e.g. Epicatechin. Those with a 2S
configuration are distinguished by the enantio (ent-) prefix.
4 OH
6 5 3
A
7 8 2
O 1′ 2′ 3′
C
6′ 5′ 4′
Flavan-3-ol
3,4-Dihydro-2-phenyl-2H-1-benzopyran-3-ol
(2R,3R)-form shown
Flavan-3,4-diol
3,4-Dihydro-2-phenyl-2H-1-benzopyran-3,4-diol
Proanthocyanidins (VK1500)
Proanthocyanidin is the preferred name for condensed tannins (or flavolans), a
series of flavan-3-ol oligomers which are usually based on a C-C link from the
8-position of one flavan unit to the 4-position of a second unit. As with the
monomeric leucoanthocyanidins, they produce coloured anthocyanidins on
heating with mineral acid, but they have the additional property of binding to
protein. The best known proanthocyanidins are procyanidins, based on catechin
and/or epicatechin units, and oligomers up to the hexamer have now been found
in plants.
The interflavonoid linkage in proanthocyanidins is indicated in the same way
as for polysaccharides, the bond and its direction being contained in parentheses
(4→). The configuration of the interflavonoid bond at C-4 is indicated by the
IUPAC αβ nomenclature within the above parentheses. Thus two common
procyanidin dimers are described as Epicatechin-(4β→8)-catechin and ent-
Epicatechin-(4α→8)-epicatechin respectively. A considerable number of
doubly linked proanthocyanidins are known, where there is a second linkage
through C-2 to O-7. The naming of such compounds can be accommodated in
the same general way, e.g. one such compound is Epicatechin-(2β→7,4α→8)-
epicatechin. Many oligomeric proanthocyanidins with molecular sizes greater
than the hexamer, have been isolated from plants but their stereochemistries
have yet to be determined.
54
which two apigenin units are linked by a carbon-carbon bond from the 8-
position of one unit to the 3′′′ of the other. A range of O-methyl ethers of this
basic structure occur naturally. Biapigenins with other C-C linkages have been
discovered, where the linkage is 3′-3′′′, 3-8′′, 3-3′′′, 6-8′′, 8-8′′, 6-6′′, or 6-5′′′.
Linkage through a C-O-C bond may also occur, as in Hinokiflavone, where the
two apigenin units are linked at the 6 and 4′′′ positions.
Mixed biflavonoids are also possible, e.g. flavone-flavanone dimers, as well
as compounds based on two flavanone units (e.g. Rhusflavanone). The first
triflavonoid has been reported recently, based on three units of Luteolin
(3′,4′,5,7-tetrahydroxyflavone). Biflavonoids have a distinctive distribution
pattern. There are major occurrences in gymnosperms, mosses and ferns and a
more limited presence in some 15 angiosperm families.
Isoflavonoids (VK3000–VK3100)
Isoflavonoids are based on the 3-phenylchroman skeleton that is biogenetically
derived by an aryl migration from a flavanone precursor. They have a very
limited distribution in the plant kingdom and are almost entirely restricted to the
subfamily Papilionoideae of the Leguminosae. They are found very occasionally
in about 18 other angiosperm families and there are isolated occurrences in
mosses and gymnosperms. Another striking feature about the isoflavonoids is
their major presence in lipophilic plant extracts in the free state and the relative
rarity of glycosidic derivatives.
Some isoflavonoid isolations reported from microorganisms are almost
certainly spurious, and associated with contamination from the culture medium.
The largest class of isoflavonoids are the isoflavones (VK3000–VK3070).
There are simple structures such as Genistein (4′,5,7-trihydroxyisoflavone) but
also a wealth of prenylated derivatives. The prenyl sidechains may ring-close on
adjacent hydroxyl groups, giving rise to tetracyclic and pentacyclic compounds.
The related isoflavanones (VK3100), in which the 2,3-bond is reduced, are
much rarer than the isoflavones.
O 2′ 3′ 4′
1′ 5′
4 6′
6 5 3
7 8 2
O
Isoflavone
3-Phenyl-4H-1-benzopyran-4-one
55
OMe
O 2 3 OMe
1
4
11 12
10 12a
98 6a 5
7 6 O
O O
Rotenone
1,2,12,12a-Tetrahydro-8,9-dimethoxy-2-(1-methylethenyl)[1]
benzopyrano[3,4-b]furo[2,3-h][1]benzopyran-6(6aH)-one, 9CI
The numbering system most used by natural products scientists for Rotenone
is shown but other schemes have been used and it must be noted that the CA
scheme differs. Various numbering schemes have also been used for the cyclised
prenyl side-chain in Rotenone and similar compounds.
Pterocarpans (VK3400–VK3550)
Pterocarpans contain a tetracyclic ring system derived from the basic isoflavone
skeleton by an ether linkage between the 4- and 2′-positions. The systematic
numbering is distinctive for this particular carbon skeleton. The majority of
natural pterocarpans have been obtained from phytoalexin studies, so that in
general they possess antifungal activity. They are conveniently subdivided into
simple pterocarpan flavonoids, 6a-hydroxypterocarpan flavonoids and
pterocarpene flavonoids, in which unsaturation is introduced at the 6a–11a
position. The best known structure is Pisatin, a 6a-hydroxypterocarpan which is
the phytoalexin of the pea plant. Many isoprenylated pterocarpans have been
described and these substances constitute the second largest group of
isoflavonoids after the isoflavones. The commonly used numbering system
corresponds with the CA scheme.
10 9
O 8
11 7
11a
2 1 6a
34 5 6
O
Pterocarpan
6a,11a-Dihydro-6H-benzofuro[3,2-c][1]benzopyran, 9CI
Although pterocarpans have two chiral centres, only R,R and S,S
configurations are sterically possible. Most pterocarpan phytoalexins that have
been isolated are laevorotatory and have the 6aR, 11aR absolute configuration; a
few are dextrorotatory and can be assigned to the 6aS, 11aS series.
Isoflavans (VK3600–VK3700)
Isoflavans are another class of isoflavonoid which have been mainly isolated as
phytoalexins after fungal inoculation of plant tissues. They are also metabolites
of dietary isoflavones. Equol (4′,7-dihydroxyisoflavan) which has been isolated
from the urine of mammals, has estrogenic activity. The numbering system of
isoflavans is the same as that of the isoflavones.
3′ 4′
2′
1′
6′ 5′
5 4
6 3
7 2
8
O
Isoflavan
3,4-Dihydro-3-phenyl-2H-1-benzopyran
56
Coumestan flavonoids (VK3750)
One final group of isoflavonoids, numerically important in terms of numbers of
structures, are the coumestans. Like the isoflavans and many isoflavones, they
exhibit weak estrogenic activity in mammals. The simplest structure is
Coumestrol (7,9-dihydroxycoumestan) but a variety of prenylated derivatives
have also been characterised. The numbering system used is the same as in the
pterocarpan series and coincides with the CA systematic numbering.
O11 10
9
11a 8
7
2 1 6a
34 6
5
O O
Coumestan
6H-1-Benzofuro[3,2-c][1]benzopyran-6-one,9CI
Neoflavonoids (VK4000)
This term refers to a small group of C15 naturally occurring substances
structurally and biogenetically related to the flavonoids and isoflavonoids. They
have a limited distribution, occurring with isoflavonoids in the subfamily
Papilionoideae of the Leguminosae. Other families where they have been
encountered are the Guttiferae, Rubiaceae, Passifloraceae, Compositae and
Polypodiaceae.
There are three main subdivisions of structures: the 4-arylcoumarins, the
dalbergiones and the dalbergiquinols. Representative structures, all isolated from
Dalbergia species, are the ring-closed Dalbergin and the two related ring-
opened compounds, 4-Methoxydalbergione and Obtusaquinol. Prenylated
derivatives of the 4-arylcoumarins have been characterised in the Guttiferae.
57
O O
6 5
4
3 6 5
4
3
OH
A
7 8 2 7 8 2
O 1′ 2′ 3′ O 1′ 2′ 3′
C C
6′ 5′ 4′ 6′ 5′ 4′
Flavone Flavonol
2-Phenyl-4H-1-benzopyran-4-one 3-Hydroxy-2-phenyl-4H-1-benzopyran-4-one
3′ 2′ 1′
α
A
4′ 6′
5′ β 1 2 3
C
6 5 4
Chalcone
1,3-Diphenyl-2-propen-1-one, 9CI
Note that the numbering of the A ring is the same in both systems of
nomenclature, but the C ring is unprimed in the semitrivial chalcone system and
carries a double prime if systematic numbering is used (the α- and β-positions
becoming 2 and 3 respectively). The majority of chalcones have
hydroxy/methoxy substituents at the 2′,4,4′,6′-positions, and a significant
number of prenylated derivatives are known.
In dihydrochalcones, the double bond in the α-β-position is reduced and the
compounds are colourless. The numbering system is the same as in the chalcone
series. They are less common than chalcones and occur variously in higher
plants, ferns and liverworts.
Aurone
2-(Phenylmethylene)-3(2H)-benzofuranone, 9CI
58
Flavanones (VK6300–VK6380)
Flavanones are 2,3-dihydro-2-phenyl-4H-1-benzopyran-4-ones. The simplest
known natural flavanone is the 7-hydroxy derivative, while the commonest is
4′,5,7-trihydroxyflavanone (Naringenin). Flavanones are isomeric with
chalcones and arise biosynthetically from them by a reaction catalysed by an
isomerase. They have a centre of chirality at C-2 and usually occur in optically
active form with the 2S-configuration. They commonly occur as glycosides. A
variety of more complex derivatives with methyl and/or prenyl substituents has
been described. Flavanones have a wide occurrence in plants.
O
4
6 5 3
A
7 8 2
O 1′ 2′ 3′
C
6′ 5′ 4′
Flavanone
2,3-Dihydro-2-phenyl-4H-1-benzopyran-4-one, 9CI
(S)-form shown
Dihydroflavonols (VK6410–VK6470)
Dihydroflavonols can be described as 3-hydroxyflavanones or as flavanon-3-ols.
They are formed biosynthetically by oxidation at C-3 of flavanones, without
inversion at C-2, and are the immediate precursors by a further oxidation of the
flavonols. Dihydroflavonols have two chiral centres at C-2 and C-3; most
naturally occurring compounds possess the (2R,3R) stereochemistry.
Dihydroflavonols such as Dihydroquercetin have a wide occurrence in nature
being present in the free state in woody plant tissues. They also occur in
glycosidic combination in other plant parts.
O
4
OH
6 5 3
A
7 8 2
O 1′ 2′ 3′
C
6′ 5′ 4′
Dihydroflavonol
2,3-Dihydro-3-hydroxy-2-phenyl-4H-1-benzopyran-4-one, 9CI
(2R,3R)-form shown
59
Tannins (VM)
Plant polyphenols (vegetable tannins) are secondary metabolites widely
distributed in the plant kingdom. They are based upon two broad structural
themes:
(a) Condensed proanthocyanidins in which the fundamental structural unit is
the phenolic flavan-3-ol (catechin) nucleus.
(b) Galloyl and hexahydroxydiphenoyl esters and their derivatives.
These metabolites are almost invariably found as multiple esters of 3,4,5-
trihydroxybenzoic (gallic) acid with D-glucose and a great many can be envisaged
as derived from the key biosynthetic intermediate β-1,2,3,4,6-pentagalloyl-D-
glucose. Derivatives of hexahydroxydiphenic acid are assumed to be formed by
oxidative coupling of vicinal galloyl ester groups in a galloyl-D-glucose ester.
OH OH
HO OH O HO OH O
O O
–2H
O O O O
HO OH HO OH
OH OH
60
Pathway (B) Oxidative coupling, Pathway (D) Oxidative
Dehydrogenation, 4-6 and 3-3; coupling, ring opening
Oligomerisation by C-O coupling -open chain derivatives
ellagitannins ellagitannins
–[H]n
–[H]n
OG
6
4
GO O
G = Galloyl ester 2 OG
GO 1
3
OG
β-1,2,3,4,6-Pentagalloyl-D-glucose
D-glucopyranose4C1 conformation
COOH
GO 6
OG OG HO OH
1
O OH
4 3 2
OG OG
β-1,2,3,4,6-Pentagalloyl-D-glucose
D-glucopyranose1C4 conformation
β-1,2,3,4,6-pentagalloyl-D-glucose
–[H] intramolecular
C-C coupling
intramolecular
–[H]
C-O coupling
61
OH OH
HO OH HO OH OH
HO
OH OH
HO HO HO
OC OC CO
HO HO O
CO CO
O OH
(R)-Hexahydroxydiphenoyl (S)-Hexahydroxydiphenoyl OH
VM 6100 VM 6100 HO
OH HO O
HO OH
O
OH OH CO
HO
HO OH
OC HO
HO CO OC OH OH
Gallagyl
VM 7600
Flavogallonoyl
VM 7000, VM 7050
O
O
O O
L-Ascorbic acid OH O O
O OH
O O O HO HO OH OH
CO
O OC OH
HO OH CO
OH
OH O O
O O O O
O O
O HO OH O
O O
Dehydrohexahydroxy-
Elaeocarpusinoyl diphenoyl HO
VM 6300 VM 6200
O OH
OH
O
HO
OH
O O
O OC O OC
CO OH CO OH
HOOC OH HOOC OH
H
O O
HO HO
O O
Chebuloyl Dehydrochebuloyl
(VM 6500) (VM 6400)
OH
OH
O OH
CO
OC
O CO OH
OC
CO OH
OH
OH HOOC OH
H
O O
O HO
Brevifoloyl O
(VM 6600) Trilloyl
Figure 14. Ester derivatives of hexahydroxydiphenic acid in which one aromatic ring
has undergone hydrolytic cleavage.
62
OC HO OH HO OH
OH O CO HO OH
HO
HO OH O CO OC
OH
CO HO OH
(S)-Sanguisorboyl
Dehydrodigalloyl VM 6900
VM 6700
OC
HO OH HO OH
HO O O OH
OH HO OH HO
HO OH
CO OC OC OH HO OH
OH CO OC
Valoneoyl
VM 6800, VM 6820, VM 6850 (R)-Tergalloyl
VM 7300, VM 7350
OH OH
OC OH OC OH
OH OH
O O
HO OH OH OH HO OH
HO OH CO O OH
CO OC OH CO OC
(R)-Macaranoyl HO OH
VM 7200 (R)-Euphorbinoyl
VM 7500
Figure 15. Principal ester groups formed by intermolecular C-O oxidative coupling of
galloyl and hexahydroxydiphenoyl esters.
63
Lignans (VO)
The lignans are a group of plant phenols whose structures are determined by the
union of cinnamic monomers or their biogenetic equivalents. The lignan dimers
linked at β-positions in the side chain were first defined in 1936 by R.D.
Haworth and these are discussed first. This group of about 900 compounds has
since been extended to include some 500 neolignans, which are dimerised in
other ways, and higher oligomers of cinnamyl alcohols. Higher lignoid
polymers occur in wood as lignin and are degraded during papermaking
processes.
9′
7
8 8′
7 9 – MeO 7′
COO
3 2 1 9
8 HO
45 6
OMe
Cinnamate monomer
OH
Dihydroguaiaretic acid
O O
O 7′ 7 O
8′ 8
9′ 9
O
O
Hinokinin
CH2OH CH2OH
(RO) +
(RO)
–H
HO –e O
–
CH2OH CH2OH
(RO) – (RO)
O O –
64
not optically active. However, it is possible that in lignin internal compensation
occurs between the many chiral centres.
Dihydroguaiaretic acid and Hinokinin are representatives of major classes
which are sometimes loosely referred to as ‘acyclic’; others may be defined as
either carbocyclic or oxy-heterocyclic derivatives of these two types of parent.
The lignans are classified in the Type of Compound Index essentially
according to Figure 16 which illustrates the principal lignan types including the
unusual 7′,8-structure found in Magnosalicin.
The systematic nomenclature of such a structurally diverse, though
biogenetically related, group of compounds is of limited utility and easily
disguises structural similarities. The CA names of most lignans are given as
synonyms within the entries, but the entry name is usually either a trivial name
or a semisystematic name using the system originally proposed by Freudenberg
and Weinges which has been extended by Moss and now accepted by IUPAC.
This names lignans according to a ‘lign-’ scheme which more accurately reflects
their biosynthetic origin. The cinnamyl unit of higher priority is numbered from
1–9 while that of lower priority bears primed numbers.
For Dihydroguaiaretic acid the two cinnamyl residues are equivalent so that
no distinction need be made between them and the semi-systematic name is:
4,4′-dihydroxy-3,3′-dimethoxylignan. Note that under the CA system this
compound would be named as a phenol. In Hinokinin, the sequence rules
require that priority be given to the lactone carbonyl (C9) and this leads to the
lignan name 3,3′,4,4′-bis (methylenedioxy)lignan-9,9′-olide.
Natural lignans are optically active although a few meso-compounds are
known. Important physiological properties may be associated with a particular
absolute configuration, as for example with the antitumour agent
8 O
9
7
O
9.9′-Lignanolide
9
Furanoid lignans
9′ 8
9 9′ 8′ 7 9 7′
7′
8 8′ O 8′ 7′
8 8′
7 7′ 7 9′
O O
O
7,8′-Epoxylignans 7,9′-Epoxylignan
7,7′-Epoxylignan (VO0300)
(VO0280)
(VO0350)
9′ 7 2 8′ 7′
7′ 2′ 9′
9′
8′ 8 8′
7′ 9
2 8
O 7
9
65
Podophyllotoxin. The application of the Cahn-Ingold-Prelog sequence rules to
lignans needs to be done with care as apparent inversions of configuration
between closely related compounds owing to different substitution patterns are
common.
Inherent dissymmetry is shown by some semisynthetic derivatives of
Podophyllotoxin and in the dibenzocyclooctadienes. The absolute configuration
of the latter major group is securely based on the X-ray analysis of Gomisin D.
This is an atropoisomer with the S-configuration.
Neolignans (VO1500)
These compounds are also dimers of cinnamyl units but their structures are
obtained by coupling of mesomeric radicals other than the β-β link typical of
the lignans.
As the range of lignoids and their plant sources has widened so the distinction
between lignans and neolignans has become less significant. Thus the neo-
lignans were long – identified as more typical of plants of the family Lauraceae,
but in recent years they have been isolated from the Piperacae, Magnoliaceae,
Phytolaccacae, Rutaceae, Pinaceae and Berberidaceae amongst others. Further,
lignans and neolignans frequently occur in plants of the same family, for
example in Piper wightii. Significant developments were the isolation of the
classical lignan Megaceratonic acid from a non-vascular plant – the hornwort
Megaceros flagellaris, and of the pyranonyl hybrid Scapaniapyrone and its
analogue from the liverworts Scapania undulata and Jamesoniella autumnalis
respectively: at present only 5% of liverworts have been investigated.
The system of nomenclature illustrated above extends logically to include
both neolignans and oligomeric lignoids since all are produced by the union of
mesomeric cinnamate radicals. The structures (Figure 16) show the
permutations of radical couplings at 8,8′-positions in lignans and some of the
more varied combinations based on C-C and C-O linkages in neolignans are
shown in Figure 17.
9 9′
3 3′ 8′
3 1
Ar Ar 8′
Ar Ar
8′
O
1
8 3
4′
O Ar
8′ 4
8′
Ar Ar
Ar
3′ O
O
4 8′
3
8′ 7′ 1
1
O Ar O
Ar
66
former group are the lappaols including Lappaol A, with three linked cinnamate
residues. Manassantin B including four such residues is typical of the
dilignans.
Note the variation in bond type between the classical C – C bonding in
Dunnianol and that in Salvianolic acid, which is regarded as a member of this
group although on hydrolysis of the enol ether and ester linkages the lignoid
character will be lost altogether. A similar distinction needs to be drawn
between the dilignans Arctigenin E and Buddlenol E, where four residues may
be stably linked through C – C bonds or through a combination of C – C and
ether linkages, and Cannabisin G. The latter falls within the definition of a
dilignan yet on hydrolysis yields the arylethylamine and a ligna-dienoic acid.
Hybrid lignans
Here the lignoid includes a structure typical of another class of natural product:
with the increased sensitivity of modern isolation methods the list of both
members and known structural types is expanding. Silybin is a flavonolignan
and in 1968 was the first known hybrid to be described; the coumarinolignan
Cleomiscosin followed in 1979. Pseudotsuganol is illustrative of another type
of flavonolignan.
More recently the stilbenolignan Maackolin and the xantholignan Cadensin
G have been identified. Hybrid terpenes are typified by the C – C linked
Piperitylmagnolol and by the C – O linked Eudesmagnolol. In Chilianthin A
the lignan acid is esterified with an oleanyl triterpene. The hydrolysable
macrocycle Pelliatin is of interest structurally and also because it is a
constituent of a non-vascular plant, the liverwort Pellia epiphylla.
O
OH
HO
OH
HO O
HO
OH
O
OMe
H H
O
HO
OMe
Pseudotsuganol
Ayres, D.C. and Loike, J.D. (1990) Lignans, Chemical, Biological and Clinical
Properties, Cambridge Univ. Press, Cambridge.
Fengel, D. and Wegener, G. (1984) Wood: Chemistry, Ultrastructure and Reactions, de
Gyten, Berlin.
Freudenberg, K. and Weinges, K. (1961) Tetrahedron, 15, 115 (nomenclature).
Gottlieb, O.R. (1978) Prog. Chem. Org. Nat. Prod., 35, 1 (neolignans).
Gottlieb, O.R. and Yoshida, M. (1989) in Natural Products Extraneous to the
Lignocellulosic Cell Wall of Woody Plants (eds J.W. Rowe and C.H. Kirk)
Springer, Berlin (neolignans).
Haworth, R.D. (1936) Ann. Rep. Progr. Chem., 33, 266.
Lin, L-J. and Cordell, G.A. (1984) Chem. Commun., 160 (coumarinolignans).
MacRae, W.D. and Towers, G.H.N. (1984) Phytochemistry, 23, 1207 (activity).
Marston, A. and Hostettman, K. (1991) Nat. Prod. Rep., 8, 392 (sesquilignans,
dilignans).
Sakakibara, A. et al. (1987) Holzforschung, 41, 1.
Ward, R.S. (1993) Nat. Prod. Rep., 10, 1; 1995, 12, 183; 1997, 14, 43 (rev).
67
Polycyclic aromatic natural
products (VQ)
A large proportion of the polycyclic aromatic compounds encountered in nature
are quinonoid.
Quinone and quinonoid compounds are widely distributed and exist in all
living organisms, often playing an important role in redox systems.
The quinones isolated from higher plants are usually relatively simple and
frequently present as glycosides. The microbial quinones, on the other hand, are
often more complex and exhibit greater structural diversity as well as wider
variations in biological activity.
Benzoisochromanquinones (VQ3100)
These are of polyketide origin and are found in fungi.
O O
8 1
7 2
6 3
O
5 4
O O
Naphthoquinone Benzoisochromanquinone
1,4-Napthalenedione, 9CI
12 7
11 9 8
10
68
Turner, W.B. (1971) Fungal Metabolites, Academic Press, London.
Turner, W.B. et al. (1983) Fungal Metabolites II, Academic Press, London.
Weiss, U. et al. (1987) Prog. Chem. Org. Nat. Prod., 52, 1.
7 1
6 2
5 3
4
Indene Indan-1-spirocyclohexane
9,10-Anthraquinone Phenanthrene
9,10-Anthracenedione, 9CI
Anthracyclinones (VQ4300)
The anthracyclines produced by Streptomyces form a group of clinically useful
antitumour agents. They also show potent activity against gram-positive bacteria
but are generally too toxic to be of value. In addition to the 200 or so naturally
occurring metabolites, many semisynthetic anthracyclines have been developed
in the search for improved antitumour activity and lower toxicity. Biosynthesis
69
is from a decaketide precursor formed from nine acetates and one propanoate
unit.
Numbering of anthracycline derivatives is confusing. The IUPAC numbering
system is commonly used. However CA names them systematically as
substituted naphthacenediones and in these cases the numbering depends on the
hierarchy of the attached functional groups.
O COOR O
1 12 10
COR 4
9 8
4 5 6 7 1 12 10
O O O O
O COOR
11 1
2
8
6 4
O O
O O
Dibenzoquinone Dinaphthoquinone
Thomson, R.H. (1971) Naturally Occurring Quinones, 2nd edn. Academic Press,
London.
Thomson, R.H. (1987) Naturally Occurring Quinones III, Chapman & Hall, London.
70
1 2 3 1 2
3 5 4
9 4 8 6 3
8 5 7 4 7 2
7 6 6 5 8 9 1
Andrew, H.F. (1979) in Rodd’s Chemistry of Carbon Compounds, 2nd edn. (ed. S.
Coffey), Elsevier, Amsterdam, Vol. IIIH, 138.
Andrew, H.F. (1988) in Rodd’s Chemistry of Carbon Compounds, Suppl., (ed. M.F.
Ansell), Elsevier, Amsterdam, Vol IIIH, 27.
Cooke, R.G. et al. (1981) Prog. Chem. Org. Nat. Prod., 40, 153.
Turner, W.B. et al. (1983) Fungal Metabolites II, Academic Press, London.
Andrew, H.F. (1988) in Rodd’s Chemistry of Carbon Compounds, Suppl., (ed. M.F.
Ansell), Elsevier, Amsterdam, Vol. IIIH, 71.
Campbell, N. et al. (1979) in Rodd’s Chemistry of Carbon Compounds, 2nd edn. (ed. S.
Coffey), Elsevier, Amsterdam, Vol. IIIH, 211.
71
Terpenoids (VS)
Classification of terpenoids
The immense variety of structural types found in the terpenoids was rationalised
by the isoprene rule of Ruzicka. However, the number of exceptions to the
regular arrangement of isoprene units led to the biogenetic isoprene rule which
encompassed the possibility of rearrangements during biosynthesis. Terpenoids
are thus seen as being formed from linear arrangements of isoprene units
followed by various cyclisations and rearrangements of the carbon skeleton.
They can also be biosynthetically modified by the loss or addition of carbon
atoms. It is useful to classify terpenoids according to the number of isoprene
units from which they are biogenetically derived, even though some carbons
may have been added or lost. (This sometimes causes some uncertainty if it is
believed that more than five carbons have been lost; only a biosynthetic study
can resolve this issue. For example the irones (C15) are derived biosynthetically
from the iridial group (C31)).
The biogenetic isoprene rule implies the involvement of a branched five
carbon unit in the biosynthesis of terpenoids. Isoprene, although a natural
product, is not a precursor of the terpenoids. The biosynthetic origin of this five
carbon unit is well established. The pathway involves mevalonic acid which is
converted into isopentenyl diphosphate, the five-carbon precursor of the
terpenoids. However alternative biosynthetic pathways to isopentenyl
diphosphate are now becoming common.
HO
HOH2C COOH
Isoprene Mevalonic acid
O O
CH2O C O P OH
OH OH
Isopentenyl diphosphate
Nomenclature
The systems used for the nomenclature of terpenoids have evolved over a long
period. For many terpenoid classes more than one name has been proposed for
the carbon skeleton and in a large number of cases several numbering systems
are in use. DNP has used the most accepted numbering system for most skeletal
types. In cases for which no numbering system has been proposed or where
several are in use, preference has been given to the biogenetic system.
72
Many terpenoids are named as formal variants of steroid structures and their
nomenclature and numbering therefore follows on from that of steroids, which
is described more fully in a subsequent section.
Monoterpenoids (VS0100–VS1200)
Monoterpenoids have been isolated from the fragrant oils of many plants and
are important in the perfumery and flavour industries. Monoterpenoids are also
found in many marine organisms, where they are generally halogenated, and as
insect pheromones and defence secretions. The biosynthetic pathways of the
main classes of monoterpenes have been well studied.
7 5 3 1
8 6 4 2
Naylor, S. et al. (1983) Prog. Chem. Org. Nat. Prod., 44, 189.
73
Ochtodane monoterpenoids (VS0220)
The ochtodanes are also principally from marine organisms particularly
Ochtodes spp. and presumably arise by cyclisation of myrcene.
2
4
5 3 1
6 8
7
9 10
Myrcene Ochtodane
3-Ethyl-1,1-dimethylcyclohexane, 9CI
Naylor, S. et al. (1983) Prog. Chem. Org. Nat. Prod., 44, 189.
9
10 7 10 9
6 6
5 1 8 5 1
4 2 4 2 8
3 3 7
1-Ethyl-1,3-dimethyl- 1-Ethyl-2,4-dimethyl-
cyclohexane cyclohexane
Naylor, S. et al. (1983) Prog. Chem. Org. Nat. Prod., 44, 189.
11 7 11
6
6 4 4
5 3 5 3
7
9 2 10 9 2
8 1 O 1 O
8
10
74
Other cyclopentane monoterpenoids (VS0450)
This is a large group containing several carbon skeletons that probably arise
biogenetically by cleavage of bicyclic monoterpenoids.
1 8 1
6 2 9 6 2
5 3 5 3 9
4 4 8
o-Menthane 10
m-Menthane
7
1
6 2
5 3 O
4
8
10 9
p-Menthane, 8CI
1-Methyl-4-(1-methylethyl)- p-Menth-1-en-6-one
cyclohexane, 9CI
75
Cycloheptane monoterpenoid skeletons
Camphane Fenchane
1,7,7-Trimethylbicyclo- 1,3,3-Trimethylbicyclo-
[2.2.1]heptane, 9CI [2.2.1]heptane, 9CI
VS0800 VS0850
10
2
1
7 9 3
8
6
4
5
Pinane
2,6,6-Trimethylbicyclo[3.1.1]heptane, 9CI
VS0900
10 10
3 4
4 2 3 5
5 1 2 1
6 6
7 7
8
9 9 8
Carane Thujane
3,7,7-Trimethylbicyclo- 4-Methyl-1-(1-methylethyl)-
[4.1.0]heptane, 9CI bicyclo[3.1.0]hexane, 9CI
VS0950 VS1000
Pelter, A. et al. (1969) in Rodd’s Chemistry of Carbon Compounds, 2nd edn. Elsevier,
Amsterdam, IIC, 136.
76
Tricyclene
1,7,7-Trimethyltricyclo[2.2.1.02,6]heptane, 9CI
Sesquiterpenoids (VS1300–VS5320)
The sesquiterpenoids are C15 compounds formed by the assembly of three
isoprenoid units. They are found in many living systems but particularly in
higher plants. There is a large number of sesquiterpenoid carbon skeletons,
which all however arise from the common precursor, farnesyl pyrophosphate, by
various modes of cyclisations followed, in many cases, by skeletal
rearrangement.
1
11 9 7 5 3
12 10 8 6 4 2
Farnesane
2,6,10-Trimethyldodecane, 9CI
77
Miscellaneous cyclobutane and cyclopentane sesquiterpenoids
(VS1420, VS1430)
This group contains a variety of miscellaneous cyclobutane and cyclopentane
sesquiterpenoids
15 12 13
12 13
7
11 13 9 12
1 9 1 11
2 6 8 10 2 8 10 15
3 5 7 5 1
4 6 3
14 14 14 4 2
14
7 5
8 6 4
9 1 3
2
10 15
13 11
12
Bisabolane
1-(1,5-Dimethylhexyl)-4-methylcyclohexane, 9CI
78
14
14
7
7 1
1
8 6 2 8 6 2
9 5 3 9 5 3
4 4
10 15 10 15
11 11
12 13 12 13
Sesquicarane Sesquisabinane
3,7-Dimethyl-7-(4-methyl- 1-(1,5-Dimethylhexyl)-
pentyl)bicyclo- 4-methylbicyclo[3.1.0]-
[4.1.0]heptane, 9CI hexane, 9CI
15 13
Elemane
1-Ethyl-1-methyl-2,4-bis-
(1-methylethyl)cyclohexane, 9CI
Germacranes (VS1650–VS1700)
The numbering of the germacrane skeleton poses a problem since there is a
plane of symmetry through carbons 2 and 7. Germacranes are normally drawn
in a conventional way as shown below with H-7 in the α-configuration. Care
should be taken with the small number of germacranes with a double bond at C-
7 as the ring can be numbered in either direction. Germacranes frequently have
double bonds in the 1(10) and 4 positions. There have been proposals to give
different names to the skeletons with (1(10)Z, 4E) (melampolides) and (1(10)E,
4Z) (heliangolides) configurations. However this is confusing and in DNP all
compounds are named as germacranes. A further problem with the represen-
tation of germacranes arises from substituents at carbons drawn as reentrant
angles. Wherever possible germacranes should be drawn without substituents at
reentrant centres as in this Dictionary, and care should be exercised when
reading the literature.
The large germacrane group is divided into simple germacranes, that is those
without a lactone or furan ring (VS1650), 12,6-germacranolides (VS1660),
12,8-germacranolides (VS1670), furanogermacranes, nor- and homo-
germacranes (VS1680), secogermacranes (VS1690) and cyclogermacranes
(VS1700).
2
1
10
9
8
O
8
3
14
7 H 12
4 5 6
11
15 13
Germacrane Furanogermacrane
1,7-Dimethyl-4-(1-methyl- 4,5,6,7,8,9,10,11-Octahydro-
ethyl)cyclodecane, 9CI 3,6,10-trimethylcyclodeca[b]-
furan, 9CI
79
Brown, D.S. et al. (1992) Heterocycles, 34, 807.
Fischer, N.H. et al. (1979) Prog. Chem. Org. Nat. Prod., 38, 47.
Fischer, N.H. (1990) Recent Adv. Phytochem., 24, 161.
Bicyclogermacrane
3,7,11,11-Tetramethylbicyclo[8.1.0]undecane, 9CI
Humulane
1,1,4,8-Tetramethylcycloundecane, 9CI
Gonzaléz, A.G. et al., (1995), Prog. Chem. Org. Nat. Prod., 64, 1
13 9 8
11 10
7 14
1 2 6
3 45
15
Caryophyllane
2,6,10,10-Tetramethylbicyclo[7.2.0]undecane, 9CI
Bicyclohumulane
1,5,8,8-Tetramethylbicyclo[8.1.0]undecane, 9CI
80
Cuparane sesquiterpenoids (VS1750)
Cuparane is formed by cyclisation between carbons 6 and 11 of the bisabolane
skeleton and the numbering system used here takes account of this fact.
Cuparanes are found in liverworts, higher plants and marine organisms.
14 8 9
4 5
15 3 6 10
7 11
2 1
12 13
Cuparane
(Most have an aromatic ring and are named in CA
as substituted benzenes)
Cyclolaurane
1,2-Dimethyl-2-(4-methylcyclohexyl)bicyclo[3.1.0]hexane, 9CI
14 15
12
Herbertane
(Mostly named as substituted benzenes in 9CI)
13
Laurane
(Mostly named as substituted benzenes in 9CI)
81
H H
16
10 O
9 11 1 2
8 6 12 3
7 5
13
15 4
14
Trichothecane, 9CI
Eudesmane Dihydro-β-agarofuran
Decahydro-1,4a-dimethyl- 2,2,5,9-Tetramethyl-
7-(1-methylethyl)- 2H-3,9a-methano-
naphthalene, 9CI 1-benzoxepin, 9CI
10
15
Emmotin skeleton
82
Oppositane sesquiterpenoids (VS2020)
The oppositanes are 8(7 → 6)-abeoeudesmanes and are found in plants and
marine organisms.
14
1 9
2 10
8
3 5 6
4
12
7
15 11
13
Oppositane
Octahydro-3a,7-dimethyl-1-(2-methylpropyl)-1H-indene, 9CI
10
15
Farfugin skeleton
Gorgonane
Decahydro-1,4a-dimethyl-8-(1-methylethyl)naphthalene, 9CI
83
H H
1 9 O
2 10 8
12
3 5 7 11 12
4 6
14
15 13 13
Eremophilane Furanoeremophilane
Decahydro-1,8a-dimethyl- 4,4a,5,6,7,8,8a,9-Octahydro-
7-(1-methylethyl)- 3,4a,5-trimethylnaphtho-
naphthalene, 9CI [2,3-b]furan, 9CI
Pinder, A.R. (1977) Prog. Chem. Org. Nat. Prod., 34, 82.
Chiloscyphane
Octahydro-7,7a-dimethyl-1-(2-methylpropyl)-1H-indene, 9CI
Aristolane
Decahydro-1,1,7,7a-tetramethyl-1H-cyclopropa-
[a]naphthalene, 9CI
Nardosinane
Decahydro-1,8a-dimethyl-8-(1-methylethyl)naphthalene
9 2 13
1 3
8
7 6 4
5
10
11 12
Brasilane
Octahydro-1,6,6-trimethyl-4-(1-methylethyl)-1H-indene
84
Cacalol sesquiterpenoids (VS2180)
The cacalol sesquiterpenoids occur, inter alia, in Cacalia spp. and are
eremophilanes, typically with an aromatic ring B, in which carbon-14 has
further migrated to C-6.
1 9 8
2 10
5
3
4 6 7 11 12
15 14 13
Cacalol skeleton
15
13
Valerane
Decahydro-4a,8a-dimethyl-2-(1-methylethyl)naphthalene
15
14 15
85
14
H H
2 10
3 1 9
4 6 8
5 7
15
H H
11
13 12
Cadinane Muurolane
Decahydro-1,6-dimethyl-
4-(1-methylethyl)naphthalene, 9CI
H H
H H
Bulgarane Amorphane
Calamenene Cadalene
1 15
2 9 8 6
7 1
7 5 2
3 5 6 4 3
4 14
11
12 13
Alliacane Primnatriene
Octahydro-2,2,4-trimethyl-
7-(1-methylethyl)-
1H-indene, 9CI
2 10
1 9
3
5 6 8
7
15 4 11
12 13
Oplopane
1-Ethyloctahydro-4-methyl-7-(1-methylethyl)-1H-indene
86
14
2 10
3 1
9
4 6 8
5 12
15
7 11
13
11
15
12
1 9
2 10 8
3 5 7
4 6
13 14
Drimane
Decahydro-1,1,4a,5,6-pentamethylnaphthalene
11
15
12
1 9
2 10 8
3 5 7
4 6
14
13
Coloratane
Decahydro-1,2,4a,5,6-pentamethylnaphthalene
14
2 10
3 1 9
4 5 8
6 7
15
11
12
13
Xanthane
1-Butyl-2-methyl-5-(1-methylethyl)cycloheptane
87
14
2 10
3 1 9
4 5 8
6 7
15
11
12
13
Carabrane
7-Butyl-1-methyl-4-(1-methylethyl)bicyclo[4.1.0]heptane
2 10
3 1 9
4 5 8
6 7
15
11
12
13
Guaiane
Decahydro-1,4-dimethyl-7-(1-methylethyl)azulene, 9CI
Fischer, N.H. et al. (1979) Prog. Chem. Org. Nat. Prod., 38, 47.
Fischer, N.H. et al. (1990) Recent Adv. Phytochem., 24, 161.
2 10
3 1 9
4 5 8
6 7
15
11
12
13
Pseudoguaiane
Decahydro-4,8a-dimethyl-7-(1-methylethyl)azulene
Fischer, N.H. et al. (1974) Prog. Chem. Org. Nat. Prod., 38, 47.
2 10
3 1 9
4 5 8
6 7
15
11
13 12
Aromadendrane
Decahydro-1,1,4,7-tetramethyl-1H-cycloprop[e]azulene, 9CI
Gijsen, H.J.M. et al. (1995), Prog. Chem. Org. Nat. Prod., 64, 149
88
Cubebane and ivaxillarane sesquiterpenoids (VS2600, VS2620)
The small groups of cubebanes and ivaxillaranes are 1,6- and 8,10-
cycloguaianes respectively.
14 14
2 10 2
3 1 9 1 10 9
3
4 5 8 4 5 8
6 7 6 7
15 15
11 11
12 12
13 13
Cubebane Ivaxillarane
Octahydro-3,7-dimethyl-4-(1-methylethyl)-
1H-cyclopenta[1,3]cyclopropa[1,2]-
benzene, 9CI
14 14 14
2 10 2 2 10 9
3 1 12 9 3 1 10 9 3 1 11 12
11
4 5 4 5 13 128 4 5
13 11 8 13 8
6 7 6 7 6 7
15 15 15
2 10
1 9
3
4 5 6
8
7 12
15 11
13
Valerenane
Octahydro-1,4-dimethyl-7-(2-methylpropyl)-1H-indene
8
5 7
4 6 9
3 2 10
1 11
15 12
13
Africanane
Decahydro-3,3,5,7b-tetramethyl-1H-cycloprop[e]azulene
89
Lippifoliane and himachalane sesquiterpenoids (VS2760,
VS2780)
Many numbering systems have been used for the himachalane skeleton. The
farnesane system has been used here. The small group of lippifolianes are 7,9-
cyclohimachalanes.
14
4 5 7
6 8
3
15 2 1 9
11 10
13
12
Himachalane
Decahydro-2,5,9,9-tetramethyl-1H-benzocycloheptene, 9CI
4 5 7
6 8
3
15 2 1 9
11 10
13
12
Longipinane
2,6,6,9-Tetramethyltricyclo[5.4.0.02,8]undecane, 9CI
12 13
1 11
10
15
2 9
4 3
5 6 8
7
14
Longifolane
Decahydro-4,8,8,9-tetramethyl-1,4-methanoazulene, 9CI
14
8 9 1
7 10 7
6
6 11 3
1 5 15 5
2 3 13
4 12
15
Longibornane
Decahydro-4,5,5,8a-tetramethyl-1,4-methanoazulene, 9CI
90
Pinguisane sesquiterpenoids (VS3000)
The structures of several pinguisanes from liverworts have recently been
revised. Many of the trivial names in this group are confusing. The commonly
used numbering system is shown.
15
14
10
4 3
11 5 9
2
6 8 1
7
12 13
Pinguisane
5-Ethyloctahydro-1,3a,4,7a-tetramethyl-1H-indene
Thapsane Fukinane
Octahydro-1,2,3a,7,7,7a- Octahydro-2,3a,4-trimethyl-
hexamethyl-1H-indene 2-(1-methylethyl)-1H-indene
13 2
1 3
12
11 6 4 9
5 8
15 10
Picrotoxane
Octahydro-1,4,7a-trimethyl-5-(1-methylethyl)-1H-indene
Fischer, N.H. et al. (1979) Prog. Chem. Org. Nat. Prod., 38, 47.
91
15
2 10
3 1 9
4 5 8
6 7 14
12 11
13
Daucane
Decahydro-3a,6-dimethyl-1-(1-methylethyl)azulene
15
2 10
3 1 9
4 5 8
11 6 7
12
13 14
Isodaucane
Decahydro-3a,7-dimethyl-1-(1-methylethyl)azulene
14
7 8
10 12
6 9
11
5 2
1 13
4 3
15
Asteriscane
Decahydro-2,2,4,8-tetramethyl-1H-cyclopentacyclooctene, 9CI
92
Illudane and protoilludane sesquiterpenoids (VS3400, VS3420)
Although historically different numbering systems have been proposed for the
illudane skeleton and the related groups, the biogenetic farnesane numbering is
shown here.
14
8 14
9 10 13 8
7 9 10 13
11 7
5 6 2 11
3 1 12 6 2
3 1 12
4 5
4
15 15
Illudane Protoilludane
Decahydro-2′,2′,4′,6′-tetra- Decahydro-3,6,6,7b-tetra-
methylspiro[cyclopropane- methyl-1H-cyclobut[e]-
1,5′-[5H]indene], 9CI indene, 9CI
8 10 13
5 7 9
4 11
6 2
3 1 12
15
Sterpurane
Decahydro-2a,5,5,7-trimethyl-1H-cyclobut[f]-
indene, 9CI
Illudalane
5-Ethyloctahydro-2,2,4,6-tetramethyl-1H-indene, 9CI
8 7 8
9 10 13 9 10 13
7
5 11 5 6 11
6 2 2
3 1 12 4 1 12
3
4
15 15
Isolactarane Lactarane
14
14
8 7 8
9 10 13 9 10 13
7
11 5 11
6 2 2
3 1 12 6 1 12
3
5
4
4
15 15
Marasmane Merulane
93
Furodysin and furodysinin sesquiterpenoids (VS3550, VS3560)
A farnesane numbering system is used for the furodysin and the rearranged
furodysinin groups from Dysidea spp.
14 14
6 4 2 6 4
7 5 3 1 7 5 3
8 10 15 8 10 15 1
9 11 9 11 2
12 13 12 13
13
12
Botrydial skeleton
Octahydro-1,1,3,3,4,5-hexamethyl-1H-indene
Spirovetivane
6,10-Dimethyl-2-(1-methylethyl)spiro[4.5]decane, 9CI
Marshall, J.A. et al. (1974) Prog. Chem. Org. Nat. Prod., 31, 283.
14
5 6 10 9
15 4 1
3 2 7 8
11
12
13
Acorane
1,8-Dimethyl-4-(1-methylethyl)spiro[4.5]decane, 9CI
Marshall, J.A. et al. (1974) Prog. Chem. Org. Nat. Prod., 31, 283.
94
Chamigrane sesquiterpenoids (VS3800)
The chamigranes are a group of mainly marine natural products, mostly from
Laurencia and Aplysia spp. The numbering system is based on farnesane. There
are also some secochamigranes known (VS 3810).
12
13
10 11 5 4
9 6 3 15
8 7 1 2
14
Chamigrane
1,1,5,9-Tetramethylspiro[5.5]undecane, 9CI
1,2,5,9-Tetramethylspiro- 1,6-Dimethyl-8-(1-methyl-
[5.5]undecane ethyl)spiro[4.5]decane
12 12
Cedrane Isocedrane
Octahydro-3,6,8,8-tetra- Octahydro-3,4,8,8-tetra-
methyl-1H-3a,7-methano- methyl-1H-3a,7-methano-
azulene azulene
10 9 10 9
12 12
Zizaane Prezizaane
Octahydro-3,7,8,8-tetra- Octahydro-3,7,7,8-tetra-
methyl-1H-3a,6-methano- methyl-1H-3a,6-methano-
azulene, 9CI azulene
95
14
15
12 7
6 8
11
5 1
10 9 13
4 2
3
Clovane
Decahydro-1,1,7-trimethyl-3a,7-methano-
3aH-cyclopentacyclooctene, 9CI
13
12
5 4
7 6 15 14
3 9 8
8
9 10 2 10
1 7
11 1 11
2 6
4
3 5
12 15
14 13
Precapnellane Capnellane
Decahydro-1,5,8,8-tetra- Decahydro-3,3,4,7a-tetra-
methyl-1H-cyclopenta- methyl-1H-cyclopenta[a]-
cyclooctene pentalene, 9CI
14
14 1 1
8 7 8 7
2 6 13 15 2 6 13
4 4
15 3 5 3 5
10 9 10 9
11 12 11 12
96
Silphinane, silphiperfoliane and presilphiperfoliane sesquiterpenoids
(VS4450, VS4460, VS4470)
These three groups are related biogenetically. The absolute configurations have
not been established unambiguously.
15
14 15
10 11 9 10 10
15 9 11 9 11
1 14 1
8 7 8 7 14
2 6 2 6 13 1
4 4 8 7
3 5 13 3 5 2 6
4
3 5 13
12 12
12
Silphinane Silphiperfoliane
Presilphiperfoliane
Decahydro-1,4,4,5a-tetra- Decahydro-1,4,5,6a-tetra-
methylcyclopenta[c]- methylcyclopenta[c]-
pentalene pentalene
6 5
15 7 14
8
9 4
10 3 13
11 1 2
12
Isocomane
Decahydro-1,3a,4,6-tetramethylcyclopenta[c]pentalene
12
9
10 8
13 11
1 2 14
7
15 3 6
4 5
Panasinane
Decahydro-2,2,4a,8-tetramethylcyclobut[c]indene, 9CI
14 15
9
2
1 8
3 10 7
5 6
4
13 11
12
Modhephane
Tetrahydro-1,1,3,4-tetramethyl-1H,4H-propanopentalene
97
14 15
13
11 3
10 2
12 4
9 1
8 5
6
7
Quadrane
14 10 12
8 14 10 12
7 9 11 8
7 9 11
1 6
5 13 6
3 1
4 5 13
2 3
2 15 4
15
Campherenane α-Santalane
1,7-Dimethyl-7-(4-methyl- 2,3-Dimethyl-2-(4-methyl-
pentyl)bicyclo[2.2.1]- pentyl)tricyclo[2.2.1.02,6]-
heptane, 9CI heptane
1
15
4 2 12
3
6 7 9
5
11
8 10 13
14
β-Santalane
2,3-Dimethyl-2-(4-methylpentyl)bicyclo[2.2.1]heptane
4 4
5 3 5 3
2 2
14 8 1 14 8 1
6 6
7 15 7 15
13 13
12 12
Sativane Helminthosporane
Octahydro-4,8-dimethyl-7- 1,6,7,8-Tetramethyl-4-
(1-methylethyl)-1,4-methano- (1-methylethyl)bicyclo-
1H-indene [3.2.1]octane
98
14 14
8 8
9 7 9 7
13 10 13 10
1 6 1 6
11 3 5 5
11 3
4 15 4
2 2
12 15 12
Copacamphane Cyclocopacamphane
Octahydro-4,8-dimethyl-7- Octahydro-1,7a-dimethyl-5-
(1-methylethyl)-1,4-methano- (1-methylethyl)-1,2,4-
1H-indene, 9CI metheno-1H-indene, 9CI
11 3 11 3
2 4 2 4
5 13 5 13
12 12
10 1 6 10 1 6
8 7 8 7
9 14 9 14
15 15
Sinularane Cyclosinularane
Octahydro-7a,8-dimethyl-5- Octahydro-1,4-dimethyl-7-
(1-methylethyl)-1,4-methano- (1-methylethyl)-1,2,4-
1H-indene, 9CI metheno-1H-indene, 9CI
2 10
3 1 9
4 6 8
5 7
15
11
12 13
Copaane
1,3-Dimethyl-8-(1-methylethyl)tricyclo[4.4.0.02,7]decane, 9CI
Ishwarane
Decahydro-1,6,6a-trimethyl-1,2a-methano-
2aH-cyclopropa[b]naphthalene, 9CI
2 10 9
1 12
3 8
5 13 11
4 7
6
15
Rotundane
Decahydro-1,4,6-trimethyl-4,7-ethanoazulene, 9CI
99
Thujopsane sesquiterpenoids (VS5040)
Thujopsanes are found in higher plants whilst ent-thujopsanes are found in
liverworts. The biogenetic farnesane numbering system is used.
14
8 5
9 7 4
10 6 3
11
2 15
1
12 13
Thujopsane
Decahydro-2,4a,8,8-tetramethylcyclopropa[d]naphthalene, 9CI
2 1 10 9
3 8
5 6 7
4
11
15 13
12
Bourbonane
Decahydro-3a,6-dimethyl-1-(1-methylethyl)cyclobuta[1,3:3,4]-
dicyclopentene, 9CI
3 2 11 10
4 1 9
5 6 7 8
14 13
Gymnomitrane
Decahydro-3a,4,7,8a-tetramethyl-4,8-methanoazulene, 9CI
100
Diterpenoids (VS5350–VS7340)
The diterpenoids constitute a large group of compounds derived from
geranylgeranyl pyrophosphate. They are found in higher plants, fungi, insects
and marine organisms.
Hanson, J.R. (1971) Prog. Chem. Org. Nat. Prod., 29, 395.
Hanson, J.R. (1972) in Chemistry of Terpenes and Terpenoids, (ed. A.A. Newman)
Academic Press, London, p. 155.
Hanson, J.R. (1998) Nat. Prod. Rep., 15, 93.
Hill, R.A. (1993) in The Chemistry of Natural Products, 2nd edn (ed. R.H. Thomson),
Blackie, Glasgow, pp. 124.
West, C.A. (1981) in Biosynthesis of Isoprenoid Compounds, (eds J. W. Porter et al.)
Wiley, New York, Vol. 1, p. 375.
13 9 5 1
15 11 7 3
16 14 12 10 8 6 4 2
Phytane
2,6,10,14-Tetramethylhexadecane
9 13
1 7 11 15
2 6 8 10 12 14 16
3 5
4
20
Prenylbisabolane
1-Methyl-4-(1,5,9-trimethyldecyl)cyclohexane
16 17 19 20
7 11 15
1 9 13
2 6 8 10 12 14
3 5
4
18
10,15-Cyclophytane
1,1,3-Trimethyl-2-(3,7-dimethylnonyl)cyclohexane
101
12 14 12 14 12 14
13 13 13
11 11 11 15
15 20 15
20 17 H 17 20 17
1 9 16 1 9 16 1 9 16
2 10 8 2 10 8 2 10 8
3 5 7 3 5 7 3 5 7
4 6 4 6 4 6
19 18 19 18 18 19
19 18 19
18
Colensane ent-Clerodane
Octahydro-1,1,2,3a,5- Decahydro-1,2,4a,5-
pentamethyl-4-(3-methyl- tetramethyl-1-(3-methyl-
pentyl)-1H-indene pentyl)naphthalene, 9CI
12 15
11 13 17
20 H 14
1 9
2 10 8
3 5 7
4 6
19 18
Abietane
Tetradecahydro-1,1,4a-trimethyl-7-(1-methylethyl)-
phenanthrene, 9CI
San Feliciano, A. (1993), Planta Med., 59, 485.
102
16 16
15 15
12 12
11 13 17 11 13 17
20 14 20 14
1 9 1 9
2 10 8 2 10 8
3 5 7 3 5 7
4 6 4 6
19
19 18
18
13,16-Cycloabietane
19(4→3)-Abeo-13,16-cyclo-
abietane
15 15 17
17
12 12
11 13 16 11 13 16
20 14 20 14
1 9 1 9
2 10 8 2 10 8
3 5 7 3 5 7
4 6 4 6
19
19 18
18
17(15→16)-Abeoabietane
17(15→16),19(4→3)-
Tetradecahydro-1,1,4a-
Bisabeoabietane
trimethyl-7-propyl-
Tetradecahydro-1,2,4a-
phenanthrene
trimethyl-7-propyl-
phenanthrene
19 18
17
Totarane 19 18
Tetradecahydro-1,1,4a-
trimethyl-8-(1-methylethyl)- Nagilactone skeleton
phenanthrene
Bendall, J.G. et al. (1995) Aust. J. Chem., 48, 883.
19 18 19 18
Pimarane Rosane
7-Ethyltetradecahydro- 7-Ethyltetradecahydro-
Isopimarane
1,1,4a,7-tetramethyl- 1,1,4b,7-tetramethyl-
phenanthrene phenanthrene
103
17 16 16 16
17 17
12 12 12
11 13 15 11 13 11 13
H 9 14 20
15
H 15
14 9 14
1 1 9 1
2 10 20 8 2 10 8 2 10 20 8
3 5 7 3 5 7 3 5 7
4 6 4 6 4 6
19
18 19 19
18 18
Erythroxylane Parguarane Devadarane
7-Ethyltetradecahydro- 5-Ethyltetradecahydro- 8-Ethyltetradecahydro-
1,4b,7,10a-tetramethyl- 1a,5,7b-trimethyl-1H- 3a,8,10a-trimethyl-
phenanthrene cyclopropa[a]phenanthrene cyclopropa[j]phenanthrene
Cassane 19 18
7-Ethyltetradecahydro-
1,1,4a,8-tetramethyl- Vouacapane skeleton
phenanthrene
4
Cleistanthane 19
7-Ethyltetradecahydro- 18
1,4,4a,7-tetramethyl- Isocleistanthane skeleton
phenanthrene
16
12
11 13
20 17 14 12
11 13 16
2
1
10
9
8
15 20 17 14 15 O
1 9
3 5 7 2 10 8
4 6
3 5 7
4 6
19 18
19 18
Isocopalane
Tetradecahydro-1,1,4a,7,8,8a- Spongiane skeleton
hexamethylphenanthrene
104
Podocarpane diterpenoids (VS5980)
Miscellaneous podocarpane derivatives that cannot be easily classified are
collected in this section.
12 12
11 13 17 11 13 17
20 20
9 14 16 9 14 16
1 1
2 10 8 15 2 10 8 15
3 5
H 7 3 5 7
4 6 4 6
19 18
H 19 18
ent-Kaurane Phyllocladane
≡ Kaurane, 9CI ≡ 5α,9α,10β-Kaurane, 9CI
19 18
H
ent-Beyerane
105
15 12 17
17 16
12 11 13
16 11 13 20
20 9 H 14 9 14 15
1 1
2 10 8
3 5
2 10
5
H 8
7 3 7
4 6 4 6
18 19
H 18 19H
ent-Villanovane ent-Atisane
Tetradecahydro-4,4,8,11b- ≡ Atisane, 9CI
tetramethyl-9,11a-methano-
11aH-cyclohepta[a]-
naphthalene, 9CI
12 12
17 17
11 11 13
20 13 16 20 14
16
9 14
15 H 15
1 1 9
2 10 8 2 10 8
5
H
3 7 3 5 7
4 6 4 6
18 19
H 18 19
H
ent-Trachylobane ent-Helvifulvane
≡ Trachylobane, 9CI Tetradecahydro-4,4,7a,9b-
tetramethyl-8,8a-methano-
9aH-cyclopropa[b]-
phenanthrene, 9CI
19 18
Aphidicolane
4,4,17-Trimethyl-9,15-cyclo-C,18-dinor-
14,15-secoandrostane
Gibberellins (VS6200)
The gibberellins, important plant hormones, are based on the ent-gibberellane
skeleton. They are produced by higher plants and the rice plant infecting fungus
Gibberella fujikuroi. They have also been isolated from red and green algae.
The biosynthesis of the gibberellins has been well studied and it is clear that
they are derived from ent-kaurene. In CA they are named as derivatives of the
stereoparent gibbane, which is a C15 skeleton and has a completely different
numbering scheme. Many of the natural gibberellins are C19 norditerpenes.
20 H
1 11 4 5
2 10 9 12 3 4a 4b 6
3 5 8 13 2 10a 9a 7
4 6 14 1 10 11
H 15 16 H 9 8
19 18 7
17
106
Bearder, J.R. et al. (1977) Biochem. Soc. Trans. 5, 569 (rev).
Binks, R. et al. (1969) Phytochemistry, 8, 271 (ms).
Ceccarelli, N. et al. (1983) Phytochemistry 22, 2203 (biosynth).
Crosier, A. et al. (1970) Can. J. Bot., 48, 867 (biochem).
Evans, R. et al. (1970) J. Chem. Soc. C, 2601 (biosynth).
Evans, R. et al. (1975) J. Chem. Soc., Perkin Trans. 1, 1514 (cmr).
Hanson, J.R. (1965) J. Chem. Soc., 5036 (pmr).
Hanson, J.R. (1990) Nat. Prod. Rep., 7, 41 (rev).
Kamiya, Y. et al. (1983) Phytochemistry, 22, 681 (biosynth).
Lang, A. et al. (1970) Annu. Rev. Plant Physiol., 21, 537 (rev).
MacMillan, J. et al. (1970) Aspects Terpenoid Chem. Biochem., Proc. Phytochem. Soc.
Symp., 2nd, 153 (rev).
Mander, L.N. et al. (1988) Nat. Prod. Rep., 6, 541 (synth).
Mander, L.N. et al. (1992) Chem. Rev., 92, 573 (synth).
Phinney, B.O. et al. (1990) Recent Adv. Phytochem., 24, 203 (rev).
Takahashi, N. et al. (1969) Org. Mass Spectrom., 2, 711 (ms).
West, C. (1969) Biochem. J., 114, 3P (rev).
Yamaguchi, I. et al. (1975) J. Chem. Soc., Perkin Trans. 1, 992 (cmr).
Yamane, H. et al. (1977) Phytochemistry, 16, 831 (biosynth).
18
6 4 16
5
7 3 2
8
19 1
15
9 13
10 12 14 17
11
20
Cembrane
1,7,11-Trimethyl-4-(1-methylethyl)cyclotetradecane, 9CI
107
Rearranged cembrane diterpenoids (VS6420)
This group contains assorted, unusual macrocyclic diterpenoids including a
basmane derivative which is formally a (2,12 : 7,11) cyclised cembrane.
18
5 4 16
6 3
19 7 15
8 11 12 2 1 17
9 10 13 14
20
Basmane
20
4 6 7 8 11 9 8 7
3 5 9 12 10 6
20 18
2 14 12 10 13 1 3 5
1 13 11 14 2 4
15 15
16 17 16 17
Eunicellane Abestinane
16
12
11 13
15
9 14
1
2 10 8
3 5 7
4 6
17
18
19 20
Sphaerane
Tetradecahydro-5,8a,10a-trimethyl-1-(1-methylethyl)-
phenanthrene
15
16
14 2 4
13 1 3 5
12 10 9 6
11 7
8
20 19 17
18
Briarane
108
Dolabellane and modified dolabellane diterpenoids (VS6500, VS6510)
Dolabellanes are found in marine organisms and in liverworts. Several
numbering systems have been used in the literature. We have used the one
shown. The modified dolabellane group includes the neodolabellanes in which a
methyl has migrated from C-1 to C-11. A rare 3,9-cyclodolabellane is also
included in this group.
20 20
19 15 19
18 18
17 9 10 17 9 10
8 12 8 11 12
11
13 13
7 1 14 7 1 14
6 3 2 6 3 2
5 15 5
4 4
16 16
Dolabellane Neodolabellane
19
17
16
6 7
8 9
4 10
5 12
14 13 11
3
2 1 20
15
Dolastane
Tetradecahydro-3a,5,8a-trimethyl-1-(1-methylethyl)-
benz[f]azulene, 9CI
17
1 8
9 7
2
3 4 6
5 14
18 16
19 10 13
11 12
20
15
Cyathane
Tetradecahydro-3a,5a,8-trimethyl-1-(1-methylethyl)-
cyclohept[e]indene
Turner W.B. et al. (1983) Fungal Metabolites II, Academic Press, London.
109
20
17 19
18
6 5
4 3
16 7 2
10 1
8 9
14
11
13 12
15
Sphaeroane
Dodecahydro-3a,6,9-trimethyl-3-(1-methylethyl)-
benz[e]azulene, 9CI
17 17
16 15 16 15
13 12 3 13 12
18 H 11 H 11
2 14 18 2 14
3 1 10 4 1 10
4 20 20
5 7 9 5 7 9
6 8 6 8
19 19
Verrucosane Neoverrucosane
Tetradecahydro-3a,5a,7a-trimethyl-
1-(1-methylethyl)cyclopenta-
[a]cyclopropa[g]naphthalene, 9CI
17 17
15 16 15
16
18 13 12 13 12
H 11 H 11
3 2 14 3 2 14
1 10 1 10
4 20 18 4 20
7 9 7 9
5 6 8 5 6 8
19 19
Homoverrucosane Homoneoverrucosane
20
Casbane
3,7,11,15,15-Pentamethylbicyclo[12.1.0]pentadecane, 9CI
Evans, F.J. et al. (1983) Prog. Chem. Org. Nat. Prod., 44, 1.
110
Jatrophane and 9,13-cyclojatrophane diterpenoids (VS6610, VS6620)
Jatrophane is the parent skeleton of a group of macrocyclic diterpenoids from
Euphorbia species. Formally it can be derived from casbane by 6,10-cyclisation
and opening of the cyclopropane. The numbering system shown is used fairly
consistently in all the related diterpenoids of this series. Two examples of the
9,13-cyclojatrophane skeleton have been isolated.
20
14 13 19
1 15
16 2
3 4 17 12 11 18
10
5 6
9
7 8
Jatrophane
Evans, F.J. et al. (1983) Prog. Chem. Org. Nat. Prod., 44, 1.
20
14 13
1 12 19
15 17 11
16 2
3 4 9 10
5 6 18
7 8
Lathyrane
Tetradecahydro-1,1,3,6,9-pentamethyl-1H-cyclopenta[a]-
cyclopropa[f]cycloundecene
Evans, F.J. et al. (1983) Prog. Chem. Org. Nat. Prod., 44, 1.
12 13 15
18 16 17
11
14 15 18 12 13
11
9 14
1 8
10 17
19 2 7 9 8
3 4 1 10
5 6 19 2 7
3 4
5 6
20
20
Rhamnofolane
Daphnane
Evans, F.J. et al. (1983) Prog. Chem. Org. Nat. Prod., 44, 1.
111
widely in the Euphorbiaceae and are renowned for their tumour promoting and
irritant activity. The ingenane skeleton is derived by rearrangement of tigliane.
Ingenane esters also have irritant properties.
17
17 16
12 13 15
18
11 15 16 12 13 14
H 14 11
18
9 8 9 8
1 10 1 10
19 2 7 19 2 7
3 4 3 4
6
5 5 6
20 20
Tigliane Ingenane
Tetradecahydro-1,1,3,6,8- Dodecahydro-1,1,4,7,9-
pentamethyl-1H-cyclo- pentamethyl-1H-2,8a-
propa[3.4]benz[1,2-e]- methanocyclopenta[e]-
azulene, 9CI cyclopropa[e]cyclo-
decene, 9CI
Evans, F.J. et al. (1983) Prog. Chem. Org. Nat. Prod., 44, 1.
18 11 12 18 11 12
10 13 10 13
16
14 15 14
1 8 1 8 16
9 7 17 9 7
19 2 19 2 15
3 4 6 3 4 6
5 5
20 20 17
Jatropholane Crotofolane
Hexadecahydro-1,1,2,5,7- Tetradecahydro-2,5,10-
pentamethyl-6H-cyclohept- trimethyl-6-(1-methyl-
[1,2-e]indene ethyl)cyclohept[e]-
indene, 9CI
Evans, F.J. et al. (1983) Prog. Chem. Org. Nat. Prod., 44, 1.
16
Fusicoccane
Tetradecahydro-1,4,9a-trimethyl-7-
(1-methylethyl)dicyclopenta[a,d]-
cyclooctene, 9CI
Krasnopolskaya, L.M. (1994), J. Plant Growth Regulation, 13, 39.
Turner, W.B. et al. (1983) Fungal Metabolites II, Academic Press, London.
112
19 19
20 15 20
18 18
9 14 9 14
8 10 8 10
13 13
7 11 12 7 11 12
6 1 6 1
5 17 2 15 5 17 2
4 3 4 3
16 16
Valparane Mulinane
Tetradecahydro-5a,8,10b-trimethyl-3-
(1-methylethyl)cyclohept[e]indene, 9CI
12
3
10 4 5
2 6
9 8 7
1 19
13 16
11 18
14 15 17 20
Spatane
Decahydro-3a,6-dimethyl-1-(1,5-dimethylhexyl)cyclo-
buta[1,2:3,4]dicyclopentene, 9CI
Verticillane
4,8,12,15,15-Pentamethylbicyclo[9.3.1]pentadecane, 9CI
20 16 17 20
Taxane 11(15→1)-Abeotaxane
Tetradecahydro-4,9,12a,13,13-pentamethyl-
6,10-methanobenzocyclodecene, 9CI
113
Kingston, D.G.I et al. (1993) Prog. Chem. Org. Nat. Prod., 61, 1.
Swindell, C.S. (1993) in Studies in Natural Product Chemistry, Vol 12, (ed. Atta-ur-
Rahman), Elsevier, p. 170.
8 9 8 9
6 7 17 10 6 7 17 10
5 20 5 11 20
4 11 4
16 16
15 12 15 12
18 3 1 13 18 3 1 13
2 14 2 14
Trinervitane Kempane
Hexadecahydro- Hexadecahydro-
1,4,8,12-tetramethyl- 2a,7,10,10c-tetramethyl-
1,11-ethanocyclopenta- napth-[2,1,8-cde]-
cycloundecane, 9CI azulene, 9CI
16 17
15
16
2 18 2 18
14 1
17 14
3 15 1 3
20 12 4 20 12 4
11 11
13 5 5
13
6 10 8 6
10 9 8 9
7 7
19 19
Amphilectane Cycloamphilectane
Dodecahydro-1,4,7-trimethyl- Hexadecahydro-1,4,7,7-
3-(2-methylpropyl)-1H- tetramethylpyrene
phenalene, 9CI
16 2 18 17
14 3
15 1 15
20 12 4 18
16 14
17 13 5
11 2 3
1
10 8 6 20 4
9 12 13
7 11 5
10 8 6
19 9 7
Adociane
19
Hexadecahydro-1,2,5,8-
tetramethylpyrene, 9CI Neoamphilectane
114
Xenicane and xeniaphyllane diterpenoids (VS7100, VS7110, VS7150)
Xenicanes and xeniaphyllanes are marine natural products. Various nor-, seco-
and cyclo-xenicanes are listed in a separate section (VS7110). Xeniaphyllanes
are the diterpenoid equivalent of the caryophyllane skeleton. Xenicanes are
cleaved xeniaphyllanes.
16 17
14 15
15 13 14
16
12 13
11 12 20
4 20 2 3
5 4
10 6 18 11 1
17 3 5
7 10 9
2 6
8 8 7
18 1 9
19
19
Xeniaphyllane
Xenicane
2,6,10-Trimethyl-10-
1,2,6-Trimethyl-3-
(4-methylpentyl)-
(1,5-dimethylhexyl)-
bicyclo[7.2.0]undecane
cyclononane
2 3
8 7
1 4 20
9 10
6 5
11 18
12
13 14
17
15
16
Viscidane
1,8-Dimethyl-4-(1,5-dimethylhexyl)spiro[4.5]decane
14 13
17 15 8
20
16
18 9
10 7
6
19 1 5
11 2
3 4
12
Eremane
Decahydro-1,5,5-trimethyl-4-(4-methylpentyl)-
3a,6-ethano-3aH-indene
115
19
1 9
2 10 8
3 5 7 12 14 16
4 6 11 15
13
20 18 17
Prenyleudesmane
Decahydro-7-(1,5-methylhexyl)-1,4a-dimethylnaphthalene
1 9
2 10 8
3 5 19 7 12 14 16
4 6 11 15
13
20 18 17
Prenylgermacrane
4-(1,5-dimethylhexyl)-1,7-dimethylcyclodecane
5 4 3
6 2
7 9 18 1 16
8 10
13 15
11
19 12 14 17
20
Prenybicyclogermacrane
3,7,11-Trimethyl-11-(4-methylpentyl)bicyclo-
[8.1.0]undecane
7
8
6
9 1 5
11 2 4 15 17 20
10 3 13 18
16
12 14 19
Lobane
4-(1,5-dimethylhexyl)-1-ethyl-1-methyl-2-
(1-methylethyl)cyclohexane
18 14
10 9 10 9
2 2
1 1
8 3 8
3 5
4 5 4 7 19
6 7 16 6
11 12 13 14 15
11 16 18
17 15
12 17 20
19 20 13
116
Serrulatane and biflorane diterpenoids (VS7250)
The biflorane skeleton is found in marine organisms, insects and Eremophila
spp. The skeleton is an ‘extended’ cadinane. The serrulatane name is given to
the aromatic analogue. Unfortunately different numbering systems have been
given to serrulatanes and bifloranes.
20 19
1 8 10 2
2 9 7 9 1 3
3 10 6 8 6 4
4 5 7 5
19 20
11 11
12 18 13 12
13 14
14 15
15
16
17 16 17 18
Serrulatane Biflorane
Decahydro-4-(1,5-dimethyl-
hexyl)-1,6-dimethyl-
naphthalene
20
11 13
10 12 14
9 17 15
8 16
7 19
6
18
5
4
2
3 1
Decipiane
Decahydro-1,2,5-trimethyl-1-(4-methylpentyl)-
1H-cyclobuta[de]naphthalene, 9CI
11
13
12
1 9
2 10 8
3 5 7
4 6
14 15
16
17
18
19 20
Sacculatane
Decahydro-1,4a,5,6-tetramethyl-1-(4-methylpentyl)-
naphthalene
117
18 16
12 2
13 11 1 3
14 10 8 6 4
15 7 5
9
19 20 17
Obtusane
1,1,3-Trimethyl-2-[3-(4-methylcyclohexyl)butyl-
cyclohexane
1
2 9 8
3 7
5 6
4
11 12 13
10
17 14
16 15
20
19
Irieol skeleton
Octahydro-1-[(3,3-dimethylcyclohexyl)methyl]-
3a,7-dimethyl-1H-indene
12
Sphenolobane
Decahydro-1-(1,5-dimethylhexyl)-3a,6-dimethylazulene
Sesterterpenoids (VS7400–VS7580)
Sesterterpenoids are a small group of natural products that arise from five
isoprene units. Although sesterterpenoids strictly have 25 carbons, there are
many nor- and alkylated members. Also included here are the C21 acyclic
terpenoids although their biosynthetic relationship with the sesterterpenoids has
not been established with certainty. Sesterterpenoids are found in fungi, higher
plants, insects and marine organisms.
118
Acyclic and noracyclic sesterterpenoids (VS7400)
The acyclic sesterterpenoids arise by a head to tail fusion of isoprene units. The
accepted numbering system is used here. The noracyclic sesterterpenoids
(VS7410) are numbered in a similar way; however, a problem arises with the
symmetry of the C21 compounds as they may be numbered from either end. The
acyclic sesterterpenoids frequently contain furanoid rings.
21 22 23 24 25
1
19 17 15 13 11 9 7 5 3
20 18 16 14 12 10 8 6 4 2
21 20 23 24 25
24 25
7 8 9 11
6 23 10 12
5 1
4 2 14 13 21
3
17 19
15
22 16 18 20
Cericerane
1-(1,5-Dimethylhexyl)-4,8,12-trimethylcyclolotetradecane
12 18
13 14 15 16 17
11 19
25 22
1 9 21 20
2 10 8
3 5 7
4 6
24 23
119
24
24
12 14 16 18 13
12 11 23 22
25 11 13 17 10
15 19 14
22 16 9 1
15 17 8
1 9 7 5 3
21 20 6 4 2
2 10 8
3 5 7 25 18 20
4 6 19
23
24 21
23 16 18 25
15 17 19
21 8 9 10 24
7 14
13
11 12
6
5 2 1
4 3 22
20
21 20 4 3 22
Cheilanthane 20
4,4,8-Trimethyl-D(15),24-
dinor-13,17-secocholane, 9CI Ophiobolane
16 17 19 21
15 20
18
25 8 9
7 10 14 22
13
6 11 12
5 2 1
4 3 23
24
Ophiobolane, CA numbering
20
19
Scalarane
4,4,8,17,17a-Pentamethyl-D-homoandrostane, 9CI
120
Triterpenoids (VS7600–VS9450)
The triterpenoids constitute a large diverse group of natural products derived
from squalene or, in the case of 3β-hydroxytriterpenoids, the 3S-isomer of
squalene 2,3-epoxide. The conformation that all-trans-squalene 2,3-epoxide
adopts when the initial cyclisation takes place determines the stereochemistry of
the ring junctions in the triterpenoid produced. Thus cyclisation of the chair-
boat-chair-boat conformation gives the protostane cation and cyclisation of the
chair-chair-chair-boat conformation leads to the dammarane cation. The initially
formed cation intermediate may undergo a series of 1,2-hydride and methyl
migrations, commonly called backbone rearrangements, to give a variety of
skeletal types.
Squalene
O
Squalene 2,3-epoxide
HO
21 24 27
18 20
18
19 H 26
19 H
30
29 28
121
Connolly, J.D. et al. (1991) Methods Plant Biochem., 7, 331.
Connolly, J.D. et al. (1997) Nat. Prod. Rep., 14, 661.
Goodwin, T.W. (1981) in Biosynthesis of Isoprenoid Compounds, (eds J.W. Porter et al.)
Wiley, New York, Vol. 1, p. 443.
Hill, R.A. (1993) in The Chemistry of Natural Products, 2nd edn (ed. R.H. Thomson),
Blackie, Glasgow, pp. 131.
Mahato, S.B. et al. (1997) Phytochemistry, 44, 1185.
Spencer, T.A. (1994) Acc. Chem. Res., 27, 83.
The main tetracyclic triterpenoid skeletons have the steroid numbering for the
skeleton including the side chain and only the methyl groups will be numbered
in the structures that follow. As a general rule the methyls which migrate during
terpenoid biosynthesis retain their numbering.
CA names most tetracyclic triterpenoids as derivatives of the steroid
stereoparents. This has the disadvantage that some are assigned different names
from those commonly used. The CA names for some common skeletons are
listed below.
25 26 27
24
1 10
28 29 30
Squalane
2,6,10,15,19,23-Hexamethyltetracosane, 9CI
O O H
HO P O P OH2C
OH OH
Presqualene pyrophosphate
122
Botryococcene triterpenoids (VS7620)
The alga Botryococcus braunii produces a series of branched alkylated
isoprenoid hydrocarbons based on botryococcane. The names of individual
compounds indicate the number of carbons, e.g. C34-Botryococcene. Of the
several numbering systems that have been used, the one below is preferred.
Cyclised botryococcenes are also listed in this section.
27
23 24 26
22
1 3
25
28 29 30
C30-Botryococcane
10-Ethyl-2,6.10,13,17,21-hexamethyldocosane
H H
19 30 30
19
H 18 18
H
29 28 28
Protostane Fusidane
18
19 H
30
29 28
Lanostane
123
18
19
H
30
29 28
Cycloartane
18
H H
19 30
29 28
Cucurbitane
Lavie, D. et al. (1971) Prog. Chem. Org. Nat. Prod., 29, 307.
Miro, M (1995), Phytother. Res., 9, 159.
H
H
19 30
18
29 28
Dammarane
29 29 28
28
Euphane Tirucallane
124
Apotirucallane triterpenoids (VS8050)
Further rearrangement of the tirucallane skeleton, the apo-rearrangement,
affords the apotirucallane skeleton. Apotirucallanes are the notional parents of
the tetranortriterpenoids (limonoids) and the quassinoids.
18
19 30
29 28
Apotirucallane
Nortriterpenoids (VS8100–VS8130)
The tetranotriterpenoids (limonoids) are formed by loss of four terminal carbons
of the apotirucallane skeleton. The side chain is typically a β-substituted furan
although other oxidation levels are found to a lesser extent. A series of ring-
cleavages and rearrangements can lead to a wide range of structures. For
example cleavage of rings B and C may be followed by cyclisation to form a
bicyclo[3.3.1]nonanolide system. In the Type of Compound Index, these com-
pounds are presented in three groups – intact tetranortriterpenoids (VS8100),
ring cleaved derivatives (VS8120) and rearranged derivatives (VS8130). The
last group contains the bitter principles of the Cneoraceae, e.g. Cneorin C.
O O
18
O
19 30
8
O
O
30
O
29 28
COOMe
Tetranortriterpenoid A tetranortriterpenoid with
skeleton rings B and D cleaved
18
7
O
MeOOC 9
8
O O
5
30
3
O
A bicyclo[3.3.1]nonanolide tetranortriterpenoid
125
stereoparent; however the numbering system used by Chemical Abstracts differs
from the accepted system; the oxygen atom is numbered.
18 22 18
12 17 12
11 13 20 23 11 13
19 30 14 19 3014
1 9 21 1 9
2 10 8 15 2 10 8 15
3 5 3 5 7
H 16
7
4 6 16 4 6
O
28 28
21
12
11 13
19 2014
1 9
2 10 8 15
3 5 7 17 16
4 6
O
18
Picrasane, CA numbering
Kawada, K. et al. (1989) Org. Prep. Proced. Int., 21, 521 (synth).
Polonsky, J. (1973) Prog. Chem. Org. Nat. Prod., 30, 101; (1985) 47, 221.
H 28 20 22
12 18
11 13 17 19 21 29
25 26 14 16
1 9 15
2 10 8
27
3 5 7
4 6
24 23
Baccharane
D : B-Friedo-18,19-secolupane, 9CI
20
30 21
19
H 22
12 18
11 13 17 28
25 26 14 16
1 9 15
2 10 8
27
3 5 7
4 6
24 23
Lupane, 9CI
(23/24 substituents specified in CA as (4α)23- and
(4β)23- respectively)
126
Oleanane triterpenoids (VS8300)
Formation of a six-membered ring E from the baccharane precursor leads to the
oleanane group. Oleananes form the largest group of triterpenoids and occur
widely in the plant kingdom often as glycosides. The fairly numerous nor-,
seco- and abeooleananes are listed separately (VS8310).
30 29
20
19 21
18 22
12
11 13 17 28
25 26 14 H 16
1 9 15
2 10 8
27
3 5 7
4 6
24 23
Oleanane, 9CI
(23/24 substituents specified in CA as (4α)23- and
(4β)23- respectively; and 29/30 substituents as (20α)29
and (20β)29 respectively)
Agrawal, P.K. et al., (1992), Prog. Nuclear Magn. Reson. Spect., 24, 1.
Mahato, S.B. et al. (1988) Phytochemistry, 27, 3037.
Tschesche, R. et al. (1973) Prog. Chem. Org. Nat. Prod., 30, 461.
27 27
H 28 H 28
25 26 25
26
24 23 24 23
Taraxerane Multiflorane
D -Friedooleanane, 9CI D : C-Friedooleanane, 9CI
30 29 30 29
27 27
H 28 H 28
25 26 25 26
24
24 23 23
Glutinane Friedelane
D : B-Friedooleanane, 9CI D : A-Friedooleanane, 9CI
127
30 29
27
H
28
25 26
24
23
Pachysanane
16-Methyl-D : A-friedo-28-noroleanane, 9CI
29 29
H 28 H 28
25 26 25 26
27 27
24 23 24 23
30
29
27
25 H 28
26
24 23
Bauerane
D : C-Friedoursane, 9CI
29
H H 21
28
25 26 30
27
24 23
Hopane
A′-Neogammacerane, 9CI
128
Neohopane, fernane, adianane and filicane triterpenoids (VS8770,
VS8800, VS8850, VS8870)
Backbone rearrangement of the moretane skeleton leads in turn to the
neohopanes (neomotianes), fernanes, adiananes and filicanes.
29 29
28 26 28
25 26 H 30 25 H 30
27 27
24 23 24 23
Neohopane Fernane
B′ : A′-Neogammacerane, 9CI D : C-Friedo-B′ : A′-
neogammacerane, 9CI
29 29
27 28 27 28
22 22
H 30 H 30
25 26 25 26
24
24 23 23
Adianane Filicane
D : B-Friedo-B′ : A′- D : A-Friedo-B′ : A′-
neogammacerane, 9CI neogammacerane, 9CI
28
25 H 30 25 26
26 H 27
24 23 24 23
Aborinane Stictane
D : C-Friedo-B′ : A′- 21,21-Dimethyl-29,30-
neogammacerane, 9CI dinorgammacerane, 9CI
129
H H 30
28 29
25 26
27
24 23
Gammacerane, 9CI
H H 30
H H 30
28 29 28 29
25 26 25 27
26
27
24 23 24 23
Serratane Onocerane
C(14a)-Homo-27- 8,4-Secogammacerane, 9CI
norgammacerane, 9CI
12 16 20
11 14 18 22
13 15 17 19 21 30
25
1 9 26
2 10 8
3 5 7
4 6
24 23
Polypodane
11 12
19 30 13 15 17 22 24 26
1 9 14 16 20 23 25
2 10 8
H
3 5 7
4 6 18 21 27
29 28
Malabaricane
15-Methyl-D-homo-C,30-dinor-13,17a-secodammarane, 9CI
26
25 24
27 28 30
24 23
Podiodane
130
Iridal group norterpenoids (VS9350)
The iridals are constituents of Iris spp. which serve as the precursors of the
important perfumery chemicals, the irones. The numbering system of iridal is
based on that of squalene. The irones are also included in this section. The
numbering system of the irone skeleton is based on the carotenoids.
28 29 30
13 15 17 21 23
12 20
19 24
26 14 16 18 22
3 5 27
HOH2C 4 11
6 10 OH
2 7 9
OHC 8
1
25
Iridal
13 12
O
7 9
14 1
2 6 8 10
3 5
4
15
Irone
Tetraterpenoids (VS9400)
The tetraterpenes arise by head to head coupling of two
geranylgeranylpyrophosphate molecules.
Spurgeon, S.L. et al. (1981) in Biosynthesis of Isoprenoid Compounds, (eds J.W. Porter
et al.), Wiley, New York, Vol. 2, p. 1.
Carotenoids
These include the hydrocarbons (carotenes) and their oxygenated derivatives
(xanthophylls). Carotenoid nomenclature is based on a stem name, carotene, and
two Greek letters as prefixes to define the two end groups. The numbering
system and end groups are given below.
131
17 18 19 20
ψ,ψ-Carotene
16 17 16 17 16 17
17 18
1 1 1
3 2 6 2 6 2 6
1 5
2 3 5 3 5 3 4 5
16 4 6 4 4
18 18 18
ψ β γ ε
16 16
17 6
16
17 17
1 1 6 1 6
5 18 2 2
2 3 3
3
4 4 5 4 5
18 18
κ φ χ
Carotenoid end-groups
132
Miscellaneous terpenoids (VS9450–VS9910)
Megastigmane norterpenoids (VS9450)
This is a fairly large group of C13 compounds generally thought to be degraded
carotenoids or catabolites of abscisic acid.
11 12
7 9
1
2 6 8 10
3 5
4
13
Megastigmane
2-Butyl-1,1,3-trimethylcyclohexane
Apocarotenoids (VS9700)
Apocarotenoids are carotenoids in which the carbon skeleton has been shortened
by the formal removal of fragments from one or both ends. A locant is used to
indicate that all the molecule beyond the carbon with that locant has been
removed. It is not necessary to give a Greek-letter end group designation if the
apo-locant is greater than 5.
Polyterpenoids (VS9800)
This section includes the higher carotenoids and polyprenols with more than 40
carbons.
Meroterpenoids (VS9900)
Meroterpenoids are of mixed biogenesis containing terpenoid and non-terpenoid
derived fragments. This broad definition could include the vast number of
simple prenylated phenolics but is normally reserved for compounds where the
terpenoid fragment comprises a large part of the molecule. The polyprenylated
quinones and chromanols typified by the ubiquinones and tocopherols are
clearly of mixed biogenesis but the metabolites of Aspergillus ustus such as
Austin could be mistaken for sesterterpenoids. In fact these metabolites have
been shown to be derived from a sesquiterpenoid fragment and an aromatic
polyketide fragment.
O
MeO CH3
MeO
H
O 10
Ubiquinone 10
CH3
HO
H3C
CH3 O
α-Tocopherol
133
O O
AcO
OH
O
O
O O
Austin
O
OH
O O
O O
OH
OH
O
Hulupone
Adhumulone
134
Steroids (VT)
For general information on the biogenesis of steroids, see the preceding
terpenoid section.
The steroid structure is based on four carbocyclic rings arranged as in
cyclopenta[a]phenanthrene, which is normally fully or partially reduced so that
only limited unsaturation, if any, is present. The four steroid rings are labelled,
and their carbon atoms are numbered according to the universal convention
illustrated.
12 17
11 13
C 14 D 16
15
1 9
2 10 8
A B
3 5 7
4 6
The great majority of steroids also have one or two methyl groups present at
the bridgehead positions C-10 and C-13; the methyl carbon atoms are numbered
C-19 and C-18, respectively.
18
19
H H
H
5α-Androstane
The hydrogen atoms at C-8,9, and 14 are generally omitted from formulae if
they have the natural configurations shown here.
Any side-chain at C-17 is assumed to have the 17β-configuration unless
otherwise indicated. This is shown either by using a wedge bond or, where there
is any possibility of uncertainty owing to substitution at C-20, by drawing in the
C-17 α-hydrogen atom.
Configurations of substituents in the side chain were formerly also indicated
by α or β, according to a convention devised by Fieser, whereby the side-chain
is drawn in Fischer projection, with the highest numbered locant at the top.
135
H OH HO H
20α-form 20β-form
The Fieser convention for pregnan-20-ols
OH H
H OH
H H
H H
(E)-5α-Pregn-17(20)-ene (Z)-5α-Pregn-17(20)-ene
5α-Pregn-17(20)E-ene 5α-Pregan-17(20)Z-ene
136
Duax, W.L. and Norton, D.A. (eds) (1975) Atlas of Steroid Structure, Vol. 1; Vol. 2
(1984) Plenum, New York.
Fieser, L.F. and Fieser, M. (1959) Steroids, Reinhold, New York.
Goad, L.F. and Akihisa, T. (1997) Analysis of Sterols, Blackie, London.
Hanson, J.R. (1997) Nat. Prod. Rep., 14, 373.
Hill, R.A., Kirk, D.N., Makin, H.L.J. and Murphy, G.M. (1991) Dictionary of Steroids,
Chapman & Hall, London.
Makin, H.L.J. (ed.) (1984) Biochemistry of Steroid Hormones, 2nd edn, Blackwell,
Oxford
Nair, P.P. and Kritchevsky, D. (eds) (1971) The Bile Acids, Vol. 1; Vol. 2 (1973); Vol. 3
(1976); Setchell, K.D.R., Kritchevsky, D. and Nair, P.P. (eds) (1988) Vol. 4,
Plenum, New York.
Turner, A.B. (1993) in The Chemistry of Natural Products, 2nd edn (ed. R.H.
Thomson), Blackie, Glasgow, pp. 140.
Zeelen, F.J. (1990) Medicinal Chemistry of Steroids, Elsevier, Amsterdam.
Zeelen, F.J. (1994) Nat. Prod. Rep., 11, 607 (synth).
In the notes which follow, the carbon numbers used to classify the different
types of steroids refer only to those which constitute the parent steroid skeleton.
They do not include any carbon atoms which may be present as substituents
(e.g. 6-methyl), or in derivative groups such as the ethers or esters of steroid
alcohols. Within the Type of Compound Index, the steroid groups are arranged
in order of increasing carbon number, which may not correspond exactly with
the order in which they are discussed in the following sections.
3
5
4
For purposes of naming and illustrating estrogens the form (i) is preferred;
the natural estrogens are accordingly derivatives of estra-1,3,5(10)-triene. They
have a hydroxyl group at C-3, and hydroxyl or carbonyl at C-17. Some
metabolites have additional oxygen functions elsewhere.
The trivial names and abbreviations of Estrone (3-hydroxyestra-1,3,5(10)-
trien-17-one), Estradiol (estra-1,3,5(10)-triene-3,17β-diol) and Estriol (estra-
1,3,5(10)-triene-3,16α,17β-triol), are commonly used, especially in biochemical
and medical contexts. These trivial names are sometimes incorporated into those
of derivatives (e.g. 17α-Methylestradiol).
Equine estrogens, with unsaturation additionally in ring B, belong to the
estra-1,3,5(10),7-tetraene and estra-1,3,5,7,9-pentaene series. Note the
recommended change in numbering for unsaturation in the latter case, where it
is possible to avoid the need for a compound locant [5(10)].
137
O
H
HO
3-Hydroxyestra-1,3,5,7,9-pentaen-17-one
OH
17β-Hydroxyandrost-4-en-3-one (Testosterone)
Zhou, Z.X. et al. (1994) Recent. Prog. Hormone Res., 49, 249.
19-Norpregnane
O
Pregn-4-ene-3,20-dione (Progesterone)
138
Norcholan-23-oic acid (C23) and cholan-24-oic acid (C24) steroids
(VT0650, VT0800)
The largest single group in this class comprises the bile acids, the majority of
which are cholan-24-oic acids. The shorter form ‘cholanic acid’ has been widely
used but is not recommended.
Naturally occurring bile acids are mainly 5β-cholan-24-oic acids with
hydroxyl substitution at C-3, and variously at other sites. The orientation of the
hydroxyl groups is usually α-, and bile acids have well-defined polar and non-
polar regions. This amphipathic quality of bile acids is essential to their
physiological functions.
Bile acids are biosynthesised from cholesterol; primary bile acids directly so
(e.g. Cholic acid, Chenodeoxycholic acid); subsequent action by intestinal
bacteria yields secondary bile acids (e.g. Deoxycholic acid, Lithocholic acid).
COOH
HO
H
3α-Hydroxy-5β-cholan-24-oic acid (Lithocholic acid)
Omission of one of the side chain carbon atoms leads to the 24-nor series
(24-norcholan-23-oic acids); loss of two carbon atoms gives the 23,24-
dinorcholan-22-oic acids. The latter are sometimes named as pregnane-20-
carboxylic acids, requiring a sequence-rule descriptor of the C-20 configuration.
H
COOH
H
H
23,24-Dinor-5α-cholan-22-oic acid
5α-Pregnane-20S-carboxylic acid
Whenever cholane nomenclature is used, the side chain has the C-20
configuration which is illustrated above for Lithocholic acid (20R, in the
absence of substituents at or near C-20), unless the opposite is specified.
139
DNP. The formulae illustrated below also show the 17β and 20R configurations
which are implied in the absence of a contrary indication.
The naturally occurring compounds generally have a 20(22)-double bond and
are commonly called cardenolides, for example Digitoxigenin is 3β,14β-
dihydroxy-5β-card-20(22)-enolide. The cardenolide glycosides are of particular
interest because of their cardiac activity.
O O O O
23
21 H
22
H
H H
OH
HO
H H
5β-Cardanolide Digitoxigenin
H
H
H
5β-Bufa-20,22-dienolide
The sterols
The sterols comprise several major groups of steroids characterised by having a
hydroxyl group at C-3, normally in the β-configuration, and branching side
chains of from eight to ten or more carbon atoms at C-17. They occur widely
throughout the animal and particularly the plant kingdoms. They have both
structural roles, as membrane constituents, and a key place in the biosynthetic
sequences which lead to the steroid hormones and other biologically active
steroidal species.
The following sections detail the main features of the various parent
hydrocarbons which provide the structural basis and classification of the sterols.
140
Cholestane steroids (C27) (VT1050, VT1100)
The cholestane skeleton, which derives its name from the longest-known and
most familiar compound of its class, Cholesterol, can be regarded as the parent
from which almost all other sterols are derived. This is true structurally, if not
necessarily in terms of the detailed biosynthetic pathway.
H 22 24 26
21
20 23 25
17
H 27
H
H
Cholestane (5α- or 5β-Cholestanes)
Other classes of sterols are derived from cholestane by the addition of one or
more carbon atoms at side-chain positions, most commonly C-24 (see
ergostanes, stigmastanes, etc, below).
Several other steroid classes have structures based upon the C27 cholestane
framework, although this is not always immediately apparent from the formulae
as drawn. The ecdysteroids (insect moulting hormones) are highly oxygenated
cholestanes. Many plant products that are commonly classified as tetracyclic
triterpenes are cholestanes with altered stereochemistry and/or additional methyl
substitution in the ring system, notably at C-4, -8, or-14. The dividing line
between the sterols and the tetracyclic triterpenes is a matter mainly of origin
and custom. Spirostans, furostans, and many of the steroidal alkaloids have
structures which are formally derived from cholestanes by linking between two
side-chain sites, or between a side-chain and a skeletal carbon, via an oxygen
(epoxy) or nitrogen (epimino) bridge. Vitamin D3 and its analogues are 9,10-
secocholestanes. All of these classes are described separately below.
Alkylated cholestanes of many types occur widely in plants, fungi, and
marine organisms. The very large classes of 24-methylcholestanes (ergostanes
and campestanes) and 24-ethylcholestanes (stigmastanes and poriferastanes) are
sufficiently important that their parent hydrocarbons have been assigned these
special systematic names (not used in Chemical Abstracts however). They are
treated in separate sections below. The 4,4,14-trimethylcholestanes (lanostanes)
are covered in the preceding terpenoid section. Many alkylcholestane
derivatives, however, fall outside these major groups, and have not been
dignified by special class names. They are treated in DNP as derivatives of
cholestane. Others are homocholestanes, in which additional carbon atoms
lengthen the side-chain, rather than branching off it. Many of these unusual
sterols are best known by trivial names that reflect their biological origins.
The cholestan-26-oic (or 27-oic) acids (VT1100) form a small but significant
group of bile acids (see cholanes above).
141
H H
H H
24 24
20 20
17
H H
17
H H
Ergostane Campestane
For historical reasons, most of the compounds of these classes have become
known as ergostane (or ergosterol) derivatives, even though, according to
current nomenclature, some of them should strictly be named as campestanes. In
DNP, therefore, ergostane-based nomenclature is generally given precedence,
with campestane synonyms added where appropriate.
A further complication, firmly rooted in historical precedent, is the use of the
locant C-28 for the carbon atom of the 24-methyl group. The latest IUPAC-IUB
recommendation is that the locant C-28 be reserved for the 4α-methyl group in
lanostanes, and in other 4,4-dimethylsterols of terpenoid type, with C-29 and C-
30 allocated, respectively, to the 4β- and 14α-methyl groups.* The locant C-28
has therefore acquired two distinct meanings, according to context. In DNP the
C-24 methyl group in ergostanes and campestanes retains its original locant as
C-28, allowing the use of derivative names containing such expressions as
ergost-24(28)-ene (for 24-methylenecholestanes) or ergostan-28-ol (for 24-
hydroxymethylcholestanes). The cholestane-based synonyms favoured by
IUPAC-IUB are also given, where necessary, for clarity.
28
CH2OH
H H
24
H H
24-Methylenecholestane 24-Hydroxymethylcholestane
[Ergost-24(28)-ene] (Ergostan-28-ol)
* The current IUPAC-IUB-recommended locant number for the 24-methyl carbon is 241,
applicable, for example, in listing 13C nmr assignments, but not recommended for use as a locant
for unsaturation or further substitution. In such cases the entire C-24 substituent should be appro-
priately named, e.g. as a 24-methylene or 24-(hydroxymethyl) derivative of the cholestane series.
142
more widely used of the two names, a situation paralleling that described above
for ergostanes and campestanes. In the Fieser system, stigmastanes have the
24α configuration, and poriferastanes are 24β. Again the sequence rule is
now preferred, with 24R or 24S depending upon local substitution and/or
unsaturation. Chemical Abstracts indexes poriferastanes asg 24S-stigmastanes.
H H
H H
24 24
20 20
H 17
H
17
H H
Stigmastane Poriferastane
As with ergostanes, common usage over several decades has favoured the
locants C-28 and C-29 for the two ethyl carbon atoms, and these are used here.
The IUPAC-IUB recommendation is that the two ethyl carbon atoms be desig-
nated 241 and 242 whenever locants are needed. Synonyms based upon 24-ethyl-
cholestane, 24-ethylidenecholestane, or 24-vinylcholestane are given in DNP
where suitable.
H H
H H
24-Ethylcholestane 24-Ethylidenecholestane
H
24-Vinylcholestane
OH
OH
HO
OH
HO
H
O
α-Ecdysone
143
Grieneisen, M.L. (1994) Insect Biochem. Mol. Biol., 24, 115
Nakanishi, K. (1971) Pure Appl. Chem., 25, 167.
24 27
O
H H
H
H H
H
Furostan
27
21
H O 26 25
F
20 23
24
E
O
H H
H
H H
H
Spirostan
144
Withanolide and brassinolide steroids (C28) (VT1400)
The withanolides are a group of naturally occurring plant steroids with an
ergostane skeleton and a side-chain δ-lactone ring linking C-22 and C-26. The
lactone ring is usually unsaturated at C-24, and there is a high level of
oxygenation in the skeletal rings, frequently including a 2-en-1-one system and
a 5,6-epoxide. The configurations are as shown below and the configuration at
C-22 is usually R.
28
27
24
23 25
22 26
O O
H
Withanolide
HO HO
OH OH
H H
HO HO
OH
HO O HO
H H
O O
Brassinolide Castasterone
145
chains, frequent presence of cyclopropane/cyclopropene functionality in the side
chain, and wide variation in the steroid A–D ring skeleton, including many
examples of A-nor and 19-nor variants.
H H
H 24
22
20 23
17
H
Gorgostane
HO HO
7-Dehydrocholesterol Previtamin D3
(Cholesta-5,7-dien-3β-ol) (9,10-Secocholesta-5(10),6Z,8-trien-3S-ol)
146
8
7
6
3
HO HO
Ergocalciferol; Vitamin D2 Cholecalciferol; Vitamin D3
(9,10-Secocholesta-5Z,7E, (9,10-Secocholesta-5Z,7E,
10(19)22E-tetraen-3S-ol) 10(19)-trien-3S-ol)
147
Aminoacids and peptides (VV)
Aminoacids (VV0050–VV0140)
Protein α-aminoacids (VV0050)
The common α-aminoacids are characterised by the structure
RCH(NH2)COOH, where R is an aliphatic (including hydrogen), aromatic or
heterocyclic group. The exception is Proline, strictly an iminoacid, in which the
N atom is incorporated into a 5-membered pyrrolidine ring.
They are the primary products of nitrogen anabolism in plants, where they are
produced from ammonia (derived ab initio by nitrate reduction or nitrogen
fixation) by a process called the glutamate synthetase cycle. This produces
glutamate which is then transformed into the other aminoacids by a variety of
processes.
The aminoacids thus represent the most important nitrogenous component (in
terms of volume and accessibility) of the chiral pool produced by living
organisms and are of great importance in chiral synthesis.
Several hundred plant aminoacids are known. Of these, 20 only (known as
the primary protein aminoacids) are incorporated by all organisms into peptides
and proteins (not all of these 20 aminoacids can be biosynthesised by animals).
This protein synthesis occurs in the ribosomes by a process involving ribo-
nucleic acid (RNA), the nucleoside chain of which transmits the template
instructions of the DNA genetic material to the protein sequences, each primary
aminoacid in the chain being coded for by one or more nucleoside base triplets
or codons.
There is an IUPAC-IUB standard 3-letter code for each of the protein
aminoacids (as well as for the common nonprotein aminoacids). For ease of
computerised documentation of large peptide structures, one-letter codes have
more recently been introduced but these are not used in DNP as the full
structures of proteins and large peptides are not given in entries.
IUPAC-IUB
abbreviations
1. Alanine Ala A
2. Arginine Arg R
3. Asparagine Asn N
4. Aspartic acid Asp D
5. Cysteine Cys C
6. Glutamic acid Glu E
7. Glutamine Gln Q
8. Glycine Gly G
9. Histidine His H
10. Isoleucine Ile I
11. Leucine Leu L
12. Lysine Lys K
13. Methionine Met M
14. Phenylalanine Phe F
15. Proline Pro P
16. Serine Ser S
17. Threonine Thr T
18. Tryptophan Trp W
19. Tyrosine Tyr Y
20. Valine Val V
Aminoacids and their corresponding 3-letter and 1-letter codes
148
Various posttranslational protein aminoacids known as secondary aminoacids
may then arise in the protein by various processes such as conjugation of OH,
SH or NH groups, N-methylation or hydroxylation (especially to produce 4-
Hydroxyproline). A special case of posttranslational change is the reversible
oxidation of cysteine residues to produce the disulfide Cystine thus linking
different parts of the peptide chain by disulfide bridges as part of the secondary
structure of the protein.
With the exception of Glycine, all of the genetically coded protein
aminoacids are chiral and belong to the L-series. In all cases except Cysteine,
this corresponds to (S-) according to the Cahn-Ingold-Prelog convention. In
Cysteine the higher priority of the CH2SH group over the COOH group
means that L- corresponds to (R-).
COOH
H2N C H
CH2SH
L-(≡R-)-Cysteine
Hatanaka, S.I. et al. (1992) Prog. Chem. Org. Nat. Prod., 59, 1 (aminoacids from fungi).
Scannell, J.P. et al. (1974) in Chemistry and Biochemistry of Amino Acids Peptides and
Proteins, Dekker, New York (antimetabolites).
β-Aminoacids (VV0120)
A number of β-aminoacids occur naturally, especially in peptide hormones and
antibiotics. Of these the most widespread is β-Alanine.
Drey, C.N.C. (1985) in Chemistry and Biochemistry of the Amino Acids, (ed. G.C.
Barrett) Chapman & Hall.
149
Peptides (VV0150–VV0600)
Peptides are oligomers and polymers notionally derived from aminoacids by
condensation to produce amide linkages. The boundary between oligopeptides
and polypeptides is arbitrary and in DNP has been set at 10 aminoacid residues.
The configuration of aminoacid residues in polypeptides is assumed to be L-
when not indicated otherwise.
There is evidence that in higher organisms small peptides (hormones) can
arise only by cleavage of protein prohormones.
A large number of biologically-active atypical peptides have been isolated
from bacteria, actinomycetes and fungi. Structurally they represent an extremely
diverse group, encompassing those metabolites containing two or more
aminoacid residues linked by a peptide bond, but possessing some additional
features not characteristic of proteins. These may include unusual aminoacid
residues, protein aminoacids with the D-configuration or raised to a higher
oxidation level, or non-peptide linkages between residues (e.g. ester, lactone or
a γ-glutamyl amide). In addition the molecules may be linear or cyclic, contain
one or a combination of the above mentioned features, be modified by further
interactions between the side chains of amino-acid units within the peptide, or
conjugated with either lipids or sugar units.
Depsipeptides (VV0600)
Cyclic heterodetic peptides or peptide lactones are those in which one or more
of the peptide bonds have been replaced by ester linkages. Valinomycin and
related antibiotics, though of no clinical value, are important biochemical tools
in that they specifically complex with alkali metal ions. The Actinomycin
family possess two peptide lactones attached to a common phenoxazine system
and form stable complexes with DNA by intercalation; they are used clinically
in the treatment of child leukaemia.
150
Large peptides and proteins (VV1000, VV2000)
Entries are given in DNP for the majority of bioactive peptides secreted by
plants and animals for which reasonable structural information exists, including
many insect neuropeptides which are an active field of research. Entries are
presented for the most important non-enzyme proteins and for some enzymes,
but full structures are not given in individual entries, the structures where
known can be assessed via the cited references.
Bladon, C. (1993) in The Chemistry of Natural Products, 2nd edn (ed. R.H. Thomson)
Blackie, Glasgow, p. 183 (rev).
Fusetani, N. et al. (1993), Chem. Rev., 93, 1793–1805 (sponge peptides)
Gross, E. (ed.) (1983) The Peptides, Academic Press, New York (general).
Lipmann, F. (1980) Adv. Microbiol. Physiol., 21, 227 (biosynth).
Sammes, P.G. (1975) Prog. Chem. Org. Nat. Prod., 32, 51 (cyclodipeptides).
β-Lactams
Penicillins and cephalosporins (VV0700, VV0800)
These are by far the most important group of the β-lactam antibiotics.
The naturally-occurring penicillins are a closely related group of antibacterial
agents produced predominantly by fungi. They possess a common β-lactam-
thiazolidine fused system. The N-acyl side-chain, in which variation can occur,
is limited to a small number of aliphatic and aromatic groups.
The naturally-occurring cephalosporins which are produced predominantly by
Acremonium/Cephalosporium and Streptomyces spp. possess a common β-
lactam-dihydrothiazine fused system, but in this case the side-chain is limited to
an α-aminoadipoyl group.
All β-lactams act by inhibiting bacterial cell wall biosynthesis. They are, in
varying degrees, susceptible to the inactivating β-lactam enzymes present in
many pathogens. The penicillins and cephalosporins are biosynthetically related
151
to, and derived from, a common tripeptide precursor. The other groups appear to
be produced by alternative pathways involving either peptide or aminoacid
intermediates.
S S
N N
O O
β-Lactam-thiazoline β-Lactam-dihydrothiazine
Carbapenems (VV0900)
These are now a fairly substantial group of naturally-occurring bicyclic β-
lactams. In terms of chemical stability they are highly sensitive compounds but
nevertheless exhibit potent broad spectrum antibacterial activity. Due to the low
titre and difficulties with isolation from microbial sources the most promising
clinical candidate, Imipenem, is currently produced by total chemical synthesis.
N
O
Carbapenem nucleus
Nocardicin A
Clavams (VV0950)
The so-called clavams are predominantly produced by Streptomyces spp. The
most important member of this group, Clavulanic acid, although exhibiting
limited antibacterial properties is a potent β-lactamase inhibitor and is used
clinically in combination with semisynthetic penicillins. The other naturally-
occurring clavams, which have the opposite chirality at the ring junction, lack
antibacterial properties but demonstrate some antifungal activity.
H
O CH2OH
N
O
COOH
Clavulanic acid
Allen, J.D. et al. (1986) Adv. Intern. Med., 31, 119 (β-lactams).
Baldwin, J.E. (ed.) (1983) Tetrahedron Symposium in Print No. 10, 39 (penicillins).
Barrett, G.C. (ed.) (1985) Chemistry and Biochemistry of the Amino Acids, Chapman &
Hall, London.
Bentley, P.H. et al. (eds), (1992) Recent Adv. in the Chem. of β-Lactam Antibiotics,
Proc. 4th Int. Symp., RSC, London.
Brennan, J. (1986) in Amino Acids Peptides and Proteins, specialist periodical reports,
RSC, London, 17, 171.
152
Brown, A.G. et al. (eds) (1985) Recent Advances in Chemistry of β-lactam antibiotics,
RSC Special Publication no. 52.
Bycroft, B.W. et al. (1987) Biotechnology Handbook, Vol. 1, (ed. J. Peberdy) Plenum
Press, New York, pp. 113.
Casy, A.F. et al. (1989) J. Pharm. Biomed. Anal., 7, 1121 (ms).
Demain, A.L. et al. (ed.) (1983) Antibiotics containing the β-lactam structure,
Handbook of Experimental Pharmacology, p. 67.
Frydrych, C.H. (1991) Amino Acids Pept., 22, 294; (1992) 23, 249.
Jensen, S.E. (1986) CRC Crit. Rev. Biotechnol., CRC Press, Boca Raton, 3, 277
(biosynth).
Kleinkauf, H. et al. (eds) (1990) Biochem. of Pept. Antibiot. Recent Adv. in Biotechnol.
of β-Lactams and Microbial Bioactive Pept., De Gruyter, Berlin.
Morin, R.B. et al. (1982) Chemistry and Biology of β-lactam Antibiotics, Vols 1–3,
Academic Press, New York (general).
Ono, H. et al. (1990) Biochem. Pept. Antibiot., 131.
O’Sullivan, J. et al. (1986) in Biotechnology, Vol 4, (ed. H. Page) VCH, Weinheim,
Ger., p. 247.
Parker, W. et al. (1986) Adv. Appl. Microbiol., 31, 181 (monobactams).
Robinson, J.A. et al. (1985) Nat. Prod. Rep., 2, 293 (biosynth).
Rolinson, G.N. (1986) J. Antimicrob. Chemother., 17, 5.
Salton, M.R.J. et al. (eds) (1981) β-Lactam antibiotics: Mode of Action, new
developments and future prospects, Academic Press, New York.
Stachulski, A.V. (1989) Amino Acids Pept., 20, 249; (1990) 21, 248.
Walsh, T.F. (1988) Annu. Rep. Med. Chem., 23, 121.
Williamson, J.M. (1986) CRC Crit. Rev. Biotechnol., 4, 111 (biosynth).
Glycopeptides (VV3000)
The members of the Vancomycin family are the most significant within this
relatively small and structurally self-evident category of antibiotics. Their
activity is restricted to gram-positive organisms but they are particularly
effective against the so-called multiresistant streptococcal and staphylococcal
strains and for this reason have found significant clinical application.
153
Alkaloids (VX)
Alkaloids are a large group of nitrogen-containing secondary metabolites of
plant, microbial or animal origin. The term originally implied pharmacologically
active bases of plant origin, but the definition has subsequently been broadened
so that it is now generally considered to include the majority of nitrogen-
containing natural products with the exception of the simple aminoacids,
proteins and nitrogen-containing substances of polyketide origin such as the
aminoglycoside antibiotics. Basic properties may be weak or absent as in the
various types of amide alkaloids. The class of microbial alkaloid overlaps
considerably with that of the nitrogenous antibiotics, and substances such as the
cytochalasans which show antibiotic properties are in DNP classified as
alkaloidal, the definition being a matter of semantics.
Biogenetically and structurally the alkaloids are diverse and it is usual to
discuss them in terms of biogenetic origin rather than purely on the basis of
structural features. The organisation of alkaloid groups within the Type of
Compound Index follows the order given below.
H2NCH2CH2CH2CHCOOH N CH2COCH3
NH2 Me
Ornithine Hygrine
154
O O
HO
A
2
HO CH3
O O
Chromone Noreugenin
Tropane
8-Methyl-8-azabicyclo[3.2.1]octane, 9CI
Retronecine
Many, and perhaps the majority, of pyrrolizidine alkaloids occur in the plant
as N-oxides, the N-oxide function being lost during isolation.
155
O
O H
H
H HH
N H O O
H
Tuberostemonine
Pseudopelletierine Lupinine
All these structural types have their analogues among the pyrrolidine
alkaloids, and while it is tempting to assume biosynthesis from lysine it may not
in all cases be true; Coniine, for example, appears to be acetate-derived.
O
O N
O
Piperine
156
laburnum, and the closely-related sparteine group (note tricky stereochemistry
owing to twofold rotation-reflection axis). Penta- and hexa-cyclic bases are
found in Ormosia species; of these Ormosanine is representative.
13 NH H 17 15
7 5 7 C
N 14
5 6 8 11 4 6 D 13
4 11
3 A B 12
9 3 9
2 N 10 2 N 10
H
O
(+)-Cytisine (–)-Sparteine
OMe
MeO
H
MeO
Cryptopleurine
8
15
12 7
10 6
14 5
9 13
N 4 O
1 3
2
Lycopodine
157
MeO 17
HO OMe
MeO 18 14 21 17
21 13 20 18
20 14
24 12
10 11 1 13
OH 2 12
1 9 OH HO
2 N 3
H
11
3
H 5
H 4
N 10
4 6
HO H 5 9
H H
HO
Lythrancine Lythranidine
O
12
H 11
O
5 3 13
H
N 14
1
O
23
MeO
OMe
Decaline
N Me
Nicotine
158
Fodor, G. and Colasanti, B. (1985) The Pyridine and Piperidine Alkaloids: Chemistry
and Pharmacology, in Alkaloids: Chemical and Biological Perspectives, (ed. S.W.
Pelletier) Vol. 3, Wiley-Interscience, New York.
Leete, E. (1983) Biosynthesis and Metabolism of the Tobacco Alkaloids, in Pelletier,
Vol. 1.
Numerous alkaloids are derived from polyacetate precursors, together with one
or more amino acids. A few alkaloids, however, are almost entirely acetate-
derived. These include Coniine, from hemlock; this, perhaps surprisingly, is not
derived from lysine. Similarly, Pinidine, from Pinus sabiniana, is acetate-
derived. Other piperidine derivatives with side-chains at position 2, which may
be acetate-derived, although proof is at present lacking, include Nigrifactine
from Streptomyces species, Carpaine and Cassine.
N
H
Coniine
CH3
HO 6
1 NH
MeO
Ancistrocladine
159
to a macrocyclic ring of 11, 13 or 14 atoms. The macrocycle may be a
carbocycle, a lactone or a carbonate. In addition the isoindole ring carries either
a phenyl or an indolyl substituent at position 10; the latter group includes the
Chaetoglobosins.
18
O O O
13 23 13 22 13 20
O O O
8
O O 21
9 1 1 1
6 N 6 N 6 N
5 3 5 5
12
10 10 10
11
Lactone Carbonate Carbocycle
Binder, M. et al. (1973) Angew. Chem. Internat. Ed. Engl., 12, 370.
Cole, R.J. (1981) Toxic Fungal Metabolites, Academic Press, New York.
Dyke, H. et al. (1986) J. Chem. Soc. Chem. Commun., 1447 (synth).
Fodor, G. and Colasanti, V. (1985) The Pyridine and Piperidine Alkaloids: Chemistry
and Pharmacology, in Alkaloids: Chemical and Biological Perspectives, (ed. S.W.
Pelletier), Vol. 3, Wiley-Interscience, New York.
Jones, R.C.F. (1984) Nat. Prod. Rep., 1, 97.
Jones, T.H. and Blum, M.S. (1983) Arthropod Alkaloids: Distribution, Functions, and
Chemistry, in Pelletier, Vol. 1.
Pendse, G.S. (ed.) (1987) Recent Advances in Cytochalasans, Chapman & Hall,
London.
Turner, W.B. (1983) Fungal Metabolites II, Academic Press, New York.
COOMe
NHMe
OMe
Damascenine
160
Simple quinoline alkaloids (VX1480, VX1520, VX1540, VX1560,
VX1580)
These include Echinopsine and the phenethyl-quinoline, Cusparine.
O
4
5 3
6
7
8 1 2
N
Me
Echinopsine
In simple quinolone derivatives C-2 and C-3 are derived from acetate;
introduction of a prenyl group at C-3 followed by cyclisation then gives a
furanoquinoline, e.g. Platydesmine, or a pyranoquinoline alkaloid, e.g.
Flindersine. Interestingly, Dictamnine, from Dictamnus albus, appears to be
formed by loss of acetone from an oxidation product of Platydesmine.
OMe
5 4 3
6
7 2
8 1
N O
Dictamnine
AcO H
N
O H
N
N
O O
N H
Vasicine NOH
N
O
Tryptoquivaline
O
OMe
N O
Me
Acronycine
161
Acridone-coumarin alkaloid dimers (VX1690)
The isolation in 1990 of the Acrimarines represented the first examples of
acridone-coumarin dimers from natural sources. To date, some twenty
compounds have been reported from Citrus plant (Rutaceae) roots.
MeO O OH
DIMBOA
Saxton, J.E. (1973) in The Acridines, 2nd edn (ed. R.M. Acheson), Wiley-Interscience,
New York.
162
Alkaloids derived wholly or in part from phenylalanine
or tyrosine
This extremely large and varied category consists of several different structural
types, which range from simple β-phenylethylamine derivatives to the much
more complex structures exemplified by the alkaloids of the Amaryllidaceae and
the bisbenzylisoquinoline alkaloids. The isoquinoline derivatives themselves
consist of a large number of structural types, which can be divided into upwards
of 20 sub-groups.
O 12
H
3
2
8 14
6 N
7 15
Securinine
Phyllanthidine is presumably an oxidative transformation product of
Allosecurinine, the diastereoisomer of Securinine, which also occurs naturally.
163
Krane, B.D. et al. (1982) J. Nat. Prod., 45, 377.
Menachery, M.D. et al. (1986) J. Nat. Prod., 49, 745.
Phillipson, J.D., Roberts, M.F. and Zenk, M.H. (eds) (1985) The Chemistry and Biology
of Isoquinoline Alkaloids, Springer Verlag, Berlin.
Shamma, M. (1972) The Isoquinoline Alkaloids, Academic Press, New York.
Shamma, M. and Moniot, J.L. (1978) Isoquinoline Alkaloid Research, 1972–1977,
Plenum Press, New York.
Polycarpine
164
Bisbenzylisoquinoline alkaloids (VX2340–VX2430)
This very large group of alkaloids is composed of two benzylisoquinoline units
attached to each other by one, two, or three bonds. In most cases the units are
joined via ether linkages, but carbon-carbon bonds between the benzyl groups
are also known. The monomeric units involved are mainly hydroxylated or
methoxylated benzylisoquinolines, but aporphine units occur in more than 50
alkaloids, and a few alkaloids contain a proaporphine component. The alkaloids
may be subdivided into the five following major groups (the classification,
proposed by Shamma, contains at least 28 subgroups, all of which are composed
of unmodified benzylisoquinoline units. Dimeric alkaloids containing aporphine,
proaporphine, or otherwise modified benzylisoquinoline components are not
included here).
(a) Alkaloids containing aryl links only. (VX2340) The bark of Popowia
pisocarpa has yielded a group of seven alkaloids which contain a single
aromatic linkage between C-11 and C-11′. These include Pisopowetine and
Pisopowiaridine
(b) Alkaloids containing one ether link. (VX2360) This is the most
common structural type, the ether linkage being in most cases between C-11 and
C-12′, as in Dauricine.
4
5 OMe MeO 5′ 4′
6 6′
7 7′
MeN 1 8
OMe MeO 8′ 1′ NMe
H H
10 10′
11 11′
14 12 O 12′ 14′
13 13′
OH
Dauricine
However, other attachments are known, e.g. between C-11 and C-10′, as in
Vanuatine, between C-10 and C-7′, as in Malekulatine and Ambrinine, and
between C-11 and C-7′, as in Neferine.
(c) Alkaloids containing one aromatic link and one or two ether links.
(VX2370, VX2390) These alkaloids are mainly based on the Rodiasine and
6′,7-didemethoxy-6′,7-epoxyrodiasine skeleton, e.g. Tiliacorine.
5 OMe MeO 5′ 4′
18 6′ 18′
7′
MeN 1 8
OMe 8′ 1′ NMe
H O H
10 10′
14 12 12′ 14′
13 OMe HO 13′
Rodiasine, 9CI
(d) Alkaloids containing two ether links. (VX2380) The largest single sub-
group containing two ether linkages possesses the berbaman skeleton,
exemplified by Berbamine, which contains ether linkages between C-8 and C-
7′, and between C-11 and C-12′.
4 5 5′ 4′
6 6′
7 7′
HN 1 8 8′ 1′ NH
H O H
10 14′
11 13′
14 12 O 12′ 10′
13 11′
Berbaman, 9CI
165
Almost as large as the berbaman group is the oxyacanthan group, e.g.
Oxyacanthine, which contains ether linkages between C-8 and C-7′, and
between C-12 and C-13′. Smaller groups include the Thalicberan (C-8 to C-6′
and C-11 to C-12′), Thalidasan (C-8 to C-5′ and C-11 to C-12′), and Thalman
(C-7 to C-5′ and C-11 to C-12′) types.
5 6′ 5′
6
7 7′ 8′
HN 1 8 1′ NH
H O H
1011 O 13′
14′
14 12 12′ 10′
13 11′
Oxyacanthan, 9CI
All these types contain ether linkages between the benzyl rings and between
the aromatic rings of the tetrahydroisoquinoline component. The Tubocuraran
sub-group contains ether linkages between the benzyl ring of one unit and the
aromatic ring of the isoquinoline component of the other unit. Other linkages of
this kind between benzyl and isoquinoline rings are also known.
5 5′
6 O 6′
7 7′
HN 1 8 8′ 1′ NH
H H
10 14′
11 13′
14 12 O 12′ 10′
5 13 11′
6 6′ 5′
7 7′ 8′
HN 1 8 O 1′ NH Thalman, 9CI
H H
1011 14′
13′
14 12 O 12′ 10′ 6
5
13 11′ 7
8 1 NH
Thalicberan, 9CI O H
12′ 14 10
5 6′ 5′
6 13 11
HN 8
7 O 7′ 8′ NH
12
1 1′
H H H O
14′ 1′
1011
13′ HN 8′
14 12 O 12′ 10′
7′
6′
13 11′ 5′
(e) Alkaloids with three ether links. (VX2400) These include alkaloids with
6′,7-epoxyoxyacanthan (e.g. Trilobine), 7,8′-epoxyoxyacanthan, and 8,12′-
epoxytubocuraran skeletons.
(f) Benzylisoquinoline-Aporphine dimers. (VX2700) Some alkaloids
consist of a benzylisoquinoline unit attached to an aporphine unit, via a single
ether linkage. Of these, Thalicarpine is typical.
(g) Miscellaneous bisbenzylisoquinoline alkaloids include those containing
a dienone ring in one of the isoquinoline components (e.g. Repanduline), some
with an aporphine unit attached to a pavine component, e.g. Pennsylpavine,
those with degraded benzylisoquinoline units (e.g. Baluchistanamine) or with a
proaporphine unit (e.g. Epiberbivaldine), and Cancentrine, which is really a
combination of cularine and morphinan components.
166
Shamma, M. and Moniot, J.L. (1978) Isoquinoline Alkaloid Research, 1972–1977,
Plenum Press, New York.
OMe O
NMe
O
H O
N
CHO
Me
OMe HO
Secolucidine
167
Secocularine alkaloids (VX2450)
The secocularines can be divided into two sub-groups, namely B- and C-ring
secocularines. B-Ring secocularines, exemplified by Secocularine, Secocul-
aridine and Norsecocularine, are structurally related to phenanthrene alkaloids
derived from aporphines and are probably formed in vivo by Hofmann
degradation of cularines. C-Ring cularines, represented by Noyaine, constitute a
new type of alkaloid without counterpart among aporphinoids.
NMe2
HO MeO NMe
O O O
COOMe
MeO
O
NMe
O 14 H
8
MeO N
32
HO 31
O
MeO
Cancentrine
+
MeO NMe2
OH
Quettamine
168
MeO
7 Me
HO 13 N
+ OMe
H
OMe
Cryptaustoline
MeO 4 5 6
3 OMe
7 8
2
MeO 1 NR
b 10 9 OMe
12
a 11
MeO a
Pavine R = H
HO NMe Argemonine R = Me
OMe
OH
b
Reticuline
O 4 5
3 6 7 8 OMe
2
O
1 12 NMe 10 9 OH
11
Amurensine
(Isopavine type)
HO
OR
MeO 3 4
2 5
1
HO 6a NMe
7 H Aporphine skeleton
12
MeO 11 8
10 9
O
Orientalinone
169
Aporphine alkaloids (VX2610, VX2620, VX2640, VX2700, VX2750,
VX2780, VX2800, VX2820, VX2840, VX6820, VX6840)
This large group of alkaloids simply contains the tetracyclic ring system formed
by phenol oxidative coupling of a benzylisoquinoline precursor. The structural
variations include:
(a) the simple aporphines, exemplified by Glaucine;
MeO 4
3 5
2
1
MeO 6a NMe
7
H
11
10
MeO 9 8
OMe
Glaucine
(b) dehydro derivatives of (a), in which the double bond is generally between
C-6a and C-7 in N-methyl compounds and between C-6a and N in apo
compounds;
(c) miscellaneous oxidative derivatives of (a), mostly with a hydroxy or
methoxy function at C-7, or two at C-4 and C-7;
(d) those with an aromatic isoquinoline ring and a carbonyl group at C-7, e.g.
Liriodenine, the so-called oxoaporphines;
(e) miscellaneous aporphinoids. Included in this group are Telezonine,
duguenaine-type aporphinoids, ring A quinonoid aporphinoids (e.g.
Pancoridine), oxoisoaporphines (e.g. Menisporphine), azafluoranthenes (e.g.
Rufescine), diazafluoranthenes (e.g. Eupolauridine), 1-azaoxoaporphinoids
(e.g. Sampangine), azahomoaporphines (e.g. Dragabine), so-called catechol
dioxygenase oxidized aporphinoids (e.g. Andesine, Chiloenine,
Santiagonamine), tropoloisoquinolines (e.g. Imerubrine) and Cleistopholine-
and Onychine- type alkaloids.
(f) compounds in which the heteroring has opened to give phenanthrene
derivatives, with the CH2CH2NR1R2 chain still present, e.g. Taspine;
(g) compounds derived from (f) which have lost C-5, mostly containing a
five-membered lactam ring (the aristolactams, e.g. Cepharanone A). The class
even includes some members in which nitrogen has been oxidised to a nitro
group, e.g. Aristolochic acid A.
O
O 2 1
3
4 NH
O 10
9
5
7 8
Cepharanone A
Although the aristolochic acids and aristolactams are non-basic they are still
classified as alkaloids since their respective skeletons bear a distinct
resemblance to that of the aporphines.
(h) a group of dimeric aporphinoid alkaloids exemplified by the aporphine-
benzylisoquinoline dimers, e.g. Thalifaberine, the proaporphine-
benzylisoquinoline dimers (e.g. Pakistanamine), and the Hernandaline-type
and Coyhaiquine-type alkaloids. The two latter types are, respectively,
oxidation products of the aporphine-benzylisoquinolines and proaporphine-
benzylisoquinolines.
A new addition to this class of compound are the proaporphine-tryptamine
dimers. These heptacyclic alkaloids, found in Roemeria hybrida (Papaveraceae)
170
and Phoebe grandis (Lauraceae), are probably derived biogenetically by a
Mannich-type condensation of a ketonic tetrahydroproaporphine with a
tryptamine analogue. Roemeridine is a typical example.
To this listing must be added a small but significant group of bisaporphines.
The majority of these dimers are bonded through a carbon-to-carbon linkage at
C-7 and C-7′ (e.g. Urabaine), although examples of C8-C8′ coupled
bisaporphines [e.g. (8, 8′-R)- and (8, 8′-S)-Bisisocorydine] and oxygen-bonded
dimers (e.g. 11,8′-O-Bisisocorydine, Dehatriphine) have recently been
isolated.
Cavé, A., Leboeuf, M. and Waterman, P.G. (1987) in Alkaloids: Chemical and
Biological Perspectives, (ed. S.W. Pelletier), Vol. 5, Wiley-Interscience, New York.
Gözler, B. et al. (1990) J. Nat. Prod., 53, 675 (aporphine dimers).
Guinaudeau, H. et al., Lloydia, (1975) 38, 275; J. Nat. Prod., (1979) 42, 133, 325;
(1983) 46, 761; (1984) 47, 565; (1988) 51, 389, 1025; (1994), 57, 1033
Jackman, L.M. et al. (1979) J. Nat. Prod., 42, 437.
Mix, D.B. et al. (1982) J. Nat. Prod., 45, 657 (aristolochic acids and aristolactams).
Shamma, M. and Guinaudeau, H. (1984) Tetrahedron, 40, 4795.
HO 2
1
12 10
O4 13 9
H
14
5 NMe
15 H
6 16
8
HO
Morphine
Blaskó, G. and Cordell, G.A. (1988) Heterocycles, 27, 1269.
171
morphine alkaloids, although there is clearly an important divergence in the
later stages.
Tyrosine
OH OH
MeO MeO MeO
HO
NR NR NR
MeO OH MeO OH MeO OH
OH
O O
OMe
MeO 2
3 4 MeO 3
2 1
1 6 4 11
5 MeO 10
12
NMe 16
13 9
9 14
13
8 5 NMe
12 10 6 8
MeO 11 OMe O
7 OMe
OMe
Protostephanine
Hasubanonine
MeO 4 5
6
MeO 1 N
8
H 13
OMe
12
11 OMe
Tetrahydropalmatine
O NMe
MeO H CH2OH
2′ OMe
OMe
Macrantaline
172
Structure determination in this series, i.e. the correct location of
substituents on the protoberberine ring system, has been a matter of some
difficulty and it is possible that some of the assignments currently given in DNP
will prove to be incorrect.
O OMe
2′ 5′ OMe
OMe 3′ 4′
OMe
Narceine
α-Hydrastine
173
O O 1
18 17
+
NMe
O NMe O 4
6 H
8 H 15
O
O O 12
8 11
O HO
O
O
Rhoeadine
(CA numbering)
+ +
NMe NMe
–
O O
4 5
6
1 14 NMe
O 13 8
12 9
174
O
O
R
N
O
O
Tetrahydrocoptisine R = H
Tetrahydrocorysamine R = Me
CHO
O
R
O
NMe
O
O
HO 12
O
1
11
R 13
4 O
14
10
H
7 6 5 NMe
O
O
Chelidonine R = H
Corynoline R = Me
5 4
6
14 1
12 13
11
10 7
9 8 N
175
HO
HO NMe
COOH COOH
RO +
RO NH2
OR
OH
Phenethylisoquinolines
RO 2
3 4
5 RO 12 11
1 RO 18
RO NMe NMe 10
RO 15
RO H RO 9
N 8
13 7
8 5 7
12 4 6
1110 9
3 2 1
O RO OH RO
R
Homoproaporphine Autumnaline Homoerythrina
type type type
HO
RO 3 4
5
RO 12 11 2
RO 1 10 1 H 6
2 NMe
3 16 17 RO
RO NMe RO 4 9 6a
RO
13 NMe 7
RO 14 15 H
5
R 12
8
8
11
6 7
R 10 9
MeO O RO
O R
Homomorphine type Homoaporphine type
RO RO 3 4 5
6
2
RO RO 1 7
NMe NHAc
RO RO
12 8
11
10 9
+
RO O O
RO
Colchicine type
Figure 18.
176
Androcymbine arises by phenol oxidative coupling, probably of Autumnaline,
by a process analogous to that involved in the biosynthesis of the morphine
alkaloids.
Lumicolchicines are the product of u.v. irradiation of Colchicine, and while
they have been reported to occur naturally, they could be regarded as artifacts.
MeO
NHMe
HO
OMe
HO OMe
Jerusalemine
177
HO HO
MeO HN MeO N
MeO
MeO O
OH
Erysodienone
O 12
16
15 9
O N 5 N
13
6 7
3 1 H
2
MeO
Erythraline Erythrinan, 9CI
HO NH2
6
5 OH
H 1 2
9 8 N 7 OH
6′
5′ 1′
2′
HO 3′
Norbelladine
MeO
MeO NMe
H
O
O
Cryptostyline I
178
function in Haemanthidine followed by a redox reaction and cyclisation of the
oxygen at C-6 on to position 11 (Haemanthidine numbering) then affords the
Tazettine skeleton.
A further possibility is the migration of C-18 in the haemanthidine skeleton to
position 11, which gives rise to the ring system present in Pancracine and
Montanine.
6
5 OH
H 1 2 5 4 OH
9 8 N 7 OH O 12
6′
1′ 12a 2
2′
HO MeO 6 12b 1
11
8
9
N
Me
Galanthamine (9CI)
16
OMe
2
4 MeO
1 H 17 1 11
12b N Me 4 OH
O 12
6 5 O 10
19 13
15 12a 17 18
OH H 12
13 7 15 14 5
O
9 8 O O 7 6 N
Tazettine OH
Haemanthidine
OH
OH
O OH
H
O NH
O
Narciclasine
HO 3′
1′
6′ 9 8 7
N 1
2 OH
H 6
OH
3
OH
HO MeN H 12
1 3 17 4
15 H 12 16 5
O 11 MeO 11
13
14 16 4 15
H 6 5 14
H6
13
O
8 7 N MeO 8 7 O
OH OH
Lycorine Lycorenine
(Galanthan group, 9CI)
179
(d) Narciclasine alkaloids
This small group, exemplified by Narciclasine, also stems from Norbelladine,
and appears to be formed via Crinine (but not 3-Epicrinine) by loss of the two
carbon bridge and appropriate oxidations.
It should be noted that several of the Amaryllidaceae alkaloids occur in
enantiomeric forms.
OMe
OMe
2′
3 4
5
2
1 6
N 7
O
Me H
Mesembrine
HO COOH
HO
NH2 HO NAc
OHC H H
H OGlc
OGlc
H
H O
O MeOOC
MeOOC
Ipecoside
Secologanin
MeO 8 7 MeO
6
5 N
MeO 11 11b N MeO
H H
1 3
2
H H
α
H
CHO
HN2′ 1′ 8′ OMe
Protoemetine
5′ 5′ OMe
Emetine
Openshaw, H.T. (1970) in Chemistry of the Alkaloids, (ed. S.W. Pelletier), Van Nostrand
Reinhold, New York.
Wiegrebe, W. et al. (1984) J. Nat. Prod., 47, 397.
180
Phenanthroindolizidine and phenanthroquinolizidine alkaloids
(VX3700, VX3760)
These alkaloids are derived from two molecules of phenylalanine or tyrosine,
together with, presumably, ornithine (→ Tylophorine group) or lysine (→
Cryptopleurine group).
OMe OMe
MeO 2 1 MeO
3
4 H
14 13
9 N 11 N
5
6
7 8
MeO MeO
OMe
Tylophorine Cryptopleurine
N
H
N N
Gramine H Me
Chimonanthine
181
MeNHCOO
8
N N
H
Me Me
Physostigmine
Chakraborty, D.P. (1977) Prog. Chem. Org. Nat. Prod., Vol. 34.
N
H
Echinulin
182
5 4 11 1
6 3 10 2
7 2 9 3
8 9 1 N 8 N
N N
H CH3 4
6
5
Harman O
Canthin-6-one
N O
N
H H
MeN
Evodiamine
The alkaloids of Evodia rutaecarpa, e.g. Evodiamine, which are also derived
from anthranilic acid (q.v.), may also be included in this group.
Finally, several alkaloids from Picrasma quassioides are bis-β-carbolines, e.g.
Picrasidine M.
H H
4 8 N
3 9 10 18
5
6 2 12 14
7 1 11 13
N H
H H
16 15
Aristoteline
Borreria alkaloids
These alkaloids also contain a regular terpenoid unit, as in Borrecapine, from
B. capitata. Borreline, from the same plant, contains a degraded monoterpene
unit. Borreverine, from B. verticillata, contains two tryptamine units and a
monoterpenoid component, and may be prepared by dimerisation of Borrerine.
183
7
8 NH
9 H
10 5
4
H
12 2
13
N
H
Ergoline, 9CI
A few other metabolites, which may also be included in this group, are the
result of skeletal rearrangement. Clavicipitic acid is one such compound;
another is α-Cyclopiazonic acid. However, whereas the former is definitely a
product of prenyl-tryptophan metabolism the latter, from Penicillium cyclopium
is not, since it appears to arise from reaction of a tryptophan-acetoacetate unit
with prenyl pyrophosphate.
10′ 9′
H 11′ 8′
O N
H 12′
H
6′
HN 3′ N 5′
18
7
8 NMe
9
10 H
4
12 2
13
N
H
Ergotaman, 9CI
184
COOH
NH2 3 NH
NH N
H H H
OGlc
OHC
H H
OGlc O
MeOOC
H
O
MeOOC Strictosidine
Secologanin
9 7
10 6
11 2
N O N O
N N
H H H
H
20 O
O
OH
O OGlc
Camptothecin Rubescine
185
Strictosidine
9 6
10 8 7 5
11
12 1 2 4
N N
N 3 21 N
H H 14 20 H H H
15 H
19
H 18 H
16
17 MeOOC CHOMe
Corynan, 9CI Corynantheine
6
10 5
11
3 N N
N 21 N
H H 14 H H H H
13
19
H 16
H
17 O O
MeOOC
Oxayohimban, 9CI Ajmalicine
Anhydrovobasindiol
17 6 5
O 14 16
15 O NMe
3
4 3 15 21
5
9
7 6 NMe 20
12 1 2
21
N H 14
20
N O H O
H 19
18
Gelsemine
186
Yohimbinoid alkaloids (VX5040)
The Yohimbine alkaloids contain a carbocyclic ring E formed by C-17 to C-18
bond formation in a corynantheine precursor. As in the corynantheine-ajmalicine
group stereoisomerism at all asymmetric centres except C-15 is known.
6
10 5
11 2
1 N4
N 3 21
H 14 H
H 15
20
H 18
17
Yohimban, 9CI
18 18
187
COOMe
6 5
16 NMe
15
20
N 3 14
H H
19
O
Ervatamine
H N
Cinchonamine Quinine
Uskokovic, M.R. and Grethe, G. (1976) in MTP Series 2, Vol. 9, Alkaloids, (ed. K.
Wiesner), Butterworth, London.
188
6 5
H
N
21
9 14
2 15 20 H
12 1 16
N 19
H H 17 H
H 18
Curan (9CI)
H H
17 18
N Nb
1 16 3
7 15 20 10 7 14 21
2
3 8 H 14 21 11 2 15 20
4 16
N 13 aN
H H H H
10
11 12 22 23 17
22
H 19
23 18
O O
Strychnidine, 9CI
(CA numbering) (biogenetic numbering)
N N
N N
H
H COOMe MeOOC CHO
Akuammicine Preakuammicine
+
Corynanthe 7 N
precursor 21 3
N
H
MeOOC
CHO
3
5 N 18
6 21 19
7 20 14
10 H 15
11 2
16
N H
H H
Condylofolan, 9CI
(biogenetic numbering)
189
suggests an alternative biogenetic route to the one given below. Extensive modi-
fication of this skeleton appears to have occurred in the formation of Gonio-
mine, which can be postulated to be formed by ring-opening and epoxidation of
an indolenine related to Condylocarpine followed by N-1 to C-19 bonding:
H N H N
N O O
H H2N O H
H
N
O
HN OH
Goniomine
N
N
N N
HOH2C COOMe H
COOMe
Secodine
N
H
N
H
MeOOC
Andranginine
190
8 3
10 N 7 5 N 14
11 19 6 6 21 15
7 20 18
12 21
15 5 10
16 2
H 4 20 11 2
H 17 19
3 16
N N
H H H
Aspidospermidine Aspidospermidine
(9CI numbering) (biogenetic numbering
used in DNP)
(c) alkaloids containing an ether or lactone bridge between C-18 and C-21;
(d) alkaloids containing an ether or lactone bridge between C-18 and C-15;
(e) alkaloids containing a lactone ring between C-18 and C-17, and a dihydro-
1, 4-oxazine ring between N-1 and C-12, as in Obscurinervidine:
(f) alkaloids containing an additional bond between C-18 and C-2, as in
Venalstonine;
3
5 N 14
6 H15
21
10
11 2 19
12 18
N
H 17
16
COOMe
Venalstonine
(biogenetic numbering)
N
H
20
H
2 19
N
H
H COOMe
Vindolinine
(h) the Quebrachamine group, which are derived by fission of the 7,21 bond.
These may have lost the C-16 methoxycarbonyl group (e.g. Quebrachamine)
or it may have been retained, as in Vincadine:
5
6
3
N 14
9
21 15
2 20
12 1
N 16 17
H
H COOMe
Vincadine
191
Kopsane alkaloids (VX5560)
The skeleton of the kopsane group of alkaloids is simply formed by attachment
of C-22 (the methoxycarbonyl carbon) of Venalstonine to C-6, as in Kopsine.
Skeletal variations include the alternative acyloin structure, as in Fruticosine, in
which C-22 is attached to C-17.
5 3
O N H 14
6 15
9 21 20
12 1 2
16 17
N
MeOOC OH
19
18
Kopsine
19
7 N 21
20 18
2 3 15
N 16 17 14
H
H
15,20-Dihydrocleavamine
192
N
COOMe
N
H
6 5
9
1 2 3 N
12
N 21 20
H 18
14
MeOOC 17
15
(–)-Coronaridine
(biogenetic numbering)
8 7
12 19
N
13 20
18 5
N 2 4 21
H H 3 H
1
Ibogamine, 9CI
(CA numbering)
193
Stemmadenine
+ OX
N
+ +
N NMe
+ +
N N N
H H H
MeOOC CH2OH MeOOC CH2
OH
H
O
18
CH3 H2C
+ +
9 19
20
21 N NMe NMe
15 14 3
12 1 2 16 +
N N N
H 17CH H H
3
Ellipticine
(biogenetic numbering)
Figure 19.
6
+ 5 N
N
Stemmadenine +
loss of C–5
N N
H H H
MeOOC CH2OH MeOOC CH2OH
Vallesamine
loss of C–6
Me H Me
+
N N
3 N
21 20
14
15
N N N
H H H H
H
Uleine
Apparicine
Figure 20.
194
(d) The Schizozygine group, which contain an additional bond between C-2
and C-18;
(e) Andrangine and Vallesamidine, in which C-21 has simply migrated to
C-2.
195
H
N
O
N N
O
4′ 5′
MeO
NHMe
Staurosporine
Gribble, G.W. and Berthel, S.J. (1993) Stud. Nat. Prod. Chem., 12, 365
(indolocarbazoles).
Lounasmaa, M. and Nemes, A. (1982) Tetrahedron, 38, 223.
Terpenoid alkaloids
(excluding those involving tyrosine or tryptophan)
15
16
Dendrobane, 9CI
(b) a group of indolizidine bases exemplified by Crepidamine and
Crepidine, from D. crepidatum. These are probably not terpenoid in origin, and
may be derived from shikimic acid, acetate, and ornithine.
196
Nuphar alkaloids (VX6360)
The Nuphar (water-lily) alkaloids contain a normal sesquiterpene carbon
skeleton, and can be divided into three main sub-groups:
(a) the furylpiperidine derivatives, e.g. Nuphamine;
(b) the furylquinolizidine derivatives, e.g. Deoxynupharidine;
H
1 9
2 10 8
3 7
4 N 6
O
Deoxynupharidine
There are also a few miscellaneous bases which are based on variants of the
above major sub-groups.
Dihydroagarofuran
197
16 COOCH2CH2NMe2
COOH
15
12 H
H H
H 14
H
3 H 7
4 6 HO O
HO O H
H MeOOC
19
Cassaic acid Erythrophlamine
12 13 O
17 HH 14 16 H
1 10 9 15
2 11 H
NH 8 HN
3 5 7 H
4 6
H H
19
(b) A few alkaloids belong to the heteratisane group, formed from the
aconitane framework by oxidative fission of the 13,14 bond.
(c) The second major group are the C20 alkaloids, based on atidane. Few
alkaloids, as it happens, are based on the parent ring system, since many skeletal
variations are known;
13
17
12
20 16
14 20 14
9 15
1
2 10 8
NH H 7 N H
3 5
4 6 6
H H
19 18
(d) The hetisane group, in which additional rings are introduced into the
atidane ring system by formation of 14,20 and N,6 bonds.
(e) A small group of atidane 7,20 cyclic ethers, as in Ajaconine.
(f) A small group of bases in which an additional carbocyclic ring is
introduced by attachment of C-7 to C-20, as in Denudatine.
(g) Complex hetisane derivatives, e.g. Delnudine, in which further
modification of the ring system has occurred. In the case of Delnudine this has
involved the contraction of ring C.
(h) A group of dimeric atisines, exemplified by Staphisine.
(i) The third major group of alkaloids is based on the C20 veatchine skeleton,
as in Cuauchichicine.
(j) 7,20-Cycloveatchine bases, e.g. Lucidusculine.
(k) 14,20-Cycloveatchine bases, e.g. Anopterine.
(l) Miscellaneous bases.
198
Benn, M.H. and Jacyno, J.M. (1983) in Alkaloids: Chemical and Biological
Perspectives, (ed. S.W. Pelletier), Vol. 1, Wiley-Interscience.
Pelletier, S.W. and Page, S.W. (1976) in MTP Series 2, Vol. 9, Alkaloids, (ed. K.
Wiesner), Butterworths, London.
Pelletier, S.W., Mody, N.V., Joshi, B.S. and Schramm, L.C. (1984) in Pelletier, Vol. 2.
Wiesner, K. (1985) Tetrahedron, 41, 497.
Olivoretin group
These, such as Olivoretin D (Teleocidin B), are metabolites of
Streptoverticillium olivoreticuli, and show pronounced vesicant activity.
Teleocidin A1 is clearly terpenoid in origin, and so presumably are the other
teleocidins.
199
O
23
23
O 10
21 H 21 8
7 12
17 7 8 10 17 H
2 25
H 3 1 3 2
N 13
4 N
14
16 16
6 6
15 15
O
2
O O
6
OH NH
Steroidal alkaloids
This very large group may be divided into nine subgroups. For further
information on steroid structure and biosynthesis, see the Steroid section above.
A
3
Conanine, 9CI
200
Steroidal alkaloids (spirosolane and solanidine type) (VX6660,
VX6680, VX6720, VX6740)
Alkaloids in which a cholestane side-chain has been converted into:
(a) a piperidine ring, to give the secosolanidane skeleton;
(b) a bicyclic system containing a piperidine and a tetrahydrofuran ring to
give the spirosolane skeleton;
H N H
O H N
H H
H
N
H
H
HO
Procevine
N
H H H
H E H N
H
H H H
H H
H H
H Veratraman, 9CI
Cevane, 9CI
(g) the veratraman group, in which ring E of Cevane has been opened.
201
All the alkaloids have a nitrogen function at C-3 and/or C-20, which may be
unmethylated, partially methylated, or fully methylated.
The suffix letters used in the nomenclature of this group indicate the degree
of methylation of the nitrogen atoms:
Substitution at N-3 Substitution at N-20
——————————— ———————————
R1 R2 R3 R4
A Me Me Me Me
B Me Me Me H
C H Me Me Me
D H Me Me H
E Me Me H H
F H H Me Me
G Me H H H
H H H H Me
I H H H H
K Me Me – –
L – – Me Me
M Me H – –
N – – Me H
O H H – –
P – – H H
In DNP the entries for these alkaloids are organised under the (usually
unknown) unsubstituted parents of the I, O, or P type.
202
while the oxazoles of the Gramineae and Rutaceae arise from the chorismic
acid-phenylalanine pathway.
Spermidine Spermine
203
Condensation of either Spermidine or Spermine with one or two cinnamic acid
units, or with an unbranched carbon chain, gives the skeleton of these alkaloids.
Aside from these aliphatic amines, therefore, phenylalanine or tyrosine, and long-
chain fatty acids are involved. The biosynthesis clearly also involves phenol
coupling processes in certain cases, e.g. Codonocarpine.
The spermidine alkaloids can be subdivided into four sub-groups: (a) simple
diamides, e.g. Maytenine; (b) medium-ring compounds involving one cinnamic
acid unit in the ring, e.g. Celabenzine; (c) medium-ring compounds involving
two cinnamic acid units in the ring, e.g. Codonocarpine, Lunaridine; (d)
medium ring compounds containing a C10 to C16 unbranched carbon chain in
the ring, e.g. Cannabisativine; (e) derivatives of spermidine lengthened by one
methylene group. This small group of amides of Solamine, so-called
homospermidine alkaloids, was isolated from plants of the family Solanaceae.
Solapalmitine is representative of this class.
H3C(CH2)14CON (CH2CH2CH2CH2NMe2)2
Solapalmitine
Ph O H
N
HN
N
COPh
Celabenzine
The spermine-derived alkaloids may be divided into three sub-groups: (a) those
in which the two terminal putrescine chains form an eight-membered ring with a
cinnamic or a C8 or C10 unbranched carbon chain, e.g. Homaline; (b)
Pithecolobine in which the one large ring involves a C12 chain; (c) two alkaloids
in which two medium rings are formed with two cinnamic acid units.
O O
NH
HN HN
O
NMe2
Frangulanine
204
Amanita alkaloids (VX7120)
The toxins of the European death cap mushroom Amanita phalloides and other
A. spp. constitute an even more complex group of macrocyclic peptides, mostly
containing sulphur. These include the amatoxins (e.g. α-Amanitin, β-
Amanitin), the phallotoxins (e.g. Phalloidin) and the virotoxins (e.g. Viroidin).
The fly agaric (A. muscaria) also contains the low molecular weight compound
Muscarine.
Purines (VX7300)
Purines are involved along with pyrimidines as bases in DNA and RNA. These
and other purines may be divided into;
(a) the ubiquitous, well-known oxypurines, exemplified by Caffeine;
(b) derivatives of adenine, e.g. the plant hormone, Zeatin;
(c) miscellaneous.
O
Me
N
MeN
O N N
Me
Caffeine
205
Polypyrroles (VY)
The polypyrroles (tetrapyrroles) are a numerically limited class of natural
products that are mostly strictly functional. The main examples are haems,
chlorophylls, bilins and Vitamin B12. All types of organism use tetrapyrroles of
one or more of these classes and all the functional tetrapyrroles derive from one
common tetrapyrrolic intermediate, Uroporphyrinogen III (Uro’gen III).
Uro’gen III is derived entirely from eight molecules of 5-Aminolaevulinic acid
(ALA) by the action of three enzymes, via Porphobilinogen (PBG) and
Hydroxymethylbilane (HMB) as intermediates. A particularly important feature
in Uro’gen III is the fact that ring D has been inverted and so the acetate and
propionate side-chains are not in the same order as on the other three pyrrolic
rings, A to C. This feature can be found in virtually all naturally occurring
tetrapyrroles. Some organisms, however, have a low activity of the enzyme
uro’gen III synthase (as occurs in the human disease, congenital erythropoietic
porphyria). In these cases non-enzymic cyclisation of HMB occurs to give
Uro’gen I, which has the regular alternating pattern of the acetate and
propionate side-chains, and a number of derived type I porphyrins can be
isolated from these organisms.
The main system of nomenclature used in DNP is that recommended by the
IUPAC-IUB Joint Commission on Biochemical Nomenclature. For the cyclic
tetrapyrroles this is based on the porphyrin with the carbon atoms numbered 1
to 20 and the nitrogen atoms numbered 21 to 24. This has superseded the older
‘Fischer’ numbering which numbered only the eight β-positions of the five-
membered pyrrole rings and labelled the four bridging meso-carbons α, β, γ
and δ.
3 α
2 4 5 6 7
1
2 3
21 22 8 4
1 N N 9 N N
20
H 10 δ
H β
19 H 11 H
24N N N N
18 23 12 8 5
16 14
17 15 13 7 γ 6
HOOC
COOH
COOH HOOC 5 COOH
COOH 4
HOOC
A B
1 N N
20
H H
10
H H
O N N N
NH2 NH2 H D C
14
ALA PBG HOOC 15 COOH
HOOC COOH
Uro'gen III
206
HOOC
COOH
HOOC COOH
A B
N N
H H
HO
H H
N N
D C
HOOC COOH
HOOC
COOH
HMB
2 3 4 78 12 13 17 18
1 5
6 9 10 11 14 15 16 24 19
21 22 23
N N N N
H
Bilin; IUPAC-IUB numbering
R2 R3
R1 R4
N N
H
H
N N
R8 R5
R7 R6
Type R1 R2 R3 R4 R5 R6 R7 R8
I A P A P A P A P
II A P P A A P P A
III A P A P A P P A
IV A P P A P A A P
The substitution patterns for uroporphyrins I to IV (A = CH2COOH, P = CH2CH2COOH)
207
Type R1 R2 R3 R4 R5 R6 R7 R8
I Me V Me V Me P Me P
II Me P Me V Me P Me V
III Me V V Me Me P P Me
IV Me P V Me Me V P Me
V Me P V Me Me P V Me
VI Me P Me P Me V V Me
VII Me P Me V Me P V Me
VIII Me V Me P Me P V Me
IX Me V Me V Me P P Me
X Me V Me P Me V P Me
XI Me P Me V Me V P Me
XII Me V V Me P Me Me P
XIII V Me Me V Me P P Me
XIV Me P V Me V Me Me P
XV Me P V Me P Me Me V
The substitution patterns for protoporphyrins I to XV (V = CH=CH2, P = CH2CH2COOH)
208
In plants the same oxidative cleavage of haem leads to the photoresponsive
pigment Phytochromobilin and, in algae, to the light-harvesting pigments such
as Phycocyanobilin. Both of these are found in vivo covalently attached to
proteins by thioether links.
209
isolation procedure introduces a cyanide ligand (Cyanocobalamin) but water,
hydroxide, or nitrite are other possibilities. The two biologically active forms of
Vitamin B12 both have Co-C bonds: Coenzyme B12 has an axial adenosyl group
and is the cofactor for a number of enzymic rearrangement reactions and
Methylcobalamin has a methyl as the axial group and is involved in enzymic
methyl-transfer reactions.
A number of close relatives of vitamin B12 have been found in various
anaerobic bacteria. These have the dimethylbenzimidazoyl group of the
nucleotide loop replaced by other substituted benzimidazoyl groups or by
purines or even by simple phenoxy groups. The latter type of group cannot
provide the lower ligand to the cobalt ion.
Geoporphyrins (VY0940)
A wide range of tetrapyrrole derivatives have been found in sedimentary
deposits derived from organic matter, such as crude oil, oil shales and lignite.
These porphyrins have undergone various degradative reactions: generally the
macrocycle is at the stable porphyrin oxidation level but the side chains are
fully reduced and very often decarboxylated; they very often occur as nickel or
vanadyl complexes. Thus one of the most common geoporphyrins is
Deoxophylloerythroetioporporphyrin (DPEP), which could well be derived
from degradation of Chlorophyll a. The arrangement of the substituents on the
geoporphyrins gives a clue to their origin and thus the type of organic material
which gave rise to the deposit. For example, geoporphyrins that lack a
substituent at C-7 are assumed to derive from Chlorophyll b or related
compounds, whereas those in which a cyclisation onto the C-17 side-chain has
occurred (such as examples with a methylated five-membered ring) may derive
from the Chlorophyll c family. Other geoporphyrins have been isolated which
have the additional carbon atoms on the C-8 and C-12 side-chains characteristic
of Bacteriochlorophylls c, d and e. The types of reaction undergone by the
porphyrins also give useful information about the conditions that they have
experienced over the history of the deposit.
Blanche F. et al. (1995) Angew. Chem., Int. Ed. Engl., 34, 383 (biosynth).
Dolphin, D. (ed.) (1978–79) The Porphyrins, Vols 1–7, Academic Press, New York.
Dolphin, D. (ed.) (1982) B12, Vols 1–2, Wiley, New York.
Jordan, P.M. (ed.) (1991) Biosynthesis of Tetrapyrroles, Elsevier, Amsterdam.
Leeper, F.J. (1985) Nat. Prod. Rep., 2, 19, 561; (1987) 4, 441; (1989) 6, 171 (biosynth).
Scheer, H. (ed.) (1991) Chlorophylls, CRC Press, Boca Raton.
Smith, K.M. (ed.) (1975) Porphyrins and Metalloporphyrins, Elsevier, Amsterdam.
Zagalak, B. and Friedrich, W. (eds) (1979) Vitamin B12, de Gruyter, Berlin.
The Biosynthesis of the Tetrapyrrole Pigments (1994) Ciba Foundation Symposium ,
180, Wiley, Chichester.
210