Академический Документы
Профессиональный Документы
Культура Документы
Executive Summary
Synthetic Biology summarizes efforts directed at the synthesis of complex biological systems to
obtain useful novel phenotypes based on the exploitation of well-characterized, orthogonal, and re-
utilizable building blocks. It aims at recruiting design structures well established in classic
engineering disciplines, such as a hierarchy of abstractions, system boundaries, standardized
interfaces and protocols, and separation of manufacturing and design, for biotechnology. Synthetic
Biology generally entertains the notion that successful design of a biological system from scratch – as
opposed to the current practice of adapting a potentially poorly understood system over and over
again towards a specific goal – is the ultimate proof of fundamental understanding and concomitantly
the most powerful way to advance biotechnology to the level required to deal with today’s challenging
problems.
First achievements include the design and implementation of synthetic genetic circuits, the
design of novel biochemical pathways, and the de novo synthesis of a bacterial genome. However, it is
clearly the ultimate aspiration of the field to extend the mastery of biological engineering to systems
complex enough to deal with problems such as the design of novel therapeutic treatments, the
production of liquid transportation fuels, or the efficient manufacturing of biopharmaceuticals and
fine, bulk, or fuel chemicals.
Synthetic Biology is a field in its infancy, but as it desires to adopt structures that have
resulted extremely successful in industrial history, the potential benefits of the field are enormous.
The success of Synthetic Biology will depend on a number of challenging scientific and technical
questions, but two points stand out: [i] Synthetic Biology needs to prove that the concept of
orthogonal parts and subsystems is viable beyond the few domains for which it has been established
at the moment (in particular the field of genetic circuits and the “Registry of Standard Biological
Parts” at the MIT); and [ii] Synthetic Biology is in need of a technological infrastructure that a)
provides access to cheap but accurate non template-driven (de novo) DNA synthesis that can cope
with the requirements of system-level engineering and b) allows installing and maintaining an
“Institute of Biological Standards” that provides the base for implementing the Synthetic Biology
principles across the biotechnological community.
-2-
Synthetic Biology – Enginering in Biotechnology
Contents Page
Executive summary 2
1. Synthetic Biology in context 4
3. Summary 55
4. Acknowledgements 56
5. Useful websites 56
6. Abbreviations 58
7. References 59
-3-
Synthetic Biology – Enginering in Biotechnology
Since the advent of genetic engineering with the production of recombinant somatostatin and
insulin by Genentech in the late 1970s, biotechnology has tried to exploit cells and cellular
components rationally to achieve ever more complex tasks, primarily in the areas of health and
chemistry, and more recently also in the energy sector. The manufacturing of biopharmaceuticals has
advanced from simple peptide hormones (the disulfide-bridges of which had to be introduced
afterwards) to the manufacturing of monoclonal antibodies complete with specific glycosylation
patterns. The manufacturing of small pharmaceutical molecules has advanced from an increase in
overproduction of natural products (such as the penicillins) to the assembly of entire novel pathways
(e.g. for artemisinic acid production [1]), and so has the manufacturing of chemicals (e.g. from
overproducing intermediates of the citric acid cycle such as citrate to compounds alien to most
microbial metabolisms such as 1,3-propanediol and indigo [2, 3]).
The same can be said for other sectors that apply the tools of molecular biology, such as gene
therapy [4], tissue engineering [5, 6], diagnostics [7], and many others.
However, despite the numerous successes of the biotechnology industry, major challenges
remain. The chemical industry might need to replace a substantial part of their raw material base for
a substantial part of its products – can biotechnology come up reliably with robust solutions for novel
production routes in a suitable timeframe and with reasonable development costs? The same goes for
our energy industry – can biotechnology play a significant part in the de-carbonization of our energy
systems, preferably without provoking major opposition on environmental or ethical issues? The list
of questions could easily be extended for example into the field of gene therapy or any other field in
which biotechnology plays already or could play in the future a major role. In fact, against these
challenges, the successes of biotechnology appear only as something as the first steps of an industry
that has managed to mount a couple of convincing cases but needs to do much more to adopt the
potentially overwhelming role it might theoretically be well placed to assume.
The reason for this lies of course in the complexity of biological systems, which makes it
difficult to reliably engineer them and essentially converts every industrial development project into a
research project that needs to cope with unexpected fundamental hurdles or completely new insights
into the biological system. Still, the challenges we face will need to be addressed to a large extent in
these complex biological systems – cells. Cells represent a multitude of different organic molecules
with a multitude of interactions, most of them highly non-linear, and thus prone to emergent
properties. Of course, advances in Systems Biology allow us to delineate an ever larger number of
these interactions in considerable detail (see below), but it is sobering to realize that even the
dynamics of arguably the best understood single pathway in metabolism, glycolysis, in one of the best
studied model organisms, Saccharomyces cerevisiae, can still not be satisfactorily explained in terms
of its constituting members [8]. Overexpressing one recombinant gene in Escherichia coli triggers
-4-
Synthetic Biology – Enginering in Biotechnology
hundreds of other genes to change their transcriptional status, and only for some of them it is clear
what might motivate the cell to do so [9]. Clearly, our understanding of biological systems is still far
from complete.
This prompts two questions: [i] how is it possible that against such a complex background it is
possible at all to successfully introduce complex artificial functions (such as assembling a novel
pathway for the production of a complex small molecule) and [ii] are we using the right concepts to
deal with complexity in biotechnology?
For the first question, I guess it is safe to argue that the successes in recombinant
biotechnology are primarily the result of a non-straightforward process in which a basic idea was put
to the test and modified and expanded for so long until every (unexpected) difficulty along the way
was taken care off to such an extent that the resulting process was still economical (I restrict the
argument to biotechnological manufacturing processes here, but it is easy to make the equivalent
argument for other areas as well). Or to put it in different words, I guess it is safe to argue that
today’s successfully operating fermentation processes employing cells that had to undergo
substantial genetic engineering are not the result of a targeted planning effort of a group of people
that had a clear idea of the problems they were facing and in addition a clear idea of the tools that
they had at their disposal to work around such problems in a reasonable amount of time. Or, to
paraphrase the key point once more, one could argue that the implication of essentially any novel
biotechnological process today primarily remains a research project, as pointed out above. The
existing examples illustrate that it is possible to be successful with such research projects, but the
effectively small number of examples (relative to the number of, e.g. novel chemical processes that
have been introduced over the same time) suggests also that such research endeavors remain difficult
and costly.
For the second question, it might be helpful to compare the current “design processes” in
biotechnology with design processes in more mature fields, such as classical engineering disciplines
(e.g. mechanical engineering or electrical engineering) in which the success rate of design efforts is by
orders of magnitude better than in biotechnology, and then to try to understand whether there is a
chance that we could recruit some of the key practices to biotechnology in the future.
This comparison has been at the heart of the emerging field of Synthetic Biology, and
consequently it has been treated in a number of very instructive papers over the last years [10-18]. In
summary, five points have been identified that are crucial in engineering but by and large absent
from biotechnology:
-5-
Synthetic Biology – Enginering in Biotechnology
-6-
Synthetic Biology – Enginering in Biotechnology
duplication of its information store. Introducing and expressing a single recombinant gene into E. coli
changes the expression patterns of hundreds of genes [9], indicating the degree of connection
between various cellular functions. On the other hand, nature provides ample examples of
orthogonality, for example in eukaryotic cells that have available organelles to carry our specialized
tasks or in higher organisms that have organs dedicated to specific functions. However, it is probably
safe to say that consciously engineering orthogonality into biological systems to help the design
process has so far played no role in biotechnology.
The third point, the hierarchy of abstraction, reflects the assembly of complex non-biological
systems from orthogonal sub-systems. If it is possible to separate the overall system into meaningful
subsystems, which in turn might be once more separated into meaningful subsystems, and so on,
then the design task can be distributed over several levels of detail at the same time (provided that
the transitions from one level to the next/one sub-system to the next are defined in advance). This
has two major implications – the development time will be much reduced by parallel advances and
every level of system development can be addressed by individuals who are specialists for this specific
level of detail, so the overall quality of the design improves. To adopt an example from the design of
integrated electric circuits, diodes, transistors, resistors, and capacitors are required to fabricate
electronic logic gates (AND, OR, NOT, NAND, NOR), which in turn are the elements to make
processors (e.g. for a printer). The term “transistor” describes a function (controlling a large electrical
current/voltage with a small electrical current/voltage), but it also hides detailed information which
is in most cases not relevant for the person that uses the transistor to build an AND gate. The AND
gate in turn describes a function (an output signal is obtained if two input signals are received
concomitantly) and can be implemented in various forms composed of various circuits elements,
which influence its performance, but the details of how this was achieved are not relevant for the
designer of the processor. In other words, at every level of the design process, detailed information is
hidden under abstract descriptions. The higher the level, the more information is hidden.
Again in contrast, the description of biological systems is usually strongly focused on the
molecular level, and formalized, abstract or functionalized descriptions in the sense discussed above
are rare. A frequently used analogy here is that the equivalent to programming cells today (writing
down a specific recombinant DNA sequence) is programming a computer in binary code instead of
using one or the other very powerful specialized user-friendly software. The advent of Systems Biology
is about to establish the idea of functional modules in the cell [21], but clearly we are nowhere near
the sophistication with which we are familiar in classical engineering disciplines when it comes to
structuring a biological engineering approach. Even worse, in essentially all instances our adoption of
functional terms (e.g. “promoter”) reflects a qualitative understanding of a higher-level function, but it
hides the fact that detailed quantitative information on the specifics of this function are not or only
partly available, should we be interested in the details of the next level of the hierarchy (which
proteins bind to regulate activity – how strong is the binding- what is the importance of the
-7-
Synthetic Biology – Enginering in Biotechnology
nucleotides between the -35 and the -10 region – what is the promoter clearance rate under which
circumstances?)
As already mentioned, the hierarchy of abstraction requires that the transitions between the
different levels are properly specified at the beginning, so that the various levels can integrate
seamlessly in the end. To continue the example from above, to be able to assemble a processor from
the various logic gates, it will be helpful if the gates are cascadable – the electronic currents leaving
one gate have to be in the proper range to be recognized by the next gate, and the two gates need to
be physically connected to each other, for which it is helpful if the number and size of output pins of
the first gate is compatible to the number and size of the receiving sockets in the next gate. In
addition, if these transitions are made according to more general specifications, it should be possible
to integrate (potentially superior) parts from other manufacturers. In other words, the transitions
should be standardized. In fact, standardization needs to encompass many more elements of the
design process: For example, measurement protocols need to be standardized so that practices
become comparable and can be made available to everybody involved in the design process. The same
is true for the documentation of system components and the storage of this information in databases.
Only sufficient compliance with these standards ensures that a designed element of the system has a
high chance of re-utilization.
Biotechnology is characterized by the absence of standards. As orthogonality in cells is at best
an emerging concept, no clear ideas on how to standardize information exchange between such
elements exist. In terms of standardizing experimental protocols, it is only now that with the advent
of Systems Biology certain standards on data quality and measurement techniques start to produce
an impact [22] in the face of the need to handle ever larger sets of experimental data delivered by the
experimental “-omics” technologies. But the impact of the lack of standards in experimental
techniques goes much further: a simple discussion about the relative strength of two different
promoters between two scientists from two different labs will quickly reveal that the discussion is
pointless, because in the corresponding experiments they have used different E. coli strains on
different media after having placed the promoter on replicons with different copy numbers per cell to
drive expression of two different reporter genes, of which one encoded a protein tagged for accelerated
turnover. In summary, it is very difficult to answer basic questions that can be expected to be
important for designing biological systems, and even if comparable experiments exist, it almost
always requires an extensive search through the available scientific literature to locate the
information.
The fifth key difference between current engineering practice and “biological engineering” that
was identified is the separation of design and manufacturing. Put differently, the design of a car is
done by a different group of people than the car’s assembly at the assembly line, and the assembly is
actually requires comparatively little effort. The different groups of people also have different
-8-
Synthetic Biology – Enginering in Biotechnology
qualifications and have received different training – they are specialists, and can afford to be because
the design was so detailed that the manufacturing can be confidently expected to give no problems.
On the other hand, the major part of the time an average PhD student in molecular
biotechnology spends at the bench, is still spent on manipulating DNA – isolating a gene from the
genomic DNA of a specific strain, pruning it of specific restriction sites inside the gene, adapting the
ends to cloning in a specific expression vector, adapting codon usage, playing with RNA and protein
half-life, etc. In other words, the manufacturing of the system is still a major part of the research
project, and in fact in many cases it is still a research project on its own.
Against this background, Synthetic Biology, in the definition adopted for this report, aims at
providing the conceptual and technological fundamentals to implement orthogonality, a hierarchy of
abstraction, standardization, and separation of design and manufacturing, into biotechnology. In other
words, it aims at providing the prerequisites of operating biotechnology as a “true” engineering
discipline. This implies aiming [i] at making biological systems amenable to large-scale rational
manipulations and [ii] at providing the technological infrastructure to implement such a large scale
changes. In this effort, Synthetic Biology depends on a substantial increase of the knowledge base in
biotechnology, which is expected from Systems Biology.
Of the points discussed above, six main challenges arise – two scientific, two technological, one
organizational, and one societal challenge.
One key scientific challenge of Synthetic Biology will be to implement the concept of
orthogonality in complex biological systems. Rational, forward-oriented engineering on a considerable
scale for example in cells will only be possible if we can limit the impact of our modifications on the
cellular background into which we want to implement our instructions. Failing here, success in large-
scale cellular manipulations will necessarily remain a matter of trial and error, because it appears
presumptuous to assume that we will understand cellular complexity to such an extent that we can
comprehensively integrate it into the design process.
In addition, orthogonality will be a fundamental prerequisite for the application of the hierarchy
of abstraction. The latter depends on our ability to define meaningful sub-systems that can be
worked on and later combined. This makes only sense if the number of interactions between the
different subsystems is limited, ideally to the few interactions that are desired and eventually
standardized.
-9-
Synthetic Biology – Enginering in Biotechnology
The second key scientific challenge will be how to deal with evolution. Though this question has
so far hardly played a role in the Synthetic Biology discussion, it is intrinsically connected to the
long-term vision of Synthetic Biology: because evolution is a permanent source of change in living
systems, it will always represent an obstacle when it comes to preserving the integrity of artificial
designs in the long run. In the short term, the problem might not be particularly dramatic – after all,
we have learned how to deal with unstable but highly efficient production strains over the
approximately 70 generations it takes to fill the large scale reactor starting from a single cell from the
cell bank. However, if the ideas of standardization, abstraction, and orthogonality can be effectively
implemented, it is easy to foresee that strains undergo various generations of modifications, and it
will be important to be sure that the strain that receives the final modification is still the one that
received the first generation of modifications.
One technological challenge of Synthetic Biology is clearly to adapt our current protocols for
strain manipulation to the drastic change in synthetic scope that will be required. The ambition to
use ever more complex catalysts for biotechnology will inevitably lead to the requirement to realize
ever more voluminous synthetic tasks: on the one hand, our ambition to adapt cellular systems to
concepts such as orthogonality will require large-scale adaptations of existing cells, which is
currently only possible by working through complex lengthy laboratory protocols. On the other hand,
the DNA-encoding of ever larger sets of instructions will require the synthesis of ever longer DNA
fragments. At the same time, we will need to adapt our data acquisition technologies to the
requirements of an engineering discipline with respect to accuracy, reproducibility, and amount of
labor to invest per measurement. It should be clear that it will be impossible to simply linearly scale
up our current efforts in the construction of strains for biotechnology if the vision is that at least
dozens of gene adaptations are required for any given new strain generation. It should be as obvious
that by transforming biotechnology from a research-driven discovery science into an engineering
activity, the reliable determination of fundamental data under pre-defined circumstances will become
much more important and has to become much more routine, and we will require the corresponding
technology to accomplish this.
A second technological challenge is to make available the tools of the trade – the “parts” and
“devices” from which complex systems can be assembled rationally and the strains and cell lines into
which these assemblies can be implemented. This is of course intimately connected to solving the
problem of orthogonality. An additional aspect here is to provide suitable modes of assembly – it
would be helpful if we could develop parts in such a way that they can be easily combined - in fact
behave modular.
Organizationally, much of the impact of Synthetic Biology will depend on whether it is possible
to gather a critical mass of scientists to adopt engineering-inspired standards and standard operating
procedures, for example for the measurement of variables of central importance in Synthetic Biology
(e.g., promoter clearance rate of recombinant promoters) and its suitable subsequent documentation
- 10 -
Synthetic Biology – Enginering in Biotechnology
for example in a centralized data repository. Beyond the experimental arena, this request for a critical
mass needs to be extended to the adoption of certain strategies to deal with questions of ownership –
it will be difficult to motivate a lively use of central repositories, for example, if the user has to go
through several layers of patent research before using.
Finally, Synthetic Biology will inevitably touch on questions regarding its ethical boundaries
and aspects of biosafety (the safety of scientists and the public from the application of Synthetic
Biology in the laboratory) and biosecurity (the security of people against military or terroristic
exploitation of the technology). The notion of making life “engineer-able at the discretion of the
engineer” will undoubtedly trigger the need for communicating at least the broader aims of this
scientific community. Efforts to define a “minimal-life chassis” (see below) will most probably trigger
ethical concerns about who has the authority to make such definitions. Certain aspects of the debate
on the safety of genetic engineering from the 1980s and 90s will be re-visited once large-scale
manipulation of living systems becomes a reality and most probably there will be a need to develop
strategies on how to use the novel technologies of Synthetic Biology to increase biosafety. And finally
the potential for military abuse of specific technological developments needs to be assessed.
In the following, I will discuss work that has been addressing the challenges for Synthetic
Biology mentioned above. It is important to point out that much of the work that is described here
was not carried out under the label of Synthetic Biology. Rather – next to some of the “hallmark-
experiments” of Synthetic Biology - I have collected work that indicates that the ambitious goals of
Synthetic Biology might indeed be reached at some point in the future.
2.1.1. Integration of unnatural amino acids into proteins (Schultz-lab, Scripps, San Diego)
Protein synthesis at the ribosome is usually limited to a set of twenty amino acids, the
proteinogenic amino acids. Even though the cell has developed a variety of strategies to extend the
diversity of protein composition post-translationally (e.g. by glycosylation, phosphorylation, or
- 11 -
Synthetic Biology – Enginering in Biotechnology
chemical modification of selected amino acids), there is considerable interest in extending the canon
of proteinogenic amino acids beyond twenty, for example to overcome limitations in structural
analysis of proteins or provide precise spots for posttranslational modifications to facilitate any
chemical manipulation of biologically produced proteins [23].
The specificity in protein synthesis is maintained at various points throughout the process:
First, an amino acid is attached by a synthetase to a corresponding empty tRNA. Each amino acid is
recognized by a specific synthetase (interface 1) and the resulting aa-enzyme (aa for aminoacyl)
complex discharges its amino acid onto a specific (set of) tRNAs (interface 2). Finally, the anticodon of
the charged tRNA interacts with the mRNA at the ribosome (interface 3). Consequently, to introduce a
novel, unnatural amino acid into cellular protein synthesis, there are at least the interactions at three
different interfaces to consider. Each of these interfaces requires consideration of two times two
specific questions:
Interface 1:
a1) The unnatural amino acid must be recognized by one empty synthetase …
a2) … but it must not fit into any other synthetase.
b1) The synthetase must recognize the new amino acid, but….
b2) … it must nor recognize any other amino acid.
Interface 2:
c1) The empty tRNA must be recognized and charged by the charged aa- synthetase complex….
c2) … but it must not be recognized nor charged by any other charged aa-synthetase complex.
d1) The charged aa-synthetase complex must recognize and charge the empty tRNA….
d2) … but it must not recognize nor charge any other empty tRNA.
Interface 3:
e1) The codon on the mRNA must recognize the anticodon on the charged aa-tRNA…
e2) … but it must not recognize any other aa-tRNA.
f1) The charged aa-tRNA must recognize the codon on the mRNA…
f2) … but it must nor recognize any other codon.
Clearly, not all 12 requirements really need to be fulfilled for a system to work efficient – even in
the wild-type system a tRNA can recognize several codons in some cases, all of which code for the
same amino acid (“wobble”), and some requirements are equivalent (a1/b1, c1/d1, e1/f1). But for the
introduction of an unnatural amino acid into the process of protein synthesis, ideally there should be
a completely orthogonal system where each of the three interfaces is specific enough to ensure
meeting all of the 9 unique requirements. As it becomes clear from the structure of each pair of
requirements, each requirement in a pair can be enforced by a combination of positive (“must
recognize”) and negative selection (“must not recognize”). Consequently, by undergoing at most 6
experimental pairs of positive and negative selection, it should be possible to select a system that can
- 12 -
Synthetic Biology – Enginering in Biotechnology
integrate an unnatural amino acid without interfering with the remaining protein synthesis
machinery.
This strategy has been realized by the group of P. Schultz in San Diego [23]. They used a
tyrosine-transferring tRNA from Methanococcus jannaschii and changed the anticodon such that it
would recognize the amber STOP codon UAG on an E. coli mRNA (interface 3). Next, they established
two selection strategies:
To engineer interface 2, they put a library of the engineered tRNA into an E. coli strain
harboring a gene encoding a toxic gene product (barnase). The gene contained two amber STOP
codons. Expressing a library of variants of the engineered tRNA in this strain eliminated all those
tRNA variants that could be charged by any of the E. coli synthetases (the charged tRNA variants
would suppress the amber STOP-codons in the barnase gene, leading to cell death – negative
selection). In a second step, those library members in strains that survived step 1 (= could not be
charged by the E. coli machinery), were recovered and transferred into an E. coli strain that contained
the recombinant Tyr-tRNA-synthetase from M. jannaschii together with an -lactamase resistance
gene inactivated by another amber STOP codon. Expressing the variants of the tRNA library in the
presence of ampicillin allowed only those strains to survive that harbored variants that could be
charged by the recombinant synthetase, because only those were able to suppress the STOP codon in
the resistance gene (positive selection).
In a next step, the amino acid-specificity of the recombinant synthetase was engineered
(interface 1), again by a combination of negative and positive selection. A library of synthetase-
variants was transferred into an E. coli strain that contained an antibiotic resistance gene with an
amber mutation. The library was then grown in the presence of the antibiotic to select for those
variants that could suppress the mutation, presumably because a synthetase variant was able to
charge the cognate tRNA, either with the unnatural amino acid from the medium or with one of the
natural amino acids. To eliminate variants that suppressed with natural amino acids, the survivors of
the positive selection were transferred into a strain with a mutated barnase gene (see above).
Expressing this library in the absence of the unnatural amino acid eliminated all synthetase variants
that charged natural amino acids onto the cognate tRNA.
With this experimental system, requirements a1, b1/b2, c1/c2, d1, e1, and f1 could be covered.
Assuming further that requirement f2 is guaranteed by the selection of a “wobble”-free tRNA,
insertion of unnatural amino acids could be engineered to meet 6 or the 9 unique requirements with
this system. In this specific case, requirement e2 is also covered in a manner of speaking – a
suppressed STOP codon is by definition only recognized by the suppressor tRNA. However, the STOP
codon appears in a variety of other genes as well, where it actually should preferably maintain its old
function, so the system allows not to properly select against the “natural” function of the amber
STOP-codon.
- 13 -
Synthetic Biology – Enginering in Biotechnology
Nevertheless, the system has been used very successfully to engineer the insertion of unnatural
acids into proteins [23]. This gives a clear indication that the power of directed evolution can be
recruited to help the development of orthogonal systems once it is possible to design the proper
selection systems.
- 14 -
Synthetic Biology – Enginering in Biotechnology
- 15 -
Synthetic Biology – Enginering in Biotechnology
possible, considerable advance has been made in designing orthogonal ZFP domains for 48 of the 64
possible triplets by rational and phage display methods and up to 6 of these ZFP domains have been
combined to produce selective binders with a recognition sequence of 18 bp [36].
However, pushing orthogonality to such extremes also revealed current limitations: for example,
while a designed binder did bind to the designed sequence, they also bound with less affinity to
similar sequences [37]. Such limitations not withstanding, ZFPs have found widespread use [32], in
particular in the area of genome editing, where the ZFP domains are coupled to the DNA restricting
domain of specific restriction endonucleases and thus can be used to produce targeted DNA double
strand cuts. These cuts trigger in turn the exchange of chromosomal genes for externally provided
genes, an attractive proposition for gene therapy [38].
Next to the ZFPs, the frequent orthogonality between protein domains has also been exploited
in a number of alternative systems, preferably to re-program signaling pathways in cells [39, 40].
- 16 -
Synthetic Biology – Enginering in Biotechnology
As discussed, a variety of concepts exist how to realize orthogonality in biological system, and
from my point of view it is likely that some of these strategies will be successful in the end – in
particular, when orthogonality becomes selectable, as suggested in the example of in vivo
introduction of unnatural amino acids. This particular experimental system has also been shown to
be rather robust, as many different unnatural amino acids have been inserted into proteins following
this strategy [23]. But even here it is clear that the system is not fully orthogonal – most importantly,
it would be necessary to prune the genome of the exploited cells of all other uses of the codon that is
used to encode the novel amino acid. It is also clear that many interactions might simply remain
undetected, because we do not bother to look or simply would not know how to look. These hidden
interactions might or might not turn out to be important in the long run, when an existing orthogonal
system is made part of more complex design. After all, it is completely unclear to what degree of
completeness we have to insist on orthogonality for the various schemes of cellular reorganization. In
summary, the field is only at its beginning, but at an important beginning. In the words of Sismour
and Benner: “Ultimately, Synthetic Biology succeeds or fails as an engineering discipline depending
on where the independence approximations become useful in the continuum between the atomic and
macroscopic worlds.” [12]
- 17 -
Synthetic Biology – Enginering in Biotechnology
then provide means to rapidly return to the previous state (essentially, DNA sequencing and DNA
synthesis – see below in the technical sections).
Available Synthetic Biology prototypes are so small that they do not drastically interfere with
the fitness of the cell, and none of the applications has been so rewarding that long-term experiments
regarding phenotype-stability were performed. Only one experiment has addressed this problem to
some extent, however in an exceptional experimental context: a population-density regulation was
shown to be stably maintained in a chemostat over appr. 50 generations, but the reactor volume (and
therefore the population pool to generate critical mutations) was only 16 nL (roughly 6 orders of
magnitude less than typical small-scale chemostat experiments), which makes comparison to
traditional data difficult [45].
However, as will be discussed below, the ambition of synthetic biologists regarding the scope of
their synthetic systems is rapidly changing – synthetic genomes are becoming available [46, 47], and
it is easy to predict that evolutionary pressure will become an important point on Synthetic Biology’s
agenda.
- 18 -
Synthetic Biology – Enginering in Biotechnology
- 19 -
Synthetic Biology – Enginering in Biotechnology
to be changed from the plant A. annua to that of e.g. the Gram-negative bacterium E. coli. Next, the
gene needs to be integrated into the regulatory structure of the new pathway – it might for example
be part of an operon requiring tight transcriptional and translational coupling to the genes in front
and afterwards. Furthermore, we might want to fine-tune the amount of enzyme available relative to
the other pathway-members by influencing e.g. the efficiency of the ribosome binding site, the half-
life of the corresponding section of the mRNA (by introducing specific secondary structures, see
below) or of the protein itself (by adding specific tags to the protein, also see below). Alternatively, the
gene might need to receive its own regulatory structure, including promoter and transcriptional
terminator. Finally, in order to allow the rapid insertion of improved variants of the gene into the
operon, it might be desirable to have the gene flanked by unique restriction sites, while at the same
time internal restriction sites might have to be eliminated.
To go comprehensively through all these modifications, step by step, with the methods
described above is so laborious that there is just no way that this can be done for more than a few
genes in any given project. To be truly able to modify large stretches of DNA and adapt them to our
specific requirements, we need to switch completely to de novo, non template-dependent DNA
synthesis methods (Fig. 1). Only this change can give the power to implement comprehensively
engineered DNA sequences on a significant scale into novel biological systems.
Even though this switch is desirable and absolutely vital for Synthetic Biology, it is not easy. In
order to design DNA sequences de novo, we need to rely on our capabilities to synthesize DNA
chemically – without the requirement for a template. In this process, a single stranded DNA molecule
is built up nucleotide by nucleotide and the nucleotide sequence is only determined by the sequence
of reactions the operator desires (Fig. 2). Such chemically produced stretches of DNA
(oligonucleotides or “oligos”) are typically between 20 and 100 bp long. The technology as such is
established and is an instrumental requirement for example for PCR reactions. But, as already
indicated above, synthetic oligonucleotides are short, and to make up meaningful novel DNA
sequences they need to be assembled into larger and larger molecules. The assembly of these short
oligonucleotides into (ultimately) DNA sequences of genome size is one of the technologies at the
heart of Synthetic Biology.
- 20 -
Synthetic Biology – Enginering in Biotechnology
A B
1. Bioinformatics-supported design of 1’600 oligonucleotides
- 21 -
Synthetic Biology – Enginering in Biotechnology
steps for the required number of times, the oligonucleotide is finally cleaved off the column and
treated with ammonium hydroxide at high temperature to remove all remaining protecting groups.
In order to make this process amenable to miniaturization and to produce many different
nucleotide sequences in a confined space, the deprotection of the 5’OH-group was made sensitive to
light. By producing suitable masks that direct the light only to certain parts of a solid-phase
synthesis array (photolithography), only these parts of the array are prepared for extension in the
next round of adding phosphoramidites. This can be achieved by replacing the acid-labile DMT group
by the photo-labile -methyl-6-nitropiperonyloxycarbonyl (MeNPoc) protective group. This technology
allows the concomitant preparation of several thousand oligonucleotides on one solid support (in
hybridization arrays, up to 1 million features per cm2 are possible).
The photolithography approach has been developed further: In order to eliminate the time-
consuming and expensive utilization of photolithographic masks, a system with “micro-mirrors” has
been developed (Nimblegen, Febit). Here, the light pattern is produced by rapidly adjustable high
contrast video displays. The German company Febit claims that with this procedure, they can
produce up to 500’000 oligonucleotides per day and chip.
A couple of properties are common to all three technologies: One crucial parameter in the oligo-
synthesis is the coupling efficiency, which is a function of the completeness with which deblocking
and chain-extension proceed. If in every step 99% of all started oligonucleotide chains are extended
(see Fig. 2 from B to C), only 60% of all chains contain all nucleotides after 50 steps. This has
considerable implications for the scale at which the synthesis needs to be started. Furthermore, also
the capping does not proceed with total efficiency. Taken together, a significant fraction of the
molecules on the solid phase has a different length from the intended oligo.
Next to deletions, also chemical modifications play a role: for example, the phosphoramidites
that are used for chain extension are not completely pure. Then, oligonucleotide syntheses are prone
to depurination (in particular under the acidic conditions during DMT removal), during which an
adenine or a guanine can be hydrolysed off the sugar-phosphate backbone, leaving a free hydroxyl
group.
These two lines of errors lead to a considerable percentage of wrong oligonucleotides in any
given oligonucleotide mixture coming out of a synthesizer. The exact percentage is difficult to
estimate and is also be a function of the specific supplier that is used and the implemented quality
control criteria. It certainly is a function of the required oligonucleotide length, which should be
obvious from the various error sources mentioned above. The various reports in the literature that
describe the assembly of larger DNA fragments from oligonucleotides typically use oligonucleotides of
around 50 bp [47, 53] as a compromise between the desire to produce long oligonucleotides to
facilitate assembly and short oligonucleotides to minimize the number of errors due to chemical
synthesis.
- 22 -
Synthetic Biology – Enginering in Biotechnology
Next to ongoing efforts to improve the synthetic procedures and thus reduce the frequency of
error introduction, several methods have been developed to identify errors introduced in synthesis
and eliminate the corresponding DNA molecules. They are based on enzymatic and physical methods.
The physical methods focus on the exploitation of size differences and the disturbances in the
hybridization of error-containing complementary oligonucleotides. Polyacrylamide gel electrophoresis
(PAGE) is for example easily sensitive enough to separate oligonucleotides of no more than one
nucleotide difference. Therefore, subjecting oligonucleotides to a PAGE-purification can substantially
reduce the number of erroneous oligonucleotides at the start of the experiment [47]. The same can be
achieved by preparative HPLC, which is a standard technology if high-quality oligonucleotides are
required.
Alternatively, hybridization under stringent conditions can be used to identify mismatches.
Perfect complementarity between two DNA strands leads to a maximum number of possible hydrogen
bonds between the two DNA strands and thus a higher temperature is required to separate the
molecules again (“melting temperature”). This has been applied to reduce the error rate with light-
directed, chip-based oligonucleotide synthesis [54]. A first chip was used to produce the
oligonucleotides for DNA-fragment assembly (“construction oligos”). These oligonucleotides were
eventually released from the first chip and then hybridized under stringent conditions to sets of
complementary oligonucleotides that had been synthesized on a second chip. Ideally, those
oligonucleotides with errors in their sequence should find no identical match in the set of correction
oligonucleotides and be lost in a washing step because of the resulting decreased melting
temperature. Obviously, this procedure requires that the all construction oligonucleotides are
carefully designed before the start to have approximately the same melting temperature. The result
was encouraging: the authors reported 1 error in a sequence of 1’400 bp. Though this is still far too
high for any large scale synthesis approach, it is by a factor of 3 better than standard solid phase
approaches [53].
The enzymatic methods include enzymes that can detect DNA structures that are typical when
erroneous DNA molecules hybridize. For example, endonuclease VII of phage T4 can identify apurinic
sites in DNA and then restricts both strands of the DNA molecule close to lesion site [55]. This leads
to shorter fragments which can again be separated from the correct oligonucleotides (see above).
Alternatively, E. coli’s MutHLS proteins can be used to detect mismatches and insertions/deletions in
double stranded DNA. If such errors are present, fragments are cleaved at GATC sites [56] and the
remaining correct fragments can again be isolated by size selection. This way, the error rate in DNA
fragments produced from chemically synthesized oligonucleotides could be reduced by an order of
magnitude [56].
Even when the error rate in the produced oligonucleotides can be reduced by (a combination of)
the methods mentioned above, the produced larger DNA fragments require sequencing after assembly
- 23 -
Synthetic Biology – Enginering in Biotechnology
[57] to confirm that the desired sequence has been achieved. Of course, this is a very laborious and
time-consuming way of error-correction.
H H H H H H H H
O H H+ O H spont., 99% O H I2 , H2 O O H
-
P OCE P OCE P OCE O P O
N N O Base O Base
N
O O
N N H H H H
H H H H
O H O H
R R
Fig. 2: Phosphporamidite-procedure for DNA synthesis in 3’- to 5’-direction: Commercially available
nucleoside-3’-phosphoramidites (A) are added to a growing oligonucleotide chain and exposed to weak
acid. This leads immediately to the formation of the reactive intermediate B. Within 1 min, B has formed
a new phosphite linkage with the oligonucleotide chain attached to the solid support (C). C can be
oxidized with iodine/water to D. CE: 2-Cyanoethyl; DMT: Dimethoxytrityl. Amine groups of bases are
also protected (e.g. benzoyl)
Fig. 3: Dependence of the overall yield (OY) of an oligonucleotide synthesis on the number of steps
and the average yield (AY), which gives the percentage of extended oligonucleotide chains per step.
- 24 -
Synthetic Biology – Enginering in Biotechnology
5’ 3’
1 2 3 4
Oligonucleotide design
5 6 7 8
5’
1 2
Hybridization
5 6
5’
1 2
1st extension
5 6
5’
1
Hybridization
6
5’
1
2nd extension
6
Fig. 4: Polymerase cycling assembly (PCA): extension of oligonucleotides (40-70 bp) to larger
fragments (appr. 600 bp) by repeated melting and polymerase-based extension. Only the first two
cycles are shown.
- 25 -
Synthetic Biology – Enginering in Biotechnology
molecules per feature translating into picomolar concentrations or lower after release into solution
[54]). This is typically not enough for the subsequent stages of assembling oligonucleotides, so that a
DNA-amplification step needs to be introduced. This amplification step needs to be done on all
oligonucleotides at the same time, putting high requirements on the design of the oligonucleotides
that need to be amplified. Essentially, the oligonucleotides obtain standard linkers on both ends that
serve as hybridization targets for the PCR. However, PCR is not error free itself and thus can
contribute to the error rate in the oligonucleotide set. Furthermore, it is unlikely that indeed all
oligonucleotides can be faithfully amplified with a comparable efficiency, leading to potentially
pronounced imbalances between the amounts of oligonucleotides in a sample (or even the absence of
specific oligonucleotides).
After such an amplification step, the assembly of oligonucleotides to larger fragments was
performed by a variant of the PCA reaction mentioned above, the polymerase assembly multiplexing
(PAM) reaction. It was applied on a pool of oligonucleotides that represented the genes for 21
ribosomal proteins. However, rather than combining these 21 genes into one large DNA fragment
immediately, the authors added terminal primers that allowed amplifying only a specific sub-set of
oligonucleotides. Repeating this process, they obtained each of the 21 genes, but each from a
different reaction. In a second round of PAM reactions, they could then recombine the 21 genes with
a novel set of primers into one 14.6 kbp DNA fragment with all the genes.
- 26 -
Synthetic Biology – Enginering in Biotechnology
- 27 -
Synthetic Biology – Enginering in Biotechnology
could not be stored in E. coli (for unknown reasons) and were performed in S. cerevisiae instead. As
S. cerevisiae is very proficient in homologous recombination, transformation into S. cerevisiae of even
long sequences of interest together with a yeast-centromer-containing sequence can lead to intact
circular recombinant constructs if sufficiently large regions of overlap are available (transformation
assisted recombination, TAR) [63]. This TAR-cloning approach was used to produce the D and E
assemblies, the latter representing the complete M. genitalium genome reassembled in a YAC together
with the required yeast-specific sequences [48]. It should be noted that the E-assembly was done
directly with the 4 C-assemblies.
It is remarkable that the number of errors introduced during the assembly of the 5 – 7 kbp
fragments into a 580 kbp genome using the various in vitro and in vivo methods pointed out above
seemed to be rather small, even though the exact error rate remains unclear. However, it is clear that
the errors could be traced back to various sources – errors due to providing wrong sequences to the
commercial suppliers, errors in the cassettes obtained in return, and errors acquired during
propagation of assemblies in E. coli. Crucial here is the error management that has to be applied –
the authors of the M. genitalium genome assembly study went through a series of comprehensive
sequence determinations – in total, the genome was sequenced about 4 times at various stages of the
assembly process, excluding oversampling and the sequencing of the 5 – 7 kbp cassettes that was
done at the commercial suppliers [48]. Whenever errors in the sequence were encountered, the
corresponding assembly had to be carefully repaired or re-made.
The M. genitalium genome is currently the largest synthetic genome that has been assembled
from entirely chemically produced oligonucleotides. However, other approaches were followed to
assemble large segments of natural DNA up to entire genomes in bacterial hosts along with the
original chromosome of the host. These attempts have focused on Bacillus subtilis as host because
this bacterium is well-known for its ability to acquire DNA from the environment. In a process called
“inchworm cloning”, the entire genome of the photosynthetic bacterium Synechocystis PCC6803 was
transferred piece by piece into various sites on the chromosome of B. subtilis 168 [64]. The process
relied on double homologous recombination for transfer of the Synechocystis genome to predesigned
points in the B. subtilis genome. At 4 different positions on the B. subtilis genome, landing pads of
about two times 5 kbp of Synechocystis DNA were placed into which about 30 kbp of Synechocystis
DNA could be recombined. Then, another 5 kbp of Synechocystis DNA were placed in front of the
growing section (again by double homologous recombination) and the insertion of another 30 kbp
could take place, and so on. This way, the entire Synechocystis genome was transferred to the 4
different locations on the B. subtilis genome. This rather laborious method has been facilitated
somewhat to allow the assembly of large pieces of DNA from rather short fragments, generated for
example by PCR, on dedicated B. subtilis genome vectors [65]. This way, small genomes up to 134.5
kbp (rice chloroplast genome) could be successfully cloned and maintained.
- 28 -
Synthetic Biology – Enginering in Biotechnology
- 29 -
Synthetic Biology – Enginering in Biotechnology
Fig. 5: Development over time of the number of transistors per chip (“Moore’s law”), DNA
sequencing capacity, and DNA synthesis capacity, put off against the synthesis rate of E. coli’s DNA
polymerase III. Taken from [66].
informative to emphasize the exponential pace which DNA synthesis capacity has developed over the
last years (Fig. 5). It is safe to assume that the costs of de novo synthesized DNA fragments will be
reduced dramatically over the next couple of years, so it can be expected that the DNA synthesis is
likely to transform the working routines of the standard molecular biology laboratory in a similar
fashion to the introduction of PCR.
- 30 -
Synthetic Biology – Enginering in Biotechnology
Current application areas of lab-on-a-chip microfluidics nowadays are DNA analysis (PCR
reactors, gel electrophoresis, hybridization arrays), biochemical mix- and read analysis, protein
purification, quantification, and crystallization, cell handling and monitoring, and chemical
synthesis.
- 31 -
Synthetic Biology – Enginering in Biotechnology
electrophoretic separation of biomolecules [67]. Additionally, the surface of the microchannels allows
applying electroosmotic transport phenomena to valvelessly manipulate liquids on the chip.
Channels usually connect micro-chambers, one of the most versatile building blocks and
typically with a volume between 1 nL and 10 L. Micro-chambers can serve as a volume for a variety
of chemical reactions such as labeling, incubation, crystallization, or extraction. They can be
combined with filters or retention structures to keep particles inside the chamber and to prevent
them from entering the downstream system.
As laminar flow is predominant in microfluidic structures due to the small characteristic
dimensions of the channels, mixing generally relies on diffusion. To shorten mixing times, a number
of mixing blocks have been developed, including lamination and binary mixers. Due to the short
diffusion lengths in microfluidic systems, complete mixing can be easily achieved over times on the
order of 10 ms.
While the above-mentioned building blocks are passive, there are also a number of active
building blocks available for microfluidic devices. Fluid control can be achieved by integrated valves
and peristaltic pumps, often controlled by pneumatic actuation. Resistive heaters and miniaturized
Peltier elements allow for heating and cooling with rapid temperature transients. Cells can be
disrupted by electroporation or by ultrasonic cavitation.
A crucial task in a lab-on-chip system is the detection of a chemical species due to the small
detection volumes. Because of its sensitivity and versatility, fluorescence is the prevalent detection
method used especially with biomolecules. The detection optics and electronics are generally not
integrated into the microchip but rather form a part of the external measurement setup. However,
optical lenses and fibers aligned with the detection chambers have been integrated onto chips to
improve the sensitivity. Other detection modes can require on-chip electrodes, for instance for
conductivity, amperometric or voltametric detection. These electrodes are formed by gold or platinum
thin layers, which are deposited and structured in the same ways as the microfluidic channels.
In summary, the available structures suggest that it should be easily possible to transfer a
multitude of laboratory processes to smaller scale and open them thus up for miniaturization and
parallelization.
- 32 -
Synthetic Biology – Enginering in Biotechnology
patterned photolithographically. Then the channels are chemically etched into the substrate where
the protection layer has been opened and the etched grooves are closed by bonding a suitable cover
plate to the substrate after the protection layer is chemically removed. This method allows glass and
fused silica (quartz) to be used, which due to their excellent chemical properties can serve as reaction
vessels for rather aggressive chemicals.
A relatively new method is the fabrication of microfluidic components based on the replication
techniques of polymers. Replication is advantageous for rapid prototyping as well as cost-efficient
mass-production of microfluidic devices in polymers using a micro-machined master structure. In
general, all replication methods allow inherently parallel pattern generation on a large number of
substrates using a single mold. Additionally, commercial nano-imprinting and -embossing systems
are available for routinely patterning in the sub-100-nm range.. For the fabrication of the mold,
etched silicon substrates, lathed metal sheets, and thick photoresist structures are used. For the
elastomer, the commercially available poly(dimethylsiloxane) (PDMS) is mostly used because of its
good optical properties and self-sealing capacity [68]. In particular the optical properties make this
material highly suitable in the manufacturing of microstructures for molecular biology purposes
(fluorescence measurements are possible), even though the material has also been found to be prone
to biofilm formation, which can interfere with cultivation techniques [45].
2.3.2.3.Applications of microfluidics
Plenty of processes relevant for molecular and ultimately for Synthetic Biology have already
been integrated into microfluidic devices: DNA ligation [69] (Fig. 6), continuous [45] and multiplex cell
culture devices that allow the composing of different media compositions [70], amplification of large
sections from single cell genomes for sequencing [71], highly parallelized measurements of affinity
constants [72], and RNA and DNA purification from cell samples [73]. Just to give a flavor of the
diverse applications, a few interesting examples are mentioned in more detail:
Valves
Fig. 6: Close-up of the microfluidic chip for a single ligation procedure [69].
- 33 -
Synthetic Biology – Enginering in Biotechnology
60°C
77°C
95°C
Next, the actual PCR process was integrated into a microfluidic device which could also
accomplish the mixing. With highly parallelized chips, it is far from trivial to obtain the correct liquid
composition and to have it to enter a specific channel. By implementing pneumatically actuated
pumps and valves, reagents (DNA polymerase, primers) could be precisely distributed for example
from a single inlet to hundreds of reaction chambers. There, they were actively mixed with different
samples to form a matrix of up to 400 different reactions of 3 nL volume. This matrix approach
dramatically reduced the separate fluid handling steps from 1200 for a conventional PCR to only 41
to load the microfluidic chip. At the same time, the on-chip distribution of reagents allowed
distributing a single 2 L aliquot of polymerase over all 400 reactions (Fig. 8) [76].
- 34 -
Synthetic Biology – Enginering in Biotechnology
Fig. 8: Schematic diagram of a PCR chip comprising 400 distinct reactions in a single device [76].
inside the channels from the center of the disc towards its rim. Valving is achieved in this case with,
e.g., burst valves or hydrophobic patches. At initially low angular speeds the liquid does not have the
necessary energy to break through these valves; only as the disc’s rotational speed is increased do
they burst through [77]. The advantages of CD-type microfluidic systems are inherently parallel
processing and analysis, cheap and highly developed fabrication techniques, optical readout similar
to a CD player and flow control qua design. This has been exploited for example for highly parallelized
bio-affinity assays (www.gyros.com): A CD with 120 microfabricated units consisting of sample and
reagent inlets, volume definition chambers and packed beds of 10 nL volume with particles such as
phenyldextran-coated polystyrene or agarose beads was used to capture proteins (Fig. 9). Next, a
labeled secondary recognition molecule was applied. While spinning the CD, the secondary label that
bound to the analyte on the column was quantitatively assessed by laser-induced fluorescence
detection [78].
- 35 -
Synthetic Biology – Enginering in Biotechnology
g force
single cells integrated compartments for cell lysis, mRNA isolation, cDNA production and purification,
and product collection [79], and was used for gene expression profiling of human embryonic stem
cells [80]. A more detailed view of the integration can be obtained from Fig. 10. A single cell is
captured for lysis in a channel, exposed to lysis buffer, and the resulting mixture is pushed through a
section with beads containing oligo-dT nucleotides which can hybridize to the polyadenylated mRNA
molecules in the cell and capture them this way. Next, the captured mRNA is washed and reverse
transcribed into cDNA by exposing the beads to the appropriate buffer with the required polymerase.
The formed cDNA remains hybridized to the beads and can be recovered from the recovery wells. The
procedure was demonstrated to have an mRNA to cDNA efficiency of approx. 50% (5-fold better than
the equivalent reaction in a standard reaction vessel) and a minimum of 4 mRNA molecules could be
detected in the device.
The applications discussed above clearly indicate that microfluidics based systems are indeed
highly suitable to carry out even complex multi-step processes on a miniaturized scale that that is
highly amenable to parallelization. Even though many of the examples currently reproduce only
single steps and avoid true integration of many steps on one chip – which would be eventually
desired to truly reduce the workload in activities such as comprehensive genome modification or
genome re-synthesis or repeated analyses – it is clear that only within microfluidics we can find the
technologies to adapt the separate steps required for synthesizing a biological system and analyzing
decisive parameters repetitively to the appropriate scale. The broad scope of functional elements for
- 36 -
Synthetic Biology – Enginering in Biotechnology
Fig. 10: Integration of cell capture (insert 2, brown, containing a single captured cell in green), cell
lysis (insert 1, left half, lysis buffer in yellow), capture of polyadenylated RNA on oligo-dT beads (insert
3, purple, upper half of figure), and recovery in collection wells of the beads after cDNA production of the
captured mRNA (insert 4, right lower corner). The beads were recovered by cutting the wells off the chip.
Pump valves in green, separation valves in black, flow channels in yellow, control channels in red,
collection and waste channels in blue
microfluidic systems augurs well for the integration of many different functions on one chip, be they
geared towards biological system synthesis or parallelized analytics.
- 37 -
Synthetic Biology – Enginering in Biotechnology
the number of outlets in circuit one matches the number of inlets in circuit two and the voltage
changes that are produced in circuit one can be perfectly read into circuit two, which turn produces
voltage changes that can be read out by circuit three, etc. In other words, there is one common signal
carrier that is passed in a standardized form from circuit to circuit and ensures that the three
circuits can interact.
2.4.1. Modularity in DNA-based parts and devices – the Registry of Standard Biological Parts
One way to transfer this concept to biological systems is to argue that a “RNA polymerase flow
along the DNA” (equivalent to “electron flow through the conductor”) can be such a signal carrier and
parts are suitably organized DNA sequences. Sets of genetic parts (“devices” in Fig. 11) can be
organized such that they can simply be linearly assembled, with one element driving transcription for
the next element, which in turn drives transcription for the third element, and so on.
external
signal
A)
reg. protein reporter
higher-oder
device
system
B)
Fig. 11: Modularity in biological systems: A) several parts (ribosome binding sites, regulatory
genes (positive (pos.) and negative (neg.), reporter genes, transcriptional terminators, and promoters) are
assembled into devices. To guarantee seamless assembly, devices are designed such that usually their
output, a promoter clearance rate (also known as “PoPS”, polymerases per second), is between a lower
and an upper limit. The devices can be assembled to higher-order devices (e.g. a ring-oscillator, Fig. 13
[81]) and ultimately into systems.
- 38 -
Synthetic Biology – Enginering in Biotechnology
This scheme, which is in particular entertained by the Synthetic Biology community at the MIT
(Cambridge, Mass.), can take the analogy with the electrical circuit design very far: genetic “parts”
(genes, ribosome binding sites, terminators, promoters, etc) can be assembled into devices (inverters,
amplifiers), these devices into devices of higher order function (oscillators, logic gates), and so on. The
specific design of the devices (polymerases in, polymerases out) ensures simple assembly of pre-
manufactured devices, which in principle should be well amenable to automation (it has to be
mentioned, though, that this specific design concept has been developed already earlier, see e.g. [82,
83]).
Having automation in mind, parts design and assembly procedure - parts to devices to systems
– have to respect certain rules so that automation can proceed seamlessly ([84] – the procedure is
shown in Fig. 12). Together, these concepts represent one of the pillars of the “Registry of standard
biological parts” at MIT (http://parts.mit.edu). This “registry” is at the moment a physical and
computational resource that harbors several thousands parts and devices that have been produced
in the course of student design competitions (see below). The idea behind is to provide a central
infrastructure assembled according to specific “standards” that ultimately will make a major
contribution to the modularity and the “engineer-ability” of biological systems. It also paves the way
to the important concept of “re-utilization”: Once a part or device has been properly engineered and
characterized, it would be easiest if its nomenclature, documentation, and physical location is clear
to the entire community and access is easy (the inherent IP problems are addressed further below).
As – eventually – everybody in the community is supposed to use the same set of standards, the
part/device could be used without further engineering, all relevant information could be obtained (in
compact, again standardized form) from the registry (instead of by lengthy literature searches), and
improved parts/devices could be returned to the registry.
An important additional element of the registry is that it tries to implement elements of the
hierarchy of abstraction and in this way makes the activity of biological engineering more and more
accessible to people with a strong engineering but poor biology background. Ideally, the process is
supposed to work as this: a DNA sequence (engineered according to the specifications outlined above)
is deposited in the registry and its engineering-relevant properties are documented. Depending on the
type of sequence, it is assigned an icon. Then, assembling circuits becomes thus an issue of linearly
assembling icons, and the task of converting a design into DNA sequence can be performed by the
computer (and then physically by a DNA synthesis provider).
Ideally, the behavior of the assembled device should be predictable from the properties of the
assembled elements. This is of course at the moment impossible – the required quantitative data is
simply not available in the vast majority of cases. To illustrate these points: While RT-PCR allows a
quantitative (though laborious) access to determine promoter clearance rate, there is no easy-to-apply
and general method to connect a RBS sequence to the rate of initiation of polypeptide-chains at the
ribosome. And there is no way to predict the half-life of an mRNA beyond some general observations
- 39 -
Synthetic Biology – Enginering in Biotechnology
(“stable secondary structures – probably long half-life”). Consequently, it is difficult to predict even
the steady-state protein concentration in a cell.
Still, should these data be available in sufficient quality at some point in the future, the format
of the registry would allow our huge knowledge in systems engineering, acquired in completely
different fields such as electrical engineering, to be used for the robust design of for example
biological circuits. The molecular complexity of the various processes would have been hidden under
icons that have the proper documentation attached including a number of crucial parameters that
can be fed into suitable mathematical models which allow predicting the device behavior. To use a
frequently quoted metaphor in the community: “biological engineering is in the pre-ohmic age: when
people did not know better, it was important to know which type of wire in which length from which
supplier to use to achieve a specific effect in a circuit. Today all we need to know is the resistivity,
measured in Ohm” (Randy Rettberg).
A)
B)
Fig. 12: Seamless part assembly: A) The standard biobrick sequence interface. B) An insert
(pruned of any internal EcoRI, NotI, XbaI, SpeI, PstI sites) but with the EcoRI/XbaI part of the sequence
interface at its 5’-end and the SpeI/PstI part at its 3’-end is digested with EcoRI and SpeI. The insert is
inserted into EcoRI/XbaI digested vector, the compatible overhanging ends of SpeI and XbaI ligate but
the restriction sites are not recreated. This way, the SpeI/PstI sequence at the 3’-end remains intact
and unique and so remains the EcoRI/XbaI sequence at the 5’-end. The larger fragment that results
from this operation can be treated in exactly the same way as before the next time round. A similar
approach is applied when the insert has to be inserted to the right of the SpeI site – the vector is then
cut with SpeI/PstI and the insert with XbaI/PstI.
- 40 -
Synthetic Biology – Enginering in Biotechnology
LacI
OFF ON ON OFF
PL PL
lacI cI gfp lacI cI gfp
Ptrc-2 Ptrc-2
CI GFP
fluorescence
B)
PL::tetO1
TetR* CI* LacI* gfp*
tetO1 GFP*
tetR* cI* lacI* PL
Plac01 PL::tetO1 PR
fluorescence
Fig. 13: Genetic circuits. A) Toggle switch in E. coli. When the system is in the stage of the left
panel, the Ptrc-promoter is on (no LacI present). This leads to the production of CI (and green fluorescent
protein, GFP, the read-out signal). CI turns off the PL promoter, which prevents LacI from being produced
– the system state is maintained. Note that this situation does not require the continued presence of
IPTG once the state is established. A brief heating of the system (in the absence of IPTG) reverses the
situation: The temperature sensitive CI protein decays, PL is turned on, LacI gets produced, turns of Ptrc,
CI is no longer produced, the system state has switched. Again, only a short signal (heat pulse) is
required to establish the state. This state can be switched again by adding IPTG at 37oC. B)
Repressilator in E. coli constructed from 3 repressor/promoter sets: The TetR repressor turns off the
PL::tetO1 promoter. This prevents the production of the CI repressor, which n turn allows the formation
of LacI, which in turn turns off production of TetR, which in turn allows the production of CI, and so on.
The system state is read out from a second plasmid, where a gfp gene is transcribed from the PL::tetO1.
In order to provide a suitable system dynamics, tagged repressor proteins are employed that are
degraded much more quickly than non-tagged original versions. C) The programming of spatiotemporal
patterns in E. coli. Left: sender cell, produces acylhomoserine lactone (AHL). The AHL diffuses out of the
cells and establishes a concentration gradient. At high AHL concentrations, the receiver cell starts to
produce LacI and CI (again the star indicates tagged versions). The LacI repressor suppresses the
formation of GFP protein. At intermediate concentrations, less LacI is produced from PluxR. Therefore,
GFP is produced because the CI repressor, though also produced at a lower rate, is still available at a
sufficient concentration to turn off PR::O12 and thus to prevent LacI production from this promoter. At low
AHL concentrations, no LacI is produced from PluxR and no CI is produced – and as a consequence, LacI
gets produced from PR::O12, which turns off Plac and suppresses again GFP production. Therefore, GFP
production is limited to the regions of intermediate AHL concentration – a medium pass filter.
- 41 -
Synthetic Biology – Enginering in Biotechnology
- 42 -
Synthetic Biology – Enginering in Biotechnology
- 43 -
Synthetic Biology – Enginering in Biotechnology
suggested by the evolutionary screen were diverse, involving mutations in the RBS, the start of the
gene, and the end of the gene, contributing to lower protein levels or reduced dimerization. A second
round of mutagenesis in variants that that had their first mutation inside the cI gene revealed
proteins with attractive properties for the circuit and mutations that tended to weaken either the
DNA-protein interactions of the repressor or the protein-protein dimerization profile. Both
experiments clearly demonstrate that the recruitment of directed evolution is a highly feasible
approach to improve the dynamic properties of specific parts in genetic circuits.
The careful adjustment of component levels will most likely become more important the more
complex the system to assemble becomes, for example the assembly of a multi-gene metabolic
pathway. The laboratory of Jay Keasling at UC Berkeley has assembled a pathway to produce a
precursor for the antimalarial artemisinin in E. coli [1, 96, 97] and S. cerevisiae [52]. The entire novel
pathway reaches from acetyl-CoA to artemisinic acid and involves 11 genes assembled from E. coli, S.
cerevisiae, and Artemisia annua, a plant that naturally produces artemisinin. As illustrated in more
detail in Fig. 14, a variety of the discussed tools were employed in the assembly of the various
sections of the pathway. In particular, the group exploited the RNA tools already discussed above to
fine-tune the relative amounts of enzymes required to convert acetyl-CoA to mevalonate. This way,
accumulation of a toxic intermediate could be avoided and strain productivity considerably improved
[97].
- 44 -
Synthetic Biology – Enginering in Biotechnology
part of the antibiotic erythromycine). To shuttle growing ketide chains between modules (and
modules on different proteins), the polyketides are attached to sufficiently long polypeptide linkers.
Researchers at Kosan Bioscicence exploited that highly regulated structure [99]: They re-synthesized
the DNA sequences for the various domains of 8 PKS clusters after codon-optimization and
introduced restriction sites at the DNA sequences equivalent to the domain boundaries and the
polypeptide linkers. The restriction sites were selected such that that they only minimally interfered
with “consensus” amino acid sequences derived from comparing the corresponding amino acid
sequences of 14 PKS clusters, and that the domain-DNA sequences could be transferred easily
between sequences. When then the sequences for a loading module, an extender and 2 modules for
linkers are assembled onto one plasmid and the sequences for one linker, one extender, and one
thioesterase gene onto another plasmid, all items to produce a triketide should be in place. By
exchanging various sequences in these plasmids for homologous variants from other PKS clusters, a
154-membered combinatorial library was produced, and just under half of the variants were proven
functional by confirmation of production of triketides, indicating that the system indeed shows a
remarkable – though not total – degree of modularity. Respecting additional rules that reduced the
modularity of the system (by enforcing the joint utilization of specific modules to ensure active
interfaces) could even increase the fraction of functional combinations [100, 101].
However, even though modularity can be achieved even to considerable extents for a few protein
systems, there is at the moment no single unifying concept recognizable that would allow viewing at
least protein assembly as comprehensively modular. As far as I can see, no other groups of molecules
have been investigated for modularity in the context of Synthetic Biology. Therefore, for the time
being modularity will be limited to the assembly of DNA-parts.
- 45 -
Synthetic Biology – Enginering in Biotechnology
MevT - operon
Para
O AtoB O O
HmgS OH
O tHmgR OH
HO2C HO2C
SCoA SCoA SCoA OH
Plac
pBBR-derivative
erg12 erg8 MVD1 idi ispA
ERG8
MVD1 Idi
OPP IspA
OH ERG12 OH
HO2C
HO2C
OPP OPP
OH OPP
Ptrc P lacUV5::Ptac
p15A-derivative ads pBR322-derivative p450 cpr
H H H
H H H
HO
OPP
O
FPP amorphadiene amorphadiene artemisinic acid
Fig. 14: Engineering an artificial pathway for the production of artmisinic acid, a precursor of the
antimalarial artemisinin, into E. coli. Denoted are the operon structures of the various genes, the origin
of the plasmid vector on which the different operons are located, the promoters from which expression is
driven, and the manner in which the various synthetic and PCR-derived genes were connected (arrows
indicate tight coupling of genes by placing a strong Shine-Dalgarno site a few base pairs behind the
stop-codon of the gene before, stem-loop structures indicate the utilization of tunable intergenic regions
(TIGRs). Please note that production of artemsininc acid has never been reported for a strain containing
a combination of these specific plasmid. Variants of the first three operons were used to produce
amorphadiene in E. coli, while the fourth operon was used in combination with a plasmid pAM92 (the
specifics of which are unclear, but presumably it contains the three first operons) in E. coli DH1 to
produce artemsinic acid. Information assembled from Chang et al. Nature Chem Biol 5:274; [97], and
Martin et al., Nature Biotech 21:796. DXP: 1-deoxy-D-xylulose-5-phosphate; IPP: isopentenyl
pyrophosphate; DMAPP: dimethylallyl pyrophosphate; FPP: farnesyl pyrophosphate; HMG: 3-hydroxy-3-
methylglutarate.
- 46 -
Synthetic Biology – Enginering in Biotechnology
A minimal genome has been defined as the smallest possible group of genes that would be
sufficient to sustain a functioning cellular life form under the most favorable conditions imaginable,
that is, in the presence of a full complement of essential nutrients and the absence of environmental
stress [102]. Such a genome would have a couple of very attractive aspects for Synthetic Biology.
First, it would help understand cellular function by re-building it and learning from the differences.
Here, a minimal genome would represent an experimentally confirmed reference point that indicated
the minimum set of functions required for independent reproduction. Any more complex phenotype
would need to be re-designed and implemented into such a “minimal cell” and could thus be studied.
In fact, even the important elements of the minimal gene set could be studied, until not only the set
of genes as such, but also aspects of interaction or physical organization have been fully understood.
Second, such a genome would be attractive to system designers because it would help in
rationally re-designing cells to serve as “chassis” for implementation of biological system designs. For
example, the information for more complex phenotypes could be re-implemented, the impact of the
additional interactions studied at an “-omics”-level, and undesired interactions eliminated by
adapting the responsible proteins (eliminating cross-reactivity for regulatory proteins, eliminating
undesired allosteric interactions between metabolites and proteins, etc.). Then, the next trait could be
re-established and the same set of analyses could be performed. This way, the enormous task of
adapting a bacterial cell to engineering specifications could be reduced into a sequence of
(theoretically) manageable steps. Furthermore, viewed with an emphasis on metabolic engineering,
the reduction of the cellular metabolic network to a core would allow the re-building of rather specific
metabolic networks without the complication of alternative pathways, which might be helpful in the
rapid design of bacterial overproducers of chemical compounds.
Of course, the reduction of more complex genomes might lead to losses in cellular “robustness”
(defined as the ability to maintain cellular function despite external and internal perturbations [103])
- for example, redundant metabolic functions would most probably a prime candidate for elimination
in a minimal or reduced genome, but they are at the same time a determinant of robustness,
guarding against failure of single functions. Obviously, we need to understand the trade-offs that will
be involved between cellular reduction and robust function under a set of specific environmental
conditions. On the other hand, novel biological systems will be designed – at least in the immediate
future – to function under highly unnatural, potentially rather constant environmental conditions
(such as a well mixed continuous bioreactor, free of any other competition than mutants), and it
remains to be seen what the actual requirement for robustness is in such systems.
Two important questions are then how big a minimal genome might be and how a minimal
genome could be achieved.
- 47 -
Synthetic Biology – Enginering in Biotechnology
Regarding the first question, a couple of theoretical and practical studies have tried to address
the question. On the theoretical side, comparisons of gene sets in fully sequenced bacterial genomes
allow to identify genes that are common to all or at least to a substantial subset of available genome
sequences. Such genes represent good candidates for membership in the “minimal genomic set”. On
the other hand, specific essential features can be realized differently in different strains, arguing that
a more appropriate to look for a minimal set of functions (rather than genes) [102]. Still, current
estimates place the number of genes based on genome comparisons at around 250 genes [102].
Comparing genomes from bacterial endosymbionts (which can benefit from rather constant
environments as they live inside higher cells) and exploiting also experimental data derived from
transposon mutagenesis studies on small genomes (see below), an alternative estimate suggested a
minimal genome to consist of even fewer genes: 206 (Tab. 2 [104]).
On the experimental side, a comprehensive transposon mutagenesis study of M. genitalium, the
bacterium with the smallest genome that can still grow independently, identified 382 genes that
could not be disrupted by transposon insertion [105, 106]. In fact, the number of genes required for
independent growth might be even higher, because until today no study exists in at least two one of
the “dispensable” genes from the organism was eliminated at the same time. Consequently, it might
well be that some of the genes identified as dispensable might only be dispensable as long as their
function can be complemented by another “dispensable” gene. A similar study with Haemophilus
influenzae identified approximately 670 indispensable genes. A combined
computational/experimental study with B. subtilis placed the number of genes essential for growth
on rich media at 271 [107], while a knock-out study for E. coli growing on rich medium reported 303
genes which could not be eliminated. In summary, the available data indicate the number of genes
required for a minimal genome for an independently growing bacterium is between 250 and 400.
The second important question is how such a minimal genome, or at least a reduced genome,
might actually be constructed. Here, all experimental approaches have so far followed the same
strategy of reducing an existing genome, eventually to the point where no additional genes can be
removed any longer. This approach was followed for example for E. coli and B. subtilis and up to 30%
of the genome was reduced (Tab. 3). Interestingly, the impact of these reductions on the physiology of
the strains was very small. One obvious beneficial aspect was the removal of transposon and IS-
element sequences from the E. coli genome, which apparently enhances the reliability of E. coli as a
host for recombinant genes [108]. The same basic idea is also behind the transposon studies with
genomes that are already small, for example for M. genitalium [105, 106].
However, with the chemical synthesis of the entire M. genitalium genome there is now an
additional way available to generate a minimal genome. Rather than reduction of a large genome, one
can could consider the construction of various artificial chromosomes based on the various
hypotheses mentioned above and transplant such a chromosome into a cell. As discussed, most of
- 48 -
Synthetic Biology – Enginering in Biotechnology
- 49 -
Synthetic Biology – Enginering in Biotechnology
these steps have already been achieved by researchers of the J. Craig Venter Institute [46, 48] and it
appears to be only a question of time when this approach will indeed become experimentally feasible.
Next to the scientific and technological challenges, a successful Synthetic Biology also requires
a change in the mind-set of the involved community, on several issues. As already discussed above,
resources like the “Registry of Standard Biological Parts” require a commitment from the community
to adhere to certain standards, for the structure of parts as well as for the measurement protocols to
characterize them. The development and implementation of such suitable standards is the goal of the
“Biobricks foundation” (http://bbf.openwetware.org/).
It is encouraging to note that the Systems Biology community is already heavily involved in this
sort of discussion and has in fact already produced a number of suggestions for standards, for
example on microarray data [119] and biochemical models [120, 121].
- 50 -
Synthetic Biology – Enginering in Biotechnology
a
Axenic: culture with only one bacterial strain growing
- 51 -
Synthetic Biology – Enginering in Biotechnology
Another point that the community has to devote some attention to is education. Synthetic
Biology recruits students and researchers from what one might think of the “two opposite poles in
science and technology” – phenomenological biology on the one side and mathematics- and modeling-
driven engineering. Consequently, communication is not trivial and traditionally, very different
groups of students are addressed. Currently, first careful steps are taken in the direction of
implementing Synthetic Biology into masters programs (usually Systems Biology-oriented, see MSc
program in Systems and Synthetic Biology at ParisTech).
However, as a discipline oriented to the design of tangible outcomes, Synthetic Biology has an
extremely attractive teaching proposition that is usually absent from more fundamentally oriented
other biological disciplines: It works towards the design of tangible biological systems. In other words,
students can be part of an “engineering project”, at the end of which there should be “the system”,
whatever that might be (at the moment usually one or the other form of fluorescing bacteria). This
proposition is exploited by the international student competition “iGEM”. Every year, student teams
(typically MSc and BSc, but also PhD students) work over the summer break on a biological design
project. Such projects rely typically on interdisciplinary student teams recruited from engineering
and natural science backgrounds, which go through the various steps in biological design (project
definition, modeling, design space exploration, implementation, debugging) and finally present the
project in a large meeting in November at the MIT in Boston/Mass. The competition reaches teams all
over the world (incl. China, Japan, India, Australia, and Mexico next to North America and Europe)
and has drawn 12 teams from Europe in 2007 (www.igem.org).
Synthetic Biology develops after the fierce debates on genetic engineering in the late 1980s and
1990s and in the face of continuing unrest over genetically modified plants (at least in Europe).
Therefore, it is only prudent to reflect - and where possible to anticipate - the potential debates that
Synthetic Biology is likely to trigger. In fact, the debate on the place of Synthetic Biology in society
has been an intrinsic part of the development of Synthetic Biology nearly from the very beginning.
Already the “Synthetic Biology 2.0” conference in Berkely (Calif.) in 2006, the second international
conference of the community, dedicated entire sessions to the various societal aspects of Synthetic
Biology (http://pbd.lbl.gov/sbconf/), and so has “Synthetic Biology 3.0” (2007 in Zürich,
http://www.syntheticbiology3.ethz.ch/). By then, it was already the focus of critical attention for a
substantial set of NGOs (http://www.etcgroup.org/en/materials/publications.html?id=11), which
requested that the development of Synthetic Biology should happen in a broader societal context. In
fact, Synthetic Biology has already drawn rather comprehensive critique from NGOs [125].
- 52 -
Synthetic Biology – Enginering in Biotechnology
Overall, it is probably safe to say that the Synthetic Biology community has been very quick to
embrace public debate, and this early embrace has already produced a rich variety of materials on
some, though not all, aspects that merit a deeper discussion [126, 127].
b Danger: a theoretically possible bad outcome of a technology; risk: likelihood that the bad outcome
materialises
c the arising of novel and coherent structures, patterns and properties during the process of self-
organization in complex systems (J. Goldstein (1999), “Emergence as a Construct: History and
Issues”, Emergence: Complexity and Organization 1: 49-72
- 53 -
Synthetic Biology – Enginering in Biotechnology
In addition, large scale manipulations might allow addressing biosafety concerns much more
effectively: substantial re-design of a genome allows also to thoroughly interfere with the viability of
manipulated cells: it is for example easy to imagine how complete metabolic pathways are simply left
out of re-designed cells (rather than single genes inactivated), reliably enforcing the external supply of
specific nutrients.
b) Regarding the implementation of orthogonality: Where orthogonality requires expanding
cellular systems (because specific functions have to be implemented twice, for example once for each
orthogonal subsystem), it is highly likely that such systems will depend on the highly regulated
environment of the laboratory to prosper, because those cellular systems are unlikely to be more
competitive in the natural environment. But what if a cell uses a completely new chemical alphabet
for encoding genetic information? Or, perhaps somewhat closer to today’s technical possibilities: what
are the chances of survival of a bacterium that has undergone a re-assignment of two codons to
unnatural amino acids and whose codon usage has been adapted genome-wide? On the one hand,
such a bacterium would most probably contribute to biosafety because transferring DNA would be
rather pointless for the bacterium (no other bacterium could read the code). On the other hand, how
could a bacteriophage adapted to nature’s classical code attack a bacterial cell that uses a different
code? After all, it could no longer produce functional proteins based on its own genetic information.
Would that (to any notable extent) upset the “normal” processes that regulate the survival of cells or
higher organisms in the environment? In my view, this question remains largely open and should be
pursued further – in particular as there is probably still considerable time before we will be able to
construct truly “orthogonal” living cells.
- 54 -
Synthetic Biology – Enginering in Biotechnology
simple in the future for any ill-intending individual to equip him- or herself with the agents required
to spread diseases at will.
Even though the reality is much more complex (acknowledging that it is still a major piece of
research to assemble kilobases of correct DNA sequence, let alone convert this sequence into a
biologically viable entity), this specific scenario is taken seriously [130, 131] and has spurred
considerable activities, in particular in the US [132, 133], but also in Europe [134]. Specifically, a
number of policy options were identified (such as DNA-sequence providers need to screen orders to
detect suspicious sequence requests, owners of DNA synthesizers must register their machines and
be licensed, and the competences of Institutional Biosafety Committees need to be expanded [133]),
and some of these options have already been endorsed by DNA sequence providers [135].
3. Summary
Synthetic Biology has emerged as an initiative that wants to fundamentally address the
properties of living systems that make them difficult to engineer rationally. Though there is no
guarantee that this ambition might be successful at the end, the focus on designing biological
systems is already producing considerable advances in our understanding of biology and our
- 55 -
Synthetic Biology – Enginering in Biotechnology
concepts for biotechnology. In my view, considerable success in the major fields of Synthetic Biology –
orthogonality, coping with evolution, de novo DNA synthesis, an automated synthetic laboratory
infrastructure, rational design of parts, devices, and eventually systems – is a prerequisite for
biotechnology to turn into the truly pervasive successful industry that it has been predicted to
become many times over the last 35 years.
If successful, Synthetic Biology will re-emphasize the role that biotechnology can play in the
future development of our society, and this will most probably intensify the discussions around the
potential safety-, security-, environmental, and ethical implications of this discipline. The scientific
community needs to be prepared for this.
4. Acknowledgments
The concepts discussed in this report are the result of the ideas of and discussions with many
different people. I am especially grateful to Randy Rettberg, Drew Endy, Tom Knight, Ron Weiss,
George Church, Victor de Lorenzo, Luis Serrano, Vitor Martins dos Santos, Markus Schmidt, Jürgen
Pleiss, Matthias Heinemann, Jörg Stelling, and Martin Fussenegger.
5. Useful websites
- 56 -
Synthetic Biology – Enginering in Biotechnology
DNA synthesis:
www.febit.com/
www.nimblegen.com/
www.microsynth.ch/ (Switzerland, mainly oligonucleotides, synthesis of large DNA fragments
together with Sloning)
www.geneart.com (Germany)
www.entelechon.com/ (Germany)
www.sloning.de/ (Germany)
www.DNA20.com (USA)
www.blueheronbio.com/ (USA)
www.codondevices.com/ (USA)
- 57 -
Synthetic Biology – Enginering in Biotechnology
6. Abbreviations:
- 58 -
Synthetic Biology – Enginering in Biotechnology
7. References
1. Chang, M.C.Y., et al., 2007. Engineering Escherichia coli for production of functionalized
terpenoids using plant P450s. Nature Chemical Biology 3: 274-277.
2. Bialy, H., 1997. Biotechnology, bioremidiation, and blue genes. Nature Biotechnology 15: 110.
3. Nakamura, C.E. and G.M. Whited, 2003. Metabolic engineering for the microbial production of
1,3- propanediol. Current Opinion in Biotechnology 14: 454-459.
4. Flotte, T.R., 2007. Gene therapy: The first two decades and the current state-of-the-art.
Journal of Cellular Physiology 213: 301-305
5. Kimelman, N., et al., 2007. Gene- and stem cell–based therapeutics for bone regeneration and
repair. Tissue Engineering 13: 1135-1150.
6. Huang, J., C.W.P. Foo, and D.L. Kaplan, 2007. Biosynthesis and applications of silk-like and
collagen-like proteins. Journal of Macromolecular Sciencew, Part C: Polymer Reviews 47: 29–
62.
7. Johnson, C.J., et al., 2008. Proteomics, nanotechnology and molecular diagnostics.
Proteomics 8: 715-730.
8. Teusink, B., et al., 2000. Can yeast glycolysis be understood in terms of in vitro kinetics of the
constituent enzymes? Testing biochemistry. European Journal of Biochemistry 267: 5313-
5329.
9. Haddadin, F.T. and S.W. Harcum, 2005. Transcriptome profiles for high-cell-density
recombinant and wild-type Escherichia coli. Biotechnology and Bioengineering 90: 127-153.
10. Andrianantoandro, E., et al., 2006. Synthetic biology: new engineering rules for an emerging
discipline. Molecular Systems Biology: 2006.0028.
11. Arkin, A.P. and D.A. Flethcer, 2006. Fast, cheap, and somewhat in control. Genome Biology 7:
114.
12. Benner, S.A. and A.M. Sismour, 2005. Synthetic Biology. Nature Reviews Genetics 6: 533-543.
13. Brent, R., 2004. A partnership between biology and engineering. Nature Biotechnology 22:
1211-1214.
14. Endy, D., 2005. Foundations for engineering biology. Nature 438: 449-453.
15. Hasty, J., D. McMillen, and J.J. Collins, 2002. Engineered gene circuits. Nature 420: 224-30.
16. McDaniel, R. and R. Weiss, 2005. Advances in synthetic biology: on the path from prototypes
to applications. Current Opinion in Biotechnology 16: 476-483.
17. Heinemann, M. and S. Panke, 2006. Synthetic biology – putting engineering into biology.
Bioinformatics 22: 2790-2799.
18. Pleiss, J., 2006. The promise of synthetic biology. Applied Microbiology and Biotechnology 73:
735-739.
19. Fleischmann, R.D., et al., 1995. Whole-genome random sequencing and assembly of
Haemophilus influenzae Rd. Science 269: 496-512.
20. Karp, P.D., et al., 2007. Multidimensional annotation of the Escherichia coli K-12 genome.
Nucleic Acids Research 35: 7577–7590.
21. Hartwell, L.H., et al., 1999. From molecular to modular cell biology. Nature 402: C47-C52.
22. Klipp, E., et al., 2007. Systems biology standards - the community speaks. Nature
Biotechnology 25: 390-391.
23. Wang, L., J. Xie, and P.G. Schultz, 2006. Expanding the genetic code. Annu. Rev. Biophys.
Biomol. Struct. 35: 225–249.
24. Chin, J.W., 2006. Modular approaches to expanding the functions of living matter. Nature
Chemical Biology 2: 304-311.
25. Rackham, O. and J.W. Chin, 2005. A network of orthogonal ribosome-mRNA pairs. Nature
Chemical Biology 1: 159-166.
26. Isaacs, F.J., et al., 2004. Engineered riboregulators enable post-transcriptional control of gene
expression. 22: 841-847.
27. Chan, L.Y., S. Kosuri, and D. Endy, 2005. Refactoring bacteriophage T7. Molecular Systems
Biology 1: msb4100025-E1-msb4100025-E10.
- 59 -
Synthetic Biology – Enginering in Biotechnology
28. Bayer, T.S. and C.D. Smolke, 2005. Programmable ligand-controlled riboregulators of
eukaryotic gene expression. Nature Biotechnology 23: 337-43. Epub: 2005 Feb 20.
29. Dhanasekaran, M., S. Negi, and Y. Sugiura, 2006. Designer zinc finger proteins: tools for
creating artificial DNA-binding functional proteins. Accounts of Chemical Research 39: 45-52.
30. Beltran, A., et al., 2006. Interrogating genomes with combinatorial artificial transcription
factor libraries: asking zinc finger questions. Assay and Drug Development Technologies 4:
317-331.
31. Blancafort, P., D.J. Segal, and C.F. Barbas III, 2004. Designing transcription factor
architectures for drug discovery. Molecular Pharmacology 66: 1361-1371.
32. Jamieson, A.C., J.C. Miller, and C.O. Pabo, 2003. Drug discovery with engineered zinc-finger
proteins. Nature Reviews Drug Discovery 2: 361-368.
33. Kaiser, J., 2005. Putting the fingers on gene repair. Science 310: 1894-11896.
34. Papworth, M., P. Kolasinska, and M. Minczuk, 2006. Designer zinc-finger proteins and their
applications. Gene 366: 27-38.
35. Porteus, M.H. and D. Carroll, 2005. Gene targeting using zinc finger nucleases. Nature
Biotechnology 23: 967-973.
36. Segal, D.J., et al., 2003. Evaluation of a Modular Strategy for the Construction of Novel
Polydactyl Zinc Finger DNA-Binding Proteins. Biochemistry 42: 2137-2148.
37. Urnov, F.D., 2005. Highly efficient endogenous human gene crrection using designed zinc-
finger nucleases. Nature 435: 646-651.
38. Lombardo, A., et al., 2007. Gene editing in human stemm cells using zinc finger nucleases
and integrase-defective lentiviral vector delivery. Nature Biotechnology 25: 1298-1306.
39. Howard, P.L., et al., 2003. Redirecting tyrosine kinase signaling to an apoptotic caspase
pathway through chimeric adaptor proteins. Proceedings of the National Academy of Sciences
of the United States of America 100: 11267-11272.
40. Yeh, B.J. and W.A. Lim, 2007. Synthetic biology: lessons from the history of synthetic organic
chemistry. Nature Chemical Biology 3: 521-525.
41. Kaminski, P.-A. and P. Marliere, Method for the in vivo modification of the synthesis activity of
a metabolite by means of the modification of a gene the activitiy of which is not theoriginal
activity 2004, Institut Pasteur.
42. Sismour, A.M., et al., 2004. PCR amplification of DNA containing non-standard base pairs by
variants of reverse transcriptase from Human Immunodeficiency Virus-1. Nucleic Acids
Research 32: 728-35. Epub: 2004 Feb 02.
43. Bishop, A., et al., 2000. Unnatural ligands for engineered proteins: New tools for chemical
genetics. Annual Review of Biophysics and Biomolecular Structure 29: 577–606.
44. Filipovska, A. and O. Rackham, 2008. Building a parallel metabolism within the cell. ACS
Chemical Biology 3: 51-63.
45. Balagadde, F.K., et al., 2005. Long-term monitoring of bacteria undergoing programmed
population control in a microchemostat. Science 309: 137-140.
46. Lartigue, C., et al., 2007. Genome transplantation in bacteria: changing one species to
another. Science 317: 632-638.
47. Smith, H.O., et al., 2003. Generating a synthetic genome by whole genome assembly: X174
bacteriophage from synthetic oligonucleotides. Proceedings of the National Academy of
Sciences of the United States of America 100: 15440-15445.
48. Gibson, D.G., et al., 2008. Complete chemical synthesis, assembly, and cloning of a
Mcyoplasma genitalium genome. Science in press.
49. Kunkel, T.A., 2004. DNA replication fidelity. Journal of Biological Chemistry 279: 16895–
16898.
50. Sanger, F., S. Nicklen, and A.R. Coulson, 1977. DNA sequencing with chain-terminating
inhibitors. Proceedings of the National Academy of Sciences of the United States of America 74:
5463-5467.
51. Margulies, M., et al., 2005. Genome sequencing in microfabricated high-density picolitre
reactors. Nature 437: 376-380.
52. Ro, D.K., et al., 2006. Production of the antimalarial drug precursor artemisinic acid in
engineered yeast. Nature 440: 940-943.
- 60 -
Synthetic Biology – Enginering in Biotechnology
53. Kodumal, S.J., et al., 2004. Total synthesis of long DNA sequences: synthesis of a contiguous
32-kb polyketide synthase gene cluster. Proceedings of the National Academy of Sciences of the
United States of America 101: 15573-15578.
54. Tian, J., et al., 2004. Accurate multiplex gene synthesis from programmable DNA microchips.
Nature 432: 1050-1054.
55. Greger, B. and B. Kemper, 1998. An apyrimidinic site kinks DNA and triggers incision by
endonuclease VII of phage T4. Nucleic Acids Research 26: 4432-4438.
56. Smith, J. and P. Modrich, 1997. Removal of polymerase-produced mutant sequences from
PCR products. Proceedings of the National Academy of Sciences of the United States of America
94: 6847-6850.
57. Cello, J., A.V. Paul, and E. Wimmer, 2002. Chemical synthesis of poliovirus cDNA: Generation
of infectious virus in the absence of natural template. Science 297: 1016-1018.
58. Agarwal, K.L., et al., 1970. Total synthesis of the gene for an alanine transfer ribonucleic acid
from yeast. Nature 227: 27-34.
59. Itakura, K., et al., 1977. Expression in Escherichia coli of a chemically synthesized gene for the
hormone somatostatin. Science 198: 1056-1063.
60. Khorana, H.G., 1979. Total synthesis of a gene. Science 203: 614-625.
61. Tumpey, T.M., et al., 2005. Characterization of the reconstructed 1918 Spanish influenza
pandemic virus Science 310: 77-80.
62. Kobasa, D., et al., 2007. Aberrant innate immune response in lethal infection of macaques
with the 1918 influenza virus. Nature 445: 319-323.
63. Larionov, V., et al., 1996. Specific cloning of human DNA as yeast artificial chromosomes by
transformation-associated recombination. Proceedings of the National Academy of Sciences of
the United States of America 93: 491-496.
64. Itaya, M., et al., 2005. Combining two genomes in one cell: Stable cloning of the Synechocystis
PCC6803 genome in the Bacillus subtilis 168 genome. Proceedings of the National Academy of
Sciences of the United States of America 102: 15971-15976.
65. Itaya, M., et al., 2008. Bottom-up genome assembly using the Bacillus subtilis genome vector.
Nature Methods 5: 41-43.
66. Carlson, R., 2003. The pace and proliferation of biological technologies. Biosecurity and
Bioterrorism 1: 203-14.
67. von Heeren, F., et al., 1996. Micellar electrokinetic chromatography separations and analyses
of biological samples on a cyclic planar microstructure. Analytical Chemistry 68: 2044-2053.
68. Lichtenberg, J.B.H. and H. Baltes, Nanofluidics - structures and devices, in Enabling
Technologies for MEMS and Nanodevices, B. H., et al., Editors. 2004, Wiley-VCH. p. 319-356.
69. Hong, J.W., et al., 2006. Molecular biology on a microfluidic chip. Journal of Physics-
Condensed Matter 18: S691-S701.
70. Gomez-Sjöberg, R., et al., 2007. Versatile, fully automated, microfluidic cell culture system.
Analytical Chemistry 79: 8557-8563.
71. Marcy, Y., et al., 2007. Dissecting biological ‘‘dark matter’’ with single-cell genetic analysis of
rare and uncultivated TM7 microbes from the human mouth. Proceedings of the National
Academy of Sciences of the United States of America 104: 11889–11894.
72. Maerkl, S.J. and S.R. Quake, 2007. A systems approach to measuring the binding energy
landscapes of transcription factors. Science 315: 233-237.
73. Hong, J.W., et al., 2004. A nanoliter-scale nucleic acid processor with parallel architecture.
Nature Biotechnology 22: 435-439.
74. Northrup M.A., C.R.M., Watson R.T. DNA amplification with a microfabricated reaction
chamber. in Proceedings of the 7th international conference on solid state sensors and
actuators. 1993. Yokohama, Japan.
75. Kopp, M.U., A.J. de Mello, and A. Manz, 1998. Chemical amplification: Continuous-flow PCR
on a chip. Science 280: 1046-1048.
76. Liu, J., C. Hansen, and S.R. Quake, 2003. Solving the "world-to-chip" interface problem with
a microfluidic matrix. Analytical Chemistry 75: 4718-4723.
77. Geschke, O., H. Klank, and P. Tellermann, Microsystem Engineering of Lab-on-a-Chip Devices.
2004, Weiheim: Wiley-VCH.
- 61 -
Synthetic Biology – Enginering in Biotechnology
78. Engström J., I.M., Ekstrand G., Eckersten A., Dérand H., Lindman S., Andersson P.; Gyros
AB, Uppsala, Sweden. Quantitative bioaffinity assays of crude protein mixtures performed at
nanoliter scale in a CD microlaboratory. in HPCE 2002. HPCE 2002, Stockholm, Sweden.
79. Marcus, J.S., W.F. Anderson, and S.R. Quake, 2006. Microfluidic single-cell mRNA isolation
and analysis. Analytical Chemistry 78: 3084-3089.
80. Zhong, J.F., et al., 2008. A microfluidic processor for gene expression profiling of single
human embryonic stem cells. Lab on a Chip 8: 68–74.
81. Elowitz, M.B. and S. Leibler, 2000. A synthetic oscillatory network of transcriptional
regulators. Nature 403: 335-338.
82. de Lorenzo, V., et al., 1993. Engineering of alkyl- and haloaromatic-responsive gene
expression with mini-transposons containing regulated promoters of biodegradative pathways
of Pseudomonas. Gene 130: 41-46.
83. Herrero, M., et al., 1993. A T7 RNA polymerase-based system for the construction of
Pseudomonas strains with phenotypes dependent on TOL-meta pathway effectors. Gene 134:
103-106.
84. Knight, T., Idempotent vector design for standard assembly of biobricks, in MIT Synthetic
Biology Working Group Technical Reports. 2003, MIT Artificial Intelligence Laboratory. p. doi:
1721.1/21168.
85. Sprinzak, D. and M.B. Elowitz, 2005. Reconstruction of genetic circuits. Nature 438: 443-448.
86. Gardner, T.S., C.R. Cantor, and J.J. Collins, 2000. Construction of a genetic toggle switch in
Escherichia coli. Nature 403: 339-342.
87. Elowitz, M.B., et al., 2002. Stochastic gene expression in a single cell. Science 297: 1183-
1186.
88. Rosenfeld, N., et al., 2005. Gene regulation at the single-cell level. Science 307: 1962-1965.
89. Basu, S., et al., 2004. Spatiotemporal control of gene expression with pulse-generating
networks. Proceedings of the National Academy of Sciences of the United States of America 101:
6355-6360.
90. Kramer, B.P. and M. Fussenegger, 2005. Hysteresis in a synthetic mammalian gene network.
Proceedings of the National Academy of Sciences of the United States of America 102: 9517-
9522.
91. Rosenfeld, N., et al., 2007. Accurate prediction of gene feedback circuit behavior from
component properties. Molecular Systems Biology 3: 143.
92. Khlebnikov, A., et al., 2001. Homogeneous expression of the PBAD promoter in Escherichia coli
by constitutive expression of the low-affinity high-capacity AraE transporter. Microbiology
147: 3241-3247.
93. Ozbudak, E.M., et al., 2004. Multistability in the lactose utilization network of Escherichia
coli. Nature 427: 737-740.
94. Anderson, J.C., et al., 2006. Environmentally controlled invasion of cancer cells by engineered
bacteria. Journal of Molecular Biology 355: 619-627.
95. Yokobayashi, Y., R. Weiss, and F.H. Arnold, 2002. Directed evolution of a genetic circuit.
Proceedings of the National Academy of Sciences of the United States of America 99: 16587-
16591.
96. Martin, V.J.J., et al., 2003. Engineering a mevalonate pathway in Escherichia coli for
production of terpenoids. Nature Biotechnology 21: 796-802.
97. Pfleger, B.F., et al., 2006. Combinatorial engineering of intergenic regions in operons tunes
expression of multiple genes. Nature Biotechnology 24: 1027-1032.
98. Park, S.-Y., A. Zarrinpar, and W.A. Lim, 2003. Rewiring MAP kinase pathways using
alternative scaffold assembly mechanisms. Science 299: 1061-1064.
99. Menzella, H.G., et al., 2005. Combinatorial polyketide biosynthesis by de novo design and
rearrangement of modular polyketide synthase genes. Nature Biotechnology 23: 1171-1176.
100. Chandran, S.S., et al., 2006. Activating hybrid modular interfaces in synthetic polyketide
synthases by cassette replacement of ketosynthase domains. Chemistry & Biology 13: 469–
474.
101. Menzella, H.G., J.R. Carney, and D.V. Santi, 2007. Rational design and assembly of synthetic
trimodular polyketide synthases. Chemistry & Biology 14: 143–151.
- 62 -
Synthetic Biology – Enginering in Biotechnology
102. Koonin, E.V., 2000. How many genes can make a cell: The minimal-gene-set concept. Annual
Review of Genomics and Human Genetics 1: 99-116.
103. Kitano, H., 2004. Biological robustness. Nature Reviews Genetics 5: 826-837.
104. Gil, R., et al., 2004. Determination of the core of a minimal bacterial gene set. Microbiology
and Molecular Biology Reviews 68: 518-537.
105. Hutchison III, C.A., et al., 1999. Global transposon mutagenesis and a minimal Mycoplasma
genome. Science 86: 2165-2169.
106. Glass, J.I., et al., 2006. Essential genes of a minimal bacterium. Proceedings of the National
Academy of Sciences of the United States of America 103: 425-430.
107. Kobayashi, K., et al., 2003. Essential Bacillus subtilis genes. Proceedings of the National
Academy of Sciences of the United States of America 100: 4678–4683.
108. Posfai, G., et al., 2006. Emergent properties of reduced-genome Escherichia coli. Science 312:
1044-1046.
109. Glass, J.I., et al., Minimal bacterial genome, U. PTO, Editor. 2007, J. Craig Venter Institute.
110. Yu, B.J., et al., 2002. Minimization of the Escherichia coli genome using a Tn5-targeted
Cre/loxP excision system. Nature Biotechnology 20: 1018-1023.
111. Goryshin, I.Y., et al., 2003. Chromosomal Deletion Formation System Based on Tn5 Double
Transposition: Use For Making Minimal Genomes and Essential Gene Analysis. Genome
Research 13: 644-653.
112. Hashimoto, M., et al., 2005. Cell size and nucleoid organization of engineered Escherichia coli
cells with a reduced genome. Molecular Microbiology 55: 137-149.
113. Kolisnychenko, V., et al., 2002. Engineering a reduced Escherichia coli genome. Genome
Research 12: 640-647.
114. Sharma, S.S., et al., 2007. Expression of two recombinant chloramphenicol acetyltransferase
variants in highly reduced genome Escherichia coli strains. Biotechnology and Bioengineering
98: 1056-1070.
115. Sharma, S.S., F.R. Blattner, and S.W. Harcum, 2007. Recombinant protein production in an
Escherichia coli reduced genome strain. Metabolic Engineering 9: 133-141.
116. Mizoguchi, H., H. Mori, and T. Fujio, 2007. Escherichia coli minimum genome factory.
Biotechnology and Applied Biochemistry 46: 157-167.
117. Westers, H., et al., 2003. Genome engineering reveals large dispensable regions in Bacillus
subtilis. Molecular Biology and Evolution 20: 2076-2090.
118. Ara, K., et al., 2007. Bacillus minimum genome factory: effective utilization of microbial
genome information. Biotechnology and Applied Biochemistry 46: 169–178.
119. Brazma, A., et al., 2001. Minimum information about a microarray experiment (MIAME)-
toward standards for microarray data. Nature Genetics 29: 365 - 371.
120. Hucka, M., et al., 2003. The systems biology markup language (SBML): a medium for
representation and exchange of biochemical network models. Bioinformatics 19: 524-531.
121. Le Novere, N., et al., 2005. Minimum information requested in the annotation of biochemical
models (MIRIAM) Nature Biotechnology 23: 1509 - 1515.
122. Henkel, J. and S. Maurer, 2007. The economics of synthetic biology. Molecular Systems
Biology 3: 117.
123. Rai, A. and J. Boyle, 2007. Synthetic Biology:Caught between property rights, the public
domain, and the commons. PLOS Biology 5: e58.
124. Blattner, F.R., et al., Bacteria with reduced genome, PTO, Editor. 2003, Wisconsin Alumni
Research Foundation.
125. ETC_Group, Extreme genetic engineering. 2007. p.
http://www.etcgroup.org/en/materials/publications.html?id=602.
126. de Vriend, H., Constructing life: Early social reflections on the emerging field of synthetic
biology. 2006, The Rathenau Institute. p.
http://www.rathenauinstituut.com//showpage.asp?steID=2&item=1288&searching=construc
ting%20life.
127. Tucker, J.B. and R.A. Zilinskas, 2006. The promise and perils of synthetic biology. The New
Atlantis: 25-45.
- 63 -
Synthetic Biology – Enginering in Biotechnology
128. Kramer, B.P., et al., 2004. An engineered epigenetic transgene switch in mammalian cells.
Nature Biotechnology 22: 867-870.
129. Randerson, J., Revealed: the lax laws that could allow assembly of deadly virus DNA, in The
Guardian. 2006. p. http://www.guardian.co.uk/world/2006/jun/14/terrorism.topstories3.
130. Agency, C.I., The darker bioweapons future. 2003, Central Intelligence Agency. p.
http://www.fas.org/irp/cia/product/bw1103.pdf.
131. Brent, R., 2006. Power and responsibility.
132. Fink, G., et al., Biotechnology research in an age of terrorism: confronting the dual use
dilemma. 2003, Committee on Research Standards and Practices to Prevent the Destructive
Application of Biotechnology - National Research Council of the National Academies.
133. Garfinkel, M.S., et al., Synthetic Genomics - Options for Governance. 2007.
134. Kelle, A., Synthetic Biology & Biosecurity Awareness In Europe, in Bradford Science and
Technology Reports. 2007, University of Bath.
135. Bügl, H., et al., 2007. DNA synthesis and biological security. Nature Biotechnology 25: 627-
629.
136. Cho, M.K., et al., 1999. Ethical considerations in synthesizing a minimal genome. Science
286: 2087-2090.
- 64 -