Академический Документы
Профессиональный Документы
Культура Документы
DNA microarray gene expression data analysis has provided new insights into gene function, dis-
ease pathophysiology, disease classification, and drug development. Biclustering in gene expression
data is a subset of the genes demonstrating consistent patterns over a subset of the conditions. The
proposed work finds the significant biclusters in large expression data using a novel optimization
technique called stellar-mass black hole optimization (SBO). This optimization algorithm is inspired
from the property of the relentless pull of a black holes gravity that is present in the Universe.
The proposed work is tested on benchmark optimization test functions and gene expression bench-
mark datasets, and the results are compared with swarm intelligence techniques such as particle
swarm optimization (PSO), and cuckoo search (CK). The experimental results show that the SBO
outperforms PSO and CK.
INTRODUCTION
Microarray technology attracts keen interest among the scientific
community and in industry. Through an analysis of gene expression data,
it is possible to learn the behavioral patterns of genes such as similar
behavior (Lockhart and Winzeler 2000). These patterns are used in the
medical domain to aid in more accurate diagnoses, treatment planning,
and drug discovery. DNA microarray enables researchers to analyze the
expression of many genes in a single experiment rapidly and in an efficient
manner. After the number of preprocessing steps, the low-level analysis of
a microarray can be represented as a numerical matrix. In this matrix, the
rows represent different genes and the columns represent experimental
conditions (Androulakis, Yang, and Almon 2007). Each element of the
matrix represents the expression level of a gene under a specific condition,
FIGURE 1 Gene expression data matrix. Mark Reimers/Exploratory Analysis. Reproduced by permis-
sion of Mark Reimers/Exploratory Analysis. Permission to reuse must be obtained from the rightsholder.
BICLUSTERING
Biclustering is a powerful analytical tool for the biologist. Biclustering
methods perform clustering in two dimensions simultaneously. That means
clustering methods derive a global model and biclustering algorithms pro-
duce a local model. When clustering is applied to gene expression data,
genes as well as conditions can be clustered. However, each gene in a
bicluster is selected using only a subset of the conditions, and each condition
in a bicluster is selected using only a subset of the genes. Biclustering, thus,
performs simultaneous clustering of both rows and columns of the gene
expression matrix instead of clustering these two dimensions separately.
In short, unlike clustering algorithms, biclustering algorithms can identify
groups of genes that show similar activity patterns under a specific subset of
the experimental conditions. Biclustering is used when one or more of the
following situations apply (Madeira and Oliveira 2004):
356 R. Balamurugan et al.
1. A single gene may participate in multiple pathways that may or may not
be coactive under all conditions.
2. Only a small set of the genes participate in a cellular process of interest.
3. An interesting cellular process is active only in a subset of the conditions.
Structure of Biclusters
Several types of biclusters have been described and categorized in the
literature, depending on the pattern exhibited by the genes across the exper-
imental conditions (Mukhopadhyay, Maulik, and Bandyopadhyay 2010).
Madeira and Oliveira (2004) have identified four major groups of structures
inside the submatrices:
aij =
aij = + i + j or aij = i j
Problem Statement
The gene expression data can be represented as N M matrix A of
real numbers. Let G be a set of genes, C a set of conditions, and A(G, C)
the expression matrix, where G = {1,2, . . . , m} and C = {1,2, . . . , n}. The
element GExi,j of A(G, C) represents the expression level of gene i under
SBO for Biclustering Gene Expression Data 357
PSO together with crowding distance as the nearest neighbor search strategy,
which speeds up the convergence to the Pareto front and also guarantees
diversity of solutions. Using PSO has the problems of dependency on initial
point and parameters, difficulty in finding their optimal design parameters,
and the stochastic characteristic of the final outputs. Coelho, de Franca,
and Zuben (2009) presented an immune-inspired algorithm for biclustering
based on the concepts of clonal selection and immune network theories
adopted in the original aiNet algorithm. It explores a multipopulation aspect
by evolving several subpopulations that will be stimulated to explore distinct
regions of the search space. Nevertheless, they consider only two different
objectives: MSR and volume; it might return trivial biclusters.
Ayadi, Elloumi, and Hao (2012) proposed as a pattern-driven neigh-
borhood search (PDNS) approach to the biclustering problem. It utilizes
fast greedy algorithms to generate diversified initial biclusters of reasonable
quality and a randomized perturbation strategy. This has the drawback that
it results in highly overlapping gene sets. Huang et al. (2012) proposed a
new biclustering algorithm based on the use of an evolutionary approach
(EA) together with hierarchical clustering. The parallel computing tech-
nology would be of great help to speed up the traditional EA framework,
but its disadvantage is that it extracts a small volume of biclusters. Roy,
Bhattacharyya, and Kalita (2013) introduced a CoBi: a pattern-based coreg-
ulated biclustering of gene expression data. It is used mainly for grouping
both positively and negatively regulated genes from microarray expression
data. However, it returns small size biclusters for large MSR value. Maatouk
et al. (2014) proposed an evolutionary biclustering algorithm based on the
new crossover method (EBACross). In order to avoid overlapping biclusters
and to increase the diversification of biclusters, a mutation operator used.
Nevertheless, it possibly takes a very long time on large inputs. Ayadi and
Hao (2014) presented a memetic biclustering algorithm (MBA) to discover
the negative correlation biclusters. It is used mainly for grouping both
positively and negatively regulated genes from microarray expression data.
Extracting small biclusters and quality of biclusters depending on threshold
values are the main issues of this approach.
that marks the point of no return. In 1915, the presence of black holes
was predicted through Einsteins theory of general relativity. In 1967, John
Wheeler, an American theoretical physicist, applied the term black hole
for the collapsed objects (Luminet 1998). A black hole is an object that is
so compact, its gravitational force prevents anything, including light, from
escaping. There are so many black holes in the universe that it is impossible
to count them. A black hole is born when an object becomes unable to with-
stand the compressing force of its own gravity. They form whenever massive
but otherwise normal stars die. The black holes cannot be seen but can be
detected by material being attracted to and falling into them.
The hole is called black because it absorbs all the light that hits the
horizon, reflecting nothing, just as a perfect blackbody in thermodynamics.
In this way, astronomers have identified and measured the mass of many
black holes in the universe through careful observations of the sky. All matter
in a black hole is squeezed into a region of infinitely small volume, called the
central singularity. The event horizon is an imaginary sphere that measures
how close to the singularity an object can safely get. Once an object has
passed the event horizon, it becomes impossible to escape: the object will
be drawn in by the black holes gravitational pull and will be squashed into
the singularity. The size of the event horizon is proportional to the mass of
the black hole. According to theory, there might be three different types of
black holes depending on their mass: Stellar, Supermassive, and Miniature as
shown in Figure 3. These black holes would have been formed in different
ways. Supermassive black holes, which can have a mass equivalent to billions
of suns, likely exist in the centers of most galaxies. The miniature black hole
would have a mass much smaller than that of a sun. The miniature black
holes could have formed shortly after the big bang.
(a) Stellar-Mass Black Hole (b) Supermassive Black Hole (c) Miniature Black Hole
NASA/CXC/M.Weiss. Reproduced by NASA/CfA/HSCfA. Reproduced by NASA/JPL-caltech. Reproduced by
permission of NASA/CXC/M.Weiss. permission of NASA/CfA/HSCfA. permission of NASA/JPL-caltech.
Permission to reuse must be obtained Permission to reuse must be obtained Permission to reuse must be obtained
from the rightsholder. from the rightsholder. from the rightsholder.
Methodology
Based on the Stephen Hawking emission and absorption process, when
the black hole absorbs matter, the mass of the black hole is increased; during
emission its mass is reduced. Stellar-mass black holes can grow by pulling gas
of a companion star that orbits around it. The diet of known black holes con-
sists mostly of gas and dust, which fill the otherwise empty space throughout
the universe. Black holes can also consume material torn from nearby stars.
In fact, the most massive black holes can swallow stars whole. They can also
grow by colliding and merging with other black holes. A black hole gathers
any mass, then it grows in mass and slightly in size. The mass decreases via
a complicated process called black hole evaporation, or Hawking Radiation,
named for Stephen Hawking. Hawking radiation is black body radiation that
is predicted to be released by black holes due to quantum effects near the
event horizon.
The stellar-mass black hole has the following properties:
cumulative changes that occur in a population over time. Table 1 shows the
comparison of social strategies with SBO.
The perpetual mystery of the black holes is that they appear to exist
on radically different mass scales. There are the countless black holes that
are left over from massive stars. These stellar-mass black holes are peppered
throughout the Universe and are generally 10 to 24 times as massive as the
sun. Scientists estimate that there are as many as ten million to a billion
such black holes in the Milky Way alone. For the optimization problem, the
initial population of black holes with m dimensions is considered as a can-
didate solution for the problem. Each black hole is assigned a fitness value
according to the solution of the problem.
Evolution
The distance from the surface (or event horizon) of the black hole to
the center (or singularity) is called the Schwarzschild radius. This radius is
the size that any object would be if compressed enough to form a black hole;
it varies from only centimeters across to several kilometers across (Chaisson
and McMillan 2008). The Schwarzschild radius increases when the mass of
the original object increases but the actual radius does not. It is interesting
to mention that the irreducible mass is related to the area A of the event
horizon as shown below:
A
M = .
16
One way to grow a more massive black hole is for a seed black hole in
a dense galactic nucleus to swallow up gas and normal stars. It keeps on
growing by absorbing mass from its surroundings. By absorbing other stars
and merging with other black holes, supermassive black holes of millions of
solar masses may form. If two black holes merge together, then their event
horizons are contiguous. Figure 4 shows a growth of black hole and collision
of two black holes (Luminet 1998).
SBO for Biclustering Gene Expression Data 363
fi
i = . (4)
N
fi
i =I
The most peculiar behavior that a black hole possesses is its ability to
radiate energy and through this process lose mass in the previously men-
tioned Hawking radiation process (Hamilton 1998). Hawkings radiation
theory of black holes also finds that the energy lost from a black hole is
inversely proportional to its mass. Larger mass black holes have low tem-
peratures and, therefore, dont radiate much energy. They also radiate at
low frequencies (e.g., radio or microwave frequencies). Smaller mass black
holes emit more energy; hence, they have higher temperatures and higher
frequency for their energy emission. Finally, the subsequent implication of
the inverse thermal/mass relationship is that a black hole will suffer from
thermal radiation runaway effects as its mass gets smaller toward the end of
its life, meaning that it will lose mass faster, the smaller it becomes. The black
hole with the lowest mass has the highest radiation. The radiation level i of
an individual black hole is given in Equation (5):
364 R. Balamurugan et al.
i = 1 i (5)
The mass of the black hole is changed based on the absorption and emis-
sion level. The mass of the black hole is updated using the following
Equation (6):
A spinning black hole drags space around with it and allows matter to
orbit closer to it. By measuring the spin of distant black holes, researchers
discover important clues about how these objects grow over time. If black
holes grow mainly from collisions and mergers between galaxies, they should
accumulate material in a stable disk, and the steady supply of new mate-
rial from the disk should lead to rapidly spinning black holes. In contrast,
if black holes grow through many small accretion episodes, they will accu-
mulate material from random directions. Stellar-mass black holes can grow
by pulling gas of a companion star that orbits around it. Based on the
characteristics, the growth of ith black hole in dth dimension is given in
Equation (7):
i bi,d (t) + i+1 bi+1 ,d (t) i=1
ai,d = i1 bi1, d (t) + i bi,d (t) + i+1 bi+1, d (t) 1< i < N
i1 bi1 ,d (t) + i bi,d (t) i =N
(7)
Smaller black holes have more powerful radiation than larger ones. The
emissivity of a black hole is measured by the proportion of the actual emitted
radiance from a black hole, which is given in Equation (8):
for evaporation is simple: the black hole radiates, which causes the mass
to decrease; as the mass decreases, radiation increases; as the radiation
increases, the black hole evaporates more quickly and finally the black hole
is no more (Gottesman 1997). The black holes with very poor fitness value
will be removed from the population.
lowest fitness values will be replaced by new black holes. It causes diver-
sity in the solution. If the two black holes are close, then the merging of
the black holes forms a new black hole, avoiding premature convergence.
The population size is fixed. So the new black holes are generated for those
left out.
Biclustering Representation
Each bicluster is encoded as a black hole in the universe. Size m + n
is the fixed length of every black hole, where m and n are the number of
genes and conditions of the microarray dataset, respectively. The first m bits
represent m genes and the following n bits represent n conditions. Each
bicluster is represented by a fixed-sized binary string called a black hole, with
a bit string of genes attached to another bit string for conditions. The black
hole represents a candidate solution for the optimal bicluster generation
problem. A bit is set to one if the corresponding gene and/or condition
is present in the bicluster, otherwise it is reset to zero. Figure 5 shows an
encoded representation of a bicluster.
The SBO works well for continuous optimization problems, so the indi-
vidual dimension of a black hole is represented by a real number. For binary
SBO, the following sigmoid function (Tasgetiren and Liang 2007) is used to
scale the velocities between 0 and 1, which is then used for converting them
to the binary values. That is,
1
sigmoid(bid ) = . (10)
1 + e bid
Figure 6 depicts the representation of the black hole and its mapping
to a bicluster. The new position of the particle is updated based on
Equation (11):
1, if U (0, 1) < sigmoid(bid )
bid = . (11)
0, otherwise
Fitness Function
Biclusters with coherent values are biologically more relevant than
biclusters with constant values. Hence, in this work, biclusters with coher-
ent values are identified. In order to identify the degree of coherence,
MSR score or Hscore is used. The MSR problem is proposed by Cheng
and Church (2000) for identifying biclusters. Let the gene expression data
matrix A have M rows and N columns where a cell aij is a real value that
represents the expression level of gene i under the condition j. Matrix A is
defined by its set of rows, R = {r 1 , r 2 , . . . , rM } and its set of columns C =
{c 1 , c 2 , . . . , cN }. Given a matrix, biclustering finds submatrices that are sub-
groups of genes and subgroups of conditions, where the genes exhibit highly
correlated behavior for every condition. Given a data matrix A, the goal is
to find a set of biclusters such that each bicluster exhibits some similar char-
acteristics. Let AIJ = (I, J ) represent a submatrix of A where I R and J
C. AIJ contains only the elements aij belonging to the submatrix with a set of
rows I and a set of columns J . The concept of bicluster was introduced by
Cheng and Church (2000) to find correlated subsets of genes and a subset
of conditions.
Let aiJ denote the mean of the ith row of the bicluster (I, J ), aIj the
mean of the jth column of (I, J ), and aIJ the mean of all the elements in the
bicluster. As given more formally,
1
a I, j = ai,j
J
jJ
1
a i, J = ai,j
|I |
iI
1
a I, J = ai,j
|I | J iI jJ
The difference between the actual value of aij and its expected value
predicted from its row, column, and bicluster mean are given by the residue
of an element. It also reveals its degree of coherence with the other entries
368 R. Balamurugan et al.
The lowest score of H (I,J ) is 0, which indicates that the gene expres-
sion levels vary in harmony. This includes the trivial or constant biclusters
where there is no variation. These trivial biclusters might not be fascinat-
ing but need to be discovered and masked so more interesting ones can be
found. The gene variance may be a complementary score to discard trivial
biclusters. The gene variance can be represented in Equation (13) as follows:
1
Varr I , J = vr (i)
|I |
iI
(13)
1 2
r (i) = ai, j ai, J .
J
jJ
to be minimized. In this way, the smaller the residue and the larger the gene
variance are, the smaller the fitness value, in other words, the better the
quality of that bicluster is. The final objective of the SBO algorithm is to
minimize the fitness.
FIGURE 7 Intersection and union of given two biclusters BCi and BCj .
where
FIGURE 8 Finding the overlapping between the biclusters from BC1 to BC4 .
371
372 R. Balamurugan et al.
The bold font values represent the optimum values achieved by the algorithms.
The bold font values represent the optimum values achieved by the algorithms.
TABLE 5 Standard Deviation (Mean) Optimum Values of the SBO, CK, and PSO
The bold font values represent the optimum values achieved by the algorithms.
TABLE 6 Mean Iteration of the Function Evaluation Values of the SBO, CK, and PSO
The bold font values represent the optimum values achieved by the algorithms.
Datasets
The proposed algorithms for the bicluster problems are coded in
MATLAB and run on an Intel i3 3.7 GHz. The biclustering algorithm
has been applied to four well-known datasets in order to study its per-
formance: the yeast Saccharomyces cerevisiae stress expression data (Gasch
et al. 2000), Arabidopsis thaliana expression data (Bleuler, Prelic, and Zitzler
2004), yeast Saccharomyces cerevisiae cell cycle expression data (Cho et al.
1998), and rat CNS expression data (Wen et al. 1998). The first Gasch
yeast is the Saccharomyces cerevisiae with 2993 genes and 173 conditions.
The second one, Arabidopsis thaliana expression data, contain 734 genes
and 69 conditions. The third dataset, yeast Saccharomyces cerevisiae cell cycle
expression, contains 2884 genes and 17 experimental conditions. The rat
CNS dataset has a set of 112 genes under 9 conditions. Through empir-
ical analysis, the population size and the number of iterations are set
as 20 and 200 respectively. Figures 912 show the fitness value obtained
for Saccharomyces cerevisiae expression data, Arabidopsis thaliana expression
data, yeast Saccharomyces cerevisiae cell cycle expression data, and rat CNS
expression data, respectively. The PSO has premature convergence due to
stagnation. For all the datasets, the proposed SBO outperforms the PSO and
CK algorithms because intensification and diversification is involved with the
SBO algorithm.
374 R. Balamurugan et al.
FIGURE 9 Plot of number of iterations versus fitness value for yeast stress data.
FIGURE 10 Plot of number of iterations versus fitness value for Arabidopsis thaliana data.
FIGURE 11 Plot of number of iterations versus fitness value for yeast cell cycle data.
FIGURE 12 Plot of number of iterations versus fitness value for rat CNS data.
of MSR score to the size of the formed bicluster. Tables 710 show sample
experimental results obtained for Saccharomyces cerevisiae expression data,
Arabidopsis thaliana expression data, yeast Saccharomyces cerevisiae cell cycle
expression data, and rat CNS expression data, respectively, and the biclusters
are chosen randomly from 20 biclusters. Clearly, Figure 13 shows a sample
bicluster of size 10 5 for Arabidopsis thaliana data.
FIGURE 13 Plot of sample biclusters of size 10 5 for Arabidopsis thaliana expression data.
where G and r are the number of genes associated with the given category in
the gene expression matrix and in the bicluster, respectively, while M and m
are the number of genes in the gene expression matrix and in the biclusters,
respectively. The p-value is the probability that the genes are selected into
the cluster by random. If the p-value is smaller, then the match will be better.
The annotations of genes for three ontologies including biological process,
cellular component, and molecular function are obtained.
Bicluster No. of
No. Genes Process Function Component
identify the characteristics that the genes might have in common. Table 11
lists the significant shared GO terms used to describe the set of genes in
each bicluster for the process, function, and component ontologies. Only
the most significant terms are shown. For example, to the bicluster BC1 ,
the genes are mainly involved in binding activity. The tuple (n = 490, p =
7.82 108 ) represents that out of 1500 genes in bicluster BC1 , 490 genes
belong to binding activity function, and the statistical significance is given by
the p-value of p = 7.82 108 . In Figure 14, the biological network of the
bicluster with 10 genes, it is clear that the false discovery rate (FDR) is very
low and it is zero on many occasions. Further, the corresponding p-value is
very small (p = 0.000562), which shows that there is a very low probability
to obtain the gene cluster in random. These results mean that the proposed
SBO biclustering approach can find biologically meaningful biclusters.
FIGURE 14 Gene ontology biological process of yeast stress data with 10 genes.
Enrichment
GO Number Ontology Description Score Genes p-value
CONCLUSIONS
In this work, a novel algorithm, stellar-mass black hole optimization
(SBO), is proposed for biclustering gene expression data. This optimization
algorithm is deduced from the properties of a stellar-mass black hole. It is a
380 R. Balamurugan et al.
FUNDING
This work is supported by the research grant from Department of
Biotechnology (DBT), Government of India, New Delhi, Grant No. BT/
PR3741/BID/382/2011.
REFERENCES
Androulakis, I. P., E. Yang, and R. R. Almon. 2007. Analysis of time-series gene expression data: Methods,
challenges, and opportunities. Annual Review of Biomedical Engineering 9(2):205228.
Ayadi, W., M. Elloumi, and J. Hao. 2012. Pattern-driven neighborhood search for biclustering microarray
data. BMC Bioinformatics 13(7): S11.
Ayadi, W., and J. K. Hao. 2014 A memetic algorithm for discovering negative correlation biclusters of dna
microarray data. Neurocomputing 145:1422.
Ben-Dor, A., B. Chor, R. Karp, and Z. Yakhini. 2003. Discovering local structure in gene expression data:
The order-preserving submatrix problem. Journal of Computational Biology 10(4):373384.
Bergmann, S., J. Ihmels, and N. Barkai. 2003. Iterative signature algorithm for the analysis of large-scale
gene expression data. Physical Review E 67:118.
Bleuler, S., A. Prelic, and E. Zitzler. 2004. An EA framework for biclustering of gene expression data. In
Proceedings of the congress of IEEE on evolutionary computation 166173. IEEE.
Chaisson, E., and S. McMillan. 2008. Astronomy today: Stars and galaxies (5th ed,). UK: Benjamin
Cummings/Pearson.
Cheng, Y., and G. M. Church. 2000. Biclustering of expression data. In Proceedings of the 8th international
conference on intelligent systems for molecular biology 8:93103. San Diego, CA, USA: AAAI Press.
Cho, R. J., M. J. Campbell, E. A. Winzeler, L. Steinmetz, A. Conway, L. Wodicka, T. G. Wolfsberg, A. E.
Gabrielian, D. Landsman, and D. J. Lockhart. 1998. A genome-wide transcriptional analysis of the
mitotic cell cycle. Molecular Cell 2:6573.
Civicioglu, P., and E. Besdok. 2011. A conceptual comparison of the cuckoo-search, particle swarm opti-
mization, differential evolution and artificial bee colony algorithms. Artificial Intelligence Review
doi:10.1007/s10462-011-9276-0.
Coelho, G. P., F. O. de Franca, and F. J. V. Zuben. 2009. Multi-objective biclustering: When non-dominated
solutions are not enough. Journal of Mathematical Modelling and Algorithms 8(2):175202.
SBO for Biclustering Gene Expression Data 381
DiMaggio, P., S. McAllister, C. Floudas, X. Feng, J. Rabinowitz, and H. Rabitz. 2008. Biclustering via
optimal re-ordering of data matrices in systems biology: Rigorous methods and comparative studies.
BMC Bioinformatics 9(1):458467.
Divina, F., and J. S. Aguilar-Ruiz. 2006. Biclustering of expression data with evolutionary computation.
IEEE Transactions on Knowledge Data Engineering 18(5):590602.
Gasch, A. P., P. T. Spellman, C. M. Kao, O. Carmel-Harel, M. B. Eisen, G. Storz, D. Botstein, and P.
O. Brown. 2000. Genomic expression programs in the response of yeast cells to environmental
changes. Molecular Biology of the Cell 11:42414257.
Gottesman, D. 1997. Stabilizer codes and quantum error correction. quant-ph/9705052 (PhD thesis,
California Institute of Technology, Pasadena, CA, USA).
Hamilton, A. J. S. 1998. The evolving universe. Dordrecht, The Netherlands: Kluwer Academic.
Hartigan, J. A. 1972. Direct clustering of a data matrix. Journal of the American Statistical Association
67(337):123129.
Huang, Q., D. Tao, X. Li, and A.W.C. Liew. 2012. Parallelized evolutionary learning for detection of
biclusters in gene expression data. IEEE/ACM Transactions on Computational Biology and Bioinformatics
9(1):560570.
Jaccard, P. 1912. The distribution of the flora in the alpine zone. New Phytologist 11(2):3750.
Karaboga, D., and B. Akay. 2009. A comparative study of artificial bee colony algorithm. Applied
Mathematics and Computation 214(12):108132
Liu, C. L. 1968. Introduction to combinatorial mathematics. New York, NY: McGraw-Hill.
Liu, J., Z. Li, X. Hu, and Y. Chen. 2009. Biclustering of microarray data with mospo based on crowding
distance. BMC Bioinformatics 10(4):S9.
Liu, X., and L. Wang. 2007. Computing the maximum similarity bi-clusters of gene expression data. BMC
Bioinformatics 23(1):5056.
Lockhart, D. J., and E. A. Winzeler. 2000. Genomics, gene expression and DNA arrays. Nature
405:827836.
Luminet, J. P. 1998. Black holes: A general introduction. In Black holes: Theory and observation, Lecture
Notes in Physics 514:334. Berlin, Heidelberg: Springer-Verlag.
Maatouk, O., W. Ayadi, H. Bouziri, and B. Duval. 2014. Evolutionary algorithm based on new crossover for
the biclustering of gene expression data. In Pattern Recognition in Bioinformatics, LNCS 8626:4859.
Switzerland: Springer International.
Madeira, S. C., and A. L. Oliveira. 2004. Biclustering algorithms for biological data analysis: A survey.
IEEE/ACM Transactions on Computational Biology and Bioinformatics 1(1):2445.
Mitra, S., and H. Banka. 2006. Multi-objective evolutionary biclustering of gene expression data. Pattern
Recognition 39(12):24642477.
Mukhopadhyay, A., U. Maulik, and S. Bandyopadhyay. 2010. On biclustering of gene expression data.
Current Bioinformatics. 5:204216.
Murali, T., and S. Kasif. 2003. Extracting conserved gene expression motifs from gene expression data.
Pacific Symposium on Biocomputing 8:7788.
Roy, S., D. K. Bhattacharyya, and J. K. Kalita. 2013. CoBi: Pattern based co-regulated biclustering of gene
expression data. Pattern Recognition 34(14):16691678.
Tanay, A., R. Sharan, and R. Shamir. 2002. Discovering statistically significant biclusters in gene
expression data. BMC Bioinformatics 18:136144.
Tasgetiren, M. F., and Y. C. Liang. 2007. A binary particle swarm optimization algorithm for lot sizing
problem. Journal of Economic and Social Research 5(2):120.
Wen, X., S. Fuhrman, G. S. Michaels, D. B. Carr, S. Smith, J. L. Barker, and R. Somogyi. 1998. Large-
scale temporal gene expression mapping of central nervous system development. Proceedings of the
National Academy of Sciences 95(1):334339.
Yang, X. S., and S. Deb. 2009. Cuckoo search via Levy flights. In Proceedings of the IEEE world congress on
nature & biologically inspired computing 210214. IEEE Publications.
Copyright of Applied Artificial Intelligence is the property of Taylor & Francis Ltd and its
content may not be copied or emailed to multiple sites or posted to a listserv without the
copyright holder's express written permission. However, users may print, download, or email
articles for individual use.