1
|
Shang Z, Hao JK, Ma F. A double-decomposition based parallel exact algorithm for the feedback length minimization problem. PeerJ Comput Sci 2023; 9:e1597. [PMID: 37869465 PMCID: PMC10588706 DOI: 10.7717/peerj-cs.1597] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2023] [Accepted: 08/28/2023] [Indexed: 10/24/2023]
Abstract
Product development projects usually contain many interrelated activities with complex information dependences, which induce activity rework, project delay and cost overrun. To reduce negative impacts, scheduling interrelated activities in an appropriate sequence is an important issue for project managers. This study develops a double-decomposition based parallel branch-and-prune algorithm, to determine the optimal activity sequence that minimizes the total feedback length (FLMP). This algorithm decomposes FLMP from two perspectives, which enables the use of all available computing resources to solve subproblems concurrently. In addition, we propose a result-compression strategy and a hash-address strategy to enhance this algorithm. Experimental results indicate that our algorithm can find the optimal sequence for FLMP up to 27 activities within 1 h, and outperforms state of the art exact algorithms.
Collapse
Affiliation(s)
- Zhen Shang
- School of Economics and Management, Chang’an University, Xi’an, China
| | | | - Fei Ma
- School of Economics and Management, Chang’an University, Xi’an, China
| |
Collapse
|
2
|
Sun W, Hao JK, Wu Z, Li W, Wu Q. Dynamic thresholding search for the feedback vertex set problem. PeerJ Comput Sci 2023; 9:e1245. [PMID: 37346631 PMCID: PMC10280643 DOI: 10.7717/peerj-cs.1245] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2022] [Accepted: 01/17/2023] [Indexed: 06/23/2023]
Abstract
Given a directed graph G = (V, E), a feedback vertex set is a vertex subset C whose removal makes the graph G acyclic. The feedback vertex set problem is to find the subset C* whose cardinality is the minimum. As a general model, this problem has a variety of applications. However, the problem is known to be NP-hard, and thus computationally challenging. To solve this difficult problem, this article develops an iterated dynamic thresholding search algorithm, which features a combination of local optimization, dynamic thresholding search, and perturbation. Computational experiments on 101 benchmark graphs from various sources demonstrate the advantage of the algorithm compared with the state-of-the-art algorithms, by reporting record-breaking best solutions for 24 graphs, equally best results for 75 graphs, and worse best results for only two graphs. We also study how the key components of the algorithm affect its performance of the algorithm.
Collapse
Affiliation(s)
- Wen Sun
- School of Cyber Science and Engineering, Southeast University, Nanjing, China
| | | | - Zihao Wu
- School of Cyber Science and Engineering, Southeast University, Nanjing, China
| | - Wenlong Li
- School of Cyber Science and Engineering, Southeast University, Nanjing, China
| | - Qinghua Wu
- School of Management, Huazhong University of Science and Technology, Wuhan, China
| |
Collapse
|
3
|
Abstract
The clique partitioning problem (CPP) of an edge-weighted complete graph is to partition the vertex set V into k disjoint subsets such that the sum of the edge weights within all cliques induced by the subsets is as large as possible. The problem has a number of practical applications in areas, such as data mining, engineering, and bioinformatics, and is, however, computationally challenging. To solve this NP-hard problem, we propose the first evolutionary algorithm that combines a dedicated merge-divide crossover operator to generate offspring solutions and an effective simulated annealing-based local optimization procedure to find high-quality local optima. The extensive experiments on three sets of 94 benchmark instances (including two sets of 63 classical benchmark instances and one new set of 31 large benchmark) show a remarkable performance of the proposed approach compared to the state-of-the-art methods. We analyze the key algorithmic ingredients to shed light on their impacts on the performance of the algorithm. The algorithm and its available source code can benefit people working on practical problems related to CPP.
Collapse
|
4
|
Lu Y, Hao JK, Wu Q. Solving the clustered traveling salesman problem via traveling salesman problem methods. PeerJ Comput Sci 2022; 8:e972. [PMID: 35721414 PMCID: PMC9202618 DOI: 10.7717/peerj-cs.972] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2022] [Accepted: 04/13/2022] [Indexed: 06/15/2023]
Abstract
The Clustered Traveling Salesman Problem (CTSP) is a variant of the popular Traveling Salesman Problem (TSP) arising from a number of real-life applications. In this work, we explore a transformation approach that solves the CTSP by converting it to the well-studied TSP. For this purpose, we first investigate a technique to convert a CTSP instance to a TSP and then apply powerful TSP solvers (including exact and heuristic solvers) to solve the resulting TSP instance. We want to answer the following questions: How do state-of-the-art TSP solvers perform on clustered instances converted from the CTSP? Do state-of-the-art TSP solvers compete well with the best performing methods specifically designed for the CTSP? For this purpose, we present intensive computational experiments on various benchmark instances to draw conclusions.
Collapse
Affiliation(s)
- Yongliang Lu
- School of Economics and Management, Fuzhou University, Fuzhou, China
| | | | - Qinghua Wu
- School of Management, Huazhong University of Science and Technology, Wuhan, China
| |
Collapse
|
5
|
|
6
|
|
7
|
|
8
|
|
9
|
|
10
|
|
11
|
|
12
|
|
13
|
Abstract
Critical node problems (CNPs) involve finding a set of critical nodes from a graph whose removal results in optimizing a predefined measure over the residual graph. As useful models for a variety of practical applications, these problems are computationally challenging. In this paper, we study the classic CNP and introduce an effective memetic algorithm for solving CNP. The proposed algorithm combines a double backbone-based crossover operator (to generate promising offspring solutions), a component-based neighborhood search procedure (to find high-quality local optima), and a rank-based pool updating strategy (to guarantee a healthy population). Extensive evaluations on 42 synthetic and real-world benchmark instances show that the proposed algorithm discovers 24 new upper bounds and matches 15 previous best-known bounds. We also demonstrate the relevance of our algorithm for effectively solving a variant of the classic CNP, called the cardinality-constrained CNP. Finally, we investigate the usefulness of each key algorithmic component.
Collapse
|
14
|
|
15
|
|
16
|
|
17
|
|
18
|
|
19
|
Lee JH, Zhao XM, Yoon I, Lee JY, Kwon NH, Wang YY, Lee KM, Lee MJ, Kim J, Moon HG, In Y, Hao JK, Park KM, Noh DY, Han W, Kim S. Integrative analysis of mutational and transcriptional profiles reveals driver mutations of metastatic breast cancers. Cell Discov 2016; 2:16025. [PMID: 27625789 PMCID: PMC5004232 DOI: 10.1038/celldisc.2016.25] [Citation(s) in RCA: 61] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2016] [Accepted: 06/21/2016] [Indexed: 12/11/2022] Open
Abstract
Despite the explosion in the numbers of cancer genomic studies, metastasis is still the major cause of cancer mortality. In breast cancer, approximately one-fifth of metastatic patients survive 5 years. Therefore, detecting the patients at a high risk of developing distant metastasis at first diagnosis is critical for effective treatment strategy. We hereby present a novel systems biology approach to identify driver mutations escalating the risk of metastasis based on both exome and RNA sequencing of our collected 78 normal-paired breast cancers. Unlike driver mutations occurring commonly in cancers as reported in the literature, the mutations detected here are relatively rare mutations occurring in less than half metastatic samples. By supposing that the driver mutations should affect the metastasis gene signatures, we develop a novel computational pipeline to identify the driver mutations that affect transcription factors regulating metastasis gene signatures. We identify driver mutations in ADPGK, NUP93, PCGF6, PKP2 and SLC22A5, which are verified to enhance cancer cell migration and prompt metastasis with in vitro experiments. The discovered somatic mutations may be helpful for identifying patients who are likely to develop distant metastasis.
Collapse
Affiliation(s)
- Ji-Hyun Lee
- Medicinal Bioconvergence Research Center, College of Pharmacy, Seoul National University, Seoul, Republic of Korea; Research Institute of Pharmaceutical Sciences, College of Pharmacy, Seoul National University, Seoul, Republic of Korea
| | - Xing-Ming Zhao
- Department of Computer Science and Technology, Tongji University , Shanghai, China
| | - Ina Yoon
- Medicinal Bioconvergence Research Center, College of Pharmacy, Seoul National University , Seoul, Republic of Korea
| | - Jin Young Lee
- Medicinal Bioconvergence Research Center, College of Pharmacy, Seoul National University , Seoul, Republic of Korea
| | - Nam Hoon Kwon
- Medicinal Bioconvergence Research Center, College of Pharmacy, Seoul National University , Seoul, Republic of Korea
| | - Yin-Ying Wang
- Department of Computer Science and Technology, Tongji University , Shanghai, China
| | - Kyung-Min Lee
- Department of Surgery, Seoul National University College of Medicine , Seoul, Republic of Korea
| | - Min-Joo Lee
- Department of Surgery, Seoul National University College of Medicine , Seoul, Republic of Korea
| | - Jisun Kim
- Department of Surgery, Seoul National University College of Medicine , Seoul, Republic of Korea
| | - Hyeong-Gon Moon
- Department of Surgery, Seoul National University College of Medicine , Seoul, Republic of Korea
| | - Yongho In
- Medicinal Bioconvergence Research Center, College of Pharmacy, Seoul National University , Seoul, Republic of Korea
| | - Jin-Kao Hao
- LERIA, University of Angers , Angers, France
| | - Kyung-Mii Park
- Research Institute of Pharmaceutical Sciences, College of Pharmacy, Seoul National University , Seoul, Republic of Korea
| | - Dong-Young Noh
- Department of Surgery, Seoul National University College of Medicine , Seoul, Republic of Korea
| | - Wonshik Han
- Department of Surgery, Seoul National University College of Medicine, Seoul, Republic of Korea; Cancer Research Institute, Seoul National University, Seoul, Republic of Korea
| | - Sunghoon Kim
- Medicinal Bioconvergence Research Center, College of Pharmacy, Seoul National University, Seoul, Republic of Korea; Department of Molecular Medicine and Biopharmaceutical Sciences, Seoul National University, Seoul, Republic of Korea
| |
Collapse
|
20
|
|
21
|
|
22
|
|
23
|
|
24
|
Zhang X, Zhao J, Hao JK, Zhao XM, Chen L. Conditional mutual inclusive information enables accurate quantification of associations in gene regulatory networks. Nucleic Acids Res 2015; 43:e31. [PMID: 25539927 PMCID: PMC4357691 DOI: 10.1093/nar/gku1315] [Citation(s) in RCA: 83] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2014] [Revised: 12/03/2014] [Accepted: 12/05/2014] [Indexed: 11/13/2022] Open
Abstract
Mutual information (MI), a quantity describing the nonlinear dependence between two random variables, has been widely used to construct gene regulatory networks (GRNs). Despite its good performance, MI cannot separate the direct regulations from indirect ones among genes. Although the conditional mutual information (CMI) is able to identify the direct regulations, it generally underestimates the regulation strength, i.e. it may result in false negatives when inferring gene regulations. In this work, to overcome the problems, we propose a novel concept, namely conditional mutual inclusive information (CMI2), to describe the regulations between genes. Furthermore, with CMI2, we develop a new approach, namely CMI2NI (CMI2-based network inference), for reverse-engineering GRNs. In CMI2NI, CMI2 is used to quantify the mutual information between two genes given a third one through calculating the Kullback-Leibler divergence between the postulated distributions of including and excluding the edge between the two genes. The benchmark results on the GRNs from DREAM challenge as well as the SOS DNA repair network in Escherichia coli demonstrate the superior performance of CMI2NI. Specifically, even for gene expression data with small sample size, CMI2NI can not only infer the correct topology of the regulation networks but also accurately quantify the regulation strength between genes. As a case study, CMI2NI was also used to reconstruct cancer-specific GRNs using gene expression data from The Cancer Genome Atlas (TCGA). CMI2NI is freely accessible at http://www.comp-sysbio.org/cmi2ni.
Collapse
Affiliation(s)
- Xiujun Zhang
- Key Laboratory of Systems Biology, Institute of Biochemistry and Cell Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031, China Department of Mathematics, Xinyang Normal University, Xinyang 464000, China School of Chemical and Biomedical Engineering, Nanyang Technological University, Singapore 637459, Singapore
| | - Juan Zhao
- Key Laboratory of Systems Biology, Institute of Biochemistry and Cell Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031, China
| | - Jin-Kao Hao
- LERIA, Department of Computer Science, University of Angers, Angers 49045, France
| | - Xing-Ming Zhao
- Department of Computer Science, School of Electronics and Information Engineering, Tongji University, Shanghai 201804, China
| | - Luonan Chen
- Key Laboratory of Systems Biology, Institute of Biochemistry and Cell Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031, China Collaborative Research Center for Innovative Mathematical Modelling, Institute of Industrial Science, University of Tokyo, Tokyo 153-8505, Japan
| |
Collapse
|
25
|
Zhao XM, Liu KQ, Zhu G, He F, Duval B, Richer JM, Huang DS, Jiang CJ, Hao JK, Chen L. Identifying cancer-related microRNAs based on gene expression data. Bioinformatics 2014; 31:1226-34. [DOI: 10.1093/bioinformatics/btu811] [Citation(s) in RCA: 70] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2014] [Accepted: 12/04/2014] [Indexed: 12/19/2022] Open
|
26
|
|
27
|
Zhao XM, Ngom A, Hao JK. Pattern recognition in bioinformatics. Neurocomputing 2014. [DOI: 10.1016/j.neucom.2014.06.035] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
|
28
|
Tessier D, Laroum S, Duval B, Rath EM, Church WB, Hao JK. In silico evaluation of the influence of the translocon on partitioning of membrane segments. BMC Bioinformatics 2014; 15:156. [PMID: 24885988 PMCID: PMC4035737 DOI: 10.1186/1471-2105-15-156] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2013] [Accepted: 05/14/2014] [Indexed: 11/10/2022] Open
Abstract
Background The locations of the TM segments inside the membrane proteins are the consequence of a cascade of several events: the localizing of the nascent chain to the membrane, its insertion through the translocon, and the conformation adopted to reach its stable state inside the lipid bilayer. Even though the hydrophobic h-region of signal peptides and a typical TM segment are both composed of mostly hydrophobic side chains, the translocon has the ability to determine whether a given segment is to be inserted into the membrane. Our goal is to acquire robust biological insights into the influence of the translocon on membrane insertion of helices, obtained from the in silico discrimination between signal peptides and transmembrane segments of bitopic proteins. Therefore, by exploiting this subtle difference, we produce an optimized scale that evaluates the tendency of each amino acid to form sequences destined for membrane insertion by the translocon. Results The learning phase of our approach is conducted on carefully chosen data and easily converges on an optimal solution called the PMIscale (Potential Membrane Insertion scale). Our study leads to two striking results. Firstly, with a very simple sliding-window prediction method, PMIscale enables an efficient discrimination between signal peptides and signal anchors. Secondly, PMIscale is also able to identify TM segments and to localize them within protein sequences. Conclusions Despite its simplicity, the localization method based on PMIscale nearly attains the highest level of TM topography prediction accuracy as the current state-of-the-art prediction methods. These observations confirm the prominent role of the translocon in the localization of TM segments and suggest several biological hypotheses about the physical properties of the translocon.
Collapse
Affiliation(s)
- Dominique Tessier
- INRA, UR1268 Biopolymères Interactions et Assemblages, Nantes F-44316, France.
| | | | | | | | | | | |
Collapse
|
29
|
|
30
|
|
31
|
Zhang X, Liu K, Liu ZP, Duval B, Richer JM, Zhao XM, Hao JK, Chen L. NARROMI: a noise and redundancy reduction technique improves accuracy of gene regulatory network inference. Bioinformatics 2012; 29:106-13. [DOI: 10.1093/bioinformatics/bts619] [Citation(s) in RCA: 107] [Impact Index Per Article: 8.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open
|
32
|
|
33
|
Liu KQ, Liu ZP, Hao JK, Chen L, Zhao XM. Identifying dysregulated pathways in cancers from pathway interaction networks. BMC Bioinformatics 2012; 13:126. [PMID: 22676414 PMCID: PMC3443452 DOI: 10.1186/1471-2105-13-126] [Citation(s) in RCA: 100] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2011] [Accepted: 05/21/2012] [Indexed: 12/04/2022] Open
Abstract
Background Cancers, a group of multifactorial complex diseases, are generally caused by mutation of multiple genes or dysregulation of pathways. Identifying biomarkers that can characterize cancers would help to understand and diagnose cancers. Traditional computational methods that detect genes differentially expressed between cancer and normal samples fail to work due to small sample size and independent assumption among genes. On the other hand, genes work in concert to perform their functions. Therefore, it is expected that dysregulated pathways will serve as better biomarkers compared with single genes. Results In this paper, we propose a novel approach to identify dysregulated pathways in cancer based on a pathway interaction network. Our contribution is three-fold. Firstly, we present a new method to construct pathway interaction network based on gene expression, protein-protein interactions and cellular pathways. Secondly, the identification of dysregulated pathways in cancer is treated as a feature selection problem, which is biologically reasonable and easy to interpret. Thirdly, the dysregulated pathways are identified as subnetworks from the pathway interaction networks, where the subnetworks characterize very well the functional dependency or crosstalk between pathways. The benchmarking results on several distinct cancer datasets demonstrate that our method can obtain more reliable and accurate results compared with existing state of the art methods. Further functional analysis and independent literature evidence also confirm that our identified potential pathogenic pathways are biologically reasonable, indicating the effectiveness of our method. Conclusions Dysregulated pathways can serve as better biomarkers compared with single genes. In this work, by utilizing pathway interaction networks and gene expression data, we propose a novel approach that effectively identifies dysregulated pathways, which can not only be used as biomarkers to diagnose cancers but also serve as potential drug targets in the future.
Collapse
Affiliation(s)
- Ke-Qin Liu
- Institute of Systems Biology, Shanghai University, Shanghai 200444, China
| | | | | | | | | |
Collapse
|
34
|
Abstract
BACKGROUND Biclustering aims at finding subgroups of genes that show highly correlated behaviors across a subgroup of conditions. Biclustering is a very useful tool for mining microarray data and has various practical applications. From a computational point of view, biclustering is a highly combinatorial search problem and can be solved with optimization methods. RESULTS We describe a stochastic pattern-driven neighborhood search algorithm for the biclustering problem. Starting from an initial bicluster, the proposed method improves progressively the quality of the bicluster by adjusting some genes and conditions. The adjustments are based on the quality of each gene and condition with respect to the bicluster and the initial data matrix. The performance of the method was evaluated on two well-known microarray datasets (Yeast cell cycle and Saccharomyces cerevisiae), showing that it is able to obtain statistically and biologically significant biclusters. The proposed method was also compared with six reference methods from the literature. CONCLUSIONS The proposed method is computationally fast and can be applied to discover significant biclusters. It can also used to effectively improve the quality of existing biclusters provided by other biclustering methods.
Collapse
Affiliation(s)
- Wassim Ayadi
- LERIA, Université d'Angers, 2 Boulevard Lavoisier, 49045 Angers Cedex 01, France
- LaTICE, Higher School of Sciences and Technologies of Tunis, 5 Avenue Taha Hussein, B. P. : 56, Bab Menara, 1008 Tunis, University of Tunis, Tunisia
| | - Mourad Elloumi
- LaTICE, Higher School of Sciences and Technologies of Tunis, 5 Avenue Taha Hussein, B. P. : 56, Bab Menara, 1008 Tunis, University of Tunis, Tunisia
| | - Jin-Kao Hao
- LERIA, Université d'Angers, 2 Boulevard Lavoisier, 49045 Angers Cedex 01, France
| |
Collapse
|
35
|
|
36
|
Zhang X, Zhao XM, He K, Lu L, Cao Y, Liu J, Hao JK, Liu ZP, Chen L. Inferring gene regulatory networks from gene expression data by path consistency algorithm based on conditional mutual information. ACTA ACUST UNITED AC 2011; 28:98-104. [PMID: 22088843 DOI: 10.1093/bioinformatics/btr626] [Citation(s) in RCA: 165] [Impact Index Per Article: 12.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
MOTIVATION Reconstruction of gene regulatory networks (GRNs), which explicitly represent the causality of developmental or regulatory process, is of utmost interest and has become a challenging computational problem for understanding the complex regulatory mechanisms in cellular systems. However, all existing methods of inferring GRNs from gene expression profiles have their strengths and weaknesses. In particular, many properties of GRNs, such as topology sparseness and non-linear dependence, are generally in regulation mechanism but seldom are taken into account simultaneously in one computational method. RESULTS In this work, we present a novel method for inferring GRNs from gene expression data considering the non-linear dependence and topological structure of GRNs by employing path consistency algorithm (PCA) based on conditional mutual information (CMI). In this algorithm, the conditional dependence between a pair of genes is represented by the CMI between them. With the general hypothesis of Gaussian distribution underlying gene expression data, CMI between a pair of genes is computed by a concise formula involving the covariance matrices of the related gene expression profiles. The method is validated on the benchmark GRNs from the DREAM challenge and the widely used SOS DNA repair network in Escherichia coli. The cross-validation results confirmed the effectiveness of our method (PCA-CMI), which outperforms significantly other previous methods. Besides its high accuracy, our method is able to distinguish direct (or causal) interactions from indirect associations. AVAILABILITY All the source data and code are available at: http://csb.shu.edu.cn/subweb/grn.htm. CONTACT lnchen@sibs.ac.cn; zpliu@sibs.ac.cn SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Xiujun Zhang
- Institute of Systems Biology, Shanghai University, Shanghai 200444, China
| | | | | | | | | | | | | | | | | |
Collapse
|
37
|
|
38
|
Hamiez JP, Hao JK, Glover FW. A Study of Tabu Search for Coloring Random 3-Colorable Graphs Around the Phase Transition. International Journal of Applied Metaheuristic Computing 2010. [DOI: 10.4018/jamc.2010100101] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
The authors present an experimental investigation of tabu search (TS) to solve the 3-coloring problem (3-COL). Computational results reveal that a basic TS algorithm is able to find proper 3-colorings for random 3-colorable graphs with up to 11000 vertices and beyond when instances follow the uniform or equipartite well-known models, and up to 1500 vertices for the hardest class of flat graphs. This study also validates and reinforces some existing phase transition thresholds for 3-COL.
Collapse
|
39
|
Abstract
This paper discusses a particular “packing” problem, namely the two dimensional strip packing problem, where a finite set of objects have to be located in a strip of fixed width and infinite height. The variant studied considers regular items, rectangular to be precise, that must be packed without overlap, not allowing rotations. The objective is to minimize the height of the resulting packing. In this regard, the authors present a local search algorithm based on the well-known tabu search metaheuristic. Two important components of the presented tabu search strategy are reinforced in attempting to include problem knowledge. The fitness function incorporates a measure related to the empty spaces, while the diversification relies on a set of historically “frozen” objects. The resulting reinforced tabu search approach is evaluated on a set of well-known hard benchmark instances and compared with state-of-the-art algorithms.
Collapse
|
40
|
Ayadi W, Elloumi M, Hao JK. A biclustering algorithm based on a bicluster enumeration tree: application to DNA microarray data. BioData Min 2009; 2:9. [PMID: 20015398 PMCID: PMC2804695 DOI: 10.1186/1756-0381-2-9] [Citation(s) in RCA: 49] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2009] [Accepted: 12/16/2009] [Indexed: 12/03/2022] Open
Abstract
BACKGROUND In a number of domains, like in DNA microarray data analysis, we need to cluster simultaneously rows (genes) and columns (conditions) of a data matrix to identify groups of rows coherent with groups of columns. This kind of clustering is called biclustering. Biclustering algorithms are extensively used in DNA microarray data analysis. More effective biclustering algorithms are highly desirable and needed. METHODS We introduce BiMine, a new enumeration algorithm for biclustering of DNA microarray data. The proposed algorithm is based on three original features. First, BiMine relies on a new evaluation function called Average Spearman's rho (ASR). Second, BiMine uses a new tree structure, called Bicluster Enumeration Tree (BET), to represent the different biclusters discovered during the enumeration process. Third, to avoid the combinatorial explosion of the search tree, BiMine introduces a parametric rule that allows the enumeration process to cut tree branches that cannot lead to good biclusters. RESULTS The performance of the proposed algorithm is assessed using both synthetic and real DNA microarray data. The experimental results show that BiMine competes well with several other biclustering methods. Moreover, we test the biological significance using a gene annotation web-tool to show that our proposed method is able to produce biologically relevant biclusters. The software is available upon request from the authors to academic users.
Collapse
Affiliation(s)
- Wassim Ayadi
- UTIC, Higher School of Sciences and Technologies of Tunis, 1008 Tunis, Tunisia
- LERIA, Université d'Angers, 2 Boulevard Lavoisier, 49045 Angers, France
| | - Mourad Elloumi
- UTIC, Higher School of Sciences and Technologies of Tunis, 1008 Tunis, Tunisia
| | - Jin-Kao Hao
- LERIA, Université d'Angers, 2 Boulevard Lavoisier, 49045 Angers, France
| |
Collapse
|
41
|
Affiliation(s)
- Béatrice Duval
- University of Angers, 2 Boulevard Lavoisier, 49045 Angers Cedex 01, France
| | | |
Collapse
|
42
|
Abstract
Gene subset selection is essential for classification and analysis of microarray data. However, gene selection is known to be a very difficult task since gene expression data not only have high dimensionalities, but also contain redundant information and noises. To cope with these difficulties, this paper introduces a fuzzy logic based pre-processing approach composed of two main steps. First, we use fuzzy inference rules to transform the gene expression levels of a given dataset into fuzzy values. Then we apply a similarity relation to these fuzzy values to define fuzzy equivalence groups, each group containing strongly similar genes. Dimension reduction is achieved by considering for each group of similar genes a single representative based on mutual information. To assess the usefulness of this approach, extensive experimentations were carried out on three well-known public datasets with a combined classification model using three statistic filters and three classifiers.
Collapse
|
43
|
Richer JM, Goëffon A, Hao JK. A Memetic Algorithm for Phylogenetic Reconstruction with Maximum Parsimony. Evolutionary Computation, Machine Learning and Data Mining in Bioinformatics 2009. [DOI: 10.1007/978-3-642-01184-9_15] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/13/2023]
|
44
|
Abstract
The Maximum Parsimony (MP) problem aims at reconstructing a phylogenetic tree from DNA sequences while minimizing the number of genetic transformations. To solve this NP-complete problem, heuristic methods have been developed, often based on local search. In this article, we focus on the influence of the neighborhood relations. After analyzing the advantages and drawbacks of the well-known Nearest Neighbor Interchange (NNI), Subtree Pruning Regrafting (SPR) and Tree-Bisection-Reconnection (TBR) neighborhoods, we introduce the concept of Progressive Neighborhood (PN) which consists in constraining progressively the size of the neighborhood as the search advances. We empirically show that applied to the Maximum Parsimony problem, this progressive neighborhood turns out to be more efficient and robust than the classic neighborhoods using a descent algorithm. Indeed, it allows to find better solutions with a smaller number of iterations or trees evaluated.
Collapse
|
45
|
Abstract
This paper presents GASAT, a hybrid algorithm for the satisfiability problem (SAT). The main feature of GASAT is that it includes a recombination stage based on a specific crossover and a tabu search stage. We have conducted experiments to evaluate the different components of GASAT and to compare its overall performance with state-of-the-art SAT algorithms. These experiments show that GASAT provides very competitive results.
Collapse
|
46
|
|
47
|
Zhu ZX, Yan ZQ, Yu SZ, Zhang RX, Wang JY, Liu YM, Hao JK, Zhang XL, Yu SL, He QN, Meng ZW. Studies on the phenomenon of latent propagated sensation along the channels. I. The discovery of a latent PSC and a preliminary study of its skin electrical conductance. Am J Chin Med 1981; 9:216-24. [PMID: 7053021 DOI: 10.1142/s0192415x81000299] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/23/2023]
Abstract
By combining electrical impulse stimuli with mechanical ones, we are able to trace out a new line of feeling, coincident with the classical channel course. Impulse stimulation was carried out at the Jing point of the Large Intestine or Stomach channel, namely Shangyan or Lidui. With a small rubber nipple, light taps were applied on the skin along the lines perpendicular to the channel and crossing over the acupuncture points; a specific propagational numb feeling at the points of the channel could be found. By linking up these points of specific feeling, an imaginary line which is exactly the classical Large Intestine or Stomach channel can be traced out. This line is called the "latent propagational sensation line along the channels" because, unless through tapping, no prominent sensation of propagation could be felt. Employing an impulsive electrical generator and an all-wave commutating circuit linked to a micro-ammeter, the skin conductance was measured over the latent PSC of the Large Intestine channel lying between the wrist and 5 cm above the elbow joint. Results were compared to those locations of 1 cm apart from the channel course, i.e. the control sites devoid of acupuncture points or channels. At most acupuncture points or any site of the channel course on all of the 10 subjects under examination, there was greater electrical conductance maxima than there was at control sites. This fact indicates that not only the acupuncture points, but the entire course of latent PSC are also of higher electrical conductance.
Collapse
|