Hybrid Genetic Algorithm For Maximum Clique Problem: Volume 2, Issue 4, April 2013
Hybrid Genetic Algorithm For Maximum Clique Problem: Volume 2, Issue 4, April 2013
Hybrid Genetic Algorithm For Maximum Clique Problem: Volume 2, Issue 4, April 2013
ABSTRACT
Maximum clique problem is one of the most important NP-hard problems which find its applications in numerous fields ranging from networking to the determination of the structure of a protein molecule. The present work carries forward one of our earlier works and examines the effect of variation of parameters for achieving optimization. The results obtained have been presented in the paper and are extremely encouraging. The premise used in the work is Genetic Algorithms, owing to their exceptional searching capabilities. The technique can be extended to many unsolvable problems and can be used in many applications from loop determination and circuit solving.
Keywords: Genetic Algorithms, NP Hard Problems, Maximum clique problem, Searching techniques.
1. INTRODUCTION
The maximum clique problem refers to finding out the maximum clique in the graph G= (V,E), where V represents the set of vertices and E= (x,y): x,y V. Finding out a clique, in a graph, is one of the original NP hard problems. It is not possible to craft a P algorithm for the problem. The problem is immensely important as it is used in error correcting codes and tilling geometry problems. It has recently been established that the matching of 3-D structure is important in the modeling of a protein structure. Since, maximum clique problem helps in recognition of cliques, therefore the problem reduction approach may help us to map the maximum clique problem to the determination of protein structure. The premise used to obtain solutions of the problem, in the work, is Genetic Algorithms. Genetic algorithms are known for their ability to find out the best possible solution from amongst a given set, if optimization is to be achieved. The present work correlates maximum clique problem with Genetic Algorithms to find an optimal solution. The work is an extension of one of our earlier works [3]. In our earlier work the crossover rate was taken as 2% and mutation rate as 0.5%. In the present work, the parameters have been varied and the earlier algorithm has been modified. The results obtained are much better than earlier results. In order to instill confidence in the proposed technique an extensive literature review has been carried out to find the gaps in the existing methodologies. The proposed technique has been implemented in C#, .net frame work 4.0 and gives encouraging results. The rest of the paper has been organized as follows. Section 2 of the paper presents the literature review. Section 3 explains the Genetic Algorithms. Section 4 presents the proposed technique. Section 4 discusses the experimental verification and finally, section 5 discusses the conclusions and future scope.
2. LITERATURE REVIEW
An extensive literature review has been carried out in order to find the gaps in the exiting technique and to propose the new technique. The literature review has been carried out in accordance with guidelines proposed by Kitchenham [23]. It may be stated that the papers of high quality journals have been selected and as far as possible the review has been unbiased. An extensive review of the existing techniques forms the foundation stone of a good research. Therefore an extra effort has been made in order to carry out the appraisal. The results of the review have been summarized in Table 1. Table 1: Review of Maximum Clique Problem
Page 421
Web Site: www.ijaiem.org Email: [email protected], [email protected] Volume 2, Issue 4, April 2013 ISSN 2319 - 4847
Author Name David R. Wood Reference No. 1 Technique Branch and Bound algorithm has been applied for finding vertex coloring. Heuristic have been used to determine lower and upper bound Randomly generated graphs with some heuristic functions. Genetic Algorithms Genetic Algorithms Polynomial time algorithm using NP completeness by inherently intractable problems. Heuristics based neural network. Parallelization of heuristic algorithm based on neural network. Enumerative and exact algorithms using heuristics search. The maximum clique problem is solvable in the polynomial time of O(n2+d).CLIQUES that generates all maximal cliques in a depth-first way in O(3n/3)-time. Bron-Kerbosch algorithm for maximum cliques. Maximum independent set of circle graphs Bron-Kerbosch algorithm for finding maximum cliques. Worst-case time complexity is O(3 n/3)for an n-vertex. Two algorithms for enumerating all maximal cliques. One runs with O(M(n)) time delay and in O(n2) space and the other runs with O(4) time delay and in O(n +m) space. Cliques for generating all maximal cliques. Random graphs and DIMACS benchmark graphs for finding maximum clique in an arbitrary graph. Non deterministic solution and artificial intelligence based solution using genetic algorithm. Efficient partition based algorithm with limited memory. Geometric structure of the circular arcs.
Panos M. Pardalos Harsh Bhasin, Rohan Mahajan Harsh Bhasin, Gitanjali Ahuja Michael R. Garey and David S. Johnson Roberto Cruj, Nancy Lopez R.Cruz and N. Lopez P.M. Pardalos and J. Xue Etsuji Tomita, Hiroaki Nakanishi
2 3 4 5 6 7 8 9
Coen Bron, Joep Kerbosch F. Gavril Etsuji Tomita, Akira Tanaka, Haruhisa Takahashi K. Makino, T. Uno
10 11 12
13
T. Nakagawa, E. Tomita Etsuji Tomita, Toshikatsu Kameda Harsh Bhasin, Nishant Gupta James Cheng, Yiping Ke Bhattacharya B.K,Kaller D
14 15 16 17 20
3. GENETIC ALGORITHM
Genetic algorithms (Gas) are used for optimization problem. GAs is currently being used in disciplines ranging from in engineering to social science. The current work uses a version of simple genetic algorithm. The three operators are mutation, crossover and replication. The use of GAs in optimization problem has been extensively explored by Goldberg. He applied genetic algorithm in natural gas pipeline and control the flow is governed by transition equation. The flow depends on the difference in pressure. It may be noted that the crossover and mutation operator cannot be applied in an arbitrary manner. This point was also observed by Goldberg. He chose crossover rate as 1% and mutation rate as 0.1% in the problems stated above. For applying Gas, encoding is done. The problem at hand is depicted a set of chromosomes. After the encoding the two operator crossover and mutation is applied. Crossover is applied to amalgamate the feature of the existing population whereas mutation is applied to break the local maxima. Broadly GAs can be described by the following steps. 1 Population Generation:-A population of chromosomes is generated. The population is generally binary. 2 Crossover: - From amongst the above chromosomes, two chromosomes are selected and a crossover point is taken with the help of which the new population is generated. This is referred to as one point crossover. However two point
Page 422
Web Site: www.ijaiem.org Email: [email protected], [email protected] Volume 2, Issue 4, April 2013 ISSN 2319 - 4847
crossover and multipoint crossover are also prevalent. 3 Mutation: - Mutation refers to the flipping of any random bit of random chromosomes to generate a new set of chromosomes. 4 Reproduction: - Reproduction operator helps us to replicate the fit chromosome again and again so as to increase the average fitness of the population.
5. PROPOSED WORK
Step 1: An initial population consisting of m chromosomes is taken. Each chromosome has n cells where n is the number of vertices in graph. The population is binary. Step 2: Each chromosome consist of 0s and 1s.The chromosomes are then mapped to the vertices of the graph. Step 3: Each chromosome is assigned an index based on the above step. The index will be more if the number of vertices selected is less and the sub graph is fully connected. We did not take into account the second condition in our previous work. Thus leads to misleading results. Step 4: For each chromosome, fitness value is calculated according to formula 1/ (1+e multiplied by a constant. The constant is determined by regression analysis. Step 5: A threshold is decided which is taken half the range of indices obtained in step 3. Step 6: Crossover and mutation are performed and new chromosomes are subjected to step 3 and 4. Here the difference from our previous work is that the crossover rate is taken as 2% and the mutation rate as 0.5% for optimal results. Step 7: Reproduction is carried out with Roulette wheel Selection. Step 8: The above process can be repeated and the results can be compared with previous iteration in order to improve solution. Here, the previous results are stored and act as a guided weight for the next step. The process is depicted in the following diagram.
Page 423
Web Site: www.ijaiem.org Email: [email protected], [email protected] Volume 2, Issue 4, April 2013 ISSN 2319 - 4847 6. RESULT AND CONCLUSION
In the above work, it has been again established that, applying the above technique results in the crafting of a set of solutions that provides almost certain answer to the problem. The Vertex Cover is an NP Hard problem. The method proposed has been explained in one of our earlier works and is verified again by varying the parameters of Genetic Algorithms. The algorithm has now been tested sufficient number of times. The above method has also undergone regress testing. We can now safely state that the above novel approach, that uses the blend of nature along with technical novelty, is capable of producing requisite results. It may be stated here that, this technique is being used to solve other NP hard problems as well.
References
[1] David R. Wood, An Algorithm for Finding a Maximum Clique in a Graph, Operations Research Letters, Vol. 21 , pp. 211-217 [2] Panos M. Pardalos, A exact algorithm for the Maximum clique problem, Operations Research Letters, Vol. 9, pp 375-382, 1990. [3] Harsh Bhasin, R. Mahajan, Genetic Algorithm Based Solution to Maximum Clique Problem, International Journal of Computer Science and Engineering, Vol. 4, 2012. [4] Harsh Bhasin, Gitanjali Ahuja, Harnessing Genetic Algorithm for Vertex Cover Problem, International Journal of Computer Science and Engineering, Vol. 4, 2012. [5] Michael R. Garey and David S. Johnson, Computers and Intractability: A Guide to the Theory of NPCompleteness, W.H. Freeman,1979 [6] Roberto Cruj, Nancy Lopez, A parallelization of a heuristic for the maximum Clique problem, Journal of Computing Sciences in Colleges, Vol. 17 , pp 62-70, 2001. [7] R.Cruz and N. Lopez, A Neural Network Approach to Solve the Maximum Clique Problem, In Proceedings of the V European Congress on Intelligent Techniques and Soft-Computing - EUFIT, Vol. 1, pp. 465-470, 1997. [8] P.M. Pardalos and J. Xue, The Maximum Clique Problem, Journal of Global Optimization, Vol. 4, pp. 301-328, 1994. [9] Etsuji Tomita, Hiroaki Nakanishi Polynomial Time solvability of the Maximum Clique problem, In Proceeding ECC'09 Proceedings of the 3rd international conference on European computing conference, pp 203-208. [10] Coen Bron, Joep Kerbosch, Finding all cliques in an undirected graph, Communications of the ACM, Vol. 16, pp 575-577, 1973. [11] F. Gavril, Algorithms for a maximum clique and a maximum independent set of a circle graph, Networks,Vol. 3, pp. 261-273, 1973. [12] Etsuji Tomita, Akira Tanaka, Haruhisa Takahashi, The worst-case time complexity for generating all maximal cliques and computational experiment Journal Theoretical Computer Science, Computing and combinatorics, Vol. 363 , 2006. [13] K. Makino, T. Uno, New algorithms for enumerating all maximal cliques, Lecture Notes in Computer Science, Vol. 3111, pp. 260-272, 2004. [14] T. Nakagawa, E. Tomita, Experimental comparisons of algorithms for generating all maximal cliques, Technical Report of IPSJ, MPS-52, pp. 41-44, 2004 [15] Etsuji Tomita, Toshikatsu Kameda, An efficient Branch and Bound algorithm for finding a maximum clique with computational experiment, Journal of Global Optimization Vol. 37, pp. 95-111, 2007. [16] Harsh Bhasin, Nishant Gupta, Randomized Algorithm Approach for Solving PCP, International Journal Of Computer Science and Engineering, Vol. 4, 2012. [17] James Cheng, Yiping Ke, Fast Algorithms for Maximal Clique Enumeration with Limited Memory, ACM 9781-4503-1462-6, 2012. [18] Harsh Bhasin, Neha Singla, Modified Genetic Algorithms Based Solution to Subset Sum Problem, International Journal of Advanced Research in Artificial Intelligence, Vol. 1, No. 1, 2012. [19] Harsh Bhasin, Neha Singla, Genetic based Algorithm for N Puzzle Problem, International Journal of Computer Applications, Vol. 51, No.22, 2012. [20] Bhattacharya B.K,Kaller D, An O(m+nlogn) Algorithm for the Maximum clique problem in circular arc graphs, Journal of Algorithms, Volume 25 ,pp 336 358, 2007. [21] Harsh Bhasin, Nakul Arora, Cryptography using Genetic Algorithms, In Reliability Infocom Technology and Optimization, Faridabad, 2010. [22] Harsh Bhasin and Neha Singla, Harnessing Cellular Automata and Genetic Algorithms to Solve Travelling Salesman Problem, International Conference on Information, Computing and Telecommunications (ICICT), conference proceedings, pp 72 77, 2012.
Page 424
Web Site: www.ijaiem.org Email: [email protected], [email protected] Volume 2, Issue 4, April 2013 ISSN 2319 - 4847
[23] Kithenham, B. A., Systematic review in software engineering: where we are and where we should be going, In Proceedings of the 2nd international workshop on Evidential assessment of software technologies, Pages 1-2, 2012.
Page 425