Authors
Gita Shah, Annappa and K.C.Shet, National Institute of Technology, India
Abstract
Cloud has been a computational and storage solution for many data centric organizations. The problem today those organizations are facing from the cloud is in data searching in an efficient manner. A framework is required to distribute the work of searching and fetching from thousands of computers. The data in HDFS is scattered and needs lots of time to retrieve. The major idea is to design a web server in the map phase using the jetty web server which will give a fast and efficient way of searching data in MapReduce paradigm. For real time processing on Hadoop, a searchable mechanism is implemented in HDFS by creating a multilevel index in web server with multi-level index keys. The web server uses to handle traffic throughput. By web clustering technology we can improve the application performance.To keep the work down, the load balancer should automatically be able to distribute load to the newly added nodes in the server.
Keywords
Compute Cloud, Hadoop, MapReduce, load balancing, Web server.