Research Article |
Fault Tolerance Techniques in Heterogeneous Mobile Distributed Computing System- A Review
Author(s): Madhav Gupta and Dr. Ruchi Singla
Published In : International Journal of Electrical and Electronics Research (IJEER) Volume 3, issue 2
Publisher : FOREX Publication
Published : 30 june 2015
e-ISSN : 2347-470X
Page(s) : 9-14
Abstract
In Distributed computing system, Fault tolerance is an important issue because if the system fails then whole execution of a tasks stop. Fault tolerance is that asset of a system which provides the service to perform well still in case of any faults. A task applied on the real time distributed system must be feasible and reliable. The real time distributed systems for instance grid networks, robotics, air traffic control systems, etc. exceedingly depends on time. A single error in real time distributed system can cause a whole system failure, if not detected accurately and recovered at the proper time. Fault-tolerance is the key method which is often used to provide continue reliability in these systems. By applying extra hardware like processors, resource, communication links hardware fault tolerance can be achieved. A fault perhaps will occur for numerous reasons in distributed computing system such as failure of network, hardware or software failure etc.This paper defines various terminologies like failure, fault, faulty environment, fault tolerance, candidate node, redundancy, etc and explains fundamental concepts linked to fault tolerance in distributed systems. There are a lot of issues in distributed Computing system such as Emergent resource sharing, transparency, dependability, Complex mappings, concurrency, Fault tolerance etc. In this paper we focussed on the different fault tolerant approaches and fault tolerant terminologies used in distributed computing environment.
Keywords: Distributed System
, Reliability
, Fault tolerance
, Faulty environment
.
Madhav Gupta*, Student (M.Tech Scholar, ECE), Chandigarh Engineering College, Mohali, India; Email: madhavgupta890@gmail.com
Dr. Ruchi Singla , Professor and Head, Department of ECE, Chandigarh Engineering College, Mohali, India; Email: ruchisingla@yahoo.com
-
[1] “Distributed Computing Principles, Algorithms, and Systems” by Ajay D. Kshemkalyani (University of Illinois at Chicago) and Mukesh Singhal (University of Kentucky, Lexington) © Cambridge University Press 2008, www.cambridge.org.
-
[2] Vinod Kumar Yadav, Mahendra Pratap Yadav and Dharmendra Kumar Yadav, “Reliable Task Allocation in Heterogeneous Distributed System with Random Node Failure”, International Conference of Computing Science, Vol 61, pp: 187-192, 2012.
-
[3] Rajwinder Singh and Mayank Dave, Senior Member, “Antecedence Graph Approach to Check pointing for Fault Tolerance in Mobile Agent Systems”, IEEE transactions on computers, vol. 62, no. 2, February 2013.
-
[4] Z. Li, X. Liu, W. Ren, and L. Xie, “Distributed tracking control for linear multiagent systems with a leader of bounded unknown input,” IEEE Transactions on Automatic Control, vol. 58, no. 2, pp. 518-523,2013.
-
[5] Zibin Zheng and Michael R. Lyu, “Selecting an Optimal Fault Tolerance Strategy for Reliable Service-Oriented Systems with Local and Global Constraints” IEEE TRANSACTIONS ON COMPUTERS, VOL. 64, NO. 1, JANUARY 2015.
-
[6] Jinho Ahn, “Lightweight Fault-tolerance Mechanism for Distributed Mobile Agent-based Monitoring” IEEE International Conference on Mobile Agent-based monitoring, 2009.
-
[7] Parmeet Kaur Jaggi and Awadhesh Kumar Singh, “Adaptive Checkpointing for Fault Tolerance in an Autonomous Mobile Computing Grid, IEEE, 2014.
-
[8] Zhe Wang, Naftaly and H. Minsky,” Fault Tolerance in Heterogeneous Distributed Systems” 10th IEEE International Conference on Collaborative Computing: Networking, Applications and Work sharing (CollaborateCom 2014).
-
[9] Pritee Parwekar and Parmeet Kaur, “Fuzzy Rule based Checkpointing Arrangement for Fault Tolerance in Mobile Grids”, IEEE2014.
-
[10] Rajwinder Singh, Mayank Dave, “Using Host Criticalities for Fault Tolerance in Mobile Agent Systems, 2nd IEEE International Conference on Parallel Distributed and Grid Computing, Vol 2 pp: 67-72, Jun 2012.
-
[11] Bahi, Jacques, Couturier, Raphael and Vernier, Flavien. Synchronous distributed load balancing on dynamic networks, Journal of Parallel and Distributed Computing, Elsevier Inc., Vol. 65, Issue 11, 1397 – 1405, 2010.
-
[12]Pradeep K. Sinha Distributed Operating Systems Concepts and Design, PHI Learning Private Limited, New Delhi- 110001, 2010.