• Jun 14, 2017 News!Vol.8, No.5 has been indexed by EI (Inspec).   [Click]
  • Jul 19, 2017 News!Vol.9, No.4 has been published with online version. 16 peer reviewed articles from 16 specific areas are published in this issue.   [Click]
  • Jun 14, 2017 News!Vol.9, No.3 has been published with online version. 15 peer reviewed articles from 8 specific areas are published in this issue.   [Click]
General Information
Editor-in-chief
Prof. Wael Badawy
Department of Computing and Information Systems Umm Al Qura University, Canada
I'm happy to take on the position of editor in chief of IJCTE. We encourage authors to submit papers concerning any branch of computer theory and engineering.
IJCTE 2010 Vol.2(5): 701-705 ISSN: 1793-8201
DOI: 10.7763/IJCTE.2010.V2.228

A Transition from Traditional Checkpointing towards Multi-Agent based Approaches

Gerard McKee, Blesson Varghese and Vassil Alexandrov
Abstract—Middleware for parallel computing systems incorporate checkpointing to achieve fault tolerance. Most traditional checkpointing approaches tend to be less dynamic in large scale parallel computing environments. Hence, there arises a need for an adaptive and dynamic approach. The work reported in this paper, proposes a multi-agent based approach for fault tolerance. Five resources namely, the executed problem, parallel computing platform, middleware, hardware abstraction and agents that contribute towards the infrastructure of the proposed approach is considered. The approach is implemented on a computer cluster and experimental results are presented to validate the feasibility of the approach and its contribution towards enhancing fault tolerance.

Index Terms—middleware approach, multi-agent, fault tolerance, parallel computing systems.

  Gerard McKee is Senior Lecturer in Networked Robotics, School of Systems Engineering, University of Reading, Whiteknights Campus, Reading, Berkshire, United Kingdom, RG6 6AY, email: g.t.mckee@reading.ac.uk.
  Blesson Varghese is a PhD candidate with the Active Robotics Laboratory, School of Systems Engineering, University of Reading, Whiteknights Campus, Reading, Berkshire, United Kingdom, RG6 6AY, email: b.varghese@student.reading.ac.uk.
  Vassil Alexandrov is Professor in Computational Science, School of Systems Engineering, University of Reading, Whiteknights Campus, Reading, Berkshire, United Kingdom, RG6 6AY, email: v.n.alexandrov@reading.ac.uk.

[PDF]

Cite: Gerard McKee, Blesson Varghese and Vassil Alexandrov, "A Transition from Traditional Checkpointing towards Multi-Agent based Approaches," International Journal of Computer Theory and Engineering vol. 2, no. 5, pp. 701-705, 2010.
Copyright © 2008-2015. International Journal of Computer Theory and Engineering. All rights reserved.
E-mail: ijcte@vip.163.com