DSpace Repository

ADAPTIVE CHECKPOINTING BASED FAULT TOLERANCE IN GRID ENVIRONMENT

Show simple item record

dc.contributor.author Upadhyay, Neeraj
dc.date.accessioned 2014-09-27T05:24:06Z
dc.date.available 2014-09-27T05:24:06Z
dc.date.issued 2012
dc.identifier M.Tech en_US
dc.identifier.uri http://hdl.handle.net/123456789/2241
dc.guide Misra, Manoj
dc.description.abstract Grid systems differ from traditional distributed systems in terms of their large scale, heterogeneity and dynamism. These factors contributes towards higher number of fault occurrences as large scale causes lower values of Mean Time To Failure (MTTF), heterogeneity results in interaction faults (protocol incompatibilities) between communicating disparate nodes and dynamism implies dynamically varying resource availability due to resources autonomously entering and leaving the grid and thus effecting the jobs running on them. Another factor that increases probability of failure of applications is that applications running on grid are long running computations taking days to finish. Traditional approaches for tolerating faults in distributed systems include checkpointing and replication. Incorporating fault tolerance in scheduling algorithms is one of the approaches for handling faults in grid environment. Genetic Algorithms and Ant Colony Optimization are a popular class of meta-heuristic algorithms used for grid scheduling. This work designs heuristics for adaptive checkpointing based on fault information about resources. These heuristics have been incorporated in GA and ACO. Other adaptive checkpointing techniques developed focuses on online adaption of checkpoint interval based on MTBF, last failure time and fault indexes of resources. Performance comparison of adaptive checkpointing with periodic checkpointing techniques have been performed using simulated Grid environment for wide range of scenarios such as temporally and spatially correlated failures, real failure traces and real workload traces. Adaptive checkpointing techniques are found to give superior performance compared to periodic checkpointing. en_US
dc.language.iso en en_US
dc.subject FAULT TOLERANCE en_US
dc.subject HETEROGENEITY AND DYNAMISIM en_US
dc.subject ADAPTIVE CHECKPOINTING en_US
dc.subject ELECTRONICS AND COMPUTER ENGINEERING en_US
dc.title ADAPTIVE CHECKPOINTING BASED FAULT TOLERANCE IN GRID ENVIRONMENT en_US
dc.type M.Tech Dessertation en_US
dc.accession.number G219994 en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record