Please use this identifier to cite or link to this item:
http://localhost:8081/jspui/handle/123456789/9843
Full metadata record
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Jangam, Prasad | - |
dc.date.accessioned | 2014-11-20T12:06:17Z | - |
dc.date.available | 2014-11-20T12:06:17Z | - |
dc.date.issued | 2004 | - |
dc.identifier | M.Tech | en_US |
dc.identifier.uri | http://hdl.handle.net/123456789/9843 | - |
dc.guide | Joshi, Ramesh Chandra | - |
dc.description.abstract | With the advent of Parallel processors and super computing technology the concept of "fault-tolerant computing" has emerged as an important issue. There are many Checkpoint/Restart Utilities available for sequential processors. But for large scientific computations which will take many hours for their execution, the obvious option is going for a parallel machine or super computer for fast execution. There is no such Checkpoint/Restart Utility available for programs running on parallel machine-ANUPAM super computer Developed by BARC. This Dissertation Work is an attempt to build an efficient Checkpoint/Restart Utility for parallel programs running on ANUPAM super computer. This is implemented using the concept of kernel level checkpointing and ZCF checkpointing schemes.- Optimization Techniques are used to minimize the checkpointing cost at each node of the parallel processor. A new checkpointing algorithm is designed and implemented which can minimize the checkpointing cost along with maintaining a consistent global checkpoint from which the parallel application can be restarted when a fault occurs. The Checkpointing scheme proposed takes communication induced checkpoints along with local checkpoints that are taken depending on the change in state size of the process. Finding a consistent global checkpoint is also easy in this scheme, which is an open question in ZCF schemes. This is implemented on Linux-8 operating system with kernel version 2.4.18 which is the official operating system used on ANUPAM. The language in which it is developed is | en_US |
dc.language.iso | en | en_US |
dc.subject | ELECTRONICS AND COMPUTER ENGINEERING | en_US |
dc.subject | CHECKPOINT/RESTART UTILITY | en_US |
dc.subject | PARALLEL PROGRAMS | en_US |
dc.subject | ANUPAM - SUPER COMPUTER | en_US |
dc.title | CHECKPOINT/RESTART UTILITY FOR PARALLEL PROGRAMS ON ANUPAM - SUPER COMPUTER | en_US |
dc.type | M.Tech Dessertation | en_US |
dc.accession.number | G11792 | en_US |
Appears in Collections: | MASTERS' THESES (E & C) |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
ECDG11792.pdf | 3.72 MB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.