Please use this identifier to cite or link to this item: http://localhost:8081/xmlui/handle/123456789/9843
Full metadata record
DC FieldValueLanguage
dc.contributor.authorJangam, Prasad-
dc.date.accessioned2014-11-20T12:06:17Z-
dc.date.available2014-11-20T12:06:17Z-
dc.date.issued2004-
dc.identifierM.Techen_US
dc.identifier.urihttp://hdl.handle.net/123456789/9843-
dc.guideJoshi, Ramesh Chandra-
dc.description.abstractWith the advent of Parallel processors and super computing technology the concept of "fault-tolerant computing" has emerged as an important issue. There are many Checkpoint/Restart Utilities available for sequential processors. But for large scientific computations which will take many hours for their execution, the obvious option is going for a parallel machine or super computer for fast execution. There is no such Checkpoint/Restart Utility available for programs running on parallel machine-ANUPAM super computer Developed by BARC. This Dissertation Work is an attempt to build an efficient Checkpoint/Restart Utility for parallel programs running on ANUPAM super computer. This is implemented using the concept of kernel level checkpointing and ZCF checkpointing schemes.- Optimization Techniques are used to minimize the checkpointing cost at each node of the parallel processor. A new checkpointing algorithm is designed and implemented which can minimize the checkpointing cost along with maintaining a consistent global checkpoint from which the parallel application can be restarted when a fault occurs. The Checkpointing scheme proposed takes communication induced checkpoints along with local checkpoints that are taken depending on the change in state size of the process. Finding a consistent global checkpoint is also easy in this scheme, which is an open question in ZCF schemes. This is implemented on Linux-8 operating system with kernel version 2.4.18 which is the official operating system used on ANUPAM. The language in which it is developed isen_US
dc.language.isoenen_US
dc.subjectELECTRONICS AND COMPUTER ENGINEERINGen_US
dc.subjectCHECKPOINT/RESTART UTILITYen_US
dc.subjectPARALLEL PROGRAMSen_US
dc.subjectANUPAM - SUPER COMPUTERen_US
dc.titleCHECKPOINT/RESTART UTILITY FOR PARALLEL PROGRAMS ON ANUPAM - SUPER COMPUTERen_US
dc.typeM.Tech Dessertationen_US
dc.accession.numberG11792en_US
Appears in Collections:MASTERS' THESES (E & C)

Files in This Item:
File Description SizeFormat 
ECDG11792.pdf3.72 MBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.