DSpace Repository

PARALLELIZING SUFFIX ARRAY ON CUD � FOR BIOINFORMATICS APPLICATIONS

Show simple item record

dc.contributor.author Sahu, Laxmi Kant
dc.date.accessioned 2014-11-30T05:57:17Z
dc.date.available 2014-11-30T05:57:17Z
dc.date.issued 2010
dc.identifier M.Tech en_US
dc.identifier.uri http://hdl.handle.net/123456789/12189
dc.guide Niyogi, Rajdeep
dc.description.abstract Suffix trees have been applied to fundamental string problems such as finding the longest repeated substring, finding all squares or repetitions in a string, computing substring statistics, approximate string matching and string comparison. They have also been used to address other types of problems such as text compression, compressing assembly code, inverted indices and analyzing genetic sequences. Suffix array is a simpler and compact alternative to the suffix tree; suffix sorting is the fundamental building block in suffix array construction process. Suffix array widely used in field of bioinformatics and text processing. The repeat structure of genomic DNA is considered an essential mechanism for evolution and other fundamental biological functions. Any kind of repeats finding problems are always deemed as one of the prerequisites for genome sequencing and analysis, and among these problems exact repeat finding is the first step for most other repeats finding problems. This work depicts the parallel implementation of suffix array and then their applications in bioinformatics such as exact repeats finding, tandem repeats finding, exact string matching etc. the parallel implementation of suffix array algorithm on a GPU using the Compute Unified Device Architecture (CUDA) platform, both from NVIDIA Corporation. CUDA is a parallel computing architecture. It is a middle-ware. compute engine which exposes the power of NVIDIA Graphics Processing Units to software developers through industry standard programming language. The thread level parallel code block provides an efficient primitive for building a high performance suffix array construction program and many other applications. The parallel version runs much faster than any serial implementation on CPU for the large size of input data elements. The parallel algorithm can also be easily adapted for similar type of problems with little modification en_US
dc.language.iso en en_US
dc.subject ELECTRONICS AND COMPUTER ENGINEERING en_US
dc.subject PARALLELIZING en_US
dc.subject BIOINFORMATICS en_US
dc.subject APPLICATIONS en_US
dc.title PARALLELIZING SUFFIX ARRAY ON CUD � FOR BIOINFORMATICS APPLICATIONS en_US
dc.type M.Tech Dessertation en_US
dc.accession.number G20089 en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record