Please use this identifier to cite or link to this item:
http://localhost:8081/jspui/handle/123456789/21154| Title: | Using Information Theoretic Concepts to Understand Autoencoders |
| Authors: | Ashish |
| Issue Date: | May-2022 |
| Publisher: | IIT Roorkee |
| Abstract: | Deep neural networks currently require theory - building approaches for analysis, despite their extensive usage in practical uses. We present a new information-theoretic model to understand learning dynamics and building autoencoders, a type of deep learning architecture that resembles a communication channel. " By generalizing the information plane to any cost function and analyzing the roles and dynamics of various levels using layer-wise information quantity, we emphasize the importance of mutual information in measuring knowledge from data. " Using the data processing inequality, we also suggest and scientifically illustrate three essential hypotheses relating to the layer-wise flow of information and intrinsic dimensionality of the bottleneck layer for mean square error training and the recognition of a data-controlled bifurcation point in the information plane. Our findings have a direct influence on the best design of autoencoders, alternative feedforward training approaches, and even the generalization problem. |
| URI: | http://localhost:8081/jspui/handle/123456789/21154 |
| Research Supervisor/ Guide: | Kumar, Sanjeev |
| metadata.dc.type: | Dissertations |
| Appears in Collections: | MASTERS' THESES (Maths) |
Files in This Item:
| File | Description | Size | Format | |
|---|---|---|---|---|
| 20616005_Ashish.pdf | 795.03 kB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.
