AINeutralLil'Log (Lilian Weng) ยท Sep 286/10
๐ง
Anatomize Deep Learning with Information Theory
Professor Naftali Tishby applied information theory to analyze deep neural network training, proposing the Information Bottleneck method as a new learning bound for DNNs. His research identified two distinct phases in DNN training: first representing input data to minimize generalization error, then compressing representations by forgetting irrelevant details.