DSpace at KOASAS: Learning the group structure of deep neural networks with an expectation maximization method

DSpace at KOASAS

RIMS Collection RIMS Conference Papers

Learning the group structure of deep neural networks with an expectation maximization method

Cited 1 time in

Cited 0 time in

Hit : 189
Download : 0

Export

Yi, Subin / Choi, Jaesik researcher

Many recent deep learning research work use very deep neural networks exploiting huge amount of parameters. It results in the strong expressive power, however, it also brings issues such as overfitting to training data, increasing memory burden and requiring excessive computations. In this paper, we propose an expectation maximization method to learn the group structure of deep neural networks with a group regularization principle to resolve those issues. Our method clusters the neurons in a layer based on how they are connected to the neurons in the next layer using a mixture model and the neurons in the next layer based on which group in the current layer they are most strongly connected to. Our expectation maximization method uses the Gaussian mixture model to keep the most salient connections and remove others to acquire a grouped weight matrix in a block diagonal matrix form. We refine our method further to cluster the kernels of convolutional neural networks (CNNs). We define the representative value of each kernel and build a representative matrix. The matrix is then grouped and the kernels are pruned out based on the group structure of the representative matrix. In experiments, we applied our method to fully-connected networks, 1-dimensional CNNs, and 2-dimensional CNNs and compared with baseline deep neural networks in MNIST, CIFAR-10, and United States groundwater datasets with respect to the number of parameters and classification and regression accuracy. We show that our method can reduce the number of parameters significantly without loss of accuracy and outperform the baseline models.

Publisher: IEEE International Conference on Data Mining Workshops

Issue Date: 2018-11-17

Language: English

Citation: 18th IEEE International Conference on Data Mining Workshops, ICDMW 2018, pp.689 - 696

ISSN: 2375-9232

DOI: 10.1109/ICDMW.2018.00106

URI: http://hdl.handle.net/10203/269548

Appears in Collection: RIMS Conference Papers

Files in This Item: There are no files associated with this item.

This item is cited by other documents in WoS

⊙ Detail Information in WoSⓡ	Click to see
⊙ Cited 1 items in WoS	Click to see citing articles in

Display Full Item Record

qr_code

트윗하기

KOASAS

Knowledge Service Development Team, KAIST 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea. T. 82-42-350-4493 Email. koasas@kaist.ac.kr
Copyright © 2016. Korea Advanced Institute of Science and Technology. All Rights Reserved.

KOASAS

KOASAS

Browse

Learning the group structure of deep neural networks with an expectation maximization method

This item is cited by other documents in WoS

KOASAS

Communities & Collections