Literature on Neural Architecture Search

2131.

Lin, Ke; A, Yong; Gan, Zhuoxin; Jiang, Yingying

WPNAS: Neural Architecture Search by jointly using Weight Sharing and Predictor Technical Report

2022.

2130.

Chen, Xuehui; Niu, Xin; Jiang, Jingfei; Pan, Hengyue; Dong, Peijie; Wei, Zimian

Influence of Initialization and Modularization on the Performance of Network Morphism-Based Neural Architecture Search Proceedings Article

In: Yao, Jian; Xiao, Yang; You, Peng; Sun, Guang (Ed.): The International Conference on Image, Vision and Intelligent Systems (ICIVIS 2021), pp. 875–887, Springer Singapore, Singapore, 2022, ISBN: 978-981-16-6963-7.

@inproceedings{10.1007/978-981-16-6963-7_77,

title = {Influence of Initialization and Modularization on the Performance of Network Morphism-Based Neural Architecture Search},

author = {Xuehui Chen and Xin Niu and Jingfei Jiang and Hengyue Pan and Peijie Dong and Zimian Wei},

editor = {Jian Yao and Yang Xiao and Peng You and Guang Sun},

url = {https://link.springer.com/chapter/10.1007/978-981-16-6963-7_77},

isbn = {978-981-16-6963-7},

year  = {2022},

date = {2022-01-01},

urldate = {2022-01-01},

booktitle = {The International Conference on Image, Vision and Intelligent Systems (ICIVIS 2021)},

pages = {875--887},

publisher = {Springer Singapore},

address = {Singapore},

abstract = {Neural Architecture Search (NAS), the process of automatic network architecture design, has enabled remarkable progress over the last years on Computer Vision tasks. In this paper, we propose a novel and efficient NAS framework based on network morphism to further improve the performance of NAS algorithms. Firstly, we design four modular structures termed RBNC block, CBNR block, BNRC block and RCBN block which correspond to four initial neural network architectures and four modular network morphism methods. Each block is composed of a ReLU layer, a Batch-Norm layer and a convolutional layer. Then we introduce network morphism to correlate different modular structures for constructing network architectures. Moreover, we study the influence of different initial neural network architectures and modular network morphism methods on the performance of network morphism-based NAS algorithms through comparative experiments and ablation experiments. Finally, we find that the network morphism-based NAS algorithm that uses CBNR block for initialization and modularization is the best method to improve performance. Our proposed method achieves a test accuracy of 95.84% on CIFAR-10 with least parameters (only 2.72 M) and fewer search costs (2 GPU-days) for network architecture search.},

keywords = {},

pubstate = {published},

tppubtype = {inproceedings}

}

2129.

Sun, Jialiang; Jiang, Tingsong; Li, Chao; Zhou, Weien; Zhang, Xiaoya; Yao, Wen; Chen, Xiaoqian

Searching for Robust Neural Architectures via Comprehensive and Reliable Evaluation Technical Report

2022.

2128.

Chebykin, Alexander; Alderliesten, Tanja; Bosman, Peter A. N.

Evolutionary Neural Cascade Search across Supernetworks Technical Report

2022.

2127.

Xiao, Yuhan; Sun, Shang; Liao, TaoLin

Parameter search-based scaling network for self-supervised depth Proceedings Article

In: Mohiddin, Md Khaja; Chen, Siting; EL-Zoghdy, Said Fathy (Ed.): Third International Conference on Electronics and Communication; Network and Computer Technology (ECNCT 2021), pp. 463 – 467, International Society for Optics and Photonics SPIE, 2022.

2126.

Javaheripi, Mojan; Shah, Shital; Mukherjee, Subhabrata; Religa, Tomasz L.; Mendes, Caio C. T.; Rosa, Gustavo H.; Bubeck, Sébastien; Koushanfar, Farinaz; Dey, Debadeepta

LiteTransformerSearch: Training-free On-device Search for Efficient Autoregressive Language Models Technical Report

2022.

2125.

Lopes, Vasco; Alexandre, Luís A.

Towards Less Constrained Macro-Neural Architecture Search Technical Report

2022.

2124.

Wu, Xixin; Hu, Shoukang; Wu, Zhiyong; Liu, Xunying; Meng, Helen

Neural Architecture Search for Speech Emotion Recognition Technical Report

2022.

2123.

Wei, Zimian; Pan, Hengyue; Niu, Xin; Dong, Peijie; Li, Dongsheng

UENAS: A Unified Evolution-based NAS Framework Technical Report

2022.

2122.

Xiang, Tiange; Zhang, Chaoyi; Wang, Xinyi; Song, Yang; Liu, Dongnan; Huang, Heng; Cai, Weidong

Towards bi-directional skip connections in encoder-decoder architectures and beyond Journal Article

In: Medical Image Analysis, vol. 78, pp. 102420, 2022, ISSN: 1361-8415.

@article{XIANG2022102420,

title = {Towards bi-directional skip connections in encoder-decoder architectures and beyond},

author = {Tiange Xiang and Chaoyi Zhang and Xinyi Wang and Yang Song and Dongnan Liu and Heng Huang and Weidong Cai},

url = {https://www.sciencedirect.com/science/article/pii/S1361841522000718},

doi = {https://doi.org/10.1016/j.media.2022.102420},

issn = {1361-8415},

year  = {2022},

date = {2022-01-01},

urldate = {2022-01-01},

journal = {Medical Image Analysis},

volume = {78},

pages = {102420},

abstract = {U-Net, as an encoder-decoder architecture with forward skip connections, has achieved promising results in various medical image analysis tasks. Many recent approaches have also extended U-Net with more complex building blocks, which typically increase the number of network parameters considerably. Such complexity makes the inference stage highly inefficient for clinical applications. Towards an effective yet economic segmentation network design, in this work, we propose backward skip connections that bring decoded features back to the encoder. Our design can be jointly adopted with forward skip connections in any encoder-decoder architecture forming a recurrence structure without introducing extra parameters. With the backward skip connections, we propose a U-Net based network family, namely Bi-directional O-shape networks, which set new benchmarks on multiple public medical imaging segmentation datasets. On the other hand, with the most plain architecture (BiO-Net), network computations inevitably increase along with the pre-set recurrence time. We have thus studied the deficiency bottleneck of such recurrent design and propose a novel two-phase Neural Architecture Search (NAS) algorithm, namely BiX-NAS, to search for the best multi-scale bi-directional skip connections. The ineffective skip connections are then discarded to reduce computational costs and speed up network inference. The finally searched BiX-Net yields the least network complexity and outperforms other state-of-the-art counterparts by large margins. We evaluate our methods on both 2D and 3D segmentation tasks in a total of six datasets. Extensive ablation studies have also been conducted to provide a comprehensive analysis for our proposed methods.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

2121.

Sengar, Neha; Singh, Akriti; Yadav, Saumya; Dutta, Malay Kishore

Äutomated System for Face-Mask Detection Using Convolutional Neural Network Proceedings Article

In: Giri, Debasis; Choo, Kim-Kwang Raymond; Ponnusamy, Saminathan; Meng, Weizhi; Akleylek, Sedat; Maity, Santi Prasad (Ed.): Proceedings of the Seventh International Conference on Mathematics and Computing, pp. 373–380, Springer Singapore, Singapore, 2022, ISBN: 978-981-16-6890-6.

2120.

Girish, Sharath; Dey, Debadeepta; Joshi, Neel; Vineet, Vibhav; Shah, Shital; Mendes, Caio Cesar Teodoro; Shrivastava, Abhinav; Song, Yale

One Network Doesn't Rule Them All: Moving Beyond Handcrafted Architectures in Self-Supervised Learning Technical Report

2022.

2119.

Li, Zi; Li, Ziyang; Liu, Risheng; Luo, Zhongxuan; Fan, Xin

Automated Learning for Deformable Medical Image Registration by Jointly Optimizing Network Architectures and Objective Functions Technical Report

2022.

2118.

Wang, Haoxiang; Wang, Yite; Sun, Ruoyu; Li, Bo

Global Convergence of MAML and Theory-Inspired Neural Architecture Search for Few-Shot Learning Technical Report

2022.

2117.

Shi, Jiachen; Zhou, Guoqiang; Bao, Shudi; Shen, Jun

Multi-SelfGAN: A Self-Guiding Neural Architecture Search Method for Generative Adversarial Networks with Multi-Controllers Journal Article

In: IEEE Transactions on Cognitive and Developmental Systems, pp. 1-1, 2022.

2116.

Dong, Junwei; Hou, Boyu; Feng, Liang; Tang, Huajin; Tan, Kay Chen; Ong, Yew-Soon

A Cell-Based Fast Memetic Algorithm for Automated Convolutional Neural Architecture Design Journal Article

In: IEEE Transactions on Neural Networks and Learning Systems, pp. 1-14, 2022.

2115.

Mi, Jian-Xun; Feng, Jie; Huang, Ke-Yang

Designing efficient convolutional neural network structure: A survey Journal Article

In: Neurocomputing, vol. 489, pp. 139-156, 2022, ISSN: 0925-2312.

2114.

Blumberg, Stefano B.; Lin, Hongxiang; Grussu, Francesco; Zhou, Yukun; Figini, Matteo; Alexander, Daniel C.

Progressive Subsampling for Oversampled Data - Application to Quantitative MRI Technical Report

2022.

2113.

Vo-Ho, Viet-Khoa; Yamazaki, Kashu; Hoang, Hieu; Tran, Minh-Triet; Le, Ngan

Meta-Learning of NAS for Few-shot Learning in Medical Image Applications Technical Report

2022.

2112.

Chang, Qing; Peng, Junran; Xie, Lingxi; Sun, Jiajun; Yin, Haoran; Tian, Qi; Zhang, Zhaoxiang

DATA: Domain-Aware and Task-Aware Self-supervised Learning Proceedings Article

In: CVPR2022, 2022.

2111.

Lu, Zhenyu; Liang, Shaoyang; Yang, Qiang; Du, Bo

Evolving Block-Based Convolutional Neural Network for Hyperspectral Image Classification Journal Article

In: IEEE Transactions on Geoscience and Remote Sensing, vol. 60, pp. 1-21, 2022.

2110.

Lukasik, Jovita; Jung, Steffen; Keuper, Margret

Learning Where To Look - Generative NAS is Surprisingly Efficient Technical Report

2022.

2109.

Yan, Chenqian; Zhang, Yuge; Zhang, Quanlu; Yang, Yaming; Jiang, Xinyang; Yang, Yuqing; Wang, Baoyuan

Privacy-preserving Online AutoML for Domain-Specific Face Detection Technical Report

2022.

2108.

Yang, Sen; Yang, Wankou; Cui, Zhen

Searching part-specific neural fabrics for human pose estimation Journal Article

In: Pattern Recognition, vol. 128, pp. 108652, 2022, ISSN: 0031-3203.

@article{YANG2022108652,

title = {Searching part-specific neural fabrics for human pose estimation},

author = {Sen Yang and Wankou Yang and Zhen Cui},

url = {https://www.sciencedirect.com/science/article/pii/S0031320322001339},

doi = {https://doi.org/10.1016/j.patcog.2022.108652},

issn = {0031-3203},

year  = {2022},

date = {2022-01-01},

urldate = {2022-01-01},

journal = {Pattern Recognition},

volume = {128},

pages = {108652},

abstract = {Neural architecture search (NAS) has emerged in many domains to jointly learn the architectures and weights of neural networks. The core spirit behind NAS is to automatically search neural architectures for target tasks with better performance-efficiency trade-offs. However, existing approaches emphasize on only searching a single architecture with less human intervention to replace a human-designed neural network, yet making the search process almost independent of the domain knowledge. In this paper, we aim to apply NAS for human pose estimation and we ask: when NAS meets this localization task, can the articulated human body structure help to search better task-specific architectures? To this end, we first design a new neural architecture search space, Cell-based Neural Fabric (CNF), to learn micro as well as macro neural architecture using a differentiable search strategy. Then, by viewing locating human parts as multiple disentangled prediction sub-tasks, we exploit the compositionality of human body structure as guidance to search multiple part-specific CNFs specialized for different human parts. After the search, all these part-specific neural fabrics have been tailored with distinct micro and macro architecture parameters. The results show that such knowledge-guided NAS-based model outperforms a hand-crafted part-based baseline model, and the resulting multiple part-specific architectures gain significant performance improvement against a single NAS-based architecture for the whole body. The experiments on MPII and COCO datasets show that our models11Code is available at https://github.com/yangsenius/PoseNFS. achieve comparable performance against the state-of-the-art methods while being relatively lightweight.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

2107.

Zhang, Haichao; Hao, Kuangrong; Pedrycz, Witold; Gao, Lei; Tang, Xue-Song; Wei, Bing

Vision Transformer with Convolutions Architecture Search Technical Report

2022.

2106.

Hu, Yiming; Wang, Xingang; Gu, Qingyi

PWSNAS: Powering Weight Sharing NAS With General Search Space Shrinking Framework Journal Article

In: IEEE Transactions on Neural Networks and Learning Systems, pp. 1-14, 2022.

2105.

Wang, Xiaoxing; Lin, Jiale; Yan, Junchi; Zhao, Juanping; Yang, Xiaokang

EAutoDet: Efficient Architecture Search for Object Detection Technical Report

2022.

2104.

Habibian, Amirhossein; Yahia, Haitam Ben; Abati, Davide; Gavves, Efstratios; Porikli, Fatih

Delta Distillation for Efficient Video Processing Technical Report

2022.

2103.

Arora, Parul; Jalali, Seyed Mohammad Jafar; Ahmadian, Sajad; Panigrahi, Bijaya Ketan; Suganthan, Pn; Khosravi, Abbas

Probabilistic Wind Power Forecasting Using Optimised Deep Auto-Regressive Recurrent Neural Networks Journal Article

In: IEEE Transactions on Industrial Informatics, pp. 1-1, 2022.

2102.

Yüzügüler, Ahmet Caner; Dimitriadis, Nikolaos; Frossard, Pascal

U-Boost NAS: Utilization-Boosted Differentiable Neural Architecture Search Technical Report

2022.

2101.

Xie, Yirong; Chen, Hong; Ma, Yongjie; Xu, Yang

Automated design of CNN architecture based on efficient evolutionary search Journal Article

In: Neurocomputing, vol. 491, pp. 160-171, 2022, ISSN: 0925-2312.

2100.

Benmeziane, Hadjer; Ouarnoughi, Hamza; Maghraoui, Kaoutar El; Niar, Smail

Real-Time Style Transfer with Efficient Vision Transformers Proceedings Article

In: Proceedings of the 5th International Workshop on Edge Systems, Analytics and Networking, pp. 31–36, Association for Computing Machinery, Rennes, France, 2022, ISBN: 9781450392532.

2099.

Rajesh, Chilukamari; Kumar, Sushil

An evolutionary block based network for medical image denoising using Differential Evolution Journal Article

In: Applied Soft Computing, vol. 121, pp. 108776, 2022, ISSN: 1568-4946.

@article{RAJESH2022108776,

title = {An evolutionary block based network for medical image denoising using Differential Evolution},

author = {Chilukamari Rajesh and Sushil Kumar},

url = {https://www.sciencedirect.com/science/article/pii/S1568494622002022},

doi = {https://doi.org/10.1016/j.asoc.2022.108776},

issn = {1568-4946},

year  = {2022},

date = {2022-01-01},

urldate = {2022-01-01},

journal = {Applied Soft Computing},

volume = {121},

pages = {108776},

abstract = {Image denoising is the key component in several computer vision and image processing operations due to unavoidable noise in the image generation process. For medical image processing, deep convolutional neural networks (CNN) gives a state-of-the-art performance. However, network structures are manually constructed for specific tasks and require several trials to tune a large number of hyperparameters, which can take a long time to construct a network. Additionally, the fittest hyperparameters which may be suitable for source data properties like noisy features cannot be easily found to target data. The realistic noise is generally mixed, complex, and unpredictable in medical images, which makes it difficult to design an efficient denoising network. We developed a Differential Evolution (DE) based automatic network evolution model in this paper to optimize the network architectures and hyperparameters by exploring the fittest parameters. Furthermore, we adopted a transfer learning technique to accelerate the training process. The proposed evolutionary algorithm is flexible and finds optimistic network architectures using well-known methods including residual and dense blocks. Finally, the proposed model was evaluated on four different medical image datasets. The obtained results at different noise levels show the potentiality of the proposed model named DEvoNet for identifying the optimal parameters to develop a high-performance denoising network structure.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

2098.

Zhou, Qinqin; Sheng, Kekai; Zheng, Xiawu; Li, Ke; Sun, Xing; Tian, Yonghong; Chen, Jie; Ji, Rongrong

Training-free Transformer Architecture Search Technical Report

2022.

2097.

Mok, Jisoo; Na, Byunggook; Kim, Ji-Hoon; Han, Dongyoon; Yoon, Sungroh

Demystifying the Neural Tangent Kernel from a Practical Perspective: Can it be trusted for Neural Architecture Search without training? Technical Report

2022.

2096.

Das, Mayukh; Singh, Brijraj; Chheda, Harsh Kanti; Sharma, Pawan; NS, Pradeep

AutoCoMet: Smart Neural Architecture Search via Co-Regulated Shaping Reinforcement Technical Report

2022.

2095.

Yang, Jin; Huang, Yingying; Jiang, Guangxin; Chen, Ying

An Intelligent End-to-End Neural Architecture Search Framework for Electricity Forecasting Model Development Technical Report

2022.

2094.

Sun, Haiyang; Lian, Zheng; Liu, Bin; Li, Ying; Sun, Licai; Cai, Cong; Tao, Jianhua; Wang, Meng; Cheng, Yuan

EmotionNAS: Two-stream Architecture Search for Speech Emotion Recognition Technical Report

2022.

2093.

Lu, Bingqian; Yan, Zheyu; Shi, Yiyu; Ren, Shaolei

A Semi-Decoupled Approach to Fast and Optimal Hardware-Software Co-Design of Neural Accelerators Technical Report

2022.

2092.

M., Abishai Ebenezer; Arya, Arti

An Atypical Metaheuristic Approach to Recognize an Optimal Architecture of a Neural Network Technical Report

2022.

2091.

Wang, Chunnan; Chen, Xingyu; Wu, Chengyue; Wang, Hongzhi

AutoTS: Automatic Time Series Forecasting Model Design Based on Two-Stage Pruning Technical Report

2022.

2090.

Zaman, Khalid; Sun, Zhaoyun; Shah, Sayyed Mudassar; Shoaib, Muhammad; Pei, Lili; Hussain, Altaf

Driver Emotions Recognition Based on Improved Faster R-CNN and Neural Architectural Search Network Journal Article

In: Symmetry, vol. 14, no. 4, pp. 687, 2022.

2089.

Zheng, Ruiqi; Qu, Liang; Cui, Bin; Shi, Yuhui; Yin, Hongzhi

AutoML for Deep Recommender Systems: A Survey Technical Report

2022.

2088.

Raychaudhuri, Dripta S.; Suh, Yumin; Schulter, Samuel; Yu, Xiang; Faraki, Masoud; Roy-Chowdhury, Amit K.; Chandraker, Manmohan

Controllable Dynamic Multi-Task Architectures Technical Report

2022.

2087.

Wang, Rui; Bai, Qibing; Ao, Junyi; Zhou, Long; Xiong, Zhixiang; Wei, Zhihua; Zhang, Yu; Ko, Tom; Li, Haizhou

LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT Technical Report

2022.

2086.

Chang, Yangyang; Sobelman, Gerald E.

Lightweight CNN Frameworks and their Optimization using Evolutionary Algorithms Proceedings Article

In: 2022 International Electrical Engineering Congress (iEECON), pp. 1-4, 2022.

2085.

Park, Gunju; Yi, Youngmin

CondNAS: Neural Architecture Search for Conditional CNNs Journal Article

In: Electronics, vol. 11, no. 7, 2022, ISSN: 2079-9292.

2084.

Zhou, Qinghua; Gorban, Alexander N.; Mirkes, Evgeny M.; Bac, Jonathan; Zinovyev, Andrei Yu.; Tyukin, Ivan Yu.

Quasi-orthogonality and intrinsic dimensions as measures of learning and generalisation Technical Report

2022.

2083.

Li, Yawei

Towards Efficient Deep Neural Networks PhD Thesis

ETH Zurich, 2022.

@phdthesis{20.500.11850/540498,

title = {Towards Efficient Deep Neural Networks},

author = {Yawei Li},

doi = {10.3929/ethz-b-000540498},

year = {2022},

date = {2022-01-01},

urldate = {2022-01-01},

publisher = {ETH Zurich},

address = {Zurich},

school = {ETH Zurich},

abstract = {Computational efficiency is an essential factor that influences the applicability of computer vision algorithms. Although deep neural networks have reached state-of-the-art performances in a variety of computer vision tasks, there are a couple of efficiency related problems of the deep learning based solutions. First, the overparameterization of deep neural networks results in models with millions of parameters, which lowers the parameter efficiency of the designed networks. To store the parameters and intermediate feature maps during the computation, a large device memory footprint is required. Secondly, the massive computation in deep neural networks slows down their training and inference. This limits the application of deep neural networks to latency-demanding scenarios and low-end devices. Thirdly, the massive computation consumes significant amount of energy, which leaves a large carbon footprint of deep learning models.The aim of this thesis is to improve the computational efficiency of current deep neural networks. This problem is tackled from three perspective including neural network compression, neural architecture optimization, and computational procedure optimization.In the first part of the thesis, we reduce the model complexity of neural networks by network compression techniques including filter decomposition and filter pruning. The basic assumption for filter decomposition is that the ensemble of filters in deep neural networks constitutes an overcomplete set. Instead of using the original filters directly during the computation, they can be approximated by a linear combination of a set of basis filters. The contribution of this thesis is to provide a unified analysis of previous filter decomposition methods. On the other hand, a differentiable filter pruning method is proposed. To achieve differentiability, the layers of neural networks is reparameterized by a meta network. Sparsity regularization is applied to the input of the meta network, i.e. latent vectors. Optimizing with the introduced regularization leads to an automatic network pruning method. Additionally, a joint analysis of filter decomposition and filter pruning is presented from the perspective of compact tensor approximation. The hinge of the two techniques is the introduced sparsity inducing matrix. By simply changing the way the group sparsity regularization is enforced to the matrix, the two techniques can be derived accordingly.Secondly, we try to improve the performance of a baseline network by a fine-grained neural architecture optimization method. Different from network compression methods, the aim of this method is to improve the prediction accuracy of neural networks while reducing their model complexity at the same time. Achieving the two targets simultaneously makes the problem more challenging. In addition, a nearly cost-free constraint is enforced during the architecture optimization, which differs from current neural architecture search methods with bulky computation. This can be regarded as another efficiency-improving technique.Thirdly, we optimize the computational procedure of graph neural networks. By mathematically analyzing the operations in graph neural network, two methods are proposed to improve the computational efficiency. The first method is related to the simplification of neighbor querying in graph neural network while the second involves shuffling the order of graph feature gathering and an feature extraction operations. To summarize, this thesis contributes to multiple aspects of improving the computational efficiency of neural networks during the optimization, training, and test phase.},

keywords = {},

pubstate = {published},

tppubtype = {phdthesis}

}

Computational efficiency is an essential factor that influences the applicability of computer vision algorithms. Although deep neural networks have reached state-of-the-art performances in a variety of computer vision tasks, there are a couple of efficiency related problems of the deep learning based solutions. First, the overparameterization of deep neural networks results in models with millions of parameters, which lowers the parameter efficiency of the designed networks. To store the parameters and intermediate feature maps during the computation, a large device memory footprint is required. Secondly, the massive computation in deep neural networks slows down their training and inference. This limits the application of deep neural networks to latency-demanding scenarios and low-end devices. Thirdly, the massive computation consumes significant amount of energy, which leaves a large carbon footprint of deep learning models.The aim of this thesis is to improve the computational efficiency of current deep neural networks. This problem is tackled from three perspective including neural network compression, neural architecture optimization, and computational procedure optimization.In the first part of the thesis, we reduce the model complexity of neural networks by network compression techniques including filter decomposition and filter pruning. The basic assumption for filter decomposition is that the ensemble of filters in deep neural networks constitutes an overcomplete set. Instead of using the original filters directly during the computation, they can be approximated by a linear combination of a set of basis filters. The contribution of this thesis is to provide a unified analysis of previous filter decomposition methods. On the other hand, a differentiable filter pruning method is proposed. To achieve differentiability, the layers of neural networks is reparameterized by a meta network. Sparsity regularization is applied to the input of the meta network, i.e. latent vectors. Optimizing with the introduced regularization leads to an automatic network pruning method. Additionally, a joint analysis of filter decomposition and filter pruning is presented from the perspective of compact tensor approximation. The hinge of the two techniques is the introduced sparsity inducing matrix. By simply changing the way the group sparsity regularization is enforced to the matrix, the two techniques can be derived accordingly.Secondly, we try to improve the performance of a baseline network by a fine-grained neural architecture optimization method. Different from network compression methods, the aim of this method is to improve the prediction accuracy of neural networks while reducing their model complexity at the same time. Achieving the two targets simultaneously makes the problem more challenging. In addition, a nearly cost-free constraint is enforced during the architecture optimization, which differs from current neural architecture search methods with bulky computation. This can be regarded as another efficiency-improving technique.Thirdly, we optimize the computational procedure of graph neural networks. By mathematically analyzing the operations in graph neural network, two methods are proposed to improve the computational efficiency. The first method is related to the simplification of neighbor querying in graph neural network while the second involves shuffling the order of graph feature gathering and an feature extraction operations. To summarize, this thesis contributes to multiple aspects of improving the computational efficiency of neural networks during the optimization, training, and test phase.