Literature on Neural Architecture Search

2181.

Mehta, Yash; White, Colin; Zela, Arber; Krishnakumar, Arjun; Zabergja, Guri; Moradian, Shakiba; Safari, Mahmoud; Yu, Kaicheng; Hutter, Frank

NAS-Bench-Suite: NAS Evaluation is (Now) Surprisingly Easy Proceedings Article

In: ICLR 2022, 2022.

2180.

Kang, Ziyang; Wang, Shiying; Wang, Lei; Li, Shiming; Qu, Lianhua; Xu, Weixia

Hardware-aware liquid state machine generation for 2D/3D Network-on-Chip platforms Journal Article

In: Journal of Systems Architecture, vol. 124, pp. 102429, 2022, ISSN: 1383-7621.

2179.

Hu, Xing; Liang, Ling; Chen, Xiaobing; Deng, Lei; Ji, Yu; Ding, Yufei; Du, Zidong; Guo, Qi; Sherwood, Tim; Xie, Yuan

A Systematic View of Model Leakage Risks in Deep Neural Network Systems Journal Article

In: IEEE Transactions on Computers, pp. 1-1, 2022.

2178.

Dushatskiy, Arkadiy; Alderliesten, Tanja; Bosman, Peter A. N.

Heed the Noise in Performance Evaluations in Neural Architecture Search Technical Report

2022.

2177.

Cho, Hyunghun; Shin, Jungwook; Rhee, Wonjong

B2EA: An Evolutionary Algorithm Assisted by Two Bayesian Optimization Modules for Neural Architecture Search Technical Report

2022.

2176.

Cho, Minsu; Joshi, Ameya; Garg, Siddharth; Reagen, Brandon; Hegde, Chinmay

Selective Network Linearization for Efficient Private Inference Technical Report

2022.

2175.

Mazumder, Arnab Neelim; Mohsenin, Tinoosh

A Fast Network Exploration Strategy to Profile Low Energy Consumption for Keyword Spotting Technical Report

2022.

2174.

Hassanzadeh, Tahereh; Essam, Daryl; Sarker, Ruhul

EvoDCNN: An Evolutionary Deep Convolutional Neural Network for Image Classification Journal Article

In: Neurocomputing, 2022, ISSN: 0925-2312.

2173.

Rabczuk, Timon; Guo, Hongwei; Zhuang, Xiaoying; Chen, Pengwan; Alajlan, Naif

Stochastic deep collocation method based on neural architecture search and transfer learning for heterogeneous porous media Journal Article

In: Engineering with Computers, vol. 2022, pp. 1 – 26, 2022.

2172.

Lee, Jemin; Yu, Misun; Kwon, Yongin; Kim, Taeho

Quantune: Post-training Quantization of Convolutional Neural Networks using Extreme Gradient Boosting for Fast Deployment Technical Report

2022.

2171.

Wang, Duo; Zhao, Yiren; Shumailov, Ilia; Mullins, Robert D.

Model Architecture Adaption for Bayesian Neural Networks Technical Report

2022.

2170.

Elsken, Thomas; Zela, Arber; Metzen, Jan Hendrik; Staffler, Benedikt; Brox, Thomas; Valada, Abhinav; Hutter, Frank

Neural Architecture Search for Dense Prediction Tasks in Computer Vision Technical Report

2022.

2169.

Sun, Junding; Yao, Chong; Liu, Jie; Liu, Weifan; Yu, Zekuan

GNAS-U2Net: A new optic cup and optic disc segmentation architecture with genetic neural architecture search Journal Article

In: IEEE Signal Processing Letters, pp. 1-1, 2022.

2168.

Speckhard, Daniel T.; Misiunas, Karolis; Perel, Sagi; Zhu, Tenghui; Carlile, Simon; Slaney, Malcolm

Neural Architecture Search for Energy Efficient Always-on Audio Models Technical Report

2022.

2167.

Jia, Liang; Tian, Ye; Zhang, Junguo

Domain-Aware Neural Architecture Search for Classifying Animals in Camera Trap Images Journal Article

In: Animals, vol. 12, no. 4, 2022, ISSN: 2076-2615.

2166.

Yang, Junhuan; Sheng, Yi; Zhang, Sizhe; Wang, Ruixuan; Foreman, Kenneth; Paige, Mikell; Jiao, Xun; Jiang, Weiwen; Yang, Lei

Automated Architecture Search for Brain-inspired Hyperdimensional Computing Technical Report

2022.

2165.

Huang, Mingqiang; Liu, Yucen; Cheng, Quan; Yang, Shuxin; Li, Kai; Luo, Junyi; Yang, Zhengke; Li, Qiufeng; Yu, Hao; Man, Changhai

A High Throughput Multi-Bit-Width 3D Systolic Accelerator for NAS Optimized Deep Neural Networks on FPGA Proceedings Article

In: Proceedings of the 2022 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays, pp. 50, Association for Computing Machinery, Virtual Event, USA, 2022, ISBN: 9781450391498.

2164.

Teso-Fz-Betoño, Daniel; Zulueta, Ekaitz; Sanchez-Chica, Ander; Fernandez-Gamiz, Unai; Teso-Fz-Betoño, Adrian; Lopez-Guede, Jose Manuel

Neural architecture search for the estimation of relative positioning of the autonomous mobile robot Journal Article

In: Logic Journal of the IGPL, 2022, ISSN: 1367-0751, (jzac030).

2163.

Chen, Jiamin; Gao, Jianliang; Chen, Yibo; Oloulade, Babatounde MOCTARD; Lyu, Tengfei; Li, Zhao

Auto-GNAS: A Parallel Graph Neural Architecture Search Framework Journal Article

In: IEEE Transactions on Parallel and Distributed Systems, pp. 1-1, 2022.

2162.

Kim, Youngkee; Yun, Won Joon; Lee, Youn Kyu; Kim, Joongheon

Two-Stage Architectural Fine-Tuning with Neural Architecture Search using Early-Stopping in Image Classification Technical Report

2022.

2161.

Kim, Jae Kwan; Ahn, Wonbin; Park, Sangin; Lee, Soo-Hong; Kim, Laehyun

Early Prediction of Sepsis Onset Using Neural Architecture Search Based on Genetic Algorithms Journal Article

In: International Journal of Environmental Research and Public Health, vol. 19, no. 4, 2022, ISSN: 1660-4601.

@article{ijerph19042349,

title = {Early Prediction of Sepsis Onset Using Neural Architecture Search Based on Genetic Algorithms},

author = {Jae Kwan Kim and Wonbin Ahn and Sangin Park and Soo-Hong Lee and Laehyun Kim},

url = {https://www.mdpi.com/1660-4601/19/4/2349},

doi = {10.3390/ijerph19042349},

issn = {1660-4601},

year  = {2022},

date = {2022-01-01},

urldate = {2022-01-01},

journal = {International Journal of Environmental Research and Public Health},

volume = {19},

number = {4},

abstract = {Sepsis is a life-threatening condition with a high mortality rate. Early prediction and treatment are the most effective strategies for increasing survival rates. This paper proposes a neural architecture search (NAS) model to predict the onset of sepsis with a low computational cost and high search performance by applying a genetic algorithm (GA). The proposed model shares the weights of all possible connection nodes internally within the neural network. Externally, the search cost is reduced through the weight-sharing effect between the genotypes of the GA. A predictive analysis was performed using the Medical Information Mart for Intensive Care III (MIMIC-III), a medical time-series dataset, with the primary objective of predicting sepsis onset 3 h before occurrence. In addition, experiments were conducted under various prediction times (0-12 h) for comparison. The proposed model exhibited an area under the receiver operating characteristic curve (AUROC) score of 0.94 (95% CI: 0.92-0.96) for 3 h, which is 0.31-0.26 higher than the scores obtained using the Sequential Organ Failure Assessment (SOFA), quick SOFA (qSOFA), and Simplified Acute Physiology Score (SAPS) II scoring systems. Furthermore, the proposed model exhibited a 12% improvement in the AUROC value over a simple model based on the long short-term memory neural network. Additionally, it is not only optimally searchable for sepsis onset prediction, but also outperforms conventional models that use similar predictive purposes and datasets. Notably, it is sufficiently robust to shape changes in the input data and has less structural dependence.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

2160.

Dai, Liuyao; Cheng, Quan; Wang, Yuhang; Huang, Gengbin; Zhou, Junzhuo; Li, Kai; Mao, Wei; Yu, Hao

An Energy-Efficient Bit-Split-and-Combination Systolic Accelerator for NAS-Based Multi-Precision Convolution Neural Networks Proceedings Article

In: 2022 27th Asia and South Pacific Design Automation Conference (ASP-DAC), pp. 448-453, 2022.

2159.

Lee, Jooyeon; Park, Junsang; Lee, Seunghyun; Kung, Jaeha

Implication of Optimizing NPU Dataflows on Neural Architecture Search for Mobile Devices Journal Article

In: ACM Trans. Des. Autom. Electron. Syst., 2022, ISSN: 1084-4309, (Just Accepted).

@article{10.1145/3513085,

title = {Implication of Optimizing NPU Dataflows on Neural Architecture Search for Mobile Devices},

author = {Jooyeon Lee and Junsang Park and Seunghyun Lee and Jaeha Kung},

url = {https://doi.org/10.1145/3513085},

doi = {10.1145/3513085},

issn = {1084-4309},

year  = {2022},

date = {2022-01-01},

urldate = {2022-01-01},

journal = {ACM Trans. Des. Autom. Electron. Syst.},

publisher = {Association for Computing Machinery},

address = {New York, NY, USA},

abstract = {Recent advances in deep learning have made it possible to implement artificial intelligence in mobile devices. Many studies have put a lot of effort into developing lightweight deep learning models optimized for mobile devices. To overcome the performance limitations of manually designed deep learning models, an automated search algorithm, called neural architecture search (NAS), has been proposed. However, studies on the effect of hardware architecture of the mobile device on the performance of NAS have been less explored. In this paper, we show the importance of optimizing a hardware architecture, namely NPU dataflow, when searching for a more accurate yet fast deep learning model. To do so, we first implement an optimization framework, named FlowOptimizer, for generating a best possible NPU dataflow for a given deep learning operator. Then, we utilize this framework during the latency-aware NAS to find the model with the highest accuracy satisfying the latency constraint. As a result, we show that the searched model with FlowOptimizer outperforms the performance by 87.1% and 92.3% on average compared to the searched model with NVDLA and Eyeriss, respectively, with better accuracy on a proxy dataset. We also show that the searched model can be transferred to a larger model to classify a more complex image dataset, i.e., ImageNet, achieving 0.2%/5.4% higher Top-1/Top-5 accuracy compared to MobileNetV2-1.0 with 3.6 texttimes lower latency.},

note = {Just Accepted},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

2158.

Zhang, Chunhui; Yuan, Xiaoming; Zhang, Qianyun; Zhu, Guangxu; Cheng, Lei; Zhang, Ning

Towards Tailored Models on Private AIoT Devices: Federated Direct Neural Architecture Search Technical Report

2022.

2157.

Bosma, Martijn M. A.; Dushatskiy, Arkadiy; Grewal, Monika; Alderliesten, Tanja; Bosman, Peter A. N.

Mixed-Block Neural Architecture Search for Medical Image Segmentation Technical Report

2022.

2156.

Sheng, Yi; Yang, Junhuan; Wu, Yawen; Mao, Kevin; Shi, Yiyu; Hu, Jingtong; Jiang, Weiwen; Yang, Lei

The Larger The Fairer? Small Neural Networks Can Achieve Fairness for Edge Devices Technical Report

2022.

2155.

Zhao, Shixiong; Li, Fanxin; Chen, Xusheng; Shen, Tianxiang; Chen, Li; Wang, Sen; Zhang, Nicholas; Li, Cheng; Cui, Heming

NASPipe: High Performance and Reproducible Pipeline Parallel Supernet Training via Causal Synchronous Parallelism Proceedings Article

In: Proceedings of the 27th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, pp. 374–387, Association for Computing Machinery, Lausanne, Switzerland, 2022, ISBN: 9781450392051.

@inproceedings{10.1145/3503222.3507735,

title = {NASPipe: High Performance and Reproducible Pipeline Parallel Supernet Training via Causal Synchronous Parallelism},

author = {Shixiong Zhao and Fanxin Li and Xusheng Chen and Tianxiang Shen and Li Chen and Sen Wang and Nicholas Zhang and Cheng Li and Heming Cui},

url = {https://doi.org/10.1145/3503222.3507735},

doi = {10.1145/3503222.3507735},

isbn = {9781450392051},

year  = {2022},

date = {2022-01-01},

urldate = {2022-01-01},

booktitle = {Proceedings of the 27th ACM International Conference on Architectural Support for Programming Languages and Operating Systems},

pages = {374–387},

publisher = {Association for Computing Machinery},

address = {Lausanne, Switzerland},

series = {ASPLOS 2022},

abstract = {Supernet training, a prevalent and important paradigm in Neural Architecture Search, embeds the whole DNN architecture search space into one monolithic supernet, iteratively activates a subset of the supernet (i.e., a subnet) for fitting each batch of data, and searches a high-quality subnet which meets specific requirements. Although training subnets in parallel on multiple GPUs is desirable for acceleration, there inherently exists a race hazard that concurrent subnets may access the same DNN layers. Existing systems support neither efficiently parallelizing subnets’ training executions, nor resolving the race hazard deterministically, leading to unreproducible training procedures and potentiallly non-trivial accuracy loss. We present NASPipe, the first high-performance and reproducible distributed supernet training system via causal synchronous parallel (CSP) pipeline scheduling abstraction: NASPipe partitions a supernet across GPUs and concurrently executes multiple generated sub-tasks (subnets) in a pipelined manner; meanwhile, it oversees the correlations between the subnets and deterministically resolves any causal dependency caused by subnets’ layer sharing. To obtain high performance, NASPipe’s CSP scheduler exploits the fact that the larger a supernet spans, the fewer dependencies manifest between chronologically close subnets; therefore, it aggressively schedules the subnets with larger chronological orders into execution, only if they are not causally dependent on unfinished precedent subnets. Moreover, to relieve the excessive GPU memory burden for holding the whole supernet’s parameters, NASPipe uses a context switch technique that stashes the whole supernet in CPU memory, precisely predicts the subnets’ schedule, and pre-fetches/evicts a subnet before/after its execution. The evaluation shows that NASPipe is the only system that retains supernet training reproducibility, while achieving a comparable and even higher performance (up to 7.8X) compared to three recent pipeline training systems (e.g., GPipe).},

keywords = {},

pubstate = {published},

tppubtype = {inproceedings}

}

Supernet training, a prevalent and important paradigm in Neural Architecture Search, embeds the whole DNN architecture search space into one monolithic supernet, iteratively activates a subset of the supernet (i.e., a subnet) for fitting each batch of data, and searches a high-quality subnet which meets specific requirements. Although training subnets in parallel on multiple GPUs is desirable for acceleration, there inherently exists a race hazard that concurrent subnets may access the same DNN layers. Existing systems support neither efficiently parallelizing subnets’ training executions, nor resolving the race hazard deterministically, leading to unreproducible training procedures and potentiallly non-trivial accuracy loss. We present NASPipe, the first high-performance and reproducible distributed supernet training system via causal synchronous parallel (CSP) pipeline scheduling abstraction: NASPipe partitions a supernet across GPUs and concurrently executes multiple generated sub-tasks (subnets) in a pipelined manner; meanwhile, it oversees the correlations between the subnets and deterministically resolves any causal dependency caused by subnets’ layer sharing. To obtain high performance, NASPipe’s CSP scheduler exploits the fact that the larger a supernet spans, the fewer dependencies manifest between chronologically close subnets; therefore, it aggressively schedules the subnets with larger chronological orders into execution, only if they are not causally dependent on unfinished precedent subnets. Moreover, to relieve the excessive GPU memory burden for holding the whole supernet’s parameters, NASPipe uses a context switch technique that stashes the whole supernet in CPU memory, precisely predicts the subnets’ schedule, and pre-fetches/evicts a subnet before/after its execution. The evaluation shows that NASPipe is the only system that retains supernet training reproducibility, while achieving a comparable and even higher performance (up to 7.8X) compared to three recent pipeline training systems (e.g., GPipe).

2154.

Zhang, Wentao; Shen, Yu; Lin, Zheyu; Li, Yang; Li, Xiaosen; Ouyang, Wen; Tao, Yangyu; Yang, Zhi; Cui, Bin

PaSca: a Graph Neural Architecture Search System under the Scalable Paradigm Technical Report

2022.

2153.

Zhang, Tianning; Ang, Yee Sin; Li, Erping; Kee, Chun Yun; Ang, L. K.

SUTD-PRCM Dataset and Neural Architecture Search Approach for Complex Metasurface Design Technical Report

2022.

2152.

Huang, Yongdong; Li, Yuanzhan; Cao, Xulong; Zhang, Siyu; Cai, Shen; Lu, Ting; Liu, Yuqi

An Efficient End-to-End 3D Model Reconstruction based on Neural Architecture Search Technical Report

2022.

2151.

Seong, Jaeho; Lee, Chaehyun; Han, Dong Seog

Neural Architecture Search for Real-Time Driver Behavior Recognition Proceedings Article

In: 2022 International Conference on Artificial Intelligence in Information and Communication (ICAIIC), pp. 104-108, 2022.

2150.

Cummings, Daniel; Sridhar, Sharath Nittur; Sarah, Anthony; Szankin, Maciej

Accelerating Neural Architecture Exploration Across Modalities Using Genetic Algorithms Technical Report

2022.

2149.

Zhao, Yaqin; Feng, Liqi; Tang, Jiaxi; Zhao, Wenxuan; Ding, Zhipeng; Li, Ao; Zheng, Zhaoxiang

Automatically recognizing four-legged animal behaviors to enhance welfare using spatial temporal graph convolutional networks Journal Article

In: Applied Animal Behaviour Science, vol. 249, pp. 105594, 2022, ISSN: 0168-1591.

@article{ZHAO2022105594,

title = {Automatically recognizing four-legged animal behaviors to enhance welfare using spatial temporal graph convolutional networks},

author = {Yaqin Zhao and Liqi Feng and Jiaxi Tang and Wenxuan Zhao and Zhipeng Ding and Ao Li and Zhaoxiang Zheng},

url = {https://www.sciencedirect.com/science/article/pii/S0168159122000521},

doi = {https://doi.org/10.1016/j.applanim.2022.105594},

issn = {0168-1591},

year  = {2022},

date = {2022-01-01},

urldate = {2022-01-01},

journal = {Applied Animal Behaviour Science},

volume = {249},

pages = {105594},

abstract = {Automatically recognizing animal behaviors in zoos and in national natural reserves can provide valuable insight into their welfare for facilitating scientific decision-making processes in animal management. Due to the difficulty of capturing massive amounts of animal video footage, a few existing methods have identified the behaviors of several different animal species in static images, but little is known about video-based animal behavior recognition. An animal's behavior is carried out in consecutive frames rather than in a single image; thus, image-based animal behavior recognition methods have low recognition accuracy. To address this dilemma, we not only construct the first skeleton-based dynamic multispecies dataset (Animal-Skeleton) but also propose a novel scheme that automatically designs the best spatial-temporal graph convolutional network (GCN) architecture with neural architecture search (NAS) to perform animal behavior recognition, named Animal-Nas for short. This is the first time that GCNs with NAS have been introduced into the animal behavior recognition task. To alleviate the trial-and-error cost of manually designing the network structure, we turn to NAS and design a novel search space with graph-based cells. Furthermore, we adopt a differentiable architecture search strategy to automatically search the cost-efficient spatial-temporal graph convolutional network structure. To evaluate the performance of the proposed model, we conduct extensive experiments on Animal-Skeleton datasets from three perspectives: model accuracy, parameter amount and stability. The results show that our model can achieve state-of the-art performance with fewer parameters.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

2148.

Sarah, Anthony; Cummings, Daniel; Sridhar, Sharath Nittur; Sundaresan, Sairam; Szankin, Maciej; Webb, Tristan; Munoz, J. Pablo

A Hardware-Aware System for Accelerating Deep Neural Network Optimization Technical Report

2022.

2147.

Sinha, Nilotpal; Chen, Kuan-Wen

Neural Architecture Search using Progressive Evolution Technical Report

2022.

2146.

Ye, Peng; Li, Baopu; Li, Yikang; Chen, Tao; Fan, Jiayuan; Ouyang, Wanli

(beta)-DARTS: Beta-Decay Regularization for Differentiable Architecture Search Proceedings Article

In: CVPR2022, 2022.

2145.

Szwarcman, Daniela; Civitarese, Daniel; Vellasco, Marley

Quantum-inspired evolutionary algorithm applied to neural architecture search Journal Article

In: Applied Soft Computing, vol. 120, pp. 108674, 2022, ISSN: 1568-4946.

@article{SZWARCMAN2022108674,

title = {Quantum-inspired evolutionary algorithm applied to neural architecture search},

author = {Daniela Szwarcman and Daniel Civitarese and Marley Vellasco},

url = {https://www.sciencedirect.com/science/article/pii/S1568494622001478},

doi = {https://doi.org/10.1016/j.asoc.2022.108674},

issn = {1568-4946},

year  = {2022},

date = {2022-01-01},

urldate = {2022-01-01},

journal = {Applied Soft Computing},

volume = {120},

pages = {108674},

abstract = {The success of machine learning models over the last few years is mostly related to the significant progress of deep neural networks. These powerful and flexible models can even surpass human-level performance in tasks such as image recognition and strategy games. However, experts need to spend considerable time and resources to design the network structure. The demand for new architectures drives interest in automating this design process. Researchers have proposed new algorithms to address the neural architecture search (NAS) problem, including efforts to reduce the high computational cost of such methods. A common approach to improve efficiency is to reduce the search space with the help of expert knowledge, searching for cells rather than entire networks. Motivated by the faster convergence promoted by quantum-inspired evolutionary methods, the Q-NAS algorithm was proposed to address the NAS problem without relying on cell search. In this work, we consolidate Q-NAS, adding a new penalization feature, enhancing its retraining scheme, and also investigating more challenging search spaces than before. In CIFAR-10, we reached 93.85% of test accuracy in 67 GPU days, considering the addition of an early-stopping mechanism. We also applied Q-NAS to CIFAR-100, without modifying the parameters, and our best accuracy was 74.23%, which is comparable to ResNet164. The enhancements and results presented in this work show that Q-NAS can automatically generate network architectures that outperform hand-designed models for CIFAR-10 and CIFAR-100. Also, compared to other NAS methods, Q-NAS results are promising regarding the balance between performance, runtime efficiency, and automation. We believe that our results enrich the discussion on this balance, considering alternatives to the cell search approach.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

2144.

Huynh, Lam; Rahtu, Esa; Matas, Jiri; Heikkilä, Janne

Fast Neural Architecture Search for Lightweight Dense Prediction Networks Technical Report

2022.

2143.

Lin, Ke; A, Yong; Gan, Zhuoxin; Jiang, Yingying

WPNAS: Neural Architecture Search by jointly using Weight Sharing and Predictor Technical Report

2022.

2142.

Chen, Xuehui; Niu, Xin; Jiang, Jingfei; Pan, Hengyue; Dong, Peijie; Wei, Zimian

Influence of Initialization and Modularization on the Performance of Network Morphism-Based Neural Architecture Search Proceedings Article

In: Yao, Jian; Xiao, Yang; You, Peng; Sun, Guang (Ed.): The International Conference on Image, Vision and Intelligent Systems (ICIVIS 2021), pp. 875–887, Springer Singapore, Singapore, 2022, ISBN: 978-981-16-6963-7.

@inproceedings{10.1007/978-981-16-6963-7_77,

title = {Influence of Initialization and Modularization on the Performance of Network Morphism-Based Neural Architecture Search},

author = {Xuehui Chen and Xin Niu and Jingfei Jiang and Hengyue Pan and Peijie Dong and Zimian Wei},

editor = {Jian Yao and Yang Xiao and Peng You and Guang Sun},

url = {https://link.springer.com/chapter/10.1007/978-981-16-6963-7_77},

isbn = {978-981-16-6963-7},

year  = {2022},

date = {2022-01-01},

urldate = {2022-01-01},

booktitle = {The International Conference on Image, Vision and Intelligent Systems (ICIVIS 2021)},

pages = {875--887},

publisher = {Springer Singapore},

address = {Singapore},

abstract = {Neural Architecture Search (NAS), the process of automatic network architecture design, has enabled remarkable progress over the last years on Computer Vision tasks. In this paper, we propose a novel and efficient NAS framework based on network morphism to further improve the performance of NAS algorithms. Firstly, we design four modular structures termed RBNC block, CBNR block, BNRC block and RCBN block which correspond to four initial neural network architectures and four modular network morphism methods. Each block is composed of a ReLU layer, a Batch-Norm layer and a convolutional layer. Then we introduce network morphism to correlate different modular structures for constructing network architectures. Moreover, we study the influence of different initial neural network architectures and modular network morphism methods on the performance of network morphism-based NAS algorithms through comparative experiments and ablation experiments. Finally, we find that the network morphism-based NAS algorithm that uses CBNR block for initialization and modularization is the best method to improve performance. Our proposed method achieves a test accuracy of 95.84% on CIFAR-10 with least parameters (only 2.72 M) and fewer search costs (2 GPU-days) for network architecture search.},

keywords = {},

pubstate = {published},

tppubtype = {inproceedings}

}