AutoML | Literature on Neural Architecture Search

@article{s21020444,

title = {Efficient Resource-Aware Convolutional Neural Architecture Search for Edge Computing with Pareto-Bayesian Optimization},

author = {Zhao Yang and Shengbing Zhang and Ruxu Li and Chuxi Li and Miao Wang and Danghui Wang and Meng Zhang},

url = {https://www.mdpi.com/1424-8220/21/2/444},

doi = {10.3390/s21020444},

issn = {1424-8220},

year  = {2021},

date = {2021-01-01},

journal = {Sensors},

volume = {21},

number = {2},

abstract = {With the development of deep learning technologies and edge computing, the combination of them can make artificial intelligence ubiquitous. Due to the constrained computation resources of the edge device, the research in the field of on-device deep learning not only focuses on the model accuracy but also on the model efficiency, for example, inference latency. There are many attempts to optimize the existing deep learning models for the purpose of deploying them on the edge devices that meet specific application requirements while maintaining high accuracy. Such work not only requires professional knowledge but also needs a lot of experiments, which limits the customization of neural networks for varied devices and application scenarios. In order to reduce the human intervention in designing and optimizing the neural network structure, multi-objective neural architecture search methods that can automatically search for neural networks featured with high accuracy and can satisfy certain hardware performance requirements are proposed. However, the current methods commonly set accuracy and inference latency as the performance indicator during the search process, and sample numerous network structures to obtain the required neural network. Lacking regulation to the search direction with the search objectives will generate a large number of useless networks during the search process, which influences the search efficiency to a great extent. Therefore, in this paper, an efficient resource-aware search method is proposed. Firstly, the network inference consumption profiling model for any specific device is established, and it can help us directly obtain the resource consumption of each operation in the network structure and the inference latency of the entire sampled network. Next, on the basis of the Bayesian search, a resource-aware Pareto Bayesian search is proposed. Accuracy and inference latency are set as the constraints to regulate the search direction. With a clearer search direction, the overall search efficiency will be improved. Furthermore, cell-based structure and lightweight operation are applied to optimize the search space for further enhancing the search efficiency. The experimental results demonstrate that with our method, the inference latency of the searched network structure reduced 94.71% without scarifying the accuracy. At the same time, the search efficiency increased by 18.18%.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Close

With the development of deep learning technologies and edge computing, the combination of them can make artificial intelligence ubiquitous. Due to the constrained computation resources of the edge device, the research in the field of on-device deep learning not only focuses on the model accuracy but also on the model efficiency, for example, inference latency. There are many attempts to optimize the existing deep learning models for the purpose of deploying them on the edge devices that meet specific application requirements while maintaining high accuracy. Such work not only requires professional knowledge but also needs a lot of experiments, which limits the customization of neural networks for varied devices and application scenarios. In order to reduce the human intervention in designing and optimizing the neural network structure, multi-objective neural architecture search methods that can automatically search for neural networks featured with high accuracy and can satisfy certain hardware performance requirements are proposed. However, the current methods commonly set accuracy and inference latency as the performance indicator during the search process, and sample numerous network structures to obtain the required neural network. Lacking regulation to the search direction with the search objectives will generate a large number of useless networks during the search process, which influences the search efficiency to a great extent. Therefore, in this paper, an efficient resource-aware search method is proposed. Firstly, the network inference consumption profiling model for any specific device is established, and it can help us directly obtain the resource consumption of each operation in the network structure and the inference latency of the entire sampled network. Next, on the basis of the Bayesian search, a resource-aware Pareto Bayesian search is proposed. Accuracy and inference latency are set as the constraints to regulate the search direction. With a clearer search direction, the overall search efficiency will be improved. Furthermore, cell-based structure and lightweight operation are applied to optimize the search space for further enhancing the search efficiency. The experimental results demonstrate that with our method, the inference latency of the searched network structure reduced 94.71% without scarifying the accuracy. At the same time, the search efficiency increased by 18.18%.

Close

1570.

Weng, Yu; Chen, Zehua; Zhou, Tianbao

Improved differentiable neural architecture search for single image super-resolution Journal Article

In: Peer-to-Peer Networking and Applications, 2021.

Abstract | Links | BibTeX

1569.

Liu, Jia; Jin, Yaochu

Multi-objective Search of Robust Neural Architectures against Multiple Types of Adversarial Attacks Technical Report

2021.

Links | BibTeX

1568.

Wu, Yan; Huang, Zhiwu; Kumar, Suryansh; Sukthanker, Rhea Sanjay; Timofte, Radu; Gool, Luc Van

Trilevel Neural Architecture Search for Efficient Single Image Super-Resolution Technical Report

2021.

Links | BibTeX

1567.

Alparslan, Yigit; Moyer, Ethan Jacob; Isozaki, Isamu Mclean; Schwartz, Daniel; Dunlop, Adam; Dave, Shesh; Kim, Edward

Towards Searching Efficient and Accurate Neural Network Architectures in Binary Classification Problems Technical Report

2021.

Links | BibTeX

1566.

Lee, Sanghyeop; Kim, Junyeob; Kang, Hyeon; Kang, Do-Young; Park, Jangsik

Genetic Algorithm Based Deep Learning Neural Network Structure and Hyperparameter Optimization Journal Article

In: Applied Sciences, vol. 11, no. 2, 2021, ISSN: 2076-3417.

Abstract | Links | BibTeX

1565.

Jiang, Hanliang; Shen, Fuhao; Gao, Fei; Han, Weidong

Learning efficient, explainable and discriminative representations for pulmonary nodules classification Journal Article

In: Pattern Recognition, vol. 113, pp. 107825, 2021, ISSN: 0031-3203.

Abstract | Links | BibTeX

1564.

Alparslan, Yigit; Moyer, Ethan Jacob; Kim, Edward

Evaluating Online and Offline Accuracy Traversal Algorithms for k-Complete Neural Network Architectures Journal Article

In: CoRR, vol. abs/2101.06518, 2021.

Links | BibTeX

1563.

Vaccaro, Lorenzo; Sansonetti, Giuseppe; Micarelli, Alessandro

An Empirical Review of Automated Machine Learning Journal Article

In: Computers, vol. 10, no. 1, 2021, ISSN: 2073-431X.

Abstract | Links | BibTeX

1562.

Li, Qing; Wu, Xia; Liu, Tianming

Differentiable Neural Architecture Search for Optimal Spatial/Temporal Brain Function Network Decomposition Journal Article

In: Medical Image Analysis, pp. 101974, 2021, ISSN: 1361-8415.

Abstract | Links | BibTeX

@article{LI2021101974,

title = {Differentiable Neural Architecture Search for Optimal Spatial/Temporal Brain Function Network Decomposition},

author = {Qing Li and Xia Wu and Tianming Liu},

url = {https://www.sciencedirect.com/science/article/pii/S1361841521000207},

doi = {https://doi.org/10.1016/j.media.2021.101974},

issn = {1361-8415},

year  = {2021},

date = {2021-01-01},

journal = {Medical Image Analysis},

pages = {101974},

abstract = {ABSTRACT 

It has been a key topic to decompose the brain's spatial/temporal function networks from 4D functional magnetic resonance imaging (fMRI) data. With the advantages of robust and meaningful brain pattern extraction, deep neural networks have been shown to be more powerful and flexible in fMRI data modeling than other traditional methods. However, the challenge of designing neural network architecture for high-dimensional and complex fMRI data has also been realized recently. In this paper, we propose a new spatial/temporal differentiable neural architecture search algorithm (ST-DARTS) for optimal brain network decomposition. The core idea of ST-DARTS is to optimize the inner cell structure of the vanilla recurrent neural network (RNN) in order to effectively decompose spatial/temporal brain function networks from fMRI data. Based on the evaluations on all seven fMRI tasks in human connectome project (HCP) dataset, the ST-DARTS model is shown to perform promisingly, both spatially (i.e., it can recognize the most stimuli-correlated spatial brain network activation that is very similar to the benchmark) and temporally (i.e., its temporal activity is highly positively correlated with the task-design). To further improve the efficiency of ST-DARTS model, we introduce a flexible early-stopping mechanism, named as ST-DARTS±, which further improves experimental results significantly. To our best knowledge, the proposed ST-DARTS and ST-DARTS+ models are among the early efforts in optimally decomposing spatial/temporal brain function networks from fMRI data with neural architecture search strategy and they demonstrate great promise.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Close

1561.

Song, Xingyou; Choromanski, Krzysztof; -, Jack Parker; Tang, Yunhao; Peng, Daiyi; Jain, Deepali; Gao, Wenbo; Pacchiano, Aldo; ó, Tamás Sarl; Yang, Yuxiang

ES-ENAS: Combining Evolution Strategies with Neural Architecture Search at No Extra Cost for Reinforcement Learning Technical Report

2021.

Links | BibTeX

1560.

Wu, Haiwei; Zhou, Jiantao

GIID-Net: Generalizable Image Inpainting Detection via Neural Architecture Search and Attention Technical Report

2021.

Links | BibTeX

1559.

Fu, Chaoyou; Hu, Yibo; Wu, Xiang; Shi, Hailin; Mei, Tao; He, Ran

CM-NAS: Rethinking Cross-Modality Neural Architectures for Visible-Infrared Person Re-Identification Technical Report

2021.

Links | BibTeX

1558.

Abdelfattah, Mohamed S; Mehrotra, Abhinav; Dudziak, Lukasz; Lane, Nicholas D

Zero-Cost Proxies for Lightweight NAS Technical Report

2021.

Links | BibTeX

1557.

Benmeziane, Hadjer; Maghraoui, Kaoutar El; Ouarnoughi, Hamza; ï, Sma; Wistuba, Martin; Wang, Naigang

A Comprehensive Survey on Hardware-Aware Neural Architecture Search Technical Report

2021.

Links | BibTeX

1556.

He, Xin; Wang, Shihao; Ying, Guohao; Zhang, Jiyong; Chu, Xiaowen

Efficient Multi-objective Evolutionary 3D Neural Architecture Search for COVID-19 Detection with Chest CT Scans Technical Report

2021.

Links | BibTeX

1555.

Zhao, Jiakun; Zhang, Ruifeng; Zhou, Zheng; Chen, Si; Jin, Ju; Liu, Qingfang

A Neural Architecture Search Method Based on Gradient Descent for Remaining Useful Life Estimation Journal Article

In: Neurocomputing, 2021, ISSN: 0925-2312.

Abstract | Links | BibTeX

1554.

Kertész, Gábor; Szénási, Sándor; Vámossy, Zoltán

Comparative analysis of image projection-based descriptors in Siamese neural networks Journal Article

In: Advances in Engineering Software, vol. 154, pp. 102963, 2021, ISSN: 0965-9978.

Abstract | Links | BibTeX

1553.

Zhang, Xuanyang; Hou, Pengfei; Zhang, Xiangyu; Sun, Jian

Neural Architecture Search with Random Labels Technical Report

2021.

Links | BibTeX

1552.

Liang, Xinle; Liu, Yang; Luo, Jiahuan; He, Yuanqin; Chen, Tianjian; Yang, Qiang

Self-supervised Cross-silo Federated Neural Architecture Search Technical Report

2021.

Links | BibTeX

1551.

Yang, Yibo; You, Shan; Li, Hongyang; Wang, Fei; Qian, Chen; Lin, Zhouchen

Towards Improving the Consistency, Efficiency, and Flexibility of Differentiable Neural Architecture Search Technical Report

2021.

Links | BibTeX

1550.

Lu, Longfei; Lyu, Bo

Reducing energy consumption of Neural Architecture Search: An inference latency prediction framework Journal Article

In: Sustainable Cities and Society, vol. 67, pp. 102747, 2021, ISSN: 2210-6707.

Abstract | Links | BibTeX

@article{LU2021102747,

title = {Reducing energy consumption of Neural Architecture Search: An inference latency prediction framework},

author = {Longfei Lu and Bo Lyu},

url = {https://www.sciencedirect.com/science/article/pii/S221067072100041X},

doi = {https://doi.org/10.1016/j.scs.2021.102747},

issn = {2210-6707},

year  = {2021},

date = {2021-01-01},

journal = {Sustainable Cities and Society},

volume = {67},

pages = {102747},

abstract = {Benefit from the success of NAS (Neural Architecture Search) in deep learning, humans are hopefully been released from the tremendous labor of manual tuning of structure and hyper-parameters. However, the success of NAS comes at the cost of much more computational resource consumption, thousands of times more computational power than ordinary training of manual-designed models, especially for the resource-aware multi-objective NAS, which must be serialized as a sequential loop of sampling, training, deployment, and inference. Recent research has shown that deep learning leads to huge energy consumption and CO2 emission (training of the namely Transformer can emit CO2 as much as five cars in their lifetimes Strubell et al. (2019)). Aiming to alleviate this issue, we propose the end-to-end inference latency prediction framework to empower the NAS process with a direct resource-aware efficiency indicator. Namely, we first propose the end-to-end latency prediction framework, which can predict latency quickly and accurately based on the dataset collected by ourselves. Eventually, we experimentally show that with the encoding scheme we designed, our proposed best model, LSTM-GBDT Latency Predictor(LGLP) achieves an excellent result of 0.9349 MSE, 0.5249 MAE, 0.9842 R2, and 0.9925 corrcoef. In other words, our limited dataset and encoding scheme already provide the precise knowledge representation of this large search space. By equipping NAS with the proposed framework, taking NEMO for example, it will save 1588 kWh⋅PUE energy, 1515 pounds CO2 emissions, and $3176 cloud compute cost of AWS. For NAS is now widely exploited in research or industry applications, this will bring incalculable benefits to society and the environment.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Close

1549.

Pinos, Michal; Mrazek, Vojtech; á, Luk

Evolutionary Neural Architecture Search Supporting Approximate Multipliers Technical Report

2021.

Links | BibTeX

1548.

Lyu, B; Yuan, H; Lu, L; Zhang, Y

Resource-constrained Neural Architecture Search on Edge Devices Journal Article

In: IEEE Transactions on Network Science and Engineering, pp. 1-1, 2021.

Links | BibTeX

1547.

Gomez-Rosero, Santiago; Capretz, Miriam A M; Mir, Syed

Transfer Learning by Similarity Centred Architecture Evolution for Multiple Residential Load Forecasting Journal Article

In: Smart Cities, vol. 4, no. 1, pp. 217–240, 2021, ISSN: 2624-6511.

Abstract | Links | BibTeX

@article{smartcities4010014,

title = {Transfer Learning by Similarity Centred Architecture Evolution for Multiple Residential Load Forecasting},

author = {Santiago Gomez-Rosero and Miriam A M Capretz and Syed Mir},

url = {https://www.mdpi.com/2624-6511/4/1/14},

doi = {10.3390/smartcities4010014},

issn = {2624-6511},

year  = {2021},

date = {2021-01-01},

journal = {Smart Cities},

volume = {4},

number = {1},

pages = {217--240},

abstract = {The development from traditional low voltage grids to smart systems has become extensive and adopted worldwide. Expanding the demand response program to cover the residential sector raises a wide range of challenges. Short term load forecasting for residential consumers in a neighbourhood could lead to a better understanding of low voltage consumption behaviour. Nevertheless, users with similar characteristics can present diversity in consumption patterns. Consequently, transfer learning methods have become a useful tool to tackle differences among residential time series. This paper proposes a method combining evolutionary algorithms for neural architecture search with transfer learning to perform short term load forecasting in a neighbourhood with multiple household load consumption. The approach centres its efforts on neural architecture search using evolutionary algorithms. The neural architecture evolution process retains the patterns of the centre-most house, and later the architecture weights are adjusted for each house in a multihouse set from a neighbourhood. In addition, a sensitivity analysis was conducted to ensure model performance. Experimental results on a large dataset containing hourly load consumption for ten houses in London, Ontario showed that the performance of the proposed approach performs better than the compared techniques. Moreover, the proposed method presents the average accuracy performance of 3.17 points higher than the state-of-the-art LSTM one shot method.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Close

1546.

Lin, Ming; Wang, Pichao; Sun, Zhenhong; Chen, Hesen; Sun, Xiuyu; Qian, Qi; Li, Hao; Jin, Rong

Zen-NAS: A Zero-Shot NAS for High-Performance Deep Image Recognition Technical Report

2021.

Links | BibTeX

1545.

Odema, Mohanad; Rashid, Nafiul; Faruque, Mohammad Abdullah Al

Energy-Aware Design Methodology for Myocardial Infarction Detection on Low-Power Wearable Devices Proceedings Article

In: Proceedings of the 26th Asia and South Pacific Design Automation Conference, pp. 621–626, Association for Computing Machinery, Tokyo, Japan, 2021, ISBN: 9781450379991.

Abstract | Links | BibTeX

@inproceedings{10.1145/3394885.3431513,

title = {Energy-Aware Design Methodology for Myocardial Infarction Detection on Low-Power Wearable Devices},

author = {Mohanad Odema and Nafiul Rashid and Mohammad Abdullah Al Faruque},

url = {https://doi.org/10.1145/3394885.3431513},

doi = {10.1145/3394885.3431513},

isbn = {9781450379991},

year  = {2021},

date = {2021-01-01},

booktitle = {Proceedings of the 26th Asia and South Pacific Design Automation Conference},

pages = {621–626},

publisher = {Association for Computing Machinery},

address = {Tokyo, Japan},

series = {ASPDAC '21},

abstract = {Myocardial Infarction (MI) is a heart disease that damages the heart muscle and requires immediate treatment. Its silent and recurrent nature necessitates real-time continuous monitoring of patients. Nowadays, wearable devices are smart enough to perform on-device processing of heartbeat segments and report any irregularities in them. However, the small form factor of wearable devices imposes resource constraints and requires energy-efficient solutions to satisfy them. In this paper, we propose a design methodology to automate the design space exploration of neural network architectures for MI detection. This methodology incorporates Neural Architecture Search (NAS) using Multi-Objective Bayesian Optimization (MOBO) to render Pareto optimal architectural models. These models minimize both detection error and energy consumption on the target device. The design space is inspired by Binary Convolutional Neural Networks (BCNNs) suited for mobile health applications with limited resources. The models' performance is validated using the PTB diagnostic ECG database from PhysioNet. Moreover, energy-related measurements are directly obtained from the target device in a typical hardware-in-the-loop fashion. Finally, we benchmark our models against other related works. One model exceeds state-of-the-art accuracy on wearable devices (reaching 91.22%), whereas others trade off some accuracy to reduce their energy consumption (by a factor reaching 8.26x).},

keywords = {},

pubstate = {published},

tppubtype = {inproceedings}

}

Close

1544.

Liang, Shuang; Tang, Changcheng; Ning, Xuefei; Zeng, Shulin; Yu, Jincheng; Wang, Yu; Guo, Kaiyuan; Yang, Diange; Lu, Tianyi; Yang, Huazhong

Efficient Computing Platform Design for Autonomous Driving Systems Proceedings Article

In: Proceedings of the 26th Asia and South Pacific Design Automation Conference, pp. 734–741, Association for Computing Machinery, Tokyo, Japan, 2021, ISBN: 9781450379991.

Abstract | Links | BibTeX

1543.

Jie, R; Gao, J

Differentiable Neural Architecture Search for High-Dimensional Time Series Forecasting Journal Article

In: IEEE Access, vol. 9, pp. 20922-20932, 2021.

Links | BibTeX

1542.

Lang, Sebastian; Reggelin, Tobias; Schmidt, Johann; Müller, Marcel; Nahhas, Abdulrahman

NeuroEvolution of augmenting topologies for solving a two-stage hybrid flow shop scheduling problem: A comparison of different solution strategies Journal Article

In: Expert Systems with Applications, vol. 172, pp. 114666, 2021, ISSN: 0957-4174.

Abstract | Links | BibTeX

1541.

Luo, Renqian; Tan, Xu; Wang, Rui; Qin, Tao; Li, Jinzhu; Zhao, Sheng; Chen, Enhong; -, Tie

LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search Technical Report

2021.

Links | BibTeX

1540.

Li, Xiaohan; Xie, Ziyan; Lai, Taotao; Zhao, Fusheng; Xu, Haiyin; Chen, Riqing

NAS-WFPN: Neural Architecture Search Weighted Feature Pyramid Networks for Object Detection Proceedings Article

In: Wang, Guojun; Chen, Bing; Li, Wei; Pietro, Roberto Di; Yan, Xuefeng; Han, Hao (Ed.): Security, Privacy, and Anonymity in Computation, Communication, and Storage, pp. 384–394, Springer International Publishing, Cham, 2021, ISBN: 978-3-030-68884-4.

Links | BibTeX

1539.

Fu, Xianya; Li, Wenrui; Chen, Qiurui; Zhang, Lianyi; Yang, Kai; Qing, Duzheng; Wang, Rui

NASIL: Neural Network Architecture Searching for Incremental Learning in Image Classification Proceedings Article

In: Ning, Li; Chau, Vincent; Lau, Francis (Ed.): Parallel Architectures, Algorithms and Programming, pp. 68–80, Springer Singapore, Singapore, 2021, ISBN: 978-981-16-0010-4.

Abstract | Links | BibTeX

@inproceedings{10.1007/978-981-16-0010-4_7,

title = {NASIL: Neural Network Architecture Searching for Incremental Learning in Image Classification},

author = {Xianya Fu and Wenrui Li and Qiurui Chen and Lianyi Zhang and Kai Yang and Duzheng Qing and Rui Wang},

editor = {Li Ning and Vincent Chau and Francis Lau},

url = {https://link.springer.com/chapter/10.1007/978-981-16-0010-4_7},

isbn = {978-981-16-0010-4},

year  = {2021},

date = {2021-01-01},

booktitle = {Parallel Architectures, Algorithms and Programming},

pages = {68--80},

publisher = {Springer Singapore},

address = {Singapore},

abstract = {``Catastrophic forgetting'' and scalability of tasks are two major challenges of incremental learning. Both of these issues were related to the insufficient capacity of machine learning model and the insufficiently trained weights as the increasing of tasks. In this paper, we try to figure out the impact of the neural network architecture to the performance of incremental learning in the case of image classification. During the increasing of tasks, we propose to use neural network architecture searching (NAS) to find a structure that fits the new tasks collection better. We build a NAS environment with reinforcement learning as the searching strategy and Long Short-Term Memory network as the controller network. Computation operation and connecting previous nodes are selected for each layer in the search phase. For each time a new group of tasks is added, the neural network architecture is searched and reorganized according to the training data set. To speed up the searching, we design a parameter sharing mechanism, in which the same building blocks in each layer share a group of parameters. We also introduce the quantified-parameter building blocks into the NAS, to identify the best candidate during each round of searching. We test our solution in cifar100 data set, the average accuracy outperforms the current representative solutions (LwEMC, iCaRL, GANIL) by 24.92%, 5.62%, and 3.6%, respectively, the more tasks added, the better our solution performs.},

keywords = {},

pubstate = {published},

tppubtype = {inproceedings}

}

Close

1538.

Lou, Xiaoxuan; Guo, Shangwei; Zhang, Tianwei; Zhang, Yinqian; Liu, Yang

When NAS Meets Watermarking: Ownership Verification of DNN Models via Cache Side Channels Technical Report

2021.

Links | BibTeX

1537.

Wang, W; Zhu, L

Reliable Network Search Based on Evolutionary Algorithm Proceedings Article

In: 2021 International Conference on Computer, Control and Robotics (ICCCR), pp. 279-282, 2021.

Abstract | Links | BibTeX

1536.

Li, Sheng; Tan, Mingxing; Pang, Ruoming; Li, Andrew; Cheng, Liqun; Le, Quoc; Jouppi, Norman P

Searching for Fast Model Families on Datacenter Accelerators Technical Report

2021.

Links | BibTeX

1535.

He, Xin; Zhao, Kaiyong; Chu, Xiaowen

AutoML: A Survey of the State-of-the-Art Journal Article

In: Knowledge-Based Systems, vol. 212, pp. 106622, 2021.

Abstract | Links | BibTeX

1534.

Liu, Peidong; Zhang, Gengwei; Wang, Bochao; Xu, Hang; Liang, Xiaodan; Jiang, Yong; Li, Zhenguo

Loss Function Discovery for Object Detection via Convergence-Simulation Driven Search Technical Report

2021.

Links | BibTeX

1533.

Su, Xiu; You, Shan; Huang, Tao; Wang, Fei; Qian, Chen; Zhang, Changshui; Xu, Chang

Locally Free Weight Sharing for Network Width Search Technical Report

2021.

Links | BibTeX

1532.

Sun, Y; Sun, X; Fang, Y; Yen, G G; Liu, Y

A Novel Training Protocol for Performance Predictors of Evolutionary Neural Architecture Search Algorithms Journal Article

In: IEEE Transactions on Evolutionary Computation, pp. 1-1, 2021.

Abstract | Links | BibTeX

@article{9336721,

title = {A Novel Training Protocol for Performance Predictors of Evolutionary Neural Architecture Search Algorithms},

author = {Y Sun and X Sun and Y Fang and G G Yen and Y Liu},

url = {https://ieeexplore.ieee.org/document/9336721},

doi = {10.1109/TEVC.2021.3055076},

year  = {2021},

date = {2021-01-01},

journal = {IEEE Transactions on Evolutionary Computation},

pages = {1-1},

abstract = {Evolutionary Neural Architecture Search (ENAS) can automatically design the architectures of Deep Neural Networks (DNNs) using evolutionary computation algorithms. However, most ENAS algorithms require intensive computational resource, which is not necessarily available to the users interested. Performance predictors are a type of regression models which can assist to accomplish the search, while without exerting much computational resource. Despite various performance predictors have been designed, they employ the same training protocol to build the regression models: 1) sampling a set of DNNs with performance as the training dataset, 2) training the model with the mean square error criterion, and 3) predicting the performance of DNNs newly generated during the ENAS. In this paper, we point out that the three steps constituting the training protocol are not well though-out through intuitive and illustrative examples. Furthermore, we propose a new training protocol to address these issues, consisting of designing a pairwise ranking indicator to construct the training target, proposing to use the logistic regression to fit the training samples, and developing a differential method to build the training instances. To verify the effectiveness of the proposed training protocol, four widely used regression models in the field of machine learning have been chosen to perform the comparisons on two benchmark datasets. The experimental results of all the comparisons demonstrate that the proposed training protocol can significantly improve the performance prediction accuracy against the traditional training protocols.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Close

1531.

Turner, Jack; Crowley, Elliot J; O'Boyle, Michael F P

Neural Architecture Search as Program Transformation Exploration Technical Report

2021.

Links | BibTeX

1530.

Lopes, Vasco; Alirezazadeh, Saeid; í, Lu

EPE-NAS: Efficient Performance Estimation Without Training for Neural Architecture Search Technical Report

2021.

Links | BibTeX

1529.

Yan, Shen; Song, Kaiqiang; Liu, Fei; Zhang, Mi

CATE: Computation-aware Neural Architecture Encoding with Transformers Technical Report

2021.

Links | BibTeX

1528.

Calisto, Maria Baldeon G; Lai-Yuen, Susana K

EMONAS: efficient multiobjective neural architecture search framework for 3D medical image segmentation Proceedings Article

In: Išgum, Ivana; Landman, Bennett A (Ed.): Medical Imaging 2021: Image Processing, pp. 22 – 34, International Society for Optics and Photonics SPIE, 2021.

Abstract | Links | BibTeX

@inproceedings{10.1117/12.2577088,

title = {EMONAS: efficient multiobjective neural architecture search framework for 3D medical image segmentation},

author = {Maria Baldeon G Calisto and Susana K Lai-Yuen},

editor = {Ivana Išgum and Bennett A Landman},

url = {https://doi.org/10.1117/12.2577088},

doi = {10.1117/12.2577088},

year  = {2021},

date = {2021-01-01},

booktitle = {Medical Imaging 2021: Image Processing},

volume = {11596},

pages = {22 -- 34},

publisher = {SPIE},

organization = {International Society for Optics and Photonics},

abstract = {Deep learning plays a critical role in medical image segmentation. Nevertheless, manually designing a neural network for a specific segmentation problem is a very difficult and time-consuming task due to the massive hyperparameter search space, long training time and large volumetric data. Therefore, most designed networks are highly complex, task specific and over-parametrized. Recently, multiobjective neural architecture search (NAS) methods have been proposed to automate the design of accurate and efficient segmentation architectures. However, they only search for either the macro- or micro-structure of the architecture, do not use the information produced during the optimization process to increase the efficiency of the search, and do not consider the volumetric nature of medical images. In this work, we propose EMONAS, an Efficient MultiObjective Neural Architecture Search framework for 3D medical image segmentation. EMONAS is composed of a search space that considers both the macro- and micro-structure of the architecture, and a surrogate-assisted multiobjective evolutionary based algorithm that efficiently searches for the best hyperparameters using a Random Forest surrogate and guiding selection probabilities. EMONAS is evaluated on the task of cardiac segmentation from the ACDC MICCAI challenge. The architecture found is ranked within the top 10 submissions in all evaluation metrics, performing better or comparable to other approaches while reducing the search time by more than 50% and having considerably fewer number of parameters.},

keywords = {},

pubstate = {published},

tppubtype = {inproceedings}

}

Close

1527.

Robbiano, Luca; Rahman, Muhammad Rameez Ur; Galasso, Fabio; Caputo, Barbara; Carlucci, Fabio Maria

Adversarial Branch Architecture Search for Unsupervised Domain Adaptation Technical Report

2021.

Links | BibTeX