Literature on Neural Architecture Search

2884.

Kang, Jeon-Seong; Kang, JinKyu; Kim, Jung-Jun; Jeon, Kwang-Woo; Chung, Hyun-Joon; Park, Byung-Hoon

Neural Architecture Search Survey: A Computer Vision Perspective Journal Article

In: Sensors, vol. 23, no. 3, 2023, ISSN: 1424-8220.

2883.

Rampavan, Medipelly; Ijjina, Earnest Paul

Genetic brake-net: Deep learning based brake light detection for collision avoidance using genetic algorithm Journal Article

In: Knowledge-Based Systems, pp. 110338, 2023, ISSN: 0950-7051.

@article{RAMPAVAN2023110338,

title = {Genetic brake-net: Deep learning based brake light detection for collision avoidance using genetic algorithm},

author = {Medipelly Rampavan and Earnest Paul Ijjina},

url = {https://www.sciencedirect.com/science/article/pii/S0950705123000886},

doi = {https://doi.org/10.1016/j.knosys.2023.110338},

issn = {0950-7051},

year  = {2023},

date = {2023-01-01},

urldate = {2023-01-01},

journal = {Knowledge-Based Systems},

pages = {110338},

abstract = {Automobiles are the primary means of transportation and increased traffic leads to the emphasis on techniques for safe transportation. Vehicle brake light detection is essential to avoid collisions among vehicles. Even though motorcycles are a common mode of transportation in many developing countries, little research has been done on motorcycle brake light detection. The effectiveness of Deep Neural Network (DNN) models has led to their adoption in different domains. The efficiency of the manually designed DNN architecture is dependent on the expert’s insight on optimality, which may not lead to an optimal model. Recently, Neural Architecture Search (NAS) has emerged as a method for automatically generating a task-specific backbone for object detection and classification tasks. In this work, we propose a genetic algorithm based NAS approach to construct a Mask R-CNN based object detection model. We designed the search space to include the architecture of the backbone in Mask R-CNN along with attributes used in training the object detection model. Genetic algorithm is used to explore the search space to find the optimal backbone architecture and training attributes. We achieved a mean accuracy of 97.14% and 89.44% for detecting brake light status for two-wheelers (on NITW-MBS dataset) and four-wheelers (on CaltechGraz dataset) respectively. The experimental study suggests that the architecture obtained using the proposed approach exhibits superior performance compared to existing models.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

2882.

Sarti, Simone; Lomurno, Eugenio; Falanti, Andrea; Matteucci, Matteo

Enhancing Once-For-All: A Study on Parallel Blocks, Skip Connections and Early Exits Technical Report

2023.

2881.

Kulbach, Cedric Peter Charles

Adaptive Automated Machine Learning PhD Thesis

Karlsruher Institut für Technologie (KIT), 2023.

2880.

Wang, Chao; Jiao, Licheng; Zhao, Jiaxuan; Li, Lingling; Liu, Xu; Liu, Fang; Yang, Shuyuan

Bi-level Multi-objective Evolutionary Learning: A Case Study on Multi-task Graph Neural Topology Search Technical Report

2023.

2879.

Gao, Yang; Zhang, Peng; Zhou, Chuan; Yang, Hong; Li, Zhao; Hu, Yue; Yu, Philip S.

HGNAS++: Efficient Architecture Search for Heterogeneous Graph Neural Networks Journal Article

In: IEEE Transactions on Knowledge and Data Engineering, pp. 1-14, 2023.

2878.

Zhou, Yuan; Hao, Jieke; Huo, Shuwei; Wang, Boyu; Ge, Leijiao; Kung, Sun-Yuan

Automatic Metric Search for Few-Shot Learning Journal Article

In: IEEE Transactions on Neural Networks and Learning Systems, pp. 1-12, 2023.

2877.

Äkiva-Hochman, Ruth; Finder, Shahaf E.; Turek, Javier S.; Treister, Eran"

Searching for N:M Fine-grained Sparsity of Weights and Activations in Neural Networks Proceedings Article

In: Karlinsky, Leonid; Michaeli, Tomer; Nishino, Ko (Ed.): Computer Vision -- ECCV 2022 Workshops, pp. 130–143, Springer Nature Switzerland, Cham, 2023, ISBN: 978-3-031-25082-8.

Abstract | BibTeX

2876.

Ünal, Hamit Taner; Başçiftçi, Fatih

Neural Logic Circuits: An evolutionary neural architecture that can learn and generalize Journal Article

In: Knowledge-Based Systems, vol. 265, pp. 110379, 2023, ISSN: 0950-7051.

2875.

Jin, Guangyin; Sha, Hengyu; Xi, Zhexu; Huang, Jincai

Urban hotspot forecasting via automated spatio-temporal information fusion Journal Article

In: Applied Soft Computing, vol. 136, pp. 110087, 2023, ISSN: 1568-4946.

@article{JIN2023110087,

title = {Urban hotspot forecasting via automated spatio-temporal information fusion},

author = {Guangyin Jin and Hengyu Sha and Zhexu Xi and Jincai Huang},

url = {https://www.sciencedirect.com/science/article/pii/S1568494623001059},

doi = {https://doi.org/10.1016/j.asoc.2023.110087},

issn = {1568-4946},

year  = {2023},

date = {2023-01-01},

urldate = {2023-01-01},

journal = {Applied Soft Computing},

volume = {136},

pages = {110087},

abstract = {Urban hotspot forecasting is one of the most important tasks for resource scheduling and security in future smart cities. Most previous works employed fixed neural architectures based on many complicated spatial and temporal learning modules. However, designing appropriate neural architectures is challenging for urban hotspot forecasting. One reason is that there is currently no adequate support system for how to fuse multi-scale spatio-temporal information rationally by integrating different spatial and temporal learning modules. Another one is that the empirical fixed neural architecture is difficult to adapt to different data scenarios from different domains or cities. To address the above problems, we propose a novel framework based on neural architecture search for urban hotspot forecasting, namely Automated Spatio-Temporal Information Fusion Neural Network (ASTIF-Net). In the search space of our ASTIF-Net, normal convolution and graph convolution operations are adopted to capture spatial geographic neighborhood dependencies and spatial semantic neighborhood dependencies, and different types of temporal convolution operations are adopted to capture short-term and long-term temporal dependencies. In addition to combining spatio-temporal learning operations from different scales, ASTIF-Net can also search appropriate fusion methods for aggregating multi-scale spatio-temporal hidden information. We conduct extensive experiments to evaluate ASTIF-Net on three real-world urban hotspot datasets from different domains to demonstrate that our proposed model can obtain effective neural architectures and achieve superior performance (about 5%∼10% improvements) compared with the existing state-of-art baselines.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

2874.

Huang, Mingqiang; Liu, Yucen; Huang, Sixiao; Li, Kai; Wu, Qiuping; Yu, Hao

Multi-Bit-Width CNN Accelerator with Systolic-in-Systolic Dataflow and Single DSP Multiple Multiplication Scheme Proceedings Article

In: Proceedings of the 2023 ACM/SIGDA International Symposium on Field Programmable Gate Arrays, pp. 229, Association for Computing Machinery, Monterey, CA, USA, 2023, ISBN: 9781450394178.

@inproceedings{10.1145/3543622.3573209,

title = {Multi-Bit-Width CNN Accelerator with Systolic-in-Systolic Dataflow and Single DSP Multiple Multiplication Scheme},

author = {Mingqiang Huang and Yucen Liu and Sixiao Huang and Kai Li and Qiuping Wu and Hao Yu},

url = {https://doi.org/10.1145/3543622.3573209},

doi = {10.1145/3543622.3573209},

isbn = {9781450394178},

year  = {2023},

date = {2023-01-01},

urldate = {2023-01-01},

booktitle = {Proceedings of the 2023 ACM/SIGDA International Symposium on Field Programmable Gate Arrays},

pages = {229},

publisher = {Association for Computing Machinery},

address = {Monterey, CA, USA},

series = {FPGA '23},

abstract = {Multi-bit-width neural network enlightens a promising method for high performance yet energy efficient edge computing due to its balance between software algorithm accuracy and hardware efficiency. To date, FPGA has been one of the core hardware platforms for deploying various neural networks. However, it is still difficult to fully make use of the dedicated digital signal processing (DSP) blocks in FPGA for accelerating the multi-bit-width network. In this work, we develop state-of-the-art multi-bit-width convolutional neural network accelerator with novel systolic-in-systolic type of dataflow and single DSP multiple multiplication (SDMM) INT2/4/8 execution scheme. Multi-level optimizations have also been adopted to further improve the performance, including group-vector systolic array for maximizing the circuit efficiency as well as minimizing the systolic delay, and differential neural architecture search (NAS) method for the high accuracy multi-bit-width network generation. The proposed accelerator has been practically deployed on Xilinx ZCU102 with accelerating NAS optimized VGG16 and Resnet18 networks as case studies. Average performance on accelerating the convolutional layer in VGG16 and Resnet18 is 1289GOPs and 1155GOPs, respectively. Throughput for running the full multi-bit-width VGG16 network is 870.73 GOPS at 250MHz, which has exceeded all of previous CNN accelerators on the same platform.},

keywords = {},

pubstate = {published},

tppubtype = {inproceedings}

}

2873.

Wang, Xueying; Li, Guangli; Ma, Xiu; Feng, Xiaobing

Facilitating hardware-aware neural architecture search with learning-based predictive models Journal Article

In: Journal of Systems Architecture, vol. 137, pp. 102838, 2023, ISSN: 1383-7621.

@article{WANG2023102838,

title = {Facilitating hardware-aware neural architecture search with learning-based predictive models},

author = {Xueying Wang and Guangli Li and Xiu Ma and Xiaobing Feng},

url = {https://www.sciencedirect.com/science/article/pii/S1383762123000176},

doi = {https://doi.org/10.1016/j.sysarc.2023.102838},

issn = {1383-7621},

year  = {2023},

date = {2023-01-01},

urldate = {2023-01-01},

journal = {Journal of Systems Architecture},

volume = {137},

pages = {102838},

abstract = {Neural architecture search (NAS), which automatically explores the efficient model design, has achieved ground-breaking advances in recent years. To achieve the optimal model latency on deployment platforms, a performance tuning process is usually needed to select reasonable parameters and implementations for each neural network operator. As the tuning process is time-consuming, it is impractical for tuning each candidate architecture generated in the search procedure. Recent NAS systems usually utilize theoretical metrics or rule-based heuristics on-device latency to approximately estimate the model performance. Nevertheless, we discovered that there is still a gap between the estimated latency and the optimal latency, potentially causing a sub-optimal solution for neural architecture search. This paper presents an accurate and efficient approach for estimating the practical model latency on target platforms, which employs lightweight learning-based predictive models (LBPMs) to assist to obtain the realistic deployment-time model latency with acceptable run-time overhead, thereby facilitating hardware-aware neural architecture search. We propose an LBPM-based NAS framework, LBPM-NAS, and evaluate it by searching model architectures for ImageNet classification and facial landmark localization tasks on various hardware platforms. Experimental results show that the LBPM-NAS achieves up to 2.4× performance boost compared with the baselines under the same-level accuracy.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

2872.

Chatzianastasis, Michail; Ilias, Loukas; Askounis, Dimitris; Vazirgiannis, Michalis

Neural Architecture Search with Multimodal Fusion Methods for Diagnosing Dementia Technical Report

2023.

2871.

Zhu, Xunyu; Li, Jian; Liu, Yong; Wang, Weiping

Improving Differentiable Architecture Search via Self-Distillation Technical Report

2023.

2870.

Wang, Zhe; Yang, Fangfang; Xu, Qiang; Wang, Yongjian; Yan, Hong; Xie, Min

Capacity estimation of lithium-ion batteries based on data aggregation and feature fusion via graph neural network Journal Article

In: Applied Energy, vol. 336, pp. 120808, 2023, ISSN: 0306-2619.

2869.

Ismail, Walaa N.; Alsalamah, Hessah A.; Hassan, Mohammad Mehedi; Mohamed, Ebtesam

AUTO-HAR: An adaptive human activity recognition framework using an automated CNN architecture design Journal Article

In: Heliyon, vol. 9, no. 2, pp. e13636, 2023, ISSN: 2405-8440.

@article{ISMAIL2023e13636,

title = {AUTO-HAR: An adaptive human activity recognition framework using an automated CNN architecture design},

author = {Walaa N. Ismail and Hessah A. Alsalamah and Mohammad Mehedi Hassan and Ebtesam Mohamed},

url = {https://www.sciencedirect.com/science/article/pii/S2405844023008435},

doi = {https://doi.org/10.1016/j.heliyon.2023.e13636},

issn = {2405-8440},

year  = {2023},

date = {2023-01-01},

urldate = {2023-01-01},

journal = {Heliyon},

volume = {9},

number = {2},

pages = {e13636},

abstract = {Convolutional neural networks (CNNs) have demonstrated exceptional results in the analysis of time- series data when used for Human Activity Recognition (HAR). The manual design of such neural architectures is an error-prone and time-consuming process. The search for optimal CNN architectures is considered a revolution in the design of neural networks. By means of Neural Architecture Search (NAS), network architectures can be designed and optimized automatically. Thus, the optimal CNN architecture representation can be found automatically because of its ability to overcome the limitations of human experience and thinking modes. Evolution algorithms, which are derived from evolutionary mechanisms such as natural selection and genetics, have been widely employed to develop and optimize NAS because they can handle a blackbox optimization process for designing appropriate solution representations and search paradigms without explicit mathematical formulations or gradient information. The Genetic optimization algorithm (GA) is widely used to find optimal or near-optimal solutions for difficult problems. Considering these characteristics, an efficient human activity recognition architecture (AUTO-HAR) is presented in this study. Using the evolutionary GA to select the optimal CNN architecture, the current study proposes a novel encoding schema structure and a novel search space with a much broader range of operations to effectively search for the best architectures for HAR tasks. In addition, the proposed search space provides a reasonable degree of depth because it does not limit the maximum length of the devised task architecture. To test the effectiveness of the proposed framework for HAR tasks, three datasets were utilized: UCI-HAR, Opportunity, and DAPHNET. Based on the results of this study, it has been found that the proposed method can efficiently recognize human activity with an average accuracy of 98.5% (∓1.1), 98.3%, and 99.14% (∓0.8) for UCI-HAR, Opportunity, and DAPHNET, respectively.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Convolutional neural networks (CNNs) have demonstrated exceptional results in the analysis of time- series data when used for Human Activity Recognition (HAR). The manual design of such neural architectures is an error-prone and time-consuming process. The search for optimal CNN architectures is considered a revolution in the design of neural networks. By means of Neural Architecture Search (NAS), network architectures can be designed and optimized automatically. Thus, the optimal CNN architecture representation can be found automatically because of its ability to overcome the limitations of human experience and thinking modes. Evolution algorithms, which are derived from evolutionary mechanisms such as natural selection and genetics, have been widely employed to develop and optimize NAS because they can handle a blackbox optimization process for designing appropriate solution representations and search paradigms without explicit mathematical formulations or gradient information. The Genetic optimization algorithm (GA) is widely used to find optimal or near-optimal solutions for difficult problems. Considering these characteristics, an efficient human activity recognition architecture (AUTO-HAR) is presented in this study. Using the evolutionary GA to select the optimal CNN architecture, the current study proposes a novel encoding schema structure and a novel search space with a much broader range of operations to effectively search for the best architectures for HAR tasks. In addition, the proposed search space provides a reasonable degree of depth because it does not limit the maximum length of the devised task architecture. To test the effectiveness of the proposed framework for HAR tasks, three datasets were utilized: UCI-HAR, Opportunity, and DAPHNET. Based on the results of this study, it has been found that the proposed method can efficiently recognize human activity with an average accuracy of 98.5% (∓1.1), 98.3%, and 99.14% (∓0.8) for UCI-HAR, Opportunity, and DAPHNET, respectively.

2868.

Gillard, Ryan; Jonany, Stephen; Miao, Yingjie; Munn, Michael; Souza, Connal; Dungay, Jonathan; Liang, Chen; So, David R.; Le, Quoc V.; Real, Esteban

Unified Functional Hashing in Automatic Machine Learning Technical Report

2023.

2867.

Romero, David W.; Zeghidour, Neil

DNArch: Learning Convolutional Neural Architectures by Backpropagation Technical Report

2023.

A lightweight network for photovoltaic cell defect detection in electroluminescence images based on neural architecture search and knowledge distillation Technical Report

2866.

Zhang, Jinxia; Chen, Xinyi; Wei, Haikun; Zhang, Kanjian

2023.

2865.

Yuan, Gonglin; Wang, Bin; Xue, Bing; Zhang, Mengjie

Particle Swarm Optimization for Efficiently Evolving Deep Convolutional Neural Networks Using an Autoencoder-based Encoding Strategy Journal Article

In: IEEE Transactions on Evolutionary Computation, pp. 1-1, 2023.

2864.

Bhattacharjee, Abhiroop; Moitra, Abhishek; Panda, Priyadarshini

XploreNAS: Explore Adversarially Robust & Hardware-efficient Neural Architectures for Non-ideal Xbars Technical Report

2023.

2863.

Cheng, Guangliang; Sun, Peng; Xu, Ting-Bing; Lyu, Shuchang; Lin, Peiwen

Local-to-Global Information Communication for Real-Time Semantic Segmentation Network Search Technical Report

2023.

2862.

Mohammadrezaei, Parsa; Aminan, Mohammad; Soltanian, Mohammad; Borna, Keivan

Improving CNN-based solutions for emotion recognition using evolutionary algorithms Journal Article

In: Results in Applied Mathematics, vol. 18, pp. 100360, 2023, ISSN: 2590-0374.

2861.

Gao, Si; Zheng, Chengjian; Zhang, Xiaofeng; Liu, Shaoli; Wu, Biao; Lu, Kaidi; Zhang, Diankai; Wang, Ning

RCBSR: Re-parameterization Convolution Block for Super-Resolution Proceedings Article

In: Karlinsky, Leonid; Michaeli, Tomer; Nishino, Ko (Ed.): Computer Vision -- ECCV 2022 Workshops, pp. 540–548, Springer Nature Switzerland, Cham, 2023, ISBN: 978-3-031-25063-7.

Abstract | BibTeX

2860.

Maulik, Romit; Egele, Romain; Raghavan, Krishnan; Balaprakash, Prasanna

Quantifying uncertainty for deep learning based forecasting and flow-reconstruction using neural architecture search ensembles Technical Report

2023.

2859.

Wei, Lanning; He, Zhiqiang; Zhao, Huan; Yao, Quanming

Search to Capture Long-range Dependency with Stacking GNNs for Graph Classification Technical Report

2023.

2858.

Han, Fred X.; Mills, Keith G.; Chudak, Fabian; Riahi, Parsa; Salameh, Mohammad; Zhang, Jialin; Lu, Wei; Jui, Shangling; Niu, Di

A General-Purpose Transferable Predictor for Neural Architecture Search Technical Report

2023.

2857.

Lyu, Zimeng; Ororbia, Alexander; Desell, Travis

Online Evolutionary Neural Architecture Search for Multivariate Non-Stationary Time Series Forecasting Technical Report

2023.

2856.

Kuş, Zeki; Aydin, Musa; Kiraz, Berna; Can, Burhanettin

Neural Architecture Search Using Metaheuristics for Automated Cell Segmentation Proceedings Article

In: Gaspero, Luca Di; Festa, Paola; Nakib, Amir; Pavone, Mario (Ed.): Metaheuristics, pp. 158–171, Springer International Publishing, Cham, 2023, ISBN: 978-3-031-26504-4.

@inproceedings{10.1007/978-3-031-26504-4_12,

title = {Neural Architecture Search Using Metaheuristics for Automated Cell Segmentation},

author = {Zeki Kuş and Musa Aydin and Berna Kiraz and Burhanettin Can},

editor = {Luca Di Gaspero and Paola Festa and Amir Nakib and Mario Pavone},

url = {https://link.springer.com/chapter/10.1007/978-3-031-26504-4_12},

isbn = {978-3-031-26504-4},

year  = {2023},

date = {2023-01-01},

urldate = {2023-01-01},

booktitle = {Metaheuristics},

pages = {158--171},

publisher = {Springer International Publishing},

address = {Cham},

abstract = {Deep neural networks give successful results for segmentation of medical images. The need for optimizing many hyper-parameters presents itself as a significant limitation hampering the effectiveness of deep neural network based segmentation task. Manual selection of these hyper-parameters is not feasible as the search space increases. At the same time, these generated networks are problem-specific. Recently, studies that perform segmentation of medical images using Neural Architecture Search (NAS) have been proposed. However, these studies significantly limit the possible network structures and search space. In this study, we proposed a structure called UNAS-Net that brings together the advantages of successful NAS studies and is more flexible in terms of the networks that can be created. The UNAS-Net structure has been optimized using metaheuristics including Differential Evolution (DE) and Local Search (LS), and the generated networks have been tested on Optofil and Cell Nuclei data sets. When the results are examined, it is seen that the networks produced by the heuristic methods improve the performance of the U-Net structure in terms of both segmentation performance and computational complexity. As a result, the proposed structure can be used when the automatic generation of neural networks that provide fast inference as well as successful segmentation performance is desired.},

keywords = {},

pubstate = {published},

tppubtype = {inproceedings}

}

2855.

Gülcü, Ayla; Kuş, Zeki

Neural Architecture Search Using Differential Evolution in MAML Framework for Few-Shot Classification Problems Proceedings Article

In: Gaspero, Luca Di; Festa, Paola; Nakib, Amir; Pavone, Mario (Ed.): Metaheuristics, pp. 143–157, Springer International Publishing, Cham, 2023, ISBN: 978-3-031-26504-4.

Abstract | BibTeX

2854.

Muneer, V; Biju, G M; Bhattacharya, Avik

Optimal Machine Learning based Controller for Shunt Active Power Filter by Auto Machine Learning Journal Article

In: IEEE Journal of Emerging and Selected Topics in Power Electronics, pp. 1-1, 2023.

2853.

Zhan, Lin; Fan, Jiayuan; Ye, Peng; Cao, Jianjian

A2S-NAS: Asymmetric Spectral-Spatial Neural Architecture Search For Hyperspectral Image Classification Technical Report

2023.

2852.

Zhan, Lin; Fan, Jiayuan; Ye, Peng; Cao, Jianjian

A2S-NAS: Asymmetric Spectral-Spatial Neural Architecture Search For Hyperspectral Image Classification Journal Article

In: CoRR, vol. abs/2302.11868, 2023.

2851.

Lu, Xiaotong; Dong, Weisheng; Li, Xin; Wu, Jinjian; Li, Leida; Shi, Guangming

Adaptive Search-and-Training for Robust and Efficient Network Pruning Journal Article

In: IEEE Transactions on Pattern Analysis and Machine Intelligence, pp. 1-14, 2023.

2850.

Bataineh, Ali Al; Kaur, Devinder; Al-khassaweneh, Mahmood; Al-sharoa, Esraa

Automated CNN Architectural Design: A Simple and Efficient Methodology for Computer Vision Tasks Journal Article

In: Mathematics, vol. 11, no. 5, pp. 1-17, 2023.

2849.

Zheng, Shenghe; Wang, Hongzhi; Mu, Tianyu

DCLP: Neural Architecture Predictor with Curriculum Contrastive Learning Technical Report

2023.

2848.

Zheng, Xin; Zhang, Miao; Chen, Chunyang; Zhang, Qin; Zhou, Chuan; Pan, Shirui

Auto-HeG: Automated Graph Neural Network on Heterophilic Graphs Technical Report

2023.

2847.

Chen, Angelica; Dohan, David M.; So, David R.

EvoPrompting: Language Models for Code-Level Neural Architecture Search Technical Report

2023.

2846.

He, Yang; Xiao, Lingao

Structured Pruning for Deep Convolutional Neural Networks: A survey Technical Report

2023.

2845.

Rajesh, Chilukamari; Kumar, Sushil

Äutomatic Retinal Vessel Segmentation Using BTLBO Proceedings Article

In: Thakur, Manoj; Agnihotri, Samar; Rajpurohit, Bharat Singh; Pant, Millie; Deep, Kusum; Nagar, Atulya K. (Ed.): Soft Computing for Problem Solving, pp. 189–200, Springer Nature Singapore, Singapore, 2023, ISBN: 978-981-19-6525-8.