AutoML | Literature on Neural Architecture Search

@article{REN2022108864,

title = {DARTSRepair: Core-failure-set guided DARTS for network robustness to common corruptions},

author = {Xuhong Ren and Jianlang Chen and Felix Juefei-Xu and Wanli Xue and Qing Guo and Lei Ma and Jianjun Zhao and Shengyong Chen},

url = {https://www.sciencedirect.com/science/article/pii/S0031320322003454},

doi = {https://doi.org/10.1016/j.patcog.2022.108864},

issn = {0031-3203},

year  = {2022},

date = {2022-01-01},

urldate = {2022-01-01},

journal = {Pattern Recognition},

volume = {131},

pages = {108864},

abstract = {Network architecture search (NAS), in particular the differentiable architecture search (DARTS) method, has shown a great power to learn excellent model architectures on the specific dataset of interest. In contrast to using a fixed dataset, in this work, we focus on a different but important scenario for NAS: how to refine a deployed network’s model architecture to enhance its robustness with the guidance of a few collected and misclassified examples that are degraded by some real-world unknown corruptions having a specific pattern (e.g., noise, blur, etc..). To this end, we first conduct an empirical study to validate that the model architectures can be definitely related to the corruption patterns. Surprisingly, by just adding a few corrupted and misclassified examples (e.g., 103 examples) to the clean training dataset (e.g., 5.0×104 examples), we can refine the model architecture and enhance the robustness significantly. To make it more practical, the key problem, i.e., how to select the proper failure examples for the effective NAS guidance, should be carefully investigated. Then, we propose a novel core-failure-set guided DARTS that embeds a K-center-greedy algorithm for DARTS to select suitable corrupted failure examples to refine the model architecture. We use our method for DARTS-refined DNNs on the clean as well as 15 corruptions with the guidance of four specific real-world corruptions. Compared with the state-of-the-art NAS as well as data-augmentation-based enhancement methods, our final method can achieve higher accuracy on both corrupted datasets and the original clean dataset. On some of the corruption patterns, we can achieve as high as over 45% absolute accuracy improvements.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Close

1975.

Dervisi, Foteini; Kyriakides, George; Margaritis, Konstantinos

Evaluating Acceleration Techniques for Genetic Neural Architecture Search Proceedings Article

In: Iliadis, Lazaros; Jayne, Chrisina; Tefas, Anastasios; Pimenidis, Elias (Ed.): Engineering Applications of Neural Networks, pp. 3–14, Springer International Publishing, Cham, 2022, ISBN: 978-3-031-08223-8.

Abstract | BibTeX

1974.

Zhang, Wentao; Lin, Zheyu; Shen, Yu; Li, Yang; Yang, Zhi; Cui, Bin

DFG-NAS: Deep and Flexible Graph Neural Architecture Search Technical Report

2022.

Links | BibTeX

1973.

Xue, Yu; Qin, Jiafeng

Partial Connection Based on Channel Attention for Differentiable Neural Architecture Search Journal Article

In: IEEE Transactions on Industrial Informatics, pp. 1-10, 2022.

Links | BibTeX

1972.

Qin, Yijian; Zhang, Ziwei; Wang, Xin; Zhang, Zeyang; Zhu, Wenwu

NAS-Bench-Graph: Benchmarking Graph Neural Architecture Search Technical Report

2022.

Links | BibTeX

1971.

Jung, Harim; Oh, Myeong-Seok; Yang, Cheoljong; Lee, Seong-Whan

Neural Architecture Adaptation for Object Detection by Searching Channel Dimensions and Mapping Pre-trained Parameters Technical Report

2022.

Links | BibTeX

1970.

Risso, Matteo; Burrello, Alessio; Benini, Luca; Macii, Enrico; Poncino, Massimo; Pagliari, Daniele Jahier

Channel-wise Mixed-precision Assignment for DNN Inference on Constrained Edge Nodes Technical Report

2022.

Links | BibTeX

1969.

Hasan, Noha W.; Saudi, Ali S.; Khalil, Mahmoud I.; Abbas, Hazem M.

A Genetic Algorithm Approach to Automate Architecture Design for Acoustic Scene Classification Journal Article

In: IEEE Transactions on Evolutionary Computation, pp. 1-1, 2022.

Links | BibTeX

1968.

Lou, Xiaoxuan; Guo, Shangwei; Li, Jiwei; Zhang, Tianwei

Ownership Verification of DNN Architectures via Hardware Cache Side Channels Journal Article

In: IEEE Transactions on Circuits and Systems for Video Technology, pp. 1-1, 2022.

Links | BibTeX

1967.

Perego, Riccardo; Candelieri, Antonio; Archetti, Francesco; Pau, Danilo

AutoTinyML for microcontrollers: Dealing with black-box deployability Journal Article

In: Expert Systems with Applications, vol. 207, pp. 117876, 2022, ISSN: 0957-4174.

Abstract | Links | BibTeX

1966.

Gridin, Ivan

Öne-Shot Neural Architecture Search Book Chapter

In: Äutomated Deep Learning Using Neural Network Intelligence: Develop and Design PyTorch and TensorFlow Models Using Python", pp. 257–318, Äpress, Berkeley, CA, 2022, ISBN: 978-1-4842-8149-9.

Abstract | Links | BibTeX

1965.

Gridin, Ivan

Multi-trial Neural Architecture Search Book Chapter

In: Äutomated Deep Learning Using Neural Network Intelligence: Develop and Design PyTorch and TensorFlow Models Using Python", pp. 185–256, Äpress, Berkeley, CA, 2022, ISBN: 978-1-4842-8149-9.

Abstract | Links | BibTeX

1964.

Dudziak, Lukasz; Laskaridis, Stefanos; Fernández-Marqués, Javier

FedorAS: Federated Architecture Search under system heterogeneity Technical Report

2022.

Links | BibTeX

1963.

Duan, Fenxia; Cao, Chunhong; Gao, Xieping

SA-NAS-BFNR: Spatiotemporal Attention Neural Architecture Search for Task-Based Brain Functional Network Representation Proceedings Article

In: Proceedings of the 2022 International Conference on Multimedia Retrieval, pp. 661–667, Association for Computing Machinery, Newark, NJ, USA, 2022, ISBN: 9781450392389.

Abstract | Links | BibTeX

1962.

Chitty-Venkata, Krishna Teja; Emani, Murali; Vishwanath, Venkatram; Somani, Arun K.

Efficient Design Space Exploration for Sparse Mixed Precision Neural Architectures Proceedings Article

In: Proceedings of the 31st International Symposium on High-Performance Parallel and Distributed Computing, pp. 265–276, Association for Computing Machinery, Minneapolis, MN, USA, 2022, ISBN: 9781450391993.

Abstract | Links | BibTeX

@inproceedings{10.1145/3502181.3531463,

title = {Efficient Design Space Exploration for Sparse Mixed Precision Neural Architectures},

author = {Krishna Teja Chitty-Venkata and Murali Emani and Venkatram Vishwanath and Arun K. Somani},

url = {https://doi.org/10.1145/3502181.3531463},

doi = {10.1145/3502181.3531463},

isbn = {9781450391993},

year  = {2022},

date = {2022-01-01},

urldate = {2022-01-01},

booktitle = {Proceedings of the 31st International Symposium on High-Performance Parallel and Distributed Computing},

pages = {265–276},

publisher = {Association for Computing Machinery},

address = {Minneapolis, MN, USA},

series = {HPDC '22},

abstract = {Pruning and Quantization are two effective Deep Neural Network (DNN) compression methods for efficient inference on various hardware platforms. Pruning refers to removing unimportant weights or nodes, whereas Quantization converts the floating-point parameters to low-bit fixed integer representation. The pruned and low precision models result in smaller and faster inference models on hardware platforms with almost the same accuracy as the unoptimized network. Tensor Cores in Nvidia Ampere 100 (A100) GPU supports (1) 2:4 fine-grained sparse pruning where 2 out of every 4 elements are pruned, and (2) traditional dense multiplication to achieve a good accuracy and performance trade-off. The A100 Tensor Core also takes advantage of 1-bit, 4-bit, and 8-bit multiplication to speed up the inference of a model. Hence, finding the right matrix type (dense or 2:4 sparse) along with the precision for each layer becomes a combinatorial problem. Neural Architecture Search (NAS) can alleviate such problems by automating the architecture design process instead of a brute-force search. In this paper, we propose (i) Mixed Sparse and Precision Search (MSPS), a NAS framework to search for efficient sparse and mixed-precision quantized model within the predefined search space and fixed backbone neural network (Eg. ResNet50), and (ii) Architecture, Sparse and Precision Search (ASPS) to jointly search for kernel size and number of filters, and sparse-precision combination of each layer. We illustrate the effectiveness of our methods targeting A100 Tensor Core on Nvidia GPUs by searching efficient sparse-mixed precision networks on ResNet50 and achieving better accuracy-latency trade-off models compared to the manually designed Uniform Sparse Int8 networks.},

keywords = {},

pubstate = {published},

tppubtype = {inproceedings}

}

Close

1961.

Gesmundo, Andrea; Dean, Jeff

muNet: Evolving Pretrained Deep Neural Networks into Scalable Auto-tuning Multitask Systems Technical Report

2022.

Links | BibTeX

1960.

Han, Zhu; Hong, Danfeng; Gao, Lianru; Zhang, Bing; Huang, Min; Chanussot, Jocelyn

AutoNAS: Automatic Neural Architecture Search for Hyperspectral Unmixing Journal Article

In: IEEE Transactions on Geoscience and Remote Sensing, vol. 60, pp. 1-14, 2022.

Links | BibTeX

1959.

Seng, Jonas; Prasad, Pooja; Dhami, Devendra Singh; Kersting, Kristian

HANF: Hyperparameter And Neural Architecture Search in Federated Learning Technical Report

2022.

Links | BibTeX

1958.

Yu, Yanjiang; Zhang, Puyang; Zhang, Kaihao; Luo, Wenhan; Li, Changsheng; Yuan, Ye; Wang, Guoren

Multi-Prior Learning via Neural Architecture Search for Blind Face Restoration Technical Report

2022.

Links | BibTeX

1957.

Dong, Peijie; Niu, Xin; Li, Lujun; Xie, Linzhen; Zou, Wenbin; Ye, Tian; Wei, Zimian; Pan, Hengyue

Prior-Guided One-shot Neural Architecture Search Technical Report

2022.

Links | BibTeX

1956.

Benmeziane, Hadjer; Niar, Smail; Ouarnoughi, Hamza; Maghraoui, Kaoutar El

Pareto Rank Surrogate Model for Hardware-aware Neural Architecture Search Proceedings Article

In: 2022 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), pp. 267-276, 2022.

Links | BibTeX

1955.

Wang, Tianzi; Deng, Jiajun; Geng, Mengzhe; Ye, Zi; Hu, Shoukang; Wang, Yi; Cui, Mingyu; Jin, Zengrui; Liu, Xunying; Meng, Helen

Conformer Based Elderly Speech Recognition System for Alzheimer's Disease Detection Journal Article

In: CoRR, vol. abs/2206.13232, 2022.

Links | BibTeX

1954.

Shen, Junge; Cao, Bin; Zhang, Chi; Wang, Ruxin; Wang, Qi

Remote Sensing Scene Classification Based on Attention-Enabled Progressively Searching Journal Article

In: IEEE Transactions on Geoscience and Remote Sensing, pp. 1-1, 2022.

Links | BibTeX

1953.

Dahouda, Mwamba Kasongo; Joe, Inwhee

Neural Architecture Search Net-based Feature Extraction with Modular Neural Network for Image Classification of Copper/Cobalt Raw Minerals Journal Article

In: IEEE Access, pp. 1-1, 2022.

Links | BibTeX

1952.

Lin, Zhiwei; Liang, Tingting; Xiao, Taihong; Wang, Yongtao; Tang, Zhi; Yang, Ming-Hsuan

FlowNAS: Neural Architecture Search for Optical Flow Estimation Technical Report

2022.

Links | BibTeX

1951.

Xie, Xiangning; Liu, Yuqiao; Sun, Yanan; Zhang, Mengjie; Tan, Kay Chen

Architecture Augmentation for Performance Predictor Based on Graph Isomorphism Technical Report

2022.

Links | BibTeX

1950.

Chen, Jingfan; Zhu, Guanghui; Hou, Haojun; Yuan, Chunfeng; Huang, Yihua

AutoGSR: Neural Architecture Search for Graph-Based Session Recommendation Proceedings Article

In: Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 1694–1704, Association for Computing Machinery, Madrid, Spain, 2022, ISBN: 9781450387323.

Abstract | Links | BibTeX

@inproceedings{10.1145/3477495.3531940,

title = {AutoGSR: Neural Architecture Search for Graph-Based Session Recommendation},

author = {Jingfan Chen and Guanghui Zhu and Haojun Hou and Chunfeng Yuan and Yihua Huang},

url = {https://doi.org/10.1145/3477495.3531940},

doi = {10.1145/3477495.3531940},

isbn = {9781450387323},

year  = {2022},

date = {2022-01-01},

urldate = {2022-01-01},

booktitle = {Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval},

pages = {1694–1704},

publisher = {Association for Computing Machinery},

address = {Madrid, Spain},

series = {SIGIR '22},

abstract = {Session-based recommendation aims to predict next click action (e.g., item) of anonymous users based on a fixed number of previous actions. Recently, Graph Neural Networks (GNNs) have shown superior performance in various applications. Inspired by the success of GNNs, tremendous endeavors have been devoted to introduce GNNs into session-based recommendation and have achieved significant results. Nevertheless, due to the highly diverse types of potential information in sessions, existing GNNs-based methods perform differently on different session datasets, leading to the need for efficient design of neural networks adapted to various session recommendation scenarios. To address this problem, we propose Automated neural architecture search for Graph-based Session Recommendation, namely AutoGSR, a framework that provides a practical and general solution to automatically find the optimal GNNs-based session recommendation model. In AutoGSR, we propose two novel GNN operations to build an expressive and compact search space. Building upon the search space, we employ a differentiable search algorithm to search for the optimal graph neural architecture. Furthermore, to consider all types of session information together, we propose to learn the item meta knowledge, which acts as a priori knowledge for guiding the optimization of final session representations. Comprehensive experiments on three real-world datasets demonstrate that AutoGSR is able to find effective neural architectures and achieve state-of-the-art results. To the best of our knowledge, we are the first to study the neural architecture search for the session-based recommendation.},

keywords = {},

pubstate = {published},

tppubtype = {inproceedings}

}

Close

1949.

Singh, Anuraj; Nair, Haritha

A Neural Architecture Search for Automated Multimodal Learning Journal Article

In: Expert Systems with Applications, vol. 207, pp. 118051, 2022, ISSN: 0957-4174.

Abstract | Links | BibTeX

@article{SINGH2022118051,

title = {A Neural Architecture Search for Automated Multimodal Learning},

author = {Anuraj Singh and Haritha Nair},

url = {https://www.sciencedirect.com/science/article/pii/S0957417422012581},

doi = {https://doi.org/10.1016/j.eswa.2022.118051},

issn = {0957-4174},

year  = {2022},

date = {2022-01-01},

urldate = {2022-01-01},

journal = {Expert Systems with Applications},

volume = {207},

pages = {118051},

abstract = {The boom of artificial intelligence in the past decade is owed to the research and development of deep learning and moreover, that of accessible deep learning. But the goal of Artificial General Intelligence (AGI) cannot be achieved by having application-specific, parameter sensitive neural networks that need to be defined and tuned for every use case. General intelligence also involves understanding different types of data, rather than having dedicated models for each functionality. Thus both automating machine learning while also giving importance to generalizing over multiple modalities has great potential to help move AGI research forward. We propose a generalizable algorithm-Multimodal Neural Architecture Search (MNAS) which can work on multiple modalities and perform architecture search in order to create neural networks that enable classification on multiple types of data for multiclass outputs. The work automates the development of a fusion architecture by building upon existing literature of multimodal learning and neural architecture search. The controller network which predicts the architecture has been designed such that it works on a reward model where the reward is dependent on accuracies of individual networks corresponding to each modality involved. The work shows good results with accuracy comparable to both unimodal classification on same data and manually created multimodal architectures wherein the experiments are performed on multiclass classification problem of image and text modalities. It also uses a shared parameter search graph ensuring that the computational complexity is less compared to several other neural architecture search algorithms.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Close

1948.

Theodorakos, Konstantinos; Agudelo, Oscar Mauricio; Schreurs, Joachim; Suykens, Johan A. K.; Moor, Bart De

Island Transpeciation: A Co-Evolutionary Neural Architecture Search, applied to country-scale air-quality forecasting Journal Article

In: IEEE Transactions on Evolutionary Computation, pp. 1-1, 2022.

Links | BibTeX

1947.

Zhu, Guanghui; Cheng, Feng; Lian, Defu; Yuan, Chunfeng; Huang, Yihua

NAS-CTR: Efficient Neural Architecture Search for Click-Through Rate Prediction Proceedings Article

In: Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp. 332–342, Association for Computing Machinery, Madrid, Spain, 2022, ISBN: 9781450387323.

Abstract | Links | BibTeX

@inproceedings{10.1145/3477495.3532030,

title = {NAS-CTR: Efficient Neural Architecture Search for Click-Through Rate Prediction},

author = {Guanghui Zhu and Feng Cheng and Defu Lian and Chunfeng Yuan and Yihua Huang},

url = {https://doi.org/10.1145/3477495.3532030},

doi = {10.1145/3477495.3532030},

isbn = {9781450387323},

year  = {2022},

date = {2022-01-01},

urldate = {2022-01-01},

booktitle = {Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval},

pages = {332–342},

publisher = {Association for Computing Machinery},

address = {Madrid, Spain},

series = {SIGIR '22},

abstract = {Click-Through Rate (CTR) prediction has been widely used in many machine learning tasks such as online advertising and personalization recommendation. Unfortunately, given a domain-specific dataset, searching effective feature interaction operations and combinations from a huge candidate space requires significant expert experience and computational costs. Recently, Neural Architecture Search (NAS) has achieved great success in discovering high-quality network architectures automatically. However, due to the diversity of feature interaction operations and combinations, the existing NAS-based work that treats the architecture search as a black-box optimization problem over a discrete search space suffers from low efficiency. Therefore, it is essential to explore a more efficient architecture search method. To achieve this goal, we propose NAS-CTR, a differentiable neural architecture search approach for CTR prediction. First, we design a novel and expressive architecture search space and a continuous relaxation scheme to make the search space differentiable. Second, we formulate the architecture search for CTR prediction as a joint optimization problem with discrete constraints on architectures and leverage proximal iteration to solve the constrained optimization problem. Additionally, a straightforward yet effective method is proposed to eliminate the aggregation of skip connections. Extensive experimental results reveal that NAS-CTR can outperform the SOTA human-crafted architectures and other NAS-based methods in both test accuracy and search efficiency.},

keywords = {},

pubstate = {published},

tppubtype = {inproceedings}

}

Close

1946.

Luo, Xiangzhong; Liu, Di; Kong, Hao; Huai, Shuo; Chen, Hui; Liu, Weichen

SurgeNAS: A Comprehensive Surgery on Hardware-Aware Differentiable Neural Architecture Search Journal Article

In: IEEE Transactions on Computers, pp. 1-14, 2022.

Links | BibTeX

1945.

Cai, Lei; Fu, Yuli; Huo, Wanliang; Xiang, Youjun; Zhu, Tao; Zhang, Ying; Zeng, Huanqiang

Multi-scale Attentive Image De-raining Networks via Neural Architecture Search Technical Report

2022.

Links | BibTeX

1944.

Sun, Xuewei; Li, Guohou; Qu, Peixin; Xie, Xiwang; Pan, Xipeng; Zhang, Weidong

Research on plant disease identification based on CNN Journal Article

In: Cognitive Robotics, 2022, ISSN: 2667-2413.

Abstract | Links | BibTeX

1943.

Zhang, Wentao; Lin, Zheyu; Shen, Yu; Li, Yang; Yang, Zhi; Cui, Bin

DFG-NAS: Deep and Flexible Graph Neural Architecture Search Proceedings Article

In: Proceedings of the 39th International Conference on MachineLearning, 2022.

Links | BibTeX

1942.

Sun, Zihao; Hu, Yu; Lu, Shun; Yang, Longxing; Mei, Jilin; Han, Yinhe; Li, Xiaowei

AGNAS: Attention-Guided Micro and Macro-Architecture Search Proceedings Article

In: Chaudhuri, Kamalika; Jegelka, Stefanie; Song, Le; Szepesvári, Csaba; Niu, Gang; Sabato, Sivan (Ed.): International Conference on Machine Learning, ICML 2022, 17-23 July 2022, Baltimore, Maryland, USA, pp. 20777–20789, PMLR, 2022.

Links | BibTeX

1941.

Greenwood, Bryson; McDonnell, Tyler

Surrogate-Assisted Neuroevolution Proceedings Article

In: Proceedings of the Genetic and Evolutionary Computation Conference, pp. 1048–1056, Association for Computing Machinery, Boston, Massachusetts, 2022, ISBN: 9781450392372.

Abstract | Links | BibTeX

1940.

Yang, Longxing; Hu, Yu; Lu, Shun; Sun, Zihao; Mei, Jilin; Han, Yinhe; Li, Xiaowei

Searching for BurgerFormer with Micro-Meso-Macro Space Design Proceedings Article

In: Chaudhuri, Kamalika; Jegelka, Stefanie; Song, Le; Szepesvári, Csaba; Niu, Gang; Sabato, Sivan (Ed.): International Conference on Machine Learning, ICML 2022, 17-23 July 2022, Baltimore, Maryland, USA, pp. 25055–25069, PMLR, 2022.

Links | BibTeX