Literature on Neural Architecture Search

2575.

Patcharabumrung, Praiwan; Jewajinda, Yutana; Praditwong, Kata

Effects of Genetic Operators on Neural Architecture Search Using Multi-Objective Genetic Algorithm Proceedings Article

In: 2023 20th International Joint Conference on Computer Science and Software Engineering (JCSSE), pp. 61-66, 2023.

2574.

Ozaeta, Mark Anthony A.; Fajardo, Arnel C.; Brazas, Felimon P.; Cantal, Jed Allan M.

Seagrass Classification Using Differentiable Architecture Search Proceedings Article

In: 2023 20th International Joint Conference on Computer Science and Software Engineering (JCSSE), pp. 123-128, 2023.

2573.

Dong, Wenqian; Kestor, Gokcen; Li, Dong

Auto-HPCnet: An Automatic Framework to Build Neural Network-based Surrogate for High-Performance Computing Applications Proceedings Article

In: Butt, Ali Raza; Mi, Ningfang; Chard, Kyle (Ed.): Proceedings of the 32nd International Symposium on High-Performance Parallel and Distributed Computing, HPDC 2023, Orlando, FL, USA, June 16-23, 2023, pp. 31–44, ACM, 2023.

2572.

Maashi, Mashael S.; Alamro, Hayam; Mohsen, Heba; Negm, Noha; Mohammed, Gouse Pasha; Ahmed, Noura Abdelaziz; Ibrahim, Sara Saadeldeen; Alsaid, Mohamed Ibrahim

Modeling of Reptile Search Algorithm With Deep Learning Approach for Copy Move Image Forgery Detection Journal Article

In: IEEE Access, vol. 11, pp. 87297–87304, 2023.

2571.

Huang, Yuxuan; Zhang, Xixi; Wang, Yu; Jiao, Donglai; Gui, Guan; Ohtsuki, Tomoaki

NASEI: Neural Architecture Search-Based Specific Emitter Identification Method Proceedings Article

In: 2023 IEEE 97th Vehicular Technology Conference (VTC2023-Spring), pp. 1-5, 2023.

2570.

Kapoor, Rahul; Pillay, Nelishia

Iterative Structure-Based Genetic Programming for Neural Architecture Search Proceedings Article

In: Proceedings of the Companion Conference on Genetic and Evolutionary Computation, pp. 595–598, Association for Computing Machinery, Lisbon, Portugal, 2023, ISBN: 9798400701207.

2569.

Klein, Aaron; Golebiowski, Jacek; Ma, Xingchen; Perrone, Valerio; Archambeau, Cédric

Structural pruning of large language models via neural architecture search Proceedings Article

In: AutoML Conference 2023, 2023.

2568.

Huang, Junhao; Xue, Bing; Sun, Yanan; Zhang, Mengjie

Multi-Objective Evolutionary Search of Compact Convolutional Neural Networks with Training-Free Estimation Proceedings Article

In: Proceedings of the Companion Conference on Genetic and Evolutionary Computation, pp. 655–658, Association for Computing Machinery, Lisbon, Portugal, 2023, ISBN: 9798400701207.

2567.

Yan, Xueming; Huang, Han; Jin, Yaochu; Chen, Liang; Liang, Zhanning; Hao, Zhifeng

Neural Architecture Search via Multi-Hashing Embedding and Graph Tensor Networks for Multilingual Text Classification Journal Article

In: IEEE Transactions on Emerging Topics in Computational Intelligence, pp. 1-14, 2023.

2566.

Cheng, Quan; Huang, Mingqiang; Man, Changhai; Shen, Ao; Dai, Liuyao; Yu, Hao; Hashimoto, Masanori

Reliability Exploration of System-on-Chip With Multi-Bit-Width Accelerator for Multi-Precision Deep Neural Networks Journal Article

In: IEEE Transactions on Circuits and Systems I: Regular Papers, pp. 1-14, 2023.

2565.

Burghoff, Julian; Rottmann, Matthias; Conta, Jill; Schoenen, Sebastian; Witte, Andreas; Gottschalk, Hanno

ResBuilder: Automated Learning of Depth with Residual Structures Technical Report

2023.

2564.

Kuş, Zeki; Kiraz, Berna; Göksu, Tuğçe Koçak; Aydın, Musa; Özkan, Esra; Vural, Atay; Kiraz, Alper; Can, Burhanettin

Differential evolution-based neural architecture search for brain vessel segmentation Journal Article

In: Engineering Science and Technology, an International Journal, vol. 46, pp. 101502, 2023, ISSN: 2215-0986.

@article{KUS2023101502,

title = {Differential evolution-based neural architecture search for brain vessel segmentation},

author = {Zeki Kuş and Berna Kiraz and Tuğçe Koçak Göksu and Musa Aydın and Esra Özkan and Atay Vural and Alper Kiraz and Burhanettin Can},

url = {https://www.sciencedirect.com/science/article/pii/S2215098623001805},

doi = {https://doi.org/10.1016/j.jestch.2023.101502},

issn = {2215-0986},

year  = {2023},

date = {2023-01-01},

urldate = {2023-01-01},

journal = {Engineering Science and Technology, an International Journal},

volume = {46},

pages = {101502},

abstract = {Brain vasculature analysis is critical in developing novel treatment targets for neurodegenerative diseases. Such an accurate analysis cannot be performed manually but requires a semi-automated or fully-automated approach. Deep learning methods have recently proven indispensable for the automated segmentation and analysis of medical images. However, optimizing a deep learning network architecture is another challenge. Manually selecting deep learning network architectures and tuning their hyper-parameters requires a lot of expertise and effort. To solve this problem, neural architecture search (NAS) approaches that explore more efficient network architectures with high segmentation performance have been proposed in the literature. This study introduces differential evolution-based NAS approaches in which a novel search space is proposed for brain vessel segmentation. We select two architectures that are frequently used for medical image segmentation, i.e. U-Net and Attention U-Net, as baselines for NAS optimizations. The conventional differential evolution and the opposition-based differential evolution with novel search space are employed as search methods in NAS. Furthermore, we perform ablation studies and evaluate the effects of specific loss functions, model pruning, threshold selection and generalization performance on the proposed models. The experiments are conducted on two datasets providing 335 single-channel 8-bit gray-scale images. These datasets are a public volumetric cerebrovascular system dataset (vesseINN) and our own dataset called KUVESG. The proposed NAS approaches, namely UNAS-Net and Attention UNAS-Net architectures, yield better segmentation performance in terms of different segmentation metrics. More specifically, UNAS-Net with differential evolution reveals high dice score/sensitivity values of 79.57/81.48, respectively. Moreover, they provide shorter inference times by a factor of 9.15 than the baseline methods.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

2563.

Wang, Li; Xie, Tao; Zhang, Xinyu; Jiang, Zhiqiang; Yang, Linqi; Zhang, Haoming; Li, Xiaoyu; Ren, Yilong; Yu, Haiyang; Li, Jun; Liu, Huaping

Auto-Points: Automatic Learning for Point Cloud Analysis with Neural Architecture Search Journal Article

In: IEEE Transactions on Multimedia, pp. 1-16, 2023.

2562.

Aach, Marcel; Inanc, Eray; Sarma, Rakesh; Riedel, Morris; Lintermann, Andreas

Optimal Resource Allocation for Early Stopping-based Neural Architecture Search Methods Proceedings Article

In: AutoML Conference 2023, 2023.

2561.

Xue, Yu; Lu, Changchang; Neri, Ferrante; Qin, Jiafeng

Improved Differentiable Architecture Search With Multi-Stage Progressive Partial Channel Connections Journal Article

In: IEEE Transactions on Emerging Topics in Computational Intelligence, pp. 1-12, 2023.

2560.

Zhang, Xixi; Chen, Xiaofeng; Wang, Yu; Gui, Guan; Adebisi, Bamidele; Sari, Hikmet; Adachi, Fumiyuki

Lightweight Automatic Modulation Classification via Progressive Differentiable Architecture Search Journal Article

In: IEEE Transactions on Cognitive Communications and Networking, pp. 1-1, 2023.

2559.

Cheng, Ke; Xi, Ning; Liu, Ximeng; Zhu, Xinghui; Gao, Haichang; Zhang, Zhiwei; Shen, Yulong

Private Inference for Deep Neural Networks: A Secure, Adaptive, and Efficient Realization Journal Article

In: IEEE Transactions on Computers, pp. 1-13, 2023.

2558.

Chowdhury, Anjir Ahmed; Mahmud, S. M. Hasan; Hoque, Khadija Kubra Shahjalal; Ahmed, Kawsar; Bui, Francis M.; Lio, Pietro; Moni, Mohammad Ali; Al-Zahrani, Fahad Ahmed

StackFBAs: Detection of fetal brain abnormalities using CNN with stacking strategy from MRI images Journal Article

In: Journal of King Saud University - Computer and Information Sciences, vol. 35, no. 8, pp. 101647, 2023, ISSN: 1319-1578.

@article{CHOWDHURY2023101647b,

title = {StackFBAs: Detection of fetal brain abnormalities using CNN with stacking strategy from MRI images},

author = {Anjir Ahmed Chowdhury and S. M. Hasan Mahmud and Khadija Kubra Shahjalal Hoque and Kawsar Ahmed and Francis M. Bui and Pietro Lio and Mohammad Ali Moni and Fahad Ahmed Al-Zahrani},

url = {https://www.sciencedirect.com/science/article/pii/S131915782300201X},

doi = {https://doi.org/10.1016/j.jksuci.2023.101647},

issn = {1319-1578},

year  = {2023},

date = {2023-01-01},

urldate = {2023-01-01},

journal = {Journal of King Saud University - Computer and Information Sciences},

volume = {35},

number = {8},

pages = {101647},

abstract = {Predicting fetal brain abnormalities (FBAs) is an urgent global problem, as nearly three of every thousand women are pregnant with neurological abnormalities. Therefore, early detection of FBAs using deep learning (DL) can help to enhance the planning and quality of diagnosis and treatment for pregnant women. Most of the research papers focused on brain abnormalities of newborns and premature infants, but fewer studies concentrated on fetuses. This study proposed a deep learning-CNN-based framework named StackFBAs that utilized the stacking strategy to classify fetus brain abnormalities more accurately using MRI images at an early stage. We considered the Greedy-based Neural architecture search (NAS) method to identify the best CNN architectures to solve this problem utilizing brain MRI images. A total of 94 CNN architectures were generated from the NAS method, and the best 5 CNN models were selected to build the baseline models. Subsequently, the probabilistic scores of these baseline models were combined to construct the final meta-model (KNN) utilizing the stacking strategy. The experimental results demonstrated that StackFBAs outperform pre-trained CNN Models (e.g., VGG16, VGG19, ResNet50, DenseNet121, and ResNet152) with transfer learning (TL) and existing models with the 5-fold cross-validation tests. StackFBAs achieved an overall accuracy of 80%, an F1-score of 78%, 76% sensitivity, and a specificity of 78%. Moreover, we employed the federated learning technique that protects sensitive fetal MRI data, combines results, and finds common patterns from many users, making the model more robust for the privacy and security of user-sensitive data. We believe that our novel framework could be used as a helpful tool for detecting brain abnormalities at an early stage.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

2557.

Chen, Wuyang; Huang, Wei; Wang, Zhangyang

“No Free Lunch” in Neural Architectures? A Joint Analysis of Expressivity, Convergence, and Generalization Proceedings Article

In: AutoML Conference 2023, 2023.

2556.

Aichberger, Lukas; Klambauer, Günter

ELENAS: Elementary Neural Architecture Search Proceedings Article

In: AutoML Conference 2023, 2023.

2555.

Chen, Shiming; Chen, Shihuang; Hou, Wenjin; Ding, Weiping; You, Xinge

EGANS: Evolutionary Generative Adversarial Network Search for Zero-Shot Learning Technical Report

2023.

2554.

Meyer-Lee, Gabriel; Cheney, Nick

On the selection of neural architectures from a supernet Proceedings Article

In: AutoML Conference 2023, 2023.

2553.

Lukasik, Jovita; Geiping, Jonas; Moeller, Michael; Keuper, Margret

Differentiable Architecture Search: a One-Shot Method? Proceedings Article

In: AutoML Conference 2023, 2023.

2552.

Roshtkhari, Mehraveh Javan; Toews, Matthew; Pedersoli, Marco

Balanced Mixture of Supernets for Learning the CNN Pooling Architecture Proceedings Article

In: AutoML Conference 2023, 2023.

2551.

Dimanov, Daniel; Singleton, Colin; Rostami, Shahin; Balaguer-Ballester, Emili

MEOW - Multi-Objective Evolutionary Weapon Detection Proceedings Article

In: AutoML Conference 2023, 2023.

2550.

Carmichael, Zachariah J; Moon, Tim; Jacobs, Sam Ade

Learning Debuggable Models Through Multi-Objective NAS Proceedings Article

In: AutoML Conference 2023, 2023.

2549.

Yoshihama, Yutaka; Yadani, Kenichi; Isobe, Shota

Hardware-Aware Zero-Shot Neural Architecture Search Proceedings Article

In: 2023 18th International Conference on Machine Vision and Applications (MVA), pp. 1-5, 2023.

2548.

Zhang, Ruohan; Jiao, Licheng; Wang, Dan; Liu, Fang; Liu, Xu; Yang, Shuyuan

A Fast Evolutionary Knowledge Transfer Search for Multiscale Deep Neural Architecture Journal Article

In: IEEE Transactions on Neural Networks and Learning Systems, pp. 1-15, 2023.

2547.

Zhang, Rui; Gao, Mei-Rong; Zhang, Peng-Yun; Zhang, Yong-Mei; Fu, Liu-Hu; Chai, Yan-Feng

Research on an ultrasonic detection method for weld defects based on neural network architecture search Journal Article

In: Measurement, vol. 221, pp. 113483, 2023, ISSN: 0263-2241.

2546.

Luo, Erjing; Huang, Haitong; Liu, Cheng; Li, Guoyu; Yang, Bing; Wang, Ying; Li, Huawei; Li, Xiaowei

DeepBurning-MixQ: An Open Source Mixed-Precision Neural Network Accelerator Design Framework for FPGAs Technical Report

2023.

2545.

Deng, Xinchi; Shi, Han; Huang, Runhui; Li, Changlin; Xu, Hang; Han, Jianhua; Kwok, James T.; Zhao, Shen; Zhang, Wei; Liang, Xiaodan

GrowCLIP: Data-aware Automatic Model Growing for Large-scale Contrastive Language-Image Pre-training Journal Article

In: CoRR, vol. abs/2308.11331, 2023.

2544.

Wu, Jiong; Fan, Yong

HNAS-Reg: Hierarchical Neural Architecture Search for Deformable Medical Image Registration Proceedings Article

In: 20th IEEE International Symposium on Biomedical Imaging, ISBI 2023, Cartagena, Colombia, April 18-21, 2023, pp. 1–4, IEEE, 2023.

2543.

Zhang, Dongran; Luo, Gang; Li, Jun

Traffic Spatial-Temporal Prediction Based on Neural Architecture Search Proceedings Article

In: Proceedings of the 18th International Symposium on Spatial and Temporal Data, pp. 21–30, Association for Computing Machinery, Calgary, AB, Canada, 2023, ISBN: 9798400708992.

@inproceedings{10.1145/3609956.3609962,

title = {Traffic Spatial-Temporal Prediction Based on Neural Architecture Search},

author = {Dongran Zhang and Gang Luo and Jun Li},

url = {https://doi.org/10.1145/3609956.3609962},

doi = {10.1145/3609956.3609962},

isbn = {9798400708992},

year  = {2023},

date = {2023-01-01},

urldate = {2023-01-01},

booktitle = {Proceedings of the 18th International Symposium on Spatial and Temporal Data},

pages = {21–30},

publisher = {Association for Computing Machinery},

address = {Calgary, AB, Canada},

series = {SSTD '23},

abstract = {Traffic spatial-temporal prediction is essential for intelligent transportation systems. However, the current approach relies heavily on expert knowledge and time-consuming manual modeling. Neural architecture search can build models adaptively, but it is rarely used for traffic spatial-temporal prediction, nor is it designed specifically for traffic spatial-temporal feature. In response to the above problems, we propose neural architecture search spatial-temporal prediction (NASST), which is a method to automatically generate a traffic spatial-temporal prediction network by performing a differentiable neural network architecture search in an optimized search space. First, we adopt a differentiable neural architecture search method to continuously relax the discrete traffic spatial-temporal prediction model architecture search, and adopt a fusion strategy of comprehensive concatenate and addition (CA) to achieve efficient neural architecture search. Second, we optimize the search space and introduce a series of classic traffic spatial-temporal feature extraction modules, which are more in line with the architectural requirements of traffic spatial-temporal prediction network. Finally, our model is validated on two public traffic datasets and achieves the best predictions. Compared with traditional manual modeling methods, our method can realize the automatic search of high-precision predictive model architectures, which improves the modeling efficiency.},

keywords = {},

pubstate = {published},

tppubtype = {inproceedings}

}

2542.

Meyer-Lee, Gabriel; Cheney, Nick

Evaluating supernets for neural architecture search Proceedings Article

In: AutoML Conference 2023 (Workshop), 2023.

2541.

Wei-ming, LI Chun-chun YANG Tie-jun [DENG

Object detection for nameplate based on neural architecture search Journal Article

In: Journal of Graphics, vol. 44, no. 4, pp. 718-727, 2023.

2540.

Fu, Jian; Wang, Qifeng

Study of DNN Network Architecture Search for Robot Vision Proceedings Article

In: 2023 International Conference on Advanced Robotics and Mechatronics (ICARM), pp. 366-372, 2023.

2539.

Xiang, Lichuan; Dudziak, Łukasz; Mehrotra, Abhinav; Abdelfattah, Mohamed S; Lane, Nicholas Donald; Wen, Hongkai

Generating Neural Network Architectures with Conditional Graph Normalizing Flows Proceedings Article

In: AutoML Conference 2023 (Workshop), 2023.

2538.

Ding, Chenchen; Ren, Hongwei; Guo, Zhiru; Bi, Minjie; Man, Changhai; Wang, Tingting; Li, Shuwei; Luo, Shaobo; Zhang, Rumin; Yu, Hao

TT-LCD: Tensorized-Transformer based Loop Closure Detection for Robotic Visual SLAM on Edge Proceedings Article

In: 2023 International Conference on Advanced Robotics and Mechatronics (ICARM), pp. 166-172, 2023.

2537.

Wu, Ruoyou; Li, Cheng; Zou, Juan; Wang, Shanshan

Generalizable Learning Reconstruction for Accelerating MR Imaging via Federated Neural Architecture Search Technical Report

2023.

2536.

Kuş, Zeki; Kiraz, Berna

BaDENAS: Bayesian Based Neural Architecture Search for Retinal Vessel Segmentation Proceedings Article

In: 2023 31st Signal Processing and Communications Applications Conference (SIU), pp. 1-4, 2023.

2535.

Selg, Hardi; Jenihhin, Maksim; Ellervee, Peeter; Raik, Jaan

ML-Based Online Design Error Localization for RISC-V Implementations Proceedings Article

In: 2023 IEEE 29th International Symposium on On-Line Testing and Robust System Design (IOLTS), pp. 1-7, 2023.

2534.

Sasaki, Yuya

Efficient and Explainable Graph Neural Architecture Search via Monte-Carlo Tree Search Technical Report

2023.

2533.

Sridhar, Sharath Nittur; Kundu, Souvik; Sundaresan, Sairam; Szankin, Maciej; Sarah, Anthony

InstaTune: Instantaneous Neural Architecture Search During Fine-Tuning Technical Report

2023.

2532.

Gao, Jianliang; He, Changlong; Chen, Jiamin; Li, Qiutong; Wang, Yili

Decoupled Graph Neural Architecture Search with Variable Propagation Operation and Appropriate Depth Proceedings Article

In: Proceedings of the 35th International Conference on Scientific and Statistical Database Management, Association for Computing Machinery, Los Angeles, CA, USA, 2023, ISBN: 9798400707469.

@inproceedings{10.1145/3603719.3603729,

title = {Decoupled Graph Neural Architecture Search with Variable Propagation Operation and Appropriate Depth},

author = {Jianliang Gao and Changlong He and Jiamin Chen and Qiutong Li and Yili Wang},

url = {https://doi.org/10.1145/3603719.3603729},

doi = {10.1145/3603719.3603729},

isbn = {9798400707469},

year  = {2023},

date = {2023-01-01},

urldate = {2023-01-01},

booktitle = {Proceedings of the 35th International Conference on Scientific and Statistical Database Management},

publisher = {Association for Computing Machinery},

address = {Los Angeles, CA, USA},

series = {SSDBM '23},

abstract = {To alleviate the over-smoothing problem caused by deep graph neural networks, decoupled graph neural networks (DGNNs) are proposed. DGNNs decouple the graph neural network into two atomic operations, the propagation (P) operation and the transformation (T) operation. Since manually designing the architecture of DGNNs is a time-consuming and expert-dependent process, the DF-GNAS method is designed, which can automatically construct the architecture of DGNNs with fixed propagation operation and deep layers. The propagation operation is a key process for DGNNs to aggregate graph structure information. However, DF-GNAS automatically designs DGNN architecture using fixed propagation operation for different graph structures will cause performance loss. Meanwhile, DF-GNAS designs deep DGNNs for graphs with simple distributions, which may lead to overfitting problems. To solve the above challenges, we propose the Decoupled Graph Neural Architecture Search with Variable Propagation Operation and Appropriate Depth (DGNAS-PD) method. In DGNAS-PD, we design a DGNN operation space with variable efficient propagation operations in order to better aggregate information on different graph structures. We build an effective genetic search strategy to adaptively design appropriate DGNN depths instead of deep DGNNs for the graph with simple distributions in DGNAS-PD. The experiments on five real-world graphs show that DGNAS-PD outperforms state-of-art baseline methods.},

keywords = {},

pubstate = {published},

tppubtype = {inproceedings}

}

2531.

Li, Jialin; Cao, Xuan; Chen, Renxiang; Zhang, Xia; Huang, Xianzhen; Qu, Yongzhi

Graph neural network architecture search for rotating machinery fault diagnosis based on reinforcement learning Journal Article

In: Mechanical Systems and Signal Processing, vol. 202, pp. 110701, 2023, ISSN: 0888-3270.

2530.

Dai, Haixing

Brain-inspired Approaches for Advancing Artificial Intelligence PhD Thesis

University of Georgia, 2023.

Abstract | BibTeX

@phdthesis{DaiHaixing2023BAfA,

title = {Brain-inspired Approaches for Advancing Artificial Intelligence},

author = {Haixing Dai},

year  = {2023},

date = {2023-01-01},

urldate = {2023-01-01},

pages = {197},

school = {University of Georgia},

abstract = {Deep learning has experienced rapid growth and garnered significant attention in recent decades. Simultaneously, neuroscience has remained a challenging and enigmatic field of study. Inspired by the structure and function of the brain, researchers have developed increasingly powerful and sophisticated deep learning models that have achieved remarkable performance in various domains, including computer vision, natural language processing, and medical image analysis. These brain-inspired models have revolutionized the field of artificial intelligence, enabling breakthroughs in tasks such as image recognition, language understanding, and disease diagnosis. In turn, the application of these advanced deep learning models has provided valuable insights into the inner workings of the human brain, revealing temporal and spatial functional brain networks. The symbiotic relationship between artificial intelligence and neuroscience is evident, as they continuously inform and complement each other's progress.This dissertation presents novel frameworks that integrate deep learning and knowledge from brain science. This research aims to gain insights into the brain and refine deep learning models through brain-inspired principles. The dissertation first discusses how deep learning has been applied to study the brain, focusing on areas such as modeling cortical folding patterns, hierarchical brain structures, and spatial-temporal brain networks. It then discusses how artificial neural networks have drawn inspiration from the brain, using examples like convolutional neural networks, attention mechanisms, and language models. The dissertation’s main contributions are several computational frameworks integrating brain-inspired insights. These include a graph representation neural architecture search method to optimize recurrent neural networks for analyzing spatiotemporal brain networks, a hierarchical semantic tree concept whitening model to disentangle concept representations for image classification, a twin-transformer framework to study gyri and sulci in the cortex, a core-periphery guided vision transformer, and methods leveraging language models to generate data and analyze health narratives. Overall, this dissertation explores how we can understand the brain better using deep learning and ultimately build more efficient, robust, and interpretable artificial neural networks inspired by the brain.},

keywords = {},

pubstate = {published},

tppubtype = {phdthesis}

}

Deep learning has experienced rapid growth and garnered significant attention in recent decades. Simultaneously, neuroscience has remained a challenging and enigmatic field of study. Inspired by the structure and function of the brain, researchers have developed increasingly powerful and sophisticated deep learning models that have achieved remarkable performance in various domains, including computer vision, natural language processing, and medical image analysis. These brain-inspired models have revolutionized the field of artificial intelligence, enabling breakthroughs in tasks such as image recognition, language understanding, and disease diagnosis. In turn, the application of these advanced deep learning models has provided valuable insights into the inner workings of the human brain, revealing temporal and spatial functional brain networks. The symbiotic relationship between artificial intelligence and neuroscience is evident, as they continuously inform and complement each other's progress.This dissertation presents novel frameworks that integrate deep learning and knowledge from brain science. This research aims to gain insights into the brain and refine deep learning models through brain-inspired principles. The dissertation first discusses how deep learning has been applied to study the brain, focusing on areas such as modeling cortical folding patterns, hierarchical brain structures, and spatial-temporal brain networks. It then discusses how artificial neural networks have drawn inspiration from the brain, using examples like convolutional neural networks, attention mechanisms, and language models. The dissertation’s main contributions are several computational frameworks integrating brain-inspired insights. These include a graph representation neural architecture search method to optimize recurrent neural networks for analyzing spatiotemporal brain networks, a hierarchical semantic tree concept whitening model to disentangle concept representations for image classification, a twin-transformer framework to study gyri and sulci in the cortex, a core-periphery guided vision transformer, and methods leveraging language models to generate data and analyze health narratives. Overall, this dissertation explores how we can understand the brain better using deep learning and ultimately build more efficient, robust, and interpretable artificial neural networks inspired by the brain.

2529.

Shariatzadeh, Seyed Mahdi; Fathy, Mahmood; Berangi, Reza

Multi-objective single-shot neural architecture search via efficient convolutional filters Journal Article

In: Electronics Letters, vol. 59, no. 17, pp. e12939, 2023.