RMIT University
Browse

PRE-NAS: Evolutionary Neural Architecture Search With Predictor

journal contribution
posted on 2024-11-03, 09:12 authored by Yameng Peng, Andy SongAndy Song, Victor CiesielskiVictor Ciesielski, Haytham AbokelaHaytham Abokela, Xiaojun ChangXiaojun Chang
Neural architecture search (NAS) aims to automate architecture engineering in neural networks. This often requires a high computational overhead to evaluate a number of candidate networks from the set of all possible networks in the search space. Prediction of the performance of a network can alleviate this high computational overhead by mitigating the need for evaluating every candidate network. Developing such a predictor typically requires a large number of evaluated architectures which may be difficult to obtain. We address this challenge by proposing a novel evolutionary-based NAS strategy, predictor-assisted evolutionary NAS (PRE-NAS) which can perform well even with an extremely small number of evaluated architectures. PRE-NAS leverages new evolutionary search strategies and integrates high-fidelity weight inheritance over generations. Unlike one-shot strategies, which may suffer from bias in the evaluation due to weight sharing, offspring candidates in PRE-NAS are topologically homogeneous. This circumvents bias and leads to more accurate predictions. Extensive experiments on the NAS-Bench-201 and DARTS search spaces show that PRE-NAS can outperform state-of-the-art NAS methods. With only a single GPU searching for 0.6 days, a competitive architecture can be found by PRE-NAS which achieves 2.40% and 24% test error rates on CIFAR-10 and ImageNet, respectively.

History

Journal

IEEE Transactions on Evolutionary Computation

Volume

27

Issue

1

Start page

26

End page

36

Total pages

11

Publisher

IEEE

Place published

United States

Language

English

Copyright

© 2022 IEEE

Former Identifier

2006122602

Esploro creation date

2023-06-07

Usage metrics

    Scholarly Works

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC