- Title
- Fast automatic optimisation of CNN architectures for image classification using genetic algorithm
- Creator
- Bakhshi, Ali; Noman, Nasimul; Chen, Zhiyong; Zamani, Mohsen; Chalup, Stephan
- Relation
- 2019 IEEE Congress on Evolutionary Computation (CEC). Proceedings of the 2019 IEEE Congress on Evolutionary Computation (Wellington, NZ 10-13 June, 2019) p. 1283-1290
- Publisher Link
- http://dx.doi.org/10.1109/CEC.2019.8790197
- Publisher
- Institute of Electrical and Electronics Engineers (IEEE)
- Resource Type
- conference paper
- Date
- 2019
- Description
- Convolutional Neural Networks (CNNs) are currently the most prominent deep neural network models and have been used with great success for image classification and other applications. The performance of CNNs depends on their architecture and hyperparameter settings. Early CNN models like LeNet and AlexNet were manually designed by experienced researchers. The empirical design and optimisation of a new CNN architecture require a lot of expertise and can be very time-consuming. In this paper, we propose a genetic algorithm that can, for a given image processing task, efficiently explore a defined space of potentially suitable CNN architectures and simultaneously optimise their hyperparameters. We named this fast automatic optimisation model fast-CNN and employed it to find competitive CNN architectures for image classification on CIFAR10. In a series of comparative simulation experiments we could demonstrate that the network designed by fast-CNN achieved nearly as good accuracy as some of the other best network models available but fast-CNN took significantly less time to evolve. The trained fast-CNN network model also generalised well to CIFAR100.
- Subject
- neural networks; genetic algorithms; computer architecture; sociology; statistics; biological cells; Australia
- Identifier
- http://hdl.handle.net/1959.13/1406891
- Identifier
- uon:35666
- Identifier
- ISBN:9781728121536
- Rights
- © 2019 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
- Language
- eng
- Full Text
- Reviewed
- Hits: 3196
- Visitors: 4059
- Downloads: 834
Thumbnail | File | Description | Size | Format | |||
---|---|---|---|---|---|---|---|
View Details Download | ATTACHMENT02 | Author final version | 238 KB | Adobe Acrobat PDF | View Details Download |