Activation function cyclically switchable convolutional neural network model

View article
PeerJ Computer Science

Main article text

 

Introduction

  1. A new model structure called AFCS-CNN has been proposed, which enables cyclical switching of AF.

  2. Unlike the studies in the literature, instead of an AF proposal, the ability to switch the AF with another AF during neural network training has been adopted.

  3. The concept of cyclic AF switching strategy during model training has been introduced.

  4. A first was achieved by designing a model structure that had not been tried before, thanks to instant AF switches during neural network training.

  5. The algorithm of the proposed model structure is designed to allow easy integration of all convolutional neural network (CNN) models into the structure.

  6. Training with the proposed model structure has provided superior success in many problems compared to training with fixed AFs.

  7. A state-of-the-art success has been achieved with the proposed model structure in plant seedling classification.

Materials and Methods

Dataset and preprocessing

Model training parameters and performance metrics

AFCS-CNN model structure

Results and discussion

Ablation studies

Stage 1: Determination of CNN models to be used in the AFCS-CNN model structure

Stage 2: Determination of list_AF array and p0 parameter value

Stage 3: Determination of the p1 parameter value

Expansion experiments

Expansion experiments with plant seedling dataset

Expansion experiments with APTOS 2019 Blindness Detection dataset

Discussion

Conclusion

Supplemental Information

Ablation studies performed with Cifar-10 dataset.

DOI: 10.7717/peerj-cs.2756/supp-2

Trainings performed on V2 Plant Seedling dataset with AFCS-CNN (VGG16) model.

DOI: 10.7717/peerj-cs.2756/supp-3

Trainings performed on V2 Plant Seedling dataset with AFCS-CNN (VGG19) model.

DOI: 10.7717/peerj-cs.2756/supp-4

Trainings performed on V2 Plant Seedling dataset with AFCS-CNN (DenseNet121) model.

DOI: 10.7717/peerj-cs.2756/supp-5

Trainings performed on V2 Plant Seedling dataset with AFCS-CNN (DenseNet169) model.

DOI: 10.7717/peerj-cs.2756/supp-6

Trainings performed on V2 Plant Seedling dataset with AFCS-CNN (EfficientNetV2B0) model.

DOI: 10.7717/peerj-cs.2756/supp-7

Trainings performed on V2 Plant Seedling dataset with AFCS-CNN (EfficientNetV2B1) model.

DOI: 10.7717/peerj-cs.2756/supp-8

Trainings performed on V2 Plant Seedling dataset with AFCS-CNN (ConvNeXtTiny) model.

DOI: 10.7717/peerj-cs.2756/supp-9

Trainings performed on V2 Plant Seedling dataset with AFCS-CNN (ConvNeXtSmall) model.

DOI: 10.7717/peerj-cs.2756/supp-10

Trainings performed on APTOS 2019 Blindness Detection dataset with AFCS-CNN (VGG16) model.

DOI: 10.7717/peerj-cs.2756/supp-11

Trainings performed on APTOS 2019 Blindness Detection dataset with AFCS-CNN (VGG19) model.

DOI: 10.7717/peerj-cs.2756/supp-12

Trainings performed on APTOS 2019 Blindness Detection dataset with AFCS-CNN (DenseNet121) model.

DOI: 10.7717/peerj-cs.2756/supp-13

Trainings performed on APTOS 2019 Blindness Detection dataset with AFCS-CNN (DenseNet169) model.

DOI: 10.7717/peerj-cs.2756/supp-14

Trainings performed on APTOS 2019 Blindness Detection dataset with AFCS-CNN (EfficientNetV2B0) model.

DOI: 10.7717/peerj-cs.2756/supp-15

Trainings performed on APTOS 2019 Blindness Detection dataset with AFCS-CNN (EfficientNetV2B1) model.

DOI: 10.7717/peerj-cs.2756/supp-16

Trainings performed on APTOS 2019 Blindness Detection dataset with AFCS-CNN (ConvNeXtTiny) model.

DOI: 10.7717/peerj-cs.2756/supp-17

Trainings performed on APTOS 2019 Blindness Detection dataset with AFCS-CNN (ConvNeXtSmall) model.

DOI: 10.7717/peerj-cs.2756/supp-18

Additional Information and Declarations

Competing Interests

Author Contributions

Data Availability

Funding

The authors received no funding for this work.

382 Visitors 327 Views 18 Downloads