A dual GAN with identity blocks and pancreas-inspired loss for renewable energy optimization

Elbaz, Mostafa; Said, Wael; Mahmoud, Gamal M.; Marie, Hanaa Salem

doi:10.1038/s41598-025-00600-7

Download PDF

Article
Open access
Published: 13 May 2025

A dual GAN with identity blocks and pancreas-inspired loss for renewable energy optimization

Mostafa Elbaz¹,
Wael Said²,
Gamal M. Mahmoud³ &
…
Hanaa Salem Marie⁴

Scientific Reports volume 15, Article number: 16635 (2025) Cite this article

1110 Accesses
Metrics details

Subjects

Abstract

Integrating energy and solar imagery is essential for electrical engineers in renewable energy prediction, consumption analysis, regression modeling, and fault detection applications. A significant challenge in these areas is the limited availability of high-quality datasets, which can hinder the accuracy of the predictive models. To address this issue, this paper proposes leveraging Generative Adversarial Networks (GANs) to generate synthetic samples for training. Despite their potential, traditional GAN face challenges such as mode collapse, vanishing gradients, and pixel integrity issues. This paper introduces a novel architecture, Penca-GAN, which enhances GANs through three key modifications: (1) dual loss functions to ensure pixel integrity and promote diversity in augmented images, effectively mitigating mode collapse and improving the quality of synthetic data; (2) the integration of an identity block to stabilize training, preserving essential input features and facilitating smoother gradient flow; and (3) a pancreas-inspired metaheuristic loss function that dynamically adapts to variations in training data to maintain pixel coherence and diversity. Extensive experiments on three renewable energy datasets—SKY images, Solar images, and Wind Turbine images—demonstrate the effectiveness of the Penca-GAN architecture. Our comparative analysis revealed that Penca-GAN consistently achieved the lowest Fréchet Inception Distance (FID) scores (164.45 for SKY, 113.54 for Solar, and 109.34 for Wind Turbine), indicating superior image quality compared to other architectures. Additionally, it attains the highest Inception Score (IS) across all datasets, scoring 71.43 for SKY, 87.65 for Solar, and 90.32 for Wind Turbine. Furthermore, the application of Penca-GAN significantly enhanced the fault detection capabilities, achieving accuracy improvements from 85.92 to 90.04% for solar panels and from 86.06 to 90.43% for wind turbines. These results underscore Penca-GAN’s robust performance in generating high-fidelity synthetic images, significantly advancing renewable energy applications, and improving model performance in critical tasks such as fault detection and energy prediction.

Novel GSIP: GAN-based sperm-inspired pixel imputation for robust energy image reconstruction

Article Open access 07 January 2025

A novel 8-connected Pixel Identity GAN with Neutrosophic (ECP-IGANN) for missing imputation

Article Open access 13 October 2024

Generation of highly realistic microstructural images of alloys from limited data with a style-based generative adversarial network

Article Open access 11 January 2023

Introduction

In the realm of renewable energy, images play a pivotal role across various tasks, including energy prediction, fault detection, and power consumption management^1,2,3,4. Accurate analysis of solar and other renewable energy imagery enables engineers to optimize energy production, identify anomalies in energy systems, and improve overall efficiency^5,6,7. These tasks are essential for advancing sustainable energy solutions and ensuring the reliability of renewable energy systems⁸.

Despite the critical importance of imagery in these applications, a significant limitation is the availability of high-quality datasets. The scarcity of labeled data poses challenges for effectively training machine learning models^9,10. As a result, many researchers and practitioners are hindered in developing robust solutions that can be generalized well in real-world scenarios. This limitation underscores the need for innovative approaches to augment existing datasets and enhance model performance¹¹.

GANs have emerged as a powerful tool for data augmentation, enabling the generation of synthetic images that can complement limited datasets. By training on existing data and generating new, realistic samples, GANs can help improve the robustness and accuracy of models used in renewable energy applications. This capability is particularly valuable in instances where data collection is expensive or time-consuming^12,13,14.

However, GANs are not without their challenges. Issues such as mode collapse, where the generator produces a limited variety of outputs, and pixel integrity, which refers to maintaining the quality and coherence of the generated images, are significant obstacles. These problems can adversely affect the performance of models that rely on augmented data, leading to suboptimal results in critical applications like fault detection and energy prediction^15,16,17. To address these challenges, our approach leverages a novel metaheuristic method to maintain pixel integrity in augmented images. Drawing inspiration from biological systems, specifically the behavior of the pancreas, a new loss function is introduced to promote pixel coherence while enhancing the diversity of the generated samples.

This methodology integrates seamlessly with our GAN architecture, which includes an identity block to stabilize training and ensure consistency in the generated outputs. Mode collapse is a critical issue that often arises during GAN training. This occurs when the generator produces a limited set of outputs, failing to capture the full diversity of the training data. This phenomenon restricts the variability of the generated samples and compromises the overall performance of the models that depend on this synthetic data for training. In the context of renewable energy applications, mode collapse can lead to insufficient representations of diverse scenarios, ultimately affecting tasks such as fault detection and energy prediction.

To mitigate the effects of mode collapse, our proposed architecture incorporates an identity block within the GAN framework. This identity block is a stabilizing mechanism that allows the network to maintain the essential features of the input data while facilitating a smoother gradient flow during training. By preserving the key characteristics of the generated images, the identity block enhances the generator’s ability to produce a more diverse range of outputs. This approach helps combat mode collapse and contributes to improved pixel integrity, ensuring that the augmented images are both varied and coherent. As a result, the inclusion of the identity block plays a vital role in enhancing our GAN architecture’s overall robustness and effectiveness in renewable energy applications.

The proposed methodology introduces a pancreas-inspired metaheuristic loss function to address the challenge of maintaining pixel integrity in the generated images. This approach draws inspiration from the pancreatic regulatory mechanisms, which maintain homeostasis by adjusting insulin levels to control blood sugar.

In biological systems, the pancreas functions as a feedback control system. When blood sugar levels rise, the pancreas releases insulin, which facilitates glucose uptake by cells, thereby lowering blood sugar levels. Conversely, when blood sugar drops, insulin secretion decreases, allowing for glucose release into the bloodstream. This dynamic modulation ensures that blood sugar levels remain within a healthy range, demonstrating a self-regulating mechanism that continuously adapts to changes in the internal environment.

In the context of generative modeling, the generator in a GAN can be viewed as analogous to the pancreas. Its goal is to produce high-quality images that accurately represent the underlying data distribution. However, during training, the generator may struggle with mode collapse, where it produces a limited variety of outputs. This lack of diversity is akin to a failure in the pancreas’ ability to regulate insulin levels effectively, leading to imbalances in output.

The pancreas-inspired metaheuristic loss function stands apart from other biological-inspired optimization techniques, such as genetic algorithms and swarm intelligence, by employing a dynamic feedback mechanism that continuously adjusts to the generator’s performance during training. While traditional approaches often focus on population-based strategies, which evolve solutions over generations without real-time adaptability, the pancreas-inspired method emphasizes maintaining pixel-level integrity and diversity in generated images through immediate corrective actions. This not only fosters a more responsive adaptation to the training dynamics but also ensures that the outputs remain coherent and high-quality. Moreover, unlike many biological metaheuristics that optimize a singular objective, the pancreas-inspired approach integrates multiple facets of image generation—specifically, pixel integrity and diversity—into a holistic framework. This comprehensive strategy allows for a more nuanced optimization process, ultimately enhancing the robustness and effectiveness of the generative model in complex tasks like image synthesis.

This study investigates the potential of GANs and data augmentation techniques to enhance energy output prediction models in renewable energy resources (RER). The goal is to demonstrate how these innovations can transcend existing technological limits, achieving superior benchmarks in performance, variance, and reliability in industry applications. Key distinctions arise in the implementation of these features compared to current methodologies, particularly in the sensitivity of generative models to architectural design and parameter optimization. The proposed modifications are significant enough that, without them, generating models of equivalent caliber would be nearly impossible. While both augmentations are critical for success, challenges in convergence persist, necessitating careful consideration of their integration. This study aims to showcase the effectiveness of the redesigned model with these enhancements, emphasizing the contributions of the novel loss function and the overall advancements relative to existing technologies.

The contribution points are as follows:

1.
Development of a new metaheuristic loss function inspired by the pancreas, focusing on maintaining pixel integrity while promoting diversity in augmented images.
2.
Introduced a new dual loss function based on a new metaheuristic loss function inspired by the pancreas, which focuses on maintaining pixel integrity while promoting diversity in augmented images.
3.
Implementing strategies within the GAN architecture to effectively mitigate mode collapse, thereby ensuring a broader representation of data in generated samples and demonstrating improved pixel integrity in augmented images, contributes to higher fidelity in output critical for tasks such as fault detection and energy prediction.
4.
The novel architecture of the GAN based on the dual loss function outperforms the different architectures of GANs in mode collapse mitigation, image diversity, and generated images.
5.
The new GAN architecture based on the dual loss function helps different architectures improve their performance in the segmentation and detection process. It may be used in the future in wide applications in the real world, especially in renewable energy.
6.
There is evidence of enhanced accuracy and reliability in the detection tasks when using the generated images, thereby supporting the methodology’s practical application in renewable energy systems.

Novelty aspects of the pancreas-inspired loss function and how it differs from other biological techniques:

1.
Pancreas-Inspired Metaheuristic Function: Introduction of a novel loss function inspired by the intelligent behavior of the pancreas, specifically its regulatory mechanisms in maintaining homeostasis. This approach emphasizes the importance of maintaining pixel integrity while promoting diversity in the generated images, setting it apart from traditional techniques.
2.
Dynamic Adaptation: Unlike other biological techniques that may rely on fixed rules or patterns, the pancreas-inspired function adapts to the variations in the training data, allowing for more responsive adjustments during the GAN training process. This adaptability enhances the generator’s ability to produce more output.
3.
Focus on Integrity and Diversity: While many biological-inspired methods prioritize either integrity or diversity, the proposed pancreas-inspired function effectively balances both objectives. The proposed dual focus helps resolve common data augmentation issues, such as mode collapse and pixel coherence, offering a more holistic solution.
4.
Biological Relevance: The pancreas was chosen as an inspiration because of its critical role in regulating biological processes through feedback mechanisms. This relevance provides a unique framework not typically explored in existing GAN methodologies, often drawing inspiration from more straightforward biological concepts.
5.
Enhanced Learning Efficiency: Our approach mimics the pancreas’ ability to manage complex interactions, allowing for more efficient learning and the generation of high-quality images. This contrasts with other biological techniques that may not effectively capture such dynamic interactions, leading to less optimal training outcomes.

This paper is organized into several key sections to provide a comprehensive understanding of our proposed methodology and its implications for renewable energy applications. Following this introduction, Sect. 2 reviews the relevant literature on GANs, highlighting existing challenges such as mode collapse and dataset limitations. Section 3 details the architecture of the proposed Penca-GAN, including the integration of the identity block and the pancreas-inspired metaheuristic loss function. Section 4 presents the experimental setup, including the datasets used and the evaluation metrics employed to assess the performance of our model. In Sect. 5, The experimental results are discussed, demonstrating the effectiveness of the proposed approach in enhancing image diversity and integrity. Finally, Sect. 6 concludes the paper, summarizing the key findings and offering suggestions for future research directions. This structured approach ensures that readers can easily follow the progression of our research and understand the significance of our contributions to the field.

Related work

Generative Adversarial Networks (GANs) have emerged as a pivotal technology for image augmentation, offering a powerful framework for synthesizing high-quality, realistic data. The foundational work by Brophy et al.¹⁸ established the adversarial training paradigm, where a generator and discriminator are trained in tandem, inspiring a plethora of subsequent advancements. This adversarial setup has proven to be highly effective in addressing diverse challenges within image generation and augmentation.

The versatility of GANs has led to their widespread adoption across various domains, including image synthesis, style transfer, and data augmentation^19,20. Moreover, GANs are increasingly utilized in cybersecurity applications, such as intrusion detection, steganography, password cracking, and anomaly detection, demonstrating their potential in addressing evolving security challenges^21,22,23,24.

To enhance the controllability and specificity of the generated outputs, Conditional GANs (cGANs)²⁵were introduced. By conditioning the generation process on auxiliary information, such as class labels, cGANs enable targeted attribute generation, significantly increasing the relevance of synthetic data in various tasks. In the realm of image-to-image translation, Pix2Pix²⁶leveraged a cGAN architecture to transform images from one domain to another, relying on paired training data. However, the reliance on paired data limited its applicability in scenarios with scarce paired datasets. CycleGAN²⁷ addressed this limitation by enabling unpaired image-to-image translation, facilitating transformations across domains without requiring corresponding image pairs.

Advancements in high-resolution image generation and style control were realized with the introduction of StyleGAN²⁸and its successors. These models allow for the disentanglement of high-level attributes and style information, producing highly detailed and realistic images. Complementing this, super-resolution GANs (SRGANs)²⁹focus on generating high-resolution images from low-resolution inputs, augmenting datasets with enhanced imagery. Further advancements in image super-resolution have been highlighted in recent studies^30,31,32,33, demonstrating the transformative potential of GAN-based frameworks for various imaging applications.

For improved interpretability and feature manipulation, InfoGAN³⁴maximizes the mutual information between the generated images and latent variables, enabling structured and interpretable outputs. Augmented GANs (AugGANs)^35,36 integrate data augmentation techniques directly into the GAN architecture, enhancing training and sample diversity.

Addressing training instability, Wasserstein GANs (WGANs)³⁷utilize the Wasserstein distance as a loss function, providing a more stable training environment. WGAN-GP builds upon this by incorporating a gradient penalty to enforce Lipschitz continuity. Self-Attention GANs (SAGANs)³⁸introduce self-attention mechanisms to capture long-range dependencies, while Progressive Growing GANs (ProGANs)³⁹employ a progressive training strategy to improve stability and quality. Boundary Equilibrium GANs (BEGANs)⁴⁰ balance generator and discriminator training for enhanced image quality and diversity.

Specialized applications of GANs include Semi-Supervised GANs, which leverage both labeled and unlabeled data for high-quality generation, and Cycle Consistency GANs, which enforce consistency across different domains. Colorful Image GANs (CiGANs) focus on colorizing grayscale images, serving as a valuable augmentation tool.

Despite these advancements, several research gaps remain. Many models, such as Pix2Pix and CycleGAN, rely on large-scale datasets, limiting their applicability in data-scarce domains. Interpretability and controllability of the generated outputs remain challenges, with latent space complexity hindering user control. Training stability continues to be a concern, requiring robust training techniques. There is also a need for GANs that can learn from weakly supervised or noisy data, produce diverse outputs, and seamlessly integrate multiple augmentation techniques.

The Self-Attention GAN (SAGAN)³⁸introduces self-attention mechanisms, allowing the model to capture long-range dependencies within images, which results in more coherent and diverse outputs. Similarly, the Progressive Growing GAN (ProGAN)³⁹ uses a progressive training strategy that gradually increases the complexity of the generated images, significantly improving both stability and quality during training.

In cases where labeled data is scarce, the Semi-Supervised GAN leverages labeled and unlabeled data, enabling the generation of high-quality samples while effectively using the limited labeled data available. The Cycle Consistency GAN emphasizes the importance of consistency across different domains, reinforcing the reliability of the generated images.

The Boundary Equilibrium GAN (BEGAN)⁴⁰ introduces a boundary equilibrium approach that effectively balances the generator and discriminator training, resulting in enhanced quality and diversity of the generated images. For specialized applications, the Colorful Image GAN (CiGAN) focuses on generating color images from grayscale inputs, serving as a valuable augmentation tool for datasets that lack color information. Table 1 shows recent research on GAN based approaches for renewable energy applications and their limitations.

Table 1 Recent research on GAN-Based approaches for renewable energy applications.

Subjects

Abstract

Similar content being viewed by others

Novel GSIP: GAN-based sperm-inspired pixel imputation for robust energy image reconstruction

A novel 8-connected Pixel Identity GAN with Neutrosophic (ECP-IGANN) for missing imputation

Generation of highly realistic microstructural images of alloys from limited data with a style-based generative adversarial network

Introduction

Related work

Research gap

Research questions

Methodology

Dataset description

GANs with an identity block

Architecture of the identity block

Pancreas-inspired meta heuristics loss function

Experimental results and discussion

Diversity of the augmented images

Mode collapse mitigation results

Case study

Solar panel fault detection

Wind turbine fault detection

Theoretical and computational analysis

Conclusion and future work

Data availability

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Quick links