

Most CycleGAN domain transfer architectures require a large amount of data belonging to domains on which the domain transfer task is to be applied. Nevertheless, in many real-world applications one of the domains is reduced, i.e., scarce. This means that it has much less training data available in comparison to the other domain, which is fully observable. In order to tackle the problem of using CycleGAN framework in such unfavorable application scenarios, we propose and invoke a novel Bootstrapped SSL CycleGAN architecture (BTS-SSL), where the mentioned problem is overcome using two strategies. Firstly, by using a relatively small percentage of available labelled training data from the reduced or scarce domain and a Semi-Supervised Learning (SSL) approach, we prevent overfitting of the discriminator belonging to the reduced domain, which would otherwise occur during initial training iterations due to the small amount of available training data in the scarce domain. Secondly, after initial learning guided by the described SSL strategy, additional bootstrapping (BTS) of the reduced data domain is performed by inserting artifically generated training examples into the training poll of the data discriminator belonging to the scarce domain. Bootstrapped samples are generated by the already trained neural network that performs transferring from the fully observable to the scarce domain. The described procedure is periodically repeated during the training process several times and results in significantly improved performance of the final model in comparison to the original unsupervised CycleGAN approach. The same also holds in comparison to the solutions that are exclusively based either on the described SSL, or on the bootstrapping strategy, i.e., when these are applied separately. Moreover, in the considered scarce scenarios it also shows competitive results in comparison to the fully supervised solution based on the pix2pix method. In that sense, it is directly applicable to many domain transfer tasks that are relying on the CycleGAN architecture. © 2022 by the authors. Licensee MDPI, Basel, Switzerland.
| Funding sponsor | Funding number | Acronym |
|---|---|---|
| 451-03-68/2020-14/200156,200156 | ||
| Science Fund of the Republic of Serbia | 6524560 | |
| Horizon 2020 Framework Programme See opportunities by H2020 | 872614 | H2020 |
Acknowledgments: This research was supported by the \u201CScience Fund of the Republic of Serbia\u201D, through the project grant agreement No. 6524560: \u201CSpeaker/Style Adaptation for Digital Voice Assistants Based on Image Processing Methods (AI-S-ADAPT), and by the \u201CSerbian Ministry of Education, Science and Technological Development\u201D through the research project No. 451-03-68/2020-14/200156: \u201CInnovative Scientific and Artistic Research from the Faculty of Technical Sciences Activity Domain\u201D. The authors would also like to acknowledge support by the EU\u2019s H2020 research and innovation programme under the project grant agreement No. 872614: \u201CSelfsustained Cross Border Customized Cyberphysical System Experiments for Capacity Building Among European Stakeholders\u201D (Smart4All).
Funding: This research was funded by the \u201CScience Fund of the Republic of Serbia\u201C, through the project grant agreement No. 6524560: \u201CSpeaker/Style Adaptation for Digital Voice Assistants Based on Image Processing Methods (AI-S-ADAPT); and by the \u201CSerbian Ministry of Education, Science and
Technological Development\u201C through the research project No. 451-03-68/2020-14/200156: \u201CInnovative Scientific and Artistic Research from the Faculty of Technical Sciences Activity Domain\u201D.
Krstanović, L.; Faculty of Technical Sciences, University of Novi Sad, Trg D. Obradovića 6, Novi Sad, Serbia;
© Copyright 2022 Elsevier B.V., All rights reserved.