Skip to main content

ORIGINAL RESEARCH article

Front. Neurorobot.
Volume 18 - 2024 | doi: 10.3389/fnbot.2024.1452019
This article is part of the Research Topic Advanced Technology for Human Movement Rehabilitation and Enhancement View all 12 articles

Swimtrans Net: A multimodal robotic system for swimming action recognition driven via Swin-Transformer

Provisionally accepted
Chen He Chen He 1,2Yue Xiaoyu Yue Xiaoyu 1,3*
  • 1 College of Physical Education, Sangmyung University (Seoul), Sangmyung, China
  • 2 School of Physical Education, Hubei University of Science and Technology (China), Hubei, China
  • 3 School of Physical Education, Hubei University of Science and Technology, Xianning, China

The final, formatted version of the article will be published soon.

    Currently, using machine learning methods for precise analysis and improvement of swimming techniques holds significant research value and application prospects. The existing machine learning methods have improved the accuracy of action recognition to some extent. However, they still face several challenges such as insufficient data feature extraction, limited model generalization ability, and poor real-time performance. To address these issues, this paper proposes an innovative approach called Swimtrans Net: A multimodal robotic system for swimming action recognition driven via Swin-Transformer. By leveraging the powerful visual data feature extraction capabilities of Swin-Transformer, Swimtrans Net effectively extracts swimming image information. Additionally, to meet the requirements of multimodal tasks, we integrate the CLIP model into the system. Swin-Transformer serves as the image encoder for CLIP, and through finetuning the CLIP model, it becomes capable of understanding and interpreting swimming action data, learning relevant features and patterns associated with swimming. Finally, we introduce transfer learning for pre-training to reduce training time and lower computational resources, thereby providing real-time feedback to swimmers. Experimental results show that Swimtrans Net has achieved a 2.94% improvement over the current state-of-the-art methods in swimming motion analysis and prediction, making significant progress. This study introduces an innovative machine learning method that can help coaches and swimmers better understand and improve swimming techniques, ultimately improving swimming performance.

    Keywords: Swin-Transformer, Clip, multimodal robotic, swimming action recognition, Transfer Learning

    Received: 20 Jun 2024; Accepted: 09 Aug 2024.

    Copyright: © 2024 He and Xiaoyu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

    * Correspondence: Yue Xiaoyu, School of Physical Education, Hubei University of Science and Technology, Xianning, China

    Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.