YOLOv8-TF: Transformer-Enhanced YOLOv8 for Underwater Fish Species Recognition with Class Imbalance Handling

Author:

Shah Chiranjibi1ORCID,Nabi M M2ORCID,Alaba Simegnew Yihunie3ORCID,Ebu Iffat Ara3,Prior Jack1ORCID,Campbell Matthew D.4,Caillouet Ryan5,Grossi Matthew D.5ORCID,Rowell Timothy5,Wallace Farron5,Ball John E.3ORCID,Moorhead Robert1ORCID

Affiliation:

1. Northern Gulf Institute, Mississippi State University, Starkville, MS 39759, USA

2. The School of Engineering and Applied Sciences, Western Kentucky University, Bowling Green, KY 42101, USA

3. Department of Electrical and Computer Engineering, James Worth Bagley College of Engineering, Mississippi State University, Starkville, MS 39762, USA

4. National Marine Fisheries Services, Southeast Fisheries Science Center, 3209 Frederic Street, Pascagoula, MS 39567, USA

5. NOAA Fisheries, 4700 Avenue U, Galveston, TX 77551, USA

Abstract

In video-based fish surveys, species recognition plays a vital role in stock assessments, ecosystem analysis, production management, and protection of endangered species. However, implementing fish species detection algorithms in underwater environments presents significant challenges due to factors such as varying lighting conditions, water turbidity, and the diverse appearances of fish species. In this work, a transformer-enhanced YOLOv8 (YOLOv8-TF) is proposed for underwater fish species recognition. The YOLOv8-TF enhances the performance of YOLOv8 by adjusting depth scales, incorporating a transformer block into the backbone and neck, and introducing a class-aware loss function to address class imbalance in the dataset. The class-aware loss considers the count of instances within each species and assigns a higher weight to species with fewer instances. This approach enables fish species recognition through object detection, encompassing the classification of each fish species and localization to estimate their position and size within an image. Experiments were conducted using the 2021 Southeast Area Monitoring and Assessment Program (SEAMAPD21) dataset, a detailed and extensive reef fish dataset from the Gulf of Mexico. The experimental results on SEAMAPD21 demonstrate that the YOLOv8-TF model, with a mean Average Precision (mAP)0.5 of 87.9% and mAP0.5–0.95 of 61.2%, achieves better detection results for underwater fish species recognition compared to state-of-the-art YOLO models. Additionally, experimental results on the publicly available datasets, such as Pascal VOC and MS COCO datasets demonstrate that the model outperforms existing approaches.

Funder

Northern Gulf Institute at Mississippi State University from NOAA’s Office of Oceanic and Atmospheric Research, U.S. Department of Commerce.

Publisher

MDPI AG

Reference72 articles.

1. Development of an intelligent feeding controller for indoor intensive culturing of eel;Changa;Aquac. Eng.,2004

2. Artificial neural networks for fish-species identification;Cabreira;ICES J. Mar. Sci.,2009

3. Semi-supervised learning for fish species recognition;Alaba;Proceedings of the Ocean Sensing and Monitoring XV,2023

4. Alaba, S.Y., Nabi, M., Shah, C., Prior, J., Campbell, M.D., Wallace, F., Ball, J.E., and Moorhead, R. (2022). Class-aware fish species recognition using deep learning for an imbalanced dataset. Sensors, 22.

5. An enhanced YOLOv5 model for fish species recognition from underwater environments;Hou;Proceedings of the Ocean Sensing and Monitoring XV,2023

Cited by 1 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.7亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2025 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3