AN AUTONOMOUS PHYTOPLANKTON DETECTION AND ECOLOGICAL RISK ASSESSMENT USING ORIENTED OBJECT DETECTION AND HYBRID DEEP LEARNING
Aquatic ecosystems depend greatly on microscopic phytoplankton species as the primary producers and bio-indicators of water quality. Accurate detection and identification of phytoplankton species is a necessary step for monitoring their ecological risk and the following in-depth ecological and environmental studies and analyses, due to their significance and potential indication of environmental stress, eutrophication and habitat change, and HABs incidents. Yet, microscopic image-based phytoplankton species identification is labor-intensive and time-consuming. It requires expert taxonomic skills which make large-scale biodiversity screening an unthinkable task using traditional approaches. This paper proposes a novel automated system that integrates an Oriented YOLOv8-OBB object detection engine with a hybrid ConvNeXt–CBAM–Vision Transformer (ViT) classifier to tackle some fundamental challenges for microscopic phytoplankton species identification, including rotational-variation and small-object detection, background noise from other particles floating near the target organisms, and subtle morphological difference among species. For detection, an anchor-free oriented bounding box regression module was adopted to preserve the 360° rotational-invariance of the target organisms, minimizing errors of background inclusion of organisms rotating diagonally. After extracting the contour-integrating organism patches, a triple-hybrid deep learning classifier was deployed, by cascading a feature extractor ConvNeXt, an attention model CBAM, and a global relational aggregator ViT into the pipeline. The trained detection and classification models are validated over 23,352 microscopic images with various magnification factors, obtaining an accuracy of 94.2%, detection mean Average Precision (mAP) of 0.89, and test latency of 34.7 ms/image. Finally, a rule engine-based Ecological Risk Assessment (ERA) is devised to generate timely alarm by converting identified taxa to a Saprobic Index.
S.D.Jagadiish, (2026). An Autonomous Phytoplankton Detection and Ecological Risk Assessment using Oriented Object Detection and Hybrid Deep Learning. International Journal of Science, Strategic Management and Technology, 02(03). https://doi.org/10.55041/ijsmt.v2i3.163
S.D.Jagadiish, . "An Autonomous Phytoplankton Detection and Ecological Risk Assessment using Oriented Object Detection and Hybrid Deep Learning." International Journal of Science, Strategic Management and Technology, vol. 02, no. 03, 2026, pp. . doi:https://doi.org/10.55041/ijsmt.v2i3.163.
S.D.Jagadiish, . "An Autonomous Phytoplankton Detection and Ecological Risk Assessment using Oriented Object Detection and Hybrid Deep Learning." International Journal of Science, Strategic Management and Technology 02, no. 03 (2026). https://doi.org/https://doi.org/10.55041/ijsmt.v2i3.163.
[2] Ultralytics, “YOLOv8 Documentation,” 2023. Available: https://docs.ultralytics.com
[3] S. Woo, J. Park, J.-Y. Lee, and I. S. Kweon, “CBAM: Convolutional Block Attention Module,” in Proceedings of the European Conference on Computer Vision (ECCV), 2018. DOI: https://doi.org/10.1007/978-3-030-01234-2_1
[4] Z. Liu et al., “ConvNeXt: A ConvNet for the 2020s,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022. Available: https://arxiv.org/abs/2201.03545
[5] A. Dosovitskiy et al., “An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale,” in International Conference on Learning Representations (ICLR), 2021. Available: https://arxiv.org/abs/2010.11929
[6] D. P. Kingma and J. Ba, “Adam: A Method for Stochastic Optimization,” in International Conference on Learning Representations (ICLR), 2015. Available: https://arxiv.org/abs/1412.6980
[7] K. He, X. Zhang, S. Ren, and J. Sun, “Deep Residual Learning for Image Recognition,” in Proceedings of CVPR, 2016. Available: https://arxiv.org/abs/1512.03385
[8] T.-Y. Lin et al., “Focal Loss for Dense Object Detection,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 42, no. 2, pp. 318–327, 2020. DOI: https://doi.org/10.1109/TPAMI.2018.2858826
[9] Q. Wang et al., “ECA-Net: Efficient Channel Attention for Deep Convolutional Neural Networks,” in CVPR, 2020. Available: https://arxiv.org/abs/1910.03151
[10] A. Bochkovskiy, C.-Y. Wang, and H.-Y. M. Liao, “YOLOv4: Optimal Speed and Accuracy of Object Detection,” arXiv preprint arXiv:2004.10934, 2020. Available: https://arxiv.org/abs/2004.10934