Neidio i’r brif dudalen lywio Neidio i chwilio Neidio i’r prif gynnwys

RepVGG-GELAN: Enhanced GELAN with VGG-STYLE ConvNets for Brain Tumour Detection

Allbwn ymchwil: Pennod mewn Llyfr/Adroddiad/Trafodion CynhadleddCyfraniad mewn cynhadleddadolygiad gan gymheiriaid

1 Dyfyniad (Scopus)

Crynodeb

Methods for object detection, especially those utilizing YOLO (You Only Look Once), are known for their impressive ability to balance precision and speed. However, their potential in detecting brain tumours has received limited attention. RepVGG-GELAN, proposed in this study is an advanced YOLO framework enhanced by RepVGG (Reparameterized Convolutional Neural Network), a specialized convolutional method designed for detection particularly emphasizing brain tumours in medical imaging. With RepVGG architecture the model enhances both speed and accuracy of brain tumour detection. The integration of RepVGG within the YOLO framework seeks to optimize both computational efficiency and detection effectiveness. This research incorporates a Generalized Efficient Layer Aggregation Network (GELAN) architecture based on spatial pyramid pooling enhancing the capabilities of RepVGG. Tests on a brain tumour dataset reveal that RepVGG-GELAN exceeds the performance of the RCS-YOLO (Reparameterized Convolution ShuffleNet YOLO) in terms of both precision and processing speed. Notably, RepVGG-GELAN delivers a 4.91% enhancement in precision and a 2.54% improvement in AP50 (Average Precision at IOU=0.5) over the most recent method with a computational efficiency of 240.7 GFLOPs (Giga Floating Point Operations per Second). The proposed RepVGG-GELAN gives positive results demonstrating itself as a state-of-the-art solution for accurate and efficient detection of brain tumour images. The implementation code is publicly available at https://github.com/ThensiB/RepVGG-GELAN.

Iaith wreiddiolSaesneg
TeitlComputer Vision and Image Processing - 9th International Conference, CVIP 2024, Revised Selected Papers
GolygyddionJagadeesh Kakarla, R. Balasubramanian, Subrahmanyam Murala, Santosh Kumar Vipparthi, Deep Gupta
CyhoeddwrSpringer Science and Business Media Deutschland GmbH
Tudalennau417-430
Nifer y tudalennau14
ISBN (Argraffiad)9783031936876
Dynodwyr Gwrthrych Digidol (DOIs)
StatwsCyhoeddwyd - 20 Gorff 2025
Digwyddiad9th International Conference on Computer Vision and Image Processing, CVIP 2024 - Chennai, India
Hyd: 19 Rhag 202421 Rhag 2024

Cyfres gyhoeddiadau

EnwCommunications in Computer and Information Science
Cyfrol2473 CCIS
ISSN (Argraffiad)1865-0929
ISSN (Electronig)1865-0937

Cynhadledd

Cynhadledd9th International Conference on Computer Vision and Image Processing, CVIP 2024
Gwlad/TiriogaethIndia
DinasChennai
Cyfnod19/12/2421/12/24

Dyfynnu hyn