Phát hiện polyp bán giám sát từ đầu đến cuối

Nguyen Hong Son

Phát hiện polyp bán giám sát từ đầu đến cuối - Luận án Thạc sĩ Khoa học Dữ liệu

Luận án Thạc sĩ Khoa học Dữ liệu về phát hiện polyp bán giám sát. Phương pháp Dense Target Producer cải thiện độ chính xác phát hiện ung thư đại trực tràng.

Trường ĐH

Đại học Bách khoa Hà Nội

Chuyên ngành

Data Science

Tác giả

Luan An

Thể loại

Master Thesis

Năm xuất bản

2023

Số trang

56

Thời gian đọc

9 phút

Lượt xem

1

Lượt tải

0

Phí lưu trữ

40 Point

I. Phát Hiện Polyp Bán Giám Sát Tổng Quan

Phát hiện polyp đại tràng đóng vai trò quan trọng trong chẩn đoán sớm ung thư đại trực tràng. Bệnh này đứng thứ ba về tỷ lệ mắc và thứ hai về tử vong toàn cầu. Tại Việt Nam, ung thư đại trực tràng nằm trong top 10 bệnh ung thư phổ biến nhất. Tỷ lệ sống sót sau 5 năm có thể đạt 90% nếu phát hiện sớm. Tuy nhiên, chỉ 4/10 trường hợp được chẩn đoán ở giai đoạn đầu. Nghiên cứu này trình bày phương pháp phát hiện polyp từ đầu đến cuối sử dụng học bán giám sát. Phương pháp giúp giảm gánh nặng gán nhãn dữ liệu y tế. Deep learning y khoa đã chứng minh hiệu quả trong nội soi tiêu hóa. Mạng nơ-ron tích chập có khả năng phân tích hình ảnh chính xác. Học bán giám sát kết hợp dữ liệu có nhãn và không nhãn. Cách tiếp cận này phù hợp với y tế, nơi dữ liệu có nhãn khan hiếm và tốn kém.

1.1. Tầm Quan Trọng Phát Hiện Polyp Sớm

Polyp đại tràng là tiền thân của ung thư đại trực tràng. Phát hiện và loại bỏ polyp kịp thời ngăn ngừa ung thư phát triển. Nội soi tiêu hóa là phương pháp chính để phát hiện polyp. Tuy nhiên, tỷ lệ bỏ sót polyp trong nội soi vẫn còn cao. Nguyên nhân do polyp nhỏ, màu sắc tương tự niêm mạc, và sự mệt mỏi của bác sĩ. Hệ thống hỗ trợ tự động giúp giảm tỷ lệ bỏ sót. Deep learning y khoa mang lại độ chính xác cao trong phân đoạn hình ảnh y tế.

1.2. Thách Thức Trong Gán Nhãn Dữ Liệu Y Tế

Gán nhãn dữ liệu y tế đòi hỏi chuyên môn cao. Chỉ bác sĩ có kinh nghiệm mới thực hiện được. Quá trình này tốn thời gian và chi phí lớn. Mỗi hình ảnh nội soi cần được đánh dấu vị trí polyp chính xác. Việc thu thập hàng nghìn hình ảnh có nhãn gần như bất khả thi. Học bán giám sát giải quyết vấn đề này. Phương pháp chỉ cần một lượng nhỏ dữ liệu có nhãn. Phần còn lại sử dụng dữ liệu không nhãn để cải thiện mô hình.

1.3. Ưu Điểm Học Bán Giám Sát

Học bán giám sát kết hợp ưu điểm của học có giám sát và không giám sát. Phương pháp tận dụng lượng lớn dữ liệu không nhãn sẵn có. Chi phí thu thập dữ liệu giảm đáng kể. Hiệu suất mô hình cải thiện so với chỉ dùng dữ liệu có nhãn. Pseudo labeling là kỹ thuật phổ biến trong học bán giám sát. Mô hình giáo viên tạo nhãn tự động cho dữ liệu chưa gán nhãn. Mô hình học sinh học từ cả nhãn thực và nhãn giả. Transfer learning giúp tận dụng kiến thức từ mô hình đã huấn luyện.

II. Kiến Trúc CenterNet Cho Phát Hiện Polyp

CenterNet++ là kiến trúc phát hiện đối tượng tiên tiến không dùng anchor. Phương pháp biểu diễn đối tượng dưới dạng điểm trung tâm trên heatmap. Cách tiếp cận này đơn giản hóa quá trình phát hiện. Không cần Non Maximum Suppression (NMS) phức tạp. Mạng nơ-ron tích chập trích xuất đặc trưng từ hình ảnh nội soi. Feature Pyramid Network (FPN) kết hợp thông tin đa tỷ lệ. Kiến trúc backbone sử dụng ResNet hoặc DLA. ASF module cải thiện khả năng tổng hợp đặc trưng. Augmentation dữ liệu như Mosaic tăng tính đa dạng. Mô hình đạt tốc độ xử lý thời gian thực. Độ chính xác cao trong phát hiện polyp đại tràng nhỏ.

2.1. Nguyên Lý Hoạt Động CenterNet

CenterNet phát hiện đối tượng qua điểm trung tâm thay vì bounding box. Mô hình dự đoán heatmap xác suất cho mỗi vị trí. Giá trị cao trên heatmap chỉ ra vị trí có khả năng chứa polyp. Kích thước và offset được dự đoán cho mỗi điểm trung tâm. Phương pháp loại bỏ nhu cầu thiết kế anchor boxes phức tạp. Quá trình hậu xử lý đơn giản hơn nhiều so với phương pháp anchor-based. Tốc độ suy luận nhanh phù hợp với ứng dụng thời gian thực trong nội soi tiêu hóa.

2.2. Backbone Networks Và Trích Xuất Đặc Trưng

Backbone network là thành phần quan trọng trong mạng nơ-ron tích chập. ResNet18 sử dụng residual blocks để huấn luyện mạng sâu. Deep Layer Aggregation (DLA) tổng hợp thông tin từ nhiều tầng. Hourglass network áp dụng trong bài toán phân đoạn hình ảnh y tế. Các kiến trúc này trích xuất đặc trưng đa cấp độ. Transfer learning từ ImageNet tăng tốc quá trình huấn luyện. Fine-tuning trên dữ liệu polyp cải thiện độ chính xác chuyên biệt.

2.3. Module ASF Và Tổng Hợp Đặc Trưng

Adaptive Spatial Fusion (ASF) module kết hợp đặc trưng thông minh. Module này tổng hợp thông tin từ các tầng khác nhau của mạng. Cơ chế attention giúp tập trung vào vùng quan trọng. Feature Pyramid Network tạo kim tự tháp đặc trưng đa tỷ lệ. Phương pháp phát hiện polyp ở nhiều kích thước khác nhau. Polyp nhỏ và lớn đều được nhận diện chính xác. ASF cải thiện đáng kể hiệu suất trên tập PolypsSet dataset.

III. Dense Target Producer Cải Tiến Pseudo Labeling

Dense Target Producer (DTP) là đóng góp chính của nghiên cứu. Phương pháp khắc phục nhược điểm của pseudo-box labels truyền thống. Pseudo labeling thông thường chỉ gán nhãn cho một số pixel. Các pixel bên trong bounding box đều được coi là foreground. Cách tiếp cận này gây nhiễu trong học bán giám sát. DTP tạo heatmap dày đặc thay vì boxes rời rạc. Mỗi pixel có giá trị xác suất liên tục. Phương pháp phù hợp với kiến trúc heatmap-based như CenterNet. Không cần Non Maximum Suppression trong quá trình tạo nhãn giả. Mô hình giáo viên tạo DTP cho dữ liệu không nhãn. Mô hình học sinh học từ cả supervised loss và unsupervised loss. Threshold Epoch Adaptive (TEA) điều chỉnh ngưỡng tự động theo quá trình huấn luyện.

3.1. Hạn Chế Của Pseudo Box Truyền Thống

Pseudo-box labels có nhiều vấn đề trong phát hiện polyp đại tràng. Phương pháp anchor-based gán tất cả pixel trong box là foreground. Điều này không chính xác vì polyp thường có hình dạng bất quy tắc. Nhiều pixel nền bị gán nhãn sai thành foreground. Gán nhãn yếu này gây nhiễu cho quá trình học của mô hình. Heatmap-based methods chỉ tập trung vào điểm trung tâm. Tuy nhiên, pseudo-boxes vẫn tạo nhiều positive samples không chính xác. DTP giải quyết vấn đề này bằng cách tạo heatmap liên tục.

3.2. Cơ Chế Hoạt Động Dense Target Producer

DTP tạo heatmap dày đặc trực tiếp từ mô hình giáo viên. Mỗi pixel trên heatmap có giá trị từ 0 đến 1. Giá trị cao chỉ ra xác suất cao có polyp tại vị trí đó. Không cần chuyển đổi từ bounding boxes sang heatmap. Quá trình này loại bỏ bước hậu xử lý phức tạp. Mô hình học sinh sử dụng DTP để tính unsupervised loss. Loss function kết hợp supervised loss từ dữ liệu có nhãn. Gradient từ cả hai nguồn cập nhật trọng số mô hình. Phương pháp tận dụng tối đa thông tin từ dữ liệu không nhãn.

3.3. Threshold Epoch Adaptive TEA

TEA điều chỉnh ngưỡng lọc pseudo labels theo epoch. Ngưỡng cố định không phù hợp với mọi giai đoạn huấn luyện. Đầu quá trình, mô hình yếu nên cần ngưỡng cao. Chỉ pseudo labels có độ tin cậy cao được chấp nhận. Cuối quá trình, mô hình mạnh hơn nên hạ ngưỡng. Nhiều pseudo labels hơn được sử dụng để học. TEA tự động điều chỉnh dựa trên hiệu suất validation. Phương pháp cải thiện đáng kể kết quả trên PolypsSet dataset với 1% dữ liệu có nhãn.

IV. Tập Dữ Liệu PolypsSet Và Đánh Giá Mô Hình

PolypsSet dataset được sử dụng để đánh giá phương pháp phát hiện polyp. Tập dữ liệu chứa hình ảnh nội soi tiêu hóa với polyp được gán nhãn. Nghiên cứu thực hiện thí nghiệm trên hai cấu hình: single-class và two-class. Single-class chỉ phân loại có polyp hay không. Two-class phân loại thêm loại polyp cụ thể. Đánh giá sử dụng mean Average Precision (mAP) làm metric chính. Intersection Over Union (IOU) đo độ chồng lấp giữa predicted và ground truth boxes. Precision-Recall curve biểu diễn mối quan hệ giữa độ chính xác và độ phủ. Thí nghiệm với 1%, 5%, 10% dữ liệu có nhãn mô phỏng tình huống thực tế. Mosaic augmentation tăng tính đa dạng cho dữ liệu huấn luyện. Kết quả cho thấy CenterNet++ với DTP vượt trội các phương pháp baseline.

4.1. Cấu Trúc Tập Dữ Liệu PolypsSet

PolypsSet dataset bao gồm hình ảnh nội soi đại tràng chất lượng cao. Mỗi hình ảnh được chuyên gia nội soi gán nhãn vị trí polyp. Tập dữ liệu chia thành training, validation và test sets. Cấu hình single-class đơn giản hóa bài toán phát hiện. Cấu hình two-class phức tạp hơn với phân loại chi tiết. Dữ liệu phản ánh đa dạng về kích thước, hình dạng và vị trí polyp. Một số polyp nhỏ, khó phát hiện thách thức mô hình. Dataset phù hợp để đánh giá phương pháp học bán giám sát.

4.2. Metrics Đánh Giá Hiệu Suất

Mean Average Precision (mAP) là metric tiêu chuẩn trong object detection. mAP tính trung bình AP qua các class và ngưỡng IOU. Intersection Over Union đo độ chồng lấp giữa hai bounding boxes. IOU lớn hơn 0.5 thường được coi là detection đúng. Precision đo tỷ lệ detections đúng trong tổng số detections. Recall đo tỷ lệ polyp được phát hiện trong tổng số polyp thực tế. Precision-Recall curve trực quan hóa trade-off giữa hai metrics này. Area under curve (AUC) của PR curve chính là Average Precision.

4.3. Augmentation Dữ Liệu Trong Huấn Luyện

Augmentation dữ liệu tăng tính đa dạng và giảm overfitting. Mosaic augmentation kết hợp bốn hình ảnh thành một. Kỹ thuật này tạo ngữ cảnh phong phú cho mô hình học. Các phép biến đổi khác bao gồm flip, rotate, scale. Color jittering thay đổi độ sáng, độ tương phản, màu sắc. Augmentation đặc biệt quan trọng khi dữ liệu có nhãn hạn chế. Phương pháp giúp mô hình tổng quát hóa tốt hơn trên dữ liệu mới. Kết hợp với học bán giám sát, augmentation nâng cao hiệu suất đáng kể.

V. Kết Quả Thí Nghiệm Phát Hiện Polyp Bán Giám Sát

Kết quả thí nghiệm chứng minh hiệu quả của phương pháp đề xuất. CenterNet++ với DTP vượt trội so với các baseline trong semi-supervised learning. Trên PolypsSet single-class với 10% dữ liệu có nhãn, mô hình đạt mAP cao. Cải thiện đáng kể so với chỉ sử dụng supervised learning. Thí nghiệm với 1% dữ liệu có nhãn cho thấy ưu thế rõ ràng của DTP. TEA giúp điều chỉnh threshold tối ưu theo epoch. ASF module và Mosaic augmentation đóng góp vào hiệu suất tổng thể. So sánh với các phương pháp SSOD khác như FCOS và YOLO. CenterNet++ với DTP đạt balance tốt giữa accuracy và speed. Visualizations cho thấy mô hình phát hiện polyp chính xác cả kích thước nhỏ. Phương pháp phù hợp triển khai trong hệ thống hỗ trợ nội soi thời gian thực.

5.1. So Sánh Fully Supervised Learning

Thí nghiệm fully supervised sử dụng 100% dữ liệu có nhãn. CenterNet++ đạt hiệu suất cạnh tranh với các detector hiện đại. Trên single-class PolypsSet, mô hình đạt mAP cao. Two-class dataset phức tạp hơn nhưng kết quả vẫn ấn tượng. So sánh với YOLO, FCOS, và các kiến trúc anchor-based khác. CenterNet++ đơn giản hơn nhờ không cần NMS. Tốc độ suy luận đạt real-time phù hợp nội soi tiêu hóa. Kết quả này thiết lập baseline mạnh cho thí nghiệm semi-supervised.

5.2. Hiệu Quả Semi Supervised Với DTP

Thí nghiệm semi-supervised với 1%, 5%, 10% dữ liệu có nhãn. DTP cải thiện đáng kể so với supervised baseline. Với 10% labeled data, mô hình đạt gần hiệu suất fully supervised. Khoảng cách performance gap thu hẹp nhờ tận dụng unlabeled data. DTP vượt trội pseudo-box methods truyền thống. Không cần NMS trong tạo pseudo labels giảm nhiễu. Unsupervised loss từ DTP cung cấp gradient chất lượng. Mô hình học được representation tốt hơn từ dữ liệu không nhãn.

5.3. Ablation Studies Và Phân Tích

Ablation studies đánh giá đóng góp của từng component. ASF module cải thiện mAP đáng kể trên cả hai datasets. Mosaic augmentation tăng tính robust của mô hình. Kết hợp ASF và Mosaic mang lại hiệu quả tốt nhất. TEA vượt trội fixed threshold trong mọi tỷ lệ labeled data. Backbone DLA tốt hơn ResNet18 cho bài toán phát hiện polyp. Visualizations cho thấy mô hình tập trung đúng vùng polyp. Heatmap predictions rõ ràng và tập trung tại điểm trung tâm. Phương pháp ổn định qua nhiều lần chạy với seeds khác nhau.

VI. Ứng Dụng Thực Tiễn Trong Nội Soi Tiêu Hóa

Phương pháp phát hiện polyp bán giám sát có tiềm năng ứng dụng cao. Hệ thống hỗ trợ bác sĩ nội soi phát hiện polyp thời gian thực. Giảm tỷ lệ bỏ sót polyp trong quá trình thăm khám. Cải thiện chất lượng chẩn đoán và phòng ngừa ung thư đại trực tràng. Chi phí triển khai thấp nhờ giảm nhu cầu gán nhãn. Chỉ cần một lượng nhỏ dữ liệu có nhãn từ chuyên gia. Phần lớn dữ liệu nội soi có sẵn được tận dụng. Transfer learning từ models pretrained giúp khởi đầu tốt. Fine-tuning trên dữ liệu bệnh viện cụ thể tăng độ chính xác. Tốc độ xử lý real-time không làm gián đoạn quy trình nội soi. Giao diện hiển thị cảnh báo trực quan khi phát hiện polyp. Hệ thống ghi lại vị trí polyp để bác sĩ xem xét kỹ hơn.

6.1. Tích Hợp Vào Hệ Thống Nội Soi

Tích hợp mô hình vào thiết bị nội soi hiện đại. Video stream từ camera được xử lý frame-by-frame. Mỗi frame qua mô hình CenterNet++ để phát hiện polyp. Heatmap predictions được overlay lên hình ảnh gốc. Bác sĩ nhìn thấy cảnh báo trực quan khi có polyp. Hệ thống không thay thế mà hỗ trợ quyết định của bác sĩ. Giảm mệt mỏi và tăng độ tập trung trong ca nội soi dài. Đặc biệt hữu ích với bác sĩ ít kinh nghiệm hoặc đang đào tạo.

6.2. Triển Khai Với Dữ Liệu Hạn Chế

Bệnh viện thường có nhiều video nội soi chưa được gán nhãn. Phương pháp học bán giám sát tận dụng nguồn tài nguyên này. Chỉ cần chuyên gia gán nhãn một phần nhỏ dữ liệu. Phần còn lại tự động học qua pseudo labeling với DTP. Giảm đáng kể thời gian và chi phí chuẩn bị dữ liệu. Mô hình có thể cập nhật liên tục khi có thêm dữ liệu. Active learning chọn samples khó để chuyên gia gán nhãn. Chu trình cải thiện liên tục nâng cao hiệu suất theo thời gian.

6.3. Hướng Phát Triển Và Cải Tiến

Nghiên cứu tương lai có thể mở rộng sang phân loại chi tiết polyp. Phân biệt polyp lành tính và ác tính từ hình ảnh nội soi. Tích hợp thêm thông tin từ nhiều nguồn như lâm sàng, tiền sử. Cải thiện kiến trúc backbone với models mới như Vision Transformers. Áp dụng techniques như self-supervised learning và contrastive learning. Mở rộng sang các bệnh lý khác trong đường tiêu hóa. Phát triển models đa nhiệm phát hiện nhiều loại bất thường cùng lúc. Tối ưu hóa để chạy trên edge devices cho độ trễ thấp nhất.

24/03/2026

Xem trước tài liệu

Tải đầy đủ để xem toàn bộ nội dung

Semi supervised end to end polyp detection

Tải xuống file đầy đủ để xem toàn bộ nội dung

Tải đầy đủ (56 trang)

Trích đoạn nội dung luận án

Tải xuống để đọc toàn bộ

Hanoi University of Science and Technology School of Information and Communication Technology Master Thesis in Data Science Semi-Supervised end-to-end polyp detection NGUYEN HONG SON son.vn Supervisor: Dr. Dinh Viet Sang Hanoi 10-2023 Author’s Declaration I hereby declare that I am the sole author of this thesis. The results in this work are not complete copies of any other works. STUDENT Nguyen Hong Son Contents Contents List of Figures List of Tables 1 Introduction 1 1.4 Outline of the thesis .3 Semi-supervised Learning .1 Object Detection Problem .2 Semi supervised Learning .3 Semi-supervised Object Detection (SSOD) .2 Backbone used in CenterNet baseline .3 Feature Pyramid Network (FPN) .2 Improving Baseline model .3 Dense Target Producer (DTP) .1 Pseudo-Labeling Framework .2 Disadvantages of Pseudo-box Labels .1 CenterNet++ improvement results.

41 5 Conclusion 44 Bibliography 45 List of Figures 1.1 Proportion of CRC compared to other cancer diseases1 .1 Illustration about supervised, unsupervised and semi-supervised learn- ing.2 Two output of Object detection model: boxes and categories 3 .3 Anchor boxes in the object detection task.1 Architecture of Residual Block .2 Architecture of ResNet18 5 .3 Architecture of DLA 6 .4 An example of Hourglass network architecture used for segmentation task.6 Overview architecture of CenterNet++.7 Four module types used in the backbone of CenterNet++ .8 Architecture of ASF module .9 The overview of our proposed pipeline for unlabeled data compared with existing pseudo-box based pipeline. For each iteration, Dense Target Producer (DTP) is generated by the teacher model on unla- beled images. DTPs then use for the student model to calculate the unsupervised loss. The total loss is the sum of supervised loss and unsupervised loss.

Note that DTP does not need any postprocessing steps.10 How object detection methods get positive samples: (b) Anchor-based and anchor-free with NMS assign all pixels inside ground truth box (c) Heatmap-based focus only on the center pixel of the ground truth box. boxes with heatmap-based method .11 Comparisons between (b) foreground pixels assigned by ground truth boxes and (c) foreground pixels assigned by pseudo-boxes .12 Example of describing heatmap values at the beginning and end of the training process.1 Some examples of images in the PolypsSet dataset .2 Examples of IOU between two boxes .3 Example of Precision-Recall curve. The mean Average Precision (mAP) is the area under the curve.4 Examples of Mosaic augmentations .5 Visualizations for PolypsSet 10% two-class. From left to right: With- out DTP (supervised), with DTP and label.

42 List of Tables 4.1 Comparison between CenterNet++ and other object detectors for fully supervised learning on the single-class PolypsSet dataset.2 Comparison between CenterNet++ and other object detectors for fully supervised learning on the two-class PolypsSet dataset.3 Experimental results for semi-supervised learning setting on the Polyps- Set dataset.4 Comparisons on the single-class PolypsSet dataset with semi-supervised learning setting using 10% of training data as labeled .5 Experiments on diﬀerent backbones of CenterNet++ on the two-class PolypsSet dataset.6 Eﬀectiveness of ASF module and Mosaic augmentations on the single- class PolypsSet dataset.7 Eﬀectiveness of using TEA compared to ﬁxed thresholds. Experi- ments were conducted on the single-class PolypsSet dataset with 1% data used as labeled. 43 List of abbreviations AI Artiﬁcial Intelligence ANN Artiﬁcial Neural Network CNN Convolutional Neural Network CRC Colorectal Cancer DL Deep Learning DTP Dense Target Producer FC Fully Connected FPN Feature Pyramid Network FPS Frame per second FCOS Fully Convolutional One-Stage Object Detection IOU Intersection Over Union NMS Non Maximum Suppression SSD Single Shot Detection SSOD Semi-supervised Object detection TEA Threshold Epoch Adaptive YOLO You Only Look One Chapter 1 Introduction 1.1 Overview Colorectal Cancer (CRC) is one of the most common types of cancer today. When considering cancer diseases speciﬁcally, it ranks third in terms of incidence and even second in terms of mortality.

It is estimated that in 2023, there will be approximately 150,000 new cases and 52,000 deaths in the United States [1]. In Vietnam, CRC consistently ranks among the top 10 most common cancer diseases, with the fourth- highest diagnosis rate for males and the third-highest for females. The incidence rate in the population is about 10.1 provides a clearer view of the number and proportion of CRC cases compared to other cancer diseases. The danger of CRC lies in the fact that when symptoms begin to appear and the patient decides to seek medical attention, the disease has already progressed to later stages.

CRC is divided into 5 stages, which increase as the disease worsens. If the disease is detected in the early stages, the 5-year survival rate can be as high as 90%. However, in reality, only about 4 out of 10 cases are detected in the early stages. Therefore, to reduce the incidence and mortality rates, eﬀorts are often made to detect and remove early signs of precancerous abnormalities.

Among these abnormalities, polyps inside the digestive tract, mainly in the colon (referred to as polyps), are considered the most signiﬁcant cause. Polyps are abnormal growth of tissue that protrudes from the lining of the digestive tract. Polyps are typically divided into two types: neoplastic and non-neoplastic. Non-neoplastic polyps are benign and can be further divided into hyperplastic, in- ﬂammatory, and hamartomatous types.

Neoplastic polyps are malignant and have the potential to develop into cancer. They can be further divided into adenomatous and serrated types. To detect polyps, doctors use a method called gastrointesti- nal endoscopy. This involves using an endoscope - a tube with a light and camera 1 Source: https://gco.fr/today/data/factsheets/populations/704-viet-nam-fact- sheets.1: Proportion of CRC compared to other cancer diseases1 attached that projects images onto a color TV screen - to examine the inside of the digestive tract.

The eﬀectiveness of endoscopy depends on the skill of the doc- tor performing the procedure. According to statistics [2], about 25% of polyps are missed during endoscopy, which poses a signiﬁcant risk to the patient. In order to reduce this high-risk rate, attention is paid to two things: • Improve the quality of machines and tools used for endoscopy. • Using Artiﬁcial Intelligence (AI) or speciﬁcally Deep Learning (DL) models as a diagnostic aid.

This second direction has shown a relatively signiﬁcant eﬀect, reducing the miss rate by up to 50% [3]. The problem used to support colonoscopy is called Polyps detection. Its goal is to locate all polyps in a frame or a video. DL models of Polyps detection task need to use a large amount of labeled data during training in order to perform well.

This has faced a number of challenges such as: • Lack of availability of public datasets. • Labeling this data requires someone knowledgeable about polyps, usually a doctor. • It is sometimes diﬃcult for labelers to agree on the size and type of polyps in a data sample. In addition, the characteristics of polyps such as small size, diverse shapes, and colors also pose challenges in the labeling process.

Therefore, the amount of labeled data used for polyp detection is quite limited. Conversely, the number of endoscopy videos is very large and continuously increasing, which is considered a huge amount 2 of unlabeled data. This is where semi-supervised object detection (SSOD) methods come into play. The models applied in SSOD are usually a type of well-known object detection models [4, 5, 6].

These models all have at least one of two components, Anchor and Non-maximum suppression (NMS). Although these components help the model achieve good performance, they signiﬁcantly reduce inference speed. A new type of model that does not use Anchor or NMS, called heatmap-based end- to-end [7, 8], has been proposed to address these issues. In this study, we applied semi-supervised learning to a heatmap-based end-to-end model to take advantage of both.2 Objectives The objective of this study is to eﬀectively apply SSOD to a heatmap-based end-to- end model in order to achieve a highly accurate and fast model.

Considering speciﬁc datasets, the model demonstrates superior results compared to existing methods. We hope that our model can be applied to many endoscopes in hospitals.3 Main contributions The main contributions of this study: • We propose a novel heatmap-based end-to-end model called CenterNet++ to improve the accuracy and inference speed over the original CenterNet model. • We propose a novel SSOD method called Dense Target Producer (DTP), which can perform end-to-end without the need for any non-diﬀerentiable postpro- cessing steps, and can be applied to heatmap-based end-to-end object detec- tors. To my knowledge, this is the ﬁrst attempt to apply SSOD to a heatmap- based end-to-end model.

• We propose a dynamic thresholding procedure called Threshold Epoch Adap- tor (TEA) to adaptively ﬁlter unreliable pseudo labels based on the learning status of the models. • Conduct experiments on a large benchmark dataset, namely PolypsSet. The results show that our DTP improves the AP performance compared to the supervised baseline model and outperforms other SSOD methods.4 Outline of the thesis The rest of this thesis is organized as follows: 3 Chapter 2 presents an overview and related work of this study’s ﬁeld. Chapter 3 describe in detail the method which apply SSOD to heatmap-based end- to-end model.

Chapter 4 presents the content of the experiments, the results obtained, and some ablation studies results. Chapter 5 concludes the thesis. 4 Chapter 2 Theoratical basis 2.1 Learning Type Based on the type of data input into the machine learning model, it can be cat- egorized into several learning types, namely supervised, unsupervised, and semi- supervised learning. Additionally, there exist other learning types such as self- supervised, weakly-supervised, and reinforcement learning, which, however, are not mentioned within the scope of this research.1: Illustration about supervised, unsupervised and semi-supervised learn- ing.1 Supervised Learning Supervised learning utilizes training data consisting of input-output pairs, where the models learn from this labeled data to make predictions for future data.

During 1 Source: https://blog.com/what-is-semi-supervised-learning/ 5 the learning process, the model predicts the output for the training data and then calculates the adjustment amount based on the diﬀerence between the predicted out- put and the actual label. This adjustment helps improve the model’s performance. There are two main tasks in supervised learning: • Classiﬁcation: The objective of the problem is to ﬁnd a model that correctly assigns data samples to their respective classes. The label in this problem is an identiﬁer (usually a numerical value) for the class to which the data belongs, and the model aims to accurately predict this label for unseen data samples.

Classiﬁcation has diﬀerent use cases, such as spam ﬁltering, customer behavior prediction, and document classiﬁcation. • Regression: The objective of the problem is to develop a model that synthe- sizes the features of the input parameters to generate continuous real-valued output results. It aids in the forecasting of continuous variables, such as Mar- ket Trends and Home Prices.2 Unsupervised Learning Unsupervised learning uses machine learning algorithms to analyze and cluster unla- beled datasets. These algorithms discover hidden patterns or data groupings without the need for human intervention.

Unsupervised learning models are utilized for three main tasks—clustering, association, and dimensionality reduction. • Clustering is a technique that groups unlabeled data based on their similari- ties or diﬀerences. Clustering algorithms are used to process raw, unclassiﬁed data objects into groups represented by structures or patterns in the informa- tion. Clustering algorithms can be categorized into a few types, speciﬁcally exclusive, overlapping, hierarchical, and probabilistic.

• Association Rules is a rule-based method for ﬁnding relationships between variables in a given dataset. These methods are frequently used for market bas- ket analysis, allowing companies to better understand relationships between diﬀerent products. Examples of this can be seen in Amazon’s “Customers Who Bought This Item Also Bought” or Spotify’s ”Discover Weekly” playlist. While more data generally yields more accurate results, it can also impact the performance of machine learning algorithms (e.

overﬁtting) and it can also make it diﬃcult to visualize datasets. Dimensional- ity reduction is a technique used when the number of features, or dimensions, in a given dataset, is too high.

Nội dung được bảo vệ bản quyền — Tải xuống đầy đủ

Câu hỏi thường gặp

Luận án "Phát hiện polyp bán giám sát từ đầu đến cuối" nghiên cứu về vấn đề gì?

Luận án Thạc sĩ Khoa học Dữ liệu về phát hiện polyp bán giám sát. Phương pháp Dense Target Producer cải thiện độ chính xác phát hiện ung thư đại trực tràng.

Luận án "Phát hiện polyp bán giám sát từ đầu đến cuối" được bảo vệ tại trường nào?

Luận án này được bảo vệ tại Hanoi University of Science and Technology. Năm bảo vệ: 2023.

Luận án "Phát hiện polyp bán giám sát từ đầu đến cuối" thuộc chuyên ngành gì?

Luận án "Phát hiện polyp bán giám sát từ đầu đến cuối" thuộc chuyên ngành Data Science. Danh mục: Trí Tuệ Nhân Tạo.

Luận án "Phát hiện polyp bán giám sát từ đầu đến cuối" có bao nhiêu trang?

Luận án "Phát hiện polyp bán giám sát từ đầu đến cuối" có 56 trang. Bạn có thể xem trước một phần tài liệu ngay trên trang web trước khi tải về.

Cách tải luận án "Phát hiện polyp bán giám sát từ đầu đến cuối" về máy như thế nào?

Để tải luận án về máy, bạn nhấn nút "Tải xuống ngay" trên trang này, sau đó hoàn tất thanh toán phí lưu trữ. File sẽ được tải xuống ngay sau khi thanh toán thành công. Hỗ trợ qua Zalo: 0559 297 239.

Luận án liên quan

Chia sẻ tài liệu: Facebook Twitter

Mục lục chi tiết

Tóm tắt nội dung