Evaluation of breast cancer molecular subtypes using artificial intelligence in micro-FTIR hyperspectral images

dc.contributor.advisorDenise Maria Zezellpt_BR
dc.contributor.authorDEL VALLE, MATHEUSpt_BR
dc.contributor.coadvisorEmerson Soares Bernardespt_BR
dc.coverageNacionalpt_BR
dc.date.accessioned2023-07-28T17:42:15Z
dc.date.available2023-07-28T17:42:15Z
dc.date.issued2023pt_BR
dc.description.abstractBreast cancer is the most incident cancer worldwide. The evaluation of molecular subtypes and their biomarkers plays an essential role in prognosis. The biomarkers used are Estrogen Receptor (ER), Progesterone Receptor (PR), Human Epidermal growth factor Receptor-type 2 (HER2), and Ki67. Based on these, subtypes are classified as Luminal A (LA), Luminal B (LB), HER2 subtype, and Triple-Negative Breast Cancer (TNBC). The gold standard for this analysis is histology and immunohistochemistry, semi-quantitative techniques that present inter-laboratory and inter-observer variations. The Fourier Transform Infrared micro-spectroscopy (micro-FTIR), which provides hyperspectral images with biochemical information of biological tissues, is applied together with artificial intelligence (AI) for cancer evaluation. In this thesis, twenty samples of two breast cancer cell lines, BT-474 and SK-BR-3, were used to define the optimal number of co-added scans for machine learning (ML) techniques. Linear Discriminant Analysis (LDA), Partial Least Squares Discriminant Analysis (PLS-DA), K-Nearest Neighbors (KNN), Support Vector Machines (SVM), Random Forest (RF), and Extreme Gradient Boosting (XGB) models were used. Sixty hyperspectral images of 320x320 pixels were collected from thirty patients of a human breast biopsies microarray, each containing a breast cancer (CA) and an adjacent tissue (AT) core. Automated methods based on K-Means clustering were developed for data organization and pre-processing to one-dimensional (1D) and two-dimensional (2D) data. The dataset was used to train two new deep learning models for breast cancer subtype evaluation: CaReNet-V1, a 1D Convolutional Neural Network (CNN); and CaReNet-V2, a 2D CNN. All ML models achieved similar performances with the b256_064 (256 background scans and 64 sample scans), b256_128, and b128_128 groups, where the best accuracy of 0.995 was presented by the XGB model. The b256_064 was established as the ideal among the three due to the shortest acquisition time. The K-Means-based method enabled fully automated preprocessing and organization, improving data quality and optimizing CNN training. CaReNet-V1 effectively classified CA and AT (individual spectra test accuracy of 0.89), as well as HER2 and TNBC subtypes (0.83 and 0.86), with greater difficulty for LA and LB (0.74 and 0.68). The model enabled the evaluation of the most contributing wavenumbers to the predictions, providing a direct relationship with the biochemical content of the samples. CaReNet-V2 demonstrated better performance than 1D, with test accuracies above 0.84, and enabled the prediction of ER, PR, and HER2 levels, where borderline values showed lower performance (minimum accuracy of 0.54). The Ki67 percentage regression demonstrated an absolute mean error of 3.6%. On the other hand, its impact evaluation by wavenumber was inferior to 1D. Thus, this study indicates image-based AI techniques using micro-FTIR as potential providers of additional information to pathological reports, also serving as patient biopsy screening techniques.pt_BR
dc.description.notasgeraisTese (Doutorado em Tecnologia Nuclear)pt_BR
dc.description.notasteseIPEN/Tpt_BR
dc.description.sponsorshipFundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP)pt_BR
dc.description.sponsorshipCoordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES)pt_BR
dc.description.sponsorshipConselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq)pt_BR
dc.description.sponsorshipIDFAPESP: 05/51689-2; 17/50332-0pt_BR
dc.description.sponsorshipIDCAPES: 001; PROCAD 88881.068505/2014-01pt_BR
dc.description.sponsorshipIDCNPq: INCT-465763/2014-6; PQ-309902/2017-7; 142229/2019-9pt_BR
dc.description.teseinstituicaoInstituto de Pesquisas Energéticas e Nucleares - IPEN-CNEN/SPpt_BR
dc.format.extent126pt_BR
dc.identifier.citationDEL VALLE, MATHEUS. <b>Evaluation of breast cancer molecular subtypes using artificial intelligence in micro-FTIR hyperspectral images</b>. Orientador: Denise Maria Zezell. Coorientador: Emerson Soares Bernardes. 2023. 126 f. Tese (Doutorado em Tecnologia Nuclear) - Instituto de Pesquisas Energéticas e Nucleares - IPEN-CNEN/SP, São Paulo. DOI: <a href="https://dx.doi.org/10.11606/T.85.2023.tde-10072023-162427">10.11606/T.85.2023.tde-10072023-162427</a>. Disponível em: http://repositorio.ipen.br/handle/123456789/34178.
dc.identifier.doi10.11606/T.85.2023.tde-10072023-162427pt_BR
dc.identifier.urihttp://repositorio.ipen.br/handle/123456789/34178
dc.localSão Paulopt_BR
dc.rightsopenAccesspt_BR
dc.subjectmammary glands
dc.subjectneoplasms
dc.subjectpathological changes
dc.subjectmutagen screening
dc.subjectspectral reflectance
dc.subjectimage processing
dc.subjectdecision making
dc.subjectlearning
dc.subjectman-machine systems
dc.subjectartificial intelligence
dc.subjectneural networks
dc.subjectfourier transform spectrometers
dc.subjectmicrostructure
dc.subjectimage processing
dc.titleEvaluation of breast cancer molecular subtypes using artificial intelligence in micro-FTIR hyperspectral imagespt_BR
dc.title.alternativeAvaliação de subtipos moleculares de câncer de mama utilizando inteligência artificial em imagens hiperespectrais por micro-FTIpt_BR
dc.typeTesept_BR
dspace.entity.typePublication
ipen.autorMATHEUS DEL VALLE
ipen.codigoautor15209
ipen.contributor.ipenauthorMATHEUS DEL VALLE
ipen.date.recebimento23-07
ipen.identifier.ipendoc29802pt_BR
ipen.meioeletronicohttps://www.teses.usp.br/teses/disponiveis/85/85134/tde-10072023-162427/pt-br.phppt_BR
ipen.type.genreTese
relation.isAuthorOfPublicationfdd01116-8cc4-406a-aafb-606941dc28dc
relation.isAuthorOfPublication.latestForDiscoveryfdd01116-8cc4-406a-aafb-606941dc28dc
sigepi.autor.atividadeDEL VALLE, MATHEUS:15209:920:Spt_BR

Licença do Pacote

Agora exibindo 1 - 1 de 1
Carregando...
Imagem de Miniatura
Nome:
license.txt
Tamanho:
1.71 KB
Formato:
Item-specific license agreed upon to submission
Descrição:

Coleções