The effect of data standardization in cluster analysis
| dc.contributor.author | NOGUEIRA, A.L. | pt_BR |
| dc.contributor.author | MUNITA, C.S. | pt_BR |
| dc.coverage | Nacional | pt_BR |
| dc.date.accessioned | 2021-08-02T18:22:57Z | |
| dc.date.available | 2021-08-02T18:22:57Z | |
| dc.date.issued | 2021 | pt_BR |
| dc.description.abstract | The application of multivariate techniques to experimental results requires a responsibility on behalf of the researcher to understand, evaluate and interpret their results, especially the ones that are more complex. The objective of this article is to evaluate the impact of three standardization techniques on the formation of clusters by means of the Kohonen neural network were studied. The standardization techniques studied were logarithm (log), generalized-log and improved minimum-maximum. The studies were performed using two different databases consisting of 298, named B1, and 146 samples, named B2. The B1 dataset is formed by samples that form two cluster very close. However, the B2 dataset form three diferent and separated cluster. The mass fractions of As, Ce, Cr, Cs, Eu, Fe, Hf, K, La, Lu, Na, Nd, Sc, Sm, Tb, Th, U, and Yb of each sample were determined by instrumental neutron activation analysis, INAA. Three validation indices : Jaccard, Fowlkes-Mallows and Rand were performed on the dataset. The results suggest that when the cluster are close, the improved minimum-maximum satandardization is better than the logarithm and generalized-log. However, when the cluster are separated, the logarithm and generalized-log are better than the improved minimum-maximum technique. | pt_BR |
| dc.format.extent | 1-15 | pt_BR |
| dc.identifier.citation | NOGUEIRA, A.L.; MUNITA, C.S. The effect of data standardization in cluster analysis. <b>Brazilian Journal of Radiation Sciences</b>, v. 9, n. 1A, p. 1-15, 2021. DOI: <a href="https://dx.doi.org/10.15392/bjrs.v9i1A.1324">10.15392/bjrs.v9i1A.1324</a>. Disponível em: http://repositorio.ipen.br/handle/123456789/32092. | |
| dc.identifier.doi | 10.15392/bjrs.v9i1A.1324 | pt_BR |
| dc.identifier.fasciculo | 1A | pt_BR |
| dc.identifier.issn | 2319-0612 | pt_BR |
| dc.identifier.orcid | 0000-0003-0546-1044 | pt_BR |
| dc.identifier.percentilfi | Sem Percentil | pt_BR |
| dc.identifier.uri | http://repositorio.ipen.br/handle/123456789/32092 | |
| dc.identifier.vol | 9 | pt_BR |
| dc.relation.ispartof | Brazilian Journal of Radiation Sciences | pt_BR |
| dc.rights | openAccess | pt_BR |
| dc.source | Meeting on Nuclear Applications (ENAN), 14th, October 21-25, 2019, Santos, SP | pt_BR |
| dc.subject | algorithms | |
| dc.subject | cluster analysis | |
| dc.subject | computer codes | |
| dc.subject | datasets | |
| dc.subject | neural networks | |
| dc.subject | neutron activation analysis | |
| dc.subject | standardization | |
| dc.subject | statistical models | |
| dc.title | The effect of data standardization in cluster analysis | pt_BR |
| dc.type | Artigo de periódico | pt_BR |
| dspace.entity.type | Publication | |
| ipen.autor | CASIMIRO JAYME ALFREDO SEPULVEDA MUNITA | |
| ipen.autor | ANDRE LUIZ NOGUEIRA | |
| ipen.codigoautor | 1325 | |
| ipen.codigoautor | 14760 | |
| ipen.contributor.ipenauthor | CASIMIRO JAYME ALFREDO SEPULVEDA MUNITA | |
| ipen.contributor.ipenauthor | ANDRE LUIZ NOGUEIRA | |
| ipen.date.recebimento | 21-08 | |
| ipen.identifier.fi | Sem F.I. | pt_BR |
| ipen.identifier.ipendoc | 27863 | pt_BR |
| ipen.type.genre | Artigo | |
| relation.isAuthorOfPublication | 9f2b42a5-30f7-4805-96a1-1cde2b1a405b | |
| relation.isAuthorOfPublication | 83449424-ae5d-459e-9a14-f8253e5ed654 | |
| relation.isAuthorOfPublication.latestForDiscovery | 83449424-ae5d-459e-9a14-f8253e5ed654 | |
| sigepi.autor.atividade | MUNITA, C.S.:1325:320:N | pt_BR |
| sigepi.autor.atividade | NOGUEIRA, A.L.:14760:320:S | pt_BR |