We gratefully acknowledge support from
the Simons Foundation and member institutions.
Full-text links:

Download:

Current browse context:

stat.ME

Change to browse by:

References & Citations

Bookmark

(what is this?)
CiteULike logo BibSonomy logo Mendeley logo del.icio.us logo Digg logo Reddit logo

Statistics > Methodology

Title: Correspondence analysis: handling cell-wise outliers via the reconstitution algorithm

Abstract: Correspondence analysis (CA) is a popular technique to visualize the relationship between two categorical variables. CA uses the data from a two-way contingency table and is affected by the presence of outliers. The supplementary points method is a popular method to handle outliers. Its disadvantage is that the information from entire rows or columns is removed. However, outliers can be caused by cells only. In this paper, a reconstitution algorithm is introduced to cope with such cells. This algorithm can reduce the contribution of cells in CA instead of deleting entire rows or columns. Thus the remaining information in the row and column involved can be used in the analysis. The reconstitution algorithm is compared with two alternative methods for handling outliers, the supplementary points method and MacroPCA. It is shown that the proposed strategy works well.
Subjects: Methodology (stat.ME); Applications (stat.AP)
Cite as: arXiv:2404.17380 [stat.ME]
  (or arXiv:2404.17380v1 [stat.ME] for this version)

Submission history

From: Qianqian Qi [view email]
[v1] Fri, 26 Apr 2024 12:55:18 GMT (134kb,D)

Link back to: arXiv, form interface, contact.