TY - JOUR
T1 - Visualizing Class Specific Heterogeneous Tendencies in Categorical Data
AU - Takagishi, Mariko
AU - Velden, Michel van de
N1 - Funding Information:
This work was supported by the Japan Society for the Promotion of Science KAKENHI grants 20K19755. Acknowledgments
Publisher Copyright:
© 2022 The Author(s). Published with license by Taylor & Francis Group, LLC.
PY - 2022
Y1 - 2022
N2 - In multiple correspondence analysis, both individuals (observations) and categories can be represented in a biplot that jointly depicts the relationships across categories and individuals, as well as the associations between them. Additional information about the individuals can enhance interpretation capacities, such as by including class information for which the interdependencies are not of immediate concern, but that facilitate the interpretation of the plot with respect to relationships between individuals and categories. This article proposes a new method which we call multiple-class cluster correspondence analysis that identifies clusters specific to classes. The proposed method can construct a biplot that depicts heterogeneous tendencies of individual members, as well as their relationships with the original categorical variables. A simulation study to investigate the performance of the proposed method and an application to data regarding road accidents in the United Kingdom confirms the viability of this approach. Supplementary materials for this article are available online.
AB - In multiple correspondence analysis, both individuals (observations) and categories can be represented in a biplot that jointly depicts the relationships across categories and individuals, as well as the associations between them. Additional information about the individuals can enhance interpretation capacities, such as by including class information for which the interdependencies are not of immediate concern, but that facilitate the interpretation of the plot with respect to relationships between individuals and categories. This article proposes a new method which we call multiple-class cluster correspondence analysis that identifies clusters specific to classes. The proposed method can construct a biplot that depicts heterogeneous tendencies of individual members, as well as their relationships with the original categorical variables. A simulation study to investigate the performance of the proposed method and an application to data regarding road accidents in the United Kingdom confirms the viability of this approach. Supplementary materials for this article are available online.
KW - Clustering
KW - Contingency table
KW - External information
KW - Multiple correspondence analysis
KW - Visualization
UR - http://www.scopus.com/inward/record.url?scp=85127366065&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85127366065&partnerID=8YFLogxK
U2 - 10.1080/10618600.2022.2035737
DO - 10.1080/10618600.2022.2035737
M3 - Article
AN - SCOPUS:85127366065
SN - 1061-8600
JO - Journal of Computational and Graphical Statistics
JF - Journal of Computational and Graphical Statistics
ER -