Keywords and phrases: bivariate copula, Kendall’s tau, dimensionality reduction, correlation, feature extraction, PCA.
Received: November 23, 2022; Accepted: February 7, 2023; Published: February 14, 2023
How to cite this article: Karima Femmam and Smain Femmam, Improving the dimensionality reduction of PCA using bivariate copulas, Advances and Applications in Statistics 86(1) (2023), 47-64. http://dx.doi.org/10.17654/0972361723015
This Open Access Article is Licensed under Creative Commons Attribution 4.0 International License
References:
[1] A. Abid, M. J. Zhang, V. K. Bagaria and J. Zou, Exploring patterns enriched in a dataset with contrastive principal component analysis, Nature Communications 9(1) (2018), 1-7. [2] A. Liaw and M. Wiener, Classification and regression by random Forest, JR News 2(3) (2002), 18-22. [3] A. Y. Keith and A. G. Paul, The Guttman-Kaiser criterion as a predictor of the number of common factors, Journal of the Royal Statistical Society, Series D (The Statistician) 31(3) (1982), 221-229. [4] B. Egger, D. Kaufmann, S. Schönborn, V. Roth and T. Vetter, Copula eigenfaces, Proc. International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications, (GRAPP), 2016, pp. 50-58. [5] C. Genest, B. Rémillard and D. Beaudoin, Goodness-of-fit tests for copulas: a review and a power study, Insurance Math. Econom. 44(2) (2009), 199-213. [6] D. Dua and C. Graff, UCI, Machine Learning Repository, 2017. http://archive.ics.uci.edu/ml. [7] F. Badakhshan Farahabadi, K. Vajaragh and R. Farnoosh, Dimension reduction big data using recognition of data features based on copula function and principal component analysis, Advances in Mathematical Physics 2021 (2021), 1-8. [8] H. Hotelling, Analysis of a complex of statistical variables into principal components, Journal of Educational Psychology 24(6) (1933), 417-441. [9] I. T. Jolliffe, Rotation of principal components: choice of normalization constraints, J. Appl. Stat. 22(1) (1995), 29-35. [10] K. Alboukadel and M. Fabian, factoextra: Extract and Visualize the Results of Multivariate Data Analyses, R package version 1.0.7., 2020. https://CRAN.R-project.org/package=factoextra. [11] K. Femmam and S. Femmam, Fast and efficient feature selection method using bivariate copulas, Journal of Advances in Information Technology 13(3) (2022), 301-305. [12] K. Max, caret: Classification and Regression Training, R Package Version 6.0-92, 2022. https://CRAN.R-project.org/package=caret. [13] K. Pearson, Principal components analysis, The London, Edinburgh, and Dublin Philosophical Magazine and Journal of Science 6(2) (1901), 559-572. [14] M. Collins, S. Dasgupta and R. E. Schapire, A generalization of principal components analysis to the exponential family, Advances in Neural Information Processing Systems 14 (2001), 617-624. [15] M. E. Tipping and C. M. Bishop, Probabilistic principal component analysis, J. R. Stat. Soc. Ser. B Stat. Methodol. 61(3) (1999), 611-622. [16] M. Hofert, I. Kojadinovic, M. Maechler and J. Yan, Copula: Multivariate Dependence with Copulas, R package version 1.0-0, 2020. https://CRAN.R-project.org/package=copula. [17] N. B. Erichson, P. Zheng, K. Manohar, S. L. Brunton, J. N. Kutz and A. Y. Aravkin, Sparse principal component analysis via variable projection, SIAM J. Appl. Math. 80(2) (2020), 977-1002. [18] R. B. Nelsen, An Introduction to Copulas, Springer Science & Business Media, 2007. [19] R. Vidal, Y. Ma and S. Shankar, Generalized principal component analysis (GPCA), IEEE Transactions on Pattern Analysis and Machine Intelligence 27(12) (2005), 1945-1959. [20] Y. Zhao and M. Udell, Missing value imputation for mixed data via Gaussian copula, Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2020, pp. 636-646.
|