Preview

Modeling and Analysis of Information Systems

Advanced search

Application of tensors in multivariate component analysis of categorized features

https://doi.org/10.18255/1818-1015-2025-4-330-359

Abstract

When modeling social processes and phenomena, it is often necessary to process data related to categorized features, identify cause-and-effect relationships between such data, and determine the most significant indicators. A study of existing approaches to analyzing dependencies between categorized variables revealed several problems when applying these methods to multidimensional categorized data (tensors). Therefore, this article proposes an approach to studying dependencies between such variables using multidimensional component analysis. This approach involves applying tensor unfolding matrices obtained for each of its axes (categorized features). The method allows for the construction of integral characteristics (components) based on the elements of the original tensor, the formation of component loading matrices, and the calculation of the tensor core, which has fewer gradations of categorized features (lower number of dimensions in the tensor axes) than the original tensor. The article proposes a method for ranking the gradations of categorized variables by the degree of cumulative influence of component loadings, based on the calculation of vector norms. The described approach to studying dependencies between multidimensional categorized variables is demonstrated using a three-dimensional tensor with the shape (4;10;10) and categorized features: nosology group, field of activity, and group of professionally significant qualities. The algorithm for analyzing multidimensional categorized data using multidimensional component analysis, discussed in this article, is intended to be incorporated as an analytical tool into the regional information and analytical portal "PERSPEKTIVA-PRO." This tool can be used to develop a digital support trajectory for people with disabilities and special needs, taking into account their personal and variable characteristics.

About the Author

Alexander A. Banin
Cherepovets State University
Russian Federation


References

1. O. A. Denisova, Tsifrovye resursy professionalizatsii molodykh invalidov. Cherepovets State University, 2021.

2. A. A. Banin and O. Y. Lyaginova, “Tensor data model for the development of trajectories for accompanying physically challenged people and people with disabilities due to their variable characteristics,” Cherepovets State University Bulletin, no. 1, pp. 7–25, 2024, doi: 10.23859/1994-0637-2024-1-118-17.

3. A. Demianova and A. Lukiyanova, “The impact of disability status on labor supply in Russia,” Applied Econometrics, vol. 44, pp. 50–74, 2016.

4. D. Fantazzini, M. Shakleina, and N. Yuras, “Big Data for computing social well-being indices of the Russian population,” Applied Econometrics, vol. 50, pp. 43–66, 2018.

5. E. Kossova and M. Kosorukova, “Estimation of the treatment effect of higher education on health: Comparison of the multivariate recursive probit model and matching,” Applied Econometrics, vol. 69, pp. 65–90, 2023.

6. A. Dubnovitskaya and K. Furmanov, “Job satisfaction in Russia: Wages, working conditions and promotion opportunities,” Applied Econometrics, vol. 72, pp. 121–139, 2023.

7. S. A. Ayvazyan and V. S. Mkhitaryan, Prikladnaya statistika i osnovy ekonometriki. Uchebnik dlya vuzov. YuNITI, 1998.

8. P. A. Popova and A. N. Rotmistrov, “Regression with categorical predictors: criticizing dummy-variable usage and log-linear analysis as an alternative approach,” Sociological Journal, vol. 22, no. 3, pp. 8–31, 2016, doi: 10.19181/socjour.2016.22.3.4583.

9. V. M. Bure, O. A. Glivinskaya, and A. V. Sotnikov, “Log-linear database analysis by cardiac infarction data among young and middle age patients,” Vestnik of Saint Petersburg University, vol. 10, no. 1, pp. 35–41, 2010.

10. O. N. Dolinina and R. N. Karimov, “Methods of presentation and joint processing of multidimensional data for objects of numeric and non-numeric origin,” Vestnik of Saratov State Technical University, no. 1, pp. 100–109, 2006.

11. E. E. Fomina, “The possibility of log-linear analysis for survey results processing,” PNRPU Sociology and Economics Bulletin, no. 3, pp. 197–211, 2018, doi: 10.15593/2224-9354/2018.3.16 УДК 303.621.3-047.44.

12. E. E. Fomina, “Methods of analysis of categorical variables in sociological research,” Proceedings of Higher Educational Institutions. Sociology. Economy. Politics, vol. 12, pp. 39–51, 2019.

13. A. Cichocki, N. Lee, I. Oseledets, A.-H. Phan, Q. Zhao, and D. P. Mandic, “Tensor Networks for Dimensionality Reduction and Large-scale Optimization: Part 1 Low-Rank Tensor Decompositions,” Foundations and Trends® in Machine Learning, vol. 9, no. 4--5, pp. 249–429, 2016, doi: 10.1561/2200000059.

14. Y. R. Magnus and K. Neydekker, Matrix differential calculation with applications in statistics and econometrics, 3Rd edition. Wiley, 2019.

15. G. H. Golub and V. Loan, Matrix Computations, 4Th edition. JHU press, 2013.

16. M. Linting, J. J. Meulman, P. J. F. Groenen, and A. J. van der Koojj, “Nonlinear principal components analysis: Introduction and application,” Psychological Methods, vol. 12, no. 3, pp. 336–358, 2007, doi: 10.1037/1082-989X.12.3.336.


Review

For citations:


Banin A.A. Application of tensors in multivariate component analysis of categorized features. Modeling and Analysis of Information Systems. 2025;32(4):330-359. (In Russ.) https://doi.org/10.18255/1818-1015-2025-4-330-359

Views: 95


Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 License.


ISSN 1818-1015 (Print)
ISSN 2313-5417 (Online)