Manifold learning-based unsupervised feature selection for structural health monitoring
Tingna Wang, Limin Sun
Abstract. Feature selection plays an important role in enhancing the performance of a data-driven structural health monitoring (SHM) system by selecting more informative and lower-dimensional features from the original ones. In this paper, an unsupervised feature selection method based on manifold learning (MLUFS) is proposed to identify feature sets that are beneficial to preserve the structure of the data. This method leverages nonlinear dimensionality reduction methods and canonical correlation analysis to address the challenges associated with acquiring costly labels in the SHM field. It is applicable to high-dimensional data that lie on a low-dimensional manifold within the ambient space. A case study on the handwritten digit datasets is presented to compare the classification performances corresponding to the features selected by the MLUFS method and a supervised feature selection method. The results show that the proposed method can provide feature sets with higher generalization ability compared to the selection method using labels. Another simulated acceleration dataset is used to demonstrate the effectiveness of the proposed method in anomaly detection for an SHM system.
Keywords
Multivariate Filter Method, Unsupervised Feature Selection, Manifold Learning, Structural Health Monitoring, Anomaly Detection
Published online 3/25/2025, 8 pages
Copyright © 2025 by the author(s)
Published under license by Materials Research Forum LLC., Millersville PA, USA
Citation: Tingna Wang, Limin Sun, Manifold learning-based unsupervised feature selection for structural health monitoring, Materials Research Proceedings, Vol. 50, pp 82-89, 2025
DOI: https://doi.org/10.21741/9781644903513-9
The article was published as article 9 of the book Structural Health Monitoring
Content from this work may be used under the terms of the Creative Commons Attribution 3.0 license. Any further distribution of this work must maintain attribution to the author(s) and the title of the work, journal citation and DOI.
References
[1] Alves V, Cury A. An automated vibration-based structural damage localization strategy using filter-type feature selection. Mechanical Systems and Signal Processing. 2023;190:110145. https://doi.org/10.1016/j.ymssp.2023.110145
[2] Bee S, Poole J, Worden K, Dervilis N, Bull L. Multitask feature selection within structural datasets. Data-Centric Engineering. 2024;5:e4. https://doi.org/10.1017/dce.2024.1
[3] Buckley T, Ghosh B, Pakrashi V. A Feature Extraction & Selection Benchmark for Structural Health Monitoring. Structural Health Monitoring. 2023;22(3):2082-127. https://doi.org/10.1177/14759217221111141
[4] Bull LA, Rogers TJ, Wickramarachchi C, Cross EJ, Worden K, Dervilis N. Probabilistic active learning: an online framework for structural health monitoring. Mechanical Systems and Signal Processing. 2019;134:106294. https://doi.org/10.1016/j.ymssp.2019.106294
[5] Gharehbaghi VR, Farsangi EN, Yang TY, Hajirasouliha I. Deterioration and damage identification in building structures using a novel feature selection method. Structures. 2021;29:458-70. https://doi.org/10.1016/j.istruc.2020.11.040
[6] Solorio-Fernández S, Carrasco-Ochoa JA, Martínez-Trinidad JF. A review of unsupervised feature selection methods. Artificial Intelligence Review. 2020;53(2):907-48. https://doi.org/10.1007/s10462-019-09682-y
[7] Guyon, I.; Elisseeff, A.: An Introduction to Variable and Feature Selection. , 26 (2003).
[8] Mitra P, Murthy CA, Pal SK. Unsupervised feature selection using feature similarity. IEEE Transactions on Pattern Analysis and Machine Intelligence. 2002;24(3):301-12. https://doi.org/10.1109/34.990133
[9] Ferreira AJ, Figueiredo MAT. An unsupervised approach to feature discretization and selection. Pattern Recognition. 2012;45(9):3048-60. https://doi.org/10.1016/j.patcog.2011.12.008
[10] Wang S, Pedrycz W, Zhu Q, Zhu W. Unsupervised feature selection via maximum projection and minimum redundancy. Knowledge-Based Systems. 2015;75:19-29. https://doi.org/10.1016/j.knosys.2014.11.008
[11] Cai D, Zhang C, He X. Unsupervised feature selection for multi-cluster data. In Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining. 2010;333-42. https://doi.org/10.1145/1835804.1835848
[12] Zhao Z, Wang L, Liu H, Ye J. On Similarity Preserving Feature Selection. IEEE Transactions on Knowledge and Data Engineering. 2013;25(3):619-32. https://doi.org/10.1109/TKDE.2011.222
[13] Hou C, Nie F, Li X, Yi D, Wu Y. Joint Embedding Learning and Sparse Regression: A Framework for Unsupervised Feature Selection. IEEE Transactions on Cybernetics. 2014;44(6):793-804. https://doi.org/10.1109/TCYB.2013.2272642
[14] Nie F, Zhu W, Li X. Unsupervised Feature Selection with Structured Graph Optimization. In Proceedings of the AAAI Conference on Artificial Intelligence. 2016;30(1). https://doi.org/10.1609/aaai.v30i1.10168
[15] Lin T, Zha H. Riemannian Manifold Learning. IEEE transactions on pattern analysis and machine intelligence. 2008;30(5):796-809. https://doi.org/10.1109/TPAMI.2007.70735
[16] Zhang S, Wang T, Worden K, Sun L, Cross EJ. Canonical-correlation-based fast feature selection for structural health monitoring. Mechanical Systems and Signal Processing. 2025;223:111895. https://doi.org/10.1016/j.ymssp.2024.111895
[17] Torgerson WS. Multidimensional scaling: I. Theory and method. Psychometrika. 1952;17(4):401-19. https://doi.org/10.1007/BF02288916
[18] Tenenbaum JB. A Global Geometric Framework for Nonlinear Dimensionality Reduction. Science. 2000;290(5500):2319-23. https://doi.org/10.1126/science.290.5500.2319
[19] Roweis ST. Nonlinear Dimensionality Reduction by Locally Linear Embedding. Science. 2000;290(5500):2323-6. https://doi.org/10.1126/science.290.5500.2323
[20] Belkin M, Niyogi P. Laplacian Eigenmaps for Dimensionality Reduction and Data Representation. Neural Computation. 2003;15(6):1373-96. https://doi.org/10.1162/089976603321780317
[21] Van der Maaten L, Hinton G. Visualizing Data using t-SNE. Journal of Machine Learning Research. 2008;9(86):2579-605.
[22] McInnes L, Healy J, Melville J. UMAP: Uniform Manifold Approximation and Projection for Dimension Reduction. arXiv; 2020. Available from: http://arxiv.org/abs/1802.03426
[23] Meilă M, Zhang H. Manifold learning: what, how, and why. arXiv; 2023. Available from: http://arxiv.org/abs/2311.03757
[24] Alimoğlu F, Alpaydin E. Combining multiple representations for pen-based handwritten digit recognition. Turkish Journal of Electrical Engineering and Computer Sciences. 2001;9(1):1-2.
[25] LeCun Y, Bottou L, Bengio Y, Haffner P. Gradient-based learning applied to document recognition. Proceedings of the IEEE. 1998;86(11):2278-324. https://doi.org/10.1109/5.726791
[26] Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, Blondel M, Prettenhofer P, Weiss R, Dubourg V, Vanderplas J. Scikit-learn: Machine learning in Python. Journal of Machine Learning Research. 2011;12:2825-30.
[27] Bao Y, Tang Z, Li H, Zhang Y. Computer vision and deep learning-based data anomaly detection method for structural health monitoring. Structural Health Monitoring. 2019;18(2):401-21. https://doi.org/10.1177/1475921718757405
[28] Ni F, Zhang J, Noori MN. Deep learning for data anomaly detection and data compression of a long‐span suspension bridge. Computer‐Aided Civil and Infrastructure Engineering. 2020;35(7):685-700. https://doi.org/10.1111/mice.12528