Searching for just a few words should be enough to get started. If you need to make more complex queries, use the tips below to guide you.
Article type: Research Article
Authors: Wang, Baoquana; b; c | Jiang, Tonghaia; c; * | Zhou, Xia; c | Ma, Boa; b; c | Zhao, Fana; c | Wang, Yia; c
Affiliations: [a] The Xinjiang Technical Institute of Physics and Chemistry, CAS, Urumqi, Xinjiang 830011, China | [b] University of Chinese Academy of Sciences, Beijing 100049, China | [c] Xinjiang Laboratory of Minority Speech and Language Information Processing, Urumqi, Xinjiang 830011, China
Correspondence: [*] Corresponding author: Tonghai Jiang, The Xinjiang Technical Institute of Physics and Chemistry, CAS, Urumqi, Xinjiang 830011, China. Tel.: +86 991 3837795; E-mail: [email protected].
Abstract: For abnormal detection of time series data, the supervised anomaly detection methods require labeled data. While the range of outlier factors used by the existing semi-supervised methods varies with data, model and time, the threshold for determining abnormality is difficult to obtain, in addition, the computational cost of the way to calculate outlier factors from other data points in the data set is also very large. These make such methods difficult to practically apply. This paper proposes a framework named LSTM-VE which uses clustering combined with visualization method to roughly label normal data, and then uses the normal data to train long short-term memory (LSTM) neural network for semi-supervised anomaly detection. The variance error (VE) of the normal data category classification probability sequence is used as outlier factor. The framework enables anomaly detection based on deep learning to be practically applied and using VE avoids the shortcomings of existing outlier factors and gains a better performance. In addition, the framework is easy to expand because the LSTM neural network can be replaced with other classification models. Experiments on the labeled and real unlabeled data sets prove that the framework is better than replicator neural networks with reconstruction error (RNN-RS) and has good scalability as well as practicability.
Keywords: Time series data, outlier detection, long short-term memory, clustering
DOI: 10.3233/JCM-204699
Journal: Journal of Computational Methods in Sciences and Engineering, vol. 21, no. 4, pp. 875-890, 2021
IOS Press, Inc.
6751 Tepper Drive
Clifton, VA 20124
USA
Tel: +1 703 830 6300
Fax: +1 703 830 2300
[email protected]
For editorial issues, like the status of your submitted paper or proposals, write to [email protected]
IOS Press
Nieuwe Hemweg 6B
1013 BG Amsterdam
The Netherlands
Tel: +31 20 688 3355
Fax: +31 20 687 0091
[email protected]
For editorial issues, permissions, book requests, submissions and proceedings, contact the Amsterdam office [email protected]
Inspirees International (China Office)
Ciyunsi Beili 207(CapitaLand), Bld 1, 7-901
100025, Beijing
China
Free service line: 400 661 8717
Fax: +86 10 8446 7947
[email protected]
For editorial issues, like the status of your submitted paper or proposals, write to [email protected]
如果您在出版方面需要帮助或有任何建, 件至: [email protected]