Expert Systems With Applications 200 (2022) 1169413
information as a set of time series with high noise and high volatility, the technical indicators calculated from the original data
may also be mixed with noise, affecting the model performance. Thus, in this study, we introduce the wavelet analysis and test a variety of wavelet basis functions to process stock price data. Then we use the denoised stock price information to calculate technical indicators. In this way, technical indicators become more effective. Besides the improved indicators as the features, another critical step in data preprocessing is to find proper feature combinations. Although improved indicators have
been established as features, different combinations of features still greatly influence the prediction results. There maybe redundant or irrelevant features in the feature set. Removing those less relevant features can reduce the amount of calculation, thus improving model performance. An effective way to solve this problem is feature selection, which has been proven to bean effective way to enhance model performance in many studies (
Chandrashekar and Sahin,
2014
).
In the study of Kou et al. (2021)
, they proposed a two-stage multi- objective feature selection method that optimizes the number of features and model classification performance. In another study, a modified differential evolution (DE) algorithm was used to perform feature selection for cardiovascular disease. The results showed that the accuracy of the proposed hybrid model is 83%, which is higher than that of some other existing models (
Vivekanandan et al., 2017
). Among the feature selection methods, feature importance as a quantitative indicator seems to be a promising one. It is computed for evaluating the extent of features influence on the model performance, thus helping to choose essential ones. In the study of
Verma et al. (2020)
, a feature importance method was utilized to select the 15 most salient features in prediction. However, these feature selection methods are only applicable to a given feature set. This article attempts to create feature sets of different time dimensions according to
the size-varied time windows, and then selects the best feature subset of each feature set. Therefore, the adaptive feature selection schema is needed.
Share with your friends: