Effective Data Analysis Framework for Financial Variable Selection and Missing Data Discovery
atmire.migration.oldid | 5505 | |
dc.contributor.advisor | Alhajj, Reda | |
dc.contributor.author | Aghakhani, Sara | |
dc.contributor.committeemember | Rokne, Jon | |
dc.contributor.committeemember | Chang, Philip | |
dc.contributor.committeemember | Khoshgoftaar, Taghi | |
dc.contributor.committeemember | Moshirpour, Mohammad | |
dc.date.accessioned | 2017-05-01T17:47:58Z | |
dc.date.available | 2017-05-01T17:47:58Z | |
dc.date.issued | 2017 | |
dc.date.submitted | 2017 | en |
dc.description.abstract | Quantitative evaluation of financial variables plays a foundational role in financial price modeling, economic prediction, risk evaluation, portfolio management, etc. However, the problem suffers from high dimensionality. Thus, financial variables should be selected in a way to reduce the dimensionality of the financial model and make the model more efficient. In addition, it is quite common for financial datasets to contain missing data due to a variety of limitations. Consequently, in practical situations, it is difficult to choose the best subset of financial variables due to the existence of missing values. The two problems are interrelated. Therefore, the central idea in this research is to develop and examine new techniques for financial variable selection based on estimating the missing values, while accounting for all the longitudinal and latitudinal information. This research proposes a novel methodology to minimize the problem associated with missing data and find the best subset of financial variables that could be used for effective analysis. There are two major steps; the first step concentrates on estimating missing data using Bayesian updating and Kriging algorithms. The second step is to find the best subset of financial variables. In this step a novel feature subset selection is proposed (LmRMR) which ranks the financial variables and the best subset of variables is chosen by employing statistical techniques through Super Secondary Target Correlation (SSTC) measurement. Some tests have been done to demonstrate the applicability and effectiveness of the ideas presented in this research. In particular, the potential application of the proposed methods in stock market trading model and stock price forecasting are studied. The experimental studies are conducted on Dow Jones Industrial Average financial variables. | en_US |
dc.identifier.citation | Aghakhani, S. (2017). Effective Data Analysis Framework for Financial Variable Selection and Missing Data Discovery (Doctoral thesis, University of Calgary, Calgary, Canada). Retrieved from https://prism.ucalgary.ca. doi:10.11575/PRISM/25787 | en_US |
dc.identifier.doi | http://dx.doi.org/10.11575/PRISM/25787 | |
dc.identifier.uri | http://hdl.handle.net/11023/3790 | |
dc.language.iso | eng | |
dc.publisher.faculty | Graduate Studies | |
dc.publisher.institution | University of Calgary | en |
dc.publisher.place | Calgary | en |
dc.rights | University of Calgary graduate students retain copyright ownership and moral rights for their thesis. You may use this material in any way that is permitted by the Copyright Act or through licensing that has been assigned to the document. For uses that are not allowable under copyright legislation or licensing, you are required to seek permission. | |
dc.subject | Economics--Finance | |
dc.subject | Computer Science | |
dc.subject.other | Missing Data Analysis | |
dc.subject.other | Financial Variable Selection | |
dc.subject.other | Bayesian Updating | |
dc.subject.other | Super Secondary Data | |
dc.subject.other | Kriging | |
dc.subject.other | Stock Price Prediction | |
dc.subject.other | APT | |
dc.subject.other | Stock Market Trading Model | |
dc.subject.other | mRMR | |
dc.subject.other | Likelihood | |
dc.subject.other | Dow Jones Industrial Average | |
dc.subject.other | FIM | |
dc.subject.other | NMAR | |
dc.subject.other | Likelihood Minimum Redundancy Maximum Relevance | |
dc.subject.other | LmRMR | |
dc.subject.other | Finincial Factors | |
dc.title | Effective Data Analysis Framework for Financial Variable Selection and Missing Data Discovery | |
dc.type | doctoral thesis | |
thesis.degree.discipline | Computer Science | |
thesis.degree.grantor | University of Calgary | |
thesis.degree.name | Doctor of Philosophy (PhD) | |
ucalgary.item.requestcopy | true |