An Approach to Server Log Analysis for Abnormal Behaviour Detection

dc.contributor.advisorFar, Behrouz
dc.contributor.authorSuman, Reeta
dc.contributor.committeememberFar, Behrouz
dc.contributor.committeememberMohammed, Emad Amin
dc.contributor.committeememberMahmood, Moussavi
dc.date2021-06
dc.date.accessioned2021-03-23T22:31:29Z
dc.date.available2021-03-23T22:31:29Z
dc.date.issued2021-03-19
dc.description.abstractAs the server logs increase in size, it becomes difficult for human experts to manually examine error log messages, analyze the anomalies, and because of the high volume of log data. If the error message is rare or of low frequency, the system does not categorize it as important and get ignored that may leads to fatal errors. Server log analytics has proven to be optimum for active strategies and excellent performances of the system like the preventive maintenance or complete shut-down. Improvements in analytical strategies are necessary for data analysts in handling the large system. For this analytical process to yield good results, the input data need to be of good quality; therefore, research focuses on cleaning and pre-processing techniques. This research proposes the consecutive logical steps to enhance the analysis of log messages. First, we purpose extracting sequences and patterns from the logs by optimizing window sizes without losing valuable information and combining them with forecasting techniques for predictive analytics. Second, we improve topic modelling for low frequency messages through text analysis and language modelling. The resulting proof of concept is not just visualizing the log data; instead, it provides insight into the logs through topics from the error messages. The experiments illustrate the effectiveness of the proposed steps and the approach for error log analysis.en_US
dc.identifier.citationSuman, R. (2021). An Approach to Server Log Analysis for Abnormal Behaviour Detection (Master's thesis, University of Calgary, Calgary, Canada). Retrieved from https://prism.ucalgary.ca.en_US
dc.identifier.doihttp://dx.doi.org/10.11575/PRISM/38686
dc.identifier.urihttp://hdl.handle.net/1880/113169
dc.language.isoengen_US
dc.publisher.facultySchulich School of Engineeringen_US
dc.publisher.institutionUniversity of Calgaryen
dc.rightsUniversity of Calgary graduate students retain copyright ownership and moral rights for their thesis. You may use this material in any way that is permitted by the Copyright Act or through licensing that has been assigned to the document. For uses that are not allowable under copyright legislation or licensing, you are required to seek permission.en_US
dc.subjectLDA - Latent Dirichlet Allocationen_US
dc.subjectLSTM - Long Short-Term Memoryen_US
dc.subjectRMSE - Root Mean Square Erroren_US
dc.subjectARIMA - Auto-Regressive Integrated Moving Averageen_US
dc.subjectEAI - Enterprise Application Integrationen_US
dc.subjectPCA - Principal Component Analysisen_US
dc.subjectEMS - Enterprise Management Systemen_US
dc.subjectTF-IDF - Term frequency-inverse document frequencyen_US
dc.subject.classificationEngineeringen_US
dc.titleAn Approach to Server Log Analysis for Abnormal Behaviour Detectionen_US
dc.typemaster thesisen_US
thesis.degree.disciplineEngineering – Electrical & Computeren_US
thesis.degree.grantorUniversity of Calgaryen_US
thesis.degree.nameMaster of Science (MSc)en_US
ucalgary.item.requestcopytrueen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
ucalgary_2021_suman_reeta.pdf
Size:
7.58 MB
Format:
Adobe Portable Document Format
Description:
Master's Thesis
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
2.62 KB
Format:
Item-specific license agreed upon to submission
Description: