CV


Hamid Saadatfar

Hamid Saadatfar

Associate Professor

Faculty: Electrical and Computer Engineering

Department: Computer

Degree: Ph.D

CV
Hamid Saadatfar

Associate Professor Hamid Saadatfar

Faculty: Electrical and Computer Engineering - Department: Computer Degree: Ph.D |

Dr. Hamid Saadatfar is currently an assistant professor of Computer Engineering Department at University of Birjand. He has received his B.Sc., M.Sc., and Ph.D. degrees from Ferdowsi university of Mashhad in 2007, 2009 and 2014, respectively. His research interests include:

  • Parallel and Distributed Processing (Cluster, Grid and Cloud Computing),
  • Data Mining and Machine Learning,
  • Big Data Analysis (Data Mining Methods for Big Data)
  • and Power-aware Computing.

نمایش بیشتر

Summarization Algorithm for Data Stream to Speed up Outlier Data Detection

AuthorsHamid Saadatfar
JournalJournal Of Computing And Security
Page number35-46
Serial number10
Volume number1
Paper TypeFull Paper
Published At2023
Journal TypeElectronic
Journal CountryIran, Islamic Republic Of
Journal Indexisc

Abstract

Outlier detection in data streams is an essential issue in data processing. Today, due to the massive growth of streaming data generated by the spread of the Internet of Things, outlier detection has become a significant challenge. Much progress has been made in outlier detection based on local outlier detection algorithms, such as density-based local outlier factor algorithms, suitable for static data. The incremental version of these algorithms is used to detect the local outliers in streaming data. However, outlier detection in streaming data faces the challenges of limited memory capacity, high execution time, inaccessibility of all data at one time, and changes in data distribution (increasing and decreasing input rates, uncertainty, etc.). In this paper, we propose a density-based summarization algorithm, which summarizes data, every time the buffer is filled. The proposed algorithm maintains the desired shape of the clusters, with a low computational cost. To this end, larger clusters are selected and the data of their dense areas are reduced so that the shape of the old clusters is not lost. The proposed summarization algorithm reduces execution time and increases precision, recall, and F1 score compared with the evaluated algorithms.

Paper URL