Intrusion Detection System using Apache Spark on Hadoop Big Data Environment

Faculty Mentor

Dr. Hayden Wimmer

Location

Poster 228

Session Format

Poster Presentation

Academic Unit

Department of Information Technology

Keywords

Allen E. Paulson College of Engineering and Computing Student Research Symposium, Intrusion Detection System, IDS, Convolution Neural Network, CNN, Long-Short Term Memory, LSTM, Synthetic Minority Oversampling Technique, SMOTE, Resilient Data Distributed Datasets, RDD

Creative Commons License

Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 License.

Presentation Type and Release Option

Presentation (File Not Available for Download)

Start Date

2022 12:00 AM

January 2022

This document is currently not available here.

Share

COinS
 
Jan 1st, 12:00 AM

Intrusion Detection System using Apache Spark on Hadoop Big Data Environment

Poster 228

Intrusion Detection System (IDS) is a system that uses intelligent techniques to analyze data quickly and capture the attacks at an early stage. Based on detection methods IDS can be Signature-based, this is an IDS that detects known attacks ,the major limitation is that this detection system cannot detect unknown attacks or Anomaly-based that detects attacks based on any abnormality and is able to detect unknown attacks hence overcome the drawbacks of signature-based methods. Spark is distributed processing system used for big data that utilizes in-memory caching and optimized query execution for fast queries against data of any size. Spark was designed to be fast for iterative algorithms, support for in-memory storage and efficient fault recovery. Hadoop is a big data environment, It has a distributed file system HDFS used to store data of various formats across a cluster . The purpose of the study is to build an Anomaly-based IDS using Apache Spark on Hadoop, the framework will overcome the challenges posed by traditional techniques of computational power and storage.