Skip to main navigation Skip to search Skip to main content

Semantic-Integrated Online Audit Log Reduction for Efficient Forensic Analysis

  • Wenhao Liao
  • , Jia Sun
  • , Haiyan Wang
  • , Zhaoquan Gu*
  • , Jianye Yang
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Audit logs are crucial for revealing and tracking sophisticated cyber threats due to their abundant system-level information. However, the immense scale of logs burdens storage resources and limits their lifecycle to days, which is insufficient for tracking multi-step attacks over months or years. Although log reduction techniques that cater to cold storage can mitigate this issue, many of these are restricted to offline batch processing in data centers. This incurs significant storage and transmission costs at endpoints. Moreover, many log reduction techniques fail to yield a suitable pattern for forensic analysis, which aims to identify signs of malicious activities by scrutinizing past events. In this paper, we present Sopr, an online audit log reduction technique designed to preserve traceability. Sopr enables real-time execution of the entire process, allowing reduction to be performed on raw log data streams. Specifically, our approach can effectively reduce events that lack causal dependence and involve repeated dependency relationships. To achieve this objective, we design a dual-cache architecture that simultaneously models semantically similar files and utilizes a versioned graph to preserve causality between log events. The synergy of these two components enhances the effectiveness of Sopr in log reduction. Our experiments on the DARPA TC datasets show that Sopr can achieve comparable event reduction factor in an online fashion to state-of-the-art offline approaches. Moreover, the runtime overhead and forensic analysis validity meet the deployment requirements for real-world environments.

Original languageEnglish
Title of host publicationAdvanced Data Mining and Applications - 20th International Conference, ADMA 2024, Proceedings
EditorsQuan Z. Sheng, Xuyun Zhang, Jia Wu, Congbo Ma, Gill Dobbie, Jing Jiang, Wei Emma Zhang, Yannis Manolopoulos, Wathiq Mansoor
PublisherSpringer Science and Business Media Deutschland GmbH
Pages318-333
Number of pages16
ISBN (Print)9789819608492
DOIs
StatePublished - 2025
Externally publishedYes
Event20th International Conference on Advanced Data Mining Applications, ADMA 2024 - Sydney, Australia
Duration: 3 Dec 20245 Dec 2024

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume15392 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference20th International Conference on Advanced Data Mining Applications, ADMA 2024
Country/TerritoryAustralia
CitySydney
Period3/12/245/12/24

Keywords

  • Forensic Analysis
  • Log Reduction
  • Provenance Graph

Fingerprint

Dive into the research topics of 'Semantic-Integrated Online Audit Log Reduction for Efficient Forensic Analysis'. Together they form a unique fingerprint.

Cite this