AIFORE: Smart Fuzzing Based on Automatic Input Format Reverse Engineering

Shi, Ji; Wang, Zhun; Feng, Zhiyao; Lan, Yang; Qin, Shisong; You, Wei; Zou, Wei; Payer, Mathias; Zhang, Chao; USENIX Association

Shi, Ji; Wang, Zhun; Feng, Zhiyao; Lan, Yang; Qin, Shisong; You, Wei; Zou, Wei; Payer, Mathias; Zhang, Chao; USENIX Association

2023

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DataCite
DublinCore
EndNote
NLM
RefWorks
RIS

Abstract

Knowledge of a program's input format is essential for effective input generation in fuzzing. Automated input format reverse engineering represents an attractive but challenging approach to learning the format. In this paper, we address several challenges of automated input format reverse engineering, and present a smart fuzzing solution AIFORE which makes full use of the reversed format and benefits from it. The structures and semantics of input fields are determined by the basic blocks (BBs) that process them rather than the input specification. Therefore, we first utilize byte-level taint analysis to recognize the input bytes processed by each BB, then identify indivisible input fields that are always processed together with a minimum cluster algorithm, and learn their types with a neural network model that characterizes the behavior of BBs. Lastly, we design a new power scheduling algorithm based on the inferred format knowledge to guide smart fuzzing. We implement a prototype of AIFORE and evaluate both the accuracy of format inference and the performance of fuzzing against state-of-the-art (SOTA) format reversing solutions and fuzzers. AIFORE significantly outperforms SOTA baselines on the accuracy of field boundary and type recognition. With AIFORE, we uncovered 20 bugs in 15 programs that were missed by other fuzzers.

Details

Title AIFORE: Smart Fuzzing Based on Automatic Input Format Reverse Engineering

Author(s) Shi, Ji ; Wang, Zhun ; Feng, Zhiyao ; Lan, Yang ; Qin, Shisong ; You, Wei ; Zou, Wei ; Payer, Mathias ; Zhang, Chao ; USENIX Association

Published in Proceedings Of The 32Nd Usenix Security Symposium

Pages 4967-4984

Conference 32nd USENIX Security Symposium, AUG 09-11, 2023, Anaheim, CA

Date 2023-01-01

Publisher Usenix Assoc, Berkeley

ISBN 978-1-939133-37-3

Other identifier(s) View record in Web of Science

Laboratories HEXHIVE

Record Appears in Scientific production and competences > I&C - School of Computer and Communication Sciences > IINFCOM > HEXHIVE - HexHive
Peer-reviewed publications
Conference Papers
Work produced at EPFL
Published

Grant National Key Research and Development Program of China: 2021YFB2701000
National Natural Science Foundation of China: 61972224
Beijing National Research Center for Information Science and Technology (BNRist): BNR2022RC01006

Record creation date 2024-02-20