Bug'sLife

Exact PC	Approximate PC
i_rank == 4 ∧ i_rank == w_rank	i_rank == 4 ∧ w_rank == 4
∧ i2 + 2p0 >= d0 (w2 - 1) + 1	∧ True
∧ i3 + 2p1 >= d1 (w3 - 1) + 1	∧ i3 >= w3

DL Library	Total	Confirmed	Fixed
PyTorch	43	41	23
TensorFlow	18	18	9
Total	61	59	32

Tool	Recall	Specificity	Balanced Accuracy
SymRadar (Ours)	100%	78%	89%
CPR	96%	8%	52%
UC-KLEE	88%	57%	73%
Spider	77%	59%	67%
VulnFix	62%	66%	64%

Tool	Recall	Specificity	Balanced Accuracy
SymRadar (Ours)	100%	74%	87%
UC-KLEE	100%	52%	76%
Spider	48%	83%	65%
VulnFix	63%	28%	45%

![width:800px](./img/debugging-ladybug.jpg) --- ![height:700px](./img/bugs_life_lady_bug.webp) ---

Using black-box fuzzing is like firing a machine gun while blindfolded.

White-box fuzzing is like using a sniper rifle. Each shot is slow but, but it hits a new path every time.

Let me first show you a snippet of the results. We applied our lightgrey-box fuzzing named PathFinder to a well-known deep learning library, PyTorch. These plots show how branch coverage increases over time. Clearly, our approach overwhelmingly outperforms the existing SOTA tools using various approaches.

Moreover, as the exploration proceeds, the path conditions can become more precise since more data points for synthesis are available.

# Bug Hunting and Patch Hunting ![width:1000px](./img/hunting.png) ---

Our situation can be specifically modeled as a Bernoulli bandit problem. We need to speculate the probability of success of each arm. And each arm can have a different probability of success.

The Bernoulli bandit problem can be solved by the Thompson sampling algorithm that works in the following three steps. First, for each arm $k$, we sample $\theta_k$ from its distribution. Let's say we are about to choose between method 1 and method 2. Let's assume that the left arm is associated with this Beta distribution and the right arm is associated with that Beta distribution. It is likely that a higher value is sample from the right arm, in which case we choose the right arm. However, note that Thompson sampling still allows to choose the left arm with a smaller probability.

- Distribution of $\theta_k$: Beta distribution $(\alpha_k, \beta_k)$ | $Beta(\alpha=2,\beta=2)$ | $Beta(\alpha=3,\beta=2)$ | $Beta(\alpha=5,\beta=2)$ | | --- | --- | --- | | ![width:290px](./img/beta-2-2.png) | ![width:290px](./img/beta-3-2.png) | ![width:290px](./img/beta-5-2.png) |

What if we find an interesting patch? Then, we update the distributions of the corresponding edges. For example, if this was the distribution of this edge before the update, its right-hand side one shows the distribution after the update. Notice that the distribution after the update is more left skewed, indicating that selecting this edge looks more promising than before.

Then the natural question that arises is: Can we invent a grey-box approach that performs better than the black-box approach?

--- # Blackbox Guidance Policy - While traversing the patch-space tree, this policy gives higher priority to edges that are more likely to lead to the discovery of interesting patches. - Note that the only runtime information used in this policy is whether a test passes or fails after applying a patch.

Each edge is associated with critical branches. These critical branches are obtained after executing interesting patches observed in the corresponding subtree. Unlike in the black-box approach, we assign a beta distribution to each critical branch.

# PoC-Centered Bounded Patch Verification 1. Concrete Snapshot Extraction 2. Abstract Snapshot Construction 3. Patch Verification ---

# Abstract Snapshot Construction ![width:1000px](./img/abstraction-example.png) ---

Jooyong Yi

LOFT (Lab of Software), UNIST

A Bug’s Life: From Its Detection through Patching to Verification

Jooyong Yi

LOFT (Lab of Software), UNIST

Contents

Part 1: Bug Hunting

White, Grey, and Black-box Fuzzing

White, Grey, and Black-box Fuzzing

Grey-box Fuzzing is Not Enough

Grey-box Fuzzing is Not Enough

Our New Approach

Key Intuition

Is an Approximate PC Useful?

Path Exploration Using Approximate PCs

Path Exploration Using Approximate PCs

Preview of the Results

How to Infer Approximate PCs?

What If the Inferred PC Is Incorrect?

Counter-Example-Guided Condition Refinement

Counter-Example-Guided Condition Refinement

Tensorflow Results

Bug Finding Results

Why DL Libraries?

Part 2: Patch Hunting

Automated Program Repair from Fuzzing Perspective

Standard Patch Scheduling Algorithm

Evaluation Results of Our Approach

Patch Space

Multi-Armed Bandit Problem

Bernoulli Bandit Problem

Thompson Sampling Algorithm

Updating

Does Our Approach Fix More Bugs Correctly?

Results on Recalling Correct Patches

Reflection

Reflection

Enhancing the Efficiency of Automated Program Repair via Greybox Analysis

Two Key Questions

What to Observe?

What to Observe?

How to Guide the Search?

Count-based Similarity of Patch Behavior

Our Greybox Guidance Policy

Blackbox vs Greybox

Evaluation (D4J v1.2; 10 times repetitions)

Results on Recalling Correct Patches

Part 3: Patch Verification

Cast a Wide Net

SymRadar: PoC-Centered Bounded Verification for Vulnerability Repair

Automated Vulnerability Repair

Our Approach

Under-Constrained Symbolic Execution

Under-Constrained Symbolic Execution

Limitation of UC-SE

PoC-Centered Bounded Patch Verification

UC-SE vs. SymRadar

Patch Verification

Patch Classification Rubric

Key Requirements for Patch Verification

Evaluation (3,3037 patches generated from CPR)

Evaluation (90 patches generated from San2Patch)