Implementing grammar enumerator by SWASTIC-7 · Pull Request #3750 · AFLplusplus/LibAFL

SWASTIC-7 · 2026-03-09T09:48:37Z

Description

Added Enumeration method for gramatron as proposed in paper https://arxiv.org/pdf/2305.00522 based on issue #2309

Future work: Need to implement mutations based on enumeration methods

Checklist

I have run ./scripts/precommit.sh and addressed all comments

SWASTIC-7 · 2026-03-09T09:49:42Z

@addisoncrump have a look at this please

Copilot

Pull request overview

This PR adds deterministic grammar enumeration support for LibAFL’s Gramatron generator, based on the IntegerizedStack approach described in the referenced paper, enabling reproducible generation of the n-th derived input.

Changes:

Exposes a new generators::enumerator module implementing enumerate_automaton.
Adds GramatronGenerator::enumerate_nth to deterministically construct a GramatronInput from an index n.
Adds unit tests for enumerate_automaton using a small sample automaton.

Reviewed changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 5 comments.

File	Description
`crates/libafl/src/generators/mod.rs`	Exports the new `enumerator` module.
`crates/libafl/src/generators/gramatron.rs`	Adds `enumerate_nth` API on `GramatronGenerator`.
`crates/libafl/src/generators/enumerator.rs`	Implements integerized-stack enumeration logic and adds tests.

Copilot · 2026-03-09T09:53:24Z

crates/libafl/src/generators/enumerator.rs

+    /// Assumes value codes exactly n integers. Zero afterwards.
+    pub fn split(&mut self, n: usize) -> Vec<u64> {
+        let mut out = Vec::with_capacity(n);
+        for _ in 0..(n - 1) {
+            out.push(self.pop());
+        }
+        out.push(self.value);
+        self.value = 0;
+        out


IntegerizedStack::split will underflow/panic for n == 0 (it does n - 1) and also behaves oddly for n == 0/n == 1. Since this is a public method, please handle these edge cases explicitly (e.g., return an empty vec for n == 0, and for n == 1 just return the current value) instead of relying on n - 1.

Copilot · 2026-03-09T09:53:24Z

crates/libafl/src/generators/enumerator.rs

+/// Rosenberg-Strong pairing decode
+fn decode(z: u64) -> (u64, u64) {
+    let m = (z as f64).sqrt().floor() as u64;
+    let msq = m * m;
+    if z - msq < m {
+        (z - msq, m)
+    } else {
+        (m, msq + 2 * m - z)
+    }
+}


decode uses (z as f64).sqrt() which can produce incorrect results for large u64 values due to f64 precision loss, and it pulls in floating-point math for a no-std crate. Consider using the existing libafl_bolts::math::integer_sqrt (or another integer sqrt) to compute m without floats, and avoid potential underflow in z - msq if m is off by 1.

Copilot · 2026-03-09T09:53:25Z

crates/libafl/src/generators/enumerator.rs

+/// Modular pairing decode
+/// Returns (z mod k, (z - (z mod k)) / k)
+fn mod_decode(z: u64, k: u64) -> (u64, u64) {
+    let a = z % k;
+    let b = (z - a) / k;
+    (b, a)
+}


The doc comment for mod_decode says it returns (z mod k, (z - (z mod k)) / k), but the implementation returns (quotient, remainder) ((b, a)). Please fix the comment (or swap the tuple) so callers don’t misinterpret the return order.

Copilot · 2026-03-09T09:53:25Z

crates/libafl/src/generators/enumerator.rs

+    // if nonterminals then we need to choose one and recurse
+    let mut stack = IntegerizedStack::new(n - num_terminal);
+    let num_nonterminal = nonterminal_indices.len() as u64;
+    let rule_choice = stack.modpop(num_nonterminal) as usize;
+    let trigger_idx = nonterminal_indices[rule_choice];
+    let trigger = &triggers[trigger_idx];
+    let dest = trigger.dest;
+
+    let mut result = vec![Terminal::new(state, trigger_idx, trigger.term.clone())];
+
+    let child_terminals = enumerate_automaton(dest, stack.value, automaton);
+    result.extend(child_terminals);


enumerate_automaton can panic or fail to make progress:

If nonterminal_indices is empty and n >= num_terminal, num_nonterminal == 0 and modpop(0) will panic (mod/div by zero).

If num_nonterminal == 1, modpop(1) leaves stack.value unchanged, so for grammars like “1 terminal + 1 recursive rule” this recurses forever for all n >= num_terminal.
Please handle num_nonterminal == 0 and == 1 explicitly (likely by using a different decoding step that consumes n, per the IntegerizedStack/paper), or change the API to return a Result when the n-th derivation is undefined.

Copilot · 2026-03-09T09:53:25Z

crates/libafl/src/generators/gramatron.rs

+    /// Enumerate the n-th input deterministically using the IntegerizedStack algorithm.
+    /// This produces a unique [`GramatronInput`] for each value of `n`.
+    pub fn enumerate_nth(&self, n: u64) -> GramatronInput {
+        let terminals = crate::generators::enumerator::enumerate_automaton(
+            self.automaton.init_state,
+            n,
+            self.automaton,
+        );
+        GramatronInput::new(terminals)
+    }


enumerate_nth is documented as producing a unique input for each n, but it currently delegates to enumerate_automaton, which can panic or loop indefinitely for some automata / n values (e.g., when a state has no nonterminals, or has exactly one nonterminal). Consider returning Result<GramatronInput, Error> (new API, so non-breaking) and documenting/handling the “no n-th derivation” case explicitly instead of implicitly panicking/hanging.

Implementing grammer enumerator

6d3e794

Copilot AI review requested due to automatic review settings March 9, 2026 09:48

SWASTIC-7 marked this pull request as draft March 9, 2026 09:48

Copilot started reviewing on behalf of SWASTIC-7 March 9, 2026 09:49 View session

Copilot AI reviewed Mar 9, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Implementing grammar enumerator#3750

Implementing grammar enumerator#3750
SWASTIC-7 wants to merge 1 commit intoAFLplusplus:mainfrom
SWASTIC-7:enumerator

SWASTIC-7 commented Mar 9, 2026

Uh oh!

SWASTIC-7 commented Mar 9, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Mar 9, 2026

Uh oh!

Copilot AI Mar 9, 2026

Uh oh!

Copilot AI Mar 9, 2026

Uh oh!

Copilot AI Mar 9, 2026

Uh oh!

Copilot AI Mar 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

SWASTIC-7 commented Mar 9, 2026

Description

Checklist

Uh oh!

SWASTIC-7 commented Mar 9, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Mar 9, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 9, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 9, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 9, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 9, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants