Alloy specification of 3SF tuned towards AccountableSafety

This directory contains an Alloy specification of the 3SF protocol, which abstracts the features that are not important for checking AccountableSafety.

We have written this specification in an attempt to check AccountableSafety for small yet reasonable configurations.

Even though Alloy specifications are not the easiest ones to write, Alloy allows us to precisely control the size of the input parameters and run optimized satisfiability solvers.

Experimental Setup

Since the SAT solver SAT4j shipped with Alloy is not very performant, we run the experiments with an award-winning SAT solver called Kissat by Armin Biere et al. To reproduce the experiments, you have to build Kissat from sources by following the installation instructions.

To generate a CNF file -- which a SAT solver is able to consume:

Open an experiment file such as ffg-exp1 in the Alloy IDE.
Check Options/Solver: Output CNF to file/Output CNF to file.
Generate the file via Execute/Run noAccountableSafety....
Copy the generated file in a convenient location. We accompany every .als file with the generated .cnf file.

Finally, run the SAT solver against the file:

$ kissat --unsat ffg-exp1.cnf

The expected output is:

s UNSATISFIABLE

This means that the property noAccountableSafety does not have a model. Hence, we were not able to find a counterexample to AccountableSafety, for inputs of the specified size.

Experimental Results

Similar to our experiments with the direct SMT encoding for CVC5, we conduct experiments for the same kinds of inputs:

Input	#blocks	#checkpoints	#signatures	#ffg_votes	#votes	runtime	memory
ffg-exp1	3	5	4	5	12	4 sec	35 MB
ffg-exp2	4	5	4	5	12	10 sec	40 MB
ffg-exp3	5	5	4	5	12	15 sec	45 MB
ffg-exp4	3	6	4	6	15	57 sec	52 MB
ffg-exp5	4	6	4	6	15	167 sec	55 MB
ffg-exp6	5	6	4	6	15	245 sec	57 MB
ffg-exp7	6	6	4	6	15	360 sec	82 MB
ffg-exp8	5	7	4	6	24	1h 27m	156 MB
ffg-exp9	5	10	4	8	24	>8 days (timeout)	198 MB
ffg-exp9a	5	10	4	8	32	>8 days (timeout)	220 MB

In addition to the above experiments, we ran a few experiments that have inputs comparable in size to those produced by our TLA+ specification:

Input	#blocks	#checkpoints	#signatures	#ffg_votes	#votes	runtime	memory
ffg-exp10	3	15	4	5	12	31 sec	56 MB
ffg-exp11	4	20	4	5	12	152 sec	94 MB
ffg-exp12	5	25	4	5	12	234 sec	117 MB
ffg-exp13	7	15	4	10	40	>16 days (timeout)	300 MB

As we can see, the running times increase dramatically, when we increase the maximum number of FFG votes and votes. While we do not have a precise explanation, the intuitive one is as follows. To be justified, a checkpoint needs at least 3 votes (assuming that we have 4 validators). Further, 3 more votes are required to finalize a justified block. Hence, the budget of 40 votes, gives us approximately 9 justified checkpoints. Since justified checkpoints may refer to on another (via FFG votes), 9 checkpoints may build longer justification chains.

The experiments ffg-exp9, ffg-exp9a, and ffg-exp13 push the boundary beyond the minimal interesting small instances. In all of them the SAT solver did not terminated. Interestingly, in case of ffg-exp9 and ffg-exp9a, Kissat was solving the remaining 3% for multiple days.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Alloy specification of 3SF tuned towards AccountableSafety

Experimental Setup

Experimental Results

Files

README.md

Latest commit

History

README.md

File metadata and controls

Alloy specification of 3SF tuned towards AccountableSafety

Experimental Setup

Experimental Results