AArch64: apply SLOTHY to poly_decompose_32 and poly_decompose_88#1180
AArch64: apply SLOTHY to poly_decompose_32 and poly_decompose_88#1180mkannwischer wants to merge 1 commit into
Conversation
Switch the dev/aarch64_opt Makefile rules from a plain copy to a slothy-cli invocation and check in the resulting schedules. Measured on Neoverse-N1: poly_decompose_88 323→298 cyc (~7.7%) and poly_decompose_32 314→298 cyc (~5.1%). The HOL Light proofs are unchanged apart from regenerated embedded bytecode. - Resolves #488 Signed-off-by: Matthias J. Kannwischer <matthias@zerorisc.com>
There was a problem hiding this comment.
Mac Mini (M1, 2020) benchmarks (opt)
Details
| Benchmark suite | Current: 24a79f0 | Previous: 9c5eceb | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
46537 cycles |
46504 cycles |
1.00 |
ML-DSA-44 sign |
131043 cycles |
131079 cycles |
1.00 |
ML-DSA-44 verify |
47342 cycles |
47310 cycles |
1.00 |
ML-DSA-65 keypair |
81690 cycles |
81680 cycles |
1.00 |
ML-DSA-65 sign |
215294 cycles |
215311 cycles |
1.00 |
ML-DSA-65 verify |
79299 cycles |
79299 cycles |
1 |
ML-DSA-87 keypair |
132403 cycles |
132405 cycles |
1.00 |
ML-DSA-87 sign |
277435 cycles |
277328 cycles |
1.00 |
ML-DSA-87 verify |
134053 cycles |
134051 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
There was a problem hiding this comment.
Mac Mini (M1, 2020) benchmarks (no-opt)
Details
| Benchmark suite | Current: 24a79f0 | Previous: 9c5eceb | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
112834 cycles |
112758 cycles |
1.00 |
ML-DSA-44 sign |
401056 cycles |
400845 cycles |
1.00 |
ML-DSA-44 verify |
119495 cycles |
119413 cycles |
1.00 |
ML-DSA-65 keypair |
193001 cycles |
192933 cycles |
1.00 |
ML-DSA-65 sign |
650124 cycles |
649924 cycles |
1.00 |
ML-DSA-65 verify |
192920 cycles |
192850 cycles |
1.00 |
ML-DSA-87 keypair |
318883 cycles |
318753 cycles |
1.00 |
ML-DSA-87 sign |
828737 cycles |
828716 cycles |
1.00 |
ML-DSA-87 verify |
326732 cycles |
326677 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
There was a problem hiding this comment.
SpacemiT K1 8 (Banana Pi F3) benchmarks (no-opt)
Details
| Benchmark suite | Current: 24a79f0 | Previous: 9c5eceb | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
759745 cycles |
759567 cycles |
1.00 |
ML-DSA-44 sign |
3138419 cycles |
3137937 cycles |
1.00 |
ML-DSA-44 verify |
858706 cycles |
858870 cycles |
1.00 |
ML-DSA-65 keypair |
1286602 cycles |
1285661 cycles |
1.00 |
ML-DSA-65 sign |
5077186 cycles |
5075651 cycles |
1.00 |
ML-DSA-65 verify |
1364475 cycles |
1364449 cycles |
1.00 |
ML-DSA-87 keypair |
2111705 cycles |
2111749 cycles |
1.00 |
ML-DSA-87 sign |
6349699 cycles |
6355811 cycles |
1.00 |
ML-DSA-87 verify |
2227295 cycles |
2228276 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
oqs-bot
left a comment
There was a problem hiding this comment.
Intel Xeon 4th gen (c7i)
Details
| Benchmark suite | Current: 24a79f0 | Previous: 9c5eceb | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
43385 cycles |
43415 cycles |
1.00 |
ML-DSA-44 sign |
130751 cycles |
130832 cycles |
1.00 |
ML-DSA-44 verify |
45118 cycles |
45316 cycles |
1.00 |
ML-DSA-65 keypair |
75780 cycles |
75430 cycles |
1.00 |
ML-DSA-65 sign |
215460 cycles |
215011 cycles |
1.00 |
ML-DSA-65 verify |
74415 cycles |
74350 cycles |
1.00 |
ML-DSA-87 keypair |
123441 cycles |
123299 cycles |
1.00 |
ML-DSA-87 sign |
272129 cycles |
271370 cycles |
1.00 |
ML-DSA-87 verify |
120746 cycles |
120802 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
oqs-bot
left a comment
There was a problem hiding this comment.
Intel Xeon 4th gen (c7i) (no-opt)
Details
| Benchmark suite | Current: 24a79f0 | Previous: 9c5eceb | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
91420 cycles |
91498 cycles |
1.00 |
ML-DSA-44 sign |
351755 cycles |
352373 cycles |
1.00 |
ML-DSA-44 verify |
99750 cycles |
99791 cycles |
1.00 |
ML-DSA-65 keypair |
154006 cycles |
153828 cycles |
1.00 |
ML-DSA-65 sign |
571622 cycles |
571106 cycles |
1.00 |
ML-DSA-65 verify |
159790 cycles |
159761 cycles |
1.00 |
ML-DSA-87 keypair |
255069 cycles |
255664 cycles |
1.00 |
ML-DSA-87 sign |
723883 cycles |
726163 cycles |
1.00 |
ML-DSA-87 verify |
264098 cycles |
263879 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
oqs-bot
left a comment
There was a problem hiding this comment.
AMD EPYC 3rd gen (c6a)
Details
| Benchmark suite | Current: 24a79f0 | Previous: 9c5eceb | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
55251 cycles |
55320 cycles |
1.00 |
ML-DSA-44 sign |
159039 cycles |
159183 cycles |
1.00 |
ML-DSA-44 verify |
57463 cycles |
57873 cycles |
0.99 |
ML-DSA-65 keypair |
96210 cycles |
95830 cycles |
1.00 |
ML-DSA-65 sign |
263757 cycles |
263504 cycles |
1.00 |
ML-DSA-65 verify |
96156 cycles |
96058 cycles |
1.00 |
ML-DSA-87 keypair |
154748 cycles |
154555 cycles |
1.00 |
ML-DSA-87 sign |
322661 cycles |
322765 cycles |
1.00 |
ML-DSA-87 verify |
150975 cycles |
150846 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
oqs-bot
left a comment
There was a problem hiding this comment.
AMD EPYC 3rd gen (c6a) (no-opt)
Details
| Benchmark suite | Current: 24a79f0 | Previous: 9c5eceb | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
133164 cycles |
133213 cycles |
1.00 |
ML-DSA-44 sign |
518907 cycles |
518708 cycles |
1.00 |
ML-DSA-44 verify |
146469 cycles |
146419 cycles |
1.00 |
ML-DSA-65 keypair |
223893 cycles |
225658 cycles |
0.99 |
ML-DSA-65 sign |
842603 cycles |
848883 cycles |
0.99 |
ML-DSA-65 verify |
233779 cycles |
236008 cycles |
0.99 |
ML-DSA-87 keypair |
368482 cycles |
366727 cycles |
1.00 |
ML-DSA-87 sign |
1066561 cycles |
1060028 cycles |
1.01 |
ML-DSA-87 verify |
382702 cycles |
380316 cycles |
1.01 |
This comment was automatically generated by workflow using github-action-benchmark.
oqs-bot
left a comment
There was a problem hiding this comment.
AMD EPYC 4th gen (c7a)
Details
| Benchmark suite | Current: 24a79f0 | Previous: 9c5eceb | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
46302 cycles |
46860 cycles |
0.99 |
ML-DSA-44 sign |
138499 cycles |
139486 cycles |
0.99 |
ML-DSA-44 verify |
49383 cycles |
49318 cycles |
1.00 |
ML-DSA-65 keypair |
82453 cycles |
81920 cycles |
1.01 |
ML-DSA-65 sign |
226079 cycles |
227729 cycles |
0.99 |
ML-DSA-65 verify |
82028 cycles |
81877 cycles |
1.00 |
ML-DSA-87 keypair |
129336 cycles |
131127 cycles |
0.99 |
ML-DSA-87 sign |
277767 cycles |
281288 cycles |
0.99 |
ML-DSA-87 verify |
127997 cycles |
129015 cycles |
0.99 |
This comment was automatically generated by workflow using github-action-benchmark.
oqs-bot
left a comment
There was a problem hiding this comment.
Graviton4
Details
| Benchmark suite | Current: 24a79f0 | Previous: 9c5eceb | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
67216 cycles |
67251 cycles |
1.00 |
ML-DSA-44 sign |
198307 cycles |
198318 cycles |
1.00 |
ML-DSA-44 verify |
70252 cycles |
70240 cycles |
1.00 |
ML-DSA-65 keypair |
119288 cycles |
119362 cycles |
1.00 |
ML-DSA-65 sign |
325990 cycles |
325982 cycles |
1.00 |
ML-DSA-65 verify |
116826 cycles |
116934 cycles |
1.00 |
ML-DSA-87 keypair |
196527 cycles |
196595 cycles |
1.00 |
ML-DSA-87 sign |
421291 cycles |
421892 cycles |
1.00 |
ML-DSA-87 verify |
193331 cycles |
193379 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
oqs-bot
left a comment
There was a problem hiding this comment.
Graviton2
Details
| Benchmark suite | Current: 24a79f0 | Previous: 9c5eceb | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
112324 cycles |
112529 cycles |
1.00 |
ML-DSA-44 sign |
353714 cycles |
354040 cycles |
1.00 |
ML-DSA-44 verify |
117482 cycles |
117389 cycles |
1.00 |
ML-DSA-65 keypair |
194624 cycles |
194738 cycles |
1.00 |
ML-DSA-65 sign |
583294 cycles |
584565 cycles |
1.00 |
ML-DSA-65 verify |
193425 cycles |
193297 cycles |
1.00 |
ML-DSA-87 keypair |
320920 cycles |
321000 cycles |
1.00 |
ML-DSA-87 sign |
747036 cycles |
746870 cycles |
1.00 |
ML-DSA-87 verify |
318279 cycles |
318737 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
oqs-bot
left a comment
There was a problem hiding this comment.
AMD EPYC 4th gen (c7a) (no-opt)
Details
| Benchmark suite | Current: 24a79f0 | Previous: 9c5eceb | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
117989 cycles |
118568 cycles |
1.00 |
ML-DSA-44 sign |
457446 cycles |
458777 cycles |
1.00 |
ML-DSA-44 verify |
130118 cycles |
131112 cycles |
0.99 |
ML-DSA-65 keypair |
203668 cycles |
200686 cycles |
1.01 |
ML-DSA-65 sign |
747194 cycles |
743736 cycles |
1.00 |
ML-DSA-65 verify |
212113 cycles |
209264 cycles |
1.01 |
ML-DSA-87 keypair |
329210 cycles |
330171 cycles |
1.00 |
ML-DSA-87 sign |
936999 cycles |
935697 cycles |
1.00 |
ML-DSA-87 verify |
342522 cycles |
343489 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
oqs-bot
left a comment
There was a problem hiding this comment.
Graviton4 (no-opt)
Details
| Benchmark suite | Current: 24a79f0 | Previous: 9c5eceb | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
127588 cycles |
127647 cycles |
1.00 |
ML-DSA-44 sign |
441200 cycles |
441052 cycles |
1.00 |
ML-DSA-44 verify |
136388 cycles |
136340 cycles |
1.00 |
ML-DSA-65 keypair |
220545 cycles |
220707 cycles |
1.00 |
ML-DSA-65 sign |
714429 cycles |
713810 cycles |
1.00 |
ML-DSA-65 verify |
220997 cycles |
220735 cycles |
1.00 |
ML-DSA-87 keypair |
364582 cycles |
365112 cycles |
1.00 |
ML-DSA-87 sign |
915573 cycles |
921310 cycles |
0.99 |
ML-DSA-87 verify |
370884 cycles |
370798 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
oqs-bot
left a comment
There was a problem hiding this comment.
Intel Xeon 3rd gen (c6i)
Details
| Benchmark suite | Current: 24a79f0 | Previous: 9c5eceb | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
62013 cycles |
61685 cycles |
1.01 |
ML-DSA-44 sign |
188813 cycles |
188995 cycles |
1.00 |
ML-DSA-44 verify |
66307 cycles |
66273 cycles |
1.00 |
ML-DSA-65 keypair |
112010 cycles |
111473 cycles |
1.00 |
ML-DSA-65 sign |
316174 cycles |
312617 cycles |
1.01 |
ML-DSA-65 verify |
110578 cycles |
110558 cycles |
1.00 |
ML-DSA-87 keypair |
171339 cycles |
171460 cycles |
1.00 |
ML-DSA-87 sign |
378831 cycles |
377513 cycles |
1.00 |
ML-DSA-87 verify |
170072 cycles |
169102 cycles |
1.01 |
This comment was automatically generated by workflow using github-action-benchmark.
oqs-bot
left a comment
There was a problem hiding this comment.
Graviton2 (no-opt)
Details
| Benchmark suite | Current: 24a79f0 | Previous: 9c5eceb | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
212222 cycles |
211802 cycles |
1.00 |
ML-DSA-44 sign |
761067 cycles |
759783 cycles |
1.00 |
ML-DSA-44 verify |
229929 cycles |
229307 cycles |
1.00 |
ML-DSA-65 keypair |
378728 cycles |
377288 cycles |
1.00 |
ML-DSA-65 sign |
1248002 cycles |
1247192 cycles |
1.00 |
ML-DSA-65 verify |
373359 cycles |
371605 cycles |
1.00 |
ML-DSA-87 keypair |
602631 cycles |
601230 cycles |
1.00 |
ML-DSA-87 sign |
1585048 cycles |
1584827 cycles |
1.00 |
ML-DSA-87 verify |
618528 cycles |
616717 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
oqs-bot
left a comment
There was a problem hiding this comment.
Graviton3
Details
| Benchmark suite | Current: 24a79f0 | Previous: 9c5eceb | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
71305 cycles |
71561 cycles |
1.00 |
ML-DSA-44 sign |
209071 cycles |
209019 cycles |
1.00 |
ML-DSA-44 verify |
74710 cycles |
74745 cycles |
1.00 |
ML-DSA-65 keypair |
125924 cycles |
125918 cycles |
1.00 |
ML-DSA-65 sign |
345759 cycles |
345393 cycles |
1.00 |
ML-DSA-65 verify |
124066 cycles |
124184 cycles |
1.00 |
ML-DSA-87 keypair |
206974 cycles |
206649 cycles |
1.00 |
ML-DSA-87 sign |
444175 cycles |
439798 cycles |
1.01 |
ML-DSA-87 verify |
204042 cycles |
204451 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
oqs-bot
left a comment
There was a problem hiding this comment.
Intel Xeon 3rd gen (c6i) (no-opt)
Details
| Benchmark suite | Current: 24a79f0 | Previous: 9c5eceb | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
154456 cycles |
154753 cycles |
1.00 |
ML-DSA-44 sign |
587935 cycles |
591158 cycles |
0.99 |
ML-DSA-44 verify |
169516 cycles |
169947 cycles |
1.00 |
ML-DSA-65 keypair |
262430 cycles |
261726 cycles |
1.00 |
ML-DSA-65 sign |
964711 cycles |
963022 cycles |
1.00 |
ML-DSA-65 verify |
272608 cycles |
271733 cycles |
1.00 |
ML-DSA-87 keypair |
431590 cycles |
431775 cycles |
1.00 |
ML-DSA-87 sign |
1212135 cycles |
1213443 cycles |
1.00 |
ML-DSA-87 verify |
447757 cycles |
447885 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
There was a problem hiding this comment.
Arm Cortex-A76 (Raspberry Pi 5) benchmarks (opt)
Details
| Benchmark suite | Current: 24a79f0 | Previous: 9c5eceb | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
112223 cycles |
112128 cycles |
1.00 |
ML-DSA-44 sign |
353019 cycles |
353788 cycles |
1.00 |
ML-DSA-44 verify |
117145 cycles |
117189 cycles |
1.00 |
ML-DSA-65 keypair |
194591 cycles |
194358 cycles |
1.00 |
ML-DSA-65 sign |
582967 cycles |
583733 cycles |
1.00 |
ML-DSA-65 verify |
192794 cycles |
193111 cycles |
1.00 |
ML-DSA-87 keypair |
320571 cycles |
320083 cycles |
1.00 |
ML-DSA-87 sign |
745952 cycles |
747201 cycles |
1.00 |
ML-DSA-87 verify |
318435 cycles |
317895 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
oqs-bot
left a comment
There was a problem hiding this comment.
Graviton3 (no-opt)
Details
| Benchmark suite | Current: 24a79f0 | Previous: 9c5eceb | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
137918 cycles |
138038 cycles |
1.00 |
ML-DSA-44 sign |
485921 cycles |
486082 cycles |
1.00 |
ML-DSA-44 verify |
149024 cycles |
149093 cycles |
1.00 |
ML-DSA-65 keypair |
241712 cycles |
241929 cycles |
1.00 |
ML-DSA-65 sign |
791958 cycles |
791534 cycles |
1.00 |
ML-DSA-65 verify |
242168 cycles |
241280 cycles |
1.00 |
ML-DSA-87 keypair |
395759 cycles |
396309 cycles |
1.00 |
ML-DSA-87 sign |
1013511 cycles |
1019346 cycles |
0.99 |
ML-DSA-87 verify |
403617 cycles |
403741 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
There was a problem hiding this comment.
Arm Cortex-A55 (Snapdragon 888) benchmarks (opt)
Details
| Benchmark suite | Current: 24a79f0 | Previous: 9c5eceb | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
267129 cycles |
266639 cycles |
1.00 |
ML-DSA-44 sign |
804495 cycles |
808808 cycles |
0.99 |
ML-DSA-44 verify |
270621 cycles |
270153 cycles |
1.00 |
ML-DSA-65 keypair |
461167 cycles |
461300 cycles |
1.00 |
ML-DSA-65 sign |
1310863 cycles |
1323365 cycles |
0.99 |
ML-DSA-65 verify |
448034 cycles |
447313 cycles |
1.00 |
ML-DSA-87 keypair |
788871 cycles |
789934 cycles |
1.00 |
ML-DSA-87 sign |
1782995 cycles |
1827662 cycles |
0.98 |
ML-DSA-87 verify |
768306 cycles |
769879 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
There was a problem hiding this comment.
Arm Cortex-A76 (Raspberry Pi 5) benchmarks (no-opt)
Details
| Benchmark suite | Current: 24a79f0 | Previous: 9c5eceb | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
211528 cycles |
211527 cycles |
1.00 |
ML-DSA-44 sign |
758665 cycles |
759858 cycles |
1.00 |
ML-DSA-44 verify |
228972 cycles |
229351 cycles |
1.00 |
ML-DSA-65 keypair |
377109 cycles |
378548 cycles |
1.00 |
ML-DSA-65 sign |
1247274 cycles |
1247648 cycles |
1.00 |
ML-DSA-65 verify |
371224 cycles |
372375 cycles |
1.00 |
ML-DSA-87 keypair |
600381 cycles |
601887 cycles |
1.00 |
ML-DSA-87 sign |
1582615 cycles |
1582472 cycles |
1.00 |
ML-DSA-87 verify |
616036 cycles |
617767 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
There was a problem hiding this comment.
Arm Cortex-A55 (Snapdragon 888) benchmarks (no-opt)
Details
| Benchmark suite | Current: 24a79f0 | Previous: 9c5eceb | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
463003 cycles |
462947 cycles |
1.00 |
ML-DSA-44 sign |
2134532 cycles |
2132913 cycles |
1.00 |
ML-DSA-44 verify |
555002 cycles |
555065 cycles |
1.00 |
ML-DSA-65 keypair |
782019 cycles |
781178 cycles |
1.00 |
ML-DSA-65 sign |
3483165 cycles |
3495803 cycles |
1.00 |
ML-DSA-65 verify |
865196 cycles |
863872 cycles |
1.00 |
ML-DSA-87 keypair |
1261848 cycles |
1265586 cycles |
1.00 |
ML-DSA-87 sign |
4308043 cycles |
4308959 cycles |
1.00 |
ML-DSA-87 verify |
1385965 cycles |
1390453 cycles |
1.00 |
This comment was automatically generated by workflow using github-action-benchmark.
There was a problem hiding this comment.
Arm Cortex-A72 (Raspberry Pi 4) benchmarks (opt)
Details
| Benchmark suite | Current: 24a79f0 | Previous: 9c5eceb | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
233714 cycles |
214312 cycles |
1.09 |
ML-DSA-44 sign |
634614 cycles |
605496 cycles |
1.05 |
ML-DSA-44 verify |
228174 cycles |
222932 cycles |
1.02 |
ML-DSA-65 keypair |
400744 cycles |
390142 cycles |
1.03 |
ML-DSA-65 sign |
1028807 cycles |
1004679 cycles |
1.02 |
ML-DSA-65 verify |
375697 cycles |
374059 cycles |
1.00 |
ML-DSA-87 keypair |
669839 cycles |
652450 cycles |
1.03 |
ML-DSA-87 sign |
1386322 cycles |
1336385 cycles |
1.04 |
ML-DSA-87 verify |
650101 cycles |
633167 cycles |
1.03 |
This comment was automatically generated by workflow using github-action-benchmark.
There was a problem hiding this comment.
⚠️ Performance Alert ⚠️
Possible performance regression was detected for benchmark 'Arm Cortex-A72 (Raspberry Pi 4) benchmarks (opt)'.
Benchmark result of this commit is worse than the previous benchmark result exceeding threshold 1.03.
| Benchmark suite | Current: 24a79f0 | Previous: 9c5eceb | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
233714 cycles |
214312 cycles |
1.09 |
ML-DSA-44 sign |
634614 cycles |
605496 cycles |
1.05 |
ML-DSA-87 sign |
1386322 cycles |
1336385 cycles |
1.04 |
This comment was automatically generated by workflow using github-action-benchmark.
CBMC Results (ML-DSA-65, REDUCE-RAM)Full Results (204 proofs)
|
CBMC Results (ML-DSA-87, REDUCE-RAM)Full Results (204 proofs)
|
There was a problem hiding this comment.
Arm Cortex-A72 (Raspberry Pi 4) benchmarks (no-opt)
Details
| Benchmark suite | Current: 24a79f0 | Previous: 9c5eceb | Ratio |
|---|---|---|---|
ML-DSA-44 keypair |
299345 cycles |
312767 cycles |
0.96 |
ML-DSA-44 sign |
1146844 cycles |
1207539 cycles |
0.95 |
ML-DSA-44 verify |
340467 cycles |
342752 cycles |
0.99 |
ML-DSA-65 keypair |
551047 cycles |
567893 cycles |
0.97 |
ML-DSA-65 sign |
1875176 cycles |
1961247 cycles |
0.96 |
ML-DSA-65 verify |
537953 cycles |
560470 cycles |
0.96 |
ML-DSA-87 keypair |
851210 cycles |
849145 cycles |
1.00 |
ML-DSA-87 sign |
2389528 cycles |
2376735 cycles |
1.01 |
ML-DSA-87 verify |
881000 cycles |
890655 cycles |
0.99 |
This comment was automatically generated by workflow using github-action-benchmark.
CBMC Results (ML-DSA-44, REDUCE-RAM)Full Results (204 proofs)
|
CBMC Results (ML-DSA-65)Full Results (204 proofs)
|
CBMC Results (ML-DSA-87)Full Results (204 proofs)
|
CBMC Results (ML-DSA-44)Full Results (204 proofs)
|
AArch64: apply SLOTHY to poly_decompose_32 and poly_decompose_88
Switch the dev/aarch64_opt Makefile rules from a plain copy to a
slothy-cli invocation and check in the resulting schedules. Measured
on Neoverse-N1: poly_decompose_88 323→298 cyc (~7.7%) and
poly_decompose_32 314→298 cyc (~5.1%). The HOL Light proofs are
unchanged apart from regenerated embedded bytecode.
poly_decompose#488