Releases · MyrtleSoftware/vollo-sdk · GitHub

09 Mar 09:34

acairncross

Vollo SDK 26.2.0 Latest

Latest

Support for FP8 (E4M3) weights on Versal devices using vollo_torch.Fp8Weights context manager
Support for torch.exp, torch.exp2 at FP32 precision using vollo_torch.Fp32Activations context manager
Support for matmul operations where both inputs are dynamic (non-constant) tensors, when using allow_dynamic_weights
Optimize accumulations on Versal devices, improving performance of layers such as LayerNorm and RMSNorm, and Linear layers with small output features
Add support for multiple state tensors in vollo_torch.nn.Scan
Add allow_unserializable flag to vollo_compiler.NNIR.to_program for testing programs which can't be serialized
Fix multi-model programs that use dynamic weights

Assets 10

06 Feb 15:50

alex-pay

Vollo SDK 26.1.2

Optimize handling of biases in Linear layers when using allow_dynamic_weights
Speed up model compilation
Add random_seeds argument to vollo_compiler.NNIR.to_program

Assets 10

05 Feb 15:02

acairncross

Vollo SDK 26.1.1

Fix V80LL initialization bitstream so that the V80LL memory can be flashed over JTAG
Optimize handling of biases in Linear layers when using allow_dynamic_weights
Add support for multiple outputs to vollo_torch.nn.Scan
Speed up loading .vollo programs

Assets 10

30 Jan 12:44

acairncross

Vollo SDK 26.1.0

Fix DMA bug introduced on V80 in Vollo SDK 26.0.0
Add Alveo V80LL bitstream and vollo_compiler.Config.v80ll_c6b32 hardware config
Add support for Linear layers where the contracted dimension is not the data dimension via the allow_dynamic_weights flag for vollo_compiler.NNIR.to_program
Add support for multiple inputs to vollo_torch.nn.Scan
Add support for indexing with negative indices in: torch.stack, torch.sum, torch.permute, torch.squeeze, torch.unsqueeze
Add support for torch.nn.functional.linear
Add optional bias argument to vollo_torch.nn.PaddedConv1d

Assets 9

28 Jan 11:25

alex-pay

Vollo SDK 26.0.2

Update example/partial_update.c to allow multiple inputs and mixed precision
Fix bug in FP32/multi-input partial updates
Speed up model compilation

Assets 9

22 Jan 14:18

acairncross

Vollo SDK 26.0.1

Make vollo-tool license use the system's CA certificates
Fix bug in FP32 partial updates

Assets 9

20 Jan 09:06

acairncross

Vollo SDK 26.0.0

V80 DMA optimizations
Support for a subset of operations at FP32
- vollo-torch
  - Add vollo_torch.Fp32Activations context manager
  - Add inputs_precisions and output_precisions arguments to vollo_torch.fx.nnir.to_nnir
- vollo-compiler:
  - Add model_input_number_format and model_output_number_format methods to vollo_compiler.Program
  - Add vollo_compiler.NumberFormat enum
- vollo-rt C/C++ API
  - Add vollo_rt_add_job, vollo_rt_add_job_partial_update, vollo_rt_model_input_format, vollo_rt_model_output_format, vollo_rt_get_raw_buffer_bytes functions and number_format enum
- vollo-rt Python bindings
  - Add add_job, add_job_f32, model_output_format methods to vollo_rt.VolloRTContext
Memory usage and compilation time improvements in the compiler
Add quick_compile flag to vollo_compiler.NNIR.to_program for faster compilation
Add max_sparse_entries option to vollo_compiler.NNIR.to_program to configure the number of nonzero entries allowed in weights for non-standard memory format MatMuls
Add token-info subcommand to vollo-tool license to show information about a purchase token
Add info message to vollo-tool license redeem-device if the device being redeemed for has been redeemed on an expired or nearly expiring token

Assets 9

17 Nov 15:20

acairncross

Vollo SDK 25.1.2

Reduced memory usage of the compiler during compilation of large models
Improve ami-tool's detection of bitstream UUIDs
Improve AMI driver's compatibility with Linux kernel versions

Assets 10

10 Nov 17:35

acairncross

Vollo SDK 25.1.1

Fix bug in vollo-tool where vollo-tool fpga-config did not enumerate the V80 management physical function
Fix bug in load-kernel-driver.sh where the Vollo driver was loaded for the V80 management physical function instead of the AMI driver

Assets 10

03 Oct 15:53

basile-henry

Vollo SDK 25.1.0

Early access support for Mamba models
Add support for SiLU
Add support for Softplus
Add support for Exp, Exp2
Add support for Sigmoid
Add support for Softmax
Speed up model compilation

Assets 10