Skip to content

Releases: MyrtleSoftware/vollo-sdk

Vollo SDK 26.2.0

09 Mar 09:34
4320994

Choose a tag to compare

  • Support for FP8 (E4M3) weights on Versal devices using vollo_torch.Fp8Weights context manager
  • Support for torch.exp, torch.exp2 at FP32 precision using vollo_torch.Fp32Activations context manager
  • Support for matmul operations where both inputs are dynamic (non-constant) tensors, when using allow_dynamic_weights
  • Optimize accumulations on Versal devices, improving performance of layers such as LayerNorm and RMSNorm, and Linear layers with small output features
  • Add support for multiple state tensors in vollo_torch.nn.Scan
  • Add allow_unserializable flag to vollo_compiler.NNIR.to_program for testing programs which can't be serialized
  • Fix multi-model programs that use dynamic weights

Vollo SDK 26.1.2

06 Feb 15:50
124f527

Choose a tag to compare

  • Optimize handling of biases in Linear layers when using allow_dynamic_weights
  • Speed up model compilation
  • Add random_seeds argument to vollo_compiler.NNIR.to_program

Vollo SDK 26.1.1

05 Feb 15:02
7c5a606

Choose a tag to compare

  • Fix V80LL initialization bitstream so that the V80LL memory can be flashed over JTAG
  • Optimize handling of biases in Linear layers when using allow_dynamic_weights
  • Add support for multiple outputs to vollo_torch.nn.Scan
  • Speed up loading .vollo programs

Vollo SDK 26.1.0

30 Jan 12:44
8a8f971

Choose a tag to compare

  • Fix DMA bug introduced on V80 in Vollo SDK 26.0.0
  • Add Alveo V80LL bitstream and vollo_compiler.Config.v80ll_c6b32 hardware config
  • Add support for Linear layers where the contracted dimension is not the data dimension via the allow_dynamic_weights flag for vollo_compiler.NNIR.to_program
  • Add support for multiple inputs to vollo_torch.nn.Scan
  • Add support for indexing with negative indices in: torch.stack, torch.sum, torch.permute, torch.squeeze, torch.unsqueeze
  • Add support for torch.nn.functional.linear
  • Add optional bias argument to vollo_torch.nn.PaddedConv1d

Vollo SDK 26.0.2

28 Jan 11:25
8a8f971

Choose a tag to compare

  • Update example/partial_update.c to allow multiple inputs and mixed precision
  • Fix bug in FP32/multi-input partial updates
  • Speed up model compilation

Vollo SDK 26.0.1

22 Jan 14:18
400c4eb

Choose a tag to compare

  • Make vollo-tool license use the system's CA certificates
  • Fix bug in FP32 partial updates

Vollo SDK 26.0.0

20 Jan 09:06
8bc8038

Choose a tag to compare

  • V80 DMA optimizations
  • Support for a subset of operations at FP32
    • vollo-torch
      • Add vollo_torch.Fp32Activations context manager
      • Add inputs_precisions and output_precisions arguments to vollo_torch.fx.nnir.to_nnir
    • vollo-compiler:
      • Add model_input_number_format and model_output_number_format methods to vollo_compiler.Program
      • Add vollo_compiler.NumberFormat enum
    • vollo-rt C/C++ API
      • Add vollo_rt_add_job, vollo_rt_add_job_partial_update, vollo_rt_model_input_format, vollo_rt_model_output_format, vollo_rt_get_raw_buffer_bytes functions and number_format enum
    • vollo-rt Python bindings
      • Add add_job, add_job_f32, model_output_format methods to vollo_rt.VolloRTContext
  • Memory usage and compilation time improvements in the compiler
  • Add quick_compile flag to vollo_compiler.NNIR.to_program for faster compilation
  • Add max_sparse_entries option to vollo_compiler.NNIR.to_program to configure the number of nonzero entries allowed in weights for non-standard memory format MatMuls
  • Add token-info subcommand to vollo-tool license to show information about a purchase token
  • Add info message to vollo-tool license redeem-device if the device being redeemed for has been redeemed on an expired or nearly expiring token

Vollo SDK 25.1.2

17 Nov 15:20
cdd807a

Choose a tag to compare

  • Reduced memory usage of the compiler during compilation of large models
  • Improve ami-tool's detection of bitstream UUIDs
  • Improve AMI driver's compatibility with Linux kernel versions

Vollo SDK 25.1.1

10 Nov 17:35
d421814

Choose a tag to compare

  • Fix bug in vollo-tool where vollo-tool fpga-config did not enumerate the V80 management physical function
  • Fix bug in load-kernel-driver.sh where the Vollo driver was loaded for the V80 management physical function instead of the AMI driver

Vollo SDK 25.1.0

03 Oct 15:53
035ef63

Choose a tag to compare

  • Early access support for Mamba models
  • Add support for SiLU
  • Add support for Softplus
  • Add support for Exp, Exp2
  • Add support for Sigmoid
  • Add support for Softmax
  • Speed up model compilation