Skip to content

Justagwas/A2M

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

31 Commits
 
 
 
 
 
 
 
 
 
 
 
 

A2M - Audio to MIDI

Code: GitHub Website Mirror: SourceForge

A2M Logo

Download (Windows)

Desktop audio-to-MIDI conversion tailored for piano recordings

Runs entirely offline on Windows, CPU-first by design, with optional CUDA or DirectML acceleration

Version License Last Commit Open Issues Stars Installs (7d)

Overview

A2M (Audio to MIDI) is a Windows desktop app that converts local, piano-focused audio into .mid files.

The app uses ONNX Runtime, supports CPU by default, and can switch to optional GPU acceleration through runtime packs (CUDA or DirectML). It includes two transcription engines in settings: Legacy v1.0.0 and Modern v2.0.0 (Legacy default).

Basic usage

  1. Download and install from the latest release.
  2. If prompted, allow the model download.
  3. Click Choose audio and select a local file.
  4. Select CPU or GPU mode in settings.
  5. Click Convert to MIDI.
  6. Open the output folder from the footer (Open downloads folder) or from your configured save path.

Features

  • Local audio-to-MIDI transcription workflow.
  • ONNX Runtime inference pipeline.
  • CPU-first behavior with optional GPU runtime packs.
  • GPU provider preference options: Auto, CUDA, DirectML.
  • In-app runtime-pack installation for GPU dependencies.
  • Legacy and Modern transcription engines.
  • Modern tuning options for adaptive thresholds, input normalization/denoise, overlap stitching, and auto calibration.
  • Output location controls, UI scale controls, and update checks.
  • Stop/cancel handling for model download, runtime-pack download, cuDNN install, and conversion.

Feature sections

Runtime and Acceleration

  • A2M runs on CPU by default.
  • Optional GPU runtime packs are installed per-user under:
    • %LOCALAPPDATA%\A2M\runtime_packs\cuda
    • %LOCALAPPDATA%\A2M\runtime_packs\dml
  • Runtime-pack endpoints are defined in A2M/a2m/core/config.py.

Transcription Engines

  • Legacy v1.0.0 is the default engine.
  • Modern v2.0.0 exposes additional behavior controls and diagnostics.
  • Modern controls are shown contextually in settings based on selected engine/calibration mode.

Preview

For Developers

Requirements

Running From Source

cd A2M
py -m pip install -r requirements.txt
py A2M.py

Configuration Files

Security and OS Warnings

Contributing

Contributions are welcome.

License

Licensed under the GNU General Public License v3.0 (GPL-3.0).

See LICENSE.

Contact

About

Desktop audio-to-MIDI transcription for piano recordings with optional CUDA/DirectML GPU acceleration

Topics

Resources

License

Code of conduct

Contributing

Security policy

Stars

Watchers

Forks

Sponsor this project

 

Contributors

Languages