Skip to content

Popular repositories Loading

  1. Step-Audio Step-Audio Public

    Python 4.6k 373

  2. Step-Video-T2V Step-Video-T2V Public

    Python 3.2k 335

  3. Step1X-Edit Step1X-Edit Public

    A SOTA open-source image editing model, which aims to provide comparable performance against the closed-source models like GPT-4o and Gemini 2 Flash.

    Python 2.1k 89

  4. gelab-zero gelab-zero Public

    STEP-GUI: The top GUI agent solution in the galaxy. Developed by the StepFun-GELab team and powered by StepFun’s cutting-edge research capabilities.

    Python 1.9k 164

  5. Step-Audio2 Step-Audio2 Public

    Step-Audio 2 is an end-to-end multi-modal large language model designed for industry-strength audio understanding and speech conversation.

    Python 1.3k 95

  6. Step-Audio-EditX Step-Audio-EditX Public

    A powerful 3B-parameter, LLM-based Reinforcement Learning audio edit model excels at editing emotion, speaking style, and paralinguistics, and features robust zero-shot text-to-speech

    Python 840 55

Repositories

Showing 10 of 25 repositories

Most used topics

Loading…