Video Maker

English | 简体中文

video-maker is a Codex skill for end-to-end Bilibili explainer-video production on Windows.

Repo map

SKILL.md: entrypoint and operating rules
scripts/bootstrap_project.py: full new-project scaffold plus runtime helper generation
scripts/bootstrap_video_project.py: low-level content scaffold used by the full bootstrap wrapper
scripts/upgrade_project.py: upgrade existing projects and rewrite runtime helpers
references/agent-orchestration.md: simplified content-strategist -> script-writer -> narration-polisher content pipeline
references/bilibili-tech-explainer-workflow.md: beat-driven explainer workflow
references/narration-polisher.md: 口播润色 subagent 的职责和推荐提示词
remotion-best-practices skill: default engine for video composition, animation, subtitles, audio mounting, timeline, and rendering
web-design-engineer skill: optional visual-design support for layout and motion taste; production output stays in Remotion
references/quiet-glass-lab-v3.md: iOS 18-inspired frosted glass brand prompt pack
references/quiet-glass-lab/base.css: neutral render foundation only, not a visual theme or module template
references/chinese-voice-rules.md: narration consistency rules
references/video-acceptance-rubric.md: final QA gates including meaning-gain checks
publish/cover_prompt.md in generated projects: imagegen-ready Bilibili cover prompt template
content/visual_qa_report.json in generated projects: Remotion frame-sample and cover visual repair report
references/imagegen-2-visual-playbook.md: GPT Image 2 / imagegen production rules for direct-rendered visuals with visual QA and regeneration
scripts/doctor.py: environment checker

Model

Default flow:

coordinator creates the project, runs simple commands, assigns subagent task packets, and makes final go / no-go decisions.
content-strategist writes problem, audience, opening, meaning, outline, depth, detail, and evidence contracts.
script-writer writes content/script_draft.json.
narration-polisher polishes the finished draft into natural Bilibili narration in content/narration_polish.json.
coordinator compiles shot intents and render-ready segments from the approved narration.
visual-architect designs Remotion scenes and uses imagegen for visual assets, exploded diagrams, animation assets, and the cover base.
visual-qa-fixer inspects Remotion frame samples and publish/cover.png, then fixes Remotion code/assets/cover text or regenerates imagegen outputs until visual blockers are gone.
production-engineer owns Qwen master-track audio, Remotion props, timeline, render, and export.
acceptance-reviewer checks many real screenshots, reads key Remotion code and content contracts, and listens to required audio samples.
Publish handoff runs only after video, cover, metadata, visual QA, and acceptance all pass.

Stable defaults:

new projects do not use HTML slide rendering; Remotion scene code and props appear after content and shot intents are locked
the main agent is a coordinator, not a maker; substantial content, visuals, voice, assembly, and review are delegated
video composition, animation, subtitles, audio mounting, timeline, and rendering default to Remotion; the visual system is a brand prompt pack, not fixed scene templates
visual assets and Bilibili covers default to visual-architect + imagegen, with final rendered-pixel repair owned by visual-qa-fixer
visual production records benchmark and key visual inside visual_asset_plan so inspiration, cover promise, reusable key visual, imagegen assets, and QA stay connected without extra contract files
Chinese narration prefers local Qwen first
narration uses a single master-track path with a hard timeout and voice_jobs/qwen_master_status.json status manifest
full-audio approval requires voice_profile.full_audio_review_status = passed; opening or midpoint spot checks are only supporting notes
Chinese pace target is calibrated to the 2026-04-25 CUDA video: about 260 CJK chars/min, with duration mismatch treated as non-blocking when content and QA pass
content planning now starts from problem_contract, then writes evidence-backed contracts before any render-ready segment exists
finished content drafts are always sent through narration-polisher to remove AI machine-tone and lower ordinary viewer comprehension cost
Bilibili cover generation happens after title lock and local QA, not during early content phases
prepare_publish_job.py now emits cover_path and refuses publish handoff when the cover file is missing

For real Bilibili publishing, pair this skill with desktop-control-for-windows.

See SKILL.md for usage, quiet-glass-lab-v3.md for brand visual rules, and chinese-voice-rules.md for voice rules. Remotion execution follows the companion remotion-best-practices skill.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
agents		agents
references		references
scripts		scripts
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
README.zh-CN.md		README.zh-CN.md
SKILL.md		SKILL.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Video Maker

Repo map

Model

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Video Maker

Repo map

Model

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages