a.xyz

xFlow

Self-hosted orchestration for channel content. Local inference, procedural assembly, human review at every gate.

xFlow

A self-hosted system that takes a topic or a live news source and returns a finished After Effects project for a human to review and publish - narration, stills, video, music, timeline. Generation runs locally: ComfyUI for image, video, TTS, and recognition; local models for prompting and metadata. Nothing leaves the machine unless it has to.

Two spreadsheets run it. One is a per-channel DSL - voices, visual style, LoRAs, music, timeline parameters - so a new channel is a new row, not new code. The other is a state machine tracking where every video sits across stages, so scripting, production, and upload batch independently with a human QA gate between each. The timeline is assembled procedurally in Houdini, not cut by hand: movement, transitions, and overlays fall out of channel parameters as an After Effects script.

The diagram shows the system at two levels. Overview is the four-stage backbone; Detailed expands each block with its steps and stack.

Channels, config DSL
One spreadsheet drives every block: narration, voices, visual style, LoRAs, music, Houdini settings.
01 Script Writing
News or non-news branch routes through an agent chain. Output is script JSON and a new row in the tracking sheet.
n8n LLM newsapi.org
Awaiting review
02 Production
Audio, stills, recognition, video, cull and fix, timeline, AE script.
n8n ComfyUI LLM Whisper FFMPEG Houdini
Ae Timeline Ready.
03 Manual Validation
Run generated JS in After Effects, review composition, sign off, final render.
After Effects human
Ready for upload
04 Uploadseparate workflow
Publish all approved videos
n8n YouTube API

Worked example

One episode end to end. Breakfast World on a micro-world channel where tiny civilizations live inside everyday food.

01 Script

A topic seed enters the agent chain (Planner, Writer, Editor) and comes out a script.

An educational survey of the inhabited world that is served, each morning, upon a single breakfast tray - a geographical tour of its regions, peoples, and customs, from the Pancake Plateaus to the Bacon Ridges, recorded in the manner of a 1950s field study by observers who know the world will be gone by nine o'clock.

02 Audio

TTS voiceover mixed with a shuffled music bed into one master track that drives the timeline.

master mix, voiceover plus music bed (excerpt)
03 Covers

Five thumbnail options generated from the script; one chosen.

04 Stills

A still is generated per block, then described; the description drives its video prompt. Click any frame to play its clip.

05 Result

Composition assembled in After Effects, reviewed, rendered, and published as Breakfast World.

Final video: Breakfast World

Contact

Get in touch