Audiovisual

Lesson 01: Audio-Reactive Visuals

Learn the basic principles of tying sound to graphics. Start mapping the acoustic behavior of a virtual modular into a visual system.

Back to course Open roadmap

Track context

Audiovisual

Map sound behavior into visuals and performance-oriented output systems.

OSC and MIDI visual control
audio-reactive graphics
Blender / Three.js / TouchDesigner

Lesson

Markdown content

Theory, structure, and practical context are all driven from content files.

0

Related patches

Concrete repository anchors already exist for this lesson track.

What You Will Learn

By the end of this lesson, you should understand:

how sound is converted into numbers to control graphics
why amplitude is a good start, but a bad finish for audio-reactive visuals
how to isolate frequency bands for selective reaction
what data smoothing (smoothing/slew) is
how to make mapping meaningful, rather than chaotically flickering

Main Idea

An audiovisual system becomes impressive only when the visual layer responds to actual musical structure, and not just to the overall volume level.

Instead of forcing graphics to jump to a finished track (like classic equalizers), we can pull data right from the heart of the modular patch - before the voices get mixed together in the master.

Why It Matters

Reactive visuals should not be random decoration. They should reveal hidden musical behavior to the viewer.

If there’s chaos on the screen that loosely corresponds to what the audience hears, dissonance arises. The connection between visual and auditory must feel like a single “living organism”.

Useful Inputs for Mapping

To create high-quality audio-reactive visuals, you need to analyze the sound. Useful inputs for mapping:

Amplitude (Level/Volume): overall signal volume. Good for scale or brightness.
Onset intensity (Transients): detects sharp spikes in volume (drums, percussion). Great for particle generation or sharp strobes.
Frequency bands (FFT): splits the sound. Lows can distort geometry, while highs can change color or noise density.
Rhythmic density (Energy over Time): accumulated energetic activity over a short interval.

Proper Mapping Examples

Mapping is the art of choosing what affects what. A good mapping has an internal logic:

graph LR
  subgraph MODULAR[Audio / CV Sources]
    SUB[Bass Energy / Sub]
    TRANS[Transient Activity]
    DRONE[Drone Intensity]
    LFO[Control LFO]
  end

  subgraph VISUALS[Visual Engine]
    SCALE[3D Scale / Mass]
    PART[Particle Burst / Bloom]
    CAM[Camera Motion / Turbulence]
    ROT[Rotation / Hue Shift]
  end

  SUB -.->|Amplitude envelope| SCALE
  TRANS -.->|Pulse / Trigger| PART
  DRONE -.->|Slew / Averaged level| CAM
  LFO -.->|Direct CV| ROT

  classDef signal fill:#1A202C,stroke:#2D3748,stroke-width:2px,color:#E2E8F0;
  classDef visual fill:#2C7A7B,stroke:#319795,stroke-width:2px,color:#E6FFFA,stroke-dasharray: 4 4;
  classDef env fill:none,stroke:#4A5568,stroke-width:1px,stroke-dasharray: 2 2;

  class SUB,TRANS,DRONE,LFO signal;
  class SCALE,PART,CAM,ROT visual;
  class MODULAR,VISUALS env;

The Default Problem: “Jittery Graphics” and Smoothing

One of the most popular mistakes is routing a raw Envelope or audio signal directly to the scale of a visual layer. Sound happens incredibly fast. Graphics reacting at the speed of sound look jittery and cause visual fatigue.

Use smoothing nodes: Smoothing, Lag, or modular Slew Limiting before sending it to render. This makes the graphics react quickly (the attack remains instant), but return to their original shape more organically and smoothly.

Typical Beginner Mistakes

Mistake 1: Linear reaction to the master track

Reacting to the overall mix often turns into “mush”, where it’s impossible to isolate a specific instrument. It’s much more effective to route individual stems (Kick only, Bass only) to the visualizer.

Mistake 2: Visual Overload

If the entire screen is overflowing with effects jumping on every beat, it quickly gets tiring. Leave “negative space”, let the graphics breathe.

Mistake 3: Forgetting about Threshold

If you map sound without threshold values (Threshold/Noise-Gate), quiet background noise will cause micro-tremors in the object, making it “nervous”. Set up a Gate so the effect triggers only when the element is truly playing.

Practice

Take your favorite generative patch from the previous lessons.
Choose one musical feature in the patch: for example, an LFO controlling a filter.
Send this signal into a visual environment (e.g. use the built-in oscilloscope in VCV Rack, or route it to browser-based WebGL, Resolume, etc.).
Adjust the attenuator (Scale) - find the modulation depth of the graphics where the movement emphasizes the sound of the rhythm. Describe this mapping in writing.

Bonus Exercise

Simulate the “Smoothing” (Slew/Lag) effect using internal modules: run a rhythmic short trigger through a Slew Limiter module before feeding it to control brightness or a video scrambler. Observe the difference between a hard trigger and a “fluid” smoothed signal.

Next Connection

Once the principles of “parameter-to-parameter” mapping are clear, an engineering question arises: how do you transmit these values from a software modular without latency? We will break this down in the lesson on OSC and MIDI integration.

Resources

View Patch Details: Generative Ambient v01

Related patches

Patch references

Use the linked patch entries below as concrete repository anchors for this lesson track.

Same track

Continue inside Audiovisual

Adjacent lessons in the same track keep the topic progression coherent.

Related diagram

Hybrid routing overview

The first system diagram connects the modular engine, DAW layer, and visual output layer.

Open roadmap diagram Open patch library