ryanfrancesconi/spfk-audio-base

Shared audio types, AVFoundation extensions, and processing utilities for the SPFK package ecosystem. Provides the foundational layer used by [SPFKTempo](https://github.com/ryanfrancesconi/spfk-tempo), [SPFKLoudness](https://github.com/ryanfrancesconi/spfk-loudness), [SPFKMusical

Core Types

Bpm

A tempo value in beats per minute with octave-equivalent matching.

let tempo = Bpm(120)!

tempo.quarterNoteDuration  // 0.5 seconds
tempo.multiples            // [15, 30, 60, 120, 240, 480, 960]
tempo.isMultiple(of: 60)   // true (120 is 2x of 60)
tempo.isMultiple(of: Bpm(61)!, tolerance: 2.0)  // true (within ±2 BPM)

LoudnessDescription

EBU R128 loudness metrics for audio files.

var loudness = LoudnessDescription()
loudness.loudnessIntegrated = -24.13    // LUFS
loudness.loudnessRange = 1.43           // LU
loudness.maxTruePeakLevel = -0.07       // dBTP
loudness.maxMomentaryLoudness = -19.51  // LUFS
loudness.maxShortTermLoudness = -22.99  // LUFS

loudness.isValid       // true
loudness.stringValue   // "I -24.1 LUFS, TP -0.1 dB, LRA 1.4 LU, M -19.5 LU, S -23.0 LU"

// Average across files
let average = [loudness1, loudness2, loudness3].average

AudioFileType

Enum representing audio formats with Core Audio, UTType, and MIME type mappings.

let type = AudioFileType(pathExtension: "m4a")
type?.fileTypeName       // "Apple MPEG-4 Audio"
type?.avFileType         // .m4a
type?.utType             // .mpeg4Audio
type?.mimeType           // "audio/mp4"
type?.isAudio            // true
type?.isPCM              // false
type?.supportsMetadata   // true

CountableResult

Generic consensus voting for iterative analysis with early exit.

var results = CountableResult<Int>(matchesRequired: 3)

results.append(120)  // false
results.append(121)  // false
results.append(120)  // false
results.append(120)  // true — 120 reached 3 matches

results.suggestedValue  // 120
results.choose()        // 120 (most frequent)

NoteName and MusicalTonality

Chromatic note names and tonality for musical key detection.

let note = NoteName(string: "Db")  // .cSharp
note?.description                  // "C#"
note?.enharmonic                   // "Db"

let tonality = MusicalTonality(string: "minor")  // .minor

Audio File Scanning

AudioFileScanner streams an audio file in fixed-size chunks with progress reporting. Used by analysis engines (BPM, loudness, musical key) to process audio incrementally.

let scanner = AudioFileScanner(
    bufferDuration: 0.5,
    sendPeriodicProgressEvery: 4,
    minimumDuration: 15   // loop short files in-memory
) { event in
    switch event {
    case .progress(let url, let value):
        print("\(url.lastPathComponent): \(value)")
    case .data(let format, let length, let samples):
        // process PCM samples
        break
    case .periodicProgress(let url, let value):
        // run intermediate analysis
        break
    case .complete(let url):
        print("Done: \(url.lastPathComponent)")
    }
}

try await scanner.process(url: audioFileURL)

When minimumDuration is set and the file is shorter than half that value, the scanner loops by seeking back to frame 0, providing enough material for algorithms that require a minimum input length.

Waveform Visualization

Parse audio files into drawing-ready waveform data at multiple resolution levels.

let parser = WaveformDataParser(resolution: .medium)
let waveform = try await parser.parse(url: audioFileURL)

waveform.channelCount     // 2
waveform.audioDuration     // 180.5
waveform.samplesPerPoint   // 64 (.medium)

// Extract a time range
let segment = try waveform.subdata(in: 10.0 ..< 20.0)

Resolution levels control the tradeoff between detail and data size:

| Resolution | Samples per Point | Best For | |------------|------------------|----------| | .lossless | 1 | Full-resolution editing | | .veryHigh | 16 | Detailed zoomed views | | .high | 32 | Standard waveform display | | .medium | 64 | Overview display | | .low | 128 | Thumbnail views |

AVFoundation Extensions

AVAudioPCMBuffer

let buffer = try AVAudioPCMBuffer(url: audioFileURL)

buffer.duration        // seconds
buffer.rmsValue        // RMS across all channels
buffer.isSilent        // true if all samples are zero

// Processing
let normalized = try buffer.normalize()
let reversed = try buffer.reverse()
let faded = try buffer.fade(inTime: 0.1, outTime: 0.5)
let converted = try buffer.convert(to: targetFormat)
let peak = try buffer.peak()

// Editing
let segment = try buffer.extract(from: 1.0, to: 5.0)
let looped = try buffer.loop(numberOfDuplicates: 3)
try buffer.append(otherBuffer)
try buffer.write(to: outputURL)

AVAudioFile

let file = try AVAudioFile(forReading: url)

file.duration              // seconds
file.dataRate              // kbps (estimated)
try await file.estimatedDataRate()  // kbps (accurate)

let buffer = try file.toAVAudioPCMBuffer()
let partial = try file.toAVAudioPCMBuffer(maxDuration: 10)
let channelData = try file.toFloatChannelData()

AVAudioFormat

format.readableDescription  // "44100 Hz, 16-bit, Stereo"
format.bitsPerChannel       // 16
format.bitRate              // 1411200.0

AVAudioFormat.createPCMFormat(bitsPerChannel: 24, channels: 2, sampleRate: 96000)

AVAudioEngine

engine.outputFormat
engine.maxFramesPerSlice

engine.safeAttach(nodes: [mixer, effect])
engine.safeDetach(nodes: [mixer, effect])
engine.connectAndAttach(source, to: destination, format: format)

AVAudioNode

node.resolvedName              // human-readable name
node.isOutputNodeConnected     // true if has output
node.ioConnectionDescription   // ASCII-art connection visualization

try node.disconnectOutput()
try node.disconnectInput()
try await node.disconnect(input: specificInput)

System Audio Configuration

AudioDefaults manages the system audio format as a thread-safe actor:

await AudioDefaults.shared.update(systemFormat: deviceFormat)
await AudioDefaults.shared.sampleRate  // 48000.0
await AudioDefaults.shared.isSupported(sampleRate: 22050)  // depends on enforcement

Time Formatting

RealTimeDomain provides time display string formatting:

RealTimeDomain.string(seconds: 3661.5)
// "1:01:01.500"

RealTimeDomain.string(seconds: 125.0, showHours: false, showMilliseconds: false)
// "2:05"

RealTimeDomain.seconds(string: "1:30.250")
// 90.25

Architecture

SPFKAudioBase
  ├── Definitions/
  │   ├── AudioFileType          — Audio format enum with Core Audio/UTType/MIME mappings
  │   ├── Bpm                    — Tempo value with octave-equivalent matching
  │   ├── LoudnessDescription    — EBU R128 loudness metrics (LUFS, LU, dBTP)
  │   ├── NoteName               — Chromatic note names with enharmonic support
  │   ├── MusicalTonality        — Major / Minor / Unknown
  │   ├── CountableResult        — Generic consensus voting for iterative analysis
  │   ├── URLProgressEvent       — Progress/completion events for async processing
  │   ├── WaveformData           — Drawing-ready waveform sample data
  │   ├── WaveformDisplay        — Waveform display modes and quality levels
  │   ├── WaveformDrawingResolution — Resolution presets for waveform rendering
  │   ├── BufferPeak             — Peak amplitude with sample position
  │   ├── RealTimeDomain         — Time display formatting
  │   └── AudioDefaults          — Thread-safe system audio configuration
  │
  ├── Extensions/
  │   ├── AVAudioPCMBuffer+      — Duration, RMS, peak, normalize, fade, convert, loop
  │   ├── AVAudioFile+           — Duration, bitrate, buffer conversion
  │   ├── AVAudioFormat+         — Readable descriptions, bit depth, PCM format creation
  │   ├── AVAudioEngine+         — Safe attach/detach, connection helpers
  │   ├── AVAudioNode+           — Connection introspection and management
  │   └── AVAudioMixerNode+      — Input bus management
  │
  └── Utilities/
      ├── AudioFileScanner       — Streaming file scanner with in-memory looping
      ├── WaveformDataParser     — Multi-resolution waveform data extraction
      └── AudioTools             — Audio file utilities (looped audio creation)

Dependencies

| Package | Purpose | |---------|---------| | spfk-base | Core utilities, logging, type extensions | | spfk-testing | Test audio resources (test target only) |

Requirements

Platforms: macOS 13+, iOS 16+
Swift: 6.2+

About

Spongefork (SPFK) is the personal software projects of Ryan Francesconi. Dedicated to creative sound manipulation, his first application, Spongefork, was released in 1999 for macOS 8. From 2016 to 2025 he was the lead macOS developer at Audio Design Desk.

Package Metadata

Repository: ryanfrancesconi/spfk-audio-base

Default branch: main

README: README.md