n8nflow.net logo

Generate High-Quality Audio with Voxtral Small 24B 2507

by Yaron Beenβ€’Updated: Last update 23 days agoβ€’Source: n8n.io
Loading workflow viewer...

Getting Started

Generate High-Quality Audio with Voxtral Small 24B 2507

This workflow integrates the notdaniel/voxtral-small-24b-2507 model from Replicate to generate audio content from provided inputs. It handles API authentication, creates predictions, polls until completion, and outputs the final generated audio file.


⚑ Section 1: Trigger & Authentication

  • πŸ”˜ On clicking 'execute' β†’ Manually starts the workflow.
  • πŸ”‘ Set API Key β†’ Stores your Replicate API key to authenticate requests.

Benefit: Securely connects your workflow to Replicate’s API and ensures only authorized requests are made.


πŸŽ›οΈ Section 2: Create Prediction

  • 🌐 Create Prediction β†’ Sends a request to Replicate’s API with parameters like:

    • audio : Input audio file (e.g., a reference sample).
    • max_new_tokens : Maximum number of tokens to generate (controls audio length/complexity).

Benefit: Starts the audio generation process with configurable input and settings.


⏳ Section 3: Polling & Status Tracking

  • πŸ†” Extract Prediction ID β†’ Captures the unique prediction ID and endpoint for polling.

  • ⏱️ Wait β†’ Pauses for 2 seconds before re-checking.

  • πŸ“‘ Check Prediction Status β†’ Polls Replicate’s API to see if the audio generation is done.

  • βœ… Check If Complete β†’

    • If finished: moves forward to process results.
    • If not: loops back to wait and check again.

Benefit: Efficiently manages asynchronous audio generation, ensuring the workflow only proceeds when results are ready.


🎧 Section 4: Process Result

  • πŸ“ Process Result β†’ Extracts and structures final output data:

    • status (success or failure)
    • output (raw response)
    • metrics (generation statistics)
    • timestamps (created and completed times)
    • audio_url (final generated audio link)

Benefit: Provides a clean, structured output that can be used in follow-up automations (e.g., sending audio to users, storing in a database, or sharing via email).


πŸ“Š Workflow Overview

SectionPurposeKey NodesBenefit
⚑ Trigger & AuthenticationStart workflow & authenticateManual Trigger, Set API KeySecure execution
πŸŽ›οΈ Create PredictionSubmit audio generation requestCreate PredictionStart model processing
⏳ Polling & Status TrackingMonitor prediction progressExtract Prediction ID, Wait, Check Prediction Status, Check If CompleteEnsures reliable completion
🎧 Process ResultFormat and deliver outputProcess ResultClean audio result ready for use

βœ… Final Benefits

  • πŸ”’ Secure authentication with Replicate
  • πŸŽ›οΈ Flexible audio generation using voxtral-small-24b-2507
  • ⏳ Reliable polling until results are ready
  • 🎧 Clean and structured audio output