What are the ACX audio specifications?

ACX requires 192 kbps CBR MP3, 44.1 kHz sample rate, RMS level between -23 dB and -18 dB, peak level ≤ -3 dB, and noise floor ≤ -60 dB. Every Notevibes export meets these specs automatically — no post-processing or mastering needed.

Does it handle chapter splitting?

Yes. Notevibes auto-detects chapters from EPUB TOC, PDF headings, DOCX heading styles, or custom markers in plain text. Each chapter is exported as a separate audio file with proper naming (Chapter_01.mp3, Chapter_02.mp3, etc.).

What is volume normalization?

Volume normalization ensures consistent loudness across your entire audiobook. Notevibes targets -20 dB RMS (within ACX's -23 to -18 dB range), applies peak limiting at -3 dB to prevent clipping, and keeps the noise floor below -60 dB.

What narration presets are available?

Notevibes offers 8 narration presets: Narrator (classic, measured), Fiction (expressive storytelling), Non-Fiction (authoritative, educational), Memoir (personal, intimate), Children's Book (warm, animated), Thriller/Mystery (suspenseful, gripping), Romance (emotional, intimate), and Custom (full control). Each has optimized speaking rates and recommended tone tags.

Can I publish AI-narrated audiobooks on Audible?

Yes. All voiceovers generated with Notevibes include full commercial usage rights with your subscription. Publish on Audible (via ACX), Google Play Books, Apple Books, Kobo, Findaway Voices, and any audiobook platform.

Can I convert my EPUB or Kindle book to an audiobook?

Yes. Upload your EPUB or Kindle (.mobi/.azw3) file directly. Notevibes detects chapters, extracts text, and lets you generate narration for each chapter with consistent voice and style. The complete audiobook is exported as ACX-compliant audio files.

What is Storybook Mode?

Storybook Mode transforms your audiobook into a visual reading experience. Listeners see beautifully typeset pages with drop caps, genre-specific themes (fantasy, romance, thriller, etc.), AI-generated character portraits, scene illustrations, and smooth page-turn animations — all synced to the narration audio.

How does Read Along work?

Read Along provides word-level highlight sync that follows the narration in real time. Each word lights up as it is spoken, making it easy to follow along. It works in both standard and Storybook Mode, supports auto-advance between chapters, and is especially useful for language learners and accessibility.

What analytics do I get for my audiobook?

Notevibes provides listener analytics including total plays, unique listeners, completion rate, geographic insights (listener locations), and device & behavior data. Track how your audience engages with each chapter and use the data to improve future audiobooks.

Can I sell AI audiobooks on Audible and ACX?

Yes. Notevibes exports ACX-compliant audio (192 kbps CBR, 44.1 kHz, proper RMS and peak levels). All paid plans include full commercial rights with no royalty splits to Notevibes. Upload directly to ACX for Audible distribution, or publish on Google Play Books, Apple Books, Spotify, Kobo, and Findaway Voices.

Do I own the rights to my AI-generated audiobook?

Yes. With any paid Notevibes plan, you own full commercial rights to all generated audio. There are no royalty splits, no per-sale fees, and no platform lock-in. You can sell, distribute, and monetize your audiobook on any platform worldwide.

NoteVibes

Start for free

AI Audiobook Generator

Your book.
Your cast.
Real performance.

Upload an EPUB, Kindle, or PDF. AI detects every character, assigns voices, and narrates with scene direction and inline emotion tags — whisper, urgent, breathless — right down to the line.

FictionThrillerRomanceChildren'sTrue CrimeFantasyMysteryMemoirSelf-Help

Create audiobook Hear the genres

Free. No credit card required.

Literary Fiction · Single NarratorVoice · Aoede

[warm] The rain hammered against the window. [short pause] She knew, [reflective] even before opening the letter, that everything was about to change.

Generated in 30s

Nine narration presets

Every genre. A voice that fits.

Each card shows the preset, the voice, scene direction, and a real sample script you can copy into the editor. Press play to hear the narration.

Open novel on a wooden desk under a warm reading lamp

Literary Fiction

Aoede

Persona

Award-winning narrator. Warm, intimate, sentences allowed to breathe.

Direction

Unhurried and reflective. Each clause lands. Dialogue softens; description carries weight.

warmshort pausereflectiveintimate

Sample script

[warm] The rain hammered against the window. [short pause] She knew, [reflective] even before opening the letter, that everything was about to change.

Shadowed hallway with a single open door at the end

Page-Turner Thriller

Fenrir

Persona

Investigative narrator. Tight, controlled, building dread line by line.

Direction

Cold and forward-leaning. Pauses cut, never settle. Final beat lands flat — not punched.

coldurgentshort pausebreathless

Sample script

[cold] He'd been here before. [urgent] The door was open. [short pause] [breathless] It shouldn't have been.

Two silhouettes near a rain-streaked window, soft lamplight

Romance Novel

Despina

Persona

Intimate narrator with a confidant's voice. Every line is told to one person.

Direction

Warm and slow. Sentences breathe. Whispers used sparingly so they actually mean something.

warmwhisperstremblingshort pause

Sample script

[warm] He said her name like it was a confession. [whispers] Eleanor. [short pause] [trembling] And for the first time in seventeen years, she let herself answer.

Brave little fox stepping into a forest of glowing mushrooms

Children's Adventure

Puck

Persona

Storyteller voice. Animated, mischievous, lots of life — never shouty.

Direction

Bright and playful. Character voices stay distinct. Read for the kid on the floor at bedtime.

cheerfulmischievouslyexcitedwarm

Sample script

[cheerful] The little fox tucked the map under his paw. [mischievously] He was not lost. [excited] He was on an adventure. [warm] There is, of course, a very large difference.

Redacted case file under a desk lamp with a cassette recorder

True Crime

Alnilam

Persona

Cold authoritative narrator. The voice of the investigation, not the victim.

Direction

Measured and grim. Names land. Numbers land. Emotion is implied through restraint.

coldseriousshort pausegrim

Sample script

[cold] On the night of October eleventh, [short pause] only one neighbor heard anything. [serious] She did not call the police. [grim] She would, [short pause] for the rest of her life, regret that.

Lone knight watching torchlight crest a distant ridge

Epic Fantasy

Rasalgethi

Persona

Bardic narrator. Old, weathered, the keeper of a story too large for one telling.

Direction

Stately cadence. Names get weight. World-building lands like memory, not exposition.

seriouswarmshort pausedetermination

Sample script

[serious] Before the Three Kings, before the long winter that bore the name of the Last Queen, [short pause] [warm] there was a road. [determination] And the road did not forget.

Detective's desk with case photos and a single empty teacup

Classic Mystery

Pulcherrima

Persona

Wry observer. Agatha-style narrator who already knows who did it.

Direction

Precise and amused. Small pauses for the listener to catch the clue you just dropped.

curiousmischievouslyshort pauseserious

Sample script

[curious] There were, in the drawing room, seven people. [mischievously] Six of them were lying. [short pause] [serious] The seventh would, in approximately forty minutes, be dead.

Two armchairs by a fireplace with a glass of wine and a notebook

Cozy Memoir

Vindemiatrix

Persona

First-person narrator telling a long story to one friend by the fire.

Direction

Conversational, slightly self-deprecating. Honest without performing the honesty.

warmreflectiveshort pauseintimate

Sample script

[warm] My mother used to say there were two kinds of people. [reflective] [short pause] The kind who left, and the kind who waited. [intimate] She didn't tell me which one I was.

Open notebook with a coffee cup and morning sunlight on a desk

Self-Help Authority

Sulafat

Persona

Confident coach. Direct, motivating, no filler. Trustworthy at scale.

Direction

Clear and measured. Builds belief through specificity, not volume. Lands the call to action softly.

confidencedeterminationwarmshort pause

Sample script

[confidence] The hardest part of any habit is the first ninety seconds. [determination] Not the first day. [short pause] [warm] The first ninety seconds. That's the only door you have to walk through.

How it works

From manuscript to finished audiobook.

Four steps. One afternoon. No studio, no scheduling, no second take. Upload your book and AI handles the cast, the direction, and the delivery.

Upload your book

Drop an EPUB, Kindle, PDF, DOCX, or plain text file. AI extracts the text and detects chapters from your TOC, headings, or markers — your book's structure is preserved automatically.

EPUBMOBIAZW3PDFDOCXTXT

Cast your characters

AI scans the manuscript, identifies every speaking character, and assigns a distinct voice from 550+. Save them to your character library — the same Elena follows you from Book 1 to Book 7.

+ 550 voices

Direct each scene

Per-paragraph scene direction shapes the atmosphere. Inline emotion tags like [whisper] and [urgent] drop at the exact line where delivery shifts — not glued to every paragraph.

Cold, controlled — building dread

whisperWait. breathlessThis can't be right.

Generate & publish

Chapters render in parallel — a 10-hour novel finishes in under an hour. Export per-chapter MP3s or a merged file at ACX-compliant 192 kbps. Publish on Audible, Apple Books, Spotify, anywhere.

ACX-READY

Storybook Mode

Audiobook by ear. Storybook by eye.

Listeners get more than narration — they get a visual reading experience with character portraits, genre-aware scene illustrations, and word-level highlight sync that follows the voice.

Reader looking at a character portrait inside an open book

Character Portraits

Every character has a face.

AI generates a portrait for every detected character and pins it to the page. Elena looks the same in chapter 3 as she does in chapter 27 — no muddled mental images, no skimming back to remember who's who.

Preserved across the whole book.

Reader paging through a book with an inline scene illustration

Scene Illustrations

Genre-aware key scenes.

Inline illustrations at chapter beats — fantasy, romance, thriller, and memoir each have their own visual language. Readers see the world, not just hear it.

Auto-placed at story turns.

Reader following along as words light up under the narration

Read Along Sync

Words light up live.

Word-level highlight follows the narration in real time, like a karaoke for prose. Built for language learners, dyslexic readers, and anyone who wants to follow the text while they listen.

Works in every language.

The economics

Studio quality. A fraction of the cost.

Traditional audiobook production costs $5,000 to $15,000 per finished hour. AI narration delivers the same quality in one afternoon.

Traditional studio

$5K – $15K per hour

$5,000 – $15,000+ per finished hour
2–6 weeks turnaround
Coordinate actors, engineers, studio time
Re-records charged per session
One language per production

Notevibes AIRecommended

From $19/month

Plans starting at $19/month — unlimited books
One afternoon from upload to finished audiobook
Upload, cast voices, click generate
Unlimited revisions, regenerate per line
72 languages from the same manuscript

Your manuscript. One afternoon. One finished audiobook.

No studio. No voice actors. No scheduling. Cast your characters, direct your scenes, drop inline emotion tags where the delivery actually shifts. Building voice AI since 2018.

Start creating for freeFree. No credit card required.

Open book transforming into sound waves on a warm desk

Frequently asked questions

Got questions? We've got answers.

How do I create an audiobook with AI narration?

Upload your book (EPUB, Kindle, PDF, DOCX, or TXT), pick a narration preset (Fiction, Memoir, Thriller, Children's, etc.), and click generate. AI detects chapters from your TOC, identifies every speaking character, and assigns a distinct voice to each one. Export per-chapter MP3s and publish on Audible, Apple Books, Spotify, or Google Play.

What book formats can I upload?

EPUB (.epub) with TOC chapter detection, Kindle (.mobi/.azw3) with chapters preserved, PDF with layout-aware text extraction, Word (.docx) with heading-to-chapter conversion, and plain text (.txt) with custom chapter markers.

How does AI character detection work?

Notevibes scans the full text to find every speaking character, extracts names, roles (protagonist, antagonist, narrator, mentor, etc.), personality traits, and chapter appearances. Each character gets a unique voice. Dialogue switches voices automatically; narration stays consistent across the whole book.

Can I publish AI audiobooks on Audible and ACX?

Yes. Notevibes exports ACX-compliant audio (192 kbps CBR, 44.1 kHz, proper RMS and peak levels). Every paid plan includes full commercial rights — no royalty splits, no per-sale fees. Publish on Audible (via ACX), Apple Books, Google Play, Spotify, Kobo, and Findaway Voices.

What's the difference between scene direction and emotion tags?

Three orthogonal layers. Persona is who the voice IS. Scene direction is how the whole moment FEELS, applied per paragraph ("hushed, confessional — rain at the window"). Emotion tags like [whisper], [urgent], [breathless] are inline at the exact line where delivery shifts. Persona + direction + tags = a performance, not a readout.

How long does a full-length audiobook take?

One afternoon. Chapters generate in parallel — a 10-hour novel typically finishes in under an hour of wall-clock time. There are no per-chapter or per-book limits on paid plans.

How many voices and languages are available?

550+ natural-sounding AI voices across 72 languages. Same character cast, same emotion controls work in every language — publish multilingual editions of the same manuscript without recasting.

What audio formats can I export?

MP3, WAV, OGG, and ULAW. MP3 at 192 kbps CBR is the standard accepted by Audible, Apple Books, Google Play Books, Kobo, Findaway Voices, and every major distributor.

How much does AI audiobook narration cost?

Plans start at $19/month with no per-book limits. Compared to traditional studio production at $5,000–$15,000+ per finished hour, AI narration takes one afternoon at a fraction of the cost.

The full picture

Why authors ship AI-narrated audiobooks

Traditional audiobook production costs $5,000 to $15,000 per finished hour and takes weeks: booking actors, studio time, engineers, post-production, retakes. For most indie authors and mid-list publishers, that math has always meant the audiobook never happens — or happens years late, after the marketing window has already closed.

Notevibes collapses that workflow into text. Upload your EPUB, Kindle, PDF, or DOCX; AI parses chapters from the TOC, detects every speaking character, assigns distinct voices, and narrates with scene direction and inline emotion tags. A 10-hour novel finishes in under an hour of wall-clock time, with full commercial rights and ACX-compliant exports — ready to publish on Audible, Apple Books, Spotify, Google Play, or anywhere else.

Cast a voice for every character — once, forever

Your character library lives at the account level. For each character you save a voice, a portrait, a persona, an emotion-tag vocabulary, and reference details — and new books link their detected speakers to those library characters with one click. Elena in Book 1 sounds and looks identical to Elena in Book 7. Series consistency stops being a production problem.

Three layers of direction — persona, scene, line

Persona is who the voice IS at the character level. Scene direction is how the moment FEELS, applied per paragraph ("hushed, confessional — rain at the window"). Inline emotion tags like [whisper], [urgent], [breathless] sit at the exact word where delivery shifts. Stack all three and the AI delivers a performance, not a readout — and you can invent your own creative tags ([like an orc], [submerged], [through gritted teeth]) per character.

Related: Free text to speech · Read aloud

From manuscript to finished file in formats that ship

Upload EPUB, Kindle (.mobi/.azw3), PDF, DOCX, or plain text. Chapters are detected from the TOC, headings, or custom markers. Export per-chapter MP3s or a merged file at 192 kbps CBR — the ACX-compliant standard accepted by Audible, Apple Books, Google Play Books, Kobo, Findaway Voices, and every major distributor. WAV, OGG, and ULAW are available for editing or telephony.

Beyond audiobooks — the same workflow scales

Once you're shipping multi-voice narration from text, the same workflow extends to every other audio surface you publish. Pair two voices for a podcast. Pull a short clip for TikTok or Reels promo. Generate a YouTube voiceover for the book trailer. The cast you built for the novel travels everywhere — same voices, same emotion controls, same one-editor workflow.

Start your free AI audiobook →