vidtranscript the round

Investor one-pager/ pre-seed → seed/ the model proposes · the code disposes

vidtranscript

The code-enforced-correctness civic-video pipeline for local newsrooms.

Built for, and running live in, a real hyperlocal Brooklyn newsroom — Bushwick Daily. verified · reference customer

Raising $[X] pre-seed / seed.  The round funds the layer around a proven safety core.
A finished 9:16 captioned civic clip: the karaoke caption “This is a sanctuary” with the active word highlighted yellow, a JULIE WON lower-third, a TOPIC: IMMIGRATION banner, a RACE FOR CONGRESS bar and a BUSHWICK DAILY watermark — rendered with ffmpeg at zero render cost. spoken word · grounded
I What we highlight — the three code-enforced moats

Not a wrapper: the code, not the model, decides what is true, fair, and shippable.

A wrapper passes a prompt to a model and ships the model's output. vidtranscript inverts that exactly where it matters — the model proposes editorial judgment; deterministic code decides what is true, what is fair, and what may ship. Three gates carry the load.

Moat 1 · correctness

A published quote physically cannot contain an unspoken word

Exact normalized-token subsequence match against the Deepgram word array; quotes stored as word indices, display text regenerated from the array; unmatched quotes land in rejected_quotes[] with a reason. The renderer never sees the model's string.

6-mo copycat? · No — needs the indices-not-strings data model + reject pipeline

Moat 2 · fairness / liability

An imbalanced debate edit refuses to render unattended

Airtime recomputed on the final trimmed cut, moderators excluded; render returns HTTP 409 until a justification is recorded. Server-enforced — it holds even if the front end is bypassed. The debate profile is forced to full diarization.

6-mo copycat? · No — a render-blocking instrumented gate encoding a liability model

Moat 3 · operational capture

Daily civic capture from a hostile platform, without bans or over-billing

Global on-disk circuit breaker (6h cooldown), failure-classified client rotation, 45s pacing, transcript-first tiering, cookie rotation — earned the expensive way. Lead with YouTube (proven); HLS partial.

6-mo copycat? · Partial — relearnable only by paying the same IP-ban / over-bill tuition

Why this is the page's spine

Every section below answers the moat from a different angle: economics proves it is also cheap to run ($0 render); competition proves an incumbent would have to invert its product to copy it; traction proves it runs live; the ask proves the gaps are all around the moat, never in it.
02Problem · the durable macro wound

Local news is dying — and civic accountability with it.

The board meetings, debates, and press conferences still happen — now with no one in the room. The raw material is abundant and public; the labor to turn it into coverage has collapsed. This is accountability loss, not nostalgia.

On-page copy — verified, investor-only

  • Medill 2025  213 news-desert counties (up from 206), 1,524 single-source counties, and ~50M Americans with limited or no local news.
  • Medill 2025  136 newspaper closures in the past year — more than two a week — concentrated in small independents: the exact buyer profile.
  • So-what: the civic proceedings are still public and abundant; what disappeared is the human capacity to cover them.

The collapse, in three verified figures

0
news-desert counties — 2025, up from 206
Medill 2025
0
single-source counties — one outlet from a desert
Medill 2025
0
Americans with limited / no local news
Medill 2025

All three are VERIFIED macro figures (Medill State of Local News 2025), cited here as the structural wound — not as adoption.

03Why now · format shift + the enabling-tech threshold

Civic news moved to short video — newsrooms can't keep up.

The audience that left newspapers now gets civic news from short video and "news influencers." The audience is on the platform; the trustworthy supply is not. And the enabling tech only crossed the grounded-extraction threshold roughly 18 months ago.

On-page copy

  • Pew  1 in 5 U.S. adults — and 38% of those under 30 — regularly get news from social-media influencers.
  • Pew  Local-news attention fell from 37% (2016) to 21% (2025). Demand shifted format faster than newsrooms could follow.
  • Enabling-tech why-now: frontier diarization (Deepgram Nova-3) plus structured-output LLMs (Claude Sonnet-class) only became good enough for grounded, attributed extraction in roughly the last 18 months. Not buildable in 2022.

Local-news attention · 2016 → 2025Pew · verified

37% · 2016 21% · 2025 ▼ Nova-3 + Sonnet threshold (~18 mo ago)

Red declining line = local-news attention 37%→21% (Pew, VERIFIED). Green dashed marker = the ~18-month-ago point where ASR + LLM crossed usable-for-grounding. The format demand and the enabling tech arrived at the same moment.

04Product · the wedge  shared product core — customer-deck wording

Paste a civic URL → a trustworthy social clip.

One operator — no engineer, no command line — turns a pasted civic-video URL or uploaded footage into three reviewable outputs. Minutes, not hours — without the fabrication risk that keeps newsrooms off AI.

The four-move operator flow

Move 1Paste / upload
Move 2Pick a profile
Move 3Review & approve
Move 4Render & download

The three outputs, named

  • (1) A speaker-attributed transcript — word-level timestamps, diarized, human-confirmed names.
  • (2) Extracted stories with provably-spoken grounded quotes plus an article draft.
  • (3) Branded captioned video in 9:16 / 16:9 / 1:1.  verified · 3 ratios
stories_frontdoor_desktop.pngone paste · zero CLI
The vidtranscript one-click front door: a single ingest panel where an operator pastes a civic-video URL or chooses a file, picks a profile, and runs the pipeline — no command line.
stories_ingest_panel_element.pngthe whole input
The ingest control: a URL or Choose File field, a profile dropdown, a key-terms boost field, and a Run pipeline button — the entire operator input surface.
One front door for the whole input: URL or file → profile → key-terms boost → run. No engineer, no terminal.
05Product · the live pipeline + three deterministic gates  shared product core

One operator, end to end — shown working.

Four stages, three deterministic gates drawn as distinct non-LLM nodes. The model proposes editorial judgment; deterministic code decides what is true, what is fair, and what may ship. This diagram is the anti-wrapper argument — the gate-as-node motif that recurs across the page.

The pipeline — the gate is a non-LLM code node

Stage 1Ingest & capture
Stage 2Transcribe · Nova-3
Gate · code3 safety gates
Stage 4$0 ffmpeg render

The three gates — mechanics verbatim with the customer deck

  • Gate 1 — quote grounding: exact normalized-token subsequence vs the Deepgram word array; quotes stored as word indices, display text regenerated from the array; non-matching quotes rejected. verified
  • Gate 2 — playback-gated approval: Approve unlocks only after play-through; the render API returns HTTP 409 on any unapproved cut — server-enforced. verified
  • Gate 3 — render-blocking debate fairness: airtime recomputed on the final trimmed cut, moderators excluded; render returns 409 until the operator records a justification. verified · server-enforced

Crosstalk caveat — diligence-grade candor

Airtime is recomputed on the final cut with moderators excluded; one honest limit — where speakers talk over each other, overlapping words are single-attributed, so airtime can undercount crosstalk.
story_detail_desktop.pngthe editorial workbench
The editorial workbench: grounded quotes listed with timestamps, a render panel, and an article-draft affordance — the operator's review surface.
story_quote_row_element.pngplay-then-approve gate
An isolated quote row showing the playback-gated approval: a Play 17:10 control that must run before the green Approved state can be reached.

The honest boundary — never claim perfect ASR

Grounding proves quote-matches-transcript, not transcript-matches-audio. The human ear at Gate 2 (play-through before Approve) is the mis-transcription safeguard. We do not claim perfect ASR; the human-in-the-loop is the design, and it is a selling point.
06Moat · the spine — the section that wins the room

Three code-enforced moats: why this isn't a wrapper.

The defensibility is the deterministic code around the model — plus operational know-how a prompt can't supply. The model is never trusted with the load-bearing claim.

The moat matrix — verbatim with the pitch-outline

MoatWrapper failure mode feared6-mo copycat?
1 · Correctnessquote groundingAn LLM fabricates or paraphrases a quote; the newsroom publishes words nobody said; trust is gone.No
2 · Fairnessrender-blockingAn AI edit skews a candidate's airtime in election season → legal + reputational exposure.No
3 · CaptureoperationalA naive scraper gets IP-banned or over-bills within days of daily pulls from an anti-bot platform.Partial

"None of these three fall out of an afternoon of prompting. They are deterministic code encoding editorial and operational knowledge — and the secondary asset is a compounding corpus of human-approved, grounded civic clips a copycat can't assemble."

The moat, shown working — investor-only states

gate 1 · quote groundingrejected_quotes[]
model proposed: "we will absolutely fund the shelter"
match vs words[14302…14338]: UNSPOKEN WORD "absolutely" → REJECTED
reason: token not a subsequence of the array · renderer never receives the string
gate 2/3 · render APIserver-enforced
POST /api/render  cut=unapproved
HTTP 409 · CONFLICT
→ render refused until play-through approval (and, for debate, a recorded fairness justification)
label_speakers_desktop.pngLLM suggests · human confirms
The speaker-labeling screen: the LLM suggests speaker names, but a human confirms each one — names are never auto-applied.

Honesty box — stating the edges strengthens credibility

Grounding proves text-matches-transcript, not transcript-matches-audio. Fairness can undercount crosstalk. The capture moat is a lead, not a patent. HLS is partial; YouTube is proven.
07Economics · structural margin

Structural margin: free render, metered intelligence only.

The cost killer of AI-video startups — paid generation / render APIs — is absent by design. Render is ffmpeg-only. Only ASR + LLM are metered, and both are held to cents per asset.

On-page copy

  • architecture  Render = $0.00 per asset — ffmpeg / ffprobe only, no paid render API. An architectural fact, not a promotional rate.
  • in-code estimate · not billed  Hybrid transcribes only newsworthy chapters: ≈$0.20–0.60 / meeting vs ≈$1.85 for full 3-hour diarization.
  • architecture  Content-hash idempotency never re-bills; a per-vendor SQLite ledger makes spend a SQL query (cost_summary()).
  • Pricing is the hypothesis this round tests — proposed per-seat SaaS, stated as a hypothesis, not a claim.

What we do NOT claim

No invented MRR / ARR. No Deepgram $/min ($0.0052) stated as billed truth (appendix only, pending a billed-test confirmation). No "fully autonomous" claim — the human-in-the-loop is a selling point.

Per-asset cost · one shared horizontal scalebars 1–2 illustrative

Full 3h diarizeillustrative
≈$1.85
Hybrid (newsworthy)illustrative
≈$0.20–0.60
Render (ffmpeg)verified · architecture
$0.00

Bars 1–2 = ILLUSTRATIVE (in-code estimate, not invoiced). Bar 3 = VERIFIED ($0.00, an architectural seal). All three share one horizontal scale, so the hybrid saving and the free render read instantly.

08GTM & expansion path

Land in newsrooms, expand by format, cross into adjacent verticals.

Land where pain is highest and incumbency lowest. Expand coverage by configuration — a new civic format is a JSON file. Then carry the same correctness moat into better-funded "provably-spoken quote" buyers.

Land

Indie / hyperlocal newsrooms

The 136-closures/year cohort — highest pain, lowest incumbency. Live reference customer: Bushwick Daily.

Expand — by configuration

A new civic format is a JSON file

verified · 4 profiles  community_board, press_conference, fireside_chat, debate ship today.

Cross — same moat

Civic / gov comms · PR · legal

Depositions, hearings, press shops — all buyers of a provably-spoken quote. A sequenced path, not a claim of being there.

The retention hook

Each footage archive becomes a searchable, attributed quote database — every clip the operator approves compounds into a corpus a copycat can't assemble. Switching cost grows with use.
search_housing_desktop.pngarchive → quote database
A search over the footage archive: many attributed, timestamped quote hits returned for a topic query — the archive functioning as a searchable quote database.
Your footage archive becomes a searchable quote database — attributed and timestamped.
09Traction · proof of reality, told honestly  shared product core

It runs live in a real newsroom today.

The full core path is implemented and runs live. The edges are named out loud — and they are exactly what the round funds. We do not present artifact counts as growth.

On-page copy

  • verified  The full core path — YouTube civic capture → Deepgram → grounded extraction → all three gates → $0 render — is implemented and runs live, built for and used by Bushwick Daily.
  • verified  Real artifacts in the live DB: transcripts, stories, finished renders, a populated cost ledger — cited as real use at single-operator scale, explicitly NOT adoption or growth.

Honest-edges matrix — authored once, identical in both decks, all five edges

roadmapNo fairness-ack UI form yet — server-enforced via HTTP 409; the typed-justification UI is roadmap.
emergingAuto-recap / debate-reel is implemented but CLI-only, not over HTTP — never demoed as a web feature.
scopingSingle-operator — no auth, localhost-CORS-locked, in-memory non-durable jobs.
opsNo scheduled capture-health probe — the check is implemented and cheap, but no scheduler runs it; "daily" is documented intent, not an automated job.
partialHLS capture is partial — lead with YouTube civic capture, which is proven.
home_desktop.pnga working tool, not a toy
The dense transcripts library: 44 transcripts with speaker chips and N-unlabeled work-queue badges — the density of a working newsroom tool.
The transcripts library — speaker chips and "N unlabeled" work-queue badges. Density reads as a working newsroom tool, not a demo.

Read this as reality, not traction

DB row counts are real use at single-operator scale — never dressed as user or usage growth. Bushwick Daily is a reference customer (proof), never an adoption metric.
10Market · bottom-up, credible

A real wedge inside a real tailwind.

SOM is built bottom-up from newsroom seats — not a top-down "1% of a $40B market." The AI-video tooling curve is the tailwind the wedge rides, cited as momentum, not as TAM math.

On-page copy

  • TAM — global AI-video-editing tooling spend. analyst ranges vary  ~$1.6B (2025) → ~$9.3B (2030), ~42% CAGR. Momentum, never the SOM denominator.
  • SAM — newsrooms plus civic / PR / legal teams producing attributed video from spoken-word source that cannot tolerate fabrication. bottom-up from seats
  • SOM — beachhead: independent / hyperlocal U.S. newsrooms. Anchor on real counts (~136 close/yr; ~1,524 single-source counties) × realistic seats (1–3) × a stated per-seat ACV hypothesis = a 3-year obtainable figure. Adjacent verticals = upside, not base case.

TAM / SAM / SOM — subordinating TAM to a bottom-up SOM

TAMAI-video tooling spend~$1.6B (2025) → ~$9.3B (2030) · ~42% CAGRILLUS
SAMattributed-video teamsnewsrooms + civic / PR / legal · seats × ACVBOTTOM-UP
SOMindie / hyperlocal U.S. newsroomsN × $/seat · the wedge we actually defendDEFINE IN DECK

Each tier narrows. TAM = ILLUSTRATIVE. The SOM bar carries the vermilion left-edge as the live wedge — computed bottom-up from seats, deliberately subordinating TAM (the opposite of the usual top-down emphasis).

11Competition · the honest axis

Generic AI clippers can't ship a provable quote.

Competitors optimize for virality. We are the only one that makes the quote provably real and the edit provably fair — the two things a newsroom is legally and reputationally unable to compromise.

On-page copy

  • vidtranscript alone sits top-right: high correctness / fairness enforcement and high civic-video fit.
  • The real substitute, named honestly: a junior producer with an NLE. We are cheaper, faster, and safer than that labor — not a category nobody has.
  • A big clip tool would have to invert its model (virality → provable correctness + render-blocking fairness) to compete — a different product for an audience it doesn't serve.

X = correctness / fairness enforcement · Y = civic-video fit

editorial correctness / fairness enforcement → civic-video fit → Opus Clip / Descriptvirality-optimized clippers Otter-classraw ASR · no editorial gate Agency / manual NLEcorrect but slow & costly vidtranscriptprovable quote · render-blocking fairness

The empty top-right quadrant IS the argument. Competitors are paper-toned; vidtranscript is the elevated civic-green dot, visually isolated. The real substitute is labor, not a feature-peer.

12Team · fit to a correctness product

Built by Alec Meeker — an operator-engineer inside the user's newsroom.

The moats exist because the builder lived the failure modes. A correctness-obsessed pipeline shipped solo, embedded with the actual buyer — earned editorial-domain insight, not assumed it.

Alec Meeker
founder · operator-engineer
alec.meeker@gmail.com
  • Founder narrative: Alec Meeker shipped a working, correctness-obsessed pipeline solo, embedded with Bushwick Daily — the real buyer. Edits were added after measured failures (the audit cites this). Founder-market fit is specific, not a logo wall.
  • The fit claim: a correctness product needs a founder who treats "the quote is real" as an engineering invariant, not a marketing line. The architecture-of-refusal is the evidence.
  • Honest hiring intent: currently a single operator. First hires — productization engineering + design-partner GTM — are part of the ask, not assumed in place.
13The ask · gaps become the plan · the one CTA

We built and proved the hard part. This round funds the layer around it.

Use-of-funds maps 1:1 to the audit's honest gap list. None of these gaps touch the safety core; they are the periphery around it. This is the deck's only call to action.

Use-of-funds — maps 1:1 to the audit gap list

Audit gap — verified, honest edgebucket
No auth / multi-operator; localhost-CORS-locked; in-memory non-durable jobsproductization
Fairness-ack server-enforced (409) but no UI formUX gap
Auto-recap / debate-reel CLI-only, not over HTTPfeature reach
No scheduler for the capture-health probe ("daily" is intent)ops
Cost rates are in-code estimates, not billed-confirmedvalidation
Single operator; no GTM motion yetcommercial

One CTA — the round & ask

The audit told us exactly what's left; the ask pays for exactly that.

The round funds the productization layer — multi-tenant, auth, and the UI for the gates we already enforce server-side — plus the first design-partner newsrooms.

Raising: $[X] pre-seed for [N] months  ·  Milestone bought: [N] paying design-partner newsrooms + multi-tenant hosted GA  ·  Numbers are the founder's to set; the structure is fixed here.

Lead this round → Request a demo →

Alec Meeker, founder · alec.meeker@gmail.com

Paste a civic URL. Get a trustworthy clip. The quote is provably real — and the code, not the model, guarantees it.

the verbatim tagline clause · closes the page