🧪 Runway Test · 4 Lengths × CEO × Map Angle

1 cohort · 1 persona · 1 frame (extend runway via programs you don't know) · 4 length variants · NO calendar link · 1,400 sends · 14-day decision · ← brain hub

§0 Hypothesis

One frame, four lengths. Find length winner first.

Frame: "We can help extend your runway by claiming hyperscaler programs you don't know about." Direct runway benefit. Programs you didn't know existed. No engineering time required.

Variable: body length only. Subject + frame + CTA held constant.

Persona: CEO / Founder only. (Engineer test deferred to wave 2 to avoid persona × length noise.)

Cohort: funded Series A or B SaaS · 20-50 emp · EU/UK · last 18mo round.

CTA: "Worth comparing notes?" — no calendar link anywhere.

Goal: identify length sweet spot for this audience. Pick top 2, run round 2 confirmatory.

§1 List Spec

PersonaCEO / Founder / Co-Founder ONLY
Employee bucket20-50 emp (overflow to 50-100 if list short)
StageFunded Series A or B (last 18mo)
GeoUK · DE · NL · Sweden · Denmark · Finland · Norway · Poland · France
IndustrySaaS · cloud-native
Source lista_b_valid_contacts_ultra_strict_clean.csv (filtered)
Total sends1,400 (4 × 350)
Pace~280/day over 5 days
Decision window14 days from last send
SuppressionB7 competitor list + WideOps staff (17 contacts)

A Variant A · Ultra-Short ~20 WORDS

B Variant B · Short ~45 WORDS

C Variant C · Medium ~85 WORDS

D Variant D · Long ~170 WORDS

§2 Volume Math · Why 350 per Variant

VariableValueSource
Baseline positive-reply rate~1.0%B5 funnel data + Unify 25M study
Effect size to detect50% lift between variantsRequired for ship-vs-kill decision
Statistical power target80% (directional)Standard for early-stage A/B
Minimum sends per variant~350Two-proportion z-test at α=0.10 (one-sided)
Total sends1,4004 variants × 350
Expected positive replies (1% baseline)~14Across all variants
Expected meetings booked~427.8% reply-to-meeting (B5)
Expected qualified ($20K+/mo)~373% qualification rate (B5)

Trade-off honest read

350/variant gives directional read, not statistical certainty. If two variants come in at 1.1% and 0.9% positive-reply rate, that's NOT a real difference. Need 600+/variant for confident kill. Treat round 1 as a pre-screen: keep top 2 lengths, run round 2 at 500/variant for confirm.

§3 Decision Rule

DayAction
Day 0-5Send 1,400 across 4 variants (350 each), ~280/day
Day 3-4Auto-send threaded bump on no-reply
Day 7First metrics check: open rate sanity, hard bounces, spam complaints
Day 14Final read: positive-reply rate by variant + meetings booked by variant
Day 14Pick top 2 variants by positive-reply rate (ignore opens, ignore total replies)
Day 15Round 2: top 2 variants × 500 sends each on next list slice
Day 30Ship winner. Promote to default Wave 2 template

Kill criteria (any variant)

• <1 positive reply per 350 sends in 14 days → kill that variant

• Open rate <30% → deliverability problem, not length problem · investigate sender pool

• Spam complaint rate >0.1% → kill list segment, not variant

• Negative-reply rate >15% → frame is wrong, kill ALL variants and rethink offer

What we are NOT testing in round 1

• Persona (CEO only — CTO/VPE deferred to wave 2)

• Subject line (extend runway held constant — connector + emoji subjects deferred)

• Hook type (signal-driven held constant — pain-driven and stat-driven deferred)

• Send time (all sends in same morning UTC window)

One axis at a time. Length first.