📊 DealForge AI
Agentic Deal Intelligence · April 2026
Agentic Deal Intelligence · Comprehensive Knowledge Base · 2026

The Complete DealForge AI Knowledge Base

Everything powering DealForge's agentic deal scraping, scoring, and notification platform — from enrichment algorithms and SBA calculations to industry benchmarks, pricing, and go-to-market strategy.

$15–$100/moSubscription
0–100DealForge Score
80+Industries Tracked
9,500+Deal Benchmarks
<1 minAlert Speed
85–90%Gross Margin
📊

DealForge AI is an agentic deal intelligence platform that automatically scrapes business-for-sale and CRE listings, enriches incomplete data using Census and industry benchmarks, runs full SBA financing calculations, scores every deal 0–100, and notifies matched investors via SMS — all in under 60 seconds.

The core problem: most business-for-sale listings are incomplete. An estimated 60–70% are missing cash flow, SDE, or other key financial data. Every competitor either skips these listings or forces the user to do manual research. DealForge fills the gaps automatically using a layered intelligence approach, turning "incomplete listing" into "estimated deal with confidence range."

The Differentiator

DealForge's 4-tier enrichment engine is what separates it from BizBuySell saved searches, Kumo, and every other listing aggregator. It creates value on the 60–70% of listings that competitors ignore entirely.

How It Works

This is a Karpathy-pattern knowledge base: raw data is collected, then compiled by an LLM into structured articles. The system uses a multi-agent architecture where specialized agents handle discovery, parsing, enrichment, scoring, matching, and notification. The wiki is the source of truth for knowledge; the database is for fast queries.

Market Opportunity

  • SBA 7(a) lending hit a record $45B across ~85,000 loans in FY2025, with acquisition-specific volume up ~35% YTD
  • 80%+ of sub-$5M buyers use SBA financing — if a deal doesn't work under SBA terms, it doesn't work for most buyers
  • 9,500+ completed transactions in the BizBuySell dataset form the foundation of industry benchmarks
  • Median sale price rising from $337,750 overall to $375,000 in Q4 2025
  • Search fund / ETA community growing rapidly — more first-time acquirers entering the market than ever
Competitive Position

DealForge is 3–10x cheaper than the closest competitor for comparable functionality. BizBuySell saved searches are free but offer no analysis. Kumo charges $149–$200/mo. DealForge Tier 1 delivers full analysis, enrichment, and SMS alerts for $15/mo.

🎯

The DealForge Score answers one question: "Is this deal worth my time?" It combines seven sub-scores into a single number that accounts for both financial fundamentals and data reliability. A deal scored 85 with HIGH confidence is fundamentally different from 85 with LOW confidence — both numbers are always shown together.

Composite Formula

DealForge Score = (
    DSCR_Score        * 0.25 +    # Can it service debt?
    Multiple_Delta    * 0.20 +    # Is it priced fairly vs industry?
    Margin_Delta      * 0.15 +    # Is it profitable vs industry?
    Payback_Score     * 0.10 +    # How fast do you recover equity?
    Cash_After_Debt   * 0.10 +    # Real take-home after loan
    Industry_Risk     * 0.10 +    # Sector default rate / stability
    Data_Confidence   * 0.10      # How much do we trust the numbers?
) * 100

Sub-Score Definitions

DSCR Score (25% Weight) — Most Important

DSCR ValueScoreInterpretation
< 1.000.0Cannot cover debt — deal fails
1.000.1Barely covers — extremely risky
1.250.3SBA minimum threshold
1.500.5Acceptable but thin
2.000.8Good margin of safety
≥ 2.501.0Excellent coverage

Multiple Delta Score (20% Weight)

Compares the asking price (as a multiple of cash flow) against the industry average. A deal priced 20%+ below average scores 1.0 (great deal). At average = 0.7 (fair). 30% above = 0.3 (overpriced). 60%+ above = 0.0 (avoid).

Margin Delta Score (15% Weight)

Is the business more or less profitable than its industry peers? 20%+ above average SDE margin = 1.0. At average = 0.6. 50%+ below = 0.0 (red flag).

Payback Score (10% Weight)

Payback PeriodScoreInterpretation
< 6 months1.0Excellent — fast equity recovery
6–12 months0.8Good
12–18 months0.5Acceptable
18–24 months0.3Marginal
24–36 months0.1Slow
> 36 months0.0Capital tied up too long

Cash After Debt (10%), Industry Risk (10%), Data Confidence (10%)

Cash After Debt: Absolute dollar take-home after debt service. Ranges from 0.0 ($0 or negative) to 1.0 ($300k+/yr). Industry Risk: Based on SBA default rate data — <2% default = 1.0, >6% = 0.1. Data Confidence: How much is based on reported data vs. estimates (see Section 05).

Score Interpretation

Score RangeClassificationInvestor Action
85–100ExceptionalImmediate deep dive — rare find
70–84StrongWorth pursuing — request financials
55–69ModerateReview if flexible on some criteria
40–54WeakLikely pass unless strategic reason
0–39PoorAuto-reject
Score vs. Interest Window

The DealForge Score is universal — it measures absolute deal quality. The Interest Window (Section 07) is personal — it measures fit for a specific investor. A deal can score 90/100 but not match an investor's window. Only deals passing BOTH trigger SMS alerts.

🏦

The SBA 7(a) loan program is the #1 way small businesses change hands in the United States. DealForge assumes SBA financing as the default because it's what 80%+ of sub-$5M buyers use. If a deal doesn't work under SBA terms, it doesn't work for most buyers.

Standard SBA 7(a) Terms (DealForge Defaults)

ParameterDefault ValueNotes
Down payment10%Standard for acquisitions without real estate
Financed portion90%
Loan term10 yearsNon-real-estate-heavy deals
Loan term (with RE)25 yearsWhen significant real estate is included
Interest ratePrime + 2.75%For loans >$250k; variable rate
SBA guaranty75% of loan amountFor loans >$150k
Closing costs~3% of asking priceLegal, appraisal, environmental, lender fees

SBA Guaranty Fee Schedule

Loan AmountFee Rate (on guaranteed portion)
≤ $150,0000%
$150,001 – $700,0002.0%
$700,001 – $1,000,0003.0%
> $1,000,0003.5%

DSCR — The Key Metric

DSCR = Annual Cash Flow (SDE) / Annual Debt Service
DSCRLender ViewDealForge Class
< 1.00Automatic declineFAIL
1.00–1.24Very riskyWEAK
1.25Minimum thresholdMARGINAL
1.25–1.50Approvable but tightACCEPTABLE
1.50–2.00ComfortableGOOD
> 2.00Excellent safetySTRONG

Full Calculation Chain

1. Total Acquisition Cost = Asking_Price + Closing_Costs + SBA_Guaranty_Fee
2. Cash Down = Total_Acquisition_Cost * 0.10
3. Loan Amount = Total_Acquisition_Cost - Cash_Down
4. Monthly Payment = Loan * [r(1+r)^n] / [(1+r)^n - 1]
5. Annual Debt Service = Monthly_Payment * 12
6. DSCR = Cash_Flow / Annual_Debt_Service
7. Cash Flow After Debt = Cash_Flow - Annual_Debt_Service
8. Payback Period = Cash_Down / Cash_Flow_After_Debt

Worked Example: Auto Repair Shop ($2.1M Asking)

Asking Price
$2,100,000
Revenue: $3,120,000
Cash Flow (SDE)
$763,000
CF Margin: 24.5%
Cash Down (10%)
$221,422
Total Acquisition: $2,214,221
Annual Debt Service
$322,224
Monthly: $26,852 @ 10.25%
DSCR
2.37
STRONG
CF After Debt
$440,776/yr
Payback: 6.0 months
DealForge Score: ~82/100

This auto repair shop has a DSCR of 2.37 (STRONG), a CF multiple at industry average (2.75x vs 2.70x), and an above-average CF margin (24.5% vs ~20%). The fast equity payback (6 months) makes this a strong acquisition target under SBA financing.

🧠

Most business listings are incomplete. An estimated 60–70% are missing cash flow, SDE, or other key financial data. DealForge's 4-tier enrichment framework is the primary differentiator — it fills gaps automatically using a layered intelligence approach.

The 4-Tier Classification

TierNameDescriptionFrequency
Tier 1Complete DataAsking price, revenue, SDE, industry, location all present. Validate and score.~25–30%
Tier 2Inferable DataKey fields missing but estimable from available signals. This is where DealForge creates the most value.~50–60%
Tier 3Needs OutreachToo little data to estimate reliably. Flag for broker/seller outreach.~10–15%
Tier 4Suspicious DataNumbers present but don't pass the smell test. Enrich + flag discrepancy.~5–10%

Enrichment Chain Paths

Path A: Employee Count Known (Most Common)

1. Source: LinkedIn, Data Axle, Google Business Profile, listing text
2. Revenue = Employees x Revenue_Per_Employee[NAICS]
3. SDE = Revenue x SDE_Margin%[NAICS]
4. Apply BEA regional price parity adjustment

Path B: Physical Units Known

IndustryUnitRevenue Per Unit
Auto repairPer bay$100k–$200k/yr
Self-storagePer unit$40–$80/unit/yr
Assisted livingPer bed$40k–$80k/yr
Car washesPer bay$150k–$400k/yr
Day carePer slot$8k–$15k/yr
LaundromatsPer machine$10k–$18k/yr
RestaurantsPer seat$15k–$25k/yr
Gas stationsPer gallon$0.15–$0.25/gal

Path C: Square Footage Known

Revenue/SqFt benchmarks: Grocery ($400–$600), QSR ($400–$700), Full service restaurant ($200–$400), Retail ($200–$400), Medical office ($300–$500), Gym/fitness ($30–$50/yr), Self-storage ($8–$15/yr).

Path D: Only Industry + Location Known (Lowest Confidence)

1. Query Census CBP at county level for NAICS code
2. Get average payroll per establishment
3. Revenue = Avg_Payroll / Payroll_to_Revenue_Ratio[NAICS]
4. SDE = Revenue x SDE_Margin%[NAICS]
5. Value = SDE x SDE_Multiple[NAICS]
6. Present as WIDE range with LOW confidence

Worked Example: HVAC Business, Grapevine TX

Input: "HVAC business for sale in Grapevine, TX. 18 employees. Asking $1.4M."

Classify: Tier 2 (revenue/SDE missing, employee count available). Path A selected. HVAC (NAICS 238220): Rev/employee $175k, SDE margin 20%, SDE multiple 2.80x. Estimated Revenue = 18 x $175k = $3.15M. SDE = $3.15M x 0.20 = $630k. Geo-adjusted (RPP 0.97): SDE $611k. Range: $489k–$733k. DSCR ~3.4 (EXCELLENT). DealForge Score: ~78/100. Confidence: MEDIUM.

🔍

Every deal analysis is only as good as its inputs. The Data Confidence score makes uncertainty explicit. Rule: The DealForge Score and Data Confidence are ALWAYS shown together.

Confidence Levels

LevelRangeSourceDisplayScore Weight
HIGH85–100%All key fields from listing/CIM, validated against benchmarks"Reported by seller"1.0
MEDIUM60–84%Estimated from 2+ independent signals"Estimated from industry data"0.7
LOW30–59%Estimated from 1 signal only"Rough estimate — verify"0.3
INSUFFICIENT<30%Almost nothing to work with"Insufficient data — outreach recommended"0.0

Confidence Point System

Data Point AvailablePoints
Cash flow / SDE stated+25
Revenue stated+20
Asking price stated+15
Employee count known+10
Physical units known (bays, beds, etc.)+10
Square footage known+5
Industry/NAICS identified+5
ZIP code identified+5
Lease terms stated+3
Years in business stated+2

Total possible: 100 points. Confidence level = sum of available points / 100.

Validation Adjustments

Confidence can be reduced if data fails validation: impossible margin (-10), multiple far outside industry range (-10), Census contradicts stated employees (-15), conflicting signals (-10 each). Confidence can be increased if multiple estimation paths agree within 15% (+10) or Census data supports stated numbers (+10).

Impact on Investor Experience

Every SMS alert shows confidence. Dashboard deals are sortable by confidence. Marginal matches with LOW confidence are deprioritized in digests. Future: investors can set confidence thresholds in their Interest Window.

💎

DealForge uses three valuation approaches depending on asset type and data availability: Income Approach (SDE multiples), Per-Unit/Physical Attribute Approach, and Market Comparison.

1. Income Approach (SDE Multiple Method)

The dominant method for small businesses under $5M. Business Value = SDE x SDE_Multiple[industry]

Multiple RangeTypical Industries
1.5–2.0xRoutes, nail salons, flower shops, cell phone repair
2.0–2.5xRestaurants, coffee shops, hair salons, dry cleaners, pet grooming
2.5–3.0xAuto repair, HVAC, plumbing, landscaping, cleaning, bars
3.0–3.5xDental, home health, trucking, day care, websites/ecommerce
3.5–4.5xLaundromats, car washes, gas stations, machine shops, hotels
4.5–5.0xStorage facilities, dog daycare, rubber/plastic manufacturing
5.0x+Marinas (6.6x), industrial machinery, medical billing

Overall average across 9,500+ transactions: 2.57x SDE / 0.67x Revenue

2. Per-Unit / Physical Attribute Approach

IndustryUnitValue Per UnitRevenue Per Unit
Auto repairBay$50k–$150k$100k–$200k/yr
Self-storageNet rentable sqft$40–$100+$8–$15/sqft/yr
Assisted livingLicensed bed$30k–$80k+$40k–$80k/yr
Car washesExpress tunnel$1M–$3M+$150k–$400k/yr
Day careLicensed slot$3k–$8k$8k–$15k/yr
DentalAnnual collections60–80%N/A
Gas stationsGallon throughput$0.15–$0.25/galN/A

SDE Margin Benchmarks

IndustrySDE Margin
Insurance agencies40–55%
Accounting/tax35–50%
Software/SaaS30–50%
Dental offices30–40%
Laundromats30–40%
Cleaning businesses25–35%
Auto repair / HVAC / Plumbing15–25%
Coffee shops15–22%
Restaurants10–18%
Retail8–18%
🎯

The Interest Window is an investor's personalized definition of "a deal worth my time." It's a multi-dimensional preference profile where each criterion can be marked strict (must match exactly) or flexible (prefer this, but consider near-misses).

Profile Structure

Each window contains: industries (include/exclude), geography (radius or state), investment size (min/max), min DSCR, max CF multiple, min CF after debt, max payback months, min DealForge score. Each field has a flexibility toggle: strict or flexible.

How Matching Works

Step 1: Check Strict Criteria
All strict criteria must pass exactly (binary yes/no). If any strict criterion fails, the deal is NO MATCH — skip entirely.
Step 2: Score Flexible Criteria
Each flexible criterion gets a 0.0–1.0 proximity score: how close is the deal to the preferred value? Weighted average = flex_score.
Step 3: Classify Match
HOT MATCH (flex ≥ 0.70) → Immediate SMS. MARGINAL (flex ≥ 0.40) → Digest only. NEAR MISS (flex ≥ 0.00) → Count only. NO MATCH → Silent log.

Notification Types

Match TypeDeliveryDescription
HOT MATCHImmediate SMSAll strict pass + flex score ≥ 0.70. Very likely worth pursuing.
MARGINALPeriodic DigestAll strict pass + flex 0.40–0.69. Close but not perfect.
NEAR MISSCount OnlyClose on strict or low flex. "3 near-misses within 15% of criteria"
NO MATCHSilent LogDoesn't meet strict criteria. Analytics only.
No-Match Pulse — Critical for Retention

If the system is scanning but finding nothing, the investor needs to know it's working. Configurable interval (daily/weekly): "847 deals reviewed this week. 0 in your interest window. 3 near-misses. Your agent is actively scanning."

🔄

Deals move through a defined set of states from discovery to resolution. Lifecycle events like price drops and stale listings trigger re-scoring and investor notifications.

State Diagram

NEW  →  ACTIVE  →  PRICE_CHANGED  →  STALE  →  REMOVED
              ↑         │
              └─────────┘ (cycles back to ACTIVE after price change)
StateDescriptionAction
NEWJust discovered by ingestion agentParse, classify, enrich, calculate, score, match
ACTIVEFully analyzed and monitoredRe-check periodically for changes
PRICE_CHANGEDPrice dropped (high-value event)Re-score immediately, re-match all windows, notify
STALEListed >90 days with no changesFlag as negotiation leverage, suggest lower offer
REMOVEDNo longer on source platformLog outcome, notify watchers, capture sale price if sold

Deduplication Strategy

Same business can appear on multiple platforms (BizBuySell + BizQuest + BusinessesForSale). Match on: exact business name + ZIP, industry + asking price + ZIP (fuzzy), or broker name + description similarity (LLM-assisted). On match: merge into single record, keep best data from each source, note "Found on 3 platforms" as a signal of serious listing.

Market Intelligence Over Time

Lifecycle data becomes market intelligence: average days-to-sale by industry, price reduction frequency, which industries have the most stale inventory, and seasonal patterns in listing volume. This feeds back into scoring and enrichment benchmarks.

The end-to-end flow from deal discovery to investor SMS notification. Event-driven architecture triggered by platform email alerts, sitemap changes, and user uploads. Target: under 1 minute from trigger to SMS.

Full Pipeline Flow

TRIGGER (Event-Driven)
Platform email alert arrives (BizBuySell, LoopNet, Crexi), sitemap diff detects new URL, broker submits listing, user pastes URL or uploads CIM, or scheduled sweep finds new listings.
PARSE (LLM Agent)
Extract structured data from unstructured listing: asking price, revenue, SDE, industry/NAICS, location/ZIP, employees, sq ft, units, lease terms, business description, broker contact.
CLASSIFY (Deterministic)
Assign data completeness tier: Tier 1 (complete) straight to calculator, Tier 2 (inferable) to enrichment, Tier 3 (insufficient) queued for outreach, Tier 4 (suspicious) enrich + flag.
ENRICH (Census + Benchmarks)
Fill gaps using lookup tables + Census CBP/ZBP: revenue estimation, SDE estimation, geographic adjustment (BEA RPP), attach data confidence score.
CALCULATE (Deterministic Python Engine)
Full financial analysis: SBA amortization, DSCR, payback, multiples, margin comparison. CRE: cap rate, NOI yield. Sensitivity: 3x3 matrix (rate x CF scenarios).
SCORE + NARRATIVE (Weighted Formula + LLM)
Compute DealForge Score (0–100) from 7 sub-scores. Generate 2–3 sentence AI analysis: strengths, risks, notable factors.
DEDUP & LIFECYCLE
Check against known listings: new → store, price change → re-score, already seen → skip, removed → notify, cross-platform duplicate → merge.
MATCH + NOTIFY
Match against all active investor windows. HOT MATCH → Immediate SMS via Twilio. MARGINAL → Queue for digest. NEAR MISS → Increment count. NO MATCH → Silent log.

Processing Time Targets

StageTargetNotes
Trigger → Parse< 30 secEmail parsing is fast; URL fetch adds latency
Parse → Classify< 2 secSimple rules engine
Classify → Enrich< 5 secLookup tables local; Census cached
Enrich → Calculate< 1 secPure math, no external calls
Calculate → Score< 10 secLLM narrative generation is bottleneck
Score → Match< 1 secIn-memory matching
Match → SMS< 5 secTwilio API call
Total: Trigger → SMS< 1 minuteGoal for v1
📋

The master lookup table powering DealForge's enrichment engine, scoring, and valuation. Source: BizBuySell transaction data (9,500+ deals, Q1 2021 – Q4 2025). Median sale price: $337,750 (rising to $375,000 in Q4 2025).

Service Businesses

IndustrySDE MultipleRev MultipleSDE MarginRev/Employee# Deals
Medical Billing4.41x1.54x~35%$120–180k7
Funeral Homes4.36x1.63x~35%$100–150k10
Laundromats4.12x1.45x30–40%N/A169
Waste Mgmt & Recycling3.20x0.94x~25%$100–150k43
Property Management2.72x0.93x~30%$80–120k59
Landscaping & Yard2.56x0.76x15–25%$80–120k211
Cleaning Businesses2.30x0.78x25–35%$50–80k154
Dry Cleaners2.20x0.77x20–30%$60–100k141

Building & Construction

IndustrySDE MultipleRev MultipleSDE MarginRev/Employee# Deals
Building Materials3.40x0.64x~18%$150–250k29
Concrete3.04x0.72x~22%$150–250k21
Heavy Construction2.98x0.70x~22%$150–300k65
Electrical & Mechanical2.94x0.59x~20%$130–200k57
HVAC2.80x0.62x15–25%$150–200k123
Plumbing2.62x0.72x15–25%$140–200k61

Food & Restaurants

IndustrySDE MultipleRev MultipleSDE MarginRev/Employee# Deals
Bars & Taverns2.86x0.53x~18%$50–80k217
Bakeries2.68x0.54x~20%$50–80k93
Coffee Shops2.28x0.45x15–22%$40–60k253
Restaurants2.26x0.37x10–18%$50–80k1,774
Juice Bars2.09x0.46x~20%$40–60k40

Healthcare & Fitness

IndustrySDE MultipleRev MultipleSDE MarginRev/Employee# Deals
Dental Practices3.28x0.87x30–40%$150–250k22
Assisted Living3.18x1.21x~30%$40–80k/bed27
Home Health Care2.84x0.60x~20%$60–100k92
Medical Practices2.58x0.70x25–35%$100–200k139
Gyms & Fitness2.44x0.64x~25%$40–60k79

Automotive & Marine

IndustrySDE MultipleRev MultipleSDE Margin# Deals
Car Washes4.73x1.81x~35%32
Gas Stations3.70x0.63x~15%113
Equipment Rental3.55x0.90x~25%31
Auto Repair & Service2.70x0.59x15–25%247

Retail, Manufacturing, Technology & Other

IndustrySDE MultipleRev MultipleSDE Margin# Deals
Marinas & Fishing6.60x1.53x~23%9
Rubber & Plastic Mfg5.11x1.19x~22%8
Storage Facilities4.60x1.15x~25%8
Dog Daycare & Boarding4.40x1.15x~25%24
Nursery & Garden4.15x0.84x~20%9
Hotels4.02x1.53x~35%8
Machine Shops3.72x0.93x~24%46
Software & Apps3.41x1.82x30–50%49
Financial Services3.41x1.75x~50%20
Liquor Stores3.41x0.52x~15%189
Day Care3.40x0.81x15–25%78
Websites & Ecommerce3.33x1.04x~30%339
Grocery Stores3.38x0.43x~13%89
Insurance Agencies2.68x1.53x40–55%50
Convenience Stores2.82x0.41x~14%100
Hair Salons & Barber2.18x0.59x~25%181
Routes1.51x0.63x~40%496
🏭

Auto Repair & Service (NAICS 811111)

2.70x SDE Multiple
247 transactions in dataset
SDE Margin: 15–25%
Essential Service
Per-Bay Valuation
$50k–$150k per bay (value)
$100k–$200k per bay (revenue)
Key Secondary Metric

Strengths: Essential service, recession-resistant, skilled labor barriers, recurring maintenance revenue. Risks: Key-man risk if owner is primary tech, labor shortages, gradual EV transition, environmental compliance. What makes a good acquisition: 4+ bays, diverse customer base, established employees who stay, long-term lease, modern diagnostics.

HVAC (NAICS 238220)

2.80x SDE Multiple
123 transactions in dataset
SDE Margin: 15–25%
PE Roll-Up Active
Value Drivers
Recurring service agreements are king
500+ contracts = 3.0–4.0x+ SDE
High Buyer Demand

Key insight: HVAC is one of the most active PE roll-up sectors. Private equity firms (Wrench Group, Service Experts, Apex Service Partners) acquire platform companies then bolt on smaller shops, creating upward pressure on multiples. HVAC deals get snapped up fast — speed of notification matters more here than most industries.

Restaurants (NAICS 722511)

2.26x SDE Multiple
1,774 deals — largest dataset by far
SDE Margin: 10–18% (thin)
High Failure Rate
What Adds Value
Franchise brands: 3.0–4.0x SDE
Liquor license, favorable lease
Volume Category

The single largest volume of small business transactions. Many investors explicitly exclude restaurants from their interest window. With 10–18% SDE margins, many fail the DSCR test. DealForge flags: "Restaurant deals under $200k SDE often struggle to meet SBA DSCR requirements."

Self-Storage (NAICS 531130)

4.60x SDE Multiple
Cap Rate: 5.5–8.0%
SDE Margin: 25–35%
Semi-Passive
Dual Valuation
Business: SDE x 4.60
CRE: NOI / Cap_Rate
Hybrid Asset

Self-storage is a hybrid — both operating business and real estate asset. DealForge runs BOTH calculation methods. Minimal labor (1–2 part-time), recession-resistant, scalable, strong NOI margins (60–70%). Watch for: occupancy rate (85%+ healthy), market saturation from new construction, climate control premium (25–50%).

💰

Tier Structure

Free Tier
$0/mo
1 deal alert per quarter. Lead magnet.
Tier 1 — Pro Lite
$15/mo
Up to 3 HOT SMS alerts/month. 1 interest window.
Tier 2 — Unlimited
$100/mo
Unlimited alerts + windows + advanced tools.

Tier Comparison

FeatureFreeTier 1 ($15)Tier 2 ($100)
HOT MATCH SMS alerts1/quarter3/monthUnlimited
Interest windows11Unlimited
Dashboard accessLimitedFullFull
Periodic digestNoYesYes
Sensitivity analysisNoNoYes
CIM upload + deep analysisNoNoPhase 2
Priority processingNoNoYes

Competitive Positioning

CompetitorPriceWhat They Offer
BizBuySell saved searchFreeBasic email alerts, no analysis
Kumo$149–200/moAI matching, deal flow aggregation
Searcher OS$49–244/moDeal sourcing + CRM
DealStream$49/mo+Deal marketplace + matching
DealForge Tier 1$15/moFull analysis + enrichment + SMS
DealForge Tier 2$100/moEverything + multi-window + deep tools
3–10x Price Advantage

DealForge is 3–10x cheaper than the closest competitor for comparable functionality. The $15/mo entry point drives adoption; the $100/mo tier captures serious acquirers. Blended ARPU: $27/mo with 80/15/5 tier mix.

🏆

Beta Founder Program (First 5–10 Users)

12 months completely free service at chosen tier level. Hand-selected, motivated acquirers who provide weekly feedback and a referenceable review. Purpose: rapid product iteration + first social proof assets. Cost: $1,620–$3,240 total (negligible).

Universal Success Incentive (All Users, All Tiers)

TriggerRewardCondition
User reports a closed deal sourced via DealForge12 months free at current tierMust provide referenceable review
Free-tier user closesUpgraded to Tier 1 for 12 months ($180 value)Written testimonial or LinkedIn post
Tier 1 user closes12 months free ($180 value)Short video or case study permission
Tier 2 user closes12 months free ($1,200 value)Referenceable review

Why This Works

  • Outcome alignment — we only "pay" when users succeed
  • Social proof generation — testimonials from actual closed deals are 10x more powerful than generic reviews
  • Retention boost — successful closers have near-zero churn during free year
  • CAC reduction — high-quality testimonials lower paid acquisition costs 30–50%
  • Virality — "Closed a $1.4M deal and got a year free" is shareable
Financial Impact

10–14% of paid users close 1 deal/year. Each closure costs ~$324 in revenue. Net ARPU reduction: 8–18%. Offset by: retention boost (30–60% LTV increase per affected user), CAC drop to <$200 via organic testimonials, free-tier conversion rate improvement of 15–30%.

📈

Revenue Per User

Blended ARPU
$27.00/mo
$324/yr before incentives
Post-Incentive ARPU
~$23.50/mo
~$282/yr base case
Variable COGS
~$3.20/user
Blended monthly
Gross Margin
85–90%
88% before incentives

Gross Margin

Gross Margin (before incentives)88%
Gross Margin (after incentives)82–86%
At scale with caching and optimization, COGS drops further.

Variable COGS Breakdown (Per User/Month)

ComponentTier 1 (3 alerts)Tier 2 (~12 alerts)Free
Scraping (Apify share)$1.00$1.00$0.30
LLM parsing/enrichment$1.20$5.00$0.15
SMS (Twilio)$0.25$1.00$0.03
Census/enrichment$0.05$0.05$0.05
Total$2.50$7.05$0.53

LTV & Breakeven

ScenarioLTVCACLTV:CACPayback (months)
Pessimistic (6% churn)$387$3501.1x15
Base (4.5% churn)$516$2801.8x10
Optimistic (3% churn)$774$2203.5x5
Breakeven: ~735 Users

Monthly fixed costs: ~$15k–$20k. Contribution margin per user: $23.80. At 20% MoM growth from 200 starting users, breakeven is reached around Month 8 (~860 users). Key levers: churn reduction (every 1% = ~25% LTV increase), Tier 2 upsell, and add-on revenue.

🚀

Phase 0: Beta Launch (Weeks 1–8)

5–10 hand-picked users from direct network. Active acquirers, located in testable markets (Texas/DFW ideal). 1 year free service. Weekly feedback sessions. Goals: validate enrichment accuracy, tune scoring weights, test SMS timing, generate first testimonials.

Phase 1: Early Growth (Months 3–6)

ChannelEst. CACNotes
Reddit Communities$50–$100r/Entrepreneur (4M), r/smallbusiness (1.5M), r/searchfunds
LinkedIn (ETA Community)$100–$200Search fund operators, independent sponsors
Broker Partnerships$150–$250Highest quality leads, best retention
BizHub User Crossover$100–$150"BizHub on autopilot — it comes to YOU"
Paid Ads (backup only)$300–$500Google/LinkedIn. Scale only if organic plateaus.

Phase 2: The Flywheel (Months 6–12)

User Closes a Deal
Submits "Report Closed Deal" form with details.
Provides Referenceable Review
LinkedIn post, written testimonial, or video. Gets 1 year free.
Review Becomes Marketing Asset
"Closed $1.4M auto repair deal — DealForge found it in 48 hours"
Posted Across Channels
Reddit, LinkedIn, website. Drives new signups organically.
New Users Close Deals
Cycle repeats. Flywheel accelerates with each closed deal testimonial.

Growth Targets

MetricMonth 6Month 12
Total users5003,000+
Paying users4002,500+
MRR$10,800$67,500
Monthly churn< 5%< 4%
Blended CAC< $300< $250
30-Day Deal Guarantee

Launch promotion: "If DealForge doesn't surface at least one deal in your interest window within 30 days, get a full refund." Low risk (most markets have sufficient deal flow) but builds confidence for skeptical early adopters.

🔧

Core Components

LayerTechnologyWhy
Backend APIFastAPI (Python)Fast, async, great for ML/data pipelines
DatabasePostgreSQLReliable, JSON support, full-text search
Calculation EnginePure Python moduleNo LLM dependency — deterministic, testable, fast
LLMClaude API (Haiku + Sonnet)Haiku for parsing, Sonnet for narrative
Enrichment DataCensus bulk files (local)No API dependency in hot path
SMSTwilioIndustry standard, reliable
Email IngestionGmail API / IMAPParse platform email alerts
FrontendStreamlit (MVP) → Next.jsSpeed first, polish later
HostingRailway/Render → AWSEasy deploy, then scale
Task QueueCelery + RedisBackground processing for pipeline
Knowledge BaseMarkdown wiki (filesystem)Karpathy pattern — LLM-maintained

LLM Cost Optimization

TaskModelCost/CallVolume
Listing parsingClaude Haiku~$0.01Every listing
Enrichment reasoningClaude Haiku~$0.02Tier 2+ listings
Narrative generationClaude Sonnet~$0.05Every scored deal
CIM analysisClaude Sonnet~$0.50On-demand (Phase 2)

Monthly LLM cost estimate at 1,000 deals/day: ~$2,160/month (parsing $300 + enrichment $360 + narratives $1,500).

Infrastructure Costs (MVP)

Hosting (API + Worker)
$50–$100
Railway or Render
PostgreSQL
$25–$50
Managed database
Redis
$15–$30
Task queue cache
Twilio SMS
$50–$200
Usage-based
Claude API
$200–$500
MVP volume
Apify (Scraping)
$50–$200
Usage-based

Total MVP infrastructure: $410–$1,100/month

Karpathy Pattern

The wiki is the source of truth for knowledge. The database is for fast queries. Code reads from both. Raw data collected in raw/, compiled by LLM into structured .md articles in wiki/. The LLM maintains all content, indexes, and cross-links.

🗄
  • 🏛
    Census CBP / ZBP
    api.census.gov
    County and ZIP Code Business Patterns. Establishment counts, employment, payroll by NAICS. Foundation of the enrichment engine. Free but lagged 2–3 years. Use ZBP for hyperlocal density, CBP for actual payroll/employment at county level. Download bulk files (~100–200MB) for production.
  • 💰
    BizBuySell
    bizbuysell.com
    Largest US business-for-sale marketplace (owned by CoStar Group). Primary listing source. Email alert parsing is the primary ethical ingestion method. Published transaction data (9,500+ deals) powers industry benchmarks. TOS prohibits scraping — email parsing is defensible.
  • 🏠
    ATTOM Data
    attomdata.com
    Property ownership, tax, deed, mortgage data for 155M+ properties. Key for off-market signals: ownership duration, tax delinquency, out-of-state owner detection. API-based, pay per call.
  • 🤖
    Apify Platform
    apify.com
    Scraping-as-a-service with multiple BizBuySell actors. Configurable concurrency, retries, proxy settings. Output as JSON via API. Cost: ~$1–26/mo base + compute. Use carefully per TOS considerations.
  • 📊
    BLS & BEA
    bls.gov / bea.gov
    Employment wages, GDP by industry, regional price parities (RPP). BEA RPP is critical for geographic adjustments — SF/NYC multiply by 1.1–1.3, rural areas by 0.7–0.9. More current than Census (quarterly).
  • 📱
    Twilio
    twilio.com
    SMS delivery for HOT MATCH alerts. Cost: ~$0.0079/segment outbound. Standard compliance handles opt-in/opt-out. Retry logic: 3x with backoff, fall back to email.
  • 🌐
    FRED API (Federal Reserve)
    api.stlouisfed.org
    Daily prime rate (series DPRIME) for SBA interest rate calculations. Free, reliable. Current prime (~April 2026): ~7.50%, giving SBA rate ~10.25%.
NAICS Granularity Recommendation

4-digit NAICS (e.g., 8111 for Auto Repair) is the best balance of data quality and specificity. 2-digit is too broad. 6-digit is too sparse at small geographies. Use 3-digit as a useful backup.

🔔

15+ public-data signals indicating a business or property may be available for acquisition, even if not listed. Ranked by signal strength and scored using an Availability Likelihood framework.

High-Signal (Strong Motivation to Sell)

SignalData SourcePoints
Probate/estate filingCounty probate court records+30
Divorce filingCounty court records+30
Federal/state tax liensPublic lien filings+30
Pre-foreclosure (Lis Pendens)County recorder+30
Business dissolution filingSecretary of State+30
Bankruptcy filingPACER / PacerMonitor+30

Medium-Signal (Likely Approaching Exit)

SignalData SourcePoints
Aging owner (65+)Voter registration + SOS formation date+15
Lapsed business licenseMunicipal records+15
Declining employee countLinkedIn company page (over time)+15
Delinquent property taxesCounty assessor (ATTOM)+15
Expired real estate listingMLS history, LoopNet archives+15
Long ownership + low assessed valueCounty assessor (ATTOM)+15

Soft Signals (Worth Monitoring)

SignalData SourcePoints
Out-of-state property ownerCounty assessor mailing address+5
Stale Google Business ProfileGoogle Maps API+5
Declining Google reviewsGoogle Maps API+5
Website domain expiringWHOIS lookup+5
Reduced job postingsIndeed/LinkedIn+5
UCC filing expirationState UCC records+5

Availability Likelihood Scoring

ScoreClassificationAction
60+ pointsLIKELY AVAILABLEPrioritize outreach
30–59 pointsPOSSIBLEMonitor, outreach if investor interested
<30 pointsUNKNOWNPassive monitoring only

Implementation Priority

Phase 2 MVP (Start Here)
Secretary of State data (OpenCorporates) for formation date as age proxy. County assessor data (ATTOM) for ownership duration, tax delinquency, out-of-state owner. Google Business Profile for stale/unclaimed as disengagement signal.
Phase 3 (Expand)
Court records (probate, divorce, liens). LinkedIn company page monitoring for employee count trends. CMBS maturity tracking for CRE-specific signals.
Off-Market Is Phase 2

Off-market signal detection adds significant value but requires additional data source integrations. Start with on-market deal flow (Sections 04–09), prove the core scoring and alerting engine, then layer in off-market capabilities as a Tier 2 differentiator.