David Jansen van Vuuren

Principal Product ManagerData & ML Platforms

I build and scale machine learning products. 15+ years across e-commerce, adtech, and fintech — search, recommendations, customer data platforms. Currently founder at Irrational Signals.

§ 01Selected impact

Numbers do the work.

Across roles at Takealot Group, Vicinity Media, and now Irrational Signals.
  • +20%
    search customer satisfaction

    Multi-intent ranking framework. Takealot.

  • recommendation impression share

    Flexible context-aware API. Takealot.

  • Petabyte
    customer data platform

    Composable CDP, multi-brand, identity resolution. Takealot Group.

  • 100+/hr
    trading signals

    Irrational Signals. US equities · 3 sectors · 6 batches per market day.

  • 80% → 30%
    GeoIP-to-WiFi conversion target

    Micro Networks model. Vicinity Media (proof-of-concept).

  • 15+
    years

    E-commerce, adtech, fintech. MBA. Cape Town.

§ 02ML Product Framework

A project execution framework for machine learning.

Built at Takealot Group to make the gap between exploration and productionisation explicit. The diamond below is the core visual: ideation expands into multiple candidate paths, then contracts into a single productionised model. Click any stage for detail.

The month markers are illustrative. Actual cadence depends on stack maturity and dataset readiness — modern tooling (synthetic data generation, LLM-assisted data prep, foundation-model baselines) compresses several phases substantially.

EXPANSIONCONTRACTIONEXPLORATIONCOMMITUNSOLVEDCUSTOMERPAIN POINTIdeationRoadmapConsiderationRoadmapCommitmentDeliverable: Proof-of-conceptDeliverable: Customer-validated modelGeneratetraining dataCandidateModel 1CandidateModel nIterativeTrainingOffline POCEvaluationFinalise ModelArchitectureProductioniseStage ModelA/B Test ORInternal TestRetrainingModel MonitoringALLEVIATEDCUSTOMERPAIN POINTMonth 0Month 1Month 2Month 3Month 4Month 5Month 6months are illustrative — actual cadence varies with stack maturity + tooling
Learnings — what changed after running the framework

Running the 6-month framework in practice surfaced two missing phases. The longer 15-month variant adds these as inline additions to the same diamond — not new structure, just acknowledgement that real ML work sometimes needs a dataset detour and a thinking pause.

  1. Learning 01
    Inserted before Generate Training Data

    Dataset Exploration Phase

    Sometimes the bottleneck isn't the model — it's the dataset. Before committing to training, generate multiple candidate datasets in parallel and A/B test which one produces the most useful signal. This adds a ~3-month phase up front (Generate Dataset → Candidate Dataset 1 / n → Serve Dataset as A/B Test) before model exploration starts.

  2. Learning 02
    Inserted after A/B Test, before retraining

    Investigation Buffer

    The A/B test doesn't always yield a clear answer. Reserve dedicated time after a test to investigate the result — understand the customer behaviour underneath the metric — instead of forcing immediate iteration. This stops cascading dependencies when an A/B test result requires deeper analysis (e.g. the Category Intent rollout, which needed investigation before promotion).

Both additions extend the project shape — a dataset arc up front, a thinking pause at the end. In practice they can add real time to a programme, though modern tooling (synthetic data generation, LLM-assisted analysis) compresses both phases substantially.

Adapting to agentic workflows

The diamond is calibrated for classical supervised machine learning — train a model against labelled data, evaluate offline, A/B test, productionise. The shape of the work still holds for agentic workflows: same expansion / contraction logic, same need for explicit checkpoints. Several stages just compress or change form.

Classical ML stage
Agentic equivalent
  • Generate training data
    Prompt design · RAG corpus · synthetic data
  • Candidate Model n
    Candidate prompts · tool sets · orchestration patterns
  • Iterative training
    Prompt iteration · retrieval tuning · tool-spec refinement
  • Offline POC evaluation
    LLM-as-judge · behavioural tests · red-team probes
  • A/B test or internal test
    A/B with completion rate · human eval · latency budgets
  • Productionise + stage
    Guardrails · rate limits · fallback prompts
  • Drift monitoring
    + hallucination rate · tool-call accuracy · latency

The two learnings (the dataset detour and the investigation buffer) carry over directly. The investigation buffer matters more in agentic work, not less — behavioural failures rarely have a clean numeric answer.

Companion artefacts
§ 03Experience

Career

MBA, University of Stellenbosch Business School (2013). BCom Management Sciences, University of Stellenbosch (2007).
  1. 2025 — present

    Founder & Principal Product Manager

    Irrational Signals

    API-first ML-driven trading signals platform. Live subscription product with automated billing. Python SDK + REST API.

    FintechMLAPI products
  2. 2024 — 2025

    Group Product Lead — Customer Data & ML Platforms

    Takealot Group

    End-to-end product strategy for the Customer Data Platform, CRM, fraud, and insights across the multi-brand portfolio. Architected petabyte-scale data platform; implemented data governance frameworks; scalable analytics architecture.

    Customer Data PlatformData governanceMulti-brand
  3. 2022 — 2024

    Group Product Lead — AI/ML (Search & Recommendations)

    Takealot Group

    Machine-learning product strategy for search and recommendations across platforms. Multi-intent search framework (+20% customer satisfaction). Flexible recommendations API (3× impression share).

    SearchRecommendationsML platforms
  4. 2020 — 2022

    Product Manager — AI/ML (Search & Recommendations)

    Takealot.com

    Delivered the machine-learning product roadmap for core search and recommendation performance. Defined and tracked North Star metrics. Productionised models with data scientists and ML engineers.

    SearchRecommendations
  5. 2017 — 2020

    Product Lead — AdTech Platform

    Vicinity Media

    Owned the advertising-tech platform strategy: audience data, campaign systems, attribution. Built the data science function. Out-of-home (OOH) attribution linking offline ad exposure to in-store behaviour. WiFi-clustering and point-of-interest visit models.

    AdtechAttributionAudience
  6. 2008 — 2015

    Consulting Manager

    Cape Value Group

    Data analysis and market modeling across financial services and public sector. Business strategy, forecasting, and operational planning.

    ConsultingModeling
§ 04Currently building

Irrational Signals

Intraday trading signals for US equities, delivered via API. Statistical models scan hundreds of stocks every market hour and surface the ones with a measurable edge — direction, historical win rate, entry / exit targets. Free tier available; Pro and Pro Max subscriptions billed automatically.
Surface
REST API + Python SDK
GET /v1/signals
Coverage
~100–130 signals / hr
Tech · Consumer Cyclical · Comm Services
Schedule
6× per trading day
10:47 AM – 3:47 PM ET
Tiers
Free · Pro · Pro Max
$0 · $99 · $249 / mo
Example signal
MRVLBUY
78%historical win rate
Entry
$87.45
Exit
$88.52
Sector
Tech / Semis
Expected return
+1.23%

Every signal includes direction, win rate, entry / exit targets, and sector. Pro Max adds a horizon-end timestamp and live preflight checks (price drift, intraday range, relative volume).

§ 05Contact

Get in touch