Product Design B2B SaaS Automation Systems Healthcare

Designing a real-time call automation system for healthcare outreach

Replacing manual workflows and static CSV processes with a rule-based system using live API data — and learning why configuration is a system, not a form.

Company

Ellipsis Health

Role

Lead Product Designer

Timeline

4 months (2024–2025)

Industry

Healthcare AI / Voice

Call campaign configuration UI — showing the recurring campaign setup with schedule range and occurrence controls

Call Scheduler — campaign configuration surface, Ellipsis Health AICM

The Shift

From manual, static workflows
to real-time automated systems

The product needed to move beyond campaigns that ops teams stitched together by hand — one CSV, one call list, one trigger at a time. The goal was a system that could run itself.

Before

CSV upload to load patient list
Manual patient selection per campaign
Manual call triggering each time
Repetitive setup for every outreach cycle
No continuity — campaigns ended and restarted

After

Live API-based patient data feed
Rule-based patient filtering and selection
Automated execution via configured schedule
Continuous system behavior, no manual restart
Persistent campaign state over time

Core Problem

Replacing human decision-making at scale

Every campaign previously ran because a person made a series of judgment calls in sequence. Automating the system meant encoding those decisions into configuration — and making sure they composed correctly.

Who to call — which patients qualify at any given moment
What script to use — based on patient context and outreach stage
When to call — scheduling windows, time zones, patient availability
How many calls at once — concurrency limits per client environment
When to retry — after no answer, after failure, after partial contact
How to progress — what happens after each call outcome

"Small configuration errors can scale into thousands of incorrect calls."

Unlike a single misclick in a manual workflow, a wrong rule in an automated system doesn't fail once — it fails at every execution interval until someone catches it. The design problem was as much about error prevention as it was about capability.

New Complexity

Designing for dynamic, real-time data

Moving from a static CSV to a live API feed fundamentally changed the nature of the problem. A CSV is a snapshot. An API is a stream. The system had to be designed around data that could change between any two executions.

Patient eligibility could change mid-campaign — admissions, discharges, status updates
No fixed list size — the dataset could grow or shrink continuously
No clear campaign "end" — the system needed to run indefinitely or until a rule triggered closure
Concurrency controls needed to account for an unknown future patient pool

Static configuration patterns — where you set parameters once and the system runs — don't hold up when the underlying dataset is fluid. Every configuration decision needed to be evaluated against a moving target.

Iteration 1

Initial approach: step-based configuration

The first design broke the campaign setup into a linear wizard — step 1 through step N. The logic felt clean: each decision had its own screen, and users moved forward once each was complete.

Iteration 1 — step-based wizard, Calling Window tab showing schedule range and per-day calling hours

Calling Window tab — step 1 of 5 in the original wizard flow

Step-based flow with sequential screens for each configuration group
Fragmented inputs — scheduling, targeting, retry logic all separated
Termination logic buried in a final step users rarely reached
Static assumptions baked in — campaign dates treated as absolute bounds

Where it failed

Where the system failed

The step-based wizard produced configurations that looked correct in isolation but broke in practice. The most revealing example was a simple scheduling conflict nobody caught until it shipped.

Campaign dates 01 Nov → 05 Nov

Active day Monday

First Monday Nov 06

→ Campaign completes before it ever runs

The date range and day-of-week restrictions were configured in different steps — and nothing surfaced the conflict between them. Users finished the wizard confident their campaign was ready.

Context loss across steps — decisions made early were invisible by step 4
Conflicting configurations were only caught at execution time, not at setup
Retry logic conflicted with termination conditions users had already set
Unpredictable outcomes with no clear failure state to diagnose

"The problem wasn't the UI. It was broken decision continuity — each step assumed it was independent, but the decisions weren't."

Key Insight

Configuration is a system, not a flow

The wizard model was the wrong mental model from the start. Breaking configuration into steps implied the decisions were sequential and independent. They weren't — every setting was downstream of another.

Interdependence

Scheduling, concurrency, and retry logic form a single operating contract — each parameter constrains the others.

Emergent behavior

System behavior isn't a property of any single setting. It emerges from combinations — and those combinations need to be visible together.

Spatial proximity

Related decisions need to occupy the same visual space, not separate steps. Conflict detection requires co-presence.

"You can't prevent bad configurations if the user can only see one decision at a time."

Iteration Evolution

From iteration 1 → 5

Step-based wizard

Linear flow, fragmented inputs, no cross-step visibility. Built fast, broke fast. Exposed the fundamental flaw in treating configuration as a sequence.

5-step wizard — each decision isolated in its own tab, no cross-visibility

Collapsed accordion panels

Moved to a single-page layout with grouped panels. Better than a wizard, but collapsed sections still hid interdependencies. Users could configure retry logic without seeing the termination conditions it interacted with.

Iteration 2 — same tab structure but showing termination criteria in Target tab

Target tab — termination criteria exposed, but still disconnected from scheduling settings

Unified panel with live summary

All configuration on one surface with a live system summary panel on the right. First time conflicts could surface in real time. Introduced the idea that the summary was as important as the form — it was the system talking back.

Iteration 3 — same layout but with Patient Time Zone label change and improved end date visibility

Refinement — timezone language updated, end date shown inline as a selected date pill

Termination → Pause logic rethink

Termination as a concept was too final and hard to reason about. Replaced with pause + resume logic — a lighter mental model that matched how operators actually thought about campaign control. Reduced misconfiguration of exit conditions significantly.

Iteration 4 — Call Audience tab showing audience customization with partner, client, practices, and call attempts filters

Call Audience tab — audience rules now defined separately before timing configuration

Iteration 4 second part — Call Timings tab with scheduling range and day-by-day calling hours

Call Timings tab — scheduling surface separated from audience, reducing per-tab complexity

Final: unified surface with constraint visibility

All interdependent settings visible simultaneously. Conflict warnings surfaced inline. Schedule preview showed real execution windows rather than abstract dates. Input complexity reduced by removing options that added configuration burden without meaningful control.

Iteration 5 — final unified configuration surface with recurring campaign, schedule range, scheduling approach, concurrent calls, pause criteria, and retry logic all visible

Final surface — all interdependent settings on one scroll: recurrence, schedule, concurrency, pause, and retry

Key Decisions

Design decisions and their tradeoffs

Unified configuration surface

Problem

Step-by-step flow hid interdependencies between scheduling, concurrency, and retry settings.

Solution

Single surface with all related inputs visible simultaneously. Conflict warnings inline.

Tradeoff

More cognitive load on open — mitigated by grouping and progressive disclosure within sections.

Impact

Scheduling conflicts surfaced before launch rather than at execution time.

Termination → Pause model

Problem

"Termination" logic was opaque — operators couldn't reason about edge cases or resume scenarios.

Solution

Replaced hard termination with pause/resume conditions. Lighter mental model, recoverable state.

Tradeoff

Paused campaigns require explicit resume — adds a step for intentional exit scenarios.

Impact

Operators reported higher confidence in campaign state at any point in time.

Reduced input complexity

Problem

Too many configurable parameters created decision fatigue and increased misconfiguration rate.

Solution

Removed inputs with no meaningful differentiation in practice. Surfaced only decisions that changed system behavior.

Tradeoff

Some power users wanted fine-grained control we deliberately withheld.

Impact

Setup time reduced; fewer post-launch corrections required from ops team.

Designed for dynamic API data

Problem

Prior system assumed static patient lists. API feed meant dataset size and eligibility were always in flux.

Solution

Configuration framed around rules and thresholds rather than fixed counts or dates where possible.

Tradeoff

Rules-based setup is harder to communicate — users needed to think in conditions, not numbers.

Impact

Campaigns remained stable as patient data changed, without manual intervention.

Final System

Final configuration system

The final design consolidated all settings into a single, structured surface. Related decisions sit in proximity. The system previews its own behavior — execution windows, call distribution, conflict warnings — so operators can validate before launch.

Final UI — campaign configuration surface showing recurring campaign setup, schedule range with occurrence count, and scheduling approach selection

Campaign configuration — unified surface with recurrence, schedule, scheduling approach, concurrency, pause criteria, and retry logic

Final UI — campaign monitoring view showing ongoing sessions list, session status, call counts, and per-patient calling list with live statuses

Campaign monitoring — live session status, call distribution across ongoing / completed / remaining, and per-patient call status

Tradeoffs

What we chose not to solve

Scope decisions were as consequential as design decisions. We deliberately left three areas out of v1 to maintain clarity and build on a stable foundation.

No outcome prediction

The system doesn't predict call success rates or suggest configuration adjustments based on historical outcomes. We chose to capture data first before building prediction on top of it.

No system self-optimization

Configuration remains entirely manual. The system executes rules as set — it doesn't adjust timing, concurrency, or retry logic based on performance signals. Flexibility was prioritized over automation.

No cross-campaign coordination

Campaigns are independent units. A patient who qualifies for two active campaigns can be targeted by both — deduplication across campaigns was out of scope and would have required significantly deeper data architecture work.

Risks & Limitations

Where the system is still fragile

Designing the system didn't eliminate fragility — it relocated it. The remaining risks are structural, not cosmetic.

Unpredictable emergent behavior

A valid configuration can produce unexpected system behavior when multiple rule conditions interact under dynamic data conditions. No amount of upfront validation fully eliminates this.

Cost spikes from misconfiguration

High concurrency settings on a large API dataset can trigger thousands of calls before an operator notices. Cost controls exist but are not automatically enforced at the configuration stage.

Misconfiguration with no immediate feedback

Some configuration errors only manifest on specific execution cycles. A conflict between retry intervals and scheduling windows might only surface on day three of a campaign.

Dynamic data complexity

Rule-based targeting against a live patient feed means the system's effective patient pool changes continuously. Operators don't always have visibility into how eligibility shifts mid-campaign.

Impact

What changed

Reduced human dependency

Campaigns run continuously without manual intervention per cycle.

Faster campaign setup

Unified surface and reduced inputs cut time-to-launch for new campaigns.

Scalable execution

System handles thousands of concurrent calls across multiple active campaigns without per-call manual oversight.

Reflection

What I'd improve next

Simulation layer before launch

The biggest gap in the current system is that operators can't see what their configuration will actually do before it runs. A dry-run mode that simulates execution against a sample of the current patient pool would catch most scheduling and targeting conflicts that currently only surface post-launch.

Optimization suggestions from historical data

As execution data accumulates, the system should be able to surface patterns — optimal call windows per patient segment, effective retry intervals, concurrency settings that reduce drop-off. Not prescriptive automation, but informed recommendations operators can choose to apply.

Better predictability for dynamic datasets

The current design surfaces real-time patient counts at configuration time, but gives no indication of how that pool is likely to change. Even a basic projection — "estimated eligible patients in 7 days based on recent trends" — would help operators configure more confidently against a moving target.

Designing a real-time call automation system for healthcare outreach

From manual, static workflowsto real-time automated systems

Replacing human decision-making at scale

Designing for dynamic, real-time data

Initial approach: step-based configuration

Where the system failed

Configuration is a system, not a flow

From iteration 1 → 5

Step-based wizard

Collapsed accordion panels

Unified panel with live summary

Termination → Pause logic rethink

Final: unified surface with constraint visibility

Design decisions and their tradeoffs

Final configuration system

What we chose not to solve

No outcome prediction

No system self-optimization

No cross-campaign coordination

Where the system is still fragile

Unpredictable emergent behavior

Cost spikes from misconfiguration

Misconfiguration with no immediate feedback

Dynamic data complexity

What changed

Reduced human dependency

Faster campaign setup

Scalable execution

What I'd improve next

Simulation layer before launch

Optimization suggestions from historical data

Better predictability for dynamic datasets

From manual, static workflows
to real-time automated systems