◑

FraudLens

INVESTIGATION COCKPIT

Operate

◧Command Center ⊞Investigation5

Observe

◈Phoenix · Traces

Improve

◇Self-Improvement

Assure

⊟Governance ⊠Policy & Versions

S. Pillai

Fraud Analyst · L2 · demo

Tracing18,42,930

EvalsLive

Phoenix412/min

ADK AgentDegraded eval

Policyv2.4.1

1,284 livepolicy v2.4.1--:--:-- IST

◈

Arize Phoenix

project: fraudlens-prod Demo · simulated stream

Every agent decision is traced with OpenInference, scored by LLM-as-judge, and queryable by the agent itself.

18,42,930

spans ingested DEMO

Traces today

84,219

Spans / min

412

Eval pass-rate

55.6%

Open experiments

Live trace stream

OpenInferenceDEMO

Trace	Operation	Lat	Verdict	G · P · R
TR-9F3A3F	investigate · UPI	2.1s	allow	939088
TR-9F3A3E	investigate · IMPS	2.6s	stepup	908683
TR-9F3A3D	investigate · UPI	1.9s	allow	949290
TR-9F3A3C	investigate · UPI	3.1s	block	868174
TR-9F3A3B	re-score · UPI	2.0s	allow	929189
TR-9F3A3A	investigate · NEFT	2.9s	hold	888578
TR-9F3A39	investigate · UPI	1.9s	allow	959391
TR-9F3A38	investigate · UPI	2.4s	stepup	918785
TR-9F3A37	investigate · UPI	1.9s	allow	949290

Showing newest 9 of 84,219 todaylatency p50 2.1s · p95 3.1s

Eval monitors

LLM-as-judgeDEMO

Groundedness91

target 90within SLO

Policy-fit86

target 90below SLO

Reason-code quality78

target 85below SLO

Phoenix self-queries

SpanQuery · annotationsDEMO

Simulated example of the agent reading its own failing traces from Phoenix.

14:32:04phoenix.query_spans(eval.reason_code < 0.85)→ 34 spans

14:31:58phoenix.get_evaluations(cluster: reason-code)→ G .91 P .86 R .74

14:31:49phoenix.get_trace(TR-9F3A21)→ 10 spans · 2.84s

14:31:40phoenix.list_experiments(dataset: failing_rc)→ 3 runs

14:31:32phoenix.diff_versions(v2.4.0 → v2.4.1)→ 1 clause Δ

Improvement experiments

baseline vs proposed · backs every version bumpDEMO

Experiment	Source	Cases	Groundedness	Policy-fit	Reason-code	Status
EXP-2210	failing_reasoncodes	34	+10 pts	+11 pts	+18 pts	pending
EXP-2188	scam_cluster_block	120	+2 pts	+5 pts	+1 pts	shipped
EXP-2090	verdict_rationale	88	+4 pts	+1 pts	+3 pts	shipped
EXP-1977	min_score_sweep	64	+1 pts	−2 pts	−3 pts	rolled-back

◈