Skip to content
View in the app

A better way to browse. Learn more.

Benchmark Six Sigma Forum

A full-screen app on your home screen with push notifications, badges and more.

To install this app on iOS and iPadOS
  1. Tap the Share icon in Safari
  2. Scroll the menu and tap Add to Home Screen.
  3. Tap Add in the top-right corner.
To install this app on Android
  1. Tap the 3-dot menu (⋮) in the top-right corner of the browser.
  2. Tap Add to Home screen or Install app.
  3. Confirm by tapping Install.

vikramb

Members
  • Joined

  • Last visited

Everything posted by vikramb

  1. I would challenge Bex. As an AI solution architect, my position is: a non-explainable AI should not be allowed to make or finalize insurance claim approval or rejection decisions on its own. It can still be used, but only as a decision-support or triage system under meaningful human oversight. In this case, “use the AI despite limited explainability” is the wrong architecture choice. The core reason is simple: in insurance, a claim decision is not just an efficiency event. It is a consequential decision that affects a customer’s money, rights, and trust. Current governance frameworks increasingly expect transparency, accountability, human oversight, logging, and the ability to contest adverse outcomes. The OECD AI Principles explicitly call for transparency and explainability appropriate to context, and accountability for proper functioning. The EU AI Act’s high-risk framework requires human oversight, record-keeping, and instructions that enable compliant use, while Colorado’s AI law requires opportunities to correct data and appeal adverse consequential decisions through human review where technically feasible. So the issue is not whether the model is fast. The issue is whether the business can defend a denial. If customer support cannot explain why a claim was rejected, the company has four serious problems. First, trust breaks. A customer will tolerate a slow answer more than an answer that feels arbitrary. In insurance, denials and unsatisfactory settlements are already common reasons for complaints, and claim denials commonly trigger formal appeals. If the company cannot state a clear reason, complaint volume, escalation cost, and reputational harm rise. Second, governance breaks. A black-box denial cannot be effectively audited, challenged, improved, or root-caused. If errors cluster around a subgroup, document type, hospital, repair estimate pattern, or claims adjuster notes style, the firm may not discover it quickly enough. That is exactly why responsible AI guidance emphasizes transparency, explainability, and due diligence rather than pure performance metrics. Third, human oversight becomes fake. If a human reviewer sees only a score or outcome but not a meaningful rationale, that reviewer is not truly supervising the system. They are rubber-stamping it. The EU high-risk framework specifically expects systems to be designed so deployers can implement human oversight effectively Fourth, regulatory exposure increases. In domains involving consequential decisions, regulators are moving toward rights to notice, explanation, correction, appeal, and monitoring for harmful outcomes. Even where no single rule says “every model must be fully interpretable,” the operating expectation is increasingly clear: if an automated decision materially affects a person, the organization must be able to justify and govern it. So Bex’s argument misses a crucial distinction: efficiency is not the same as acceptability. A system can be faster, cheaper, and more consistent, yet still be architecturally unfit for final decision authority. My recommended architect position would be: Do not deploy this AI as a fully autonomous claims approver/rejector. Deploy it only in one of these safer roles: Triage and prioritization Route straightforward claims faster, flag suspicious or complex claims for human review. Decision recommendation with reason codes The AI can suggest approve/reject, but a human can only act if the system also provides understandable factors, evidence references, and confidence. Human-in-the-loop for adverse outcomes Auto-approval may be acceptable for low-risk simple cases, but rejections, partial denials, or fraud flags should require human validation and a customer-readable explanation. Fallback to interpretable models where needed For high-stakes denial decisions, use models or explanation layers that can produce stable, defensible rationale. That gives you most of the efficiency benefit without accepting the full trust and compliance risk. So my final stance is: No — AI that cannot explain its decisions should not be used as the final authority for insurance claim approval or rejection. It may be used as a supporting system, but not as the decision-maker, because in this context explainability is not a luxury feature. It is part of operational accountability.
  2. I challenge Bex's position and strongly support View B: Wait until the organization is ready. Here's my case: The core flaw in View A is conflating "better process" with "better outcome." A 25% delay reduction on paper means nothing if the implementation itself causes a 40% productivity drop for three months, erodes trust in AI tools, and triggers turnover among experienced staff. The change isn't the hard part — the adoption is. Evidence and examples: 1. Nike's ERP disaster (2000) — Nike rushed implementation of a demand-planning system that was technically superior. The result? $100 million in lost sales, a 20% stock price drop, and years of recovery. The technology worked. The organization wasn't ready. 2. The UK's Universal Credit rollout — The system was designed to be more efficient than the legacy benefits system it replaced. Rapid rollout without adequate staff training and process readiness led to payment delays affecting millions, massive public backlash, and a program that took over a decade to stabilize. 3. Research supports this directly. McKinsey's studies on digital transformations consistently find that roughly 70% fail, and the primary drivers of failure are not technical — they're people and culture. Resistance, inadequate training, and lack of management buy-in account for more failures than bad technology ever does. Regarding Bex's Starbucks example: Starbucks succeeded precisely because it invested heavily in change management, store-level training, and phased rollouts — the opposite of "implement immediately despite unpreparedness." That example actually supports View B. My argument as a solution architect comes down to three principles: First, implementation risk is real cost. A solution architect doesn't just evaluate the target state — they evaluate the transition. A confused team making errors, reverting to old processes, or quietly working around the new system creates technical debt and organizational debt that compounds. Second, failed implementations poison future ones. If this AI recommendation is forced through and stumbles, every subsequent AI-driven suggestion will face even greater skepticism. You don't just lose this change — you lose the organization's willingness to trust AI at all. That's a far greater long-term cost than a few weeks of preparation. Third, "ready" doesn't mean "comfortable." View B isn't about waiting until everyone is enthusiastic. It means ensuring managers understand the rationale, teams have been trained on new workflows, there's a rollback plan, and success metrics are defined. This can happen in weeks, not months. The difference between "rushing" and "preparing" is often just 4–6 weeks of structured change management — a trivial delay against the long-term value of sustainable adoption. In short: The 25% improvement is only real if people actually execute the new process correctly and consistently. A solution architect's job is to deliver sustained outcomes, not theoretical gains. Preparing the organization isn't a delay — it's the implementation.
  3. My Position: View A — Prioritize Immediate Resolution I challenge Bex. Here's why. Bex's Toyota Example Actually Proves My Point Bex cites Toyota, but Toyota's system is fundamentally a resolve-first approach. When a worker pulls the Andon cord, the immediate goal is to stop the defect from propagating and restore the line. The root cause investigation happens after the line is flowing again. Toyota never lets cars sit half-built on the floor while engineers spend days studying why a bolt didn't seat properly. They fix, they restore, and then they learn. That's View A with a disciplined follow-up — not View B. Why View A Wins in Practice Healthcare — every emergency room on earth operates on View A. When a patient arrives in cardiac arrest, no doctor says "let's understand why this happened before we intervene." You stabilize the patient first. Diagnosis follows. The entire field of emergency medicine is built on the principle that resolution precedes understanding. Applying View B here would be fatal — literally. Financial services — the 2012 Knight Capital incident. A software deployment error caused the firm to lose $440 million in 45 minutes. The teams that responded focused entirely on stopping the bleeding — killing the rogue trading algorithm. If they had paused to investigate why the deployment failed before acting, the firm wouldn't have survived long enough to learn anything. Knight Capital still went under, but every minute of delay would have made it worse. Cloud infrastructure — every major provider follows View A. When AWS, Google Cloud, or Azure experience outages affecting millions of users, their incident commanders restore service first. The post-incident review comes hours or days later. No SRE team in the world would delay restoration to conduct a root cause analysis while customers are down. The Flaw in View B's Logic Bex argues that "reliance on quick fixes fosters a cycle of recurring issues." This is true — but only if teams stop at the fix. The problem isn't that teams resolve quickly. The problem is organizational discipline. Blaming View A for poor follow-through is like blaming a fire extinguisher for not preventing arson. The recurring-issue cycle breaks not by slowing down resolution, but by mandating that every resolution triggers a root cause investigation afterward. AI makes this even easier — it can auto-generate incident analyses, flag repeat patterns, and create investigation tickets while the engineer is still closing the incident. Where View B Fails Dangerously Consider a scenario: your e-commerce platform goes down on Black Friday. Revenue loss is $100,000 per minute. Bex's position would suggest the team should investigate the root cause to ensure it doesn't happen again. Meanwhile, your customers are going to competitors, your brand reputation is eroding, and your CEO is watching revenue evaporate in real time. No stakeholder — no customer, no board member, no investor — would accept "we're learning" as a response during a live outage. My Conclusion View A is the correct architectural stance because you cannot learn from a system that no longer exists. If the business fails, if the patient dies, if the customer leaves — there is nothing left to optimize. Resolution preserves the opportunity to learn. Learning without resolution is an academic exercise performed on a corpse. The right model is: resolve immediately, learn inevitably. AI should accelerate the fix first and auto-trigger the investigation second. But when forced to choose — and the prompt demands a choice — resolution comes first. Every time.
  4. My Position: Keep the Feature, Fix Selectively (View B) Rolling back a feature that works well for 90%+ of users because 10% are hitting friction is like closing an entire highway because one lane has a pothole. You don't shut down the road. You cone off the lane and fix it. The real risk isn't keeping the feature live — it's doing nothing for the affected users. What actually breaks trust isn't a bumpy experience for some users. It's when people can tell the company knows and isn't doing anything about it. If you keep the feature running, isolate the affected cohort with feature flags, and ship a targeted fix on a tight timeline, you're demonstrating more care than a blanket rollback ever could. A rollback doesn't fix anything structurally. The reason 8–10% of users had problems — older devices, specific usage patterns — will still be there when you relaunch. You've just delayed the same conversation. Meanwhile, you've taken away something that was genuinely working for the majority. This isn't theoretical — it's how the best teams actually operate. Consider what happened with Instagram's shift to a video-heavy feed algorithm around 2022. A significant chunk of users — particularly creators focused on photography — experienced a degraded experience and pushed back loudly. Instagram didn't fully revert the algorithm. Instead, they adjusted ranking signals, reintroduced more photo visibility for affected segments, and iterated. They kept the broader direction that was driving engagement for most users while making targeted corrections for the groups that were struggling. That's selective fixing in action at massive scale. Where I'll concede: if the 10% are experiencing something severe — lost data, failed transactions, security issues — then yes, pull the feature immediately. Severity trumps percentages every time. But if we're talking about friction, slower load times, or UI glitches? Fix forward, don't retreat. The practical move: flag the affected segment today, toggle the feature off just for them using feature flags, communicate transparently, and set a bounded deadline for a fix. If you can't fix it within that window, revisit the rollback. Not as a first instinct — as a last resort. Easy decisions protect the short term. The harder, better choice is building a system that adapts to different user contexts without holding everyone else back.
  5. In my opinion, View B — Continue unless failure is certain makes more sense. Should You Stop the Line Every Time AI Flags a Defect? No. Stopping immediately every time the AI raises an alert sounds responsible. In practice, it creates its own problems. With a 12% false positive rate and 3–4 flags per shift, you're generating 2–3 unnecessary stoppages every week. That's real time lost chasing problems that don't exist. Worse, when workers experience repeated false alarms, they stop taking alerts seriously. This is observed at Three Mile Island in 1979 — operators began ignoring warning signals because there were so many nuisance alarms. The safety system worked against them. An AI that cries wolf too often stops being useful — even when it's right. What Works Better: Stop Smarter Semiconductor manufacturers like TSMC face some of the strictest quality standards in the world — and they don't stop the line at every flag. They use a simple tiered response: Low confidence flag → Keep running, increase monitoring High confidence or serious risk → Investigate at the next natural break High confidence + high severity + confirmed by a second signal → Stop immediately When hard stops are reserved for genuine high-risk moments, they carry real weight. Operators trust them. The system retains its credibility. To summarize, blanket stop rules drain throughput, frustrate operators, and ironically leave you less protected by eroding trust in the AI itself.

Account

Navigation

Search

Search

Configure browser push notifications

Chrome (Android)
  1. Tap the lock icon next to the address bar.
  2. Tap Permissions → Notifications.
  3. Adjust your preference.
Chrome (Desktop)
  1. Click the padlock icon in the address bar.
  2. Select Site settings.
  3. Find Notifications and adjust your preference.