Skip to content
View in the app

A better way to browse. Learn more.

Benchmark Six Sigma Forum

A full-screen app on your home screen with push notifications, badges and more.

To install this app on iOS and iPadOS
  1. Tap the Share icon in Safari
  2. Scroll the menu and tap Add to Home Screen.
  3. Tap Add in the top-right corner.
To install this app on Android
  1. Tap the 3-dot menu (⋮) in the top-right corner of the browser.
  2. Tap Add to Home screen or Install app.
  3. Confirm by tapping Install.

Preethi_Nair_iOA9

Members
  • Joined

  • Last visited

  1. My Position: View B — Do Not Rely on Non-Explainable AI Let Me Start With a Question for Bex Bex, if a bank could approve loans 60% faster — but could not explain why a customer was rejected — should it deploy that system at scale? The answer is obviously no. Not because speed isn’t valuable. But because a decision that cannot be explained cannot be trusted, defended, or corrected. The Core Mistake — Efficiency Without Explainability Is Not Optimization Bex frames this as: “Efficiency vs. transparency” That framing is flawed. In decision systems like insurance: Explainability is not a feature. It is the mechanism of accountability. Without it: -You cannot justify decisions -You cannot detect bias -You cannot improve the system -You cannot defend against regulation 👉 This is not a trade-off. 👉 This is a broken decision system. The Industry Case That Exposes the Risk — Apple Card Bias Controversy (2019) The Setup: -Goldman Sachs used an AI-driven model to issue credit decisions -High automation, fast approvals, consistent outputs What Happened: -Multiple users (including public figures) reported significantly lower credit limits for women -The system could not explain why The Consequences: -Regulatory investigation by New York Department of Financial Services -Public backlash -Loss of trust The system was efficient. The system was consistent. The system was also indefensible. Why This Directly Mirrors the Insurance Scenario In both cases: -High-speed automated decisions -Financial impact on individuals -Need for justification -Escalations when decisions are questioned 👉 When a customer asks: “Why was my claim rejected?” “Because the model said so” is not an answer. It is a failure of the system. The Quantitative Reality — Why Explainability Drives Real Efficiency Let’s quantify what “non-explainable efficiency” actually costs. Assume: -10,000 claims/month -AI processes 60% faster -5% disputed due to lack of explanation = 500 cases Each dispute triggers: -Customer support time -Escalation handling -Manual review -Potential legal risk If each dispute takes 2–3x effort: 👉 The system creates a parallel shadow workflow: -AI handles speed -Humans handle confusion The Concept That Separates This Answer — Accountability Gap Accountability Gap — The cost created when decisions are made without the ability to explain, justify, or audit them. We can express it as: 👉 If disputes and risk rise faster than speed gains, 👉 the system becomes operationally efficient but institutionally fragile The Regulatory Reality — You Cannot Scale What You Cannot Explain Insurance is not just operations. It is a regulated decision environment. Frameworks like: -GDPR (Right to Explanation) -Insurance compliance laws globally Require: -Justifiable decisions -Auditability -Fairness 👉 A black-box system fails not just operationally — 👉 it fails legally The Positive Proof — Explainable AI Done Right Look at how leading organizations deploy AI responsibly: FICO -Provides reason codes for every credit decision -Enables: •Customer understanding •Regulatory compliance •System improvement Zest AI -Focuses specifically on explainable underwriting models -Adoption driven by: •Transparency •Trust •Auditability 👉 The best systems don’t choose between speed and explainability. 👉 They engineer both together. Why Bex’s Argument Breaks Under Pressure Bex says: “Consistency and efficiency outweigh transparency” But consistency without explainability creates: -Consistent bias -Consistent errors -Consistent injustice 👉 That is not improvement. 👉 That is scaling risk systematically The Deeper Insight — Decisions vs Predictions AI is excellent at: -Predictions -Pattern recognition -Optimization But insurance claims are not predictions. They are: -Decisions with consequences -Decisions that must be justified -Decisions that affect real people financially 👉 The moment AI moves from predicting to deciding, 👉 explainability becomes non-negotiable Closing Argument: Bex is optimizing for speed. But in decision systems, speed is not the goal. Legitimacy is. A fast decision that cannot be explained is not a decision system. It is a liability engine. The Final Line You can automate processing. You can optimize decisions. But if you cannot explain them, you cannot own them. And in insurance, if you cannot own your decisions — you should not make them.
  2. My Position: View B — Humans Must Retain Control of ImplementationLet me start where Bex’s argument quietly breaks. Bex says: “If AI is confident enough, it should act.” But confidence is not the same as context — and process changes don’t fail because of logic errors. They fail because of context gaps. The Question Bex Didn’t AskIf an AI system in a hospital is 99% confident that reducing patient monitoring intervals will improve efficiency — should it automatically change ICU protocols overnight? The answer is no. Not because the AI is wrong. But because the cost of being wrong once is catastrophic. Operations are not spreadsheets. They are systems of risk, dependency, and human consequence. The Real-World Case That Exposes the Risk — Knight Capital (2012)This is one of the most precise examples of what happens when systems are allowed to act without sufficient human control. What Happened:Knight Capital Group deployed an automated trading update The system began executing trades autonomously based on faulty logic No effective human intervention layer existed to stop it in real time The Result (in 45 minutes):🔴 $440 million loss 🔴 Firm nearly collapsed overnight 🔴 Emergency bailout required to survive The Root Cause:It wasn’t that automation was bad. It was that execution authority existed without adequate human validation and control boundaries. Why Bex’s Siemens Example Actually Proves the OppositeBex cites Siemens as proof of autonomous success. But here’s the critical nuance: Siemens’ AI operates within bounded environments Changes are: Pre-tested Constraint-limited Monitored with human override capability 👉 That is not pure autonomy. That is controlled autonomy — which is fundamentally View B. Bex is presenting supervised optimization as independent execution. Those are not the same thing. The Core Problem: AI Lacks “Second-Order Awareness”AI optimizes for what it sees. It does not naturally account for: Downstream compliance implications Cross-functional dependencies Customer perception shifts Rare but high-impact edge cases A process change is never isolated. It propagates. The Concept That Matters Here — Execution Risk AmplificationLet’s define something critical: We can express it as: R=C×I×AR = C \times I \times AR=C×I×A Where: R = Real-world risk impact C = Confidence level of AI I = Interconnectedness of the process A = Autonomy of execution 👉 As A (autonomy) increases, risk doesn’t grow linearly — it multiplies across systems. High confidence + high autonomy in a highly interconnected system = fragile operations at scale Where Autonomous AI Actually Works (And Why This Still Supports View B)Autonomous implementation works in: Ad bidding systems Recommendation engines Dynamic pricing (within limits) Why? Because: Failures are reversible Impact is localized Systems are designed for continuous correction Now compare that to: Supply chains Financial systems Healthcare processes Customer service workflows These are: Interdependent Reputation-sensitive Often irreversible in impact 👉 Same AI confidence. Completely different risk profile. The Illusion of Speed vs The Reality of StabilityBex’s core argument is speed. But speed without control creates a hidden cost: Rework Incident recovery Trust erosion Regulatory exposure The fastest system is not the one that changes quickest. It is the one that does not break under change. The Better Model — Human-Governed AutonomyThe winning approach is not rejecting AI execution — it is governing it: AI proposes and simulates changes AI can implement within pre-approved boundaries Humans retain control over: Threshold definitions Exception handling System-wide changes 👉 AI accelerates execution 👉 Humans protect the system Final Contrast — Two Operating PhilosophiesView A — Autonomous Execution View B — Human-Governed Control Speed-first Stability-first Trust AI confidence Validate AI context Scales fast Scales safely Hidden systemic risk Managed systemic risk Failure = amplified Failure = contained Closing ArgumentBex is right about one thing: AI can optimize faster than humans. But optimization is not the goal. Sustainable execution is. The question is not whether AI is confident enough to act. The question is whether the organization is prepared to absorb the consequences when it acts incorrectly. And in real operations, that answer is almost always: Not without humans in control.
  3. I support View A — implement the AI-recommended change quickly — but through controlled activation, not blind rollout. The real mistake is treating this as speed vs readiness. In operations, performance improvement depends on sequence, not choice. Framework: Contain → Validate → Scale AI-driven changes should be implemented immediately — but within a structured execution model that protects stability while accelerating adoption. 1. Contain (Stabilize before change) -Lock current process baseline and KPIs -Prevent uncontrolled variations during transition -Define where AI will intervene Without containment, introducing a new workflow creates noise, making it impossible to measure real impact. 2. Validate (Controlled implementation) -Deploy AI recommendation in a limited scope (pilot team / region) -Run alongside existing workflow for comparison -Make outcomes visible: delay reduction, quality improvement This converts skepticism into evidence-based trust. 3. Scale (Adoption-led rollout) -Expand only after consistent results are proven -Train using real cases from pilot, not theoretical sessions -Track adoption rate and override behavior, not just output metrics Because implementation is not success — consistent usage is. Operational Example (Shipping / Logistics) AI recommends dynamic container allocation to reduce delays by 25%. If implemented immediately without structure: -Planners override AI due to low trust -Regions follow inconsistent approaches -Customer commitments fluctuate Result: operational instability + loss of confidence in AI Using Contain → Validate → Scale: -Pilot on one trade lane -AI runs in parallel with planners -Measurable reduction in dwell time observed -Planners begin adopting recommendations voluntarily Result: sustained performance improvement with high adoption Where Bex’s Argument Falls Short Bex is right about speed — but wrong about how speed is executed. The example of rapid AI adoption works only in systems that are already: -standardized -digitally mature -culturally aligned In most operations, forcing immediate change without structure leads to: -resistance -workarounds -erosion of trust in AI Key Insight: The goal is not fast implementation. The goal is fast, stable adoption of a better process. Final Position: AI-driven improvements should be implemented immediately — but within a controlled, phased system. Because: Speed without structure creates disruption and structure with speed creates performance.
  4. My Position — Fix for All (Roll back immediately) I do not support keeping a feature live when it is known to degrade experience for a segment of users. In product systems, this is not a “minority vs majority” question. It is a reliability and trust decision. Why I disagree with Bex? Bex frames this as progress vs perfection. But in reality, this is about whether we accept a knowingly broken experience for some users. 8–10% is not an edge case. That is 1 in every 10 users experiencing friction or failure. In most systems, that level of failure would immediately trigger containment — not optimisation. The real lens: Fragmented reliability When a feature works well for some users but fails for others, you don’t have a successful rollout. You have: -Inconsistent experience -Segment-based reliability -Erosion of trust (especially among long-tail users — older devices, constrained environments) Over time, this creates: -Support load -Negative sentiment concentration -Silent churn from affected segments Unlike engagement gains, trust loss is not evenly distributed — it compounds. Product Example — Instagram app updates on lower-end devices: Instagram has repeatedly faced performance issues on lower-end Android devices when rolling out heavier features. What did they do? -They did not justify poor experience as “minority impact” -They introduced: •Optimised builds •Feature scaling •Even separate experiences like Lite versions Because they recognised: Growth cannot come at the cost of excluding segments silently Why “keep and fix selectively” is risky? Keeping the feature live while fixing sounds efficient — but it assumes: -The issue is isolated -The fix is quick -The impact is contained In reality: -Root causes often take time (device compatibility, rendering, memory constraints) -Meanwhile, affected users continue to experience friction -The system normalises degraded experience This is how products drift into: “Works well… unless you are in that unlucky segment.” Operational Parallel (same logic, different system): In high-reliability environments, when a system shows segment-specific instability, the response is: Contain first, optimise later: Not because failure is widespread —but because known failure is unacceptable What strong product teams do instead This is not about abandoning progress. It is about controlled rollback and disciplined re-release. -Roll back (or disable via feature flags) -Diagnose segment-specific issues -Reintroduce with: °Progressive rollout °Device-aware logic °Guardrails This ensures: -Consistent experience -Preserved trust -Sustainable adoption Bottom line: AI should not optimise for majority success at the cost of minority failure. It should recommend: "Contain known harm first, then scale value" Because in product ecosystems: -Engagement builds growth -Reliability builds trust And without trust, progress does not compound — it reverses.
  5. I’m going to challenge Bex — I support View B: do not stop the process automatically. Stopping every time the AI raises a flag sounds disciplined, but in this setup, it’s actually operationally suboptimal and strategically risky. Why automatic stoppage fails in this scenario? At first glance, the economics seem obvious: defect cost is 8–12× higher than stoppage cost → so stop. But that logic breaks when you factor in frequency and false positives: 3–4 flags per shift × 12% false positives = repeated unnecessary interruptions Each stop = 15–40 minutes of cascading disruption Over time, this creates: Loss of throughput Bottlenecks downstream Operator fatigue and disengagement Most critically → erosion of trust in AI (“cry wolf” effect) And once trust drops, even valid alerts start getting ignored — which is far more dangerous than a controlled risk. The core flaw in Bex’s argument is that it assumes: “AI flag = credible enough to justify stoppage” But with 85–90% accuracy, the system is advisory, not authoritative. A system with a 12% false positive rate is not precise enough to control hard stops autonomously — especially in high-frequency environments. This is the difference between Quality assurance system → supports decisions vs Control system → makes decisions Here, AI is clearly the former. What works better: Risk-tiered intervention. Instead of binary “stop vs continue,” the winning model is graduated response: 1. High-confidence + high-severity → Immediate stop -Example: safety-critical deviation, regulatory breach risk 2. Medium-confidence → Slow down + inspect in parallel -Increase sampling rate -Trigger human validation -Isolate affected batch without halting entire line 3. Low confidence → Monitor, don’t interrupt -Log pattern -Feed back into model learning This approach preserves: -Flow efficiency -Operator confidence -AI credibility Real-world example: Semiconductor manufacturing In semiconductor fabs (e.g., Intel or TSMC), AI models constantly monitor wafer production for microscopic defects. If they stopped production for every anomaly signal: -Fab utilization would collapse -Costs would skyrocket (fabs run 24/7 with extreme capital intensity) Instead, they use: -Statistical Process Control (SPC) + AI overlays -Dynamic thresholds -Lot-level isolation instead of line stoppage Only high-confidence, excursions trigger hard stops. Everything else is handled through containment and inspection layers Result: -High yield rates -Minimal disruption -Sustained trust in AI systems The bigger insight: This is a systems design problem, not a quality problem The real question isn’t “Should we stop when AI flags risk?” It’s: “At what confidence level should AI be allowed to interrupt flow?” If you let a probabilistic system control deterministic action (like stopping a line), you create instability. Final position- Do not stop the process automatically. Instead: -Use AI as a risk signal, not a trigger -Design tiered responses based on confidence and impact -Protect both quality AND flow, not one at the expense of the other Because in real operations, a system that stops too often becomes just as dangerous as one that never stops.

Account

Navigation

Search

Search

Configure browser push notifications

Chrome (Android)
  1. Tap the lock icon next to the address bar.
  2. Tap Permissions → Notifications.
  3. Adjust your preference.
Chrome (Desktop)
  1. Click the padlock icon in the address bar.
  2. Select Site settings.
  3. Find Notifications and adjust your preference.