Skip to content
View in the app

A better way to browse. Learn more.

Benchmark Six Sigma Forum

A full-screen app on your home screen with push notifications, badges and more.

To install this app on iOS and iPadOS
  1. Tap the Share icon in Safari
  2. Scroll the menu and tap Add to Home Screen.
  3. Tap Add in the top-right corner.
To install this app on Android
  1. Tap the 3-dot menu (⋮) in the top-right corner of the browser.
  2. Tap Add to Home screen or Install app.
  3. Confirm by tapping Install.

Jamiu_Lasisi_LQ84

Members
  • Joined

  • Last visited

  1. My Position: Challenge Bex—Reduce Inefficient Collaboration, But Redesign What Collaboration Is For I support View A, and I challenge Bex directly. Not because collaboration is unimportant—it is essential. But because Bex has made a category error: defending the activity of collaboration rather than its purpose. The question is not whether teams should collaborate. It is whether organisations should continue spending hours in problem-solving workshops when AI has already identified the root cause and optimal solution. Bex's answer defends the meeting. The right answer is to redesign what happens in it. The Wrong Question Bex Is Answering Bex frames this as AI speed versus human collaboration—as though they are competing for the same organisational resource. They are not. The real question is: what should human cognitive capacity be spent on once AI has handled pattern recognition and solution generation? Collaborative problem-solving workshops exist for two separable purposes: Finding the solution—diagnosing root causes, generating options, selecting the best path Building capability, alignment, and ownership—ensuring teams understand the problem, commit to the solution, and develop the skills to solve similar problems independently AI has now made Purpose 1 faster and better in many operational contexts. Bex's response is to protect the time spent on Purpose 1 anyway. The correct response is to redirect that time entirely to Purpose 2—which AI cannot provide and which delivers the long-term organisational capability Bex is rightly concerned about it. Four Conditions Where View B Legitimately Applies Preserving traditional collaborative problem-solving has legitimate force when all four conditions are met simultaneously: The problem domain is novel—AI has no historical pattern to match against; human creative reasoning is required The solution requires political alignment—implementation depends on cross-functional commitment that cannot be mandated The team's capability gap is in problem diagnosis—the collaborative exercise builds skills that don't yet exist AI confidence is low—the predictive signal is weak enough that human deliberation adds genuine solution quality In the scenario presented—recurring process problems where AI consistently outperforms workshop outputs—none of these conditions are met. The domain is not novel. The solutions are execution-dependent, not politically contentious. And AI confidence is demonstrably high. Bex is defending collaboration for problems where it adds the least value. Example 1: Google's Project Aristotle — What Collaboration Actually Builds Google's five-year study of team effectiveness—Project Aristotle—identified that the highest-performing teams were not defined by how long they spent in problem-solving sessions. They were defined by psychological safety, clarity of purpose, and structured contribution. Teams that spent the most time in unstructured deliberation were not the best performers—teams that had clear problems, defined roles, and trusted each other to execute were. The direct parallel: When AI handles root cause identification and solution generation for recurring operational problems, it does not eliminate the conditions. Project Aristotle identified as creating high performance. It eliminates the portion of meetings spent on pattern-matching and option generation—the cognitive work that AI now does better. What remains—alignment, commitment, learning, and capability development—is precisely what Project Aristotle showed actually matters. Bex cites Toyota as evidence for collaboration. Google's research reveals that Toyota's real advantage is not the meeting—it is the psychological safety and learning culture that meetings, when properly structured, can build. AI does not threaten that culture. Inefficient meetings do. Example 2: Toyota Production System — What Bex Got Right and Wrong Bex cites TPS as evidence that collaboration preserves capability. This is partially correct and substantially misapplied. Toyota's TPS does not protect long problem-solving workshops. It is built on structured, rapid problem-solving using the A3 methodology—a disciplined, one-page format that forces root cause clarity before solution discussion. The entire philosophy is: get to the root cause fast, align briefly, act immediately, learn from the outcome. Toyota's nemawashi (consensus-building) process is not about lengthy deliberation—it is about brief, structured alignment after the diagnosis is already clear. When AI can identify the root cause in minutes with the same accuracy that a well-run A3 process delivers in hours, it is not threatening TPS—it is accelerating the diagnostic step that TPS says should be done quickly. The learning and capability-building in TPS comes from the structured reflection cycle (PDCA—Plan, Do, Check, Act) that follows implementation—not from the root cause identification workshop that precedes it. The correct Toyota parallel: AI handles the plan phase faster. Human collaborative reflection handles the Check and Act phases deeper. TPS endorses this division—it does not endorse spending hours on a diagnosis that AI can do in minutes. Example 3: NASA's Mission Control—Speed and Collaboration in the Right Sequence NASA Mission Control operates under extreme time pressure with lives at stake. When Apollo 13's oxygen tank failed in 1970, the team did not convene a collaborative workshop to diagnose root causes. They used every available data signal—the equivalent of AI-speed analysis—to identify the problem within minutes. Then the collaborative human work began: designing a solution with available materials, building alignment across multiple expert teams, and executing under conditions no historical data could have fully predicted. The wrong sequence (Bex's implied model): Spend time collaboratively diagnosing a problem that instruments already identified → delay solution identification → slower response. The right sequence (View A's model): Use the fastest available diagnostic capability → redirect human collaborative energy to solution design, alignment, and execution → faster, better outcomes. NASA's Gene Kranz did not say, "Let's workshop the root cause together." He said, "What do the instruments show?"—and then, "Work the problem, people." The collaboration was not eliminated. It was redirected to where human judgement was irreplaceable. Example 4: The NHS Diagnostic Bottleneck—When Protecting Collaborative Process Costs Lives NHS England has documented waiting times for diagnostic procedures—cancer screening, radiology, and pathology—that extend to weeks or months because diagnostic workflows involve multiple collaborative review steps, multidisciplinary team meetings, and consensus processes designed for human diagnostic capability limitations. AI diagnostic tools—including those developed by DeepMind (now Google Health) for diabetic eye disease and breast cancer screening—have demonstrated diagnostic accuracy matching or exceeding specialist consensus. In the DeepMind diabetic retinopathy study, AI matched the diagnostic performance of eight ophthalmologists. NHS trusts that have adopted AI-assisted diagnostic triage have reduced waiting times by 30–50% in pilot programmes — not by eliminating clinician review, but by redirecting it. Instead of every clinician reviewing every scan collaboratively, AI pre-screens, flags high-priority cases, and routes human collaborative review to the cases where it adds genuine value: ambiguous diagnoses, complex treatment planning, and patient communication. The direct parallel, Bex's View B, applied to NHS diagnostics, would preserve the collaborative multi-disciplinary review meeting for every scan—including the clear, low-complexity cases where AI is faster and equally accurate. The cost of that preservation is measured in delayed cancer diagnoses and preventable harm. Collaboration preserved in the wrong place is not a virtue. It is a cost paid by the people waiting for the outcome. The Unified Framework Example Bex's View B Prediction What Actually Happened / Should Happen AI Lesson Google Project Aristotle Collaborative sessions build the capability that matters High performance comes from psychological safety and purpose clarity—not session length AI frees time for what actually builds teams Toyota TPS TPS proves collaboration must be preserved TPS mandates rapid diagnosis and structured reflection—not lengthy workshops AI accelerates the diagnostic step TPS says should be fast NASA Apollo 13 Collaborative diagnosis is essential Instruments diagnosed the problem; humans solved and executed collaboratively Use fastest diagnostic capability; redirect collaboration to solution design NHS AI Diagnostics Preserve collaborative review for all cases AI triage + human review of complex cases = 30–50% wait time reduction Collaboration redirected to where it adds value saves lives The Conclusion Bex Didn't Reach Bex is right that collaboration builds capability, alignment, and ownership. Bex is wrong that preserving collaborative problem-solving sessions is the mechanism for achieving those outcomes. The correct response to AI-speed solution identification is not to protect the time humans previously spent on diagnosis. It is to redesign what that time is used for: From: "Let's collaboratively identify the root cause AI already found." To: "AI has identified the root cause—let's use our time to deeply understand why, build the skills to prevent it, align on implementation, and develop the capability to handle the next novel problem AI hasn't seen before." That redesign does not reduce collaboration. It makes collaboration more valuable by focusing it on the work that only humans can do. Bex defended the meeting. The right answer is to redesign it. Use AI where it is faster. Use humans where they are irreplaceable. The measure of a great team is not how long they spent in the room—it is what they built because of it.
  2. My Position: Challenge Bex — Do Not Act on Predictive Attrition Signals as Currently Framed I support View B, and I challenge Bex directly. Not because retaining talent is unimportant — it is critical. But because Bex has made a precise and dangerous error: conflating the legitimacy of the goal with the legitimacy of the method. Wanting to retain employees is valid. Using AI-predicted intent as the basis for differential treatment is a governance failure that creates more organisational damage than the attrition it prevents. The Structural Flaw: Bex Is Solving the Right Problem With the Wrong Instrument IBM cited a 25% turnover reduction as real. But the mechanism Bex attributes it to—acting on predictive attrition signals—obscures what IBM actually built: a system that identified systemic organisational conditions driving attrition and addressed those conditions at scale. That is categorically different from identifying specific individuals predicted to leave and treating them differently because of an algorithmic prediction about their future intent. The distinction is not semantic. It is the difference between the following: Legitimate: AI identifies that employees in a specific role, tenure band, or team are showing attrition patterns → organisation improves conditions for that entire group Problematic: AI flags John and Maria as 87% likely to leave → manager is told to treat John and Maria differently from colleagues who appear equally engaged Bex's framing validates the second model. That model creates the exact harms View B identifies—and IBM's actual programme, examined closely, was primarily the first. Four Conditions Where View A Legitimately Applies Proactive intervention on attrition signals has legitimate force when all four conditions are met simultaneously: Interventions are applied to conditions, not individuals—the AI informs policy change, not person-specific differential treatment Employees are aware AI is being used—transparent disclosure exists; employees are not being profiled without knowledge Predictions are used to improve, not surveil—the outcome is a better workplace for everyone, not a watchlist. False positives carry no consequence—an employee incorrectly flagged as high attrition risk is not disadvantaged by that misclassification In the scenario presented—where managers may treat high-risk employees differently and predictions may be wrong—none of these four conditions are reliably met. The intervention model described is individual-level, not condition-level. Disclosure is not mentioned. And false positive consequences are explicitly acknowledged as a risk. Example 1: Amazon's Warehouse Monitoring — What Happens When Predictive Signals Drive Individual Treatment Amazon deployed extensive algorithmic monitoring of warehouse worker behaviour—tracking productivity, movement, rest patterns, and performance signals—using outputs to make individual-level decisions about performance management and termination. The system generated predictions about which workers were underperforming and acted on those predictions proactively. What actually happened: Investigative reporting by The Guardian and The Atlantic documented that Amazon's algorithmic management system was generating termination recommendations that workers had no visibility into, no ability to contest, and no awareness of being made about them. Workers were being managed—and in some cases dismissed—based on algorithmic assessments of predicted future behaviour rather than documented actual performance. The consequences: regulatory scrutiny across the EU and UK, significant reputational damage, unionisation drives at multiple facilities directly attributed to surveillance concerns, and—critically—increased attrition as workers chose to leave organisations where they felt monitored rather than managed. The direct parallel: Amazon had exactly the organisational goal Bex describes—reduce costly turnover and underperformance in a high-volume talent environment. The AI-driven individual prediction model did not achieve it. It accelerated the attrition it was designed to prevent by destroying the trust that retention requires. Example 2: Unilever's Responsible People Analytics — The Correct Model Bex Should Be Citing Unilever implemented one of the most sophisticated people analytics programmes in the world—using data to understand attrition patterns, engagement drivers, and talent risk. Crucially, Unilever's framework was built on a principle that directly contradicts Bex's individual-prediction model: aggregate insight drives policy change; individual data does not drive individual treatment. When Unilever's analytics identified attrition risk concentrations in specific functions, geographies, or career stages, the response was structural: Redesigned career pathways in high-attrition functions Manager capability programmes in teams showing engagement decline Compensation benchmarking in roles showing systematic flight risk Workload distribution reviews in functions showing burnout signals The outcomes: Unilever achieved sustained improvement in retention, was recognised by the Ellen MacArthur Foundation and Responsible Business Alliance for ethical people practices, and maintained employee trust scores that consistently outperformed sector benchmarks—including during major transformation programmes. The direct parallel: Unilever used the same data signals Bex describes — absenteeism, engagement, performance, workload — and achieved better retention outcomes than the individual-prediction model, without the trust destruction, false positive consequences, or differential treatment risks. The AI informed organisational conditions. It did not create a watch list. Example 3: The NHS Staff Retention Crisis — When the Prediction Was Right and the Response Was Wrong NHS England has extensive workforce data showing, years in advance, which specialties, trusts, and career stages carry the highest attrition risk. The predictive signals have been consistently accurate. The institutional response — when it acted on individual-level signals — was to increase monitoring, performance management pressure, and administrative burden on staff already showing engagement decline. What actually happened: NHS staff attrition reached record levels precisely in the specialties and trusts where predictive signals were strongest and individual-level interventions were most aggressively applied. Staff surveys consistently showed that feeling surveilled and mistrusted was a primary driver of departure intent—meaning the intervention was causing the outcome it was designed to prevent. When NHS trusts shifted to the conditions-based model—addressing rota design, pay equity, development access, and management quality at a systemic level—retention stabilised in those trusts even as national figures continued to deteriorate. The direct parallel: The NHS case proves the mechanism Bex misidentifies. The AI prediction was accurate. The attrition risk was real. But acting on individual prediction signals — rather than systemic conditions — converted a retention problem into a surveillance problem, and surveillance accelerated departure. The Wrong Metric Bex Is Using Bex cites IBM's 25% turnover reduction as validation of the individual-prediction approach. But turnover reduction is not the right metric for evaluating whether predictive attrition intervention is ethically and operationally sound. The right metrics are the following: Metric What It Reveals Employee trust scores Does the workforce believe the organisation acts in their interest? False positive rate and consequences What happens to employees incorrectly flagged as high attrition risk? Disclosure and consent Do employees know they are being assessed for attrition probability? Treatment differential Are flagged employees receiving different management from non-flagged colleagues? Retention of non-flagged employees Is the intervention improving conditions broadly, or only for the watch list? IBM's programme, when examined against these metrics rather than turnover rate alone, reveals that the retention improvement came substantially from improved career development and engagement practices deployed broadly—not from individual-level predictive profiling. The Unified Framework Example Individual Prediction Model Conditions-Based Model Outcome Amazon Warehouse Algorithmic individual monitoring and management Not implemented Increased attrition, unionisation, regulatory scrutiny Unilever People Analytics Explicitly avoided Aggregate insight → structural policy change Sustained retention improvement, sector-leading trust scores NHS Staff Retention Individual monitoring increased Conditions redesign was applied The individual model worsened attrition; conditions model stabilised it IBM (Bex's example) Partial—but primarily conditions-based in practice Career development and engagement redesigned at scale 25% turnover reduction — attributable to conditions, not individual profiling The Conclusion Bex Didn't Reach Bex is right that losing experienced employees is costly. Bex is right that AI can identify attrition risk early. Bex is wrong about what to do with that information. The legitimate use of attrition prediction AI is to answer the question: What organisational conditions are we creating that make people want to leave? The answer to that question drives systemic improvement that benefits every employee — flagged and unflagged alike. The illegitimate use is to answer the question: Which specific individuals are likely to leave so we can treat them differently? That model violates employee privacy, creates management bias, produces false positive consequences, and — as Amazon and the NHS demonstrate — accelerates the attrition it is designed to prevent. The AI should be used to fix the workplace. Not to watch the employees. Predict the conditions. Improve the environment. Trust the people.
  3. My Position: Do Not Stop the Project Based on AI Prediction Alone I support View B, and I challenge the framing as fundamentally as I challenge the AI's recommendation. The scenario presents stopping vs. continuing as the decision. It is not. The real decision is whether an AI trained on historical failure patterns has legitimate authority over a politically committed, executive-sponsored transformation initiative. It does not—and organisations that grant it that authority will systematically kill their most important work. The Structural Flaw: AI Is Measuring the Wrong Signal for Transformational Projects Every signal the AI analyzes—milestone delays, budget consumption, stakeholder engagement, and decision bottlenecks—is a convergent project metric designed to measure execution conformance against a predetermined plan. Valid for operational projects. Structurally wrong for transformational ones. Transformational initiatives will, necessarily, produce exactly the signals the AI interprets as failure: Milestone delays — because transformational scope is discovered, not defined upfront Stakeholder disengagement — because transformation threatens existing power structures Budget overruns — because transformation requires investment in learning and iteration Decision bottlenecks — because transformation requires new decision frameworks that don't yet exist The AI is not detecting a failing project. It is detecting a transforming one. It cannot distinguish between the two because its training data contains insufficient genuinely transformational project outcomes to build a reliable model for them. Four Conditions Where View A Legitimately Applies Stopping a project on AI prediction has legitimate force only when all four conditions are met simultaneously: The project is operational, not transformational—defined deliverables, stable requirements, measurable conformance criteria AI training data includes sufficient comparable projects—prediction based on genuinely similar historical outcomes Executive sponsorship reflects informed commitment—sponsors have seen the evidence and still believe for documented strategic reasons Resources can be genuinely redirected—not simply reassigned to equally weak initiatives In the scenario presented—strong executive sponsorship, significant sunk investment, political importance, and transformational initiative—none of these conditions are cleanly met. Example 1: Apple iPhone — Every AI Signal Would Have Recommended Termination Between 2004 and 2007, Apple's iPhone development ran for three years, consumed hundreds of millions of dollars, missed multiple internal milestones, required fundamental technical pivots, including abandoning the original stylus interface entirely, and generated significant internal resistance from iPod and Mac divisions. An AI monitoring this project would have detected every failure signal in your scenario: persistent delays, budget overruns, stakeholder resistance, unresolved decision bottlenecks, and risk patterns consistent with historical failures at similar stages. Apple shipped the iPhone in June 2007. It became the most valuable product launch in consumer electronics history, generating over $1 trillion in cumulative revenue and creating the modern smartphone industry. The diagnosis: Every signal the AI would have measured was real. But they were not evidence of a failing project — they were evidence of a project solving problems that had never been solved before. Strong executive sponsorship was present throughout. That sponsorship was not an irrational sentiment. It was domain expertise recognising something pattern-matching cannot see. Example 2: Amazon AWS — The Failure Signature of a $90 Billion Business When Amazon developed AWS between 2003 and 2006, the initiative showed limited early adoption, required significant engineering diversion from revenue-generating retail operations, generated internal skepticism about whether selling infrastructure was a legitimate business model for a retailer, and produced persistent resource consumption signals inconsistent with its visible output. An AI would have flagged this against every criterion in your scenario. The pattern — high investment, slow visible progress, internal resistance, unclear market validation — matches the failure signature almost perfectly. AWS launched in 2006, and by 2023, it generated $90.8 billion in annual revenue, creating the cloud computing industry and becoming the majority source of Amazon's operating profit. The diagnosis: The AI would have been measuring Amazon's ability to execute against a retail project template. AWS was not a retail project. The signals of difficulty were transformation signals—the friction of a genuinely new business model being built inside an organisation optimised for a different one. Political importance, far from being a reason to stop, indicated that leadership recognised the strategic stakes the AI could not quantify. Example 3: Ford's EV Programme—When the Losses Are the Investment Ford's electrification programme under the Model e division lost $4.7 billion in 2023 and $5.1 billion in 2024. Production volume targets were repeatedly revised downward. Dealer resistance created persistent stakeholder friction. Battery cost timelines slipped. An AI monitoring this initiative would generate maximum failure probability scores on virtually every metric in your scenario. Stopping would be catastrophically wrong. Ford's EV programme is not failing—it is transforming a 120-year-old manufacturer for a regulatory environment in which internal combustion vehicles face progressive market closure. The losses are not failure evidence. They are the investment cost of surviving the next decade. The diagnosis: The AI is measuring the cost of transformation and calling it the probability of failure. These are not the same number. Every automotive manufacturer that abandons this transition faces existential competitive displacement—a risk the AI's historical project data cannot model because it has never happened before. Example 4: NHS NPfIT—When the AI Would Have Been Right Intellectual honesty requires the counter-case. The NHS National Programme for IT, launched in 2003 with a £6.2 billion budget, showed persistent failure signals for years before termination in 2011 at a cost of £10 billion to taxpayers. Strong political sponsorship suppressed early termination that the signals clearly warranted. The critical distinction: NPfIT failed not because it was transformational but because it was wrongly designed from the start—a top-down technology imposition without clinical co-design. The signals were design error signals, not transformation friction. An AI with sufficient domain context could, in principle, have distinguished between them. The lesson: The AI's prediction should not be ignored. It should be interrogated. The right response is structured human investigation asking, "Are these transformation friction signals or design error signals?" That question requires domain expertise the AI cannot provide. The Unified Framework Example AI Failure Signals Should Have Stopped? Outcome Apple iPhone Yes—delays, overruns, resistance No—transformation friction $1T+ revenue, industry created Amazon AWS Yes—slow adoption, resource diversion No category creation $90.8B annual revenue Ford EV Programme Yes—$4.7B+ in losses, missed targets No—survival transformation Existential strategic necessity NHS NPfIT Yes—delays, disengagement, overruns Yes—design error £10B wasted; terminated 2011 The Conclusion the AI Cannot Reach Alone The AI has done its job correctly — it identified a pattern correlating with historical failure. The error is treating that correlation as a decision rather than a question. The right response to a high-probability-of-failure prediction is a structured review asking four questions no AI can answer: Are these transformation friction signals or design error signals? One warrants persistence. The other warrants termination. Does executive sponsorship reflect strategic conviction or sunk-cost protection? One is an asset. The other is a liability. What is the asymmetric cost of being wrong in both directions? Killing a transformational initiative that would have succeeded is a categorically different error from continuing a fatally flawed one. What would need to be true for this project to succeed, and is any of it still achievable? If yes, redesign. If no, stop. The AI predicts failure probability. It cannot answer these questions. Only human judgement—informed by the AI's evidence, not replaced by it—can. Use the AI prediction as the alarm. Use human expertise to investigate what the alarm is telling you. Never let the alarm make the decision.
  4. My Position: Challenge Bex — Follow the AI, But Redesign What It Optimises For I support View A, and I challenge Bex directly. Not because broad opportunity distribution is wrong as a principle, but because Bex has made a precise analytical error: conflating the AI's current optimisation target with the AI's capability. The problem is not that the AI recommends top performers. The problem is that the AI has been given the wrong objective function. Fix the objective, and View A becomes the most powerful capability development tool the organisation has ever deployed. The Wrong Objective Function Problem Bex argues that following AI recommendations concentrates opportunity and stunts development. This is true—but only if the AI is optimising exclusively for current task performance. That is a configuration choice, not an architectural inevitability. An AI optimising for long-term organizational capability—factoring in skill gap closure, bench strength, succession depth, and team resilience alongside delivery quality—will produce dramatically different recommendations. The organisation asked the AI the wrong question, and Bex accepted that framing without challenging it. Critically, Google's 20% time—Bex's own example—proves View A's case, not View B's. It works precisely because the other 80% is assigned to people most capable of delivering it. That is View A with a development layer built on top. Four Conditions Where View B Legitimately Applies View B has legitimate force only when all four conditions are simultaneously met: the AI's optimisation target cannot be redesigned; short-term performance loss from broader distribution is tolerable; top performer burnout is not yet critical; and no systematic capability-building mechanism exists alongside task assignment. In a large operations organisation with AI infrastructure already deployed, none of these conditions are fully met. Example 1: The NFL Quarterback Model — Why Bex's Logic Breaks Under Pressure By Bex's reasoning, NFL teams should distribute starting opportunities broadly to develop backup quarterbacks and maintain squad morale. No serious organisation does this—because the cost of suboptimal performance in critical moments is immediate and measurable. What elite NFL organisations actually do is separate the function: deploy the best performers in critical games and use practice sessions and preseason games as structured developmental contexts for emerging talent. The Pittsburgh Steelers' decades of quarterback continuity is a masterclass in this model—develop deeply in controlled environments and deploy decisively when it matters. The direct parallel: Your organisation's AI is being asked to manage both the starting lineup and the practice schedule with one metric. Split the function. Use AI to assign critical tasks to best performers and to design developmental assignments in lower-stakes contexts. Bex collapses both into a single distribution decision. That is the error. Example 2: McKinsey's Staffing Model — Engineered Stretch, Not Random Distribution McKinsey does not randomly distribute client engagements to build broad capability. It uses a structured staffing model that assigns work based on current capability and deliberate developmental intent—with each consultant tracked against a capability development roadmap. Senior partners handle the most critical client relationships. But every engagement team is deliberately constructed to include consultants operating at the edge of their current capability, supported by experienced seniors. The AI equivalent assigns the lead role to the top performer and engineers the team composition for developmental stretch simultaneously. McKinsey produces more senior business leaders per alumni cohort than almost any organisation in the world—not because it distributed critical work randomly, but because it engineered capability development into performance delivery. The two were never in conflict. The direct parallel: Bex treats performance and development as zero-sum. McKinsey's model treats them as a portfolio optimisation problem—which is exactly what a correctly configured AI should be solving. Example 3: Toyota's Senpai-Kohai System — Concentration Done Right Toyota's production system concentrates critical quality decisions in experienced senior workers (senpai) who have demonstrated mastery. Junior workers (kohai) are assigned progressively more complex tasks under structured mentorship, with escalation protocols ensuring critical decisions flow to the right capability level. Plants that distributed quality decision authority broadly—in the name of empowerment—showed higher defect rates and slower capability development than plants maintaining structured concentration with clear developmental pathways. Toyota's quality consistency across decades came from structured capability pipelines operating alongside concentrated performance delivery — a dual-objective system. The direct parallel: Bex's Google example and Toyota's system both validate the same model: protect core performance and engineer developmental space deliberately. Neither supports random broad distribution against AI recommendations. The Unified Framework Example Bex's View B Prediction What Actually Happened The AI Lesson NFL Quarterback Distribute starts for bench development Elite teams’ separate performance from developmental practice Split the AI's function: assign and develop separately McKinsey Staffing Random distribution builds capability Structured stretch within performance delivery builds better leaders AI should optimise team composition, not just individual assignment Toyota Senpai-Kohai Broad decision authority empowers teams Concentrated decisions with structured pipelines outperform Dual-objective AI: current output and future capability together Google's 20% Time (Bex's example) Proves broad distribution works Proves structured developmental space alongside concentrated performance works Validates View A with a redesigned objective function The Conclusion Bex Didn't Reach Bex is solving the right problem with the wrong instrument. The answer to over-concentration is not to override the AI's recommendations. It is to expand what the AI is asked to optimize. An AI told to maximise today's task success rate will recommend top performers every time. An AI told to maximise organisational capability at a 36-month horizon—weighting delivery quality, skill development velocity, bench strength, and retention risk simultaneously—produces fundamentally more valuable recommendations. The organisation does not have an AI problem. It has an objective function problem. And the solution is not broader distribution against the AI's judgement. It is giving the AI better judgement by asking it the right question. Bex accepted the AI's current configuration as fixed. That is the analytical error. The AI is a tool. The objective function is a choice. Make it the right one.
  5. My Position: Pursue Bold Innovation Despite the AI Warning I support view B, without qualification. The problem is not that AI is occasionally wrong about risk. It is that AI risk models are structurally incapable of evaluating transformational opportunity — not because of a data gap, but because of an architectural one. Training on historical patterns does not just limit what AI can see. It actively biases AI toward recommending the status quo at precisely the moment when the status quo is most dangerous. The Structural Flaw Every AI risk model is trained on outcomes that were recorded. Recorded outcomes are, by definition, things that happened within existing market structures. When an AI evaluates a radical new business model, it is not assessing the future. It is asking: Does this resemble things that worked before?" Transformational innovations do not resemble things that worked before. That is the definition of transformation. The more genuinely transformational an idea is, the more confidently a backwards-looking AI will flag it as dangerous. The AI's conviction is inversely correlated with the idea's potential. Four Conditions Where View A Legitimately Applies View A has legitimate force only when all four conditions are met simultaneously: failure modes are reversible; the decision domain closely resembles historical patterns; the opportunity cost of caution is low; and no identifiable human expertise sees something the AI cannot. In the scenario presented — a radical new business model with strong senior leader conviction — none of these conditions are met. Example 1: Netflix — Every Risk Signal Would Have Said No In 2007, Netflix pivoted from profitable, market-leading DVD-by-mail to streaming. A data-driven risk model would have flagged every warning: incomplete broadband penetration, no proven consumer streaming behavior, no content licensing infrastructure, and Blockbuster dominant with physical advantages that Netflix lacked. Netflix pursued the transformation anyway. By 2023, it had 238 million subscribers and had eliminated Blockbuster. Blockbuster, which had the data, the infrastructure, and the market position — and chose caution — no longer exists. The AI would have recommended staying with DVD-by-mail. It would have been statistically defensible and strategically fatal. Example 2: Apple iPhone — Protecting Nokia's Market Position In 2007, Nokia held 40% of the global mobile market with superior scale, distribution, and consumer data. That data showed customers valued battery life and physical keyboards above all else. Touchscreens tested poorly. An AI risk model evaluating the iPhone would have correctly identified every danger signal and recommended against it. Nokia followed its data. Apple ignored the historical pattern. By 2013, Nokia's mobile division was sold to Microsoft for a fraction of its former value. Apple became the most valuable company in the world. Nokia was measuring what customers said they wanted based on existing products. Apple was measuring where behavior would go once a genuinely new experience existed. AI trained on historical preference data cannot ask the second question. Example 3: Amazon Web Services — $90 Billion Built on a "High Risk" Idea In 2003, Amazon proposed selling cloud computing infrastructure—entirely outside its retail competency, with no proven enterprise market and significant capital requirements. Every risk signal pointed away from it. AWS launched in 2006. By 2023, it generated $90.8 billion in annual revenue and created the modern cloud computing industry. The retail business that the AI would have recommended protecting is now, in margin terms, less significant than the business it would have recommended not building. The absence of historical precedent is not evidence of high risk. For transformational ideas, it is frequently evidence of genuine opportunity. The Unified Argument Example AI Risk Signal What AI Would Have Recommended Actual Outcome Netflix Streaming High Stay in DVD-by-mail 238M subscribers; Blockbuster extinct Apple iPhone High Protect existing mobile market Most valuable company in the world Amazon Web Services High Focus on retail $90.8B revenue; industry created Three industries. Three transformational decisions. In every case, the AI risk signal would have been statistically defensible and strategically fatal to follow. The Conclusion: The AI Cannot Reach Alone AI risk models have legitimate authority over decisions that resemble historical patterns. They have no legitimate authority over decisions that are transformational precisely because they do not resemble historical patterns. Applying a historical risk model to a genuinely novel opportunity is not rigorous analysis. It is a category error. The senior leaders in your scenario are not being emotional. They are applying forward-looking, qualitative domain judgment that is not a weakness in AI—it is an architectural boundary. Use AI to manage the risk. Use AI to pressure-test assumptions and model downside scenarios. But the decision to pursue a transformational opportunity must remain with human leadership. Use the AI to manage the risk. Do not use it to veto the vision.

Account

Navigation

Search

Search

Configure browser push notifications

Chrome (Android)
  1. Tap the lock icon next to the address bar.
  2. Tap Permissions → Notifications.
  3. Adjust your preference.
Chrome (Desktop)
  1. Click the padlock icon in the address bar.
  2. Select Site settings.
  3. Find Notifications and adjust your preference.