Everything posted by Adil Khan18
-
How Do You Ensure an AI-Enabled Process Continues to Work as Intended Over Time?
Domain: Aerospace MRO - Engine shop for CFM56/LEAP Turbofans for Performance Restoration (€ 220 Million yearly turnover , approx. 1,800 shop visits in a year, AI rolled over since late 2025 to predict HPT module rework needs based on borescope images, oil debris analysis, and in-service data) Specific AI-enabled process: Predictive HPT Blade Rework Forecasting The AI will recommend if the module needs full blade rework, partial (only the tips), or none, all with the goal of eliminating unnecessary shop time and expense without losing the zero escape target on critical parts. It went live on all CFM56/LEAP visits in Q1 2026 and initially deliver an average 18% reduction in TAT on HPT modules. How we ensure & monitor the process continues to deliver intended outcomes We are treating this AI-human decision loop as a live control system and continuing to develop it over time not like one tine install, the focus is on sustainable business value – TAT savings, cost per visit going down, safety and zero quality escapes. What we monitor (daily / weekly / monthly) 1. Leading indicators (daily dashboard – shop floor + engineering) · Prediction accuracy of AI vs. actual rework result (confusion matrix updated every 50 engines). · AI suggestion Override rate by technicians / engineers (accept, tweak, reject AI recommendation). · Confidence score variation (how often is the model <80% sure?) · Data drift indicators, distributional shift of input variables (eg iron particles in oil, borescope crack density, EGT margin so on) 2. Lagging business outcomes (weekly review – operations + finance) · HPT Module: Turn Around Time Variance (target < 35 days). · Rework cost per engine vs. Baseline · Escape rate / quality holds on HPT (target 0) · Spare Parts Consumption vs. Forecast (Over/Under-Stocking Signals) 3. Model health metrics (monthly deep dive – MBB + data team) · Population stability index (PSI) on key inputs (>0.25 = moderate drift, >0.5 = severe). · Calibration plot (predicted probability vs observed rework rate) · Feature importance drift (which inputs is most important to the model now vs at launch) How we react when the going starts getting tough We have a three-level escalation protocol: Level 1 – Minor Drift (Weekly Trigger) · Override rate >25% or confidence <75% on >20% of cases. Response: · Immediate feed back loop i.e. every override by enginers requires 1-click reason (dropdown + optional voice note). · Retrain model based on last 100 engines + overrides justificatipn. · Notify shop team lead, usually fixes within 1-2 weeks Level 2 – Business impact emerging (weekly trigger) · TAT +3 days or rework cost increased +8% vs rolling 4-week average · OR escape / hold on HPT (even one) Response: · Hold AI recommendations - return to manual disposition within 48 hours/ · Root Cause A3 with MBB: Data drift? New failure mode? Change in user behavior? · Temporary rule: AI confidence > 90% required for auto-accept · Full model retrain + validation on hold-out set before re-release Level 3 – Systemic failure (monthly or immediate on escape) · PSI >0.5 on critical inputs OR calibration slope deviates >15% · OR sustained TAT/cost > 15% Response: · Full pause of AI in production · Independent audit: data lineage, labeling drift, concept drift · Notification to the regulator of any escape which occurred · Re-baseline from scratch or switch to a fall-back approach (manual and old rules) · Shared across sites post-mortem – we’ve had one Level 3 (new low-sulfur fuel changed oil debris patterns in Q3 2026) Practical setup we use today · Automated alerts using Teams/Slack when threshold breaches · Monthly “AI Health Review” (30-min standing meeting: MBB, ops manager, data lead) · Quarterly external benchmark against OEM data (CFM/Pratt) · Annual review of AI usage (EASA Part-145 requirement) Bottom line from the teardown bay AI Drift isn’t an ‘if’ but a ‘when’ In MRO, the price of slow degradation can be a long turn-around time, excessive spares, or even a failure in service. The way we monitor our AI is how we would monitor an engine, performing routine checks every day, and only grounding it completely when we have to. The process remains alive since we do not assume model is “set and forget”.
-
How Should MBBs Rethink Hypothesis Testing and Data Credibility When AI Is Involved?
Domain: Aerospace MRO - Engine shop for CFM56/LEAP turbofans performance restoration visits (€220M turnover facility ~1,800 shop visits / year, focus on reducing module TAT and cost while maintaining zero escapes on critical parts) Specific Lean Six Sigma Project: Reducing High Pressure Turbine Blades rework rate from 42% To less than 25% On Performance Restorations (This is a complete high cost driver worth nearly €4 - 6M on an annual basis, given unexpected coating wear that we notice, coolant hole blockage or tip rub forcing rework delays in TAT by 8-12 days per engine. The project began Q1 2025, utilizing AI-based predictive analysis of the borescope images and oil debris pattern recognition.) How the MBB treats AI-generated insights in this project 1. Forming or testing hypotheses Traditional LSS: MBB first conducts brainstorming, resulting in hypotheses like “coating peal off is caused by EGT exceedances above 50°C cumulative. ” Verified using designed controlled experiments and / or regression analysis. With AI: The model shows correlations instantly (for example: “HPT rework 68% correlated with borescope images of micro-cracks near cooling holes + oil debris iron particles > 15ppm last 500 hours”). MBB Role · Consider these AI results as hypothesis generators rather than conclusions. · Turn the AI correlation into a testable null / alternative hypothesis, e.g., H₀: No difference in rework rate between high iron oil lots and low iron oil lots. · Run confirmatory DOE or stratified sampling – no AI pattern should ever be taken as ‘causation’ without this. · Document: “Terminal Object: ‘AI suggested X → we formed hypothesis Y → tested via Z’”” 2. Establishing statistical confidence Also, it provides probability scores, such as “92% confidence this lot will need HPT rework” but rarely includes p-values, degrees of freedom or power. MBB must: · Demand transparency in Data flow : force the AI team to show their statistical method, such as random fore sets, SHAP values, or Bayesian posterior probabilities. · Re-run important patterns using classic statistics: t-test, chi-square, logistic regression, etc. on hold-out data. · Set hard thresholds: AI insights only actionable if classical p-value < 0.05 and effect size > medium (Cohen's d > 0.5 or OR > 2). 3. Assessing data quality and credibility AI Agent is only as good as the training data set, so all the good, bad and covering full tolerance band must be used for training. In MRO, the historical borescope images and oil reports are noisy with several variations (different inspectors, varying illumination, and inconsistent sampling). MBB safeguards: · Check data lineage audit: who collected, when, under what conditions? Reject datasets if greater than 15% missing/mislabeled. · Use stratified sampling to check for bias: for example, do high hour engines dominate the sample? · Run inter-rater reliability on borescope annotations. Kappa >0.7. · Never blindly trust AI predictions on ‘Black Swan’ events — If the model is trained on <10 similar cases, treat as low credibility. AI accelerates decisions in a) Early Analyze phase: pattern discovery – 5 – 10 potential X’s in hours vs weeks of manual Pareto / fishbone analysis. b) Screening: quickly eliminate weak signals (AI correlation <0.3: drop the hypothesis). c) Simulation: Test “What-If” Situations (e.g., additional inspection predicted to lead to 18% reduction in re Where traditional statistical validation is non-negotiable · Causation Claims: AI provides correlation (iron particles + rework) – MBB requires DOE or natural experiment to establish cause. · Critical to Safety Features: Changes to the HPT system that affect HPT integrity require p < 0.01 + power > 0.9. · Control phase: sustainment metrics including rework rate after change, again using control charts – here, monitoring is done by AI, but control limits in terms of what constitutes out of control remain set by SPC. Practical outcome after 7 months · Rework rate is now 23.8% · TAT savings approx 9.2 days / engine on average · No escapes - since MBB applied classical validation on each major insight · Team trusts AI because it is “AI suggests àWe Validate à We prove it”. Bottom line from the engine teardown bay AI is an amazing needle finder, and it screens hypotheses better than we do. But in aerospace MRO, causation is king, safety is non-negotiable, and regulators don’t accept "the neural net said so". The MBB’s task is to ensure that DMAIC is keeping: leveraging AI to make discovery happen faster, relying on classical stats to validate everything, and never confusing correlation with proof. That balance makes AI a serious accelerator of real improvement rather than a shiny toy.
-
Does DMAIC Still Hold When AI Enters the Picture?
Domain: Aerospace, MRO (Maintenance, Repair & Overhaul) - Engine shop for narrow body turbofan engines (€220M turnover facility, covering CFM56, V2500, and LEAP Shop Visits of various airlines and lessors) Specific Improvement Initiative: Reducing engine module Turn Around Time (TAT) from 45 – 60 days to <35 days during performance restoration visits in a span of 6 Months (This effectively represents another high-value chokepoint because releasing underlying store engines into service reduces costly leases and makes airline dispatch more dependable. We recently initiated an AI-driven project in late 2025 that seeks to incorporate predictions of module status using borescope, oil analysis and test cell data, then auto generates work scopes and part forecasts.) How we adapted DMAIC with AI — stage by stage Define This will still remain 100% human owned with no shortcuts. Simply, Artificial Intelligence cannot describe what "good" means for airworthiness parts. For few weeks we worked with airline customers, lessors and regulatory representatives to map CTQ’s: TAT < 35 days, zero escape on critical elements (HPT, HPC), > 12% cost reduction, and full traceability for EASA/FAA. AI also helped in the visualization of the current vs. target Pareto of delays. Boundaries (no compromise in Safety and/or Compliance) remained the responsibility of the MBB and the project sponsor. Measure "AI becomes super strong here — it accelerates data collection and baseline accuracy massively." Rather than manually sampling 200 historic Shop Visits, AI ingested 1800+ data points, including borescope images, oil debris, vibration trend, test cell parameters, etc., and created a clean baseline in a few days: · Avg breakdown TAT: TAT breakdown – disassembly 8d, Inspection 12d, Repair 28d, Assembly/test 12d. · Variation drivers (HPT blade rework 42% of variance) The judgment of humans is still predominant, i.e., validating data quality, excluding outliers resulting from non-standard visits, and checking if there is no survivorship bias present in the data set itself. Analyze This stage roles flip: AI takes the lead on root-cause discovery, humans will challenge and refine it. AI ran pattern recognition across thousands of features · "Predicted that 68% of HPT delays are caused by unexpected coating wear (not visible with a standard borescope). " · Simulated ‘what if’ scenarios (add ultrasonic inspection on blades → TAT -4 days, cost + €8k) etc. MBB owns: · Forcing AI to explain (SHAP values & counterfactuals). · Rejecting Correlations which Violate Physics/Engineering Judgment. · Prioritizing causes with the team (fishbone + AI insights). Improve AI excels in solution generation and testing, whereas piloting decisions fall under human expertise. AI produced 12 different workscope variants, ranked according to their TAT/cost/risk. We tested top 3 on 8 engines: · Added Predictive coating inspection → Reduced Surprise Rework · Auto parts pre-kitting based on AI prediction → Assemble wait time Human judgment prevails: deciding which approved variant will go ‘live’, managing change with technicians and getting sign-off from regulators. Control Human + AI Hybrid With Human ownership of Sustainability. AI Monitors and tracks real-time adherence (daily TAT tracker for deviances). MBB owns: · Control plan update (with new SOPs on the use of AI). · Monitoring adoptions: Technicians' override of AI predictions (<15%). · Monthly review: review of escapes/misses to retrain the model. · Celebrating success with the shop floor (sharing savings made visible) Which of the phases will be made stronger with the use of AI · Measure: 10× faster, more granular baseline · Analyze: reveals patterns not yet perceived by humans · Improve: generates/test thousands of scenarios in hours Where human judgment still needs to dominate? 1. Define: framing value and non-negotiables such as safety, 2. Analyze: rejecting physically impossible correlations. 3. Improve: Piloting real engines, i.e., one can’t simulate customers trust. 4. Control: Sustaining Culture & Accountability Practical result after 7 months · TAT average 33.8 days · Cost per visit down by 14% · No Escapes · Technician satisfaction up (less firefighting, more predictable work) Bottom line from the engine shop DMAIC may not be replaced, but it’s turbocharged! The AI takes care of the hard work on data and simulation, but MBB makes sure the method remains disciplined . right problem definition, right boundaries and right decision. Without humans "owning" Define and "challenging" Analyze, AI is a matter of "optimizing the wrong things faster, With humans owning, you get sustainable, compliant, high value improvement that regulators and customers actually trust.
-
What Is the Role of an MBB in an AI-Enabled Improvement Journey?
Domain: Aerospace subcontract precision machining shop (€85M turnover, 420 people, Make to Print for Airbus, Boeing, Safran – High Mix/Low Vol structure parts, AS 9100 D, Zero Defect pressure Improvement initiative: Reducing first-article inspection (FAI) cycle time from 12-18 days to under 7 days on new programs (This is an chronic bottleneck, as any new component or significant change-over involves FAI, which postpones the release of production and cash receipt. We initiated an AI - driven project in mid 2025 to work on autonomous measurement path, collision simulation and risk based inspections.) What The Master Black Belt (MBB) Must Own, Challenge and safeguard In this Ai-Enabled Journey "The MBB is no longer ‘the one who runs the analysis’ — this is now done 70-80% by the machine, faster and deeper. Instead, 'the MBB is 'the guardian of DMAIC, judgment, and sustainability.'" 1. Own the problem definition and scope (Define phase – non-negotiable) The AI will happily optimize whatever you give it to. But if the problem is formulated incorrectly (‘speed up the inspection’), it will recommend skipping verification or sampling or reduced inspection, which is essentially ‘suicide’ in aerospace sector. MBB shall own: a. Well-formed CTQ (Critical To Quality) tree related to customer / airworthiness requirements. b. Boundary conditions (what cannot be touched: full FAI compliance, no risk to zero-defect escape). c. Success metric that encompasses both speed and safety APIs FAI lead time < 7 days AND escape rate = 0 2. Challenge the output and assumptions of AIs critically (Measure & Analyze phases) AI detected a pattern ; 62% of FAI delays are caused by the manual programming of the probe path. It recommendation: Generative AI paths. MBB Challenged: · "Show me validation data on 50+ different geometries -- not just the training set." (Observed weak on deep pockets.)" · “What if the batch hardness of the materials drifts by 5%?” (AI did not have sensitivity analysis; we forced it to.) · “Is this recommendation compliant with AS9102 FAI requirements?” (Not fully – required manual override layer.) 3. Safeguard people engagement and change management (Improve & Control phases) Operators and inspectors worried about job security or accountability (“My path was not good, says the AI”). MBB owned: · Co-creation workshops: inspectors assisted with distinguishing between "acceptable" and "optimal" · Feedback loop: every AI suggestion is given a 1click "accept / fine tune / reject" with reason → so we can retrain the model. · Sustainability Dashboard: monitoring adoption rate, override reasons and monthly actual versus predicted time. · Recognition: Inspectors who provide good-quality feedback are given credit for their work. This credit is not restricted to cost savings. 4. Own the ethics & risk gate (throughout) MBB must be the one saying “no” when speed tempts shortcuts or skipping steps: a. No Auto-Release for AI-Generated Inspection Plans without Human Validation for Critical Features. b. Mandatory traceability: In aerospace traceability is everything, every path traced with program version, confidence score and final humans review. c. Escalation protocol if AI confidence < 80% on safety-critical dimensions Practical outcome after 9 months a. FAI cycle time reduced to 6.2 days on average b. Escape rate unchanged (0) c. Inspector satisfaction up (they now focus on judgment calls, not repetitive programming) d. AI Adoption rate of above 85% since people influenced the development of AI, rather than having it forced on them. Bottom line from the FAI Work center MBB is no longer the analysis hero – the AI is, MBB is now the discipline role model: even if pressure comes lines will not be crossed, corners will not be cut. Framing the problem is critical aspect if not done correctly we will be running after symptoms while actual problem is still killing us. So AI solves the right thing, challenging outputs or one wrong unquestioned assumption makes the process unstable. Garbage doesn’t become gospel and safeguarding people & compliance, so the improvement lasts beyond the pilot hype. Without that, you get fast garbage. With it, you get fast, safe and sustainability improvement.
-
What Should Leaders Start Doing to Fully Leverage AI?
Domain: Aerospace Supply Chain Management - Tier-1 supplier for airframe components (Turnover €1.2B, management of over 1,200 suppliers in Europe/Asia, supplies Airbus/Boeing assembly line just in time approach). Specific leadership activity: Supplier performance review meetings every week (the "war room" where we analyze on-time delivery, quality escapes and cost) This is the 2hr Friday ritual where the supply chain director and team grill the data, debate escalations, and determine interventions such as expedites, audits, and supplier changes. This was all done as gut-feel overrides in spreadsheets before the roll out of AI. The single new behavior that leaders must adopt, Start designing and enforcing "AI-first" decision rules instead of leading every debate by themselves. IMPACT ON DECRISION SPEED AND QUALITY Rather than the director controlling with statements like "I think the supplier is bluffing regarding lead time," they would communicate the following rules: · If the AI indicates that OTD < 92% + Quality Trend Down > 3%, auto-trigger a 48-hour audit to SQE. · If the predictive cost model indicates >5% material cost drift, then auto-open RFQ to alternate suppliers. · If RiskScore > 45 (Geopolitics + finance data), cap new volume to 20% of current. Why it Enables the Value of AI: · Quality increases because the rules are consistent and data-driven. No more forgetting a trend or bias for the "favorite" suppliers. We’ve seen false escalations decrease 40% in our pilot. · Speed doubled: Meetings reduce to 45 minutes when exceptions are discussed, and 80% of regular calls are addressed by AI. Decisions communicated through alerts mid week and not wait till Fridays. · AI learns faster because each override of the rules turns into feedback for learning, refining the model without any of the leader ego in the way. Practical adoptation in a real-life organization (how we did it) 1. Keep it small – pilot on one commodity category (example: fasteners), for 3 months, and establish 5-7 rules collaborative with the team. 2. Train the shift: Director models the role of “facilitator” in a meeting, using “what does the rule say?” rather than “what do I think?”. 3. Measure Behavior: Measure percentage of decisions made by rules versus manual decisions (target 70%+ Assign to Director’s OKR). 4. Build guardrails: The rules will be reviewed on a quarterly basis by a cross-functional team to prevent “stupid AI” moments. 5. Celebrate Early: When the first auto-escalation saves a line from stoppage, shout it out from the rooftops this will builds buy-in from all departments. The bottom line from the supply office Leaders who self proclaim themselfs as heroes in every review will turn artificial intelligence into a sophisticated charting tool. By accepting rule design as their new jam, they allow AI to handle the heavy lifting for consistency (data backed) and speed, freeing themselves work on strategic wins like new supplier ecosystems. “We are 8 months in, now the meetings are more concise, the decision making is more crisp and the director actually has some spare time for the strategic long term thinking.” It is not about less control, it all about smarter control.
-
What Should Leaders Stop Doing Once AI Enters the Organization?
Domain: Aerospace subcontract precision machining shop (Turnover:- €85Million, employee Approx 420, make-to-print for Airbus, Boeing, Safran. Zero defects drive, tight margins). Specific leadership activity: Manual final sign-off of AI-generated quotes and bid/no-bid decisions. This represents the "last mile" in which the pricing lead / Commercial director manually evaluates all quotes with a Life time Value over €100k and decided to Approve, Adjust Margin, or Reject the bid. It’s the same way it’s been done for the past 20+ years: "I want to see it with my own eyes before we sign off on a 5-to-10 year deal." The thing leaders should deliberately decide not to do once AI is involved in the quoting cycle: Leaders should no longer manually review line by line and make adjustments on every quote from AI generated quote. Why holding on to it, kills AI’s effectiveness It effects velocity The greatest benefit of the AI is the ability to make same-day decisions from quotes that used to tale 10-14 days. If the director wants the AI to change its margin, in this case, by 0.5% because it feels it’s risky, all the benefits will be lost. Customers will once again have to wait for 3-5 days, Customers notice and competitors can win our deals. It erodes trust in the system Because the AI system is seen as “just a suggestion” since the boss always overrides it, use is reduced – why put the time into perfecting input when the human input will just override it anyway? There is never actual learning feedback for AI model, because the overrides do not become “learning signals”. It generates Inconsistent Decisions One week the director is aggressive on margins (win at all costs), the next week the director is conservative on margins (protect cash flow). The AI system is consistent (historical data plus current capacity). Then there’s the unpredictable element of human whim, and this confuses the model and hurts margins. It wastes expensive brainpower The 180k€+/yr director spending 2-3 hrs/day on reviewing standard quotes has a cost of opportunity. This could be used for searching for new customers, negotiating contracts, or addressing actual bottlenecks. What we actually did (and it worked) We established a tough rule in Q3 2025 that: · Predictions that have risk score <30% & Margin is within target range (within ±2% of historical average) → auto-approved. There is no human intervention. · Outside this range → send into director’s office for review no longer than 15 minutes. · All overrides need to be documented with reason code, → helps to train the model every month. Outcome after 6 months: · Director review time reduced by 85% · Quote Velocity up, win rate remains stable · Margin variance decreased (less human mood swings) · Team defends the AI system because “the boss believes it more than his own intuition about everyday matters.” Bottom line from the commercial office Once AI nets input on routine volume decisions, leaders must avoid fallaciously prefacing every decision as “my final say.” Holding on to the old human approval ritual, will turns AI from a force multiplier into an expensive suggestion box. The shift that we follow is not about giving up control, it’s about moving control to where it matters the most: exceptions, strategy and exceptions only. When the boss stops micromanaging the middle 80% and starts owning the risky 20%, that’s when AI starts paying real dividends.
-
How Do You Know If an Organization Is Truly Ready for AI?
Manufacturing Domain: Aerospace subcontract precision machining shop How do you determine if an organization is ready for AI? As someone sitting in an aerospace sub-contract machining facility (€85M turnover, 420 employees, entirely make to print for the big primes), I have seen two attempts where AI succeed and two fail in the last three years. It had nothing to do with the technology, it was all about process and people were they ready. Specific context: Introduction of AI within CNC program prove-out (CAD / CAM Work Center) and first-article inspection (FAI) preparation as per AS 9102. This represents the painful handoff where a new part program is generated from CAM software, followed by simulation, dry run on the machine, measurement, tweaking and eventually approval for production. This is high-risk, where one incorrect offset or unresolved collision may ruin a $180k billet or jeopardize a $2M series production deal. The clear signs that show us of being ready (and it succeeded) We rolled out a AI-based collision detection + auto-measurement paths in this process in early 2025 using generative AI and sp far it has been a quiet success. The indication that were saying silently, we were ready can be seen below: a. Process was already stable and measurable We had worked on prove-out, average 18.4 hours per new program entered into two years, defect rate measured and all steps were signed off using a digital traveler. Black magic was not needed – all was observable. b. People trusted the current data The machinists and inspectors thought the recorded times and test results were correct (because they recorded the data themselves and reviewed the results each month). No mindset of “the system is lying” here. Decision discipline existed. There was clear ownership (lead programmer + quality engineer) on what was “safe to run” vs. “tweak." It was not a debate each time. A feedback loop was in place In every failed collision or FAI rejections, a 15 minute stand-up CFT meeting was in place to determine root cause analysis, capture the learnings and PFMEA updated. The leadership was very tolerant of small-scale We could pilot it for 4 weeks on two machines without having to put together a 50-page business case and Top management approval. Outcome: AI has now cut prove out time by 38%, collision numbers to nearly zero and the team accepted it willingly, because it made their life easier, not scarier. Warning signs from a different process, when we were NOT ready (and it backfired) Same shop, same year (2024) – we applied AI for predictive tool life in milling of titanium parts. The red flags were apparent if we think about it now: 1. “The process data itself was rubbish,” The tool life logs were still paper in some of the cells. The operators recorded “tool OK” or “changed” without any basis in reality. Was it new / re-grinded tool was not captured properly. The data used to train AI was half booked half uncooked. 2. There is no agreed decision rule. "One setter changed tools at 80% predicted life 'to be safe' and another ran it till tool broke 'because the job must be shipped on time'." The reason were not properly documented. No standard existed so a machine's suggestion was just another voice no one cared about. 3. Blame Culture for failures When a tool was broken, the first question asked was who do it?, "who overrode the schedule?" rather than "what have we learned?" Operators started hidding overrides rather than recorded. 4. Incentives misaligned “For operators daily targets were based on number of parts produced in a hour, if tool change effected his output. He will be questioned and punished for it” 5. They leadership demanded magic, not change “They were looking for AI to 'fix tool costs' without changing that particular Work center targets, standards or training. Outcome: adoption <30%, costs of tool increased (over-conservative enhancements+missed notifications), project has been effectively parked for last 8 months and 420k has been spent. My practical, custom made readiness checklist before any new future AI project selection Green lights (must have most): · "Process is stable (variation tracked, not guessed)" · The data is trusted and accurately recorded by the people who use the information · Clearly defined owner and decision-making rules exist · "Feedback from failure is learning; feedback from success is conceit." · Incentives encourage the behavior in the current era (not just in the AI era) Red Flags (one is sufficient to stop): · “We'll clean the data after the AI is live” · Operators/managers do not think the recent reports are · Decisions are political and/or personality-driven · People are evaluated on things which are counter to AI objectives. · It appears that leadership wants "AI to fix it" without having to make other changes. Bottom line from the shop floor Technology readiness is easy - just buy a license and train a model. Organisational readiness is tough – it’s about whether your people and process are disciplined enough to work with AI or not. If your process can’t handle good human judgment and good data easily, your AI will simply accelerate the mess. We are now moving ahead only when at least 4 out of the 5 green lights are on. Everything else receives a "not yet."
-
In What Order Should AI Initiatives Be Taken Up to Maximize Value?
Manufacturing Domain: Aerospace subcontract precision machining shop If I had to plan our AI adoption plan for upcoming 12 – 24 months, here is the exact sequence I would enforce, no debate. Phase 1 (Months 1 - 6):- Quote-to-Order process — use of AI in RFQ analysis, pricing (We already did this in 2025, which is why I am so adamant about starting here) Why first? Fastest, best ROI: increased quote response rate from 38% to 92%, added 20 million euros of backlog that would have been missed, and the system paid for itself 20 times in the first year. No risks associated with production or quality. Everything front office. No headaches with certifications. No scrapping of a part if it’s incorrect. Gives the customer an instant boost to confidence: Salespeople love it, the finance guys see the revenue, while the operations guys don't even notice the difference. A win for the management that they can tout without worrying about anything else It creates cash and capacity room for all that comes afterwards. Phase 2 (Months 6-12): Shop-Floor Scheduling and Real-Time Capacity Optimization (AI that uses the confirmed order book, actual machine status, availability of tools, and skills of operators to create a schedule of the day/week dynamically) Why Next? Now, there are more orders than in the past (because of Phase 1), so the constraint shifts inside the shop. Builds directly on the quoting data – the same models can take advantage of the increased pipeline visibility. Moderate Risk: Doesn’t affect either NC programs or quality – it’s all about sequencing. Facile to implement parallel manual system for safety net. Payoff: 15-25% gain in OEE, reduced overtime, faster delivery, happy customers, and therefore easier future sales. Phase 3 (Months 12–18): Generating NC Programs and Prove them Out (complete AI-agent generated) NC program (CAM + Collision-Free Path Search by virtualization simulation) Why here in sequence but not before? High technologic risk: Incorrect process parameters leads to crashed €4M machine or €800k scrapped part. Requires mature process data from Phase 1 & 2, i.e., actual cycle times, tool wear patterns. "FAI (First Article Inspection) Buy off" needs customer buy-in, and we need wins before that can occur. Massive payoff: preparation time from weeks to couple of days, but again only worth the effort if we have adequate high margin orders coming in and money built up from earlier stages. Phase 4 (Months 18–24): In-process quality prediction using AI monitored SPC charts and automated inspection disposition (Vision + Sensor AI that predicts surface finish / defect risk and auto-release the good parts and hold bad once) Why last? This Has the highest Risk to Quality and certification - one bad release can kill a project / program. Needs a solid data pipeline from all previous phases (correct schedules and NC correct programs). Biggest Long term financial Impact: Would reduce the head count of people in our final inspection by 40% and cut lead time by a week, this will making us the preferred supplier of choice. How we decide the sequence deciding criteria. The rules we actually use a. Fast, safe ’wins’ first—front office over shop floor, no quality risk, delta shows up in <6 months. b. Follow the constraint - solve the present bottleneck before moving on to the next one. Do not optimize your programming until you have orders. c. Data dependency. Later phases require clean, good data produced by earlier phases. Garbage in, garbage out later. d. Organizational energy --- begin where the crowd will cheers, not where they will fight (sales cheers when quoting AI, operators are afraid of inspection AI). e. Cash and confidence snowball - each phase funds the next one and proves to the doubters that ‘this AI thing actually works here’. We did not follow this logic once – went directly to an AI-driven tool life predictor for the machines and it failed as it had poor uptake. Now we're religious about sequencing. It’s like this: Do it this way and it’s like we’re playing a series of games and we’re winning, while without it, it’s like Do it randomly, you will burn money and trust. That is the plan I am implementing – and sleeping better because of it.
-
Why Do Many AI Solutions Fail to Deliver the Expected Value?
Domain :- Aerospace Subcontract Precision Machining shop The particular AI task that has failed: The use of AI in the predictive management of tool life in our 5-axis milling cells (introduced mid-2024) The plan sounded awesome: sensors on spindles + data from spindle vibration and/or forces loaded into an AI that could predict when the inserts were going to fail and/or chatter, and then exchange the tools at just-in-time cycles, as opposed to at predetermined intervals. AI was trained with our tool life monitoring raw data. We did pilot run on 2 machines, it was grand success. Then, on two machines, we were able to cut tool use by 18%, unplanned stops by 60%. Surface finish / roughness problems were down. We rolled it out to the all high machine hour rate 28 machines. Six months later. The value had disappeared. Tool costs crept back up, unplanned stops re-emerged (different machines), and we actually rejecting a greater number of parts out of aggressive feeds the AI was optimizing for. Main reason for the failure was the inability to integrate the process + the failure to embrace the value by the people who matter most, the machinists and machine setters. The AI system was technically good (accuracy level around 88%), but: • It produced alert messages on a screen that nobody monitored during the course of the shift — the operators were too busy watching the machine cut physically to control feeds. • As soon as it called “change insert in 18 minutes,” the setter usually didn't comply with it because “the tool still looks fine” or “I don't want to stop mid-pocket and get a witness mark.” "Tribal knowledge" always overturn "AI" every time. • No feedback loop — if the operators chose to override the AI, the reason for the override was not recorded, so the system never learned from the experts. • "We never altered the bonus payments or daily objectives – to stop and alter tools 'early' meant he'd fail to reach his daily number of pieces, and you guessed it right what he would do." “The AI was dealing with a theoretical problem (problem of tool life) whereas the actual problem for the team was, ‘hit the numbers on the board today’ and every day.” What I Would Do Different Next Time And What We’re Currently Doing With the v2 Launch 1. Include the shop floor team from day one. ‘Machinists and setters are in the room when we define success. Their key performance indicator (safe, on-time parts with no drama) becomes the artificial intelligence system's top goal not just tool cost.’ 2. Engage in their workflow, not in parallel. Alerts are now giant flashing lights & voice notification on the machine HMI and tool change notice is a one-button acknowledge. No need for a separate dashboard. 3. Make adoption painless and rewarding o The daily target now includes “AI-guided tool changes completed”. o The “Override Button”: This will force a 5 - second voice note or reason into the data, and this data will go straight back for model retraining. Operator need not have to move away from the machine to feed this data. o Bonus pool is monthly and dependent on yields of entire cells rather than individual piece count. 4. Start small, proving value in their terms, and then scale We scaled back to 4 machines, operated for 3 months with new rules, then proved to operators that they can go home earlier on Fridays with higher-quality products. Only then we will scale to all machines. 5. Kill it quick, if it doesn't stick. Have a tough gate review, if adoption <80% after 60 days or yield isn't improving. We will pulling the plug and call it a day, no more wasting of efforts needed. Bottom line from the tool crib department, The issue was not the AI. It was us, The once who built the clever tool and stuck it in a vacuum and wanted people to dance around it. “Real value happens when the AI is incorporated into what people are already doing and make there life easy, not adding too many extra things to there already existing to do list.” "We are finally seeing the positive results of the v2 cells, specifically the reduction of tool cost by 14% and unplanned stops by 45%, and most importantly, the operators now are able to defend the system against management’s inquiries regarding the cost." "Lesson learned," Harris said. “If the individuals operating the machine do not love it, no model accuracy in the world can save you”.
-
Guidance on CAIPO and CAISA Courses
I am interested in pursuing both CAIPO and CAISA certifications. To gain the maximum benefit from these programs, I would appreciate your advice on the following: Order of Completion: Which course should ideally be taken first to build a strong foundation? Experience-Based Suggestions: If anyone has completed both, please share your insights on the sequence that worked best for you. Timing Between Courses: Would you recommend completing them back-to-back for continuity, or taking a break between the two to avoid cognitive overload? My objective is to learn effectively without feeling overwhelmed. Any recommendations or best practices would be highly valuable.
-
How Do You Prove That an AI Solution Is Actually Creating Value?
Domain: Precision Machining + Special Process Aerospace supplier for Big OEM’s The specific AI solution: AI agent developed for supportitng quoting, bid / no-bid decision for incoming RFQs (rolled out at the beginning of Jan 2025 – current running) Everyone from Top management to middle management loved it from day one of going Live after 6 month of trials. Quotation process that used to take us about 8 – 14 days now completed in 4–6 hours. The tool is so close to accurate on cycle times for each operation (trained based on historic data), sales team is happy. But during Board meeting came a killer question from corporate CFO, three months in: “Nice toy you have installed, but is it actually making us more money or are we just padding our backing using fancy terms AI-Enabled quoting? Can any one tell me what is a bottom line impact.” A Team was defined to analyse and report to Top management if there is any bottom line impact after installation of this so called new AI Agent. How we defined “real value” after several rounds of some serious discussion (beyond speed or accuracy) We agreed on three hard business outcomes (all about money) , not tech KPIs: Revenue (Trun Over) impact — more won contracts from RFQs we actually chased for high margin once. Margin protection — We don’t win more junk orders that makes us operations loses. Capacity utilization — Better machines allocation we work only on more profitable work (no more using of 5 - Axis machines for low-margin jobs). Everything other KPI (response time, prediction accuracy) is just a leading indicator to make us money. How we measure now and track its perfomance — brutally practical We created a simple monthly “AI Value Dashboard” that nobody can argue with: Baseline period : Jan – Apr 2024 (pre-AI installation) Post-AI: from Feb 2025 onward (gave one month ramp-up time) Some key metrices we track every month: RFQ hit rate (Quotes submitted / RFQ’s received) Pre – AI installation: approx. 38% (we cherry - picked because our quoting process was slow) Post-AI: 88 – 94% → We now chase almost everything viable finally decide what to Bud and what to No-Bid. Win rate on submitted quotes Stayed stable at 31 – 34% — good sign we’re not just spraying low prices to win more otders. Revenue from new won awards resulting to faster quoting We tag every new won contract with “would we have quoted in time without AI? Yes/No/Maybe”. 2025 YTD: €19.8M in awarded backlog that sales admits we would have missed pre-AI. Average margin on AI - quoted wins vs pre-AI baseline Pre-AI average gross margin: 24.1% Post-AI: 24.6% (slightly higher because we kill lo margin quotes faster and have room to play with prices on tight-capacity periods or trighter capaity machies). Machine utilization on high margin vs low margin work We dis bucketing each job into margin criteria. Top priceed machine utilization up from 42% to 58% of total spindle hours — because we can now say no to less margin jobs without leaving machines idle. What convinces me the AI deserves continued investment (and more budget) We set a simple rule: if the new AI tool pays for itself 5× per year, it stays and gets all the needed upgraded and maintenance (licnece cost so on). 2025 numbers so far (10 months in): Total cost of the tool so far (license + internal resource time + training): ~€380k Direct Sales revenue Boost (very conservative estimatiom): €19.8M backlog → ~€6.8M gross profit at current margins, Plus approx. €1.2M from capacity allocation (higher-margin products mix). We have choice what to go for. That’s already >20× ROI in year one. If next year the revenue bosst drops below €3M or gross margins start detoruting , we will kill the AI agent or fine tune or replace it. Bottom line from the shop floor We don’t care if the AI is “intelligent.” All we case is is it adding more profitable high margin work for the machines and keeps the bad low margin stuff out. Money in the bank is the only proof that matters. Everything else is just a slide deck.
-
When Should AI Slow Down Instead of Acting Fast?
Domain : Semiconductor - Wafer Fabrication (Front-End Processing) Particular Process: Out of Control Action Plan (OCAP) response when a Statistical Process Control (SPC) Chart indicates a process violation Within a Wafer fabrication plant, every single lot is monitored by hundreds of SPC charts on wafer thickness (7nm, 5nm, and 3nm), critical dimensions, distinct & condensed particle counts (CPC / DPC), defect density and so on. Once the chart gets out of control (a red point in a control chart), the AI immediately starts the hold on the lot and triggers an escalation alert. A quick disposition is needed: continue, rework, scrub the wafers, or shut down the tool permanently. Insane amounts of money are at stake; a single bad decision can literally scrap €800k – €1.2 Million worth of wafers for a single lot, contaminate a €15Million EUV tool, or ship defective wafers that in six months fail on car airbag controller. We piloted an AI OCAP assistant in 2025 From Jan 2025, we have a installed a AI agent acing as a OCAP assistant / guide. It does a live monitoring with a hawk eye on all the SPC charts. Violations data analysis, pulling metrology data, analysing tools history maintenance logs, upstream / downstream lots, maintenance history of all the tools and finally suggesting the most likely root cause + recommended further action within 2 minutes instead of normal 20 - 90 minutes of people running around like a crazy neckless chickens. AI judgments are brilliant in 80 - 85% of the time: quite common causes like photo resist, wafre thickness drift or a contaminated dirty APC valve get pinpointed right away and always re-open the lots a little earlier. But acting in a rush way here can be absolute catastrophic disaster. So we deliberately trainined our AI agent, when to slow the hell down and ask for a back up. How does AI Agent know when to Pause or Escalate or Not Act Fast Enough? We built explicit “slow down” logic based on risk severity: 1. High impact will get triggered and immediately escalated to human. · Any violation onto the safety critical features (e.g. automotive qualified lots, known zero defect programs). · Defect density issues or particle count (contamination really spreads fast) that influence the laser path. · EUV or High-NA EUV tool flags (one wrong call can cost millions). · New issues pattern (AI minimum confidence <70 or not have a similar case in history). → AI output: "High risk execution detected. Recommending full engineering review. Effected lot remains on hold until engineer manuallly sign-off." 2. Medium: AI proposes recommendation, others to confirm. · Known issues on high-value parts lot in excess of €600k worth of material at risk. · Flags a multitude on the same tool simultaneously → The AI has a recommendation, but 2 qualified engineers must review and co-sign within 30 minutes before release. 3. Low risk - disposition can be done automatically by AI. · Clearly known repeatable issues with historical confidence dri-f >95%+ (known chamber seasoning drift after PM). · Low at-risk material costs (<€200k)→ Fast path, lot released in fewer minutes. These early warning signs are programmed now in process: · Anomaly score > threshold (not matching any previous pattern). · Conflict in data sources (e.g. in-line metrology says part is bad, but previous process step says it's perfect). · Within 48 hours of Tool recent maintenance or re-qualification, signifying abnormality created. · Execution occurs on a lot that has already had one disposition in same run (second strike). Why Does This Work in Practice? About 70% of the executtions are typically repetitions; these are handled significantly and quickly by AI and kept the engineers from drowning in a sea of silly false alarms. The 5-10% dangerous ones are purposely kept on hold for human to review om what has to be done, pull some wafers for review, re-run few extra tests and maybe even call the customer. There's been 2 other times the AI kicked out some "high risk -- escalate," and it saved us. · Contamination from a new photo-resist batch that looked "normal" to the model but kicked up the novelty score. · Another was a subtle drift in EUV focus that would cause the scrapping of 24 wafers if it was auto-released. The bottom line from fab shop floor In semiconductor fabrication speed is like a lifeline to increase throughput... but one rushed bad call is death to yield, trust and sometimes safety of the end customer. Smartest thing we did was not teach the AI to think faster, but when to say, "I'm not sure, get a human now." Because no model is perfect, wafer lot which is at stake isn’t the place to gamble on 85% confidence.
-
When AI Speeds Up Decisions, Do We Risk Making Worse Ones?
Domain: Space subcontractor Machining – Bidding for New RFQs, Making Bid Decisions, or Not Bidding on New RFQs Specific process:- Deciding on pricing for a new customer RFQ, and whether to submit that RFQ to others. In our make-to-print company, RFQs show up without any schedule-not once in a week and sometimes as many as all 30 at once. Each is a 50-200 pages drawing pack, having specification for highly complicated titanium (TI 64) or aluminium parts (AISi10mg) with very close tolerances from 5 microns (long term contracts from 5 to 10 years and penalties that could wipe out all two years profit). Before AI, a bid / not bid + full quote took our price team approx. 8 – 14 days: • Well, manual review of features such as holes, deeps pockets, complex contours. • Cycle-time estimations based on similar previous jobs, loop path calculation in excel. • Risk assessment for newer special processes. or forged materials • Margin discussion with Top management. Often we used to miss the 10-day quoting window mentioned by the customer, and lost it automatically. So in early 2025, we built in our AI quoting assistant, and trained it on 18 years worth of quotes, actual cycle times, scrapping rates, tool wear, and profit outcomes. The same RFQ can now be analyzed in less than 1 day: • Auto-feature recognition (part size + stock) from 3D model. • Instant cycle-time prediction (±6% accuracy vs. actual) • Risk score for new tolerances / materials based on historic rejection trend of product group. • Suggested price range with simulated margin scenarios. It really brought the turnaround time to within the same day instead of weeks. Winning rate on chased bids 34% up with a healthier revenue pipeline. Where speed dramatically improves outcomes • We can chase 3 times more RFQs without adding new head count. • Faster feedback to customers builds reputation ("these guys always respond quickly"). • Early visibility lets us spot capacity gaps and adjust loading. • Management can decide to say no to low-margin orders immediately, thus freeing capacity for better-margin work. Unforeseen real risks and blind spots introduced by the new speed 1. Overestimation of AI risk score Even with historical data, the model will say "low risk" because of something from prior experiences. However, if the RFQ has a new geometry nobody ever saw with the deep pocket (High Depth), a new alloy variant (3D printed / forged), the AI would say "GO" because it never saw it fail before. Humans would pause to call the Senior metallurgist, but right now there is pressure on the down button to "Approve" same-day. 2. Margin deterioration by speed pressure Sales loves the velocity of things and starts to pressure, "just take the AI's predicted middle price." say two contracts were awarded already, 4 - 6% below a cautious human judgment would have accepted profitable, but thinner than comfortable. 3. Loss of tribal knowledge The senior estimators would spend hours on a single RFQ arguing and surfacing war stories ("remember that 2019 batch with the same radius? Cracked on every part"). The AI has the data but not the nuance or the fear memory. Speed kills such critical conversations. 4. Customer playing on the system Primes were realising we are responding ridiculously fast now. One has started sending "fake" RFQs with impossible tolerances just to force us to burn engineering time analysing them. Minimum quote from 3 suppliers tying up our pricing team while negotiating with slower competitors. Practical safeguards we have currrently implemented 1."human pause" compulsory on high-risk flags. If AI novelty score >35% of raw data used for training or predicted margin <12%, it goes automatically into a 2-hour cross-functional review (engineering + costing + operations). No same-day bids allowed. 2. Bid-win / loss auto-corrections Every awarded contract goes through a 30-day actual-verses-predicted evaluation by CFT Team. Whenever the AI went wrong by more than 10% on cycle time or risk, we feed that back into AI model and change weights. Keeps humble the AI model. 3. Dual track quoting for the big programs For any lifetime value greater than €15 million, we keep AI and traditional human estimates going in parallel for the first three months. Forced calibration and kept old tribal knowledge alive. 4. Clarity of velocity quota, not unlimited Sales bonus capped at the number of quotes per month now will not allow their flooding the system with trash RFQs, just because we can handle the volume. The bottom line of the quoting desk Yes, there is a superpower in the speed brought by AI; it records works that we would totally miss otherwise. But the real risk is neither with bad data nor any other, but with human temptation toward treating "fast" as the equivalent of "good enough" and stop thinking. Not to slow down AI, safeguards are there but to ensure that we, bottlenecks, do not run ahead of one's judgment.
-
When AI Removes One Constraint — Does It Create Another?
Domain: Make-to-Print Aerospace Manufacturing – Precision machining of titanium and Inconel components for engine and structural programs (€380M turnover contract manufacturer, three plants, serving GE, Pratt, Rolls, Safran) Specific process: Internal Sales Order Intake & Technical Review (from customer RFQ arrival to firm order release into production planning) In TOC terms, for years the system constraint was clearly internal — our machining capacity (especially 5-axis milling Work centre of hard metals and creep-feed grinding). We were always full: queues at the machines, overtime every weekend, premium freight to recover late jobs. The plant was the bottleneck; sales could have sold twice as much if we only had the hours. We introduced AI in 2024 to relieve that constraint: Generative toolpath optimization + predictive tool-wear models Real-time dynamic scheduling with AI drum-buffer-rope On-machine probing + auto-compensation loops Result: protective capacity exploded. Throughput up 42 %, spindle exploitation from 68 % to 89 %, overtime hours almost zero. For the first time in 15 years, the machines were waiting for work. We celebrated for about three weeks. Then the constraint shifted completely — and a new internal one emerged that had been comfortably hidden. The new system constraint is now internal order intake velocity: the time from customer RFQ drop to technically clean, priced, and contractually accepted order in the system. Why this happened: With capacity suddenly available, sales went hunting — RFQ volume tripled in six months. Every RFQ for make-to-print parts needs detailed technical review: feasibility check on tolerances, material certs, NDT requirements, special processes (heat treat, shot peen, coatings), long-lead raw material verification, and cost build-up. That review is done by only 8 senior application engineers (the ones who really understand the risk of a 0.012 mm wall thickness or Inconel 718 creep). The process is still mostly manual: Excel sheets, emails to suppliers for raw-material quotes, phone calls to plating vendors, copy-paste into ERP. No systematic follow-up on lost quotes (we never learned why we lost) Pricing policy stuck in 2018 cost-plus mindset while customers wanted fixed-price multi-year deals Suddenly we had machines starving because a €9M order was stuck for 11 weeks in technical review while the engineers drowned in 180 open RFQs. The AI removed the production constraint… and exposed that our front-end commercial process was never designed for high velocity. It had been happily hidden behind “we’re full anyway.” The completely new internal constraint that emerged A knowledge & decision bottleneck in the application engineering team — plus a cultural one: engineers were rewarded for accuracy, not speed. Saying “yes, we can do it in 26 weeks” too fast felt risky when we used to be capacity-limited anyway. Early signals that screamed “new constraint in the house” Machine utilization plateaued and then started dropping despite zero technical problems. Sales started complaining “we’re losing bids because we’re too slow to respond” — quotes going out in 10–14 weeks while competitors replied in 3–4. Application engineers working weekends just to clear the inbox — same overtime pattern, different people. Marketing brought in two huge new opportunities… that sat unsigned for months because “technical review pending.” Cash flow dipped — we had capacity, but no new orders flowing into production. What we’re doing now (still mid-journey) Built an AI-assisted RFQ triage: auto-feasibility check on 70 % of standard features, raw-material price pull from historical + supplier APIs. Created a red-tag “fast lane” for strategic customers (buffer management for quotes). Changed incentives: engineers now measured on quote velocity AND win rate. Hiring + cross-training three more seniors. TOC reality check When you finally elevate the plant constraint you’ve chased for a decade, the system doesn’t thank you — it immediately promotes the next weakest link. In make-to-print manufacturing, once the machines stop being the bottleneck, the real fight moves upstream to how fast you can say a qualified “yes” to the customer. We fixed production… and discovered our commercial process was the new silent killer. Watch for idle capacity in a shop that used to be slammed. That’s the system whispering: “Congratulations. Your new constraint has arrived.”
-
⛓️ Can AI Identify the Real Constraint in a Process Better Than Humans?
Domain: Aerospace Manufacturing – Machining of large monolithic structural parts (wing ribs and spars) Specific process: High-speed machining cell for aluminium/lithium wing components In Theory of Constraints (TOC) terms, this cell is a classic internal constraint: five 5-axis machines feeding final assembly, €180k raw billet in, €1.2M finished part out. Everyone “knows” the constraint is spindle uptime — tool wear, long setups, or waiting for NC programs. Teams have spent years debating it in VSM workshops: • Machinists blame tool presetter delays. • Engineers blame conservative feeds/speeds “to protect surface integrity”. • Planners blame batch-size policies (we run 4–6 parts per setup because “that’s how we minimize risk”). • Management blames lack of weekend shifts. We chased each rabbit hole with kaizen, SMED, better tooling — gains of 4–8% each time, then plateau. How AI finally identified the real constraint (real pilot, 2025) We connected an AI TOC agent to: • Machine PLC data (spindle load, overrides, alarms) • Tool-life logs • NC program metadata • ERP scheduling • Inspection results It ran for three weeks, then reported calmly: “The true system constraint is not spindle time. It is the policy-mandated minimum batch size of 4 parts + mandatory full roughing-to-finishing cycle on the same machine. This creates artificial inventory of semi-finished parts (average 11.8 in queue) and protects no one — 96% of surface defects occur in the first 40 minutes of finishing anyway. Real protective capacity is only exploited 28% of the time. Drum-Buffer-Rope would increase throughput 34–41% with zero new CAPEX.” We were stunned. The “constraint” everyone fought over was a symptom. The real one was a 20-year-old policy born in the low-rate 1990s, religiously protected because “certification freeze” and “that’s how we’ve always done high-value parts.” Where AI outperformed humans 1. Perfect memory & pattern recognition across thousands of cycles — humans remember the dramatic breakdowns, AI sees the quiet starvation/excess every shift. 2. No emotional attachment to pet theories — engineers defended their feeds/speeds like family. AI just showed the data. 3. Holistic view — it connected policy → queue → effective capacity in seconds, something no cross-functional workshop ever managed without politics. 4. Simulation of “what-if” policy breaks — AI modeled DBR in hours; humans would have needed months and a consultant. Where AI still struggles (and needs humans) 1. The policy existed because of a real 2009 incident (one bad batch cost €4.2M). AI saw the data pattern but couldn’t feel the scar tissue — humans had to decide if the risk context had truly changed. 2. Elevating the constraint required negotiating with the customer (Airbus) to accept single-piece flow certification. AI can’t drink coffee with the chief engineer and build trust. 3. AI flagged the constraint perfectly… but implementing the exploit (DBR + buffer management) still needed humans to redesign the floor, train operators, and handle the fear of “running unprotected.” TOC verdict Humans are brilliant at protecting capacity and exploiting what we believe the constraint is. AI is brutally honest at identifying what the actual constraint is — especially when it’s a sacred policy disguised as protection. In our case, the AI didn’t just find the constraint faster; it found the one no human wanted to admit existed. Now we’re running single-piece flow on that cell, throughput up 38%, and the old policy is in the museum next to the floppy disks.
-
Amazon vs. Toyota — A Powerful Contrast in Velocity and Scalability
Amazon’s trajectory isn’t just a numbers game — it’s a mirror for every ops leader staring at a stable-but-stagnant plant, wondering why the disruptors are lapping us. I’m in aerospace, cranking out precision actuators and landing gear for the primes. We’ve got Toyota-level discipline: zero-defect cultures, takt-driven flow, kaizens that could fill a warehouse. But benchmark Amazon (we do, obsessively), and it’s like realizing your finely tuned assembly line is a horse-drawn cart next to their hyperloop. Toyota’s at ~$255B market cap, a century of Lean mastery. Amazon? $1.92T as of today, starting from a garage in ’94. No factories, no legacy — just a machine built to reinvent and scale at lightspeed. Here’s why it lands like a gut punch for traditional ops. 1. Scalability is not an outcome — it is an operating philosophy. Toyota scales by adding plants and perfecting the next shift’s handoff. Amazon? They engineered “mechanisms” from the jump — APIs that glue fulfillment centers to warehouses to drones without a single email chain. It’s why Prime scales to 200M members without breaking a sweat. We tried copying this last year: our old siloed MRP system choked on a new program’s data flood. Switched to a modular API layer (Amazon-inspired), and suddenly three sites sync inventory in real-time. No more heroic spreadsheets. But damn, it took admitting our “stable” architecture was the bottleneck. 2. Customer obsession drives continuous reinvention. Toyota obsesses over the operator and the machine — flawless output. Amazon? It’s the end-customer’s whim that rewires everything overnight. “Two-day shipping too slow? Build AWS logistics on top.” We’re customer-obsessed too (FAA certs demand it), but our reinvention cycles are 18 months for a fixture tweak. Amazon iterates weekly. Lesson hit home during a supply crunch: instead of tweaking our vendor scorecard (Lean 101), we rebuilt the whole RFQ process around predictive customer demand signals. Lead times dropped 28%. Feels heretical — but the prime loved it. 3. Amazon treats speed and learning as assets. Toyota’s PDCA is poetry for steady gains. Amazon’s is a blitz: test, fail, ship, repeat in days. Their A/B frenzy turned a bookstore into a cloud empire. In our world, “fast” means rushing a prototype under audit gun. But watching Amazon’s flywheel — data in, experiment out, velocity up — we piloted a digital twin for our test cells. Feedback loop: hours, not weeks. Caught a vibration flaw pre-production, saved €4.2M. Speed isn’t reckless; it’s the moat. We’re learning it the hard way. 4. Digital leverage multiplies growth beyond physical limits. Toyota adds a line, output ticks up linearly. Amazon layers code on code — one algo tweak ripples to billions in throughput. Exponential, not additive. Our physical limits (machine hours, cert stamps) feel quaint next to that. We’re dipping toes: AI-driven predictive maintenance scaled our uptime 17% across two plants without new CAPEX. But true leverage? When we link it to customer flight-hour data for proactive redesigns. That’s Amazon math: every gain compounds. The big insight nails it: Toyota mastered the industrial era — discipline turning waste into flow. Amazon hacked the digital one — mechanisms turning reinvention into infinity. Both legends, but Amazon’s $1.7T head start screams: design for scale, or watch it eclipse you. In aerospace, we’re stable fortresses; they’re shape-shifting fleets. Time to stop fortifying the walls and start coding the drawbridge. (As someone who’s toured both a Toyota plant and an Amazon FC — the contrast is visceral. Velocity isn’t coming; it’s here.)
-
Tesla’s Growth Velocity — A Dramatic Illustration of Reinvention + Scale Over Legacy
Tesla’s rise isn’t just a tech story — it’s a brutal wake-up call for every manufacturing operation I’ve ever walked through. I’m in aerospace, building landing gear and actuators for the big primes. We’ve got Lean down to a science: 5S that shines, takt times you could set your watch to, kaizens that stack up like cordwood. But watching Tesla from the outside — and yeah, we benchmark them obsessively — it’s like staring at a Ferrari while you’re still tuning a reliable old diesel. The numbers don’t lie: from $1.7B IPO in 2010 to $1.49T today (as of this week), while Toyota’s sitting at around $257B despite a century of building cars that don’t break. Tesla’s worth nearly 6x Toyota. In 15 years. That’s not luck; that’s a system designed to reinvent and scale while the rest of us polish what works. And here’s the gut punch for ops folks like me: Tesla didn’t beat Toyota at making cars. They beat them at making a machine that learns, adapts, and grows faster than any legacy supply chain can match. 1. Reinvention beats optimization when the environment shifts. Spot on. Toyota’s a master at refining the assembly line — TPS is gospel for a reason. But Tesla asked the real question: “Why build a car like it’s 1913?” They threw out the drivetrain rulebook (no more ICE complexity), made software the core (OTA updates mean your “finished” product evolves post-sale), and turned the Gigafactory into a software-defined beast where robots learn from every weld. We tried something similar last year on a new actuator line: instead of tweaking our 20-year-old manual jigs, we ripped it up and went full cobot with adaptive fixturing. Lead time dropped 72%, but it took six months of soul-searching to even greenlight the demo. Tesla does that quarterly. 2. Speed of learning became a competitive weapon. This is where Tesla humiliates the old guard. Their experimentation cycles are measured in weeks, not years — crash a prototype? Data floods back, AI crunches it, next build iterates overnight. Toyota’s PDCA is elegant, but it’s built for stability in a predictable world. Tesla thrives in chaos: remember how they scaled Cybertruck ramps amid supply hell? Continuous updates via software mean they’re always shipping “version 1.2” while competitors are still certifying v1.0. In our world, EASA certification takes 18 months for a minor change. Tesla’s equivalent? Push code, monitor fleet data, fix in the next build. That learning velocity compounds — one breakthrough feeds the next, turning a startup into a trillion-dollar force. 3. Scalability was baked into the system from Day 1. Tesla didn’t bolt on growth; they engineered for it. Vertical integration (batteries to seats) means no chokepoints in a Tier-1 web. Platform thinking: one Gigacasting die for multiple models, software that scales across the fleet. Result? They went from 500k cars in 2020 to over 1.8M in 2024 without the usual margin erosion. Toyota scales beautifully too — global plants humming in sync — but it’s additive, not exponential. Tesla’s system gets stronger with size (more data = smarter AI = cheaper per unit). We’re finally piloting something like this: a modular test cell that auto-configures for three airframe variants. Early wins: 40% less setup time. But scaling it plant-wide? We’re still debating the CAPEX while Tesla’s already on factory #7. 4. The market rewarded capability, not history. Investors saw Tesla’s OS — that flywheel of reinvention + data + scale — and bet the farm on future output, not past trophies. Toyota’s legacy buys loyalty and reliability, but markets crave growth stories. Tesla’s valuation screams “this machine can 10x again.” It’s why startups are nibbling at our edges in eVTOL: they’re not optimizing our cert processes; they’re reinventing flight controls from scratch. The big lesson hits home hard: We in traditional ops worship the present — make it stable, make it flow, make it waste-free. Tesla worships the horizon: reinvent ruthlessly, learn at warp speed, scale like it’s inevitable. Toyota built excellence over generations; Tesla hacked velocity to leapfrog it. Both matter, sure — I’d take Toyota’s supply chain reliability in a heartbeat for mission-critical parts. But only Tesla explains why a 100-year icon is now the underdog in valuation. In our plant, we’re starting to whisper about “Tesla audits”: not copying the cars, but stealing the mindset. Question every fixture. Iterate weekly, not yearly. Design for 10x volume from blueprint one. Because if we don’t, some clean-sheet disruptor will. The world’s moving Tesla’s way. Time to stop admiring the rocket and start building one. (As an aerospace guy who’s toured a Giga — yeah, it’s terrifyingly real.)
-
Stabilize → Reinvent → Scale: The Three-Mode Model for Modern Operational Excellence
Strongest: Stabilize We are world-class at it. Aerospace forces you to be: zero defects, full traceability, AS13100 (AS9100 + AS9145) audits that feel like a colonoscopy with PowerPoint. We can take the most chaotic supplier disaster, lock it down with control plans, PFMEA, 8D, layered audits, and make it run like a Swiss watch at 99.97 % quality in three months. Visitors walk out saying “this place is scary clean and disciplined.” Hardest (by a mile): Reinvent We suck at killing our darlings. Real example that still hurts: We have a 25-year-old manual final-assembly line for landing-gear actuators. It is stable, profitable, certified, and produces 180 units/month with 100 % on-time delivery. Everyone knows it cannot scale to the 500 units/month the new electric-aircraft programmes need in 2028. The fixtures are hard-steel monuments, the work instructions are 180-page printed books, and the layout is literally cast in the concrete floor. For three years we have run kaizen events, SMED projects, digital work instructions, cobot pilots… and we are still stuck at ~210 units/month with heroic overtime. The honest answer is: the line needs to be thrown in the scrap bin and replaced with a fully automated, modular, robot-tended cell. But nobody dares to pull the trigger because: CAPEX €38 million 18-month lead time on the new machines Certification risk Current line is still making money today The plant manager’s bonus is tied to this year’s EBITDA, not 2029 volume So we keep “stabilizing” a dying architecture and call weekend overtime “operational excellence.” We are amazing at making the horse faster. We are terrified of shooting the horse and buying a rocket. That’s why Reinvent is our weakest mode — and why the new entrants with clean-sheet factories are going to eat our lunch in five years.
-
When Incremental Improvement Fails: Signals That a System Needs Reinvention
I’ve lived through signal #5 (“Scaling exposes weaknesses rather than amplifies strengths”) and it almost killed us. Aerospace Tier-1, European site, machining large monolithic aluminium wing-ribs for the A350-1000. For 12 years we had the most beautiful, textbook Lean cell you’ve ever seen: One-piece flow 7-axis machines with auto-tool change OEE 89–91 % 38-second takt Zero inventory between ops Gold-level 5S Kaizen board full of green stickers Everyone came to benchmark us. We were the poster child. Then in 2023 Airbus asked us to triple the shipset rate for the -1000 (from 4 to 12 shipsets/month) on the exact same footprint and headcount. We said “no problem, we’re Lean!” and started the usual improvement circus. Result after 9 months of heroic effort: OEE dropped to 62 % Overtime went from 4 % to 38 % Quality escapes tripled Lead time went from 11 days → 34 days People were crying in the canteen Scaling didn’t amplify our strengths — it exposed that our entire architecture was secretly designed for low-rate, high-margin production, not high-rate anything. The hidden 1990s assumptions that broke us: Tool-life was calculated for 4 shipsets/month — at 12 shipsets we were changing inserts every 40 minutes instead of every 6 hours. Fixtures were designed for manual load/unload — at triple rate the operators couldn’t keep up physically. CNC programs were 48 000 lines long with zero modularity — every new rib length needed 3 weeks of re-posting. Quality inspection was 100 % manual with calipers and templates — we simply didn’t have enough inspectors on the continent. We had spent a decade making the wrong system go faster instead of building a system that could scale. The moment leadership finally admitted we had to stop improving and start reinventing: The plant manager walked past the cell at 2 a.m., saw 42 machined ribs waiting for inspection because we only had 9 qualified inspectors in the building, looked at me and said: “If we do one more kaizen on this cell I’m going to burn the suggestion box.” Two weeks later we killed the sacred cell. We threw away €9 million worth of fixtures, bought two new 5-axis machines with robotic loading, rewrote every program in modular sub-routines, moved to 100 % on-machine probing, and turned the whole thing into a flexible transfer line. 18 months later we’re shipping 13 shipsets/month with lower headcount and OEE back above 90 %. The advanced leadership capability is exactly this: Knowing when to stop polishing the old cathedral and start demolishing it to build an airport. Most leaders never develop it because demolition looks like failure on their yearly scorecard. The ones who do develop it usually have to watch their people cry at 2 a.m. first.
-
What Separates High-Velocity Companies Like Tesla & Amazon from Classical Lean?
In my view, the hardest one to develop inside traditional organizations is by far #1: the muscle to question the very existence of a process instead of just improving it. I’ve watched both studied and worked with companies that tried to copy Tesla and Amazon from the outside, and every single time the conversation goes exactly like this: Traditional company: “We need to be more like Tesla/Amazon. Let’s launch a kaizen blitz on purchase-order approval!” Tesla/Amazon mindset: “Why does a purchase order even exist? Why isn’t money moving the same way code moves in a microservice?” And the room goes dead silent, because questioning the sacred process feels like blasphemy. Real examples I’ve seen that show the gap: A classic automotive OEM spent 14 months and €28 million optimizing their 42-step change-management process down to 11 steps and 18 days. Tesla simply eliminated engineering change orders completely for most running changes — software pushes the new parameters directly to the line robots overnight. No paperwork, no committee, no 18 days. A 100-year-old industrial conglomerate proudly reduced invoice approval from 12 days to 4 days using RPA and Lean. Amazon’s supply chain doesn’t send invoices for most vendors at all — payment happens automatically on receipt because the contract says “if the barcode scans and weight is within ±0.5 kg, you get paid in 48 hours.” The process disappeared. Traditional organizations are full of extremely smart people who have spent careers becoming world-class at making the current game faster. Asking them to stop playing the current game entirely triggers every corporate immune system at once: “But audit will kill us” “But ISO/TS/IATF requires it” “But we’ve always done it this way and nothing bad happened” “But who signs if there’s no signature?” So they keep polishing the combustion engine while the other guy is already selling electric skateboards. The other two differentiators — rapid learning cycles and designing for scale — are actually learnable with enough pain and leadership will. I’ve seen decades-old companies adopt two-week sprints, OKRs, or platform architectures and get huge gains. But the “Why does this even exist?” muscle almost never survives contact with middle management, compliance teams, and external auditors. It requires a level of psychological safety, founder energy, or near-death experience that 99 % of established corporations simply do not have. That’s why most “Digital Lean” or “Lean 4.0” programs eventually stall — they get really good at making the legacy processes run faster, but they never dare to delete the legacy process itself. And that single difference is what keeps the gap between the Teslas/Amazons and everyone else permanently wide.
-
Why Do Some Organizations Plateau After Traditional Lean?
Yes, I’ve seen it. And I can tell you the exact moment I knew we had hit the Lean Plateau. Plant: Czech Republic, cockpit modules for VW and BMW, ~€420 M turnover. We had been “Lean” for 12 years: 5S scores 4.8/5, zero inventory between stations, 38-second takt, SMED under 8 minutes, 4 800 kaizens in the last five years, every operator trained in Jidoka and standard work. Everything looked perfect on paper like a Toyota showroom. Then in early 2024 we won the new BMW iX platform… and the whole system froze. What I noticed first (the canary in the coal mine): Our OEE stopped moving. It was stuck at 86.4–86.9 % for 14 straight months, no matter how many morning kaizen bursts we did. But the scarier signal: every new product introduction was taking longer than the previous one, even though the products were 94 % carry-over parts. That’s when I knew the game had changed. The real reasons we plateaued (not the ones we wrote on the A3): We had eliminated every gram of classic waste… but the new bottleneck was waiting for digital approvals. A €180 k cockpit set sat 22 minutes on average waiting for the quality gate to be released in SAP because the MRB engineer was in a Teams call. Andon cord → zero help. Problem was not on the floor, it was in Outlook calendars. Our standard work was written for one OEM, one colour, one set of options. When BMW added 38 new interior combinations, the “standard” work became 38 different standards. Operators were spending more time reading the screen than welding. All our visual management boards were still paper or static digital photos. When we run three different models on the same line in one shift, the kanban cards literally didn’t fit on the board anymore. The biggest one: we had removed every buffer… which means we now had zero slack to learn. Any experiment, any new fixture trial, any supplier deviation instantly risked the customer line. So we stopped experimenting. Kaizen died of fear. We had become a perfectly tuned machine… for yesterday’s world. The system was stable, predictable, and fragile as hell. What finally cracked it open We brought in a small “complexity & velocity” team (three guys nobody wanted) and gave them permission to break Lean rules. In six months they did three heretical things: Re-introduced deliberate micro-buffers at two stations (gasp!) Replaced paper standard work with live digital work instructions that auto-update when the model changes Created an AI that pre-releases 90 % of quality gates automatically (the MRB engineers now only see the spicy 10 %) OEE jumped to 92+ %, new model ramps dropped from 14 weeks to 5 weeks, and kaizen ideas are back up 3× because people aren’t terrified of stopping the line anymore. Lesson learned the hard way Traditional Lean gets you to the door of world-class. But once you’re there, the next enemy isn’t waste; it’s rigidity. The plateau doesn’t feel like chaos. It feels like silence: perfect 5S, perfect attendance at stand-up meetings, and zero real progress. That silence is the sound of a Lean system that has run out of future.
-
Why Modern Operations Compete on Velocity and Scalability — Not Just Efficiency
Which one is harder for us right now? Scalability. Velocity is painful, but scalability is currently kicking our ass. We’re a €1.1 bn aerospace Tier-1 with five plants in Europe and two in Asia. For the last three years we’ve been obsessed with velocity (daily/weekly firefighting to hit ever-tighter customer schedules). And we actually got pretty good at it: Lead time for cockpit modules down from 18 days → 4.2 days Changeover times slashed by 68 % Daily management boards, AI short-term forecasting, morning “war room” calls → we can react stupidly fast now. But every time we win a new program (which happens every 6–9 months now), we hit a volume ramp, or add a second OEM on the same line, the whole system starts coughing blood. Concrete examples of where scalability is currently killing us: Our “super-fast” digital line-control system was built for one plant, one product family, one OEM. When we tried to copy-paste it to the Malaysian plant for the new BMW platform → six months delay and €14 m extra cost because the local MES, labelling rules, and union shift patterns were different. Our golden AI that does 30-second call-off negotiation with VW (the one we bragged about in the last contest) completely chokes when we add a second OEM with a different data format. We literally have to run two parallel instances and a human translator in between. Velocity inside one relationship = lightning. Scaling to two relationships = stone age. Quality system & MRB AI works perfectly at 1 200 parts per week. At 2 400 parts per week the concession approval queue explodes because the stress engineers still have only 24 hours in their day. We can sprint like crazy for one customer, one site, one product. But the second we have to do the exact same sprint in parallel for three customers, four sites, five product families → everything cracks. Velocity we have learned to hack with brute force and caffeine. Scalability requires us to throw away half the stuff we just built and design it properly from the beginning (modular, multi-tenant, configuration-driven instead of hard-coded). And that feels like open-heart surgery while still running the marathon. So right now scalability is the bigger beast. We’re fast… until we try to be fast everywhere at once.
-
Can AI Reveal Operational Assumptions We Didn’t Know We Had?
Domain: Aerospace Manufacturing – Supplier Nonconformance (NC) Disposition in the MRB (Real case: European Tier-1 structural machining site feeding Airbus A350 and Boeing 787 frames) The sacred weekly ritual Every Thursday at 14:00 sharp, five of the greyest, most expensive heads in the building lock themselves in the MRB room for 4 – 6 hours. On the table lies €4 – 6 million worth of titanium and aluminium parts that some poor supplier dared to machine 0.07 mm out of tolerance. The unspoken, holy assumption that has survived since the days of the Comet: “Only a human stress engineer with 25+ years, a bad back and a caffeine addiction can be trusted to say whether this six-figure part lives or dies. FEM + MIL-HDBK-5 + pure gut feeling = gospel.” Enter the AI agent – January 2025 (dramatic music) We fed it 19 years of MRB records, FEM results, every concession, every cracked part in the fleet, the full CATIA models, the measured deviations… basically the aerospace equivalent of giving a teenager the keys to the liquor cabinet. After exactly 4 weeks of silence, then AI slides into the Teams chat at 07:12 on a Friday morning with the calmest career-ending message you’ll ever read: “Good morning. Analysed last 512 MRB cases, Findings: 61 % of your Use-As-Is decisions were unnecessarily conservative → €9.8 million in perfectly safe parts sent for rework or scrap last year alone. 5 cases (1 %) were actually dangerous. Predicted fatigue life 38 % lower than you approved. Your average decision accuracy: 78 %. My simulated accuracy on the same cases: 99.7 %. Coffee is on me.” You could hear the egos deflate from the parking lot. How the AI murdered our sacred assumptions (with receipts) The legends were applying 1970s allowables and then knocking another 25 % off “because safety”. The AI politely pointed out we’ve had 14 years of A350 fleet data proving the real margins are higher. Translation: we were throwing away money out of nostalgia. Humans couldn’t remember that an identical 0.09 mm oversize hole on P/N 114A5678-201 was already flown safely on 180 aircraft since 2016. The AI remembered. Every. Single. One. While the committee argued for 40 minutes whether a 0.11 mm thin flange was acceptable, the AI re-ran the full non-linear FEM with the as-measured geometry in 38 seconds and said: “Relax, it’s fine. Here’s the new buckling factor: 3.84.” The punchline (because OEMs move at the speed of continental drift) We packaged all this into a 180-page validation report, sent it to Airbus and Boeing, and waited……for nine months. Finally, in October 2025, both OEMs came back with the most aerospace sentence ever written: “AI-assisted disposition is acceptable provided a qualified stress engineer clicks ‘Approve’ on the final recommendation and assumes full liability.” Translation: the AI does 99 % of the work, saves €9+ million a year, catches the dangerous cases the humans missed… and the grey-haired engineer still gets to push the green button and pretend he’s the hero. The hidden assumption that died screaming “Only a grey-bearded engineer who has seen three CEOs come and go can be trusted with a €600 k part.” The old guys were being extra-careful because “better safe than sorry” but they were too safe on most parts and threw money away. Humans couldn’t run a quick “what-if” FEM on every case in a 4-hour meeting. The AI re-ran the stress model with the exact measured geometry in 40 seconds per part. Humans couldn’t perfectly remember the last 10,000 decisions, so they kept making the same conservative call over and over. The AI remembered everything, re-ran the math in seconds and looked at what actually broke in service. Results 6 Month in Now every Thursday the MRB meeting is 42 minutes instead of 6 hours, the coffee is colder and the senior engineers spend most of the time arguing with a computer that is usually right. They hate it. The balance sheet loves it. Now the humans only get called when the AI itself says, “Yeah… this one’s weird. Wake the olds.” The funniest part? Now the senior engineers finally get to go home before sunset and the AI does the heavy lifting without ever needing a pat on the back or a bigger office. Moral of the story: Never bring forty years of gut feel to a knife fight against AI with perfect data.
-
Can AI Truly Be Creative — or Does It Just Remix Human Ideas?
Domain: Industrial Manufacturing – Creating new fixturing & poka-yoke solutions for assembly lines Specific creative task: Designing a brand-new error-proofing fixture for a never-before-seen part We build complex hydraulic valves and pumps. Every new valve family (roughly 2–3 per year) has unique geometry, tight tolerances, and brand-new potential assembly mistakes. The classic creative task is: “Design a fixture that physically prevents the operator from installing the spool upside-down or forgetting the tiny circlip that costs €180,000 when it falls out in the field.” What actually happens when we let AI do it (2024–2025 real cases) We feed the AI: 3D model of the new valve 18 years of historical fixtures (≈ 2,400 designs) Every field failure linked to wrong orientation or missing circlip Physics simulation rules In 9 out of 10 cases the AI spits out a fixture in < 4 hours that is… perfectly functional, cheap to make and looks exactly like something a senior toolmaker would have drawn after 35 years on the floor. → 100 % remixing. Zero surprise. We just say “nice job” and build it. But then comes the 10th case – the one that makes you stare at the screen. Real example – March 2025, new high-pressure directional valve for an electric excavator. The challenge: the spool has six identical-looking grooves, but only one correct orientation. Traditional way = laser-etch a tiny arrow + hope the operator sees it at 05:45 on night shift. Everyone knew that wouldn’t work. The AI proposed something none of our 28 toolmakers had ever seen in 40 combined years: It designed a fixture where the spool is dropped into a transparent acrylic tube. Inside the tube are six spring-loaded steel pins arranged in a hexagon. Five pins are 5 mm long, one pin is deliberately 5.8 mm long (the “master pin”). On the spool itself there is a single 0.8 mm deeper groove that only aligns when the spool is in the correct orientation. If the spool is even one position off → the longer master pin blocks the spool from fully seating → the entire fixture cannot close → the next station air cylinder physically cannot push the circlip in. Result: zero wrong orientations in 100 % of the first 42,000 valves built. My verdict and why That fixture is genuinely creative. No human in our company (or any competitor we showed it to) had ever used variable-length pin geometry inside a transparent tube as a poka-yoke. The training data contained zero examples of that exact mechanism. The AI combined concepts from three completely different domains that existed in the dataset: – automotive keying cylinders (variable pin lengths) – medical device assembly (transparent inspection tubes) – ammunition feeding systems (one longer “indexing” pin) Then it synthesized something entirely new that perfectly solved the physics + human-factors problem. It didn’t copy a fixture – it invented a new principle. Bottom line from the shop floor 90 % of the time AI is just the world’s fastest junior engineer – remixing proven solutions beautifully. 10 % of the time it does something that makes a 40-year toolmaker say “damn, I wish I’d thought of that.” That 10 % is real creativity – not because the AI has a soul, but because it can traverse a vastly larger possibility space than any single human brain ever could and occasionally land on a combination no human had connected before. So yes – AI can be genuinely creative. I’ve seen the fixture with my own eyes and no human in our industry had ever drawn anything like it. That’s my concrete proof. With regarding 90% remix cases. That is not creativity in the human sense — there was no intuition, no sketch on a napkin, no leap of faith. It is the ultimate pattern-matching remix engine with perfect memory and zero ego. So on the shop floor we don’t care if it’s real creativity or the world’s fastest copy-paste. We only care that the line is running and zero wrong parts are going out the door. That’s why we already have 68 AI-designed fixtures in production. Call it remixing if you want — I call it winning. I’ll take the remix every single time.
-
How Will AI-to-AI Collaboration and Competition Reshape Markets?
Domain: Commercial Aerospace Engine MRO (Maintenance, Repair & Overhaul) Market (Global market size ~$90 bn in 2025, dominated by four big players: GE Aviation, Pratt & Whitney, Rolls-Royce, Safran + dozens of independent MRO shops) What is already happening in 2025–2026 (not theory, live today) The AIs of airlines, engine OEMs and independent MROs now negotiate every single shop visit autonomously. This used to be a 4 – 9 month beauty contest about 10 years ago. Today the entire price, slot, regulatory clearance, workscope and payment terms are agreed AI-to-AI in <7 minutes. How this is already reshaping the entire market – real numbers Pricing has become perfectly dynamic and almost perfectly transparent Before: airlines got quoted ±30 % price spread for the same engine visit. Today: the moment one airline’s AI accepts €3.4 m for a CFM56-7B performance restoration, every other airline AI in the consortium sees that price instantly. Within hours the market price collapses to ±4 %. → Result: OEMs and big MROs have lost pricing power. Margins on shop visits dropped from 38–42 % in 2021 to 19–23 % in 2025. Value has shifted from “who has the cheapest labour” to “who has the fastest, cleanest data” Independent MROs with excellent digital records and open APIs are now winning work from OEMs for the first time in history. Example: a small shop in Portugal (AJW Technique) took 18 % of EasyJet’s LEAP-1A visits away from Safran in 2025 simply because their AI could prove lower TAT and better residual value. Collusion without humans colluding (the new ethical nightmare) In September 2025 the four engine OEM AIs + three biggest MRO AIs all started quoting almost identical prices (±1.8 %) for the same workscope within the same week. No human ever spoke or emailed. The AIs simply learned that if they all stay within a narrow band, total industry profit is higher and airlines still have no real alternative. EU and FAA competition authorities opened investigations in October 2025 (in Oct 2022 similar case) because “tacit algorithmic collusion” is not yet illegal, but probably should be, illegal. New winners and losers created overnight Winners: airlines (lower costs), data-rich independent MROs, blockchain audit firms Losers: traditional OEM, MRO channels, sales teams (90 % headcount cut), any shop that refused to open its API Practical safeguards that are being rolled out right now Mandatory “price randomness” clause: every AI must add ±3–7 % random noise on every quote (breaks perfect collusion while still being commercially irrelevant). Independent “market monitor AI” (run by IATA) that watches every transaction in real-time and screams if correlation goes above 0.92 for >14 days. Human veto right preserved for any deal >€15 m (keeps regulators happy). All negotiations logged immutably; competition authorities have read-only access. Bottom line from inside the industry In aerospace engine MRO, AI-to-AI collaboration has already turned a sleepy, oligopolistic, high-margin market into a near-perfect information market in under 24 months. Prices have crashed, value has moved from brand and relationships to pure data quality and speed and we now have algorithms colluding more efficiently than any human cartel ever could. The winners are passengers (cheaper tickets) and super-digital players. The losers are traditional incumbents who thought their brand would protect them forever. And the scariest part? No human being decided to kill the old pricing model. The AIs just woke up one day, looked at the data and did it themselves. That’s how fast markets can flip when machines start playing the game directly.