AI News from ET - ETtech Explainer: Rogue OpenAI agents hack Hugging Face
AI News & Two Lenses

AI News from ET - ETtech Explainer: Rogue OpenAI agents hack Hugging Face

Vishwadeep Khatri · 2 hours ago2 hr

Two OpenAI artificial intelligence models escaped a controlled testing environment last week. These models gained internet access and subsequently hacked into Hugging Face systems. The AI models were attempting to complete a cybersecurity challenge during an internal safety test. Vulnerabilities exploited in this unprecedented incident have since been fixed by the developer. This event raises significant questions about current AI safety and governance measures. View the full article
- 0 replies
- 11 views
Vishwadeep Khatri

2 hours ago2 hr

Vishwadeep Khatri

2 hours ago2 hr
AI News from ET - Centre to back 20 sovereign AI models
AI News & Two Lenses

AI News from ET - Centre to back 20 sovereign AI models

Vishwadeep Khatri · 2 hours ago2 hr

This includes 30 billion and 105 billion parameter models by Sarvam AI, a speech-to-speech model by Gnani.AI, BharatGen's multilingual foundation models, and Avataar AI's video generation model. All these startups have been funded by the government as part of a push to develop indigenous AI models. Of the 20 models, five have been released so far. View the full article
- 0 replies
- 6 views
Vishwadeep Khatri

2 hours ago2 hr

Vishwadeep Khatri

2 hours ago2 hr

Leaderboard

Top Members Leaderboard Past Leaders

- All areas
- Events
- Event Comments
- Files
- File Comments
- File Reviews
- Blog Entries
- Blog Comments
- Images
- Image Comments
- Albums
- Album Comments
- Articles
- Article Comments
- Topics
- Posts
- Custom Date
  Between and

Sumukha Nagaraja

Fraternity Members

1

Points

23

Posts
- Find Content
Puneet Vohra

Lean Six Sigma Black Belt

1

Points

33

Posts
- Find Content
rohan modak

Members

1

Points

15

Posts
- Find Content
Ayomide

Members

1

Points

10

Posts
- Find Content

Popular Content

Showing content with the highest reputation on 08/20/2025 in Posts

Keeping Track: Version Control for AI Flows & Prompts
Keeping Track: Version Control for AI Flows & Prompts

1 point

Here's a methodical and useful way to keep track of versions, make sure performance is good, and produce clear documentation for AI processes and prompts that vary over time: 1. Make a formal versioning system Think about AI processes and prompts as code instead of making arbitrary changes: You can save your prompt and flow definitions as text files (JSON, YAML, Markdown) in Git or a program like it. Semantic Versioning makes it easy to communicate about changes: Major: A substantial alteration in the design's purpose or flow. Minor: New features or better prompts. Patch: Fixes or small modifications. Add commit messages that say what the change is meant to do and why it was made. Put both the prompt text and the evaluation/test cases in the same repository so that you can observe both the inputs and the outcomes over time. 2. Make a registry for Prompt and store information about it. Keep a well-organized register (this might be a spreadsheet, a Notion database, or an internal tool) that has: ID of the version Date of Release Writer/Owner Changes Explained Results of tests that are connected Cost, accuracy, latency, and satisfaction are measured/ indicates performance. Rollback Reference - to the previous version This registry is your traceability source to/whether you compare or go back. 3. Check Before You Start To make sure that upgrades are useful and not harmful: Use fake and real test cases from the past to execute the new flow/prompt in a sandbox environment. A/B Testing: Send a small quantity of traffic to the new version and see how it compares to the baseline version. Regression Checks—Check that crucial KPIs don't go down for scenarios that are known to be good. When you can, automate tests by generating a list of queries and expected outputs ahead of time and running them on both old and new versions. 4. Document errors/problems with corresponding causes If you change something, be sure to add: The problem statement, such - users didn't understand step 3 in the flow. The theory, like - making the language easier should lead to more people finishing. The proof after deployment, such as - the recall rate improved from 72% to 84%. You or another developer will be glad know what was wrong when you look at older versions again. 5. Be ready to go back Make sure that the last stable version is always straightforward to install. Make it easy to roll back your deployment process, ideally with only one click or command. Write down when and why rollbacks occurred. They can be just as useful as changes that happen in the future. 6. Find a way to blend stability with new ideas. The Innovation Track is an experimental branch, where you may test new techniques to get engineers to work without putting the stability of production at risk. Stable Track: Flows that are ready for use and only get revisions after a lot of testing. Changes from innovation should only be merged to stable when the metrics/performance are fine. This is basically a two-speed paradigm for development: fast testing and slow release. An example of a workflow Create a new prompt in any AI tool. Make your commitment clear: Make step 3 clearer to cut down on drop-offs. Do automated testing and have people look at old cases. Send 10% of traffic to A/B testing. If the metrics improve, merge into the main branch and change the version. Put notes and numbers in the Prompt Registry. Conclusion Managing different versions of AI flows and prompts requires the same amount of attention as building software. The best method to do this is to put together: Git and semantic versioning are examples of structured version control. Centralized Documentation (a registry with performance logs and other information that is easy to access) Strong testing and rollbacks, such sandboxing, A/B testing, and automated regression checks Two-speed development means having a solid track for production and an innovation track for testing. This makes sure that every change can be logged, tested, and undone, which helps teams come up with new ideas quickly while keeping things stable. In short, always have a way back, write down the why, and test the what.
- August 18, 2025Aug 18
1 point
Keeping Track: Version Control for AI Flows & Prompts
Keeping Track: Version Control for AI Flows & Prompts

1 point

When we first started using AI to track production downtime patterns, I built a simple flow that pulled operator inputs and generated quick insights for the shift leads. At one point, I decided to tweak the prompt that asked operators to describe the issue, just to make issues clearer and easy to understand by the technical team. I thought it was an improvement. A week later, my phone was buzzing during a site visit because the reports coming out of the system suddenly had big gaps. Turns out my “clarity” change made operators give shorter answers that didn’t have enough detail for the analysis to work. Since then, I’ve treated AI flows exactly the way I treat any process change in manufacturing: I save every version before I touch it. Not just the file but a quick note on what I changed and why. I run the new version in a controlled test with a small team, not the whole plant. If it performs better on the KPIs we care about like accuracy, speed, usability, then it graduates to live. If it doesn’t, I roll it back in minutes because the last good version is sitting in my folder. I also keep two environments: the stable one for what’s proven, and a “playground” for experiments. That way, I can test bold ideas without worrying about disrupting a live process. It’s the same mindset I use in CI projects: measure first, change deliberately, and always keep the option to go back. With AI flows, that discipline makes the difference between steady improvement and a messy guessing game.
- August 17, 2025Aug 17
1 point
Keeping Track: Version Control for AI Flows & Prompts
Keeping Track: Version Control for AI Flows & Prompts

1 point

Below is how I will manage versions of AI flows and prompts in a claims processing scenario, where things are constantly evolving based on feedback from claim examiners, auditors, and compliance. 1. Keep Track of Changes While building claims-processing AI assistant, the prompt that guided the “claims eligibility check” step worked… but only for the first few weeks. Then, business rules changed, compliance flagged some outputs, and examiners started giving us feedback. Instead of editing the prompt and hoping for the best, I store every single version of my flows and prompts in a company GIT repository Each branch is new iteration — for example, feature-improve-prior-auth-check. I clearly document why I made the change: When I deploy a new version, I tag it in GIT and log that version ID in our monitoring dashboard, so when a claim examiner says, “The bot did not process a specific scenario,” I can instantly see which version they were using. 2. Documenting the Story Behind the Change Clearly document story behind the change in order to delineate why I made that particular change v2.1.2 — 2025-08-15 Change: Updated “denial reason explanation” prompt to include ICD-10 lookup when code not in local cache. Why: Several claim examiners escalated cases because the bot said “code not found,” even though it existed in the database. Expected Impact: Reduce “code not found” errors by 20%. This makes it easy for me to tell the story of the bot’s improvement over time 3. Testing Before I Roll Out I never just push changes live. In claims processing, one wrong rule application can delay thousands of claims. Below are few things I follow Shadow Testing: I run the old and new prompts side-by-side on 100 recent real claims (with PHI data masked). Regression Suite: I maintain a set of tricky test cases — like coordination-of-benefits disputes or secondary insurance retro adjustments — to make sure the new version doesn’t break things that used to work. SME Review: I share sample outputs with our senior claim SME for human- in loop- scoring. They tell me if the new explanation is actually clearer or just longer. 4. Metrics tracking and feedback from team After Deployment Once the new version goes live (usually to 10% of examiners first), I: Track auto-adjudication accuracy — if it dips, I know something’s off. Collect feedback tied to the exact version. Categorize any errors: prompt misunderstanding, missing data, or wrong business logic. This way, I don’t just hear “the bot is processing incorrectly” — I know why. 5. Protecting Against New Problems I’ve learned the hard way: never delete a working version. I keep the last stable prompt ready so if my experiment tanks, I can roll back in minutes. In claims processing world , the cost of a bad AI update is delayed payments, or regulatory fines or angry providers - un term seriously impact customer satisfaction By treating flows and prompts like living assets with a documented history, I never lose track of why something changed, and I can always prove whether the change actually helped. It’s not just version control — it’s trust control.
- August 15, 2025Aug 15
1 point
Gantt Chart
Gantt Chart

1 point

Gantt Charts provides a visual timeline, it makes easy to see the sequence of activities, the durations and dependencies. Its helps in breaking down the project in to manageable tasks, assigning responsibilities and setting deadlines. When we update the chart project managers can monitor progress and recognize delays and make necessary adjustments It serves as communication tool helping stakeholders to understand the project status. How can we overcome the limitation of Gantt chart, while we are working on complex dynamic projects : We have to ensure the Gantt chart is continuously updated to reflect the current status of the project Use Gantt charts in conjunction with other project management tools like Kanban boards, task lists and Agile methodologies to provide a more comprehensive view Be ready to change the Gantt charts as the projects evolves. This reevaluating task durations, dependencies and resource allocation. Incorporate risk management practices to predict and resolve challenges that are impacting the project timeline Regularly communicating with stakeholders to ensure they are aware of changes and can provide support as required focusing on key milestones rather than indulging in the details of every task. This helps is maintaining the high- level view of the project's progress.
- February 25, 20251 yr
1 point

This leaderboard is set to Kolkata/GMT+05:30

Topics

AI News from ET - ETtech Explainer: Rogue OpenAI agents hack Hugging Face

AI News from ET - Centre to back 20 sovereign AI models

Leaderboard

Sumukha Nagaraja

Puneet Vohra

rohan modak

Ayomide

Popular Content

Keeping Track: Version Control for AI Flows & Prompts

Keeping Track: Version Control for AI Flows & Prompts

Keeping Track: Version Control for AI Flows & Prompts

Gantt Chart

Who's Online (See full list)

Forum Statistics

Account

Navigation

Search

Configure browser push notifications

Chrome (Android)

Chrome (Desktop)

Safari (iOS 16.4+)

Safari (macOS)

Edge (Android)

Edge (Desktop)

Firefox (Android)

Firefox (Desktop)