Skip to content
View in the app

A better way to browse. Learn more.

Benchmark Six Sigma Forum

A full-screen app on your home screen with push notifications, badges and more.

To install this app on iOS and iPadOS
  1. Tap the Share icon in Safari
  2. Scroll the menu and tap Add to Home Screen.
  3. Tap Add in the top-right corner.
To install this app on Android
  1. Tap the 3-dot menu (⋮) in the top-right corner of the browser.
  2. Tap Add to Home screen or Install app.
  3. Confirm by tapping Install.
Message added by Mayank Gupta,

In Sample Testing is when the model is used to predict the outcome using a data point from the sample data that was used to develop and optimize the model.

 

Out of Sample Testing is when the model is used to predict the outcome using a data point that was not a part of the sample data that was used to develop and optimize the model.

 

An application-oriented question on the topic along with responses can be seen below. The best answer was provided by Anish Mohandas on 14th Apr 2024. 

 

Congratulations to the winner!!

Featured Replies

Q 660Compare In-Sample and Out-of-Sample testing, highlighting their advantages and disadvantages. Provide examples where their use is preferred.

 

Note for website visitors -

Solved by AnishMohandas

  • Solution

Machine learning researchers and data scientists often use in-sample and out-of-sample testing to refer to training and test sets respectively.

 

In-sample data refers to the set of data which is used for training or fitting a model. When analysts try to build statistical or machine learning models they usually make use of historical data which enables the model to be taught about how predictions or classifications can be made. During this process, inputs are given, along with their corresponding outputs to enable it to learn the underlying patterns and relationships that exist in the dataset. This is essentially what is meant by in-sample data; it is the dataset upon which the model learns from.

 

On the other hand, Out of sample data refers to unseen data by a model when undergoing training phase. After a model has been trained, measuring its performance on new unseen information is important for assessing its ability of being generalized. For this reason, out-of-sample testing is employed. By trying out a model using real time cases that have never encountered before, analysts can make an estimation as regards making predictions or classification of unseen instances inside such models as well. This stage helps confirm its applicability within practical contexts (Data modelling and learning steps are illustrated and attached)

 

In-sample (Training data)

Out-of-sample (Testing data)

Advantages:

  •  It facilitates model evaluation based on the same data used for training,
  •  It gives us insight on how well the model fits the training data.
  • Computationally well efficient

Disadvantage:

  • It is prone to overfitting

Advantages:

  • Provides a more accurate estimation of model’s performance in unseen data.
  • Validates the model effectiveness in real world scenario

Disadvantage:

  •  It requires a separate dataset for testing.
  • It can be computationally intensive if multiple iterations or cross validations are performed

Example of in-sample and out-of- sample data in real world scenario

Assumption:80% in-sample and 20% out-of-sample

The examples demonstrate how in-sample and out-of-sample testing are applied across all domains from Finance, Healthcare, cybersecurity etc.

  • Credit decisioning model
  • Training machine learning models on historical data to predict stocks
  •  Developing a spam email classifier
  •  Fraud detection algorithm

  •  Evaluating the performance of medical diagnosis model

data model.jpg

Anish Mohandas' is the only answer that we were able to publish and I am glad that it was the best answer! Well done

 

P.S. there were a couple of more good answers but could not be published as they failed in the AI generated content test. While you can do your research online or using AI tools, however, we want the answers to be originally written.

Create an account or sign in to comment

Account

Navigation

Search

Search

Configure browser push notifications

Chrome (Android)
  1. Tap the lock icon next to the address bar.
  2. Tap Permissions → Notifications.
  3. Adjust your preference.
Chrome (Desktop)
  1. Click the padlock icon in the address bar.
  2. Select Site settings.
  3. Find Notifications and adjust your preference.