Data Leakage

Data Leakage - In the context of AI, data leakage refers to the introduction of information into the training data that the system is expected to infer but should not legitimately have access to during model training. This results in a model that appears to perform well during evaluation on the test set but demonstrates poor performance during deployment when evaluated on new data sets.

Class Information

Identification

Label (rdfs)
Data Leakage
Preferred Label
None
Alternative Labels
Data Contamination, Model Contamination, Unintended Data Inclusion
Identifier
N/A

Definition and Examples

Definition
In the context of AI, data leakage refers to the introduction of information into the training data that the system is expected to infer but should not legitimately have access to during model training. This results in a model that appears to perform well during evaluation on the test set but demonstrates poor performance during deployment when evaluated on new data sets.
Examples
  • N/A

Translations

N/A

Class Relationships

Sub Class Of
Parent Class Of
  • N/A
Is Defined By
N/A
See Also
N/A

Additional Information

Comment
N/A
Description
N/A
Notes
  • N/A
Deprecated
False

Metadata

History Note
N/A
Editorial Note
N/A
In Scheme
N/A
Source
N/A
Country
N/A

Graph