바로가기 메뉴 본문 바로가기 주메뉴 바로가기
  • 05-1bHave you implemented techniques to detect outliers in training data?
    • One of the significant activities in data pre-processing is to identify and remove outliers. Unlike data omission, outliers already have predefined values but deviate significantly from the normal range of the entire dataset, making it tricky to detect them through simple exploration.

    • In most cases, outliers are identified by applying statistical techniques (e.g. z-score, interquartile range) to the data as a whole to find data points that are far from the entire dataset.