• There must be efforts to ensure data quality when collecting or producing data, as the quality of training data is one of the crucial factors that greatly impact the performance of an AI model. Open-source datasets may be used depending on circumstances.
• As for open-source datasets, there may be errors discovered by multiple users during use, resulting in changes in data version through modification and rebuilding of datasets.
• The clear provenance of data used in the training, the time of deployment, and the version of open-source datasets must be managed to respond to these AI model problems that may occur due to datasets.