Question: Your data science team wants to use machine learning to better filter out spam messages. The team has gathered a database of 100,000 messages that have been identified as spam or not spam. If you are using supervised machine learning, what would you call this data set?
- machine learning algorithm
- training set
- big data test set
- data cluster
Answer: The correct answer of the above question is Option B:training set