Capgemini interview question

Why this data set ? Why this ML algorithm ?