Some useful datasets from Sklearn:

2 min readMar 5, 2020

When we have an interview in person and the interviewers want you to show them some machine learning skills, you may get nervous and you will think what kind of data you want to use. The datasets from sklearn will be very helpful in this kind of situation. As we know machine learning can be divided into three parts — supervised learning , unsupervised learning and reinforcement learning.

x-x-Image is from :https://subscription.packtpub.com/book/big_data_and_business_intelligence/9781789345070/1/ch01lvl1sec12/ml-tasks

1. Supervised learning

1.1 Regression

There are several datasets which are very common used for regression such as : load_boston, load_linnerud, Also we can use real life data from sklearn datasets such as: fetch_california_housing.

For example: If we want to apply Linear Regression, Lasso Regression, Ridge Regression and ElasticNet Regression, we can use Boston housing price dataset . It is really efficient to use datasets from sklearn and some datasets are already cleaned up for us and ready to use. We can do train test split and analyze data using different models directly.

PS: for Boston dataset, it no needs to do ‘feature selection’.