Python for Data Science: Session Five
Session Content
Teaching Session Content
- Train-test splits and over/under-fitting
- Classification with k-nearest neighbours
- Regression with support vector machines
- Clustering with k-means
- Ensemble methods (bagging) with random forests
- Cross-validation and hyperparameter tuning
Recommended Reading and Resources
- Scikit-learn API Reference
- Scikit-learn User Guide
- Scikit-learn Examples
- Google’s ML Crash Course
- Cassie Kozorov’s Making Friends With Machine Learning course
Session Resources
Teaching Session
Project Session
Economins/Finance
- Questions
- Solutions
- Bankruptcy Dataset
- Productivity Dataset