Olympic Datathon - A Data Story-Telling Competition
Welcome to the Datathon
The Olympic Datathon is designed to be a language-agnostic mini “hackathon” for students with varying levels of experience. The focus of the datathon will be the exploratory data analysis - or as we like to call it data storytelling - of the given Olympics dataset*. We invite everyone who is interested in data science to participate, regardless of their technical background. This one day datathon offers an interesting dataset to explore, guidance along the way, and free pizza!
Disclaimer: This is an EDA and not a ML competition.
*Dataset will be published on October 16th.
Spotted a mistake, dead link, or have suggestions for improvements? Message us using the links below.
Navigation
Introduction
Important First Steps
- Make sure you have registered on the Google form. Otherwise, you will not be invited/accepted to the Microsoft Teams group.
- Join the Datathon Teams Group and turn on notifications to stay up to date. The hybrid session will be hosted there.
- Click ‘Going’ on the datathon’s Facebook event and invite your friends if you want.
All other relevant details should be available on this page. If you have any questions feel free to reach out by using the contact links below. Viel Spass!
Event Timeline
All events on October 16th will be held both in-person as well as online. For in-person attendees, please join us in room OC0.01
. Online attendees can follow the in-person event on the ‘General’ channel on Teams. We strongly encourage anyone who has the chance to participate in the in-person event.
Again, the dataset will be published on October 16th.
Date | Time | Event |
---|---|---|
16th October | 3-4pm | Dataset introduction & Python basics for storytelling |
16th October | 4-5pm | R workshop by Warwick Statistics Society |
16th October | 5-6pm | In person guidance |
16th October | 6pm | Free pizza |
23rd October | 6pm | Submission deadline |
Competition Details
Data Story-Telling Question
Which factors determine whether an Olympic athlete wins a medal or not?
Winning notebook (post submission deadline)
Congratulations to the team consisting of Ammar Salem, Dulnath Jayasinghe, Sofia Pangher and Zain Mobarik!
Sample Notebook Structure
- Summary statistics
- Univariate analysis (e.g. Medal)
- Bivariate analysis (e.g. Height vs Weight)
- Statistical modelling and inference (If you feel brave enough. This is not part of the EDA!)
Make sure to elaborate on your observations!
Resources
- Google colab skeleton code. Save a copy in Drive and follow along in the teaching session.
- Data set
- Python for Data Science teaching session 1 by WDSS
- Tidyverse R basics
Submission
Submissions should be sent to hackathon@wdss.io. We accept any formats!
The competition will run from 6pm on Saturday the 16th October to 6pm on Saturday the 23rd October. No preference will be given to early entrants.
Judging and Winning
We will judge submissions based on the insights they derive. If you can tell a story about the data with basic code, but elaborate observations that’s actually slightly better than a super technically advanced submission with no elaborations at all. We will be looking for a good balance between code, visualizations and elaborations.
The winner/winning team will receive a shoutout on our social media accounts (including LinkedIn if you wish so). We will also link to your solution from this GitHub page.
Help and Information
FAQs
Is this a team or individual competition?
This is up to you to decide. If you would like to be part of a team, please contact Valentin Kodderitzsch on MS teams. We will try to our best to allocate teams of up to 4 people based on each participant’s skill level. We will also be discussing teams during the Datathon introduction workshop which will be held on October 16th 2021 at 3pm in OC0.01
.
I’ve improved my data story-telling submission since last submitting. Can I resubmit?
No problem. Simply email hackathon@wdss.io and explain the situation.
Getting Help
We will provide help throughout the datathon in a number of ways:
- Most help will be provided during the workshop on October 16th
- After the workshop you will find help by messaging the ‘Get Help’ channel on Teams
Contact
This hackathon was written and is primarily managed by Valentin Kodderitzsch. If you need to contact me please reach out on Microsoft Teams or message me on LinkedIn.