···773. Get baseline model working.
884. Design an evaluation method as close as the final evaluation. Plot local evaluation metrics against the public ones (correlation) to validate how well your validation strategy works.
995. Try different approaches for preprocessing (encodings, Deep Feature Synthesis, lags, aggregations, imputers, ...). If you're working as a group, split preprocessing feature generation between files.
1010-6. Plot learning curves ([sklearn](https://scikit-learn.org/stable/modules/learning_curve.html) or [external tools](https://github.com/reiinakano/scikit-plot)) to avoid overfitting.
1010+6. Plot learning curves ([sklearn](https://scikit-learn.org/stable/modules/learning_curve.html) or [external tools](https://github.com/reiinakano/scikit-plot)) to avoid overfitting.
11117. Plot real and predicted target distribution to see how well your model understand the underlying distribution. Apply any postprocessing that might fix small things.
12128. Tune hyper-parameters once you've settled on an specific approach ([hyperopt](target distribution), [optuna](https://optuna.readthedocs.io/)).
13139. Plot and visualize the predictions (histograms, random prediction, ...) to make sure they're doing as expected. Explain the predictions with [SHAP](https://github.com/slundberg/shap).
···4444- [Sktime](https://github.com/alan-turing-institute/sktime) / [Aeon](https://github.com/aeon-toolkit/aeon)
4545- [Awesome Collection](https://github.com/MaxBenChrist/awesome_time_series_in_python)
4646- [Video with great ideas](https://www.youtube.com/watch?v=9QtL7m3YS9I)
4747-- [Tutorial Kaggle Notebook](https://www.kaggle.com/code/tumpanjawat/s3e19-course-eda-fe-lightgbm)
4747+- [Tutorial Kaggle Notebook](https://www.kaggle.com/code/tumpanjawat/s3e19-course-eda-fe-lightgbm)
4848+4949+## Datathon Platforms
5050+5151+- [Kaggle](https://www.kaggle.com/competitions)
5252+- [MLContest](https://mlcontests.com/)
5353+- [Humyn](https://app.humyn.ai/)
5454+- [DrivenData](https://www.drivendata.org/competitions/)
5555+- [Xeek](https://xeek.ai/challenges)
+1
Open Data.md
···287287- [Victoriano's Data Sources](https://victorianoi.notion.site/Data-Sources-79b28912c6d941af99e6ef102c578fa0)
288288- [Data is Plural](https://www.data-is-plural.com/)
289289- [Public APIs](https://github.com/public-api-lists/public-api-lists)
290290+- [Real Time Datasets](https://github.com/bytewax/awesome-public-real-time-datasets)
290291291292## Open Source Web Data IDE
292293