- Basic python and statistics
Pima Indians :- https://www.kaggle.com/uciml/pima-indians-diabetes-database
Cardio Goodness fit :- https://www.kaggle.com/saurav9786/cardiogoodfitness
Automobile :- https://www.kaggle.com/toramky/automobile-dataset
- Advanced Statistics
Game of Thrones:-https://www.kaggle.com/mylesoneill/game-of-thrones
World University Ranking:-https://www.kaggle.com/mylesoneill/world-university-rankings
IMDB Movie Dataset:- https://www.kaggle.com/carolzhangdc/imdb-5000-movie-dataset
- Supervised Learning
a) Regression Problems
How much did it rain :- https://www.kaggle.com/c/how-much-did-it-rain-ii/overview
Inventory Demand:- https://www.kaggle.com/c/grupo-bimbo-inventory-demand
Property Inspection predictiion:- https://www.kaggle.com/c/liberty-mutual-group-property-inspection-prediction
Restaurant Revenue prediction:- https://www.kaggle.com/c/restaurant-revenue-prediction/data
IMDB Box office Prediction:-https://www.kaggle.com/c/tmdb-box-office-prediction/overview
b) Classification problems
Employee Access challenge :- https://www.kaggle.com/c/amazon-employee-access-challenge/overview
Titanic :- https://www.kaggle.com/c/titanic
San Francisco crime:- https://www.kaggle.com/c/sf-crime
Customer satisfcation:-https://www.kaggle.com/c/santander-customer-satisfaction
Trip type classification:- https://www.kaggle.com/c/walmart-recruiting-trip-type-classification
Categorize cusine:- https://www.kaggle.com/c/whats-cooking
- Unsupervised Learning
Vehicle Identification:- https://www.kaggle.com/c/st4035-2019-assignment-1
I hope it helps in the beginning , you can also use the projects mentioned in the supervised learning to implement the ensemble techniques.
Blog posts already done but not published (will be made soon)
- Wind generation
- Tennis Bet
- Audio deep learning
- Data engineering
- Hierachical clustering
- DBSCAN
- Scraping AI generated faces
Next blog posts
- TEDx talks
- Theories of mind
- Best linux apps
- Update the about section
- git basics
- challs
To do asap
- TED talk about memory with a summary
- My Anki memory flashcards (spark, python, scala, python vs scala…) https://towardsdatascience.com/python-vs-scala-a-comparison-of-the-basic-commands-fae23b3ede23
- post on 30 wind years
- ML in scala
- insincere questions part 2
- description of data with stats
- other posts on scraping (on steroid with Tor, scrapy etc…)
- tables of content for each post (with references)
- https://www.kaggle.com/aksingh2411/dataset-of-malicious-and-benign-webpages/kernels
- https://www.kaggle.com/mkashifn/nbaiot-dataset/kernels
- https://www.kaggle.com/deepsworld/malicous-and-benign-websites
- STAMINA
- https://www.kaggle.com/yashwanthkumbam/apaddos-dataset/kernels
- https://www.kaggle.com/remosin/bot-detection/kernels
- https://www.kaggle.com/omegaji/bots-ua-parsed
- https://www.kaggle.com/hawkcurry/2019-trendmicro-ctf-wildcard-400/kernels
- https://www.kaggle.com/ang3loliveira/malware-analysis-datasets-raw-pe-as-image
- https://www.kaggle.com/ellipticco/elliptic-data-set
- samples from Vxug and books STEMINA mlw classification
Other links
Other Kaggle Challenges - CV
- https://www.kaggle.com/dansbecker/food-101
- This person does not exist (with celebrities images from Kaggle)
- https://www.kaggle.com/xiaotawkaggle/inhibitors
Other Kaggle Challenges - NLP
- https://www.kaggle.com/c/word2vec-nlp-tutorial
- https://www.kaggle.com/c/quora-question-pairs/overview
- https://www.kaggle.com/c/quora-insincere-questions-classification
- https://www.kaggle.com/snap/amazon-fine-food-reviews
- https://www.kaggle.com/bittlingmayer/amazonreviews
Other Kaggle Challenges - Audio
- https://www.kaggle.com/c/freesound-audio-tagging
Other Kaggle Challenges
- https://www.kaggle.com/c/sf-crime
- https://www.kaggle.com/c/pkdd-15-predict-taxi-service-trajectory-i
- https://www.kaggle.com/c/forest-cover-type-prediction
- https://www.kaggle.com/c/random-acts-of-pizza
- https://www.kaggle.com/c/titanic
- https://www.kaggle.com/c/digit-recognizer
- https://www.kaggle.com/uciml/breast-cancer-wisconsin-data
- https://www.kaggle.com/c/facial-keypoints-detection
- https://www.kaggle.com/kmader/skin-cancer-mnist-ham10000