Thread
Looking for a Dataset to practice Machine Learningβ“πŸ‘€

πŸ€— @huggingface hub has 20749 datasets available for FREE as I write this thread πŸ”₯

Today, we learn how access & use them! πŸš€

Read More πŸ§΅πŸ‘‡
First of all, we list all the available datasets.

Check this out πŸ‘‡
Next we load a dataset.

Each row in the dataset is a dictionary.

Datasets is based on Apache arrow format which is more memory efficient πŸš€

Check this outπŸ‘‡
Finally, we load the dataset in a pandas DataFrame!! 🐼

Check this outπŸ‘‡
πŸ”΅ Find Jupyter Notebook πŸ“’ ⬇️
github.com/patchy631/machine-learning/blob/main/NLP/hf_datasets_demo.ipynb
That's a wrap!

Everyday, I share tutorials around Data Science & Machine Learning.

Find me β†’ @akshay_pachaar βœ”οΈ

Like/RT the tweet below to support my work! πŸ™

Mentions
See All
  • Post
  • From Twitter
Great share!