This INCREDIBLE trick will speed up your data processes.
In this video we discuss the best way to save off data as files using python and pandas. When you are working with large datasets there comes a time when you need to store your data. Most people turn to CSV files because they are easy to share and universally used. But there are much better options out there! Watch as Rob Mulla, Kaggle grandmaster, discusses some alternative ways of saving data files: pickle, parquet and feather files. I run some benchmarks to show that you can save time, space and keep the important metadata about your files in the process!
Timeline
00:00 Intro
00:49 Creating our Data
02:08 CSVs
04:39 Setting dtypes for CSVs
06:15 Pickle Files
07:16 Parquet ❤️
09:07 Feather
10:31 Other Options
11:02 Benchmarking
12:19 Takeaways
12:43 Outro
Code Gist:
Follow me on twitch for live coding streams:
Other Videos:
1 view
98
21
2 weeks ago 00:20:57 1
Incredible Discovery In The Grand Canyon? - YouTube
2 weeks ago 00:13:22 2
Most Beautiful Moments - Women’s Pole Vault Golden Roof Challenge Innsbruck 2023 Athletics
2 weeks ago 00:58:52 2
Never Seen So Many Native American Artifacts In One Collection - Absolutely Incredible!
3 weeks ago 00:05:14 2
Clara Fernández: The Spanish Sensation Transforming Pole Vaulting
3 weeks ago 00:12:19 1
Woman’s Truly INCREDIBLE Floating Home was Built Inside an Old Boat Shed – FULL TOUR
4 weeks ago 00:01:29 1
Zebra Birth and First Steps in the Wild: A Miraculous Moment | WILDLIFE
4 weeks ago 00:24:15 7
INCREDIBLE BEAUTIFUL! 😍 4 GENIUS IDEAS FOR THE HOME FROM A BATH MAT!!DIY