Uncluster Your Data Science Using Vaex • Maarten Breddels & Jovan Veljanoski • GOTO 2021
This presentation was recorded at GOTO Copenhagen 2021. #GOTOcon #GOTOcph
Maarten Breddels - Independent developer and consultant, co-founder of
Jovan Veljanoski - Machine learning specialist at Cloud Technology Solutions and co-founder of
ABSTRACT
Would you like to build an snappy dashboard visualising hundreds of millions of data points, or interactively explore hundreds of Gigabytes of data, all of that using a single machine?
Meet Vaex - an out of core DataFrame library in Python that can do all the typical data manipulations, filtering, and aggregations on a billion rows in real time & on a single computer. This approach empowers your team and allows them to focus much more on the business problem, as it removes the large DevOps overhead of configuring and maintaining a cluster.
Vaex fully supports Apache Arrow, which both facilitates the interoperability with other systems and enables storage and manipulation of more complex data structures like lists [...]
TIMECODE
1 view
0
0
3 years ago 00:38:25 1
Uncluster Your Data Science Using Vaex • Maarten Breddels & Jovan Veljanoski • GOTO 2021