Pandas is a well-known library for data scientist yet it does not scale. Koalas provides a pandas interface for big data manipulation in spark environment. In this session, koalas usage will be demonstrated for distributed data frame manipulation.
Speaker: Mr. Wong Ho Wa / Hong Kong - GitHub, Twitter, LinkedIn
Language: Cantonese
Date and Time : October 9, 2021 / 14:00-14:30 (UTC+8)
Speaker Introduction
Wong Ho Wa is currently a data scientist in a luxury fashion industry and conveyor of open data working group, Internet Society Hong Kong.