Toronto Apache Spark Meetup Group

Wednesday May 31, 2017
@ 6pm - 8pm EST
Downtown Toronto TBD

Add to your Calendar

Join SWI on May 31st, 2017 for the Toronto Apache Spark Meetup Group

RDD, DataFrame and Dataset – Comparing API & Performance Benchmarks

Agenda:
6:30PM to 7:00PM – Opening and networking
7:00PM to 8:30PM – RDD, DataFrame and Dataset – Comparing API & Performance Benchmarks by Eyal Edelman
8:30PM to 9:00PM – Networking

Location: Ramada Plaza Toronto Downtown, 300 Jarvis St, Toronto, ON M5B 2C5

Event Description:
Spark is continually advancing its capabilities. Originally the only distributed collection offered by Spark was the Resilient Distributed Dataset (RDD). Since then, DataFrame and Dataset have been introduced. In this presentation we will review the three Spark distributed collections, their API differences, and also compare their performance in Spark versions 1.6 and 2.1.

Target audience: Data Scientist, Data Engineer, Data Analyst and Spark Developers

Level: Intermediate to Advance

Speaker: Eyal Edelman is the Big Data Practice Lead and senior consultant at SWI. He is a Big Data Architect, Spark expert and a Microsoft Certified Solution Expert in Business Intelligence. Eyal has extensive experience in optimizing both SQL and Big Data solutions. He holds a Bachelor’s degree in Computer Science, a Master’s in Business Administration (MBA) and a PMP Project Management certification. With over 25 years of experience in architecting and implementing Data Systems, Eyal’s extensive background allows him to be an effective liaison between Business and Technology and deliver top notch technical solutions that provide real business value.

To register online click here.