Hadoop Platform as a Service (PaaS) is here!

Over the past few years, businesses have been increasingly turning to the cloud to meet their Hadoop and Big Data needs. Cloud offerings provide varying levels of virtualization support to organizations, from simple virtual machine hosting to full-blown cloud solutions. One way to explain the various cloud models is using the analogy of “Pizza as […]

read more

Deliver Business Value Faster with Spark Machine Learning

Apache Spark Machine Learning (Spark ML) was introduced in Spark Version 1.4 and serves as a comprehensive solution for Machine Learning based on the Apache Spark computation engine. Spark ML introduces automation for the Machine Learning process via the Machine Learning Pipeline (ML Pipeline). Spark ML is based on the Apache Spark platform meaning it […]

read more

Spark RDD – Getting to the bottom records…

Apache Spark is the leading computation engine for Big Data and Analytics and has revolutionized the way we handle big data. Its API, the Resilient Distributed Dataset (RDD), is a powerful and robust tool which makes distributed computation tasks easy. There are numerous advantages to Apache Spark, making it a top choice for many organizations. […]

read more

Apache Spark: RDD, DataFrame and Dataset – API comparison and Performance Benchmark

Apache Spark is one of the most popular fast in-memory computation engines in the Big Data Space. Apache Spark includes SQL support and a rich Machine Learning library, which makes it a favourite choice for Analytics processing. Apache Spark computations are performed on distributed object collections. The Resilient Distributed Dataset (RDD) was the first type […]

read more

Emerging Technology Discussed at the TSX Equities Trading Conference

Wednesday May 17, 2017 was the annual TSX Equities Trading Conference. SWI was once again a proud sponsor of the event, and even had some of our employees on stage to help with the Toronto Stock Exchange open that day. Following the hype of the market open was a day packed with interesting panels in […]

read more

Microsoft’s Cortana Intelligence Suite at the TechConnex Big Data Meeting

Microsoft Black Belt Alex Fernandes presented at the most recent TechConnex Big Data Peer Group hosted by SWI. Alex, a Data Solutions Architect, presented an overview of the various components of the Microsoft Azure Cortana Intelligence Suite. The suite is a complete set of well integrated cloud services offered in the Microsoft Azure platform which includes […]

read more