Abstract

Polybase is Microsoft's newest way of integrating SQL Server with external systems such as Hadoop and Azure Blob Storage. In this talk, we will get an overview of the Polybase technology and then implement it to work with an on-premises Hadoop cluster as well as Azure Blob Storage. We will then dive into the mechanics of how Polybase works, using packet captures to understand the cross-network communication. Finally, we will look at ways of tuning common Polybase scenarios and look at a few less-common scenarios.


Slides

The slides are available in HTML 5 format. All modern browsers (including tablets and phones) should be able to navigate the slides successfully.

The slides are licensed under Creative Commons Attribution-ShareAlike.


Demo Code

The demonstration code is available on my GitHub repository. This includes all of the SQL queries as well as relevant data sets.

The source code is licensed under the terms offered by the GPL. The slides are licensed under Creative Commons Attribution-ShareAlike.


Links And Further Information