Working with Spark in Big Data Clusters

  • Benjamin Weissman
  • Enrico van de Laar


So far, we have been querying data inside our SQL Server Big Data Cluster using external tables and T-SQL code. We do, however, have another method available to query data that is stored inside the HDFS filesystem of your Big Data Cluster. As you have read in Chapter 2, Big Data Clusters also have Spark included in the architecture, meaning we can leverage the power of Spark to query data stored inside our Big Data Cluster.

Copyright information

© Benjamin Weissman and Enrico van de Laar 2020

Authors and Affiliations

  • Benjamin Weissman
    • 1
  • Enrico van de Laar
    • 2
  1. 1.NurnbergGermany
  2. 2.DrachtenThe Netherlands

Personalised recommendations