RepoDatabricks (DBRX)Databricks (DBRX)published Feb 10, 2019seen 5d

databricks/LearningSparkV2

Scala

Open original ↗

Captured source

source ↗
published Feb 10, 2019seen 5dcaptured 10hhttp 200method plain

databricks/LearningSparkV2

Description: This is the github repo for Learning Spark: Lightning-Fast Data Analytics [2nd Edition]

Language: Scala

License: Apache-2.0

Stars: 1397

Forks: 796

Open issues: 3

Created: 2019-02-10T05:17:50Z

Pushed: 2025-01-28T04:30:40Z

Default branch: master

Fork: no

Archived: no

README:

Learning Spark 2nd Edition

Welcome to the GitHub repo for Learning Spark 2nd Edition.

Chapters [2](chapter2/README.md), [3](chapter3/README.md), [6](chapter6/README.md), and [7](chapter7/README.md) contain stand-alone Spark applications. You can build all the JAR files for each chapter by running the Python script: python build_jars.py. Or you can cd to the chapter directory and build jars as specified in each README. Also, include $SPARK_HOME/bin in $PATH so that you don't have to prefix SPARK_HOME/bin/spark-submit for these standalone applications.

For all the other chapters, we have provided notebooks in the [notebooks](notebooks) folder. We have also included notebook equivalents for a few of the stand-alone Spark applications in the aforementioned chapters.

Have Fun, Cheers!