Spark 3? #5

Open nigel.stanger opened this issue on 26 Oct - 0 comments

nigel.stanger commented on 26 Oct

The current Docker image uses:

  • Scala 2.11
  • Spark 2.4.8 + Hadoop 2.7 (spark-2.4.8-bin-hadoop2.7)
  • GraphFrames 0.8.1 (graphframes-0.8.1-spark2.4-s_2.11)
  • Kafka 2.2.2 (kafka_2.11-2.2.2)
  • Java 8 (temurin-8-jdk)
  • Python 3.6 (slim-buster)

The main constraint seems to be the supported Scala version:

Package Scala version
Hadoop 2.7 2.12
Hadoop 3.2 2.13
GraphFrames 0.8.2 2.12
Kafka 3.5.1 2.12
Spark 3.x 2.12, 2.13

GraphFrames 0.8.2 only supports up to Spark 3.2.x.

Spark 3.2 supports Java 8/11 and deprecates Python 3.6. (Spark 3.4 supports Java 17.)

Let’s try:

Labels

Priority
default
Milestone
No milestone
Assignee
nigel.stanger
1 participant