Skip to content

[QST] SparkException ERROR ContextCleaner: Error cleaning broadcast #5328

Answered by tgravescs
unigrav-tech asked this question in General
Discussion options

You must be logged in to vote

It sounds like your environment is not setup properly. it looks like you are not properly using Spark GPU scheduling and executors are trying to use the same GPU. Alternatively if you didn't want to use GPU scheduling you could set Gpus up in process exclusive mode. But since you are using standalone its probably easiest just to configure it to do GPU scheduling.

2.start spark standalone ( 1 master and 3 works on the same machine)

Is there a reason you are using 3 workers on the same machine? You should just use a single worker since you only have 1 GPU.

please see instructions here: https://nvidia.github.io/spark-rapids/docs/get-started/getting-started-on-prem.html#spark-standalone-clu…

Replies: 5 comments

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Answer selected by sameerz
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
3 participants
Converted from issue

This discussion was converted from issue #5299 on April 27, 2022 15:13.