You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I noticed that we are using Typesafe config and that seems to be introducing serialization issues to the job as they are failing with the following exception :
Caused by: java.io.UTFDataFormatException: encoded string too long: 72887 bytes
The issue is hard to replicate and all I can provide at the moment are the stack traces. I will update the issue if I find a way to replicate it
Do you have any recommendation to deal with this issue?
AFAIR, the typesafe configs for LuceneRDD do not need to be serializable. If you use the typesafe config in your application make sure you use it within an object so that it is available to both driver and executors.
I am not using typesafe configs on my own application. The exception is coming from LuceneRDD itself.
I did another build removing all the references to it in LuceneRDD and it is working fine. I obviously miss the capability to add dynamic configurations so that's not a good solution.
I noticed that we are using Typesafe config and that seems to be introducing serialization issues to the job as they are failing with the following exception :
Caused by: java.io.UTFDataFormatException: encoded string too long: 72887 bytes
The issue is hard to replicate and all I can provide at the moment are the stack traces. I will update the issue if I find a way to replicate it
Do you have any recommendation to deal with this issue?
Similar issue : https://stackoverflow.com/questions/41505599/task-not-serializable-in-spark-caused-by-utfdataformatexception-encoded-string
The text was updated successfully, but these errors were encountered: