1

I am working on PST files, I have worked on writing custom record reader for a Mapreduce program for different input formats but this time it is going to be spark.

I am not getting any clue or documentation on implementing record readers in spark. Can some body help on this? Is it possible to implement this functionality in spark?

2
  • Did you try using sparkContext.hadoopRDD? Commented Sep 11, 2018 at 9:34
  • We can use that but I am not sure about the corresponding hadoop file format that has to be extended for PST file format.
    – BARATH
    Commented Sep 11, 2018 at 9:58

0

Your Answer

By clicking “Post Your Answer”, you agree to our terms of service and acknowledge you have read our privacy policy.