γνώση
About

Access S3 Objects from Spark

Dec 31, 2016

The main references are

http://stackoverflow.com/questions/30851244/spark-read-file-from-s3-using-sc-textfile-s3n https://www.cloudera.com/documentation/enterprise/5-5-x/topics/spark_s3.html

As of 2016/12, use the s3a protocol for accessing S3 objects. Also append --packages org.apache.hadoop:hadoop-aws:2.7.3 to spark-shell or spark-submit.

γνώση

  • γνώση
  • jli05
  • jli05

Blog in financial markets, trading, and more broadly information processing