I am trying to run an EMR scalding job and the Scala code is suppose to fetch the content of a text file located in an S3 bucket. The <code>scala.io.source</code> library is messing up with the correct location of the S3 path. I am giving the parameter runidfile to the EMR job : <pre class="prettyprint"><code>--runidfile s3://my-bucket/input.txt </code></pre> The scala code does the following : <pre class="prettyprint"><code>val runid_path = args("runidfile") val runid = Source.fromFile(runid_path).getLines().mkString </code></pre> The code somehow doesn't accept the "//" in the S3 path and I get an error: <blockquote> Caused by: java.io.FileNotFoundException: s3:/my-bucket/input.txt (No such file or directory) at java.io.FileInputStream.open(Native Method) at java.io.FileInputStream.(FileInputStream.java:146) at scala.io.Source$.fromFile(Source.scala:90) at scala.io.Source$.fromFile(Source.scala:75) at scala.io.Source$.fromFile(Source.scala:53) at com.move.scalding.userEvents.RecommenderValidator.(RecommenderValidator.scala:37) </blockquote> Is there any solution or a workaround to this? I tried using <code>Source.fromURL</code>, but S3 is not a valid protocol so it doesn't accept it.

The <code>scala.io.Source</code> library is not meant to access files directly from Amazon S3. You need another library for that. You can use the offical Amazon S3 Java Library. Here is some sample code (copied together from this question and its answers) <pre class="prettyprint"><code>val credentials = new BasicAWSCredentials("myKey", "mySecretKey") val s3Client = new AmazonS3Client(credentials) val s3Object = s3Client.getObject(new GetObjectRequest("my-bucket", "input.txt")) val myData = Source.fromInputStream(s3Object.getObjectContent()) val runid = myData.getLines().mkString </code></pre>

Scala code doesnt fetch s3 file

Tags:

amazon-web-services

amazon-s3

scala

I am trying to run an EMR scalding job and the Scala code is suppose to fetch the content of a text file located in an S3 bucket. The scala.io.source library is messing up with the correct location of the S3 path.

I am giving the parameter runidfile to the EMR job :

--runidfile s3://my-bucket/input.txt

The scala code does the following :

val runid_path = args("runidfile")
val runid = Source.fromFile(runid_path).getLines().mkString

The code somehow doesn't accept the "//" in the S3 path and I get an error:

Caused by: java.io.FileNotFoundException: s3:/my-bucket/input.txt (No such file or directory)
at java.io.FileInputStream.open(Native Method)
at java.io.FileInputStream.(FileInputStream.java:146)
at scala.io.Source$.fromFile(Source.scala:90)
at scala.io.Source$.fromFile(Source.scala:75)
at scala.io.Source$.fromFile(Source.scala:53)
at com.move.scalding.userEvents.RecommenderValidator.(RecommenderValidator.scala:37)

Is there any solution or a workaround to this? I tried using Source.fromURL, but S3 is not a valid protocol so it doesn't accept it.

467

asked Sep 16 '15 23:09

Rachit Raut

1 Answers

The scala.io.Source library is not meant to access files directly from Amazon S3. You need another library for that.

You can use the offical Amazon S3 Java Library. Here is some sample code (copied together from this question and its answers)

val credentials = new BasicAWSCredentials("myKey", "mySecretKey")
val s3Client = new AmazonS3Client(credentials)
val s3Object = s3Client.getObject(new GetObjectRequest("my-bucket", "input.txt"))
val myData = Source.fromInputStream(s3Object.getObjectContent())

val runid = myData.getLines().mkString

168

answered Sep 18 '22 21:09

Sven Koschnicke

Related questions
                            
                                how to do cascade delete for foreign key in slick
                            
                                Play 2.2.2 @Transactional of eBean does not rollback transaction in Exception thrown
                            
                                Scala's either with tuple as Right
                            
                                Why `Nil` is defined as `case object`
                            
                                How can I identify an emoji in scala?
                            
                                How to use Cashbah MongoDB connections?
                            
                                Play Slick 2.1.0 This DBMS allows only a single AutoInc column to be returned from an INSERT
                            
                                Cannot deploy local Spark job, worker fails with EndPointAssociationError
                            
                                NoSuchMethodError while running Scalatest
                            
                                What should I import for Scalaz' traverse functionalities
                            
                                how to downgrade proguard version in android studio gradle?
                            
                                Play / Logging / Print Response Body / Run over enumerator / buffer the body
                            
                                How to unmarshal POST params and JSON body in a single route?
                            
                                scala exception in for-comprehension with type annotation
                            
                                Merge two Streams (ordered) to get a final sorted Stream
                            
                                How to do SQL "NOT LIKE" in Slick
                            
                                Scala 2.12 uses Java 1.8; what should we do if we are unable to upgrade to Java 1.8?
                            
                                Run 3000+ Random Forest Models By Group Using Spark MLlib Scala API
                            
                                How to use saved variable values outside of gatling scenario in scala file
                            
                                Underscore for existential type in Scala

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With