Spark Streaming: StreamingContext doesn't read data files

Tags:

2 Answers

Did you try moving text files from another directory into the directory that is being monitored? For file stream to work, you are atomically put the files into the monitored directory, so that as soon as the files becomes visible in the listings, Spark can read all the data in the file (which may not be the case if you are copying files into the directory).

This is well documented in the Basic sources subsection in the programming guide

170

answered Sep 21 '22 23:09

Tathagata Das

Copy file/document Using command line or save as the file/document to the directory work for me. When you normally copy(by IDE) this can't effect the modified date as streaming context monitor modified date.

answered Sep 19 '22 23:09

Zeeshan Abbas

Related questions
                            
                                Scala Play - How to convert a list of Scala Strings into an Array of javascript Strings (avoiding the &quot; issue)?
                            
                                Is Scala mixin really better than multiple C++ inheritance? [closed]
                            
                                Scala speed test and profiler
                            
                                Pattern for redirecting to previous page after an action
                            
                                Adding value to Scala map
                            
                                How can I do aggregate queries in Slick?
                            
                                Slick- use foreignKey constraint and access referenced object directly as column
                            
                                How to use plugin in sbt project when only the plugin's sources available?
                            
                                scala override protected member function
                            
                                STS Upgrate leads to MatchLocator problems
                            
                                How to get SSSP actual path by apache spark graphX?
                            
                                How to add a json object in to a json array using scala play?
                            
                                Missing logic in filter algorithm
                            
                                Dynamically create extensible record in shapeless 2.0
                            
                                Does scala specification 2.10 and 2.11 exist?
                            
                                How do I create an explicit companion object for a case class which behaves identically to the replaced compiler provided implicit companion object?
                            
                                How to convert a generic HList to a List
                            
                                Java interoperability woes with Scala generics and boxing
                            
                                How to integrate npm/gulp/bower building process into sbt?
                            
                                Splitting a scalaz-stream process into two child streams

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Spark Streaming: StreamingContext doesn't read data files

Tags:

scala

spark-streaming

Momog

People also ask

2 Answers

Tathagata Das

Zeeshan Abbas

Recent Activity

Donate For Us