Ways to buffer REST response

Tags:

There's a REST endpoint, which serves large (tens of gigabytes) chunks of data to my application.
Application processes the data in it's own pace, and as incoming data volumes grow, I'm starting to hit REST endpoint timeout.
Meaning, processing speed is less then network throughoutput.
Unfortunately, there's no way to raise processing speed enough, as there's no "enough" - incoming data volumes may grow indefinitely.

I'm thinking of a way to store incoming data locally before processing, in order to release REST endpoint connection before timeout occurs.

What I've came up so far, is downloading incoming data to a temporary file and reading (processing) said file simultaneously using OutputStream/InputStream.
Sort of buffering, using a file.

This brings it's own problems:

what if processing speed becomes faster then downloading speed for some time and I get EOF?
file parser operates with ObjectInputStream and it behaves weird in cases of empty file/EOF
and so on

Are there conventional ways to do such a thing?
Are there alternative solutions?
Please provide some guidance.

Upd:

I'd like to point out: http server is out of my control.
Consider it to be a vendor data provider. They have many consumers and refuse to alter anything for just one.
Looks like we're the only ones to use all of their data, as our client app processing speed is far greater than their sample client performance metrics. Still, we can not match our app performance with network throughoutput.

Server does not support http range requests or pagination.
There's no way to divide data in chunks to load, as there's no filtering attribute to guarantee that every chunk will be small enough.

Shortly: we can download all the data in a given time before timeout occurs, but can not process it.
Having an adapter between inputstream and outpustream, to pefrorm as a blocking queue, will help a ton.

389

asked Apr 02 '18 15:04

miracle_the_V

1 Answers

You're using something like new ObjectInputStream(new FileInputStream(..._) and the solution for EOF could be wrapping the FileInputStream first in an WriterAwareStream which would block when hitting EOF as long a the writer is writing.

Anyway, in case latency don't matter much, I would not bother start processing before the download finished. Oftentimes, there isn't much you can do with an incomplete list of objects.

Maybe some memory-mapped-file-based queue like Chronicle-Queue may help you. It's faster than dealing with files directly and may be even simpler to use.

You could also implement a HugeBufferingInputStream internally using a queue, which reads from its input stream, and, in case it has a lot of data, it spits them out to disk. This may be a nice abstraction, completely hiding the buffering.

There's also FileBackedOutputStream in Guava, automatically switching from using memory to using a file when getting big, but I'm afraid, it's optimized for small sizes (with tens of gigabytes expected, there's no point of trying to use memory).

179

answered Sep 22 '22 11:09

maaartinus

Related questions
                            
                                JSch 0.1.53 session.connect() throws "End of IO Stream Read"
                            
                                Bullets not getting shot out of the gun
                            
                                Hierarchical enum in Java
                            
                                What are the possible problems caused by adding elements to unsynchronized ArrayList's object by multiple threads simultaneously?
                            
                                Why does an empty lambda and constructor with an explicit return cause a compiler error (Java Bug?)
                            
                                Make Java GC logs show MB or GB instead of KB
                            
                                Cookie is not set on localhost in chrome or firefox
                            
                                Issue with type parameter: "cannot select from parameterized type"
                            
                                Jenkins slave went offline during build
                            
                                Spring Boot in memory database H2 doesn't load data from file on initialization
                            
                                Spring Boot Hibernate 5 Ignoring @Table and @Column
                            
                                Why is assertEquals(Object[], Object[]) from JUnit 4 deprecated?
                            
                                Understanding Example 12 All Permutations of a string from Big O notation - Cracking the Coding Interview
                            
                                How to protect @ConfigurationProperties classes from changes?
                            
                                Spring Boot / JUnit, run all unit-tests for multiple profiles
                            
                                Is there any way to detect month change in android calendar view(i.e. when user changes calendar to another month)
                            
                                AWS Lambda: ClassNotFoundException
                            
                                How to eliminate the "Eureka may be incorrectly claiming instances are up when they're not" warning on Eureka Dashboard?
                            
                                How do I customize the program name in a traybar notification in AWT?
                            
                                iterator() on parallel stream guarantee encounter order?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Ways to buffer REST response

Tags:

java

performance

rest

buffer

miracle_the_V

People also ask

1 Answers

maaartinus

Recent Activity

Donate For Us