Parquet Writer to buffer or byte stream

2 Answers

TLDR; you will need to implement OutputFile, e.g. something along the line of:

import org.apache.parquet.io.OutputFile;
import org.apache.parquet.io.PositionOutputStream;

import java.io.BufferedOutputStream;
import java.io.IOException;

public class ParquetBufferedWriter implements OutputFile {

    private final BufferedOutputStream out;

    public ParquetBufferedWriter(BufferedOutputStream out) {
        this.out = out;
    }

    @Override
    public PositionOutputStream create(long blockSizeHint) throws IOException {
        return createPositionOutputstream();
    }

    private PositionOutputStream createPositionOutputstream() {
        return new PositionOutputStream() {
            @Override
            public long getPos() throws IOException {
                return 0;
            }

            @Override
            public void write(int b) throws IOException {
                out.write(b);
            }
        };
    }

    @Override
    public PositionOutputStream createOrOverwrite(long blockSizeHint) throws IOException {
        return createPositionOutputstream();
    }

    @Override
    public boolean supportsBlockSize() {
        return false;
    }

    @Override
    public long defaultBlockSize() {
        return 0;
    }

}

And your writer would be something like:

    ParquetBufferedWriter out = new ParquetBufferedWriter();
        try (ParquetWriter<Record> writer = AvroParquetWriter.
                <Record>builder(out)
                .withRowGroupSize(DEFAULT_BLOCK_SIZE)
                .withPageSize(DEFAULT_PAGE_SIZE)
                .withSchema(SCHEMA)
                .build()) {

            for (Record record : records) {
                writer.write(record);
            }
        } catch (IOException e) {
            throw new IllegalStateException(e);
        }

111

answered Sep 19 '22 07:09

naimdjon

I just also needed to write to a stream, so I completed the example given by naimdjon. The following works perfectly fine for me.

class ParquetBufferedWriter implements OutputFile {
    
    private final BufferedOutputStream out;

    public ParquetBufferedWriter(BufferedOutputStream out) {
        this.out = out;
    }

    @Override
    public PositionOutputStream create(long blockSizeHint) throws IOException {
        return createPositionOutputstream();
    }

    private PositionOutputStream createPositionOutputstream() {
        return new PositionOutputStream() {
            
            int pos = 0;

            @Override
            public long getPos() throws IOException {
                return pos;
            }

            @Override
            public void flush() throws IOException {
                out.flush();
            };

            @Override
            public void close() throws IOException {
                out.close();
            };

            @Override
            public void write(int b) throws IOException {
                out.write(b);
                pos++;
            }

            @Override
            public void write(byte[] b, int off, int len) throws IOException {
                out.write(b, off, len);
                pos += len;
            }
        };
    }

    @Override
    public PositionOutputStream createOrOverwrite(long blockSizeHint) throws IOException {
        return createPositionOutputstream();
    }

    @Override
    public boolean supportsBlockSize() {
        return false;
    }

    @Override
    public long defaultBlockSize() {
        return 0;
    }
}

answered Sep 21 '22 07:09

breadcrumb42

Related questions
                            
                                Char array compile time error upon assign a value from array
                            
                                Merging two .odt files from code
                            
                                How do I fix "incompatible JNA native library" when using Putty, Gradle, and the Gradle Git plugin?
                            
                                Android tests with Appium and Gradle
                            
                                Spring Boot REST service - throws java.lang.NoClassDefFoundError: org/apache/tomcat/util/ExceptionUtils
                            
                                How to perform swipe action in Selendroid?
                            
                                Maven and Eclipse : loading default properties in maven library project and use it in runnable Jar
                            
                                Using a Java8 Lambda Function inside spring XML
                            
                                How to retrieve and modify HTML content from WebView with Http Post
                            
                                Camel and Activemq setup with Spring Boot
                            
                                Creating notification from InstrumentationTestCase
                            
                                Elasticsearch - what to do if fields have the same name but multiple mapping
                            
                                Java: How do i determine whether a local class defined in an initializer block requires an enclosing instance for instantiation?
                            
                                Worldwind Custom Renderable Picking Issue
                            
                                Too many LOGS getting generated for Hystrix-AMQP
                            
                                Logo on every JavaDoc page
                            
                                Java / Groovy - Memory leak in GroovyClassLoader
                            
                                Spring Security OAuth2 - How to use OAuth2Authentication object?
                            
                                How to setup structure with SpringBoot and Angular2?
                            
                                Checked exception with CompletableFuture [duplicate]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Parquet Writer to buffer or byte stream

Tags:

java

bufferedreader

parquet

vijju

People also ask

2 Answers

naimdjon

breadcrumb42

Recent Activity

Donate For Us