I am trying to write a parquet file as follow in Apache Beam using Snappy compression
records.apply(FileIO.<GenericRecord>write().via(ParquetIO.sink(schema)).to(options.getOutput()));
I see that is possible to set AUTO
,GZIP
,BZIP2
,ZIP
and DEFLATE
as compression but I am unable to find a way to set it as SNAPPY
, any ideas how to do this? As reference, it is possible to do using wrting to avro as follow
records.apply("writeAvro", AvroIO.writeGenericRecords(schema).withCodec(CodecFactory.snappyCodec()).to(options.getOutput()));