I have a parquet file stored in S3 bucket. I want to get the list of all columns of the parquet file. I am using s3 select
but it just give me list of all rows wihtout any column headers.
Is there anyway to get all column names from this parquet file without downloading it completely? Since parquet file can be very large, I would not want to download the entire parquet file which is why I am using s3 select
to pick first few rows using
select * from S3Object LIMIT 10
I tried to fetch column names explicitly by doing
SELECT COLUMN_NAME FROM INFORMATION_SCHEMA.COLUMNS WHERE TABLE_NAME = 'S3Object'
but it would not work as AWS S3 doesn't support this yet.
Is there any other way to achieve the same?