How do I obtain the number of rows of a ParquetDataset that is structured in the form of a folder containing multiple parquet files.
I tried
from pyarrow.parquet import ParquetDataset
a = ParquetDataset(path)
a.metadata
a.schema
a.commmon_metadata
I want to figure out the number of rows in total without reading the dataset as it can quite large.
What's the best way to do that?