I'm using R shiny and dplyr to connect to a database and query the data in Impala. I do the following.
con <- dbPool(odbc(),
Driver = [DIVER],
Host = [HOST],
Schema = [SCHEMA],
Port = [PORT],
UID = [USERNAME],
PWD = [PASSWORD])
table_foo <- tbl(con, [TABLE_NAME_FOO])
table_bar <- tbl(con, [TABLE_NAME_BAR])
When I run a query for instance:
table %>% filter(name=greg) %>% collect()
There are describes that are being run on the impala:
DESCRIBE TABLE 'table_foo'
DESCRIBE TABLE 'table_bar'
All the describes run before every query.(Every collect()) I have many tables all these describes are wasting substantial amount of time. Especially on impala where some describes can take a while to run.
Is there a way to turn these off? Is there anything I can do? I looked at the docs and could not find anything: https://db.rstudio.com/dplyr/
describe table
to get metadata about resultset so it is a substantial part of querying tables. – Vincentiatable %>% filter(name=greg) %>% explain()
? Thanks. – Neurophysiology