I have very simple problem that produces error. Example will clear this one.
library(odbc)
library(DBI)
library(dplyr)
library(dbplyr)
con <- dbConnect(odbc(), "myDSN")
tbl_test <- tibble(ID = c("A", "A", "A", "B", "B", "B"),
val = c(1, 2, 3, 4, 5, 6),
cond = c("H", "H", "A", "A", "A", "H"))
dbWriteTable(con, "tbl_test", tbl_test, overwrite = TRUE)
After writing simple table to DB I add link to table in db and try to use simple conditional sums that work normally. But will face an error.
db_tbl <- tbl(con, in_schema("dbo", "tbl_test"))
db_tbl %>%
group_by(ID) %>%
summarise(sum = sum(val, na.rm = TRUE),
count_cond = sum(cond == "H", na.rm=TRUE),
sum_cond = sum(val == "H", na.rm=TRUE))
Error: <SQL> 'SELECT TOP 10 "ID", SUM("val") AS "sum", SUM(CONVERT(BIT, IIF("cond" = 'H', 1.0, 0.0))) AS "count_cond", SUM(CONVERT(BIT, IIF("val" = 'H', 1.0, 0.0))) AS "sum_cond"
FROM dbo.tbl_test
GROUP BY "ID"'
nanodbc/nanodbc.cpp:1587: 42000: [Microsoft][ODBC Driver 13 for SQL Server][SQL Server]Operand data type bit is invalid for sum operator.
I'm no expert, but feels like SQL can't understand TRUE as 1 and for that reason can't calculate sums. Is there away around this, as alot of times I face some kind of conditions. Below is just code for normal tibble to show that they should work.
tbl_test %>%
group_by(ID) %>%
summarise(sum = sum(val),
count_cond = sum(cond == "H"),
sum_cond = sum(val[cond == "H"]))
# A tibble: 2 x 4
ID sum count_cond sum_cond
<chr> <dbl> <int> <dbl>
1 A 6. 2 3.
2 B 15. 1 6.
I Understand that this might not be reproducible example, as not everyone have DB connection available.
https://stackoverflow.com/questions/582637/sql-server-equivalent-of-a-countif-aggregate-function
, so useifelse
orcasewhen
, it seem you were right and you can't sum booleans withSQL Server
– Clubmansum_cond = sum(ifelse(cond == "H",1,0),na.rm=TRUE)
– Clubmancondition
byifelse(condition,1,0)
and i think it should work – Clubman