Generate rows with incrementing dates based on just a starting date in Redshift

I'm dealing with a table of user subscriptions info, where each row is a specific user along with the start date of their subscription and how many months they have pre-paid. I'm trying to break this table out so that there's one row per month. I'm on Redshift, and the only other answers I've found suggest the generate_series which doesn't always work on Redshift.

Starting data:

userid  |  amount_paid  |  start_date  |  months
------------------------------------------------
asdf    |  20.00        | 2020-01-01   |  1
------------------------------------------------
qwer    |  10.00        | 2021-06-01   |  3

Desired results (months column value doesn't matter but I'd like amount_paid to be 0 or null for new rows):

userid  |  amount_paid  |  start_date  |  months
------------------------------------------------
asdf    |  20.00        | 2020-01-01   |  1
------------------------------------------------
qwer    |  10.00        | 2021-06-01   |  3
------------------------------------------------
qwer    |  0            | 2021-07-01   |  3
------------------------------------------------
qwer    |  0            | 2021-08-01   |  3

On redshift, as you have seen, generate_series is not supported as a means to make data for use against your table data. A simple replacement is a recursive CTE to generate the numbers you are looking for.

with recursive numbers(n) as
( select 1 as n
    union all
    select n + 1
    from numbers n
    where n.n <= 500
    )
select n from numbers;

The above produces the numbers between 1 and 500.

If your tables are large and the performance of the resulting query matters significantly you may want to think about the distribution of this set of data as it can impact the query plan. You can create a numbers table on Redshift with DISTSTYLE ALL so that the overall query plan can be better optimized especially when performing a cross join.

Recommended topics

Hot tags