I am new to NATS Jetstream and I have been reading their official documentation to understand its concepts and compare it with Kafka. One of the major use cases I have, is to solve for message/event ordering based on a particular id (like a partition key
in the Kafka world).
For example, there are several update events coming for an Order
entity and my system needs to consume the events for a particular Order
in the same order. In this case, I would use the order-id
as the partition key while publishing to the Kafka topic. How do I accomplish this in Jetstream?
I have come across a de-duplication key (Nats-Msg-Id
) in Jetstream, but I think this feature is more synonymous with topic compaction in Kafka. Am I right?
Nevertheless, I have written the following code in Golang for publishing:
order = Order{
OrderId: orderId,
Status: status,
}
orderJson, _ := json.Marshal(order)
dedupKey := nats.MsgId(order.OrderId)
_, err := js.Publish(subjectName, orderJson, dedupKey)
Am I doing this right? Will all orders for a particular orderId go to the same consumer within a consumer group in the Jetstream world, hence maintaining the sequence?
Edit 1
This is what I get from @tbeets' suggestion. For example, I have predefined 10 stream subjects like ORDER.1
, ORDER.2
,ORDER.3
.... ORDER.10
On the publishing side, I can do an order-id%10+1
to find the exact stream subject to which I would want to publish. So here, we have accomplished that all update events for the same orderId will go to the same stream subject every time.
Now, on the subscriber side, I have 10 consumer groups (there are 10 consumers within each consumer group) and each consume from a particular stream subject, like consumerGroup-1
consumes from ORDER.1
, consumerGroup-2
consumes from ORDER.2
and so on...
Say, 2 order update events came for order-id
111, which would get mapped to ORDER.1
stream subject, and correspondingly consumerGroup-1
will consume these 2 events. But within this consumerGroup, the 2 update events can go to different consumers and if one of the consumers is a bit busy or slow, then at an overall level, the order update events consumption maybe out-of-sync or out-of-order.
Kafka solves this using the concept of partition key as consumers of a consumer group are allocated to a particular partition. Hence, all events for the same orderId, are consumed by the same consumer, hence, maintaining the sequence of order update event consumption. How do I solve this issue in Jetstream?
consumerGroup-1
,consumerGroup-2
....) Check this out: github.com/nats-io/nats-architecture-and-design/pull/36/files Jetstream is planning to solve for this use-case in the future. – Dedrick