Thanks Marc for the response. If we created say 100 partitions, and had 10 queues to start with.
10 app instances would consume off the 10 queues and consume 10 partitions each. If we could run 10 threads within each instance and have each thread process a single partition only and maintain order it would be preferable than running 100 instances as it’s more difficult to manage and rollout and allocate resources in shared kubernetes environment for example.
Any thoughts much appreciated.