On dense tiers like warm and cold the page cache can only cover a tiny fraction of the index, while disk access is expensive as these dense tiers often leverage spinning disks.
Could we reorder items in the search queue in order to group together shard requests that would hit the same shard in order to maximize the hit ratio of the page cache?
This is probably something that we should avoid doing on the content tier where this could trade latency for throughput in non-desirable ways?