Check out the new USENIX Web site. next up previous
Next: Related Work Up: Evaluation Previous: Pareto Query Arrivals


Zipf-like Key Distributions

A recent study has shown that queries for multiple keys in a peer-to-peer network follow a Zipf-like distribution, with a small portion of the keys getting the most queries [12]. That is, the number of queries received by the i'th most popular key is proportional to $ \frac{1} {i^\alpha}$ for constant $ \alpha$.

In this section we compare CUP with PCX in a network of 1024 nodes, where each node owns one key. The query distribution among the 1024 keys follows a Zipf-like distribution with parameter $ \alpha = 1.2$. Table 4 shows results for Poisson arrivals where the overall $ \lambda$ rates are 100, 1000, 10000, and 100000 queries per second. (We also ran experiments with $ \alpha$ = 0.80 and 2.40 and with Pareto arrivals, and the results were similar.)

From the table we see that CUP outperforms PCX with IR ranging from 6.57 to 30.02. The latency reduction ranges from 3.2 (for 100 q/s) to an order of magnitude reduction (for 100000 q/s, latency dropped from 1.53 to 0.13). The Zipf-like distribution causes some of the keys to get a large percentage of the queries, leaving others to be asked for quite rarely. For rare keys, caching does not help since the entry expires by the time the key is queried for again, and the query rate for these keys is not high enough to recover the update propagation. However, the IR for the very hot keys is high enough to by far offset the unrecovered cost of the unpopular keys. As a result, CUP achieves an overall IR of at least 6.57 for 100 q/s and as much as 30.02 for 100000 q/s.


Table: Cross-Key Comparison of CUP with PCX, for Poisson arrivals and Zipf-like key distribution
Overall AvgRate q/s 100 1000 10000 100000
CUP/PCX MissCost 0.45 0.23 0.10 0.08
PCX AvgLat ($ \sigma$) 10.6 (9.9) 6.9 (8.9) 3.4 (7.5) 1.53 (5.47)
CUP AvgLat ($ \sigma$) 7.4 (8.5) 2.6 (5.2) 0.4 (2.7) 0.13 (1.67)
IR 6.57 8.52 10.98 30.02


next up previous
Next: Related Work Up: Evaluation Previous: Pareto Query Arrivals
Mema Roussopoulos 2003-04-04