• Donate
  • Log In
Home
  • About
    • About
      • About Us
      • Our Board of Directors
      • Board Meeting Minutes
      • Board Elections
      • Updates & Announcements
      • Our Staff
      • Governance & Financials
      • Lifetime Achievement Award
  • Events
    • Events
      • Upcoming
      • Past
      • Conference FAQ
      • Conference Policies
      • Code of Conduct
      • Calls for Papers
      • Author Resources
      • Grant Opportunities
      • Best Papers
      • Test of Time Awards
  • Join & Support
    • Join & Support
      • Become a Member
      • Ways to Give
      • Our Supporters
      • Student Opportunities
      • Sponsorship Opportunities
  • Archive
    • Archive
      • Proceedings
      • Multimedia
      • ;login: Archive
      • Short Topics in System Administration Series
      • Journal of Education in System Administration (JESA)
      • Journal of Election Technology and Systems (JETS)
      • Computing Systems Journal
  • Search
Join the conversation
Back to ;login: Online

Analysis of USENIX Paper Downloads

July 30, 2024
Column
Authors: 
Rik Farrow

In September of 2022, I started receiving the web log files for usenix.org. I felt I needed to see what articles were being downloaded, and the current system didn't provide the insights I was looking for in a timely manner. I filtered out log entries for publications/login, counted the download frequencies, and sorted these with the most popular first.

While I might think that a particular topic will be interesting to lots of people, I am not always right. And as an editor who doesn't pay authors, I really, really, want people to read the articles that they have taken the time to write. I have a responsibility to authors, one I only get to exercise by carefully curating the authors of papers about topics I think will be very popular.

Over time, I noticed a couple of things. Sometimes an article might remain popular for over a year, while other articles would suddenly become popular for a period of just weeks. In both cases, the initial surge in popularity often involves promotion by people other than the authors in places like Slashdot and HackerNews. And sometimes an article just fills a particular need through its thorough explanation of some technical topic.

I noticed a prominent example of how promotion affects downloads the second time I took a look at papers. An OSDI paper had an unusually high number of downloads in the September-October 2023 timeframe: 1818. I asked Ding Yuan, the lead author, if he had any idea what might have caused this increase in activity. Ding discovered that a post on X by Kevlin Henney on September 6 had resulted in about 20% of his followers downloading the paper.

Besides papers and ;login: articles leaping in popularity because of promotion, some stay near the top for other reasons. I skimmed the top 20 papers from June 2024 trying to determine what it was about these papers that made them so popular. I first noticed that they were all well-written, but that's really not uncommon for highly-rated papers.

The other things all these papers had in common was that they weren't just introducing some new research or software: they did a great job of teaching about the issues involved. Usually, section two of a paper covers related-work, while section one provides motivation for why this particular work deserves to be published. Between these two sections, you can learn a lot about a topic. And that's why you find papers about Meta's Haystack, Google's Transformer, and Yahoo's ZooKeeper papers in the top 20 (Table 1).

Table 1: Top paper downloads during the month of June 2024

 

IndexTitle DescriptionDownloads
1The Multi Router Traffic Grapher and RRDtoolDescription of MRTG and RRDtool, a binary logging tool for time-sequence data939
2Scaling Memcache at FacebookHow Facebook scaled memcached to thousands of servers874
3Replication: No One Can Hack My MindSurvey of security advice from experts and non-experts845
4In the Compression Hornet’s NestDenial of service attacks when Deflate is used in Apache HTTPD Tomcat and other services772
5An Analysis of Private Browsing Modes in Modern BrowsersEvaluation of private browsing in four major browsers, inconsistencies and failures743
6How the Great Firewall of China Detects and Blocks Fully Encrypted TrafficBlocking encrypted traffic based on passive traffic analysis642
7REX: A Development Platform and Online Learning ApproachDana, a component-based programming language, an assembly and learning framework, and an online learning implementation that altogether allows for runtime optimization533
8Extracting Training Data from Large Language ModelsTraining data from LLMs (GPT2) can be recovered540
9TensorFlow: A System for Large-Scale Machine LearningOne of the foundational papers leading to LLMs: describes a dataflow graph to represent both the computation in an algorithm and the state533
10Is Real-time Phishing Eliminated with FIDO?Downgrade attack against the use of two-factor authentication that uses the FIDO protocol493
11Resilient Distributed Datasets: A Fault-Tolerant Abstraction for In-Memory Cluster ComputingImprovement to Hadoop by caching datasets, Spark475
12Remote Exploitation of Memory Corruptions in Cellular Protocol StacksDemonstration of attacks against the radio processor on smartphones462
13Orca: A Distributed Serving System for Transformer-Based Generative ModelsNew scheduling mechanism improves performance of LLM inference procedures459
14Spark: Cluster Computing with Working SetsAdding of working sets for MapReduce and interactive analytics using a Dryad-like interface451
15Scalability! But at what COST?Workshop paper that explains COST—the Configuration that Outperforms a Single Thread—showing that many data parallel systems are either slower than a single threaded solution or have high COST440
16Finding a needle in Haystack: Facebook’s photo storageFacebook's Haystack photo storage system keeps metadata in memory376
17Andromeda: Performance, Isolation, and Velocity at ScaleGoogle's Andromeda cloud network virtualization for isolation and performance373
18Fingerprinting Obfuscated Proxy Traffic with Encapsulated TLS HandshakesUncovering obfuscated proxy traffic such is done by Great Firewall of China373
19ZookeeperDescription of Yahoo's ZooKeeper system for coordinating distributed services353
20Amazon DynamoDBA fully distributed NoSQL database supporting multiple tenants, limitless tables, predictable and reliable performance343

At the end of this article, I've included Table 2 with the top 100 paper downloads from July 2024. If you attend to the downloads column, you'll notice that there is a quick dropoff in number of downloads. I graphed the top 3000 downloads against their index numbers, and you can see (Figure 1) just how steeply downloads drop off. When you consider that there are nearly 31 thousand papers represented in the logfiles, this might seem very unfair that a relative handful appear the very popular. I suggest keeping in mind a couple of things: one, that papers about famous software are going to be downloaded more often, and two, that promotion can briefly push a paper to the top of the list.

Figure 1: Download frequency vs paper index; relatively few papers get downloaded often.

I produced two lists of the top 100 paper downloads, one from the end of 2023 and the other from July 2024, and only 31 papers are in both lists. In other words, there is a fair amount of churn happening over time.

I didn't just look at the top papers either. I skimmed the paper with index number 3000, a workshop paper from HotPar'11 by Hans Boehm where he points out that there are no benign data races. A bit obscure, certainly, but still an interesting enough workshop paper.

One of the last entries in the list was the slide deck from a LEET'10 presentation about botnets. These days, it seems that no one talks about botnets, and looking at the web logs, only crawler bots actually visited this link. Still, I found the slides interesting as the botnot it described was very advanced compared to those from the 90s, with multiple tiers for command and control.

Papers with ten or less downloads, starting almost halfway down the list, are still being downloaded by other than bots. Unless the browser information has been falsified, that LEET presentation mentioned above was just downloaded by three bots, and nothing else.

Finally, there's the matter of conference popularity and how that affects the papers downloaded. If you go back to the early days of USENIX, there were just two conferences: USENIX Summer and USENIX Winter. All topics were included in those conferences, where the main difference between them, besides the season, is one happened near the East Coast and the other on the West Coast. Starting around 1990, conferences began appearing that covered a particular topic area, like system administration or security— the first two new conferences. Figure 2 shows the binning of downloads when separated into conference categories.

Figure 2: Downloads binned by conference.

Security dwarfs all other categories. If you wonder why this is, just consider that there were over 400 papers at Security'23, and Security'24 has even more. Some conferences, like SRE, have no papers at all, but they do have some presentation slides and all presentations appear on YouTube as videos, data not included in this analysis. LISA had few papers, but one from LISA'98 by Toby Oetiker about RRDTool is often in the top 100.

I'm closing this brief analysis with Table 2, the top 100 downloads during July 2024. When I compared this list to the one from June, 37 papers were the same. If you are wondering about the outlier, Inference of Error Specifications and Bug Detection Using Structural Similarities by Dossche and Coppens with over 10,000 downloads, I quickly found a posting on X by Winson Tang, referring to this paper.

Table 2: Top 100 downloads of papers the month of July 2024
IndexURLDownloads
1https://usenix.org/system/files/sec24fall-prepub-93-dossche.pdf10060
2https://usenix.org/system/files/osdi24-chen-haibo.pdf3225
3https://usenix.org/system/files/atc24-li-hongyu.pdf2168
4https://usenix.org/events/osdi10/tech/full_papers/Xiong.pdf841
5https://usenix.org/conference/usenixsecurity15/sec15-paper-pellegrino.pdf684
6https://usenix.org/conference/hotstorage18/hotstorage18-paper-raju.pdf665
7https://usenix.org/conference/usenixsecurity16/sec16_paper_saltaformaggio.pdf545
8https://usenix.org/system/files/osdi22-yu.pdf523
9https://usenix.org/system/files/osdi24-sun-xudong.pdf513
10https://usenix.org/conference/nsdi13/nsdi13-final170_update.pdf512
11https://usenix.org/system/files/osdi24-choudhury.pdf497
12https://usenix.org/conference/osdi16/osdi16-abadi.pdf426
13https://usenix.org/event/hotcloud10/tech/full_papers/Zaharia.pdf390
14https://usenix.org/conference/nsdi12/nsdi12-final138.pdf373
15https://usenix.org/system/files/sec21-carlini-extracting.pdf372
16https://usenix.org/event/osdi10/tech/full_papers/Beaver.pdf361
17https://usenix.org/system/files/osdi24_full_proceedings.pdf352
18https://usenix.org/system/files/osdi24-zhong-yuhong.pdf336
19https://usenix.org/system/files/osdi24-skiadopoulos.pdf320
20https://usenix.org/system/files/osdi24-chow.pdf305
21https://usenix.org/events/sec09/tech/full_papers/crosby.pdf302
22https://usenix.org/conference/usenixsecurity18/sec18-staicu.pdf300
23https://usenix.org/conference/nsdi18/nsdi18-dalton.pdf297
24https://usenix.org/conference/hotos15/hotos15-paper-mcsherry.pdf294
25https://usenix.org/system/files/osdi24-lee.pdf294
26https://usenix.org/events/sec03/tech/full_papers/crosby/crosby.pdf291
27https://usenix.org/conference/usenixsecurity14/sec14-paper-yarom.pdf290
28https://usenix.org/conference/usenixsecurity17/sec17-tang.pdf283
29https://usenix.org/events/sec10/tech/full_papers/Aggarwal.pdf277
30https://usenix.org/system/files/login/articles/login_spring17_08_bratus.pdf276
31https://usenix.org/events/sec10/tech/full_papers/Felmetsger.pdf270
32https://usenix.org/system/files/sec20-lee-hyeonmin.pdf269
33https://usenix.org/system/files/woot20-paper-obermaier.pdf266
34https://usenix.org/system/files/sec20fall_lee_prepub_0.pdf264
35https://usenix.org/system/files/atc22-elhemali.pdf263
36https://usenix.org/events/fast11/tech/full_papers/Wei.pdf259
37https://usenix.org/system/files/sec20fall_reichel_prepub.pdf256
38https://usenix.org/system/files/sec19-cidon.pdf254
39https://usenix.org/system/files/sec20-oleksenko.pdf254
40https://usenix.org/system/files/usenixsecurity23-moghimi.pdf254
41https://usenix.org/events/sec01/full_papers/gutmann/gutmann.pdf254
42https://usenix.org/system/files/usenixsecurity23-feng.pdf251
43https://usenix.org/system/files/osdi24-sun-hao.pdf248
44https://usenix.org/event/atc10/tech/full_papers/Hunt.pdf246
45https://usenix.org/conference/usenixsecurity18/sec18-van_schaik.pdf245
46https://usenix.org/system/files/sec21-ragab.pdf243
47https://usenix.org/system/files/usenixsecurity23-ayalon.pdf233
48https://usenix.org/conference/atc13/atc13-bronson.pdf232
49https://usenix.org/conference/usenixsecurity17/sec17-antonakakis.pdf230
50https://usenix.org/system/files/sec23fall-prepub-234-wu-mingshi.pdf228
51https://usenix.org/system/files/osdi24-shubha.pdf219
52https://usenix.org/system/files/osdi24-zhou.pdf213
53https://usenix.org/system/files/sec23winter-prepub-107-bouhoula.pdf208
54https://usenix.org/system/files/sec21-hoang.pdf205
55https://usenix.org/conference/woot15/woot15-paper-peles.pdf200
56https://usenix.org/event/osdi10/tech/full_papers/Peng.pdf198
57https://usenix.org/system/files/nsdi24-jiang-ziheng.pdf195
58https://usenix.org/system/files/osdi24-fu.pdf191
59https://usenix.org/system/files/sec23fall-prepub-285_kim-jiwon.pdf188
60https://usenix.org/event/atc10/tech/full_papers/Carroll.pdf184
61https://usenix.org/system/files/nsdi24-hu.pdf183
62https://usenix.org/system/files/atc24-xiong.pdf180
63https://usenix.org/system/files/soups2019-busse.pdf178
64https://usenix.org/events/usenix99/provos/provos.pdf178
65https://usenix.org/system/files/sec21-specter-keyforge.pdf176
66https://usenix.org/system/files/sec20-bouwman.pdf176
67https://usenix.org/system/files/osdi24-agrawal.pdf174
68https://usenix.org/system/files/atc24-ma.pdf174
69https://usenix.org/events/usenix05/tech/general/gray/gray.pdf172
70https://usenix.org/system/files/nsdi20-paper-agache.pdf166
71https://usenix.org/conference/usenixsecurity14/sec14-paper-marczak.pdf165
72https://usenix.org/system/files/sec23winter-prepub-490-jia.pdf164
73https://usenix.org/system/files/osdi18-chen.pdf162
74https://usenix.org/system/files/sec24fall-prepub-1998-bocovich.pdf162
75https://usenix.org/system/files/osdi24-ma-haoran.pdf159
76https://usenix.org/system/files/osdi18-moritz.pdf157
77https://usenix.org/conference/atc17/atc17-hahn.pdf157
78https://usenix.org/system/files/osdi24-xiang.pdf156
79https://usenix.org/conference/hotcloud16/hotcloud16_burns.pdf155
80https://usenix.org/system/files/woot19-paper_ovadia.pdf155
81https://usenix.org/events/lisa98/lisa98.pdf155
82https://usenix.org/system/files/usenixsecurity23-carlini.pdf155
83https://usenix.org/system/files/usenixsecurity23-deng-jiangyi-v-cloak.pdf152
84https://usenix.org/system/files/woot19-paper_zhao.pdf151
85https://usenix.org/conference/fast16/fast16-papers-lu.pdf149
86https://usenix.org/system/files/atc21-chen.pdf149
87https://usenix.org/system/files/fast21-pan.pdf148
88https://usenix.org/event/usenix99/provos/provos.pdf148
89https://usenix.org/conference/atc12/atc12-final39.pdf146
90https://usenix.org/conference/osdi14/osdi14-paper-yuan.pdf143
91https://usenix.org/system/files/osdi22-zheng-lianmin.pdf142
92https://usenix.org/system/files/osdi24-sun-biao.pdf141
93https://usenix.org/system/files/osdi24-lazarev_1.pdf141
94https://usenix.org/system/files/osdi24-leblanc.pdf140
95https://usenix.org/system/files/osdi24-zhong-yinmin.pdf138
96https://usenix.org/system/files/sec19-reardon.pdf137
97https://usenix.org/system/files/atc24-yuan.pdf137
98https://usenix.org/system/files/sec22fall_senol.pdf135
99https://usenix.org/system/files/sec24fall-prepub-2205-fang.pdf134
100https://usenix.org/system/files/sec24fall-prepub-1500-yu-zhiyuan.pdf134
Article Categories: 
SRE
Security
Operating Systems
Network
Sysadmin
Culture
Last updated July 31, 2024
Authors: 

Rik Farrow has been a consultant for 44 years. He has written two books, as well as worked as the technical editor for a Unix magazine and for two editions of a popular operating system book. He also taught Unix system administration and Internet security during the 90s internationally, and worked as a volunteer for USENIX program and steering committees. Rik has been the editor of ;login: since 2005.

[email protected]
  • Log in to post comments
USENIX logo
  • Contact USENIX
  • Privacy Policy

© USENIX 2025
EIN 13-3055038

Website designed and built by Giant Rabbit LLC
Powered by Backdrop CMS

We need contributions from individuals like you.

USENIX conferences directly influence the development of computing systems and products used worldwide. Contribute today to support this vital work for the next 50 years.

Secure the Future of USENIX

Donate
Close