Search results
-
SEeSAW- Similarity Exploiting Storage for Accelerating Analytics Workflows
Kalapriya Kannan, Suparna Bhattacharya, Kumar Raj, Muthukumar Murugan, and Doug Voigt, Hewlett Packard Enterprise The key to successful deployment of big data solutions lies in the timely distillation of meaningful information. This is made difficult by t ...arnold - December 11, 2021 - 1:39 am
-
Neutrino: Revisiting Memory Caching for Iterative Data Analytics
a distributed cluster. In this paper, we make the case that existing abtractions such as RDD are coarse-grained ...arnold - December 11, 2021 - 1:39 am
-
Feeding the Pelican: Using Archival Hard Drives for Cold Storage Racks
paper we present data gathered from a test and a production environment. A key design choice for Pelican ...arnold - December 11, 2021 - 1:39 am
-
ZEA, A Data Management Approach for SMR
Adam Manzanares, Western Digital Research; Noah Watkins, University of California, Santa Cruz; Cyril Guyot and Damien LeMoal, Western Digital Research; Carlos Maltzahn, University of California, Santa Cruz; Zvonimr Bandic, Western Digital Research Digital ...arnold - December 11, 2021 - 1:39 am
-
Evaluating Host Aware SMR Drives
paper, we carry out evaluation to understand the performance of HA-SMR drives with the objective of ...arnold - December 11, 2021 - 1:39 am
-
Avoiding the Streetlight Effect: I/O Workload Analysis with SSDs in Mind
Gala Yadgar and Moshe Gabel, Technion—Israel Institute of Technology Storage systems are designed and optimized relying on wisdom derived from analysis studies of file-system and block-level workloads. However, while SSDs are becoming a dominant building ...arnold - December 11, 2021 - 1:39 am
-
NVMeDirect: A User-space I/O Framework for Application-specific Optimization on NVMe SSDs
kernel should provide generality and fairness. In this paper, we propose a user-level I/O framework which ...arnold - December 11, 2021 - 1:39 am
-
Optimizing Flash-based Key-value Cache Systems
Zhaoyan Shen, Hong Kong Polytechnic University; Feng Chen and Yichen Jia, Louisiana State University; Zili Shao, Hong Kong Polytechnic University Flash-based key-value cache systems, such as Facebook’s McDipper [1] and Twitter’s Fatcache [2], provide a ...arnold - December 11, 2021 - 1:39 am
-
ClusterOn: Building Highly Configurable and Reusable Clustered Data Services Using Simple Data Nodes
features right in a distributed setting. In this paper, we argue that for most modern storage applications, ...arnold - December 11, 2021 - 2:10 am
-
Silver: A Scalable, Distributed, Multi-versioning, Always Growing (Ag) File System
maintaining scalability, strong consistency and performance remains a challenge. In this paper we introduce ...arnold - December 11, 2021 - 2:10 am
-
Exo-clones: Better Container Runtime Image Management across the Clouds
Richard P. Spillane, Wenguang Wang, Luke Lu, Maxime Austruy, Christos Karamanolis, and Rawlinson Rivera, VMware Our key innovation is to allow volume snapshots in VDFS (our native hyper-converged distributed file system) to be exported to a stand-alone re ...arnold - December 11, 2021 - 2:10 am
-
Finding Consistency in an Inconsistent World: Towards Deep Semantic Understanding of Scale-out Distributed Databases
Neville Carvalho, Hyojun Kim, Maohua Lu, Prasenjit Sarkar, Rohit Shekhar, Tarun Thakur, Pin Zhou, Datos IO; Remzi H. Arpaci-Dusseau, University of Wisconsin—Madison We present a new problem in data storage: how to build efficient backup and restore tools ...arnold - December 11, 2021 - 2:10 am
-
Why Do We Always Blame The Storage Stack?
spent evaluating mobile application performance from the user’s perspective. In this paper, we try to ...arnold - December 11, 2021 - 2:10 am
-
An Empirical Study of File-System Fragmentation in Mobile Storage Systems
experiencing sluggish response. In this paper, by conducting an empirical study of filesystem fragmentation on ...arnold - December 11, 2021 - 2:10 am
-
Pixelsior: Photo Management as a Platform Service for Mobile Apps
uneven and siloed user experience. In this paper, we motivate the need for a dedicated platform service ...arnold - December 11, 2021 - 2:10 am
-
99 Deduplication Problems
Philip Shilane, Ravi Chitloor, and Uday Kiran Jonnala, EMC Corporation Deduplication is a widely studied capacity optimization technique that replaces redundant regions of data with references. Not only is deduplication an ongoing area of academic researc ...arnold - December 11, 2021 - 2:10 am
-
A Simulation Result of Replicating Data with Another Layout for Reducing Media Exchange of Cold Storage
Satoshi Iwata and Kensuke Shiozawa, Fujitsu Laboratories Ltd. Cold storage devices such as tape and optical discs are a good solution for reducing the total cost of owner- ship for storing data. However, there is a drawback in that media and drives are se ...arnold - December 11, 2021 - 2:10 am
-
Deduplicating Compressed Contents in Cloud Storage Environment
Zhichao Yan and Hong Jiang, The University of Texas at Arlington; Yujuan Tan, Chongqing University; Hao Luo, University of Nebraska—Lincoln Data compression and deduplication are two common approaches to increasing storage efficiency in the cloud environ ...arnold - December 11, 2021 - 2:53 am
-
Non-volatile Memory through Customized Key-value Stores
Leonardo Mármol, Jorge Guerra, and Marcos K. Aguilera, VMware Non-volatile memory, or NVM, is coming. Several technologies are maturing (FeRAM, ReRAM, PCM, DWM, FJG RAM), and soon we expect products from Intel, Micron, HP, SanDisk, and/or Samsung. Some of ...arnold - December 11, 2021 - 2:53 am
-
Write Amplification Reduction in Flash-Based SSDs Through Extent-Based Temperature Identification
Mansour Shafaei and Peter Desnoyers, Northeastern University; Jim Fitzpatrick, SanDisk Corporation We apply an extent-based clustering technique to the problem of identifying “hot” or frequently-written data in an SSD, allowing such data to be segregated ...arnold - December 11, 2021 - 2:53 am
-
Improving I/O Resource Sharing of Linux Cgroup for NVMe SSDs on Multi-core Systems
Sungyong Ahn and Kwanghyun La, Samsung Electronics Co.; Jihong Kim, Seoul National University In container-based virtualization where multiple isolat-ed containers share I/O resources on top of a single operating system, efficient and proportional I/O re ...arnold - December 11, 2021 - 2:53 am
-
Unblinding the OS to Optimize User-Perceived Flash SSD Latency
Seoul National University In this paper, we present a flash solid-state drive (SSD) optimization that ...arnold - December 11, 2021 - 2:53 am
-
Fine-grained Provenance for Linear Algebra Operators
this paper, we study provenance information for matrix data and linear algebra operations. Our core ...arnold - December 11, 2021 - 2:53 am
-
Quantifying Causal Effects on Query Answering in Databases
Babak Salimi, University of Washington; Leopoldo Bertossi, Carleton University; Dan Suciu, University of Washington; Guy Van den Broeck, University of California, Los Angeles The notion of actual causation, as formalized by Halpern and Pearl, has been rec ...arnold - December 11, 2021 - 2:53 am
-
Refining SQL Queries based on Why-Not Polynomials
Nicole Bidoit, Université Paris Sud; Melanie Herschel, University of Stuttgart; and Katerina Tzompanaki, Télécom ParisTech Explaining why some data are not part of a query result has recently gained significant interest. One use of why-not explanations is ...arnold - December 11, 2021 - 2:53 am