Directory-Aware File System Backup to Object Storage for Fast On-Demand Restore

  • Jun Nemoto Hitachi, Ltd.
  • Atsushi Sutoh Hitachi, Ltd.
  • Masaaki Iwasaki Hitachi, Ltd.
Keywords: File system backup, On-demand restore, Object storage

Abstract

With emerging “pay-as-you-go object storage” services, file system backup to object storage is an attractive option in terms of total cost of ownership. However, if a failure or a disaster occurs, a conventional backup system with object storage requires a long time to recover the necessary data since it must restore the entire directory structure of the file system from object storage before accessing the data. In this paper, “directory-aware backup” (DAB), which achieves on-demand restore as part of file system backup to object storage, is proposed. On-demand restore is a function that restores only a certain directory or file according to the request from an end user. The proposed method backs up relationships between a file and an object on a directory basis in order to efficiently handle the relationships in the restore. It is experimentally shown that the proposed method reduces the response time of file access involving restore by over an order of magnitude.

References

R. H. Patterson, S. Manley, M. Federwisch, D. Hitz, S. Kleiman, and S. Owara, “SnapMirror: File-System-Based Asynchronous Mirroring for Disaster Recovery,” in FAST, 2002, pp. 117–130.

M. Vrable, S. Savage, and G. M. Voelker, “Cumulus: Filesystem Backup to the Cloud,” in FAST, 2009, pp. 225–238.

Amazon S3; http://aws.amazon.com/s3/

S. Quinlan and S. Dorward, “Venti: A New Approach to Archival Storage,” in FAST, 2002, pp. 89–101.

C. Ungureanu, B. Atkin, A. Aranya, S. Gokhale, S. Rago, G. Całkowski, C. Dubnicki, and A. Bohra, “HydraFS: a High-Throughput File System for the HYDRAstor Content-Addressable Storage System,” in FAST, 2010, pp. 225–238.

Y. Zhu and J. Masui, “Backing up your data to the cloud: Want to pay less?,” in ICPP, 2013, pp. 409–418.

H. Tang, F. Liu, G. Shen, Y. Jin, and C. Guo, “UniDrive: Synergize Multiple Consumer Cloud Storage Services,” in Middleware, 2015, pp. 137–148.

P. Anderson, “Fast and Secure Laptop Backups with Encrypted De-duplication,” in LISA, 2009.

S. Han, H. Shen, T. Kim, A. Krishnamurthy, T. Anderson, and D. Wetherall, “MetaSync: File Synchronization Across Multiple Untrusted Storage Services,” in ATC, 2015, pp. 83–95.

rsnapshot; http://www.rsnapshot.org/

rdiffbackup; http://www.nongnu.org/rdiff-backup/

A. Tridgell, “Efficient Algorithms for Sorting and Synchronization,” 1999.

M. Eshel, R. Haskin, and D. Hildebrand, “Panache: A Parallel File System Cache for Global File Access.,” in FAST, 2010, pp. 155–168.

B. Callaghan, B. Pawlowski, and P. Staubach, “RFC 1813: NFS version 3 Protocol Specification,” 1995.

P. J. Leach and D. Naik, “A common Internet file system (CIFS/1.0) protocol,” Internet-Draft, IETF, 1997.

R. Fielding, J. Gettys, J. Mogul, H. Frystyk, L. Masinter, P. Leach and T. Berners-Lee, “RFC 2068: Hypertext Transfer Protocol -- HTTP/1.1,” 1997.

XFS; http://oss.sgi.com/projects/xfs/

FUSE; https://github.com/libfuse/libfuse/

LeoFS; http://leo-project.net/

Published
2017-03-31