ub.xmlui.mirage2.page-structure.muninLogoub.xmlui.mirage2.page-structure.openResearchArchiveLogo
    • EnglishEnglish
    • norsknorsk
  • Velg spraaknorsk 
    • EnglishEnglish
    • norsknorsk
  • Administrasjon/UB
Vis innførsel 
  •   Hjem
  • Fakultet for naturvitenskap og teknologi
  • Institutt for informatikk
  • Artikler, rapporter og annet (informatikk)
  • Vis innførsel
  •   Hjem
  • Fakultet for naturvitenskap og teknologi
  • Institutt for informatikk
  • Artikler, rapporter og annet (informatikk)
  • Vis innførsel
JavaScript is disabled for your browser. Some features of this site may not work without it.

Scalability of Distributed Version Control Systems

Permanent lenke
https://hdl.handle.net/10037/19015
Thumbnail
Åpne
article.pdf (181.1Kb)
Publisert versjon (PDF)
Dato
2017-11-26
Type
Journal article
Tidsskriftartikkel

Forfatter
Murphy, Mike; Bjørndalen, John Markus; Anshus, Otto
Sammendrag

Distributed version control systems are popular for storing source code, but they are notoriously ill suited for storing large binary files.

We report on the results from a set of experiments designed to characterize the behavior of some widely used distributed version control systems with respect to scaling. The experiments measured commit times and repository sizes when storing single files of increasing size, and when storing increasing numbers of single-kilobyte files.

The goal is to build a distributed storage system with characteristics similar to version control but for much larger data sets. An early prototype of such a system, Distributed Media Versioning (DMV), is briefly described and compared with Git, Mercurial, and the Git-based backup tool Bup.

We find that processing large files without splitting them into smaller parts will limit maximum file size to what can fit in RAM. Storing millions of small files will result in inefficient use of disk space. And storing files with hash-based file and directory names will result in high-latency write operations, due to having to switch between directories rather than performing a sequential write.

The next-phase strategy for DMV will be to break files into chunks by content for de-duplication, then re-aggregating the chunks into append-only log files for low-latency write operations and efficient use of disk space.

Beskrivelse
Source at https://ojs.bibsys.no/index.php/NIK/article/view/434.
Forlag
Norsk informatikkonferanse
Sitering
Murphy, M. J., Bjørndalen, J. M. & Anshus, O. (2017). Scalability of Distributed Version Control Systems. NIK: Norsk Informatikkonferanse.
Metadata
Vis full innførsel
Samlinger
  • Artikler, rapporter og annet (informatikk) [477]
Copyright 2017 The Authors

Bla

Bla i hele MuninEnheter og samlingerForfatterlisteTittelDatoBla i denne samlingenForfatterlisteTittelDato
Logg inn

Statistikk

Antall visninger
UiT

Munin bygger på DSpace

UiT Norges Arktiske Universitet
Universitetsbiblioteket
uit.no/ub - munin@ub.uit.no

Tilgjengelighetserklæring