Practical and low-overhead masking of failures of TCP-based servers
This article describes an architecture that allows a replicated service to survive crashes without breaking its TCP connections. Our approach does not require modifications to the TCP protocol, to the operating system on the server, or to any of the software running on the clients. Furthermore, it runs on commodity hardware. We compare two implementations of this architecture – one based on primary/backup replication and another based on message logging – focusing on scalability, failover time, and application transparency. We evaluate three types of services: a file server, a web server, and a multimedia streaming server. Our experiments suggest that the approach incurs low overhead on throughput, scales well as the number of clients increases, and allows recovery of the service in near-optimal time.
PublisherUniversitetet i Tromsø
University of Tromsø
SeriesTekniske rapporter / Institutt for informatikk 57(2005)
The following license file are associated with this item: