META-pipe cloud setup and execution
Permanent lenke
https://hdl.handle.net/10037/14818Dato
2018-01-18Type
Journal articleTidsskriftartikkel
Peer reviewed
Forfatter
Agafonov, Aleksander; Mattila, Kimmo; Tuan, Cuong Duong; Tiede, Lars; Raknes, Inge Alexander; Bongo, Lars Ailo AslaksenSammendrag
META-pipe is a complete service for the analysis of marine metagenomic data. It provides assembly of high-throughput sequence data, functional annotation of predicted genes, and taxonomic profiling. The functional annotation is computationally demanding and is therefore currently run on a high-performance computing cluster in Norway. However, additional compute resources are necessary to open the service to all ELIXIR users. We describe our approach for setting up and executing the functional analysis of META-pipe on additional academic and commercial clouds. Our goal is to provide a powerful analysis service that is easy to use and to maintain. Our design therefore uses a distributed architecture where we combine central servers with multiple distributed backends that execute the computationally intensive jobs. We believe our experiences developing and operating META-pipe provides a useful model for others that plan to provide a portal based data analysis service in ELIXIR and other organizations with geographically distributed compute and storage resources.