High performance computing framework for tera-scale database search of mass spectrometry data Article

Haseeb, M, Saeed, F. (2021). High performance computing framework for tera-scale database search of mass spectrometry data . 1(8), 550-561. 10.1038/s43588-021-00113-z

cited authors

  • Haseeb, M; Saeed, F

fiu authors

abstract

  • Database peptide search algorithms deduce peptides from mass spectrometry data. There has been substantial effort in improving their computational efficiency to achieve larger and more complex systems biology studies. However, modern serial and high-performance computing (HPC) algorithms exhibit suboptimal performance mainly due to their ineffective parallel designs (low resource utilization) and high overhead costs. We present an HPC framework, called HiCOPS, for efficient acceleration of the database peptide search algorithms on distributed-memory supercomputers. HiCOPS provides, on average, more than tenfold improvement in speed and superior parallel performance over several existing HPC database search software. We also formulate a mathematical model for performance analysis and optimization, and report near-optimal results for several key metrics including strong-scale efficiency, hardware utilization, load-balance, inter-process communication and I/O overheads. The core parallel design, techniques and optimizations presented in HiCOPS are search-algorithm-independent and can be extended to efficiently accelerate the existing and future algorithms and software.

publication date

  • August 1, 2021

Digital Object Identifier (DOI)

start page

  • 550

end page

  • 561

volume

  • 1

issue

  • 8