z-logo
Premium
A data‐driven support strategy for a sustainable research software repository
Author(s) -
Belgin Mehmet,
Perini Tyler A.,
Liu Fang Cherry,
Zhang Nuyun,
Sarajlic Semir,
McNeill Andre,
Manno Paul,
Bright Neil C.
Publication year - 2019
Publication title -
concurrency and computation: practice and experience
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.309
H-Index - 67
eISSN - 1532-0634
pISSN - 1532-0626
DOI - 10.1002/cpe.5338
Subject(s) - computer science , software , pace , scheduling (production processes) , rank (graph theory) , software engineering , data science , operating system , engineering , operations management , mathematics , geodesy , combinatorics , geography
Summary We describe a sustainable strategy to support a large number of researchers with widely varying scientific software needs, which is a common problem for most centralized Research Computing Centers on university campuses. Changes in systems and hardware, coupled with aging software, often necessitates re‐compilation of existing software. The naive approach of re‐compiling all of the existing packages is not only counterproductive but may also become unrealistic, especially for small support teams such as Georgia Tech's PACE Team. Instead, we analyze job scheduling data to identify actively used software, then rank, and distribute them in three support tiers, which define the level of support we provide. The distribution of software into multiple tiers is a non‐trivial problem. We use a heuristic ranking algorithm that uses four metrics, namely the number of users, groups, jobs, and their collective runtimes. The results revealed a surprisingly small subset of software that is sufficient to support a very large portion of the overall research computing activity on campus. This approach allows us to make data‐driven strategic technical and policy decisions to provide high‐quality support for the software that really matters and sustain these services with a relatively small team in the long term.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here