Warning

There are known security vulnerabilities in Open XDMoD versions ≤11.0.2. We strongly encourage upgrading immediately to the latest version of Open XDMoD, 11.0.3, which contains fixes for these vulnerabilities.

If you cannot upgrade immediately, you can follow the instructions below to manually patch your installation as a temporary workaround before upgrading later.

Workaround instructions
  1. First, make sure you are on the correct web page for your version of Open XDMoD. The web page you are on now is for version 11.0. Other versions' pages are linked below:
  2. Download the patch file for your version:
  3. Copy the patch file to your Open XDMoD web server.
  4. Make sure you have the patch command installed; it can be installed with dnf install patch.
  5. Apply the patch by running the command below, replacing [PATH_TO_XDMOD_SHARE_DIR] with the path to the share directory (/usr/share/xdmod for RPM-based installs, /opt/xdmod/share or another location for source code installs) and replacing [PATH_TO_PATCH_FILE] with the path to the patch file.
    # patch -p1 -d [PATH_TO_XDMOD_SHARE_DIR] < [PATH_TO_PATCH_FILE]
  6. Upgrade to the latest version of Open XDMoD as soon as you can.

About → Overview

Open XDMoD is an open source tool to facilitate the management of high performance computing resources. It is widely deployed at academic, industrial and governmental HPC centers. Open XDMoD’s management capabilities include monitoring standard metrics such as utilization, providing quality of service metrics designed to proactively identify underperforming system hardware and software, and reporting job level performance data for every job running on the HPC system without the need to recompile applications. Open XDMoD is designed to meet the following objectives: (1) provide the user community with a tool to more effectively and efficiently use their allocations and optimize their use of HPC resources, (2) provide operational staff with the ability to monitor, diagnose, and tune system performance as well as measure the performance of all applications running on their system, (3) provide software developers with the ability to easily obtain detailed analysis of application performance to aid in optimizing code performance, (4) provide stakeholders with a diagnostic tool to facilitate HPC planning and analysis, and (5) provide metrics to help measure scientific impact. In addition, analyses of the operational characteristics of the HPC environment can be carried out at different levels of granularity, including job, user, or on a system-wide basis.

The Open XDMoD portal provides a rich set of features accessible through an intuitive graphical interface, which is tailored to the role of the user. Metrics provided include: number of jobs, CPU hours consumed, wait time, and wall time, with minimum, maximum and the average of these metrics, in addition to many others. Metrics are organized by a customizable hierarchy appropriate for your organization.

A version of Open XDMoD, ACCESS XDMoD is used to monitor the NSF-supported portfolio of advanced computing systems and services that are integrated with the NSF-funded ACCESS program.

This material is based upon work supported by the National Science Foundation under Grant Numbers OAC 2137603, ACI 1025159 and ACI 1445806.

For more information, questions, or feedback send email to ccr-xdmod-help at buffalo.edu.

Want to be notified about XDMoD releases and news? Subscribe to our mailing list.

Referencing XDMoD

When referencing XDMoD, please cite the following publication:

Jeffrey T. Palmer, Steven M. Gallo, Thomas R. Furlani, Matthew D. Jones, Robert L. DeLeon, Joseph P. White, Nikolay Simakov, Abani K. Patra, Jeanette Sperhac, Thomas Yearke, Ryan Rathsam, Martins Innus, Cynthia D. Cornelius, James C. Browne, William L. Barth, Richard T. Evans, “Open XDMoD: A Tool for the Comprehensive Management of High-Performance Computing Resources”, Computing in Science & Engineering, Vol 17, Issue 4, 2015, pp. 52-62. 10.1109/MCSE.2015.68

License

Open XDMoD is an open source project released under the GNU Lesser General Public License (“LGPL”) Version 3.0.