Intel® Cluster Checker verifies the configuration and performance of Linux based clusters and checks compliance with the Intel® Cluster Ready architecture specification. If issues are found, Intel® Cluster Checker diagnoses the problems and may provide recommendations on how to repair the cluster.
Intel® Cluster Checker has the following features:
- Dynamic detection and diagnosis of problems with cluster configuration/performance with severity and confidence levels.
- On-demand data collection.
Intel® Cluster Checker is installed as part of the following suites:
- Intel® Parallel Studio XE 2016 Cluster Edition.
- For Intel® Cluster Ready partners, as a stand-alone product.
The following flowchart represents the usage model for working with the Intel® Cluster Checker.
Prerequisites
- Install Intel® Cluster Checker using the bundled installer
- First, we recommend running the tool as a non-root user. Before using Intel® Cluster Checker for the first time, the runtime environment must be setup. Two files are included to setup the runtime environment, clckvars.sh for shells with Bourne* syntax and clckvars.csh for shells with csh syntax. Source the appropriate file from the command line, for example:
source /opt/intel/clck/<product-version>/bin/clckvars.sh
- Create a host file text file (named nodefile) that lists the compute nodes in the cluster using one host name per line. If the head node is also used as a compute node, it should also be included in the file, otherwise it should normally be omitted. Example for four compute nodes:
- node1
- node2
- node3
- node4
For detailed system requirements, see the System Requirements section in the Intel® Cluster Checker Release Notes.
Step 1: Collect Data
Run this from the console (nodefile should be in a shared & writeable location):
clck-collect -a -f nodefile
Step 2: Analyze the data
Run this from the console:
clck-analyze
Resolve any issues reported in step 2 and repeat steps 1 and 2 until you are satisfied with the results.
There will be occasions where modifications of the included xml configuration file are needed. This can happen when more output is desired, test parameters need to be modified, change the log level, etc. More data about this can be found in the User's Guide.
Troubleshooting/FAQ
Files will be installed into /opt/intel/clck/<product-version>.
- For help with the collector, run:
clck-collect --help
- For help with the analyzer, run:
clck-analyze --help
- To view collected data, use the database query tool:
clckdb --help
- To customize the analysis behavior:
cp /opt/intel/clck/<product-version>
/etc/clck.xml ~
Edit the xml file options.
It can be useful to change the value diagnosed_signs to “on”.
To include custom XML values in the analyzer run do the following:clck-analyze -c clck.xml
You can also include a “-f” to include a modified nodefile.
Documentation and Resources
Document | Description |
---|---|
Contains a breakdown of the following components: the knowledge base, the connector and the database schema. The document is located at: | |
Contains a description of the product including the following components and processes: the analyzer, knowledge base, connector, data collection, data providers as well as the database schema. The document is located at: | |
Contains a brief overview of the product, new features, system requirements, installation notes, documentation, known limitations and trouble shooting, technical support and the disclaimer and legal information. The document is located at: |