To point COLLECTLDIRECTORY to a directory named temp in your Slate-Scratch directory, make sure the temp directory already exists in your Slate-Scratch directory and, in your script, replace username with your IU username.If your job will run for less than a day, UITS recommends setting SAMPLEINTERVAL=10 if your job will run for multiple days, set SAMPLEINTERVAL=30 or SAMPLEINTERVAL=60. Adjust SAMPLEINTERVAL according to the expected runtime for your application.For example, after applying such changes, the my_script.sh shell script would look similar to this:ĬOLLECTLDIRECTORY=/N/scratch/username/temp/Ĭollectl -F1 -i$SAMPLEINTERVAL:$SAMPLEINTERVAL -sZl -procfilt u$UID -f $COLLECTLDIRECTORY & To have collectl record subsystem data as your binary runs, add lines to the shell script for loading the collectl module, including data collection instructions, and stopping data collection after your binary has finished running. ![]() ![]() ![]() The collectl utility runs a lightweight application alongside your binary, capturing its memory and CPU usage as it runs on a compute node.Ī simple way to launch collectl alongside your binary is to prepare a simple shell script, which you can later submit to Slurm using a job script.įor example, consider the following shell script ( my_script.sh) for launching the binary. Likewise, you can use collectl output to help determine the resources you need to request to run your batch job. Output from collectl can help you determine how many serial applications you can run on a single node. You can use collectl to monitor the memory and CPU usage of single-node batch jobs running on Carbonate. ![]() Data can be stored in compressed or uncompressed data files, which themselves can be in either raw format or in a space-delineated format that enables plotting using gnuplot or Microsoft Excel. Any set of subsystems (for example, CPU, disks, memory, or processes) can be included in or excluded from data collection. The collectl utility is a system-monitoring tool that records specific operating system data for one or more sets of subsystems.
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |