Automate Cloud Scaling for Slurm
Create rules to automatically scale-up and scale-down deployment of nodes on the cloud. Use node configuration to customize a node associated with a Slurm cluster.
- Login to NavOps.
- Click Automations.
- Click Add Automation.
- Provide a name for the automation.
- Enter a description for the automation.
- Select a Slurm cluster.
-
In the IF condition menu :
- select slurm job filter to filter job for dynamic scale-up automations.
- select none to trigger actions without conditions. For example, if you want to set a rule to scale-up instances at 8 AM.
- select node-filter to filter on the state of the node in the inventory. For example, used to scale-down instances once they are idle.
-
Enter a query to filter a set of Slurm jobs.
Tip: Here is an example:
job_state=PENDING
This filter finds Pending jobs in SLURM and passes them to the automation. There is a limit of 25 jobs. You can extend this by adding a page_size to the request. For example,job_state=PENDING&page_size=100
This will find the top 100 Pending jobs in SLURM. If the page_size value exceeds a maximum limit of 200, the filter will not return jobs. - The Trigger type is defined as calender.
- In the When section, select the values from the drop down menus to build the required cron expression displayed in the textbox form field below the menus. You can also enter valid chron expression directly in the textbox form field. For example, * * * * * will run the automation every minute and is the recommended default.
-
In the Then select Jobs data driven
scaling and define the parameters:
-
Click Save.
The new automation is displayed in the automations table.
- Enable the Automation Engine and enable the automation.
Create Scale Down Automations
- Login to NavOps.
- Click Automation.
- Click Add Automation.
- Provide a name for the automation.
- Enter a description for the automation.
- Select a Grid Engine cluster.
- In the IF condition menu select node-filter.
-
In the Query field, click (
) to open the advanced condition editor.
-
Select a template to configure a condition.
The configuration fields are populated based on the template. You can modify them as per your requirements. The generated query is displayed.
- Click Save.
- Trigger type is defined as calender.
- In the When section, select the values from the drop down menus to build the required cron expression displayed in the textbox form field below the menus. You can also enter valid chron expression directly in the textbox form field.
- In the Then menu, select Scale down (Slurm).
- Enter the cool down time in seconds.
-
Click Save.
The new automation is displayed in the automations table.
- Enable the Automation Engine and enable the automation.