Suntory Azure Managed Service Standard Document
| Document ID | AZ-VM-OPS-001 |
| Version | 1.0 |
| Status | RELEASED |
| Created | 2026-05-18 |
| Revised | 2026-05-18 |
| Service Manager | Suntory Holdings LimitedοΌSHDοΌ |
| Operations Team | TCS (Responsible for procedure execution and proactive updates to this document) |
| Author | Tomoki Koyama |
This document defines the day-to-day operational procedures for Azure Virtual Machines, including procedures for incident and failure alert response.
TCS shall perform all operations in accordance with this document and proactively keep it up to date whenever changes occur.
| Ver. | Revision Date | Author | Description | Approver |
|---|---|---|---|---|
| 1.0 | 2026-05-18 | Tomoki Koyama (SHD) | Initial release | β |
| Document ID | Document Name | Type | Notes |
|---|---|---|---|
| AZ-VM-OVERVIEW-001 | Azure Virtual Machine Service Overview | Service Overview | β |
| AZ-VM-DESIGN-001 | Azure Virtual Machine Design Document | Design Document | Design rationale and standard value details |
| AZ-VM-PARAM-001 | Azure Virtual Machine Parameter Sheet | Parameter Sheet | Entry and approval form for build-time configuration |
| AZ-VM-BUILD-001 | Azure Virtual Machine Build Procedure | Build Procedure | Portal / Terraform build procedures |
| AZ-VM-OPS-001 | Azure Virtual Machine Operation Manual (this document) | Operation Manual | β |
R = ResponsibleγA = AccountableγC = ConsultedγI = Informed
| Task / Activity | SHD | TCS | BU Representative |
|---|---|---|---|
| Operation Manual management and final approval | A | R | I |
| ServiceNow request submission (Spec Upgrade / Disk Addition / Deletion / Restore) | I | I | R |
| ServiceNow CR / Incident closure | I | R | I |
| VM Restart approval (Normal Operations) | I | C | A |
| VM Restart approval (Emergency: Incident / Failure) | A | R | I |
| Pre-Work Backup execution | I | R | I |
| VM Restart execution | I | R | I |
| VM Spec Upgrade execution | I | R | I |
| Disk Addition execution | I | R | I |
| Disk Expansion execution (TCS decision based on monitoring alert) | I | A | I |
| VM Deletion execution | I | R | I |
| VM Restore execution | I | R | I |
| Work Completion Report | I | R | I |
| β | No. | Verification Item | Details / Reference |
|---|---|---|---|
| β | 1 | Verify target VM and Resource Group name | Confirm information stated in the ServiceNow ticket or alert |
| β | 2 | Obtain pre-work backup | Refer to "Procedure 1: Pre-Work Backup" in this document. Excluded for emergency restarts. |
| β | 3 | Log in to Azure Portal and confirm permissions | Verify that the Contributor role or higher is assigned for the target Subscription |
| β | 4 | Confirm maintenance window | As a rule, perform work within the agreed maintenance window to minimize business impact |
| β | 5 | Confirm ServiceNow ticket | Ensure the ticket includes work details, requester, and approval status (for request-triggered operations) |
| β | 6 | Confirm where to record completion | Record the work result in the ServiceNow ticket and ensure it is properly closed |
| Trigger Type | Source | Basic Flow |
|---|---|---|
| ServiceNow CR | BU Representative submits a request via ServiceNow | Review CR β Pre-work checks β Obtain backup β Execute work β Verify β Close CR |
| Monitoring Alert | Azure Monitor / NewRelic alert | Review and assess alert β Emergency decision β Notify BU (if possible) β Execute work β Record incident |
| Incident (Emergency) | Failure detected / Escalation | Assess situation β Immediate response (post-hoc BU notification) β Root cause investigation β Record incident |
| Trigger | TCS executes this as a rule before performing any CR (except emergency restarts) |
|---|---|
| Operator | TCS |
| Estimated Duration | Approx. 10β30 minutes (until the backup job completes) |
| β | Step | Action | Details / Verification Points |
|---|---|---|---|
| β | 1 | Open the Recovery Services Vault | Azure Portal β "Recovery Services vaults" β Select the target Vault |
| β | 2 | Confirm the backup target VM | "Backup items" β "Azure Virtual Machine" β Confirm the target VM is displayed |
| β | 3 | Execute "Backup Now" | Select the target VM β "Backup Now" β Set retention period (30 days or more recommended for work backups) β "OK" |
| β | 4 | Confirm backup job completion | In "Backup Jobs", wait until the job status shows "Completed". Proceed to the next step only after completion. |
| β | 5 | Record the backup point | Record the date and time of the completed backup point in the ServiceNow ticket |
| Trigger | When an incident or failure alert occurs, or when TCS determines that a restart is necessary for service recovery |
|---|---|
| Operator | TCS |
| Important Rules |
[Normal Operations] Before restarting, TCS must always contact the BU Representative for the target system, obtain approval for execution and the scheduled date/time, and then proceed. [Emergency] In the event of a complete VM outage where service continuation is impossible, an immediate restart without prior BU notification is permitted. However, TCS must provide a post-hoc report to the BU promptly after the restart. |
| β | Step | Action | Details / Verification Points |
|---|---|---|---|
| β | 1 | Prepare restart reason, target VM, and proposed date/time | Confirm and record the basis for the restart need (alert details, error logs, etc.) |
| β | 2 | Send a confirmation message to the BU Representative | Contact via Teams or Email using the confirmation template below |
| β | 3 | Obtain approval from BU | Receive the approval message and record the approval details (scheduled date/time) in the ServiceNow ticket. If approval cannot be obtained, escalate to the next level. |
| β | Step | Action | Details / Verification Points |
|---|---|---|---|
| β | 1 | Obtain pre-work backup | Execute "Procedure 1: Pre-Work Backup" |
| β | 2 | Open the target VM in the Portal | Azure Portal β "Virtual machines" β Select the target VM |
| β | 3 | Execute "Restart" | Click "Restart" in the top menu β Click "Yes" in the confirmation dialog |
| β | 4 | Confirm VM returns to Running state | Wait until the VM "Status" shows "Running" (typically 3β5 minutes) |
| β | 5 | Verify application and service operation | Confirm that the application is functioning normally via the BU Representative or system monitoring |
| β | 6 | Record results in ServiceNow ticket and close | Record the restart date/time, result, and verification details |
| β | Step | Action | Details / Verification Points |
|---|---|---|---|
| β | 1 | Confirm and record the failure situation | Record alert details, logs, and error messages. Register the incident in ServiceNow. |
| β | 2 | Execute "Restart" on the VM | Perform an immediate restart via Azure Portal or CLI (backup not required) |
| β | 3 | Confirm VM returns to Running state | Wait until Status shows "Running" |
| β | 4 | Confirm service recovery | Confirm the service is operating normally via system monitoring |
| β | 5 | Send post-hoc report to BU Representative | Promptly report the reason for restart, execution date/time, result, and service recovery status |
| β | 6 | Root cause investigation and incident recording | Investigate the root cause that necessitated the restart and record it in ServiceNow |
| Trigger | ServiceNow CR Performed upon receipt of a ServiceNow request from a BU Representative |
|---|---|
| Operator | TCS |
| Important Notes | VM resizing requires the VM to be stopped (deallocated). The VM will be down during the operation (service interruption). Always agree on a downtime window with the BU before proceeding. |
| Verification Item | Details |
|---|---|
| Availability Zone compatibility | Confirm that the new VM size is available in the current Availability Zone (Zone 1 / 2 / 3). Check "Zone availability" on the VM size selection screen in the Portal. |
| Trusted Launch (vTPM / Secure Boot) support | Confirm that the new VM size supports Trusted Launch (check via Basics tab β Security type) |
| Accelerated Networking support | Confirm that the new size supports SR-IOV. Re-verify that Accelerated Networking remains enabled in the NIC settings. |
| Ultra Disk compatibility | If Ultra Disk is in use, confirm that the new VM size supports Ultra Disk. |
| Data disk cache settings | Confirm that data disk cache settings are maintained after the family change. |
| β | Step | Action | Details / Verification Points |
|---|---|---|---|
| β | 1 | Review the ServiceNow CR | Confirm that the new VM size, scheduled date/time, requester, and BU approval status are all documented |
| β | 2 | Review Family Change notes | Check the "Notes on VM Family Change" section above |
| β | 3 | Obtain pre-work backup | Execute "Procedure 1: Pre-Work Backup" |
| β | 4 | Notify BU Representative of work start | Notify the planned start time and expected downtime duration |
| β | 5 | Stop (deallocate) the VM | Azure Portal β "Stop" β "Yes" β Wait until Status shows "Stopped (deallocated)" |
| β | 6 | Resize the VM | Portal β VM β "Settings" β "Size" β Select the new size β Click "Resize" |
| β | 7 | Start the VM | Click "Start" β Wait until Status shows "Running" |
| β | 8 | Verify the size change | Confirm the new size is displayed in VM "Overview" β "Size" |
| β | 9 | Re-verify Accelerated Networking | Confirm NIC "Overview" β "Accelerated networking" shows "Enabled" |
| β | 10 | Verify application operation | Coordinate with the BU Representative to confirm the service is operating normally |
| β | 11 | Complete and close the ServiceNow CR | Record the before/after sizes, execution date/time, and verification results, then close |
| Disk Addition Trigger | ServiceNow CR Performed upon receipt of a ServiceNow request from a BU Representative |
|---|---|
| Disk Expansion Trigger |
ServiceNow CR Request from BU, or Monitoring Alert When a disk capacity exhaustion alert is triggered and TCS determines an emergency expansion is needed (TCS may proceed at their own discretion) |
| Operator | TCS |
| β | Step | Action | Details / Verification Points |
|---|---|---|---|
| β | 1 | Review the ServiceNow CR | Confirm the disk size, storage type, and intended use |
| β | 2 | Obtain pre-work backup | Execute "Procedure 1: Pre-Work Backup" |
| β | 3 | Open the VM's Disks settings in the Portal | Azure Portal β Target VM β "Settings" β "Disks" β Click "+ Add data disk" |
| β | 4 | Create and configure the new disk |
Configure the following in "Create disk": Name: <hostname>_data<N> (e.g., JZJP1WAPSP001_data01)Size: Size (GiB) as per the request Storage type: Standard SSD LRS (general) / Premium SSD LRS (DB data) Source type: None (empty disk) Key management: Platform-managed key Shared disk: No Delete with VM: ON |
| β | 5 | Click "Save" to attach the disk | On the Disks screen, click "Save" β Confirm the disk appears under Data disks |
| β | 6 | Initialize and format the disk at the OS level | Follow the OS-specific procedures below (requires connection to the VM) |
| β | 7 | Verify operation and close the ServiceNow CR | Report disk addition completion to the BU Representative and close the CR |
| β | Step | Action | Details / Verification Points |
|---|---|---|---|
| β | 1 | Review the alert details and identify the target disk | Determine the target VM name, disk name, current utilization, and the size after expansion |
| β | 2 | Obtain pre-work backup | Execute "Procedure 1: Pre-Work Backup" (if possible) |
| β | 3 | Expand the Azure Managed Disk size | Portal β Target Disk (Managed Disk) β "Size + performance" β Enter the new size β "Save" Both OS disks and data disks support online expansion while the VM is running (no VM stop required) |
| β | 4 | Expand the partition and file system at the OS level | Follow the OS-specific procedures below (perform immediately after expansion) |
| β | 5 | Verify disk capacity after expansion | Confirm that the disk capacity has increased at the OS level (Windows: File Explorer / Linux: df -h) |
| β | 6 | Report completion to BU Representative and update ServiceNow | Record the before/after sizes, execution date/time, and verification results |
| Trigger | ServiceNow CR Performed upon receipt of a ServiceNow request from a BU Representative |
|---|---|
| Operator | TCS |
| β | Step | Action | Details / Verification Points |
|---|---|---|---|
| β | 1 | Review the ServiceNow CR | Confirm the target VM name, RG name, deletion reason, requester, and BU approval status |
| β | 2 | Double-check the target VM name and RG name | Open the VM in the Portal and confirm that the VM name, RG, and tags match the details in the CR |
| β | 3 | Obtain a final backup | Execute "Procedure 1: Pre-Work Backup" to preserve the final state before deletion |
| β | 4 | Notify the BU Representative of the deletion | Notify the deletion date/time and target VM via Teams / Email |
| β | 5 | Delete the VM | Portal β Target VM β "Delete" β Confirm that the target resources (VM, NIC, OS disk) are checked β Enter VM name β "Delete" |
| β | 6 | Confirm related resources are deleted | Verify in the Portal that the NIC, OS disk, and data disks (if Delete with VM is ON) have been deleted |
| β | 7 | Remove the VM from CyberArk | Delete the target VM entry from CyberArk |
| β | 8 | Update ServiceNow CMDB | Retire/delete the corresponding record in the ServiceNow CMDB |
| β | 9 | Complete and close the ServiceNow CR | Record the deletion completion date/time and verification results, then close |
| Trigger | ServiceNow CR Performed upon receipt of a ServiceNow request from a BU Representative |
|---|---|
| Operator | TCS |
| Restore Types |
β File / Folder Restore: Restore specific files only to the original VM (can be performed while the VM is running) β‘ Full VM Restore: Restore the entire VM by creating a new VM with a different (or the same) name |
| β | Step | Action | Details / Verification Points |
|---|---|---|---|
| β | 1 | Review the ServiceNow CR | Confirm the file path to restore, backup point (date/time), and restore destination |
| β | 2 | Open the Recovery Services Vault | Azure Portal β "Recovery Services vaults" β Select the target Vault |
| β | 3 | Select "File Recovery" | "Backup items" β "Azure Virtual Machine" β Target VM β "File Recovery" |
| β | 4 | Select the restore point | Select the backup point (date/time) specified in the CR |
| β | 5 | Download and execute the recovery script | Get the script via "Download Executable" β Run it on the target VM β The backup disk will be mounted |
| β | 6 | Copy the target files to the restore destination | Copy the target files from the mounted backup disk to the original path |
| β | 7 | Unmount the disk | After recovery is complete, click "Unmount disks" in the Portal to unmount the backup disk (auto-released after 12 hours) |
| β | 8 | Verify restore results and close the ServiceNow CR | Have the BU Representative confirm file restore completion, then close the CR |
| β | Step | Action | Details / Verification Points |
|---|---|---|---|
| β | 1 | Review the ServiceNow CR | Confirm the target VM, backup point (date/time), destination RG, and new VM name |
| β | 2 | Open the Recovery Services Vault | Portal β "Recovery Services vaults" β Target Vault β "Backup items" β "Azure Virtual Machine" |
| β | 3 | Select "Restore VM" for the target VM | Target VM β Click "Restore VM" |
| β | 4 | Select the restore point | Select the backup point (date/time) specified in the CR |
| β | 5 | Configure the restore settings |
Configure the following under "Create new": Restore type: "Create new virtual machine" Virtual machine name: New VM name (follow naming conventions) Resource group: Destination RG Virtual network / Subnet: Select from existing VNets / subnets Staging location: Specify a temporary storage account |
| β | 6 | Click "Restore" to start the restore | In "Backup Jobs", wait until the job shows "Completed" (typically 30β60 minutes) |
| β | 7 | Assign the Common NSG to the restored VM | Re-assign the Common NSG (si2-securitygroup-shd-cs-tokyo-cmn-01) to the NIC of the restored VM |
| β | 8 | Reconfigure the backup policy for the VM | Enable backup for the restored VM in the Recovery Services Vault |
| β | 9 | Reconfigure tags | Set Suntory standard tags (Subsidiary / ServiceName / Environment, etc.) on the restored VM |
| β | 10 | Re-register in CyberArk | Register the restored VM in CyberArk and configure/change the administrator password |
| β | 11 | Update the ServiceNow CMDB | Update the corresponding CMDB record with the new VM name and IP address |
| β | 12 | Verify operation and close the ServiceNow CR | Have the BU Representative confirm service recovery, then close the CR |
| Situation | Escalation Target | Response Method |
|---|---|---|
| An event occurs that is not covered by the documented procedures | TCS Azure Operation Team β SHD | Create a ServiceNow incident, record the situation, and escalate |
| Service does not recover after restart or restore | TCS Azure Operation Team β Azure Support | Follow the Azure support inquiry procedure (refer to separate document) |
| Cannot obtain restart approval from the BU Representative | TCS Azure Operation Team β SHD | Request SHD to coordinate with the BU |
| An error occurs during Azure Portal operations | TCS Azure Operation Team β Azure Support | Capture the error message and operation logs via screenshots and report |
| Disk capacity shortage continues after expansion | SHD β BU | Escalate to SHD from an architecture review perspective |
| Role | Person / Team | Contact Method |
|---|---|---|
| TCS Operations Team Lead | β | Teams channel / Phone |
| SHD Service Manager | Tomoki Koyama (SHD / Digital & AI Global ITG) | Teams / Email |
| Azure Technical Support | β | Azure Portal Support Request (refer to separate document) |