Skip to content

Latest commit

 

History

History
123 lines (91 loc) · 6.77 KB

PROVISIONING.md

File metadata and controls

123 lines (91 loc) · 6.77 KB

Discovery and Provisioning

Supportive docs and alternative spec

Definitions

Name Details
Server platform The main computer unit (a.k.a. server), x86-based. Interconnects and controls the PCIe peripherals.
Server BMC Independent controller embedded in the server platform. Controls the server’s HW/FW configuration, initialization, health, lifecycle, and more.
Host OS The general-purpose operating system running on the server platform.
Host BIOS/UEFI Low-level software that starts after power on before booting your operating system.
DPU/IPU A PCIe performance NIC with a programmable CPU complex.
DPU/IPU firmware Low-level control drivers provided by the DPU/IPU vendor. Runs underneath the DPU/IPU OS (e.g. NIC firmware, microcode, UEFI)
DPU/IPU OS The operating system running on the DPU/IPU's programmable CPUs.
DPU/IPU BMC Independent controller embedded in the DPU/IPU. Controls the DPU/IPU's HW/FW configuration, initialization, health, lifecycle, and more.

DPU HW Components

DPU HW Components Full DPU HW Components no BMC
DPU HW Components Standalone

Inventory Query or Broadcast

see Inventory

Adopted solution for RFC 8572 Secure Zero Touch

See ZTP for diagrams and detailed explanation.

See Demo for running example.

Additional provisioning methods out of OPI scope

Some provisioning can be triggered by external actors either manually or automatically.

We call "Automatic Provisioning" a provisioning process that is usually one that is automated by the environment and/or control plane, and doesn't involve a human operator's direct involvement (in e.g. plugging in a USB drive or a laptop via an ethernet/console cable)

For example, MaaS or Ironic.

Those are currently out of OPI scope, but here are few examples:

RSHIM custom Provisioning

NVIDIA has a manual provisioning process based on a virtual *-over-PCIe device set, called RSHIM. RSHIM creates, among other things, a virtual point-to-point ethernet device, and a virtual console device, between host and DPU/IPU. See also usage of RSHIM. Many customers are using this process to deploy their own OS image and initialize system configuration, since they trust the OS running on the x86 host.

Provisioning via xPU BMC

xPU can receive commands via its BMC (using IPMI, RedFish etc.) to change boot order to PXE boot, and then to boot.

Provisioning via Platform-Server-Host BMC

The interaction from the previous section can also be applied using a trusted network connection between platform BMC and NIC BMC (using NCSI). In this scenario, the platform BMC is able to interact with the NIC BMC without any involvement of, or placing trust in, an OS running on the host. One could send custom OEM commands over NC-SI to the BMC, or have a RedFish client on DPU receive commands from RedFish server on the platform BMC.

USB/Virtual media Provisioning

Note: This is typically a manual method, enabled by a one-to-one interaction with a BMC.

Use case: small scale, unique, specialized deployments ?

Virtual Media Provisioning

  • Provisioning server contacts xPUs BMC (i.e. via redfish)
    • Question: how can we get list of IPs / MACs and credentials ? Manual ?
    • Question: Where to get credentials for redfish https ?
    • Question: can we also do the DHCP discovery of the BMC and initiate the provisioning from the BMC itself ?
  • Provisioning server changes boot order of the xPU
PATCH https://<bmc_ip_address>/redfish/v1/Systems/1
{
   "Boot" : {
      "BootSourceOverrideMode" : "Legacy",
      "BootSourceOverrideTarget" : "Hdd",
      "BootSourceOverrideEnabled" : "Once",
      "UefiTargetBootSourceOverride" : null
   },
   "HostWatchdogTimer" : {
      "FunctionEnabled" : true
   },
   "AssetTag": "asset tag",
   "IndicatorLED": "Lit"
}
POST https://<bmc_ip_address>/redfish/v1/Managers/bmc/VirtualMedia/CD/Actions/VirtualMedia.InsertMedia
{
"Image": "http://<web_server>/<image_name>.iso"
}
  • Call script to start installation from mounted image
    • Question: how do we know what is the script name ? probably different for each Image...
  • Provisioning server causes reboot after or xPU reboots itself
POST https://<bmc_ip_address>/redfish/v1/Systems/1/Actions/ComputerSystem.Reset
{
    "ResetType" : "ForceRestart"
}