This Python code parses VMware product release data from the kb article Correlating build numbers and versions of VMware products (1014508)
and transforms them into a machine-readable format.
A combination of parsing with beautiful soup and data handling with Pandas is the used to achieve the goal.
A process is scheduled with GitHub actions to run daily, the results will be pushed to the repo Machine-readable VMware release data.
The way the output is currently structured is:
- Directory: based on Pandas options to handle json data orientation
- Files: KB(a)_(b)_table(c)release_as(d)"
- a: knowledge base article id - the unique ID for the KB article
- b: product name - The first product from the meta data, all in lower case and spaces replaced by underscores
- c: An id to identify multiple html tables on the section (starting at 0)
- d: json data orientation - see above
- Files: KB(a)_(b)_table(c)release_as(d)"
This is not an official VMware repository and in no way linked in official capacity to my employment at VMware.