From 86624bbae79b585c6c06120357d146c4f187d3fb Mon Sep 17 00:00:00 2001 From: Nathan Weinberg Date: Tue, 19 Nov 2024 15:20:12 -0500 Subject: [PATCH] Added document detailing org roadmap to CLI 1.0.0 Signed-off-by: Nathan Weinberg --- .markdownlint-cli2.yaml | 1 + .spellcheck-en-custom.txt | 4 ++ docs/instructlab-cli-1.0.0.md | 70 +++++++++++++++++++++++++++++++++++ 3 files changed, 75 insertions(+) create mode 100644 docs/instructlab-cli-1.0.0.md diff --git a/.markdownlint-cli2.yaml b/.markdownlint-cli2.yaml index 59f2fa6e..bb1ab78b 100644 --- a/.markdownlint-cli2.yaml +++ b/.markdownlint-cli2.yaml @@ -3,6 +3,7 @@ config: line-length: false no-emphasis-as-header: false + no-emphasis-as-heading: false first-line-heading: false code-block-style: false no-duplicate-header: false diff --git a/.spellcheck-en-custom.txt b/.spellcheck-en-custom.txt index 5b57c755..e9b83971 100644 --- a/.spellcheck-en-custom.txt +++ b/.spellcheck-en-custom.txt @@ -41,6 +41,7 @@ dev disambiguating ditaa docstring +downstreams dr Dropdown env @@ -130,6 +131,7 @@ Podman podman pre preprint +prereqs PR's pyenv PyPI @@ -146,6 +148,7 @@ Ren repo repos RHEL +roadmapping ROCm RTX runtime @@ -172,6 +175,7 @@ subcommand subcommands subdirectory Sudalairaj +supportability Taj tatsu TBD diff --git a/docs/instructlab-cli-1.0.0.md b/docs/instructlab-cli-1.0.0.md new file mode 100644 index 00000000..4464adfc --- /dev/null +++ b/docs/instructlab-cli-1.0.0.md @@ -0,0 +1,70 @@ +# The Road to 1.0.0 + +_Or: How I Learned to Stop Worrying and Love to GA_ + +## Context and Goals + +The `instructlab/instructlab` repo started off as `instructlab/cli` - a basic Python Click-based command-line interface designed to prototype an application capable of +running the LAB methodology created by IBM Research. As the project evolved and the organization looked into creating a proper PyPI package for it, the decision was made +to rename the repo to `instructlab/instructlab` to keep the repo name consistent with the PyPI package name. The rest of this documet will being using "InstructLab" to +refer to this repo and Python package. + +Today, InstructLab has gone from a scrappy research project to an upstream community serving as the basis for multiple downstreams, with the goal +to continuing to evolve the community to encourage more participation from additional stakeholders. To wit, it would behoove us to determine what exactly we should be +roadmapping between now and a proper 1.0.0 release, which demonstrates the following to existing and potential community members: + +1. An official goalpost for the community denoting the evolution of InstructLab from a pre-1.0 project lacking the stability and supportability typically seen from 1.0-and-beyond projects. +1. A dedicated set of V1 interfaces that can be counted on for continuous usage of InstructLab 1.0 with future provisions made for backwards compatibility for subsequent Y-Streams and Z-Streams. +1. A commitment from the Oversight Committee and Maintainer teams to continue to maintain InstructLab throughout a 1.y cycle and work towards an eventual 2.0. + +## MVP for an InstructLab 1.0.0 + +At a high-level, these are the items the Maintainer teams believe should serve as prereqs for releasing an InstructLab 1.0.0: + +### A consistent naming scheme + +As noted in the `Context and Goals` section, InstructLab started off as just as a CLI - however, we are planning for this package to serve as a more general "Engine" - +being a place where a future REST API can be defined that is used by both the CLI aspect as well as an official GUI for orchestrating the entire LAB workflow. Despite +this, the repo is often still referred to as "the CLI". We as an organization need a better term to refer to this repo as, and should adopt the relevant documentation +and meetings accordingly. + +### A fully-realized configuration scheme, centered around the usage of system profiles + +The InstructLab configuration scheme has transformed in many ways since the project's inception, from the `config.yaml` file the intially served as the user's config, to +the addition of code-based Pydantic defaults, to train profiles, to system profiles. We need to fully-decouple this config from the Click library, remove the need for a +`config.yaml` file, and have a consistent config scheme that can be easily extended. + +### An official v1 REST API schema + +We need to have a defined v1 REST API schema - while this does not preclude future updates, something mature enough to serve as a v1 API throughout subsequent Y-Streams +for an InstructLab 1.0 is a must for such a milestone. + +### Integration of InstructLab with RAG + +RAG is currently being planned on being integrated into InstructLab - that work should be in a stable state adhering to our v1 API standard. + +### An upgrade path to subsequent Y-Streams and an eventual 2.0 + +Any user wishing to install an InstructLab 1.0 must have an upgrade path to 1.1, 1.2, ..., 1.n. Upon being ready for an InstructLab 2.0, we should also be expecting to +provide a path for users wishing to upgrade from our final 1.y stream to 2.0. + +### Backwards compatibility across the 1.y stream + +Any user going down our upgrade path described above should expect that the release they upgrade to is backwards-compatible with the release they upgrade from. + +### An official hardware support matrix + +We need to have a document matrix of what hardware footprints we support and to what extent. + +## Q&A + +**Q. What about the libraries? Will they 1.0.0 as well?** + +A. It depends - we historically have not aligned the InstructLab and Library releases on a particular version numbering scheme, apart from matching Y-Streams to Y-Streams (e.g., InstructLab 0.20 used SDG 0.4, Training 0.5, and Eval 0.3). At this stage, this document scopes only the prereqs we want for the InstructLab package. + +## Conclusions and Decision Outcome + +This document will be debated and updated as part of the Pull Request review process. Upon reaching a lazy consensus by the Oversight Committee and Maintainer teams, the author of this document (Nathan Weinberg) will merge the document, denoting the following: + +1. The items in the above section `MVP for an InstructLab 1.0.0` will become official prerequisites for the InstructLab CLI Maintainer team to releasing a `1.0.0` of the InstructLab. +2. Any amendments to this list can only be made with a subsequent PR editing this document, subject to the same review process.