New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Add tests for error codes with local API server #3131

Open

manyoso wants to merge 6 commits into main from tests_with_model

+202 −29

Collaborator

manyoso commented Oct 22, 2024

Add tests for error codes with local API server and also install a model to test model info. In future commits will test the other endpoints with a model installed.


          Add tests for error codes with local API server and also install a model

ce1a888

to test model info. In future commits will test the other endpoints with
a model installed.

Signed-off-by: Adam Treat <treat.adam@gmail.com>

manyoso requested a review from cebtenzzre

October 22, 2024 17:02

manyoso added 2 commits

October 22, 2024 14:39


          Add a completions test as well.

c29e236

Signed-off-by: Adam Treat <treat.adam@gmail.com>


          Add an xfail for issue #3133

817ee88

Signed-off-by: Adam Treat <treat.adam@gmail.com>

manyoso mentioned this pull request

Temperature > 0 can sometimes cause assertion failure in sampling #3133

Open

manyoso requested a review from AndriyMulyar

October 23, 2024 16:12

cebtenzzre added 3 commits

October 24, 2024 11:25


          lint with isort and mypy

c0aa56e

Signed-off-by: Jared Van Bortel <jared@nomic.ai>


          lint with flake8

b3b398f

Signed-off-by: Jared Van Bortel <jared@nomic.ai>


          import-from exception types, but not shutil.copy

c4eaa60

Signed-off-by: Jared Van Bortel <jared@nomic.ai>

cebtenzzre reviewed

View reviewed changes

gpt4all-chat/tests/CMakeLists.txt

Comment on lines +18 to +22

+              file(DOWNLOAD
+                  "${TEST_MODEL_URL}"
+                  "${TEST_MODEL_PATH}"
+                  EXPECTED_HASH "MD5=${TEST_MODEL_MD5}"
+              )

Member

cebtenzzre Oct 24, 2024

This is going to be downloaded by default for anyone building GPT4All with Python installed, at configure time. If we expect this to be shared by the C++ and Python tests, I would prefer if it was downloaded as the first step of the check target, before it runs ctest.

gpt4all-chat/tests/CMakeLists.txt

               configure_file(python/config.py.in "${CMAKE_CURRENT_SOURCE_DIR}/python/config.py")
               add_test(NAME ChatPythonTests
-                  COMMAND ${Python3_EXECUTABLE} -m pytest --color=yes "${CMAKE_CURRENT_SOURCE_DIR}/python"
+                  COMMAND ${Python3_EXECUTABLE} -m pytest -s --color=yes "${CMAKE_CURRENT_SOURCE_DIR}/python"

Member

cebtenzzre Oct 24, 2024

I can imagine wanting to add -s via the PYTEST_ADDOPTS env var at the console if you are trying to debug a test that e.g. succeeds when it should fail, but why should the default be to disable capturing of output? This is going to make successful test runs quite verbose.

gpt4all-chat/tests/python/test_server_api.py

Comment on lines +118 to +129

+              @pytest.fixture
+              def chat_server_with_model(chat_server_with_model_config: dict[str, str]) -> Iterator[None]:
+                  chat_executable = Path(os.environ['CHAT_EXECUTABLE']).absolute()
+                  with subprocess.Popen(chat_executable, env=chat_server_with_model_config) as process:
+                      try:
+                          yield
+                      except:
+                          process.kill()
+                          raise
+                      process.send_signal(signal.SIGINT)
+                      if retcode := process.wait():
+                          raise CalledProcessError(retcode, process.args)

Member

cebtenzzre Oct 24, 2024

There is no need for the code duplication between chat_server_config/chat_server_with_model_config and chat_server/chat_server_with_model.

There only really need to be two fixtures: chat_server and chat_server_with_model. They can each be a single line function that calls some common function like def prepare_chat_server(with_model: bool = False). You can always return a generator, or yield from if it's called inside of a with block.

gpt4all-chat/tests/python/test_server_api.py

Comment on lines +135 to +137

+                  with pytest.raises(HTTPError) as excinfo:
+                      request.get('foobarbaz', wait=True)
+                  assert excinfo.value.response.status_code == 404

Member

cebtenzzre Oct 24, 2024

For checks like this I would prefer to pass an argument like raise=False that disables the raise_for_status call, and returns a resp.status_code, resp.json() tuple (typed as tuple[int, Any]).

Also, it wouldn't hurt to test the response body, even for non-200 responses.

gpt4all-chat/tests/python/test_server_api.py

+                  assert excinfo.value.response.status_code == 404
+                  # empty model list
+                  response = request.get('models', wait=True)

Member

cebtenzzre Oct 24, 2024

wait=True was intended for only the first request in a test, as the server is still starting. Subsequent requests should not need to retry.

gpt4all-chat/tests/python/test_server_api.py

Comment on lines +237 to +238

		assert 'choices' in response
		assert response['choices'][0]['text'] == ' jumps over the lazy dog.'

Member

cebtenzzre Oct 24, 2024

We can be more strict here, at least by checking the number of choices and the keys in a choice:

Suggested change

      
                assert 'choices' in response
          
                assert response['choices'][0]['text'] == ' jumps over the lazy dog.'
          
                assert len(response['choices']) == 1
          
                assert response['choices'][0].keys() == {'text', 'index', 'logprobs', 'finish_reason'}
          
                assert response['choices'][0]['text'] == ' jumps over the lazy dog.'

gpt4all-chat/tests/python/test_server_api.py

Comment on lines +245 to +247

+                  pytest.xfail('This causes an assertion failure in the app. See https://github.com/nomic-ai/gpt4all/issues/3133')
+                  with pytest.raises(HTTPError) as excinfo:
+                      response = request.post('completions', data=data, wait=True)

Member

cebtenzzre Oct 24, 2024

I believe this asserts that the entire test fails, not just this part. Checks that are expected to fail should be split into their own test functions.

gpt4all-chat/tests/python/test_server_api.py

Comment on lines +66 to +74

+                      local_file_path_env = os.getenv('TEST_MODEL_PATH')
+                      if local_file_path_env:
+                          local_file_path = Path(local_file_path_env)
+                          if local_file_path.exists():
+                              shutil.copy(local_file_path, app_data_dir / local_file_path.name)
+                          else:
+                              pytest.fail(f'Model file specified in TEST_MODEL_PATH does not exist: {local_file_path}')
+                      else:
+                          pytest.fail('Environment variable TEST_MODEL_PATH is not set')

Member

cebtenzzre Oct 24, 2024

It's better to ask for forgiveness than for permission (in Python, at least). This could be simply:

Suggested change

      
                    local_file_path_env = os.getenv('TEST_MODEL_PATH')
          
                    if local_file_path_env:
          
                        local_file_path = Path(local_file_path_env)
          
                        if local_file_path.exists():
          
                            shutil.copy(local_file_path, app_data_dir / local_file_path.name)
          
                        else:
          
                            pytest.fail(f'Model file specified in TEST_MODEL_PATH does not exist: {local_file_path}')
          
                    else:
          
                        pytest.fail('Environment variable TEST_MODEL_PATH is not set')
          
                    shutil.copy(os.environ['TEST_MODEL_PATH'], app_data_dir / local_file_path.name)

And an informative KeyError or FileNotFoundError will be raised if there is a problem. This also clarifies that this is not something being tested (pass/fail), but an expectation about the environment the test itself runs in.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet