feat: use llama.cpp server #2128

vansangpfiev · 2025-03-17T04:14:05Z

Describe Your Changes

Ready to review, e2e tests work on VMs, still checking why they do not work on CIs

This pull request includes various updates and improvements across multiple files, focusing on version updates, code refactoring, and adding new functionality. The most important changes include updating version strings, refactoring code to use constants, and adding new engine support.

Version Updates:

Updated version strings in docs/static/openapi/cortex.json to reflect new versions and naming conventions. [1] [2] [3] [4] [5]

Code Refactoring:

Removed the IsSupported method from EngineI class in docs/docs/engines/engine-extension.mdx and engine/cortex-common/EngineI.h for cleaner interface. [1] [2]
Replaced hardcoded organization and repository names with constants in various files for maintainability. [1] [2] [3] [4]

New Functionality:

Added support for local-engine in CMakeLists.txt files to include new engine extension. [1] [2]
Added a custom command for MSVC builds in engine/CMakeLists.txt to copy necessary files post-build.

Logging and Debugging:

Added logging statements in engine/controllers/engines.cc for better traceability of engine versions and variants during installation. [1] [2]

Test Updates:

Updated test cases in engine/e2e-test/api/engines to use new version strings and variant names. [1] [2] [3] [4] [5] [6] [7] [8]

Fixes Issues

Replace cortex.llamacpp with minimalist fork of llama.cpp #1728

Self Checklist

Added relevant comments, esp in complex areas
Updated docs (for bug fixes / features)
Created issues for follow-up changes or refactoring needed

…/spawn-llama-cpp

github-actions · 2025-03-17T08:21:02Z

Preview URL: https://be4cfec3.cortex-docs.pages.dev

…/spawn-llama-cpp

…tex.cpp into s/feat/spawn-llama-cpp

engine/e2e-test/cli/engines/test_cli_engine_install.py

…s/feat/spawn-llama-cpp

* chore: release CIs * fix: cuda urls --------- Co-authored-by: sangjanai <sang@jan.ai>

…o s/feat/spawn-llama-cpp

sangjanai added 3 commits March 17, 2025 10:53

feat: use llama.cpp server

cc2e093

Merge branch 'dev' of https://github.com/janhq/cortex.cpp into s/feat…

70caa83

…/spawn-llama-cpp

chore: cleanup

0968abe

vansangpfiev force-pushed the s/feat/spawn-llama-cpp branch from 8350e2d to 0968abe Compare March 17, 2025 07:50

Merge branch 'dev' of https://github.com/janhq/cortex.cpp into s/feat…

668af84

…/spawn-llama-cpp

github-actions bot deployed to cortex-docs (Preview) March 17, 2025 08:20 View deployment

sangjanai added 2 commits March 18, 2025 17:30

feat: OAI

219d460

Merge branch 'dev' of https://github.com/janhq/cortex.cpp into s/feat…

bb5cc35

…/spawn-llama-cpp

github-actions bot deployed to cortex-docs (Preview) March 18, 2025 10:37 View deployment

sangjanai added 2 commits March 19, 2025 09:47

fix: wait for child process up

8c4ca06

Merge branch 'dev' of https://github.com/janhq/cortex.cpp into s/feat…

220a974

…/spawn-llama-cpp

github-actions bot deployed to cortex-docs (Preview) March 19, 2025 03:00 View deployment

chore: cleanup

0f2fa6e

github-actions bot deployed to cortex-docs (Preview) March 19, 2025 04:47 View deployment

sangjanai added 3 commits March 19, 2025 11:51

chore: cleanup

6dd7f7c

chore: fix unit tests

bc92732

Merge branch 's/feat/spawn-llama-cpp' of https://github.com/janhq/cor…

5a68356

…tex.cpp into s/feat/spawn-llama-cpp

github-actions bot deployed to cortex-docs (Preview) March 19, 2025 05:38 View deployment

github-actions bot deployed to cortex-docs (Preview) March 19, 2025 05:50 View deployment

chore: cleanup

7c4d964

vansangpfiev force-pushed the s/feat/spawn-llama-cpp branch from b5b42ca to 7c4d964 Compare March 19, 2025 06:24

github-actions bot deployed to cortex-docs (Preview) March 19, 2025 06:28 View deployment

sangjanai added 4 commits March 19, 2025 13:49

chore: cleanup

1e5beaf

fix: unit tests

b7b772b

fix: e2e tests

7c66135

Merge branch 's/feat/spawn-llama-cpp' of https://github.com/janhq/cor…

af28b07

…tex.cpp into s/feat/spawn-llama-cpp

github-actions bot deployed to cortex-docs (Preview) March 19, 2025 08:28 View deployment

chore: cleanup

4705a1b

github-actions bot deployed to cortex-docs (Preview) March 19, 2025 09:00 View deployment

qnixsynapse reviewed Mar 21, 2025

View reviewed changes

engine/e2e-test/cli/engines/test_cli_engine_install.py Show resolved Hide resolved

github-actions bot deployed to cortex-docs (Preview) March 21, 2025 04:26 View deployment

vansangpfiev added 2 commits March 21, 2025 11:44

fix: windows e2e tests

7832a5a

Merge branch 's/feat/spawn-llama-cpp' of github.com:janhq/nitro into …

86cc89c

…s/feat/spawn-llama-cpp

github-actions bot deployed to cortex-docs (Preview) March 21, 2025 04:49 View deployment

fix: engine list

1c1146f

github-actions bot deployed to cortex-docs (Preview) March 21, 2025 05:01 View deployment

fix: macos filter

7487304

github-actions bot deployed to cortex-docs (Preview) March 21, 2025 06:36 View deployment

chore: skips some tests for linux arm

1e99861

github-actions bot deployed to cortex-docs (Preview) March 21, 2025 08:13 View deployment

gau-nernst mentioned this pull request Mar 22, 2025

idea: Use llama.cpp HIP build for AMD GPUs #2168

Open

fix: terminate process windows

70adc6e

github-actions bot deployed to cortex-docs (Preview) March 24, 2025 02:51 View deployment

chore: release CIs (#2171)

9846ad8

* chore: release CIs * fix: cuda urls --------- Co-authored-by: sangjanai <sang@jan.ai>

github-actions bot deployed to cortex-docs (Preview) March 24, 2025 04:57 View deployment

fix: remove v in the llama-engine wget

aba833e

github-actions bot deployed to cortex-docs (Preview) March 24, 2025 06:06 View deployment

fix: add start time for model

b239378

github-actions bot deployed to cortex-docs (Preview) March 24, 2025 06:51 View deployment

Merge branch 'dev' of https://github.com/menloresearch/cortex.cpp int…

ca2180c

…o s/feat/spawn-llama-cpp

github-actions bot deployed to cortex-docs (Preview) March 27, 2025 07:20 View deployment

Merge branch 'dev' into s/feat/spawn-llama-cpp

8ce2ac6

github-actions bot deployed to cortex-docs (Preview) April 2, 2025 09:45 View deployment

chore: quality gate

5f36501

github-actions bot deployed to cortex-docs (Preview) April 2, 2025 09:57 View deployment

Merge branch 'dev' into s/feat/spawn-llama-cpp

300f368

github-actions bot deployed to cortex-docs (Preview) April 3, 2025 11:29 View deployment

vansangpfiev merged commit b901e01 into dev Apr 4, 2025
9 checks passed

vansangpfiev deleted the s/feat/spawn-llama-cpp branch April 4, 2025 02:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: use llama.cpp server #2128

feat: use llama.cpp server #2128

vansangpfiev commented Mar 17, 2025 •

edited

Loading

github-actions bot commented Mar 17, 2025 •

edited

Loading

feat: use llama.cpp server #2128

feat: use llama.cpp server #2128

Conversation

vansangpfiev commented Mar 17, 2025 • edited Loading

Describe Your Changes

Version Updates:

Code Refactoring:

New Functionality:

Logging and Debugging:

Test Updates:

Fixes Issues

Self Checklist

github-actions bot commented Mar 17, 2025 • edited Loading

vansangpfiev commented Mar 17, 2025 •

edited

Loading

github-actions bot commented Mar 17, 2025 •

edited

Loading