[Prototype] Self-hosted CodeLlama LLM for code autocompletion #576

senwang86 · 2023-11-04T02:52:43Z

Summary

This pr provides a solution to use CodeLlama for self-hosted code autocompletion.

A standalone LLM is required to run separately and expose a RESTful API to access. The most straight-forward way is to use llama.cpp, in which you can host the LLM either on a Mac laptop or a GPU-installed machine.
- Please follow the instructions of llama.cpp to run the LLM locally or in a cloud, also feel free to let me know if a detailed tutorial is needed.
A Sidebar setting is added to enable/disable the copilot
tRPC seems not very well support multiple providers in React, ref, thus I put the Route in api/src/server, initially we talked about to create an standalone server service to the copilot-related services.
There are a couple of follow-up patches to add, e.g., vite bug fix, monitoring the copilot service connection status, similar to the sync statue in the top of Sidebar, adding infilling mode in addition to autocomplete mode. This PR also aims to collect some early feedback regarding the overall design and architecture.

Test

First, enable the RESTful API on the llama.cpp, assuming the IP address is x.x.x.x, port is 9090
On the local machine, open the terminal,
- cd codepod/api/
- pnpm dev --copilotIP x.x.x.x --copilotPort 9090

Note that, the screenshot below intends to the demonstrate the functionality, the quality of the auto-completion might be low due to the 4-bit quantized llama-7b model.

lihebi · 2023-11-05T06:01:37Z

tRPC seems not very well support multiple providers in React, ref,

I also came across multiple providers the other day, and it is well supported: trpc#3049. I have implemented multiple providers in https://github.com/codepod-io/codepod-cloud/pull/11. Related code:

https://github.com/codepod-io/codepod-cloud/blob/113f4f7ca3656d6db2296bb32a64dc8ae3ae3342/ui/src/lib/trpc.ts#L9-L16

lihebi · 2023-11-05T06:06:07Z

With that said, it could actually be better and simpler to leave it in api/ routers, so that the frontend always has one API to talk to. We can let the api/ forward the request to the actual LLM service internally through tRPC or gRPC.

lihebi

Thanks, Sen! It works well. I left some minor comments in the code.

One issue is the automatic completion by InlineCompletionsProvider is not very responsiveness. Sometimes it is fired, and sometimes it is not. How about using a shortcut to trigger it manually, and disable the automatic triggering?

lihebi · 2023-11-07T19:36:56Z

api/src/server.ts

@@ -59,6 +68,8 @@ export async function startServer({ port, repoDir }) {
  });

  http_server.listen({ port }, () => {
-    console.log(`🚀 Server ready at http://localhost:${port}`);
+    console.log(
+      `🚀 Server ready at http://localhost:${port}, LLM Copilot is hosted at ${copilotIpAddress}:${copilotPort}`


I'd revert this print information, because people may choose to run CodePod without copilot server, and this info is misleading. Just be silent should be fine.

lihebi · 2023-11-07T19:42:43Z

ui/src/lib/trpc.ts

+} else {
+  remoteUrl = `${window.location.hostname}:${window.location.port}`;
+}
+export const trpcProxyClient = createTRPCProxyClient<AppRouter>({


We already have a trpc client in App.tsx. You can access the client in llamaInlineCompletionProvider like this:

// MyMonaco.tsx function MyMonaco() { ... const { client } = trpc.useUtils(); const llamaCompletionProvider = new llamaInlineCompletionProvider( id, editor, client ); }

A second thought: since the copilot is already a REST API, and we are not going to further customize it or add authentication to this Desktop app, let's directly call the REST API in the frontend.

The trpc is preferred in the cloud app.

lihebi · 2023-11-07T19:52:04Z

Also, there's an uncaught exception in the console for canceled API call. I'd like to catch it and display a canceling message to keep the console clean.

lihebi · 2023-11-07T19:53:53Z

monitoring the copilot service connection status

This isn't that critical. We can assume that the service is up.

adding infilling mode in addition to autocomplete mode

This is quite important. It is quite often that we edit code in the middle.

senwang86 · 2023-11-17T05:43:04Z

After the discussion, we decide to leave this PR as a reference to integrate the self-hosted copilot and address the comments in the codepod-cloud repo.

Sen Wang added 3 commits November 3, 2023 17:38

Codepod self-hosted code copilot

e1548fe

Add a Sidebar setting to turn on/off Codepod copilot

5e9276c

deactive useEffect() on copilotEnabled

1974f45

senwang86 marked this pull request as ready for review November 4, 2023 04:21

senwang86 requested a review from lihebi November 4, 2023 04:22

lihebi reviewed Nov 7, 2023

View reviewed changes

Sen Wang added 3 commits November 9, 2023 18:31

Using trpc.useUtils() hook to get the global trpcClient

33951f4

Allow manually triggering Copilot code-autocompletion

6fb6b31

Add infilling mode for code auto-completion

e2f24be

senwang86 changed the title ~~[Feature] Self-hosted CodeLlama LLM for code autocompletion~~ [Prototype] Self-hosted CodeLlama LLM for code autocompletion Nov 17, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Prototype] Self-hosted CodeLlama LLM for code autocompletion #576

[Prototype] Self-hosted CodeLlama LLM for code autocompletion #576

Uh oh!

senwang86 commented Nov 4, 2023 •

edited

Loading

Uh oh!

lihebi commented Nov 5, 2023 •

edited

Loading

Uh oh!

lihebi commented Nov 5, 2023 •

edited

Loading

Uh oh!

lihebi left a comment

Uh oh!

lihebi Nov 7, 2023

Uh oh!

lihebi Nov 7, 2023

Uh oh!

lihebi Nov 7, 2023

Uh oh!

lihebi commented Nov 7, 2023

Uh oh!

lihebi commented Nov 7, 2023

Uh oh!

senwang86 commented Nov 17, 2023

Uh oh!

Uh oh!

[Prototype] Self-hosted CodeLlama LLM for code autocompletion #576

Are you sure you want to change the base?

[Prototype] Self-hosted CodeLlama LLM for code autocompletion #576

Uh oh!

Conversation

senwang86 commented Nov 4, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Test

Uh oh!

lihebi commented Nov 5, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lihebi commented Nov 5, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lihebi left a comment

Choose a reason for hiding this comment

Uh oh!

lihebi Nov 7, 2023

Choose a reason for hiding this comment

Uh oh!

lihebi Nov 7, 2023

Choose a reason for hiding this comment

Uh oh!

lihebi Nov 7, 2023

Choose a reason for hiding this comment

Uh oh!

lihebi commented Nov 7, 2023

Uh oh!

lihebi commented Nov 7, 2023

Uh oh!

senwang86 commented Nov 17, 2023

Uh oh!

Uh oh!

senwang86 commented Nov 4, 2023 •

edited

Loading

lihebi commented Nov 5, 2023 •

edited

Loading

lihebi commented Nov 5, 2023 •

edited

Loading