ローカルのLLMサービスをOpenAI API互換の形に変換するには？

2025-09-10

1.9 K

背景

多くのアプリケーションはOpenAI APIを中心に設計されており、LlamaEdgeはネイティブLLMを標準のOpenAI APIにカプセル化する方法を提供します。

ステップ1：APIサーバーWasmのダウンロード
curl -LO https://github.com/second-state/LlamaEdge/releases/latest/download/llama-api-server.wasm
ステップ2：APIサービスの開始
コマンドを実行する：wasmedge --dir .:. --env API_KEY=your_api_key llama-api-server.wasm --model-name llama-3.2-1B --prompt-template llama-chat --reverse-prompt "[INST]" --ctx-size 32000
ステップ3：統合テスト
curlでテストできる：curl -X POST http://localhost:8080/v1/chat/completions -H 'accept:application/json' -H 'Content-Type: application/json' -d '{"messages":[{"role":"system", "content": "You are a helpful assistant."}, {"role":"user", "content": "What is the capital of France?"}], "model": "DeepSeek-R1-Distill-Llama-8B"}'

このAPIサービスは、1）既存のチャットボット・アプリケーションへのアクセス、2）LangChainの開発、3）RAGシステムのコア・コンポーネント、4）コスト削減のためのクラウド・サービスの代替、といった使い方が可能だ。