트레이닝된 모델 사용하기

Serverless RL로 모델을 트레이닝하면 자동으로 Inference에 사용할 수 있습니다. 트레이닝된 모델에 요청을 보내려면 다음이 필요합니다.

W&B API 키
Training API의 기본 URL https://api.training.wandb.ai/v1/
모델 엔드포인트

모델 엔드포인트는 다음 스키마를 따릅니다.

wandb-artifact:///<entity>/<project>/<model-name>:<step>

스키마는 다음으로 구성됩니다:

W&B entity(팀)의 이름
모델과 연결된 프로젝트 이름
트레이닝된 모델의 이름
배포하려는 모델의 트레이닝 step(일반적으로 모델이 평가에서 가장 좋은 성능을 낸 step입니다)

예를 들어, W&B 팀 이름이 email-specialists이고, 프로젝트 이름이 mail-search이며, 트레이닝된 모델 이름이 agent-001이고, step 25에 배포하려는 경우 엔드포인트는 다음과 같습니다:

wandb-artifact:///email-specialists/mail-search/agent-001:step25

엔드포인트가 준비되면 이를 일반적인 Inference 워크플로에 통합할 수 있습니다. 다음 예제에서는 cURL 요청 또는 Python OpenAI SDK를 사용해 트레이닝된 모델에 Inference 요청을 보내는 방법을 보여줍니다.

cURL

curl https://api.training.wandb.ai/v1/chat/completions \
    -H "Authorization: Bearer $WANDB_API_KEY" \
    -H "Content-Type: application/json" \
    -d '{
            "model": "wandb-artifact://<entity>/<project>/<model-name>:<step>",
            "messages": [
            {"role": "system", "content": "You are a helpful assistant."},
            {"role": "user", "content": "Summarize our training run."}
            ],
            "temperature": 0.7,
            "top_p": 0.95
        }'

OpenAI SDK

from openai import OpenAI

WANDB_API_KEY = "your-wandb-api-key"
ENTITY = "my-entity"
PROJECT = "my-project"

client = OpenAI(
    base_url="https://api.training.wandb.ai/v1",
    api_key=WANDB_API_KEY
)

response = client.chat.completions.create(
    model=f"wandb-artifact:///{ENTITY}/{PROJECT}/my-model:step100",
    messages=[
        {"role": "system", "content": "You are a helpful assistant."},
        {"role": "user", "content": "Summarize our training run."},
    ],
    temperature=0.7,
    top_p=0.95,
)

print(response.choices[0].message.content)

서버리스 RL

서버리스 SFT

API 레퍼런스

cURL

OpenAI SDK

서버리스 RL

서버리스 SFT

API 레퍼런스

Documentation Index

​cURL

​OpenAI SDK

cURL

OpenAI SDK