Dedicated Endpoints

Run this model inference on single tenant GPU with unmatched speed and reliability at scale.

Learn more
Container

Run this model inference with full control and performance in your environment.

Learn more

Get help setting up a custom Dedicated Endpoints.

Talk with our engineer to get a quote for reserved GPU instances with discounts.

README

Base Model

Trendyol/Trendyol-LLM-8B-T1

Gorev

Ham lojistik mesajindan asagidaki JSON semasini uretmek:

json

{
"siparis_sayisi": 1,
"siparisler": [
{
"nereden": "Mersin",
"nereye": "Batman",
"yuk_tipi": "YUKUSTU",
"arac_tipi": "1360",
"kasa_tipi": "ACIK"
}
]
}

Egitim

  • Yontem: QLoRA / LoRA
  • LoRA r: 32
  • LoRA alpha: 64
  • Epoch: 3
  • Learning rate: 1e-4
  • Max sequence length: 2048

Test Sonucu

Test seti: 150 mesaj, 918 gold siparis objesi.

MetrikSonuc
JSON parse success100.00%
Schema valid rate100.00%
Siparis sayisi consistency92.00%
Order precision95.85%
Order recall93.03%
Order F194.42%
Strict field micro F188.14%
Strict field macro F188.14%
Relaxed field micro F189.33%
Relaxed field macro F189.33%
Strict exact order match70.37%
Relaxed exact order match74.84%

Kullanim

python

from transformers import AutoModelForCausalLM, AutoTokenizer
from peft import PeftModel
base_model_id = "Trendyol/Trendyol-LLM-8B-T1"
adapter_id = "AlBarann/lojistik-trendyol-llm-8b-lora-adapter"
tokenizer = AutoTokenizer.from_pretrained(adapter_id)
base_model = AutoModelForCausalLM.from_pretrained(base_model_id, device_map="auto")
model = PeftModel.from_pretrained(base_model, adapter_id)
model.eval()

Bu repo merge edilmis tam model degildir; yalnizca LoRA/PEFT adapter dosyalarini icerir.

Model provider

AlBarann

Model tree

Base

Trendyol/Trendyol-LLM-8B-T1

Adapter

this model

Modalities

Input

Text

Output

Text

Pricing

Dedicated Endpoints

View details

Supported Functionality

Model APIs

Dedicated Endpoints

Container

More information

Explore FriendliAI today