Dedicated Endpoints

Run this model inference on single tenant GPU with unmatched speed and reliability at scale.

Learn more
Container

Run this model inference with full control and performance in your environment.

Learn more

Get help setting up a custom Dedicated Endpoints.

Talk with our engineer to get a quote for reserved GPU instances with discounts.

README

License: other

Training Details

  • Base model: Qwen/Qwen2-0.5B-Instruct
  • Dataset: rjac/e-commerce-customer-support-qa
  • Training column: conversation
  • Method: LoRA / PEFT
  • Platform: Hugging Face AutoTrain
  • Examples: 1,000
  • Epochs: 1
  • Max sequence length: 1024
  • Runtime: about 4 minutes on Nvidia T4

Intended Use

  • customer-support response drafting
  • e-commerce support workflow automation
  • SME AI adoption prototype
  • lightweight domain adaptation experiment

Example Use Case

A small e-commerce business could use a lightweight adapted model to draft first-pass support replies, help triage customer issues, and support customer-service workflows without relying only on closed frontier models.

Example Prompt

text

Customer: I ordered a phone case last week and the tracking page still says pending. Can you help?
Agent:

Model provider

shubhbali

Model tree

Base

Qwen/Qwen2-0.5B-Instruct

Adapter

this model

Modalities

Input

Text

Output

Text

Pricing

Dedicated Endpoints

View details

Supported Functionality

Model APIs

Dedicated Endpoints

Container

More information

Explore FriendliAI today