Dedicated Endpoints

Run this model inference on single tenant GPU with unmatched speed and reliability at scale.

Learn more

Get help setting up a custom Dedicated Endpoints.

Talk with our engineer to get a quote for reserved GPU instances with discounts.

README

License: apache-2.0

Repository Contents

  • adapter_model.safetensors: LoRA adapter weights.
  • adapter_config.json: PEFT adapter configuration.
  • formal_eval_report.md: copied formal evaluation report for the released Round 3 adapter.
  • README.md: this model card.
  • LICENSE: Apache License 2.0 for this adapter repository.

Model Details

ItemValue
Base modelQwen/Qwen3.5-9B
Adapter typeLoRA via PEFT
Task typeCAUSAL_LM
PEFT version0.19.1
LoRA rank16
LoRA alpha32
LoRA dropout0.05
Target modulesq_proj, k_proj, v_proj, o_proj, gate_proj, up_proj, down_proj
Source checkpointstablecoin_phase1_pipeline/training/outputs/round3_conservative_repair/checkpoints/full_train_20260602T101206Z/adapter
LicenseApache License 2.0 for this adapter repository

Intended Use

This adapter is designed for domain-specific answer generation in stablecoin, virtual-asset, and compliance-oriented research settings. Suitable uses include:

  • material-grounded analysis of stablecoin and virtual-asset regulatory documents;
  • comparison of legal, academic, policy, judicial, and industry materials;
  • drafting structured compliance or risk-analysis responses based on supplied source passages;
  • identifying when the supplied materials are insufficient to support a legal, regulatory, or factual conclusion;
  • citation-sensitive workflows where each substantive claim should be traceable to provided source chunks.

The adapter is best used in a retrieval-augmented or otherwise source-controlled environment. It should receive the relevant source text, source identifiers, and explicit instructions about citation format, material boundaries, and refusal behavior.

Out-of-Scope Use

This adapter is not a substitute for qualified legal, financial, compliance, or professional advice. It should not be used as an autonomous decision-maker for high-stakes matters, licensing analysis, enforcement risk assessment, investment decisions, or client-facing legal opinions without human review.

The released evaluation supports the adapter's behavior under the tested prompt, citation, and material-boundary setup only. Performance outside that setup has not been established.

Formal Evaluation Summary

The released adapter was evaluated on a 300-sample formal evaluation set covering stablecoin and virtual-asset analysis tasks. The evaluation focused on citation discipline, material-boundary control, refusal behavior, and domain-relevant reasoning.

MetricValue
Sample count300
Pass count296
Pass rate0.9867
Mean score0.8290
Reasoning leak count0
Missing citation count0
Invalid citation ID count0
Empty answer count0
Material boundary error count0
API key leak count0
Format error count0
Eval set SHA2561b18137c7c1a363c5f5bed2bbdd5f83d11cbe83a590671d211c134056a86861f

Task-level results:

Task typeSamplesPass rateMean score
academic_literature_reasoning540.98150.8418
cross_text_comparison390.97440.7886
industry_report_analysis451.00000.8439
insufficient_information_refusal241.00000.8878
judicial_case_rule_extraction391.00000.8387
lawyer_practice_risk_analysis540.98150.8096
legal_regulation_interpretation450.97780.8171

See formal_eval_report.md for the full copied evaluation report, including run guardrails, failure samples, and error analysis.

Loading Example

python

from transformers import AutoModelForCausalLM, AutoTokenizer
from peft import PeftModel
base_model_id = "Qwen/Qwen3.5-9B"
adapter_id = "Karitasu/qwen35-stablecoin-round3-lora"
tokenizer = AutoTokenizer.from_pretrained(
base_model_id,
trust_remote_code=True,
)
model = AutoModelForCausalLM.from_pretrained(
base_model_id,
torch_dtype="auto",
device_map="auto",
trust_remote_code=True,
)
model = PeftModel.from_pretrained(model, adapter_id)
model.eval()

Prompting And Operational Guidance

For best results, use the adapter with prompts that make the evidence boundary explicit. A typical production prompt should:

  • provide the exact source passages or retrieved chunks to be used;
  • include stable source identifiers for citation;
  • instruct the model to cite every material factual, legal, or policy claim;
  • require separation between explicit source statements, legal rules, author opinions, and cautious inference;
  • require the model to state when the provided materials are insufficient;
  • prohibit unsupported references to statutes, cases, institutions, or market facts not present in the supplied context.

The formal evaluation suggests that this adapter is strongest when the expected output format and citation discipline are specified clearly.

Known Limitations

  • The adapter was evaluated on a controlled 300-sample set; the results should not be read as a guarantee of performance on all stablecoin, virtual-asset, or legal-compliance tasks.
  • The adapter can still miss parts of an analytical rubric, under-cover domain-specific criteria, or make overly broad legal/compliance inferences when the prompt does not strictly constrain the evidence boundary.
  • It inherits limitations from the base model and from the surrounding retrieval, prompting, decoding, and post-processing pipeline.
  • Outputs may be incomplete, jurisdictionally overbroad, outdated, or unsuitable for professional use unless reviewed by qualified humans.

License

This adapter repository is released under the Apache License 2.0. See LICENSE.

The base model Qwen/Qwen3.5-9B is not redistributed in this repository and remains subject to its own license and terms. Any third-party documents, evaluation materials, or source texts used with this adapter remain subject to their respective rights and restrictions.

Model provider

Karitasu

Model tree

Base

Qwen/Qwen3.5-9B

Adapter

this model

Modalities

Input

Video, Text, Image

Output

Text

Pricing

Dedicated Endpoints

View details

Supported Functionality

Model APIs

Dedicated Endpoints

Container

More information

Explore FriendliAI today