JoaoZaokk

Qwen3-4B-Thinking-2507-Heretic-CodeFeedback-Agentic-Tessa-8K-2048-Experimental-LoRA

Dedicated Endpoints

Run this model inference on single tenant GPU with unmatched speed and reliability at scale.

Learn more
Container

Run this model inference with full control and performance in your environment.

Learn more

Get help setting up a custom Dedicated Endpoints.

Talk with our engineer to get a quote for reserved GPU instances with discounts.

README

License: other

Status

This is an experimental adapter kept for transparency, comparison, and future analysis.

In a small local Python coding benchmark, this adapter regressed compared with the previous CodeFeedback checkpoint.

Table
ModelAdapterPassedPass rateAvg tokens/s
Beforeheretic_F_lora_python5000_codefeedback50009/1090.00%9.54
AfterNOITE_3090_TESSA_8000_20487/1070.00%9.28

Delta:

Table
MetricValue
Passes-2
Pass rate-20.00%
Avg tokens/s-0.26

Observed behavior

The adapter did not fail completely, but it became worse at strict executable-code output.

Observed regressions:

  • flatten failed with a type error.
  • valid_parentheses failed to output executable code.
  • lru_cache remained incomplete.
  • The model showed more explanatory / agentic behavior instead of always returning compact executable code.

This suggests that the larger Agentic Tessa continuation pushed the model toward a more verbose agentic-assistant style, which may be useful for some workflows but is worse for strict code-output benchmarks.

Training configuration

Table
ItemValue
Base modelJoaoZaokk/Qwen3-4B-Thinking-2507-Heretic-CodeFeedback
Input adapterheretic_F_lora_tessa_agentic_1000_test
Datasetsmirki/Agentic-Coding-Tessa
Samples used8,000
Sequence length2048
Epochs1
Learning rate7e-7
Training methodQLoRA / LoRA
Quantized loading during training4-bit NF4
Trainable parameters~33M
Trainable percentage~0.81%

Training result

Table
MetricValue
Train runtime9421 seconds
Runtime2h 37m 00s
Samples/second0.849
Steps/second0.106
Final train loss1.178
First logged loss1.509
Last logged loss1.072

Benchmark files

Benchmark artifacts are included under:

text

benchmark/

Files:

text

benchmark/before_summary.md
benchmark/after_summary.md
benchmark/COMPARISON.md
benchmark/comparison.json
benchmark/before_results.jsonl
benchmark/after_results.jsonl

Intended use

This adapter is intended for:

  • comparison against the previous CodeFeedback checkpoint
  • studying regression from larger agentic fine-tuning
  • analyzing output-style drift
  • future experiments with smaller learning rates or filtered datasets

It is not recommended for strict agentic coding workflows that require compact executable code output.

For the stronger current baseline, prefer:

JoaoZaokk/Qwen3-4B-Thinking-2507-Heretic-CodeFeedback

Loading example

python

from transformers import AutoModelForCausalLM, AutoTokenizer, BitsAndBytesConfig
from peft import PeftModel
import torch
base_model = "JoaoZaokk/Qwen3-4B-Thinking-2507-Heretic-CodeFeedback"
adapter = "JoaoZaokk/Qwen3-4B-Thinking-2507-Heretic-CodeFeedback-Agentic-Tessa-8K-2048-Experimental-LoRA"
tokenizer = AutoTokenizer.from_pretrained(base_model, trust_remote_code=True)
bnb_config = BitsAndBytesConfig(
load_in_4bit=True,
bnb_4bit_quant_type="nf4",
bnb_4bit_compute_dtype=torch.float16,
bnb_4bit_use_double_quant=True,
)
model = AutoModelForCausalLM.from_pretrained(
base_model,
quantization_config=bnb_config,
device_map="auto",
trust_remote_code=True,
)
model = PeftModel.from_pretrained(model, adapter)
model.eval()

Important notes

This is an experimental LoRA adapter.

It should not be treated as a universal improvement over the previous CodeFeedback model.

The benchmark used here is small and should not be treated as a formal coding leaderboard. It is mainly useful for local before/after regression testing.

Model provider

JoaoZaokk

Model tree

Base

JoaoZaokk/Qwen3-4B-Thinking-2507-Heretic-CodeFeedback

Adapter

this model

Modalities

Input

Text

Output

Text

Pricing

Dedicated Endpoints

View details

Supported Functionality

Model APIs

Dedicated Endpoints

Container

More information

Explore FriendliAI today