Run this model inference on single tenant GPU with unmatched speed and reliability at scale.
Get help setting up a custom Dedicated Endpoints.
Talk with our engineer to get a quote for reserved GPU instances with discounts.
README
License: apache-2.0Model Description
This model is a fine-tuned version of Gemma 4 trained for retrieval and reranking tasks. Given a search query and a collection of candidate passages, the model selects and returns the most relevant passage from the provided corpus.
The model was fine-tuned using Unsloth and Hugging Face's TRL library for efficient training.
Input Format
The model was trained on chat-formatted examples:
python
[{"role": "system", "content": "<|GET|>"},{"role": "user","content": {"query": "most dependable affordable cars","corpus": ["Document 1...","Document 2...","If you can look past its bargain interior and anonymous exterior, the Suzuki SX4 is one of the most reliable and affordable all-wheel-drive cars."]}}]
Output Format
The model returns the most relevant passage from the corpus:
text
If you can look past its bargain interior and anonymous exterior, the Suzuki SX4 is one of the most reliable and affordable all-wheel-drive cars.
Training Objective
- Query-document relevance matching
- Passage retrieval and reranking
- Selection of the best matching document from a candidate set
Training Framework
- Unsloth
- Hugging Face Transformers
- Hugging Face TRL
Acknowledgements
This model was fine-tuned using Unsloth for fast and memory-efficient training.
Model provider
sapkotapraful
Model tree
Base
this model
Modalities
Input
Text, Image
Output
Text
Pricing
Dedicated Endpoints
View detailsSupported Functionality
Model APIs
Dedicated Endpoints
Container
More information