Dedicated Endpoints

Run this model inference on single tenant GPU with unmatched speed and reliability at scale.

Learn more
Container

Run this model inference with full control and performance in your environment.

Learn more

Get help setting up a custom Dedicated Endpoints.

Talk with our engineer to get a quote for reserved GPU instances with discounts.

README

License: apache-2.0

Training Specs

  • CPU:- Intel(R) Core(TM) i3-14100
  • Memory:- 16.0 GB DDR5
  • GPU:- None (Intel UHD Graphics 730)

Model Overview

  • Model Name: VedaX-0.7B-Base
  • Organization: VedaX Labs
  • Parameters: 0.7 Billion
  • Architecture: Transformer-based causal language model
  • Type: Base/Foundation Model
  • License: Apache-2.0

Capabilities

  • Text generation
  • Lightweight inference
  • Fine-tuning friendly

Intended Use

VedaX-0.7B-Base is intended for:

  • Research and experimentation
  • Educational purposes
  • Building lightweight AI assistants
  • Fine-tuning for downstream tasks
  • Local inference applications
  • Suitable for all devices

Context Length

  • Context Window: 4096 tokens

Training

The model was trained on a mixture of publicly available text datasets and curated instruction-style data.

Example Usage

python

from transformers import AutoTokenizer, AutoModelForCausalLM
model_id = "VedaX-Labs/VedaX-0.7B-Base"
tokenizer = AutoTokenizer.from_pretrained(model_id)
model = AutoModelForCausalLM.from_pretrained(model_id)
prompt = "The magic forest was"
inputs = tokenizer(prompt, return_tensors="pt")
outputs = model.generate(
**inputs,
max_new_tokens=128
)
print(tokenizer.decode(outputs[0], skip_special_tokens=True))

Benchmarking(Higher number means higher score)

Screenshot 2026-05-30 130627

Limitations

  • May generate inaccurate information
  • Limited reasoning compared to larger models
  • Responses may contain hallucinations
  • Performance depends on prompting quality
  • May generate repettetive words(Chances is very low)

Future Plans

  • Instruction-tuned versions
  • Reasoning-enhanced variants
  • Multimodal support
  • Larger parameter models

Developed by VedaX Labs.

Model provider

VedaX-Labs

Model tree

Base

this model

Modalities

Input

Text

Output

Text

Pricing

Dedicated Endpoints

View details

Supported Functionality

Model APIs

Dedicated Endpoints

Container

More information

Explore FriendliAI today