Name: nemotron-3-nano-omni-30b-a3b-reasoning
Brand: NVIDIA
SKU: nvidia/nemotron-3-nano-omni-30b-a3b-reasoning
Availability: InStock

Question 1

How much does nemotron-3-nano-omni-30b-a3b-reasoning cost?

Accepted Answer

nemotron-3-nano-omni-30b-a3b-reasoning is priced at Free per million input tokens and Free per million output tokens when accessed via Requesty.  Requesty charges exactly what the upstream provider charges — we don't add markup.

Question 2

What is the context window of nemotron-3-nano-omni-30b-a3b-reasoning?

Accepted Answer

nemotron-3-nano-omni-30b-a3b-reasoning has a context window of 131K tokens, with a maximum output of 20K tokens per response. That's roughly 175 words of input you can fit in a single prompt.

Question 3

What can nemotron-3-nano-omni-30b-a3b-reasoning do?

Accepted Answer

nemotron-3-nano-omni-30b-a3b-reasoning supports vision input, tool calling, extended reasoning. You can call it through any OpenAI-compatible client by pointing base_url to Requesty.

Question 4

How do I use nemotron-3-nano-omni-30b-a3b-reasoning with the OpenAI SDK?

Accepted Answer

Install the OpenAI SDK, set base_url to "https://router.requesty.ai/v1", set your API key to your Requesty key, and set the model to "nvidia/nemotron-3-nano-omni-30b-a3b-reasoning". The Quickstart above shows Python, JavaScript and cURL snippets.

nemotron-3-nano-omni-30b-a3b-reasoning

Specifications

Benchmarks

Pricing

Quickstart

Other NVIDIA models

Frequently asked questions

Access nemotron-3-nano-omni-30b-a3b-reasoning through Requesty