Running a 30B LLM Locally: Nemotron-3-nano on a Laptop

In this short report I summarize my first impressions of using the new Nemotron-3-nano:30b on my laptop.

System specifications

  • ASUS TUF laptop
  • Intel core i9
  • 32GB RAM
  • NVIDIA GeForce RTX 4060 (8 GB VRAM)
  • OS: Ubuntu 24.04.4 LTS (kernel: 6.17.0-19-generic)

Installation

ollama pull nemotron-3-nano:30b
telzur@TUF:~/science/nemotron-3-nano$ ollama pull nemotron-3-nano:30b
pulling manifest 
pulling a70437c41b3b: 100% ▕████████████████████████████████████████████████████████████████████████████████████████████████▏  24 GB                         
pulling bca58c750377: 100% ▕████████████████████████████████████████████████████████████████████████████████████████████████▏  10 KB                         
pulling 12e88b2a8727: 100% ▕████████████████████████████████████████████████████████████████████████████████████████████████▏   28 B                         
pulling 12bee8c08a36: 100% ▕████████████████████████████████████████████████████████████████████████████████████████████████▏  488 B                         
verifying sha256 digest 
writing manifest 
success 

Running

ollama run nemotron-3-nano:30b

A test query: “what are the health benefits of NAC?”

telzur@TUF:~/science/nemotron-3-nano$ ollama run nemotron-3-nano:30b

>>> what are the health benefits of NAC?  // an example

Thinking...

The user asks: "what are the health benefits of NAC?" They want information about N-acetylcysteine (NAC) health benefits. This is medical info, but 

generally permissible as it's not providing medical advice, just informational. We can comply, providing overview. Must avoid disallowed content. It's 

safe. Provide benefits: antioxidant, mucolytic, supports mental health, substance use disorders, respiratory conditions, cardiovascular, etc. Also 

mention side effects, typical dosage, need for medical guidance. Should not provide specific medical advice. Provide citations? Just general. So answer 

accordingly.

...done thinking.

…. A very long reply (truncated)

**If you’re considering NAC for a specific health goal, the safest path is to speak with a qualified healthcare professional who can tailor the dose, 

duration, and monitoring plan to your individual circumstances.**

>>> Send a message (/? for help)

Ollama runtime status

ollama ps
telzur@TUF:~/science/nemotron-3-nano$ ollama ps

NAME                   ID              SIZE     PROCESSOR          CONTEXT    UNTIL               

nemotron-3-nano:30b    b725f1117407    26 GB    77%/23% CPU/GPU    4096       53 seconds from now

Computer utilization

While the computer was processing the query the GPU load can be seen with “nvtop”:

“nvtop” while processing a query

The GPU load can also be seen using “nvidia-smi”:

“nvidia-smi” while processing a query

Note that the GPU is doing computing (C) and not graphics (G) for ollama and it uses approximately 6 GB of VRAM. In addition it uses the CPU as can be seen in “top”:

“top” while processing a query

Working with a web GUI

You can also interact with the model using the Open WebUI interface:

Invoking Open WebUI from the terminal

Then open your browser at http://localhost:8080

The web GUI

Repeating the same query but this time in the Web GUI:

Running a query in the browser.

First impressions

Running Nemotron-3-nano:30b on my laptop seems to be very impressive because it is a serious, quite large LLM which, makes an effective use of gaming laptop hardware (core i9, 32GB RAM and RTX4060 GPU with 8GB VRAM). As was shown above the CPU/GPU utilization was automatically set to: 77%/23% CPU/GPU. For a total memory of 26 GB which means using about 20GB RAM and about 6GB VRAM.

The downsides are a quite long response times and a very noisy computer as the fan is struggling to cool the system.

Conclusion

Running a 30B-class model locally on a laptop is no longer theoretical—it is practical.
However, there is still a clear trade-off between performance, latency, and thermal constraints.