Get Started

Up and running in minutes.

Install the app, auto-download the Llama server files, then choose Llama.cpp or AirLLM and download a model. After that, Quietly runs fully offline — no accounts or API keys.

Install Quietly

Download and run the installer for your OS — Windows .exe, macOS .dmg, or Linux AppImage. One file, no extra prerequisites.

Quietly-Setup.exe / .dmg / .AppImage~180 MB

then

Auto-download Llama server

Inside the app, press Auto download to fetch the Llama server files Quietly needs. That pulls the runtime so you are not hunting for binaries by hand.

Auto downloadLlama server

then

Pick a backend & download a model

Choose Llama.cpp or AirLLM, select a model to download, and let it finish. For coding, Llama 3.1 8B or Code Llama in GGUF form are solid defaults.

Llama.cpp | AirLLMModel

then

Quietly is ready

After the model download completes, the app is fully working — local, private, and usable completely offline. Start a session whenever you like.

Offline · no API keysReady

Windows · macOS · Linux · No signup required