Up and running in minutes.
Install the app, auto-download the Llama server files, then choose Llama.cpp or AirLLM and download a model. After that, Quietly runs fully offline — no accounts or API keys.
Install Quietly
Download and run the installer for your OS — Windows .exe, macOS .dmg, or Linux AppImage. One file, no extra prerequisites.
Quietly-Setup.exe / .dmg / .AppImage~180 MBAuto-download Llama server
Inside the app, press Auto download to fetch the Llama server files Quietly needs. That pulls the runtime so you are not hunting for binaries by hand.
Auto downloadLlama serverPick a backend & download a model
Choose Llama.cpp or AirLLM, select a model to download, and let it finish. For coding, Llama 3.1 8B or Code Llama in GGUF form are solid defaults.
Llama.cpp | AirLLMModelQuietly is ready
After the model download completes, the app is fully working — local, private, and usable completely offline. Start a session whenever you like.
Offline · no API keysReadyWindows · macOS · Linux · No signup required