Uncensored LLMs — open-source chatbot & roleplay models

Dolphin Mixtral 8x7B

by cognitivecomputations

100% unrestricted

MoE 46.7B total / 12.9B active. Uncensored Mixtral fine-tune by Cognitive Computations. Fast inference thanks to MoE sparsity.

46.7B33K contextQ4_K_M

28 GB VRAMchatml

Gemma 3 27B

by Google

100% unrestricted

Google's open-weights 27B. Heavily aligned but strong baseline. Good for coding and reasoning, mediocre for RP unless jailbroken.

27B128K contextQ4_K_M

20 GB VRAMgemma

Dolphin 2.9 Llama 3 70B

by cognitivecomputations

100% unrestricted

Uncensored 70B Llama 3 fine-tune. Best-in-class for uncensored long-form response at Q4 on 48GB VRAM.

70B8K contextQ4_K_M

48 GB VRAMchatml

Llama 3.3 70B Instruct

by Meta

100% unrestricted

Meta's latest 70B. Claude-3 Sonnet-level quality at open-weights. Moderately aligned — useful as refusal baseline for comparisons.

70B128K contextQ4_K_MTools

48 GB VRAMllama3

Mistral Nemo 12B Instruct

by mistralai

100% unrestricted

Mistral/Nvidia collab. 12B with 128K context, good multilingual performance. Reasonable alignment — usable with system-prompt jailbreaks.

12B128K contextQ4_K_M

10 GB VRAMmistral

Dolphin 2.9 Llama 3 8B

by cognitivecomputations

97% unrestricted

Uncensored 8B Llama 3 fine-tune. Great for running on laptops or 6-8GB VRAM — no-filter creative writing and RP.

8B8K contextQ4_K_M

8 GB VRAMchatml

Qwen 2.5 32B Instruct

by Alibaba

97% unrestricted

Alibaba's 32B with 128K context. Strong multilingual, good coding, moderate alignment — easy to uncensor via system prompt.

32B128K contextQ4_K_MTools

22 GB VRAMchatml

Nous Hermes 3 70B

by NousResearch

97% unrestricted

Nous Research flagship. Strong reasoning + roleplay, lightly-censored. Llama 3.1 70B backbone with tool-calling.

70B128K contextQ4_K_MTools

48 GB VRAMchatml

Mistral Small 24B Instruct 2501

by mistralai

97% unrestricted

Mistral AI's 24B dense model. Fits on a single RTX 4090 at Q4 with 32K context. Lightly-filtered with easy system-prompt bypass.

24B33K contextQ4_K_MTools

18 GB VRAMmistral

Hermes 3 Llama 3.1 8B

by NousResearch

97% unrestricted

Nous's compact Hermes. Excellent structured output, function calling, and lightly-aligned tone.

8B128K contextQ4_K_MTools

8 GB VRAMchatml

Magnum v4 22B

by anthracite-org

96% unrestricted

Anthracite's 22B uncensored creative/RP fine-tune on Mistral-Small base. Prose-focused, excellent for character roleplay.

22B33K contextQ4_K_M

16 GB VRAMchatml

Rocinante v1.1 12B

by TheDrummer

96% unrestricted

Drummer's Mistral Nemo uncensored fine-tune for RP and creative. Strong prose, holds character well. 12GB VRAM sweet spot.

12B33K contextQ4_K_M

10 GB VRAMchatml

L3-Stheno 3.3 8B

by Sao10K

94% unrestricted

Sao10K's 8B uncensored roleplay specialist. Llama 3 base. Beloved in the SillyTavern community for character consistency.

8B8K contextQ4_K_M

8 GB VRAMllama3

Phi-4 14B

by Microsoft

93% unrestricted

Microsoft's 14B reasoning-focused model. Strong math/logic. Very aligned — low uncensored score but useful for benchmarks.

14B16K contextQ4_K_M

10 GB VRAMchatml

Command R 35B

by CohereForAI

93% unrestricted

Cohere's 35B with strong tool-use + RAG. 128K context. Permissive but not uncensored by default.

35B128K contextQ4_K_MTools

24 GB VRAMcommand-r

MythoMax L2 13B

by Gryphe

92% unrestricted

Gryphe's classic uncensored fiction/RP model. Llama 2 base — older but legendary for its creative output. Low VRAM footprint.

13B4K contextQ5_K_M

10 GB VRAMalpaca

Llama 3.1 8B Instruct

by Meta

90% unrestricted

Meta's compact flagship. 128K context, strong instruction following. Heavily aligned — pair with uncensored fine-tunes for RP.

8B128K contextQ4_K_M

8 GB VRAMllama3

Fimbulvetr v2 11B

by Sao10K

90% unrestricted

Sao10K's Solar-10.7B uncensored creative writing model. Classic RP workhorse, still very popular for 12GB rigs.

10.7B4K contextQ5_K_M

10 GB VRAMalpaca

Qwen 2.5 7B Instruct

by Alibaba

90% unrestricted

Fast and capable 7B baseline. 32K context, strong coding and tool-use for size. Runs on 8GB VRAM.

7.6B33K contextQ4_K_MTools

8 GB VRAMchatml

Goliath 120B

by alpindale

88% unrestricted

Llama 2 70B + 70B frankenmerge by alpindale. Legendary long-context RP model. Requires ~80GB VRAM at Q4. For enthusiasts with dual 4090s.

118B4K contextQ4_K_M

80 GB VRAMvicuna

LLaVA 1.6 34B

by haotian-liu

87% unrestricted

Vision-language model. See-and-describe images, OCR, visual reasoning. Works with Ollama's multimodal API.

34B4K contextQ4_K_MVision

22 GB VRAMchatml

DeepSeek V3 671B

by DeepSeek

58% unrestricted

671B MoE (37B active). Matches frontier closed models on coding and reasoning. Requires ~400GB to host locally; mostly API-consumed.

671B128K contextQ4_K_M

400 GB VRAMdeepseek