ZenCreator

Pro-grade AI content creation. Image, video, face-swap, lipsync, and upscaling behind one API.

14 tools

Up to 4K

4.4(288)

Visit

Loading…

ReleasesNSFW

Gemma-4-E4B uncensored GGUF quantizations land on HuggingFace

Two GGUF quantizations of felldude's gemma-4-e4b-uncensored model appeared on HuggingFace this week, packaged by mradermacher for local inference.

ByAlex Sokoloff·June 25, 2026

Gemma-4-E4B uncensored GGUF quantizations land on HuggingFace

Two GGUF quantizations of felldude's gemma-4-e4b-uncensored model appeared on HuggingFace on June 23, 2026, packaged by mradermacher for local inference. Both carry Apache 2.0 licenses and target English-language use.

GGUF is the format llama.cpp and its derivatives use to run large language models on consumer hardware. Quantization trades precision for smaller file sizes and faster inference, letting users run models that would otherwise require server-grade GPUs. The technique compresses floating-point weights into lower-bit representations—8-bit, 4-bit, or even 2-bit—without retraining the model. The i1 variant uses importance-matrix quantization, preserving accuracy on high-impact weights while aggressively compressing less-critical parameters. The result is a checkpoint that fits in consumer RAM and runs at speeds practical for local chat, code completion, and text generation.

The mradermacher namespace on HuggingFace has become a de facto clearinghouse for community quantizations, hosting hundreds of GGUF conversions and typically uploading new variants within hours of the original checkpoint's release. This mirrors the broader open-weight ecosystem, where a single base model often spawns dozens of derivative formats, fine-tunes, and merge experiments. The uncensored label signals that safety tuning has been removed or never applied—a common practice in the local-inference community where users want full control over model behavior.

ZenCreator

Gemma-4-E4B uncensored GGUF quantizations land on HuggingFace

More in Releases

Five uncensored Qwen3.6-35B fine-tunes surface on HuggingFace in 24 hours

NormGuard preserves image quality in flow-model RL fine-tuning by capping velocity inflation

PP-OCRv6 scales from 1.5M to 34.5M parameters across 50 languages

OpenAI previews GPT-5.6-sol reasoning model for Pro and Enterprise users

OpenAI previews GPT-5.6 Sol with stronger coding and cybersecurity