ReleasesNSFW

Gemma 4 26B uncensored weights quantized to GGUF for local inference

JMingo published quantized GGUF weights for a 26-billion-parameter Gemma 4 variant stripped of safety tuning, based on llmfan46's uncensored fine-tune.

ByAlex Sokoloff·June 7, 2026

Gemma 4 26B uncensored weights quantized to GGUF for local inference

JMingo released GGUF-quantized weights for gemma-4-26B-A4B-it-ultra-uncensored-heretic-UD on HuggingFace this week. The checkpoint is a quantized build of llmfan46's Gemma 4 26B fine-tune, which removes safety guardrails from Google's instruction-tuned base. The Apache 2.0 license permits commercial use without restriction.

GGUF quantization makes the 26-billion-parameter model runnable on consumer hardware. A Q4_K_M quant typically fits in 16 GB VRAM, putting it within reach of a single RTX 4090 or similar card. Q5_K_M builds require roughly 20 GB, and Q8_0 formats push past 24 GB. The quantization format is native to llama.cpp and compatible with Ollama, LM Studio, and other local inference tools. Uncensored Gemma fine-tunes have proliferated on HuggingFace over the past two years, with dozens of variants targeting different use cases—creative writing, roleplay, technical Q&A, and general instruction-following without safety refusals. The "heretic" naming convention typically signals aggressive ablation of refusal behavior.

More in Releases

IndustryNSFW

Microsoft's MAI-Thinking-1: 35B-active MoE model trained on 8,000 GB200s

Microsoft published a technical report on MAI-Thinking-1, a 35-billion-active-parameter reasoning model trained on 8,000 GB200 GPUs with a 256,000-token context window.

Jun 7, 2026

ResearchNSFW

Google DeepMind open-sources Science Skills for AI research workflows

DeepMind released Science Skills, a Python library of modular functions that let language models call scientific tools, databases, and simulations as part of reproducible research workflows.

Jun 6, 2026

Platform

Anthropic adds Services Track and Partner Hub to Claude deployment network

Anthropic expanded its Claude Partner Network with a Services Track for consulting firms and a Partner Hub directory, three months after the program's March launch.

Jun 6, 2026

ReleasesNSFWPlatform

Ideogram 4 open-weights: 9.3B DiT with Qwen vision encoder, native 2K

Ideogram released its first open-weight text-to-image model, a 9.3B-parameter single-stream Diffusion Transformer trained from scratch with multilingual text rendering, JSON-structured prompts, and explicit color control.

Jun 6, 2026

Research

q0 framework trades compute for data via parallel ensembles and cyclic schedules

New pretraining framework runs parallel model ensembles with anti-phase learning-rate and weight-decay cycles, then distills and weights predictions to squeeze 12.9× more value from fixed datasets when compute is abundant but training tokens are scarce.

Jun 6, 2026