● LIVE

OpenAI releases GPT-5 APIIndia AI startup raises $120MBitcoin ETF hits record inflowsMeta Llama 4 benchmarks leakedOpenAI releases GPT-5 APIIndia AI startup raises $120MBitcoin ETF hits record inflowsMeta Llama 4 benchmarks leaked

📅 Tue, 23 Jun, 2026✈️ Telegram

AI & Tech News

✈️ Follow

Ranking Local LLMs: Evaluating Cost per Accurate Response with GPU Energy

TL;DR: I priced 8 local Ollama models by € per 1,000 correct answers — metered GPU energy ÷ correct answers, on one RTX 3090. gemma4:26b won at 96.9% accuracy for €0.013/1k-correct. The most expensive model (qwen3:8b-fp16) cost €0.239/1k and scored worse (66.7%). Reasoning tokens and full precision

⚡

Key Insights

10 editorial insights.

AiFeed24 Team·⏱ 1 min read·News

✈️ Telegram 𝕏 Tweet WhatsApp

Deep Analysis

Multi-Source Intelligence

Tags:#cloud

Found this useful? Share it!

✈️ Telegram 𝕏 Tweet WhatsApp

Ranking Local LLMs: Evaluating Cost per Accurate Response with GPU Energy

Deep Analysis

Multi-Source Intelligence

Related Stories

Unified Security Solutions vs. Multiple Tools: What’s Best for SMEs?

Nakuru High School Rolls Out Python-Powered Student Fee Management System.

7 Career Lessons I Wish I Knew Two Decades Ago

Introducing OmniVec: An Open-Source Embedding Platform for AI Apps on Azure

Ranking Local LLMs: Evaluating Cost per Accurate Response with GPU Energy

Deep Analysis

Multi-Source Intelligence

Related Stories

Unified Security Solutions vs. Multiple Tools: What’s Best for SMEs?

Nakuru High School Rolls Out Python-Powered Student Fee Management System.

7 Career Lessons I Wish I Knew Two Decades Ago

Introducing OmniVec: An Open-Source Embedding Platform for AI Apps on Azure