Low-Level Efficiency & Performance Benchmarks

Followers

May 10, 20251 yr

These leaderboards assess how well models perform with regard to latency, memory, throughput, and power consumption. Often used by ML engineers optimizing for deployment on edge devices, these tools are more technical and infrastructure-focused. They may also benchmark quantized models, model distillation, or fine-tuning effectiveness.

Tools:

Optimum LLM Performance Leaderboard – Measures throughput and latency of LLMs across hardware types and quantization schemes (e.g., INT8, FP16).
Sotabench – Tracks reproducible model benchmarks submitted by users, focusing on vision and NLP models across classic datasets like ImageNet and SQuAD.

Create an account or sign in to comment

Share on Facebook
Share on X
{lang="reddit_text"
Share via email
Share on Pinterest

Followers

Go to topic listing

Low-Level Efficiency & Performance Benchmarks

Featured Replies

Tools:

Create an account or sign in to comment

Who's Online (See full list)

Lead AI Transformation without coding

Most Solved

Forum Statistics

Member Statistics

Account

Navigation

Search

Configure browser push notifications

Chrome (Android)

Chrome (Desktop)

Safari (iOS 16.4+)

Safari (macOS)

Edge (Android)

Edge (Desktop)

Firefox (Android)

Firefox (Desktop)