Frontier hub

Frontier Models

A research hub for flagship reasoning, multimodal, and general-purpose AI models that product teams compare before standardizing a platform.

Built for: Product, research, platform, and enterprise AI teams

Latest briefing Briefing archive

Ranked entries

Verified repos

Decision pages

Top score

Does the model solve the workflow with less human repair than cheaper baselines?

Can the provider meet data, latency, billing, and deprecation requirements?

Is the model being used for genuinely hard reasoning rather than routine formatting?

Ranking

Shortlist the leading entries.

These entries come from the current SkillRank dataset. Scores help discovery; final decisions should use your own workflow tests.

Chat / Reasoning

GPT-5.5

OpenAI

Score

OpenAI’s current flagship for general reasoning, multimodal understanding, and agent-style tasks.

Text / MultimodalRank #1Editorial profile

Chat / Reasoning

Claude Sonnet 4.5

Anthropic

Score

Balanced frontier model with strong reasoning, long context, and tool use.

Text / MultimodalRank #2Editorial profile

Chat / Reasoning

Gemini 2.5 Pro

Google

Score

Google DeepMind multimodal model tuned for reasoning across text, images, and tools.

Text / MultimodalRank #3Editorial profile

Chat / Reasoning

Claude Opus 4.1

Anthropic

Score

Highest-capability Claude tier for demanding reasoning and structured outputs.

Text / MultimodalRank #4Editorial profile

Chat / Reasoning

Gemini 2.5 Flash

Google

Score

Fast, cost-efficient Gemini variant for high-volume chat and classification.

Text / MultimodalRank #5Editorial profile

Chat / Reasoning

DeepSeek R2

DeepSeek

Score

Latest DeepSeek reasoning line with improved chain-of-thought and tool use.

TextRank #6Editorial profile

Chat / Reasoning

Qwen 4

Alibaba

Score

Current-generation Qwen flagship for multilingual chat, tools, and multimodal use.

Text / MultimodalRank #7Editorial profile

Chat / Reasoning

Grok 4

xAI

Score

Latest xAI assistant with real-time web and X integration where available.

Text / MultimodalRank #8Editorial profile

Daily signals

What today's briefing says about this category.

Signals are generated from recorded snapshots and verified source metadata. They keep the hub connected to the daily crawler.

Codingverified

pi

pi changed this week: score +3, rank up 4, GitHub stars +4,564.

Codingverified

graphify

graphify changed this week: score +2, rank up 3, GitHub stars +4,322.

Codingverified

worldmonitor

worldmonitor changed this week: score +2, rank up 3, GitHub stars +2,055.

Codingverified

caveman

caveman changed this week: score +1, rank up 2, GitHub stars +2,175.

Codingverified

awesome-mcp-servers

awesome-mcp-servers changed this week: score -1, rank down 2, GitHub stars +429.

Codingverified

ponytail

ponytail changed this week: score +1, rank up 2, GitHub stars +3,377.

Verified repositories

Repository-backed projects to inspect.

GitHub metadata is useful for discovery, but production fit still depends on license, docs, security posture, and local maintainability.

OpenClaw

hermes-agent

NousResearch/hermes-agent

219,839 stars

The agent that grows with you

aiai-agentai-agentsanthropicchatgpt

Claude Code

ECC

affaan-m/ECC

232,741 stars

The agent harness performance optimization system. Skills, instincts, memory, security, and research-first development for Claude Code, Codex, Opencode, Cursor and beyond.

ai-agentsanthropicclaudeclaude-codedeveloper-tools

Claude Code

open-design

nexu-io/open-design

81,239 stars

🎨 The open-source Claude Design alternative. 🖥️ Local-first desktop app. 🖼️ Your coding agent becomes the design engine: prototypes, landing pages, dashboards, slides, images & video — real files, HTML/PDF/PPTX/MP4 export. 🤖 Claude Code / Codex / Cursor / Gemini / OpenCode / Qwen & 20+ CLIs via BYOK.

agent-skillsai-agentsai-designbyokclaude-code-for-design

Claude Code

graphify

Graphify-Labs/graphify

94,977 stars

Turn any codebase, with its docs, SQL schemas, configs, and PDFs, into a queryable knowledge graph. A /graphify skill for Claude Code, Cursor, Codex, and Gemini CLI: local deterministic AST parsing, every edge explained, no vector store.

ai-agentsantigravityastclaude-codecode-analysis

OpenClaw

graphify

Graphify-Labs/graphify

94,977 stars

ai-agentsantigravityastclaude-codecode-analysis

Claude Code

claude-mem

thedotmack/claude-mem

88,435 stars

Persistent Context Across Sessions for Every Agent – Captures everything your agent does during sessions, compresses it with AI, and injects relevant context back into future sessions. Works with Claude Code, OpenClaw, Codex, Gemini, Hermes, Copilot, OpenCode + More

aiai-agentsai-memoryanthropicartificial-intelligence

Guides

Read the operating playbooks.

Model strategy9 min

AI Model Selection Framework for Product Teams

A practical framework for choosing chat, reasoning, multimodal, coding, and retrieval models without relying on launch hype.

Retrieval8 min

RAG Evaluation Checklist

A grounded checklist for choosing embedding models, retrieval pipelines, rerankers, and document-agent workflows.

Media generation7 min

AI Image and Video Model Workflow

How creative teams can compare image and video generators using briefs, brand constraints, rights review, and repeatability.

Methodology8 min

SkillRank Data Methodology

How SkillRank separates editorial model profiles, GitHub-verified repository signals, daily picks, and recorded history.

Operations9 min

AI Model Cost and Latency Playbook

How to choose model tiers, route requests, set latency budgets, and avoid paying flagship prices for routine AI work.

Architecture8 min

Model Routing and Fallback Design

A practical guide to routing prompts across fast, cheap, specialist, and frontier models without creating brittle AI infrastructure.

Compare

Turn options into a decision.

Frontier reasoningUpdated 2026-06-04

Closed-provider model profiles are editorial unless an official source or verified public repository is attached.

Weekly movement appears only from recorded SkillRank snapshots, not invented launch commentary.

Use rankings as a shortlist, then run a fixed evaluation set on your own prompts and documents.

Frontier Models

Shortlist the leading entries.

GPT-5.5

Claude Sonnet 4.5

Gemini 2.5 Pro

Claude Opus 4.1

Gemini 2.5 Flash

DeepSeek R2

Qwen 4

Grok 4

What today's briefing says about this category.

pi

graphify

worldmonitor

caveman

awesome-mcp-servers

ponytail

Repository-backed projects to inspect.

Read the operating playbooks.

AI Model Selection Framework for Product Teams

RAG Evaluation Checklist

AI Image and Video Model Workflow

SkillRank Data Methodology

AI Model Cost and Latency Playbook

Model Routing and Fallback Design

Turn options into a decision.

GPT-5.5 vs Claude Opus for Professional Work

OpenAI vs Gemini for Product Teams

Embedding Models for RAG: OpenAI vs Gemini vs Cohere vs BGE