Inference-Time Compute Hackathon 2026: Frontier AI Hackathon - Devpost

- Log in
- Sign up

Join a hackathon

Devpost

Participate in our public hackathons

Hackathons Projects

Devpost for Teams

Access your company's private hackathons

Host a hackathon

Devpost

Grow your developer ecosystem and promote your platform

Host a public hackathon

Devpost for Teams

Drive innovation, collaboration, and retention within your organization

Host an internal hackathon

By use case

AI hackathons Customer hackathons Employee hackathons Public hackathons

Blog

Insights into hackathon planning and participation

Customer stories

Inspiration from peers and other industry leaders

Planning guides

Best practices for planning online and in-person hackathons

Webinars & events

Upcoming events and on-demand recordings

Help desk

Common questions and support documentation

Join a hackathon
- Devpost
  
  Participate in our public hackathons
  
  Hackathons Projects
  
  Devpost for Teams
  
  Access your company's private hackathons
  
  Login
Host a hackathon
- Devpost
  
  Grow your developer ecosystem and promote your platform
  
  Host a public hackathon
  
  Devpost for Teams
  
  Drive innovation, collaboration, and retention within your organization
  
  Host an internal hackathon
  
  By use case
  AI hackathons Customer hackathons Employee hackathons Public hackathons
Resources
- Blog
  
  Insights into hackathon planning and participation
  
  Customer stories
  
  Inspiration from peers and other industry leaders
  
  Planning guides
  
  Best practices for planning online and in-person hackathons
  
  Webinars & events
  
  Upcoming events and on-demand recordings
  
  Help desk
  
  Common questions and support documentation

Log in
Sign up

Descend

Overview
My projects
Participants (128)
Rules
Project gallery
Updates
Discussions

Filter submissions

Which track are you competing in?

Applied AI (Mercor)
Agents (Cognition)
Build the Future (Etched)

Sort

FDVLA — FDVLA

the full-duplex vision-language-action model

Team Emoji — Team Emoji

Emoji-to-emoji diffusion: a low-dimensional proxy for text that reasons in parallel, not word-by-word. A 328M model that tops the frontier at emoji infill—and scales with compute.

Walkie-Talkie -- Make devin succeed in harder tasks — Walkie-Talkie -- Make devin succeed in harder tasks

A multi-agent harness Cli bump Devin with Opus 4.8 on Swe-Bench pro from 50% to 70%

GRPO My Vending Machine — GRPO My Vending Machine

Vending machines that stay profitable - Bankruptcy-gated vending sim with in-world turn cost, plus a genetic outer loop that maintains diverse operating postures under selection pressure.

BioMatrix — BioMatrix

Cell signaling is attention. We simulate tissue as transformer forward passes, a physics engine for biology where you drop in cancer or a virus, native on transformer silicon.

bothub — bothub

Docker infrastructure for memory and context in AI agents

SecondGuess — SecondGuess

A jailbreak detector that escalates compute only when unsure, beating always-on detection by 2pts recall at 1.6x less compute on JailbreakBench.

$LLM — $LLM

We introduce a hedge fund run by a fleet of small LLMs that write their own trading strategies as code and ship them in milliseconds. Tap to buy in, watch it beat typical buy-and-old in real-time.

BetterAhead Clinical Research — BetterAhead Clinical Research

Making clinical trials recruiting easier.

CompoundWork — CompoundWork

Never let an AI repeat a workflow twice - let a stronger coding agent solve the problem and distill this knowledge to a 10x cheaper and 10x faster open-source model overnight.

Sea Otter — Sea Otter

Orchestration on coding agent Watch it. Checkpoint it. Retry it — smarter & cheaper.

Katchup — Katchup

We give every student a real-time AI tutor that detects confusion during class and instantly provides personalized explanations before they fall behind.

FlashGreq — FlashGreq

FlashGrep replaces RAG indexes with live LLM semantic scoring. It caches results so users/agents can refine instantly, add fresh data, and search large corpora faster as compute scales.

Qssessment: assessment by questions, not just answers. — Qssessment: assessment by questions, not just answers.

Qssessment turns job requirements into interactive AI interviews that test how candidates ask questions and reason through ambiguity.

Lifeline — Lifeline

Verified, hands-free first aid: a frozen open model pushed to 98% with inference-time compute, never an unverified step.

Falcon — Falcon

Project Falcon makes long-context AI cheaper and more reliable: same hardware, half the memory budget, better answers.

Hyperion — Hyperion

The cheapest roads through space already exist. Gravity built them. We just couldn't afford to find them... until now.

AlphaRoyale — AlphaRoyale

Alpha Go moment for Clash Royale

Advertising Payment Via TTC to Improve Surplus — Advertising Payment Via TTC to Improve Surplus

Ads are broken for AI chat. We let advertisers bid for test-time compute, not attention: relevant products buy the agent more reasoning to prove they fit your private context.

AutoReduce — AutoReduce

Autoreduce is distributed autoresearch for ML systems, helping agents discover both the best algorithms and the GPU scale where they work best.

xLift — xLift

xLift is a pre-training data scout which predicts which cohorts will move a model before we spend compute. Our differentiator is that we measure whether the cohort creates learnable disagreement.

Rate The Robot — Rate The Robot

Let the consumers label the data.

25 – 46 of 46

«
1
2
»

Devpost

About
Careers
Contact
Help

Hackathons

Browse hackathons
Explore projects
Host a hackathon
Hackathon guides

Portfolio

Your projects
Your hackathons
Settings

Connect

Twitter
Discord
Facebook
LinkedIn

© 2026 Devpost, Inc. All rights reserved.

Community guidelines
Security
CA notice
Privacy policy
Terms of service