Private inference
Exploring model execution that keeps sensitive context close to the user.
AI research lab
PumaAI explores how useful AI can run closer to the user: private local inference, multi-agent harness flows, and a side of fun gaming experiments.
We prototype local model runtimes, agent orchestration loops, and game-like testbeds that make AI systems easier to inspect, steer, and enjoy.
A small AI lab testing what private, local-first systems can become.
Exploring model execution that keeps sensitive context close to the user.
Testing practical ways to run useful AI on personal hardware.
Building loops that make agent behavior observable, repeatable, and easier to debug.
Prototyping handoffs, tool calls, evaluations, and human-in-the-loop controls.
Using playful prototypes as a fast way to stress-test agents and interfaces.
Turning prototypes into concrete lessons about privacy, control, and agency.
We believe personal AI should be private, steerable, and genuinely enjoyable.
More intelligence should run locally, with cloud services used deliberately.
Multi-agent systems need harnesses that expose decisions, failures, and tradeoffs.
Games can make complex AI behavior legible, repeatable, and fun to improve.
Privacy, agency, curiosity, and play.
Angel and pre-seed support from leaders in crypto and AI.
Stefan Thomas (Interledger, BitcoinJS), Chris Larsen (Ripple), Anatoly Yakovenko (Solana), Illia Polosukhin (NEAR, Attention paper), Sridhar Ramaswamy (Snowflake, Neeva), Don Ho (Quantstamp, Orange DAO), Kartik Talwar (ETHGlobal, A.Capital), Jason Warner (poolside.ai, GitHub), Oleksandr Maksymets (Meta Superintelligence Labs), Evil Rabbit (Vercel), and many others.
We are researching private local inference, multi-agent harnesses, and game-like AI experiments. If that sounds like your kind of lab, we want to meet you.
We are focused on making local AI practical, inspectable, and fun.