Mithril’s cover photo
Mithril

Mithril

Technology, Information and Internet

Palo Alto, CA 5,251 followers

Foundry is now Mithril — the AI omnicloud

About us

Foundry is now Mithril — the AI omnicloud that brings transparent, multi‑provider compute and simple abstractions to every AI practitioner.

Website
https://mithril.ai
Industry
Technology, Information and Internet
Company size
11-50 employees
Headquarters
Palo Alto, CA
Type
Privately Held
Founded
2022

Locations

Employees at Mithril

Updates

  • Congrats to our partners at Sail Research for officially coming out of stealth 🚀. It's been great working with Neil Movva and team. Excited to see you build the best inference platform for long-horizon agents.

    View profile for Neil Movva

    Samir Menon and I are thrilled to announce Sail Research! We build infrastructure for long-horizon agents: inference served at unbeatable price-per-token for open models, plus sandboxes designed to run for days, weeks, or even longer. We've raised $80M, with our seed led by Sequoia Capital and series A led by Kleiner Perkins. What makes agents so different? Instead of racing to serve a human waiting at a keyboard, agents need scale, reliability, and sustainable cost. Sail finds this efficiency everywhere in the stack: we carefully choose our chips, write custom inference engines, and run a global controller that fully utilizes every computer in our fleet. Tight integration from silicon to API lets Sail open up the cost / latency frontier to our customers - the most patient agents can now access 10x more intelligence per dollar. We're excited to be working with great companies like Parallel Web Systems, Detail, Jack & Jill, and Quadrillion Labs to deploy long-horizon agents with trillions of tokens. Our team is thoughtful in our engineering craft and relentlessly ambitious in our pursuit of peak performance. We previously trained at companies like NVIDIA, OpenAI, Google, and so many trading firms. Now we're ready to do the work that will define our careers, in the most compute intensive market of all time. Welcome to the era of abundant intelligence. We can't wait to build with you!

  • Mithril reposted this

    The data processing revolution has been chiefly concerned with making structured data as query-able as possible. In parallel, tabular AI models have given us accurate predictions over this data (for example, Prior Labs, Kumo, Probabl, Neuralk, The Forecasting Company). Now we have generative AI, which can operationalize superhuman intelligence given sufficient context. Yet, AI agents operating over these model outputs and structured data cannot be reliable if they do not have the ability to generate and verify the reasoning embedded in the AI models. The scale of structured data has restricted enterprises to analytics agents that access information through simple aggregates.  With reasoning, we can move beyond this intelligence layer to build agents that provide reliable recommendations and execute actions. I founded OuterProduct, along with Adityanarayanan Radhakrishnan and Mikhail Belkin, to solve this problem. At OuterProduct, we have built the Unified Reasoning Engine as a culmination of our decades of combined research. In our Science publication (https://lnkd.in/g7Wb36FP), we discovered a key abstraction for how general AI models learn from data beyond classical methods. This led us (along with David Holzmüller) to develop a first-principles tabular model (https://lnkd.in/ghykqVNp, ICLR) that can outperform even tabular foundation models across benchmarks (https://lnkd.in/gzRab6Sy). In another follow-up Science publication (https://lnkd.in/gquRbeav), we showed this technology can be applied to govern and monitor LLMs. I am incredibly proud of the technology we have built and the team that has joined us. My co-founders have always pursued the most ambitious problems, including, for example, Misha Belkin’s discovery of the now well-known double descent phenomenon (https://lnkd.in/gCiKj7i5). See our website and join our waitlist to stay up to date on our product (www.outerproduct.com).

    View profile for Adityanarayanan Radhakrishnan

    Assistant Professor at MIT Math, Founder at OuterProduct, Associate Member at the Broad Institute of MIT and Harvard

    I’m excited to announce OuterProduct, a new company founded by me, Daniel Beaglehole, and Mikhail Belkin with the mission of building the next-generation of enterprise AI that can reason, iterate, and reliably execute actions across the structured data stack.  The most successful applications of AI have been in domains like coding where AI can reason over data. Coding agents can reason over a codebase to figure out where to modify code and check whether their code is correct. My co-founders and I believe AI should bring the same incredible transformation to enterprises as it brought to coding. The fundamental challenge lies in giving AI the means to reason over the structured data backbone powering many critical enterprise functions.  Overcoming this challenge requires more than building intelligence like black-box tabular foundation models and analytics agents. It requires new methods for (1) reasoning over AI models; (2) verifying correctness of this reasoning; and (3) packaging this reasoning into context that agents can act on.  At OuterProduct, we developed a new technology, the Unified Reasoning Engine, that unifies intelligence with interpretability and verification to give AI the ability to reason over structured data (read more at https://lnkd.in/g9QeMC8h).  The Engine is a culmination of novel methods built from our founding team’s decades of combined research experience in AI. Core to the Engine is our foundational research into AI interpretability for models on structured data published in Science (https://lnkd.in/ge9nqjKi), and for governance and control of generative AI, our second publication in Science (https://lnkd.in/ewgQNCWk) with coverage in MIT News (https://lnkd.in/ep--wm69).  I am thrilled about the incredible team and truly unique technology we are building at OuterProduct. Join our waitlist to stay up to date on upcoming news and product releases.

  • Cloud Next last month was a clear statement of where Google Cloud sees TPUs going: split between training and inference. TPU 8t scales to 9,600 chips for training; TPU 8i delivers ~80% better price-performance for inference vs. Ironwood. Both are expected to GA later this year. The chip Google still points to as the inference price-performance sweet spot today, Trillium (v6e), has been hard to actually try. Committing to a long Calendar reservation upfront doesn't fit a benchmarking workflow. We offer TPUs in preview, with the same flexibility (Spot and Reservations) as we offer on GPUs: - bid a max and pay market on spot - reserve a single chip (or 4, or 8) without a long commit - relist unused reservation capacity back for credit Reply or DM if you want to be allowlisted. #TPU #Trillium #GoogleCloud

    • No alternative text description for this image
  • Excited to see our partners at uRun officially come out of stealth 🚀. It’s been great working with Keegan McCallum and team as they build the future of real time video gen. Looking forward to what’s ahead. Big congrats to the whole team!

    Today uRun is coming out of stealth. 👋 In 1984, the Macintosh didn't just make computers friendlier. It required an entirely new compute layer: graphics cards, event loops, and windowing systems. Infrastructure that didn't exist for batch computing. Interactive AI is the same kind of shift. The bottleneck in generative video has moved. It's no longer intelligence per token, it's tokens per second. But today's inference stack still works like a slot machine: send a request, wait, get a result, lose the connection. No session. No state. No way to steer the model while it's still running. So we built it. uRun is the inference cloud for the interactive era. We started with video because it's the ceiling. If we can run a 14B parameter video model at 24fps in real time, the infrastructure has been stress tested for everything else. Audio, images, agents, game engines. Real-time video models are already running on uRun today. We're building this in public with Twitch streams, open-source demo repos, and honest recaps of what works and what doesn't. If you're building for the interactive era, we want to talk. The full story → blog.urun.sh Join the waitlist → urun.sh/waitlist

  • Mithril reposted this

    View organization page for Nebius

    104,524 followers

    Jared Quincy Davis, Founder and CEO of our partner Mithril, is a fascinating person to talk to. Through Mithril’s platform clients across sectors can access Nebius infrastructure through tools they already know and trust. We spoke with Jared about what inspired him at #NVIDIAGTC, how the industry is pushing itself to create new solutions and how the infrastructure layer needs to evolve to support emerging workloads. #GTC26

  • View organization page for Mithril

    5,251 followers

    We’re proud to share that Mithril has been named to Fast Company’s Most Innovative Companies in Artificial Intelligence for 2026. This recognition is a meaningful milestone for our team and our mission to build the AI omnicloud: a single platform for transparent, flexible access to the compute infrastructure powering modern AI. This includes our recent announcement that all reservations on Mithril are now "flexible" allowing you to resell compute that would otherwise sit idle. You can find our product announcement in the comments. Thank you to our customers, partners, and team for helping make this possible. See the full list here: https://lnkd.in/e_cM4yBC #FastCompany #AI #CloudInfrastructure #Innovation

  • Proud moment for the team: Mithril was highlighted by Oracle among its AI innovators in connection with its latest quarterly earnings materials: https://lnkd.in/e7QaNCTN We’re excited to keep building infrastructure that helps teams deploy and scale AI faster. Appreciate the recognition from Oracle and the partnership with OCI! #AI #CloudInfrastructure #OCI #GenAI #MLInfrastructure

  • Mithril reposted this

    Anthropic just launched a "fast mode" for Claude Opus 4.6 — 2.5x faster, same model, same intelligence, but at 6x the price. The developer reaction has been… polarized. Some are calling it usurious. Others are calling it one of the biggest productivity unlocks of the year. I think the visceral negative reaction from some devs is actually pretty interesting. Nobody is worse off for having more options — the standard mode is still there, same price, same quality. So why are people upset? I think ironically it's because they're worried fast mode will succeed. If it works well enough that others adopt it, you get a tragedy of the commons -- like everyone standing on their tippy-toes at a concert. Now you're forced to pay the 6x premium just to stay competitive. Zero-sum thinking. But here's what I think people are missing: even 6x greater cost for 2.5x speed can make some applications genuinely cheaper. If you're running a pipeline, or using Claude as an orchestrator in a compound system alongside customer or open-source models running on your own GPU servers, fast mode can unblock the entire system. The orchestrator is the bottleneck. Speed it up and you reduce idle GPU time across the whole pipeline. The bottleneck matters more than the line item. We ran an experiment at Mithril that illustrates this same principle at the GPU layer. We took some underutilized previous-generation GPUs (A100s) and listed them on our platform with a $0.50/hr floor price in an auction. That was already super profitable. Then we dropped the floor 50x — to $0.01 — and let the market purely determine the price. The results were fascinating (see chart). Earnings went UP by 2.5x. And the price dynamics were compelling: [1] during peak hours, preemptible GPU usage surges to $2-4.00/gpu/hr. [2] At night and on weekends/holidays, it drops to $0.04 or even $0.01. A 400x spread. The feedback from users has been that this is better for both sides: For users with high-priority workloads, it's better to have guaranteed availability at a variable price than variable availability at a guaranteed price (first-come, first-serve). For users with flexible workloads — say you need 4 hours of runtime within the next 6-14 hours — it's better to run at night and save 10-100x. Both cases — Anthropic's fast mode and our auction experiment — point to the same notion: there's enormous room beyond flat pricing for compute and its derivatives (like tokens). Better market design pushes the whole price-performance frontier forward. Price-sensitive customers save money. Performance-sensitive customers get guaranteed availability. High-value inference jobs can run anytime. Long-running training jobs can run flexibly with much better economics. It's a rare case where better mechanism / market design genuinely makes everyone better off. I hope we see more of it!

    • No alternative text description for this image
  • View organization page for Mithril

    5,251 followers

    Researchers at the Broad Institute of MIT and Harvard use Mithril’s GPU Omnicloud to better understand gene expression. “Through Mithril, computations that once took up to a week on traditional infrastructure can now be completed in hours.”   ~ Dr. Adit Radha, MIT Mathematics and Broad Institute Read how the Broad Institute is leveraging Mithril to accelerate biological discovery in our latest case study. https://lnkd.in/ggXgshEw

Similar pages

Browse jobs