Business Wire

Logical Intelligence Achieves 76 Percent on Putnam Benchmark, Highlighting Shift Beyond Large Language Models to Language-free, Mathematically Grounded Models

2.12.2025 15:15:00 CET | Business Wire | Press release

Share

Over the last decade, artificial intelligence (AI) has been largely built around large language models (LLMs). These systems are based on a language and guess words in a chain in the form of tokens. As a result, they frequently hallucinate and require vast compute and power infrastructure to solve tasks. The moment systems like public safety, national infrastructure, and industrial automation need logic, LLMs break and introduce safety risks. Token-free language independent models represent a new direction for AI. They do not predict words. They search for correct solutions and require less compute. Logical Intelligence is the first company building exclusively around mathematically derived, non autoregressive EBM (Energy Based Model) reasoning.

Today, Logical Intelligence announced that its Aleph tool achieved a 76 percent score on the Putnam Benchmark, one of the most demanding mathematical reasoning tests in artificial intelligence. The benchmark measures a model’s ability to solve formal mathematics problems by producing verified proofs rather than relying on text generation. While Aleph is an internal tool built on top of an LLM, its performance places it ahead of all publicly evaluated LLMs and the hybrid EBM systems that still depend on LLM scaffolding. The results are a strong signal that native EBM architectures offer a clear path to trustworthy AI.

“We built Aleph as an internal tool to test the mathematical rigor of the environment we are creating, not to be our core model,” said Eve Bodnia, founder and CEO of Logical Intelligence. “Aleph’s performance proves that our foundations are strong, even though Aleph itself was developed on top of an LLM. The tool represents a fraction of what we expect our core model to accomplish.”

Why Logical Intelligence Uses EBMs Instead of LLMs

Most AI systems reason the same way they write: one word at a time. This produces long, fragile chains of tokens that can fall apart with a single incorrect step. The model receives a “final grade” only at the end of the chain, with no idea where the reasoning failed. This makes LLMs unpredictable and unsuitable for environments that require guaranteed correctness.

Logical Intelligence uses EBMs because they operate on a different principle. An EBM does not think in words. It reasons in continuous mathematical states shaped by the structure of the problem. Instead of producing text token by token, the model updates its entire internal state at once. This allows it to correct course, explore alternatives, and converge on stable, verifiable answers. The system behaves closer to a trained mathematician than a predictive text engine.

EBMs are positioned to become the backbone of the systems where uncertainty is unacceptable. These include true self-driving vehicles, advanced aviation, automated manufacturing, power grids, defense systems, autonomous robotics, chip design, and national infrastructure. Any environment that depends on logic behaving the same way every time will require the type of deterministic reasoning that EBMs can provide.

“If you need certainty, you cannot rely on word prediction,” Bodnia said. “You need a system that works through the structure of a problem. EBMs give us the foundation for that.”

Why Aleph Matters

Aleph was created for one purpose. It is a tool that converts mathematical problems into formal statements and generates proofs that can be checked by a machine. This allows researchers to verify that an answer is mathematically correct. Even as an internal tool built on an LLM, Aleph’s ability to generate large volumes of verifiable proofs is a meaningful advancement. Most AI systems can describe mathematics. Very few can prove anything.

“Aleph gives us a new level of certainty in AI today,” Bodnia said. “It is the first signal of what is possible when you build systems around mathematical truth.”

Logical Intelligence is already working with a small group of organizations to test early applications of Aleph in controlled environments across key vertical industries. These pilots are designed to explore how mathematical verification can support real systems.

Logical Intelligence will release its general purpose model with formal machine verifiable reasoning in 2026. This system will go far beyond Aleph and demonstrate how mathematical reasoning can support complex, high-assurance environments at scale. The company will show how its approach can serve industries where perfect logic is the requirement.

“Aleph is our first milestone,” Bodnia said. “The full system is coming in 2026.”

For more information and to read the Aleph white paper, visit www.logicalintelligence.com/aleph-prover.html.

About Logical Intelligence

Logical Intelligence is an artificial intelligence research company building the first fully language-free, mathematically grounded Energy Based Models. These systems differ from LLMs and hybrid EBM approaches by reasoning directly in structured state space and generating proofs that can be checked for correctness. Logical Intelligence is designing its models to underpin critical infrastructure, advanced automation, and high-reliability computing. Its team includes researchers with advanced degrees in mathematics and computer science, ICPC and IMC medalists, contributors to major proof systems, a Fields Medalist, and a Turing Award laureate who guides the company’s long-term scientific direction. For more information, visit www.logicalintelligence.com or follow us on X at @logic_int and our founder & CEO at @EveLovesOlive.

View source version on businesswire.com: https://www.businesswire.com/news/home/20251202089385/en/

Contacts

media@logicalintelligence.com

About Business Wire

Business Wire
24 Martin Lane
EC4R 0DR London

+44 20 7626 1982http://www.businesswire.co.uk

(c) 2018 Business Wire, Inc., All rights reserved.

Business Wire, a Berkshire Hathaway company, is the global leader in multiplatform press release distribution.

Subscribe to releases from Business Wire

Subscribe to all the latest releases from Business Wire by registering your e-mail address below. You can unsubscribe at any time.

Latest releases from Business Wire

H.I.G. Capital Announces the Sale of DGS S.p.A.11.6.2024 12:00:00 CEST | Press release

H.I.G. Capital (“H.I.G.”), a leading global alternative investment firm with $62 billion of capital under management, is pleased to announce that an affiliate has signed a definitive agreement to sell its portfolio company, DGS S.p.A. (“DGS” or the “Group”), a leading firm in the Italian Information Technology market, to DGS Co-Founders and management team in partnership with ICG, a global alternative asset manager. Since its inception in 1997, DGShas supported blue-chip customers in the design, integration, and maintenance of complex IT systems, with a specialization in digital transformation and cybersecurity services. The Group currently has over 1,900 employees, revenues of approximately €300 million, and maintains a group of highly loyal clientele. During H.I.G.’s ownership, DGS has tripled in size and consolidated its position as a leading Italian firm in cybersecurity services and digital transformation. DGS offers its clients sophisticated and proprietary digital transformation

Evertas Names Nick Selby Head of European Underwriting11.6.2024 12:00:00 CEST | Press release

Evertas, the world’s first crypto insurance company, has named Nick Selby as its new Head of European Underwriting. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20240611141887/en/ Nick Selby, Executive Vice President and Head of European Underwriting at Evertas (Photo: Business Wire) Selby, an accomplished information and physical security professional, brings two decades of expertise in public and private sector information security, physical security, and complex incident handling, as well as seven years of experience leading teams securing billions of dollars in cryptoassets. Previously, his roles included VP of the Software Assurance Practice at Trail of Bits, Chief Security Officer at Paxos Trust Company, and Director of Cyber Intelligence and Investigations at the NYPD Intelligence Bureau. “Nick is an extremely valuable addition to our European team,” said Evertas CEO and Co-Founder J. Gdanski. “His public and private

Owlet utvider globalt fotavtrykk med lanseringen av medisinsk-sertifisert Dream Sock™ i Storbritannia og over hele Europa11.6.2024 11:00:00 CEST | Pressemelding

Owlet, Inc. («Owlet» or the «Company») (NYSE:OWLT), pioneren innen smart spedbarnsovervåking, kunngjør i dag den britiske og europeiske lanseringen av Dream Sock. Dette er en smart babymonitor med levende helseavlesninger og varsler for friske spedbarn mellom 0-18 måneder og 2,5-13,6 kg. Dette innovative medisinske utstyret gir foreldre helse og viktig informasjon i sanntid, noe som gir uovertruffen trygghet. Denne pressemeldingen inneholder multimedia. Se hele pressemeldingen her: https://www.businesswire.com/news/home/20240611820341/no/ (Photo: Business Wire) «Vi er svært stolte over å lansere Dream Sock til omsorgspersoner over hele Storbritannia og Europa og gi millioner av foreldre mer trygghet mens babyen sover,» sa Kurt Workman, Owlets administrerende direktør og medgründer. «Dream Sock er nå et globalt produkt som er anerkjent som medisinsk nøyaktig og trygt, etter å ha gjennomgått regulatoriske autorisasjoner og sertifiseringer innenfor flere geografier. I dag er misjonen vår

V-Nova Surpasses 1000 Patent Milestone in Media Technology Innovation11.6.2024 10:00:00 CEST | Press release

V-Nova, a leading provider of data compression solutions, video compression technology, XR technology, AI acceleration and parallel processing for a multitude of industries including media and entertainment, today announced its milestone achievement of 1000 active technology patents. This accomplishment underscores V-Nova’s dedication to research and development and its commitment to protecting its intellectual property globally. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20240611724561/en/ V-Nova’s patent portfolio spans more than 50 different jurisdictions. Including over 400 patents in Europe, over 200 in the Americas, over 100 in the United States specifically, and over 200 in Asia. V-Nova forged new directions in data processing to enhance digital experiences, maximize efficiency, reduce costs, and increase sustainability. The company leads the way with key international data compression standards for the video indust

Alipay+ Reveals Top Scorer Trophy Design for UEFA EURO 2024™11.6.2024 09:24:00 CEST | Press release

Alipay+, a suite of cross-border mobile payment and digitalization technology solutions operated by Ant International and an Official Partner of UEFA EURO 2024™, today revealed the trophy that will be awarded to the most prolific marksman at the UEFA EURO 2024™ finale on July 14 in Berlin, Germany. This press release features multimedia. View the full release here: https://www.businesswire.com/news/home/20240610328619/en/ The UEFA Top Scorer Trophy presented by Alipay+ is unveiled for UEFA EURO 2024™ (Photo: Business Wire) Sculpted in the shape of the Chinese character “支” (pronounced zhi, and meaning payment as well as support), the trophy reflects Alipay+’s dedication to supporting consumers to enjoy seamless payment and a broad choice of deals using their preferred payment methods while traveling abroad. The character also resembles the fleeting moment of a barefooted striker poised to shoot, evoking the original beauty and power of football – a game that united people across the wo

World GlobeA line styled icon from Orion Icon Library.HiddenA line styled icon from Orion Icon Library.Eye