Proof assistant

From Wikipedia, the free encyclopedia
Jump to navigation Jump to search

Template:Short description Template:Hatnote group Template:Distinguish Template:Missing information Template:More footnotes

File:CoqProofOfDecidablityOfEqualityOnNaturalNumbers.png
An interactive proof session in RocqIDE, showing the proof script on the left and the proof state on the right

In computer science and mathematical logic, a proof assistant or interactive theorem prover is a software tool to assist with the development of formal proofs by human–machine collaboration. This involves some sort of interactive proof editor, or other interface, with which a human can guide the search for proofs, the details of which are stored in, and some steps provided by, a computer.

A recent effort within this field is making these tools use artificial intelligence to automate the formalization of ordinary mathematics.<ref>{{#invoke:citation/CS1|citation |CitationClass=web }}</ref>

Template:AnchorSystem comparison

Template:See also

Name Latest version Developer(s) Implementation language Features
Higher-order logic Dependent types Small
kernel
Proof automation Proof by
reflection
Code generation
ACL2 8.3 Matt Kaufmann, J Strother Moore Common Lisp Template:No Template:N/a Template:No Template:Yes Template:Yes<ref>Template:Cite book</ref> Template:N/a
Agda 2.6.4.3<ref name="github-agda"/> Ulf Norell, Nils Anders Danielsson, and Andreas Abel (Chalmers and Gothenburg)<ref name="github-agda"/> Haskell<ref name="github-agda"/> Template:Yes
Template:Cn
Template:Yes
<ref name="agdaWiki"/>
Template:Yes
Template:Cn
Template:No
Template:Cn
Template:Partial
Template:Cn
Template:N/a
Template:Cn
Albatross 0.4 Helmut Brandl OCaml Template:Yes Template:No Template:Yes Template:Yes Template:Unknown Template:Not yet implemented
F* repository Microsoft Research and INRIA F* Template:Yes Template:Yes Template:No Template:Yes Template:Yes<ref>Search for "proofs by reflection": Template:ArXiv</ref> Template:Yes
HOL Light repository John Harrison OCaml Template:Yes Template:No Template:Yes Template:Yes Template:No Template:No
HOL4 Kananaskis-13 (or repo) Michael Norrish, Konrad Slind, and others Standard ML Template:Yes Template:No Template:Yes Template:Yes Template:No Template:Yes
Idris 2 0.6.0 Edwin Brady Idris Template:Yes Template:Yes Template:Yes Template:Unknown Template:Partial Template:Yes
Isabelle Isabelle2025 (March 2025) Larry Paulson (Cambridge), Tobias Nipkow (München) and Makarius Wenzel Standard ML, Scala Template:Yes Template:No Template:Yes Template:Yes Template:Yes Template:Yes
Lean citation CitationClass=web

}}</ref>

Leonardo de Moura (Microsoft Research) C++, Lean Template:Yes Template:Yes Template:Yes Template:Yes Template:Yes Template:Yes
LEGO 1.3.1 Randy Pollack (Edinburgh) Standard ML Template:Yes Template:Yes Template:Yes Template:No Template:No Template:No
Metamath citation CitationClass=web

}}</ref> || Norman Megill || ANSI C || || || || || ||

Mizar 8.1.11 Białystok University Free Pascal Template:Partial Template:Yes Template:No Template:No Template:No Template:No
Nqthm
NuPRL 5 Cornell University Common Lisp Template:Yes Template:Yes Template:Yes Template:Yes Template:Unknown Template:Yes
PVS 6.0 SRI International Common Lisp Template:Yes Template:Yes Template:No Template:Yes Template:No Template:Unknown
Rocq 9.0 INRIA OCaml Template:Yes Template:Yes Template:Yes Template:Yes Template:Yes Template:Yes
Twelf 1.7.1 Frank Pfenning, Carsten Schürmann Standard ML Template:Yes Template:Yes Template:Unknown Template:No Template:No Template:Unknown
  • ACL2 – a programming language, a first-order logical theory, and a theorem prover (with both interactive and automatic modes) in the Boyer–Moore tradition.
  • Rocq (former name: Coq) – Allows the expression of mathematical assertions, mechanically checks proofs of these assertions, helps to find formal proofs, and extracts a certified program from the constructive proof of its formal specification.
  • HOL theorem provers – A family of tools ultimately derived from the LCF theorem prover. In these systems the logical core is a library of their programming language. Theorems represent new elements of the language and can only be introduced via "strategies" which guarantee logical correctness. Strategy composition gives users the ability to produce significant proofs with relatively few interactions with the system. Members of the family include:
  • IMPS, An Interactive Mathematical Proof System.<ref>Template:Cite journal</ref>
  • Isabelle is an interactive theorem prover, successor of HOL. The main code-base is BSD-licensed, but the Isabelle distribution bundles many add-on tools with different licenses.
  • Jape – Java based.
  • Lean
  • LEGO
  • Matita – A light system based on the Calculus of Inductive Constructions.
  • MINLOG – A proof assistant based on first-order minimal logic.
  • Mizar – A proof assistant based on first-order logic, in a natural deduction style, and Tarski–Grothendieck set theory.
  • PhoX – A proof assistant based on higher-order logic which is eXtensible.
  • Prototype Verification System (PVS) – a proof language and system based on higher-order logic.
  • Theorem Proving System (TPS) and ETPS – Interactive theorem provers also based on simply typed lambda calculus, but based on an independent formulation of the logical theory and independent implementation.

User interfaces

A popular front-end for proof assistants is the Emacs-based Proof General, developed at the University of Edinburgh.

Rocq includes RocqIDE, which is based on OCaml/Gtk. Isabelle includes Isabelle/jEdit, which is based on jEdit and the Isabelle/Scala infrastructure for document-oriented proof processing. More recently, Visual Studio Code extensions have been developed for Rocq,<ref>{{#invoke:citation/CS1|citation |CitationClass=web }}</ref> Isabelle by Makarius Wenzel,<ref>{{#invoke:citation/CS1|citation |CitationClass=web }}</ref> and for Lean 4 by the leanprover developers.<ref>{{#invoke:citation/CS1|citation |CitationClass=web }}</ref>

Formalization extent

Freek Wiedijk has been keeping a ranking of proof assistants by the amount of formalized theorems out of a list of 100 well-known theorems. As of September 2025, only six systems have formalized proofs of more than 70% of the theorems, namely Isabelle, HOL Light, Lean, Rocq, Metamath and Mizar.<ref>{{#invoke:citation/CS1|citation |CitationClass=web }}</ref><ref>Template:Cite journal</ref>

Notable formalized proofs

Template:See also

The following is a list of notable proofs that have been formalized within proof assistants.

Theorem Proof assistant Year
Four color theorem<ref>Template:Citation</ref> Rocq 2005
Feit–Thompson theorem<ref>{{#invoke:citation/CS1|citation CitationClass=web

}}</ref> || Rocq || 2012

Fundamental group of the circle<ref>Template:Cite book</ref> Rocq 2013
Erdős–Graham problem<ref>{{#invoke:citation/CS1|citation CitationClass=web

}}</ref><ref>Template:Cite arXiv</ref>

Lean 2022
citation CitationClass=web

}}</ref> || Lean || 2023

BB(5) = 47,176,870<ref>{{#invoke:citation/CS1|citation CitationClass=web

}}</ref>

Rocq 2024

See also

Notes

Template:Reflist

References

|CitationClass=web }}

Template:External links

Catalogues

|CitationClass=web }} (By Frank Pfenning).

de:Maschinengestütztes Beweisen