Skip to content

yforster/coq-verified-extraction

Repository files navigation

Verified Extraction from Coq to OCaml

This repository contains the current development state of a new verified extraction from Coq to OCaml, based on MetaCoq. Technically, the extraction targets Malfunction, which is a specification of Lambda, the internal language of the OCaml compiler. We use Malfunction as target for extraction from Coq, and rely on the Malfunction and OCaml compilers to obtain .cmx files that will behave like .cmx files created by Coq's current extraction process and the Ocaml compiler. In particular, Coq programs extracted like this can interact with other OCaml programs and with Coq programs extracted using the existing extraction.

The implementation of extraction is fully functional and supports all of Coq's constructs including primitive integers, floats and arrays, but the cofixpoint to lazy/force translations is not verified yet. The article "Verified Extraction from Coq to OCaml" published and awarded at PLDI'24 describes this development.

Installation

opam switch create coq-malfunction --packages="ocaml-variants.4.14.1+options,ocaml-option-flambda"
eval $(opam env --switch=coq-malfunction)
opam repo add coq-released https://coq.inria.fr/opam/released
opam pin -n -y "https://github.com/MetaCoq/metacoq.git#v1.3.2-8.19"
opam pin -n -y "https://github.com/stedolan/malfunction.git#master"
opam install . --deps-only
make -j 4

Usage

After From VerifiedExtraction Require Import Extraction. the commands Verified Extraction <definition> and Verified Extraction <definition> "<file>.mlf" can be used to run the new extraction process. Multiple functions can be extracted at the same time with Verified Extraction (<d1>,<d2>,...). To add an mli file one can add the output of the (unverified) generator MetaCoq Run Print mli <definition>. to a .mli file. Note that this metaprogram does not (yet) take into account the reordering of constructors, besides the one for booleans (see Extract Inductives below). It won't output declarations for unit, bool, list, option and prod that have matching representations in OCaml.

Verified Extraction supports the following options:

  • -time: prints compilation and run times
  • -typed: uses typed extraction to perform more agressive argument removal
  • -bypass-qeds: by default, Qed's proofs are turned into opaque constants during reification in MetaCoq, so that the reified environment is smaller, this bypasses this behavior and reifies everything (can be uterly slow). Axioms are turned into external function calls, so if some remain in the extracted code they will have to be implemented (in the produced Axioms.ml file)
  • -compile-plugin: compile the code as a Coq plugin that can be loaded.
  • -compile-with-coq: compile the code as a standalone program that however links with Coq's codebase (c.f. the CoqMsgFFi.v file)
  • -compile: compile the code as a fully standalone program not depending on Coq (can be run from the shell)
  • -optimize: Use "malfunction -O2" to compile produced code.
  • -load: load the plugin into Coq
  • -run: Runs the compiled program or linked plugin and readback it's computed value. Plugins using the CoqMsgFFI.v can output information in Coq's info, notice and error channels while running.
  • -verbose: More verbose compiler output
  • -fmt: Use malfunctions automatic formatting to produce the .mlf file (for more readable generated code)
  • -unsafe: Use unsafe optimizations (all of them, or a subset by listing some of the following flags separated by spaces):
    • cofix-to-lazy: unverified cofixpoints to lazy/force translation
    • inlining: honor Verified Extract Inline directives.
    • unboxing: perform unboxing of types having a single constructor with a single computational argument This is run after typed erasure for more opportunities to unbox.
    • betared: perform apparent beta reductions (useful when combined with inlining)

Verified Extraction also supports the directives:

  • Verified Extract Inductives [ ind [ name [ tags ] ], .. ]: this declares a reordering of constructors to match existing ocaml datatype representations (tags are natural numbers). This reordering phase is verified. By default, Coq's booleans Inductive bool := true | false are mapped to OCaml's type bool = false | true by swapping their constructor tags.
  • Verified Extract Inline cst: declares cst to be inlined (expanded everywhere) during extraction. This phase is NOT verified at the moment.

Structure

  • the theories directory contains the Coq files with implementation and verification of the plugin
    • Malfunction.v contains the syntax definition of Malfunction. It is a direct, line-to-line port of the OCaml file malfunction.ml to Coq.
    • SemanticsSpec.v defines an inductive big-step evaluation predicate.
    • Compile.v defines a compilation function of lambda box to Malfunction.
    • CompileCorrect.v proves the correctness of this function, using the correctness proof of case analysis in Mcase.v
    • Interpreter.v contains an interpretation function, which is close to malfunction_interpreter.ml, and a proof that values according to the evaluation predicate are also found by the interpreter. Note that since the interpreter is not necessarily terminating we switch of Coq's termination checker, meaning this proof can only be seen as a sanity check.
    • Serialize.v contains seralization functions into s-expressions. There is also a parser in Deserialize.v, used for testing.
    • Pipeline.v composes the full extraction pipeline from Coq to Malfunction
    • PipelineCorrect.v composes the correctness proof of the extraction pipeline
    • Firstorder.v derives interoperability results for first-order functions
  • the plugin directory contains the code of the extraction plugin extracted using Coq's OCaml extraction via theories/Extraction.v. This is packaged into an intermediate extraction plugin.
  • the plugin/plugin-bootstrap directory contains the code of the extraction plugin extracted using the intermediate extraction plugin. This is packaged into the final verified extraction plugin.
  • the examples directory contains various examples how to use the new verified extraction plugin.
  • the benchmarks directory allows re-producing benchmarks from the paper.

Team & Credits

The project is developed by Yannick Forster, Matthieu Sozeau, Pierre-Marie Pédrot, and Nicolas Tabareau.

Copyright (c) 2022--2024 Yannick Forster, Matthieu Sozeau, Nicolas Tabareau