How to get all premises in a random Lean repo? #20
-
I'm working on translating natural language reasoning task into Lean code and solving it using theorem proving techniques (for examples, please check here). For this, I need a way to extract all premises from the file so I can select the relevant ones for reasoning tasks. I briefly looked over the code but it appears that a Furthermore, I'm curious to know if the existing ReProver model can handle this task? Although the problem I've outlined might seem outside its current scope, it could potentially be solvable given that the required tactics are fairly straightforward - such as |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 3 replies
-
For |
Beta Was this translation helpful? Give feedback.
-
Thanks for guiding me to the demo. I've now sorted the issue of finding premises. Yet, I'm still wrestling with producing proofs for my natural language reasoning task. I'm using GPT4 to formalize natural language reasoning problems into Lean code and using ReProver to handle the theorem proving part. For my example, I give premises in the form of I'm unsure if I'm doing something wrong, or if the gap between math theorem training data and translated natural language reasoning training data is too wide for successful proofs. I would appreciate any suggestions! |
Beta Was this translation helpful? Give feedback.
For
corpus.jsonl
, you can look into the code for generating LeanDojo Benchmark. For the other question, maybe ReProver can be used, but I don't see a way to collect enough aligned data of natural language -> formal theorem.