-
Notifications
You must be signed in to change notification settings - Fork 130
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Problems related to the creation of a new chemical inverse synthesis planning environment in the LightZero framework #317
Comments
To adapt the molecular retrosynthesis planning problem into an environment compatible with the LightZero framework and leverage AlphaZero or MuZero to solve it, the design can be approached as follows: 1. Modeling the Problem as an MDPTo transform the retrosynthesis planning problem into a Markov Decision Process (MDP), the following key elements need to be defined: (1) State
(2) Action
(3) Reward
(4) State Transition
(5) Termination Condition
2. Designing the Environment for the LightZero FrameworkThe LightZero framework supports training with AlphaZero and MuZero. To integrate the retrosynthesis problem, the MDP must be implemented as an interactive environment. Below are the design suggestions: (1) Environment InterfaceThe environment should implement LightZero's standard interface(you can start from this env), with the following core functions:
(2) State Representation
(3) Dynamic Action Space
(4) State Transition Logic
(5) Reward Design
3. Generating Multiple Reference PathwaysThe LightZero framework can generate multiple reference pathways by:
Feel free to ask any detailed questions about adapting the environment and integrating algorithms in LightZero. We are eagerly looking forward to your contributions!(Part of the answer was assisted by chatgpt-4o.) |
Thank you very much for your enthusiastic answers! I am in the process of integrating my environment into the LightZero framework.LightZero is really a clean, efficient and very useful algorithmic framework that enhances my learning. When I am done I will be happy to share this environment with you. |
I am trying to translate a molecular inverse synthesis planning problem into the LightZero environment.First of all, the problem is overviewed as follows: starting from the target molecule$S_0$ , multiple reaction rules $[a_1,a_2,..., a_n]$ can be obtained to convert the target molecule into precursor molecules (one reaction rule may get one precursor molecule or more than one) by using a specialized chemical model B. It is judged for the obtained precursor molecules whether they can be purchased to get the molecules or not available using model B to get the reaction rule. The termination condition is that the target molecule finds a molecule that can be purchased through a series of reactions or reaches the maximum depth of search. ==It would also be nice for the model to output a few more reaction paths for reference.==$a_1$ is chosen, two molecules $S_1$ and $S_2$ may be obtained, if you want to get the synthesis path you must $S_1$ and $S_2$ both decomposed into molecules that can be purchased.
If this problem is transformed into a LightZero framework compatible environment can be solved using AlphaZero, Muzero, how should this problem be modeled by MDP, the difficulty is that in a Step a reaction rule
In response to this question, how do you think my environment should be designed to give a better fit with the LightZero framework. I look forward to discussing this with you, thank you!
The text was updated successfully, but these errors were encountered: