UnityRLBridge package contains an example of its usage.
There is an agent in the form of a cube moving on two axes, trying to reach a target position. The example environment was designed in Unity, and learning process was conducted using the stable-baselines3 library in Python.