Added termination to parameters for the gym. Added default rewards. Added observations to the simulation results.
Added termination reward to parameters for the gym, including a termination reward. This replaced what was previously truncation. Add functions to the gym class for computing default rewards. Added observations to the simulation results (this is a bit unclean and is not configurable). Cleaned up a few things.