Combines Reinforcement Learning with DLITE (Delight) as a new information measure (loss function) for training and fine-tuning of machine/deep learning models.

Combines Reinforcement Learning with DLITE (Delight) as a new information measure (loss function) for training and fine-tuning of machine/deep learning models.