1. Q-LEARNING AND POLICY GRADIENT METHODS. Authors: HUNOR JAKAB, LEHEL CSATÓ.2. GEODESIC DISTANCE-BASED KERNEL CONSTRUCTION FOR GAUSSIAN PROCESS VALUE FUNCTION APPROXIMATION. Authors: HUNOR JAKAB.