[1]
Elliot Harrington, “Improving Sample Efficiency with Policy Gradient Variants”, EDTECH, vol. 6, no. 1, pp. 11–15, Feb. 2026.