A prominent technology company in York Region, Canada is seeking a Researcher for Reinforcement Learning. This role focuses on enhancing Large Language Models (LLMs) through innovative training techniques and evaluation methods. The ideal candidate will hold a PhD in Computer Science, with strong deep learning and reinforcement learning skills, along with proficiency in Python and experience with tools like PyTorch. This entry-level position offers a contract duration of 12 months.
#J-18808-Ljbffr
Researcher Agentic RL for LLMs Contract • Markham, York Region, CA