Jacob Hilton is an influential AI alignment researcher renowned for his significant contributions to making large language models safer and more aligned with human intentions. He was a key member of the alignment team at OpenAI, where he co-led the development of InstructGPT, the predecessor to ChatGPT, pioneering techniques in Reinforcement Learning from Human Feedback (RLHF). His work focuses on scalable oversight, aiming to ensure that AI systems, even those far more capable than humans, can be reliably supervised. After his tenure at OpenAI, he has continued his research independently and has been associated with the Alignment Research Center (ARC) Evals team, contributing to the evaluation of advanced AI models for potential risks and alignment challenges.
Jacob Hilton's work history includes a series of influential roles in various companies. Here is a detailed list of his professional journey:
Co-led the research and development of InstructGPT at OpenAI, demonstrating significant improvements in language model helpfulness, honesty, and harmlessness through Reinforcement Learning from Human Feedback (RLHF).
Made foundational contributions to the practical application and scaling of Reinforcement Learning from Human Feedback, a critical technique for aligning large language models like ChatGPT.
Conducted and published research on scalable oversight mechanisms, exploring how humans can effectively supervise and align AI systems that may eventually surpass human capabilities in many domains.
Actively contributes to the AI alignment research community through publications, talks, and engagement, fostering discussion and progress on critical AI safety problems.
Contributed to the Alignment Research Center's (ARC) Evals team, focusing on developing and applying evaluations to assess the capabilities and alignment of advanced AI systems, particularly in identifying potentially dangerous emergent behaviors.
State Technical College of Missouri - Year 2017
Kirkwood Sr. High School - Year 2013
Highperformr Signals uncover buying intent and give you clear insights to target the right people at the right time — helping your sales, marketing, and GTM teams close more deals, faster.
Concolor Software is a technology firm specializing in the development and delivery of innovative software solutions. While specific details about its market focus and product suite are not widely publicized, it likely aims to provide tools or platforms that address specific business challenges or consumer needs. As a software developer, its core activities would revolve around software design, coding, testing, and maintenance, potentially serving various industries or a niche market segment. Given the limited public information, it is presumed to be a private entity.
Get verified emails, phone numbers, and LinkedIn profile details
Discover contacts with similar roles, seniority, or companies
Uncover insights like skills, work history, social links, and more
Explore contacts in-depth — from verified emails and phone numbers to LinkedIn activity, job changes, and more — all in one powerful view.
Highperformr AI helps you surface the right people and enrich your CRM with live, accurate contact insights so your teams can connect faster and close smarter.
Thousands of contacts — including decision-makers, influencers, and ICP matches — are just a search away.
Thousands of companies, including, are just a search away.