Meaning Alignment Institute
Subscribe
Sign in
Home
Archive
Leaderboard
About
Model Integrity
You may want a compliant assistant, but a co-founder with integrity. We propose ‘model integrity’ as an overlooked challenge in aligning LLM agents.
Dec 5
•
Joe Edelman
and
Oliver Klingefjord
12
Share this post
Meaning Alignment Institute
Model Integrity
Copy link
Facebook
Email
Notes
More
March 2024
[New paper] What are human values, and how do we align to them?
We are excited to release our new paper on values alignment! Co-authored with Ryan Lowe, and funded by OpenAI.
Mar 29
•
Joe Edelman
,
Oliver Klingefjord
, and
Ryan Lowe
16
Share this post
Meaning Alignment Institute
[New paper] What are human values, and how do we align to them?
Copy link
Facebook
Email
Notes
More
5
February 2024
David Shapiro Interview
And two other quick updates;
Feb 6
•
Oliver Klingefjord
and
Joe Edelman
6
Share this post
Meaning Alignment Institute
David Shapiro Interview
Copy link
Facebook
Email
Notes
More
December 2023
Year End Bonus: a GPT to help with your New Year's Resolutions
This time of year, many reflect on their values. What are yours? How can you weave your life around them?
Dec 31, 2023
•
Joe Edelman
9
Share this post
Meaning Alignment Institute
Year End Bonus: a GPT to help with your New Year's Resolutions
Copy link
Facebook
Email
Notes
More
1
Meaning Alignment Institute: Year in Review
And what's next for 2024
Dec 29, 2023
•
Oliver Klingefjord
and
Joe Edelman
12
Share this post
Meaning Alignment Institute
Meaning Alignment Institute: Year in Review
Copy link
Facebook
Email
Notes
More
October 2023
OpenAI x DFT: The First Moral Graph
Beyond Constitutional AI; Our first trial with 500 Americans; How democratic processes can generate an LLM we can trust.
Oct 24, 2023
•
Joe Edelman
and
Oliver Klingefjord
33
Share this post
Meaning Alignment Institute
OpenAI x DFT: The First Moral Graph
Copy link
Facebook
Email
Notes
More
5
September 2023
Help us make ChatGPT wiser
Join our OpenAI-backed experiment to democratically fine-tune ChatGPT's values.
Sep 20, 2023
•
Joe Edelman
and
Oliver Klingefjord
6
Share this post
Meaning Alignment Institute
Help us make ChatGPT wiser
Copy link
Facebook
Email
Notes
More
12
August 2023
We are now "The Institute for Meaning Alignment"
Hello everyone!
Aug 29, 2023
•
Joe Edelman
10
Share this post
Meaning Alignment Institute
We are now "The Institute for Meaning Alignment"
Copy link
Facebook
Email
Notes
More
Introducing Democratic Fine-Tuning
An alternative to Constitutional AI or simple RLHF-based approaches for fine-tuning LLMs based on moral information from diverse populations.
Aug 29, 2023
•
Joe Edelman
and
Oliver Klingefjord
24
Share this post
Meaning Alignment Institute
Introducing Democratic Fine-Tuning
Copy link
Facebook
Email
Notes
More
2
Share
Copy link
Facebook
Email
Notes
More
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts