Guive’s Substack
Subscribe
Sign in
Home
Archive
About
Token and Taboo
Can language models correctly identify moral taboos?
Apr 24
•
Guive Assadi
3
Share this post
Guive’s Substack
Token and Taboo
Copy link
Facebook
Email
Notes
More
January 2025
Testing for Scheming with Model Deletion
There is a simple behavioral test that would provide significant evidence about whether AIs with a given rough set of characteristics develop subversive…
Jan 7
•
Guive Assadi
2
Share this post
Guive’s Substack
Testing for Scheming with Model Deletion
Copy link
Facebook
Email
Notes
More
December 2024
Updating on Bad Arguments
Here is an intuitively compelling principle: hearing a bad argument for a view shouldn’t change your degree of belief in the view.
Dec 21, 2024
•
Guive Assadi
5
Share this post
Guive’s Substack
Updating on Bad Arguments
Copy link
Facebook
Email
Notes
More
2
May 2024
Coming soon
This is Guive’s Substack.
May 7, 2024
•
Guive Assadi
Share this post
Guive’s Substack
Coming soon
Copy link
Facebook
Email
Notes
More
Share
Copy link
Facebook
Email
Notes
More
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts