Structured, flexible, and robust: benchmarking and improving large language models towards more human-like behavior in out-of-distribution reasoning tasks
Details
The content you want is available to Zendy users.Already have an account? Click here. to sign in.