Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Fair enough!

I've only briefly tried it and it did seem quite capable for what I was doing, but not that much better than the Chinese models I've been mostly using.

In any case, this [0] seems to paint a more reasonable picture than "it's much better than anything else at everything".

[0] https://www.aisi.gov.uk/blog/our-evaluation-of-claude-mythos...



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: