Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Are there any public test data sets of name corner cases? Given the popular "falsehoods programmers believe" lists, someone could create a public data set of unsanitized name inputs, expected decompositions, and expected round trip result. I think genealogy organizations have published de facto standards for name formatting.


Not exactly what you asked for, but https://github.com/minimaxir/big-list-of-naughty-strings/ does have a lot of inputs which, if given as names, would turn out the bugs you're looking for.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: