Using probabilities calculated from the Asian_Surnames database, along with some regex stuff we've developed, I've written an XSLT file which parses the XML dump from the database looking for owners with no assigned ethnicity, and then creates lists of those most probably Japanese, less probably Japanese, possibly Chinese, and probably not either; in each of the first three cases, it also generates SQL insert statements to assign the appropriate ethnicity. Results sent to SF.