Utility to report and repair broken surrogate pairs in UTF-16 text

From: Jim Monty (jim.monty@yahoo.com)
Date: Wed Nov 03 2010 - 13:37:22 CST

  • Next message: Doug Ewell: "RE: Utility to report and repair broken surrogate pairs in UTF-16 text"

    Is there a utility, preferably open source and written in C, that inspects
    UTF-16/UTF-16BE/UTF-16LE text and identifies broken surrogate pairs and illegal
    characters? Ideally, the utility can both report illegal code units and "repair"
    them by replacing them with U+FFFD.

    Jim Monty



    This archive was generated by hypermail 2.1.5 : Wed Nov 03 2010 - 14:04:39 CST