The UTF8 program is a tool for converting from a UTF-8 byte sequence to the Unicode code point. The byte sequence can also be referred to as the binary representation of the code even though the byte sequence is considered text. The term "byte sequence" here is used to refer to a sequence of bytes intended to represent something, which in this case is a Unicode character. The Unicode code point is the Unicode designation uniquely identifying that particular sequence. The Unicode code point is not specific to UTF-8 and persists across different encodings, such as UTF-16.

The idea behind the UTF8 program is to provide the answer to the question of what some special UTF-8 character is or to provide a way to create the UTF-8 character given the Unicode code point.

This tool is intended to be scriptable, should handle both piped data and files, and can convert entire files.

This tool can be used to validate a given byte sequence or can be used to get the character width of some byte sequence or code point.

This tool can be used to store binary data in a text-friendly format and then restore the binary data.