To what bitstring a character(s) is encoded in each character set?
Input one character or short letters and click "Convert."
Charset | Character | Bit string (binary) | Bit String (hexadecimal) |
---|---|---|---|
ISO-8859-1 | AåV¸ãTé\ | 10011110010000011110010101010110100110001011100011100011010101001110100101011100 | 9e41e55698b8e354e95c |
SJIS-WIN | ?A?V???T?\ | 00111111010000010011111101010110001111110011111100111111010101000011111101011100 | 3f413f563f3f3f543f5c |
EUC-JP | ?AåV?¸ãTé\ | 001111110100000110001111101010111010100101010110001111111000111110100010101100011000111110101011101010100101010010001111101010111011000101011100 | 3f418faba9563f8fa2b18fabaa548fabb15c |
UTF-8 | AåV¸ãTé\ | 11000010100111100100000111000011101001010101011011000010100110001100001010111000110000111010001101010100110000111010100101011100 | c29e41c3a556c298c2b8c3a354c3a95c |
UHC | ?A?V?¸?T?\ | 0011111101000001001111110101011000111111101000101010110000111111010101000011111101011100 | 3f413f563fa2ac3f543f5c |
SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)