To what bitstring a character(s) is encoded in each character set?
Input one character or short letters and click "Convert."
Charset | Character | Bit string (binary) | Bit String (hexadecimal) |
---|---|---|---|
ISO-8859-1 | ??h?? | 0011111100111111011010000011111100111111 | 3f3f683f3f |
SJIS-WIN | 煬?h煬? | 11100000100011000011111101101000111000001000110000111111 | e08c3f68e08c3f |
EUC-JP | 煬?h煬? | 11011111111011000011111101101000110111111110110000111111 | dfec3f68dfec3f |
UTF-8 | 煬먬h煬먬 | 11100111100001011010110011101011101010001010110001101000111001111000010110101100111010111010100010101100 | e785aceba8ac68e785aceba8ac |
UHC | 煬먬h煬먬 | 111001011100100110010000111010010110100011100101110010011001000011101001 | e5c990e968e5c990e9 |
SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)