To what bitstring a character(s) is encoded in each character set?
Input one character or short letters and click "Convert."
Charset | Character | Bit string (binary) | Bit String (hexadecimal) |
---|---|---|---|
ISO-8859-1 | ??±??^ | 001111110011111110110001001111110011111101011110 | 3f3fb13f3f5e |
SJIS-WIN | ??±?蔑^ | 0011111100111111100000010111110100111111100101011100110001011110 | 3f3f817d3f95cc5e |
EUC-JP | ??±勖蔑^ | 00111111001111111010000111011110100011111011001111101101110010101100111001011110 | 3f3fa1de8fb3edcace5e |
UTF-8 | 얘뤫±勖蔑^ | 111011001001011010011000111010111010010010101011110000101011000111100101100010111001011011101000100101001001000101011110 | ec9698eba4abc2b1e58b96e894915e |
UHC | 얘뤫±勖蔑^ | 1011111011101010100011111101100110100001101111101110100111101101110110011010000101011110 | beea8fd9a1bee9edd9a15e |
SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)