To what bitstring a character(s) is encoded in each character set?
Input one character or short letters and click "Convert."
Charset | Character | Bit string (binary) | Bit String (hexadecimal) |
---|---|---|---|
ISO-8859-1 | ?????? | 001111110011111100111111001111110011111100111111 | 3f3f3f3f3f3f |
SJIS-WIN | 攸磐?頃?? | 100111011011111110010100110101100011111110001101101000000011111100111111 | 9dbf94d63f8da03f3f |
EUC-JP | 攸磐?頃?? | 110110101100000111001000110110000011111110111010101000100011111100111111 | dac1c8d83fbaa23f3f |
UTF-8 | 攸磐렱頃렱골 | 111001101001010010111000111001111010001110010000111010111010000010110001111010011010000010000011111010111010000010110001111010101011001110101000 | e694b8e7a390eba0b1e9a083eba0b1eab3a8 |
UHC | 攸磐렱頃렱골 | 111010101111001011011010111100011000111010111110110011001111000110001110101111101011000011110001 | eaf2daf18ebeccf18ebeb0f1 |
SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)