To what bitstring a character(s) is encoded in each character set?
Input one character or short letters and click "Convert."
| Charset | Character | Bit string (binary) | Bit String (hexadecimal) |
|---|---|---|---|
| ISO-8859-1 | ?????? | 001111110011111100111111001111110011111100111111 | 3f3f3f3f3f3f |
| SJIS-WIN | 紹ヤ鑅ヘ贊貽 | 10001111110100001101010011111011111001101100110111100110110100111110011011000100 | 8fd0d4fbe6cde6d3e6c4 |
| EUC-JP | 紹ヤ鑅ヘ贊貽 | 10111110110100101000111011010100100011111110010111101010100011101100110111101100110101011110110011000110 | bed28ed48fe5ea8ecdecd5ecc6 |
| UTF-8 | 紹ヤ鑅ヘ贊貽 | 111001111011010010111001111011111011111010010100111010011001000110000101111011111011111010001101111010001011010010001010111010001011001010111101 | e7b4b9efbe94e99185efbe8de8b48ae8b2bd |
| UHC | 紹???贊貽 | 111000011100100100111111001111110011111111110011110001111110110011000010 | e1c93f3f3ff3c7ecc2 |
SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)