To what bitstring a character(s) is encoded in each character set?
Input one character or short letters and click "Convert."
Charset | Character | Bit string (binary) | Bit String (hexadecimal) |
---|---|---|---|
ISO-8859-1 | ??????B | 00111111001111110011111100111111001111110011111101000010 | 3f3f3f3f3f3f42 |
SJIS-WIN | ニミ七裝軸B | 1111001010100110110001101101000010001110101101011110010111100100100011101011001001000010 | f2a6c6d08eb5e5e48eb242 |
EUC-JP | ?ニミ七裝軸B | 001111111000111011000110100011101101000010111100101101111110101011100110101111001011010001000010 | 3f8ec68ed0bcb7eae6bcb442 |
UTF-8 | ニミ七裝軸B | 11101110100001111001110111101111101111101000011011101111101111101001000011100100101110001000001111101000101000111001110111101000101110111011100001000010 | ee879defbe86efbe90e4b883e8a39de8bbb842 |
UHC | ???七裝軸B | 00111111001111110011111111110110110100101110110111111011111101011110111001000010 | 3f3f3ff6d2edfbf5ee42 |
SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)