To what bitstring a character(s) is encoded in each character set?
Input one character or short letters and click "Convert."
Charset | Character | Bit string (binary) | Bit String (hexadecimal) |
---|---|---|---|
ISO-8859-1 | üGÛûMèÜâ^ | 111111000100011111011011111110110100110111101000110111001110001001011110 | fc47dbfb4de8dce25e |
SJIS-WIN | ?G??M???^ | 001111110100011100111111001111110100110100111111001111110011111101011110 | 3f473f3f4d3f3f3f5e |
EUC-JP | üGÛûMèÜâ^ | 100011111010101111100100010001111000111110101010111001011000111110101011111001010100110110001111101010111011001010001111101010101110010010001111101010111010010001011110 | 8fabe4478faae58fabe54d8fabb28faae48faba45e |
UTF-8 | üGÛûMèÜâ^ | 110000111011110001000111110000111001101111000011101110110100110111000011101010001100001110011100110000111010001001011110 | c3bc47c39bc3bb4dc3a8c39cc3a25e |
UHC | ?G??M???^ | 001111110100011100111111001111110100110100111111001111110011111101011110 | 3f473f3f4d3f3f3f5e |
SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)