To what bitstring a character(s) is encoded in each character set?
Input one character or short letters and click "Convert."
Charset | Character | Bit string (binary) | Bit String (hexadecimal) |
---|---|---|---|
ISO-8859-1 | ??????[ | 00111111001111110011111100111111001111110011111101011011 | 3f3f3f3f3f3f5b |
SJIS-WIN | 樹??諄??[ | 100011101111011100111111001111111110011001111000001111110011111101011011 | 8ef73f3fe6783f3f5b |
EUC-JP | 樹??諄??[ | 101111001111100100111111001111111110101111011001001111110011111101011011 | bcf93f3febd93f3f5b |
UTF-8 | 樹귚뙭諄뗤뙭[ | 11100110101010001011100111101010101101111001101011101011100110011010110111101000101010111000010011101011100101111010010011101011100110011010110101011011 | e6a8b9eab79aeb99ade8ab84eb97a4eb99ad5b |
UHC | 樹귚뙭諄뗤뙭[ | 11100010101001111000001011100100100011001011000011100010111101001000101111100100100011001011000001011011 | e2a782e48cb0e2f48be48cb05b |
SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)