To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 楮級?億?楮級?億?B 1001111010111000100010111000100100111111100010011010110100111111100111101011100010001011100010010011111110001001101011010011111101000010 9eb88b893f89ad3f9eb88b893f89ad3f42
EUC-JP 楮級?億?楮級?億?B 1101110010111010101101011110100100111111101100101010111100111111110111001011101010110101111010010011111110110010101011110011111101000010 dcbab5e93fb2af3fdcbab5e93fb2af3f42
UTF-8 楮級꺽億렚楮級꺽億렚B 11100110101001011010111011100111101101001001101011101010101110101011110111100101100001001000010011101011101000001001101011100110101001011010111011100111101101001001101011101010101110101011110111100101100001001000010011101011101000001001101001000010 e6a5aee7b49aeababde58484eba09ae6a5aee7b49aeababde58484eba09a42
UHC 楮級꺽億렚楮級꺽億렚B 111011101011111111010000111001001011001010101001111001011110001010001110101011011110111010111111110100001110010010110010101010011110010111100010100011101010110101000010 eebfd0e4b2a9e5e28eadeebfd0e4b2a9e5e28ead42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)