To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN セュ胥辞軸セュ茱釈セュ胥辞軸セュ茱釈B 1011111010101101111000111110111110001110101010111000111010110010101111101010110111100100101000111000111011011111101111101010110111100011111011111000111010101011100011101011001010111110101011011110010010100011100011101101111101000010 beade3ef8eab8eb2beade4a38edfbeade3ef8eab8eb2beade4a38edf42
EUC-JP セュ胥辞軸セュ茱釈セュ胥辞軸セュ茱釈B 10001110101111101000111010101101111001101111000110111100101011011011110010110100100011101011111010001110101011011110100010100101101111001110000110001110101111101000111010101101111001101111000110111100101011011011110010110100100011101011111010001110101011011110100010100101101111001110000101000010 8ebe8eade6f1bcadbcb48ebe8eade8a5bce18ebe8eade6f1bcadbcb48ebe8eade8a5bce142
UTF-8 セュ胥辞軸セュ茱釈セュ胥辞軸セュ茱釈B 11101111101111011011111011101111101111011010110111101000100000111010010111101000101111101001111011101000101110111011100011101111101111011011111011101111101111011010110111101000100011001011000111101001100001111000100011101111101111011011111011101111101111011010110111101000100000111010010111101000101111101001111011101000101110111011100011101111101111011011111011101111101111011010110111101000100011001011000111101001100001111000100001000010 efbdbeefbdade883a5e8be9ee8bbb8efbdbeefbdade88cb1e98788efbdbeefbdade883a5e8be9ee8bbb8efbdbeefbdade88cb1e9878842
UHC ??胥?軸??茱???胥?軸??茱?B 00111111001111111110000010100001001111111111010111101110001111110011111111100010101111000011111100111111001111111110000010100001001111111111010111101110001111110011111111100010101111000011111101000010 3f3fe0a13ff5ee3f3fe2bc3f3f3fe0a13ff5ee3f3fe2bc3f42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)