To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????_ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011111 3f3f3f3f3f3f3f3f3f3f3f3f5f
SJIS-WIN 竪束賊誰遜俗竪賊尊誰遜臓_ 10010010010001111001000110101001100100011010111110010010010011101001000110111011100100011010110110010010010001111001000110101111100100011011100010010010010011101001000110111011100100011001111101011111 924791a991af924e91bb91ad924791af91b8924e91bb919f5f
EUC-JP 竪束賊誰遜俗竪賊尊誰遜臓_ 11000011101010001100001010101011110000101011000111000011101011111100001010111101110000101010111111000011101010001100001010110001110000101011101011000011101011111100001010111101110000101010000101011111 c3a8c2abc2b1c3afc2bdc2afc3a8c2b1c2bac3afc2bdc2a15f
UTF-8 竪束賊誰遜俗竪賊尊誰遜臓_ 11100111101010111010101011100110100111011001111111101000101100111000101011101000101010101011000011101001100000011001110011100100101111111001011111100111101010111010101011101000101100111000101011100101101100001000101011101000101010101011000011101001100000011001110011101000100001111001001101011111 e7abaae69d9fe8b38ae8aab0e9819ce4bf97e7abaae8b38ae5b08ae8aab0e9819ce887935f
UHC 竪束賊誰遜俗竪賊尊誰遜?_ 111000101011010111100001110101101110111011100100111000101100000111100001111000011110000111010100111000101011010111101110111001001111000011101110111000101100000111100001111000010011111101011111 e2b5e1d6eee4e2c1e1e1e1d4e2b5eee4f0eee2c1e1e13f5f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)