To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 竪淡孫誰遜揃竪束賊誰遜俗竪測続辿 1001001001000111100100100101011110010001101101111001001001001110100100011011101110010001101101011001001001000111100100011010100110010001101011111001001001001110100100011011101110010001101011011001001001000111100100011010101010010001101100011001001001001000 9247925791b7924e91bb91b5924791a991af924e91bb91ad924791aa91b19248
EUC-JP 竪淡孫誰遜揃竪束賊誰遜俗竪測続辿 1100001110101000110000111011100011000010101110011100001110101111110000101011110111000010101101111100001110101000110000101010101111000010101100011100001110101111110000101011110111000010101011111100001110101000110000101010110011000010101100111100001110101001 c3a8c3b8c2b9c3afc2bdc2b7c3a8c2abc2b1c3afc2bdc2afc3a8c2acc2b3c3a9
UTF-8 竪淡孫誰遜揃竪束賊誰遜俗竪測続辿 111001111010101110101010111001101011011110100001111001011010110110101011111010001010101010110000111010011000000110011100111001101000111110000011111001111010101110101010111001101001110110011111111010001011001110001010111010001010101010110000111010011000000110011100111001001011111110010111111001111010101110101010111001101011100010101100111001111011011010011010111010001011111010111111 e7abaae6b7a1e5adabe8aab0e9819ce68f83e7abaae69d9fe8b38ae8aab0e9819ce4bf97e7abaae6b8ace7b69ae8bebf
UHC 竪淡孫誰遜?竪束賊誰遜俗竪測?? 1110001010110101110100111011111111100001110111011110001011000001111000011110000100111111111000101011010111100001110101101110111011100100111000101100000111100001111000011110000111010100111000101011010111110110101101000011111100111111 e2b5d3bfe1dde2c1e1e13fe2b5e1d6eee4e2c1e1e1e1d4e2b5f6b43f3f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)