To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 竪淡孫誰遜揃竪淡孫誰遜揃竪束賊誰遜俗B 10010010010001111001001001010111100100011011011110010010010011101001000110111011100100011011010110010010010001111001001001010111100100011011011110010010010011101001000110111011100100011011010110010010010001111001000110101001100100011010111110010010010011101001000110111011100100011010110101000010 9247925791b7924e91bb91b59247925791b7924e91bb91b5924791a991af924e91bb91ad42
EUC-JP 竪淡孫誰遜揃竪淡孫誰遜揃竪束賊誰遜俗B 11000011101010001100001110111000110000101011100111000011101011111100001010111101110000101011011111000011101010001100001110111000110000101011100111000011101011111100001010111101110000101011011111000011101010001100001010101011110000101011000111000011101011111100001010111101110000101010111101000010 c3a8c3b8c2b9c3afc2bdc2b7c3a8c3b8c2b9c3afc2bdc2b7c3a8c2abc2b1c3afc2bdc2af42
UTF-8 竪淡孫誰遜揃竪淡孫誰遜揃竪束賊誰遜俗B 11100111101010111010101011100110101101111010000111100101101011011010101111101000101010101011000011101001100000011001110011100110100011111000001111100111101010111010101011100110101101111010000111100101101011011010101111101000101010101011000011101001100000011001110011100110100011111000001111100111101010111010101011100110100111011001111111101000101100111000101011101000101010101011000011101001100000011001110011100100101111111001011101000010 e7abaae6b7a1e5adabe8aab0e9819ce68f83e7abaae6b7a1e5adabe8aab0e9819ce68f83e7abaae69d9fe8b38ae8aab0e9819ce4bf9742
UHC 竪淡孫誰遜?竪淡孫誰遜?竪束賊誰遜俗B 1110001010110101110100111011111111100001110111011110001011000001111000011110000100111111111000101011010111010011101111111110000111011101111000101100000111100001111000010011111111100010101101011110000111010110111011101110010011100010110000011110000111100001111000011101010001000010 e2b5d3bfe1dde2c1e1e13fe2b5d3bfe1dde2c1e1e13fe2b5e1d6eee4e2c1e1e1e1d442

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)