To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????noBF 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101101110011011110100001001000110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f6e6f4246
SJIS-WIN 竪担村誰遜賊竪淡賊誰遜属竪束賊誰遜俗noBF 10010010010001111001001001010011100100011011101010010010010011101001000110111011100100011010111110010010010001111001001001010111100100011010111110010010010011101001000110111011100100011010111010010010010001111001000110101001100100011010111110010010010011101001000110111011100100011010110101101110011011110100001001000110 9247925391ba924e91bb91af9247925791af924e91bb91ae924791a991af924e91bb91ad6e6f4246
EUC-JP 竪担村誰遜賊竪淡賊誰遜属竪束賊誰遜俗noBF 11000011101010001100001110110100110000101011110011000011101011111100001010111101110000101011000111000011101010001100001110111000110000101011000111000011101011111100001010111101110000101011000011000011101010001100001010101011110000101011000111000011101011111100001010111101110000101010111101101110011011110100001001000110 c3a8c3b4c2bcc3afc2bdc2b1c3a8c3b8c2b1c3afc2bdc2b0c3a8c2abc2b1c3afc2bdc2af6e6f4246
UTF-8 竪担村誰遜賊竪淡賊誰遜属竪束賊誰遜俗noBF 11100111101010111010101011100110100010111000010111100110100111011001000111101000101010101011000011101001100000011001110011101000101100111000101011100111101010111010101011100110101101111010000111101000101100111000101011101000101010101011000011101001100000011001110011100101101100011001111011100111101010111010101011100110100111011001111111101000101100111000101011101000101010101011000011101001100000011001110011100100101111111001011101101110011011110100001001000110 e7abaae68b85e69d91e8aab0e9819ce8b38ae7abaae6b7a1e8b38ae8aab0e9819ce5b19ee7abaae69d9fe8b38ae8aab0e9819ce4bf976e6f4246
UHC 竪?村誰遜賊竪淡賊誰遜?竪束賊誰遜俗noBF 1110001010110101001111111111010110111101111000101100000111100001111000011110111011100100111000101011010111010011101111111110111011100100111000101100000111100001111000010011111111100010101101011110000111010110111011101110010011100010110000011110000111100001111000011101010001101110011011110100001001000110 e2b53ff5bde2c1e1e1eee4e2b5d3bfeee4e2c1e1e13fe2b5e1d6eee4e2c1e1e1e1d46e6f4246

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)