To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 竪属尊竪足揃誰遜他誰遜臓竪束賊誰遜測 100100100100011110010001101011101001000110111000100100100100011110010001101010111001000110110101100100100100111010010001101110111001000110111100100100100100111010010001101110111001000110011111100100100100011110010001101010011001000110101111100100100100111010010001101110111001000110101010 924791ae91b8924791ab91b5924e91bb91bc924e91bb919f924791a991af924e91bb91aa
EUC-JP 竪属尊竪足揃誰遜他誰遜臓竪束賊誰遜測 110000111010100011000010101100001100001010111010110000111010100011000010101011011100001010110111110000111010111111000010101111011100001010111110110000111010111111000010101111011100001010100001110000111010100011000010101010111100001010110001110000111010111111000010101111011100001010101100 c3a8c2b0c2bac3a8c2adc2b7c3afc2bdc2bec3afc2bdc2a1c3a8c2abc2b1c3afc2bdc2ac
UTF-8 竪属尊竪足揃誰遜他誰遜臓竪束賊誰遜測 111001111010101110101010111001011011000110011110111001011011000010001010111001111010101110101010111010001011011010110011111001101000111110000011111010001010101010110000111010011000000110011100111001001011101110010110111010001010101010110000111010011000000110011100111010001000011110010011111001111010101110101010111001101001110110011111111010001011001110001010111010001010101010110000111010011000000110011100111001101011100010101100 e7abaae5b19ee5b08ae7abaae8b6b3e68f83e8aab0e9819ce4bb96e8aab0e9819ce88793e7abaae69d9fe8b38ae8aab0e9819ce6b8ac
UHC 竪?尊竪足?誰遜他誰遜?竪束賊誰遜測 111000101011010100111111111100001110111011100010101101011111000011101011001111111110001011000001111000011110000111110110111000101110001011000001111000011110000100111111111000101011010111100001110101101110111011100100111000101100000111100001111000011111011010110100 e2b53ff0eee2b5f0eb3fe2c1e1e1f6e2e2c1e1e13fe2b5e1d6eee4e2c1e1e1f6b4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)