To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 脱揃其竪族続辿捉属竪属尊誰遜尊辿村他 100100100100010110010001101101011001000110110100100100100100011110010001101100001001000110110001100100100100100010010001101010001001000110101110100100100100011110010001101011101001000110111000100100100100111010010001101110111001000110111000100100100100100010010001101110101001000110111100 924591b591b4924791b091b1924891a891ae924791ae91b8924e91bb91b8924891ba91bc
EUC-JP 脱揃其竪族続辿捉属竪属尊誰遜尊辿村他 110000111010011011000010101101111100001010110110110000111010100011000010101100101100001010110011110000111010100111000010101010101100001010110000110000111010100011000010101100001100001010111010110000111010111111000010101111011100001010111010110000111010100111000010101111001100001010111110 c3a6c2b7c2b6c3a8c2b2c2b3c3a9c2aac2b0c3a8c2b0c2bac3afc2bdc2bac3a9c2bcc2be
UTF-8 脱揃其竪族続辿捉属竪属尊誰遜尊辿村他 111010001000010010110001111001101000111110000011111001011000010110110110111001111010101110101010111001101001011110001111111001111011011010011010111010001011111010111111111001101000110110001001111001011011000110011110111001111010101110101010111001011011000110011110111001011011000010001010111010001010101010110000111010011000000110011100111001011011000010001010111010001011111010111111111001101001110110010001111001001011101110010110 e884b1e68f83e585b6e7abaae6978fe7b69ae8bebfe68d89e5b19ee7abaae5b19ee5b08ae8aab0e9819ce5b08ae8bebfe69d91e4bb96
UHC ??其竪族??捉?竪?尊誰遜尊?村他 0011111100111111110100001110110011100010101101011111000011101001001111110011111111110011101101010011111111100010101101010011111111110000111011101110001011000001111000011110000111110000111011100011111111110101101111011111011011100010 3f3fd0ece2b5f0e93f3ff3b53fe2b53ff0eee2c1e1e1f0ee3ff5bdf6e2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)