To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??踰??筍ル┃嚥△?愉??純??堊 1110000110011111001111110011111111100110111110100011111100111111111000101010000110000011100010111000010010101011100110101000101110000001101000100011111110010110111110010011111100111111100011111000001100111111001111111001101010111111 e19f3f3fe6fa3f3fe2a1838b84ab9a8b81a23f96f93f3f8f833f3f9abf
EUC-JP 癲??踰??筍ル┃嚥△?愉??純??堊 1110001010100001001111110011111111101100111111000011111100111111111001001010001110100101111010111010100010101101110100111110101110100010101001000011111111001100111110110011111100111111101111011110001100111111001111111101010011000001 e2a13f3fecfc3f3fe4a3a5eba8add3eba2a43fccfb3f3fbde33f3fd4c1
UTF-8 癲녴굚踰곲씣筍ル┃嚥△뫗愉며뇖純껋쪑堊 111001111001100110110010111010111000010110110100111010101011010110011010111010001011100010110000111010101011001110110010111011001001010010100011111001111010110110001101111000111000001110101011111000101001010010000011111001011001101010100101111000101001011010110011111010111010101110010111111001101000010010001001111010111010100110110000111010111000011110010110111001111011010010010100111010101011101110001011111011001010101010010001111001011010000010001010 e799b2eb85b4eab59ae8b8b0eab3b2ec94a3e7ad8de383abe29483e59aa5e296b3ebab97e68489eba9b0eb8796e7b494eabb8becaa91e5a08a
UHC 癲녴굚踰곲씣筍ル┃嚥△뫗愉며뇖純껋쪑堊 1110111110100110100001101110001110000010100000101110101110110010100000011110100110011101101101111110001011101100101010111110101110100110101011011110011010111111101000011110001010010001101110011110101011110000101110001110011110000111100000011110001011101101100000111110110010100101100010111110010010111110 efa686e38282ebb281e99db7e2ecabeba6ade6bfa1e291b9eaf0b8e78781e2ed83eca58be4be

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)