To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲ル????轅??齬??異??純??鈺?? 11100001100111111000001110001011001111110011111100111111001111111110011101110110001111110011111111101010100101110011111100111111100010001101100100111111001111111000111110000011001111110011111111111011110001000011111100111111 e19f838b3f3f3f3fe7763f3fea973f3f88d93f3f8f833f3ffbc43f3f
EUC-JP 癲ル?佾??轅??齬??異??純??鈺?? 11100010101000011010010111101011001111111000111110110000111110110011111100111111111011011101011100111111001111111111001111110111001111110011111110110000110110110011111100111111101111011110001100111111001111111000111111100011110101010011111100111111 e2a1a5eb3f8fb0fb3f3fedd73f3ff3f73f3fb0db3f3fbde33f3f8fe3d53f3f
UTF-8 癲ル슡佾믭㎠轅⑷섭齬잙벊異룟윜純껊폏鈺곕깭 111001111001100110110010111000111000001110101011111011001000101010100001111001001011110110111110111010111010111110101101111000111000111010100000111010001011110110000101111000101001000110110111111011001000010010101101111010011011110110101100111011001001111010011001111010111011001010001010111001111001010110110000111010111010001110011111111011001001110010011100111001111011010010010100111010101011101110001010111011011000111110001111111010011000100010111010111010101011001110010101111010101011100110101101 e799b2e383abec8aa1e4bdbeebafade38ea0e8bd85e291b7ec84ade9bdacec9e99ebb28ae795b0eba39fec9c9ce7b494eabb8aed8f8fe988baeab395eab9ad
UHC 癲ル슡佾믭㎠轅⑷섭齬잙벊異룟윜純껊폏鈺곕깭 111011111010011010101011111010111001101010101101111011001110101110010010111011111010011110110010111010101011111110101001111010101011110010110111111001011110000110011111111010111001001110101101111011001011011010110111111001011001111110011111111000101110110110000011111010111011110010011010111010001010110110110000111010111000001110011100 efa6abeb9aadeceb92efa7b2eabfa9eabcb7e5e19feb93adecb6b7e59f9fe2ed83ebbc9ae8adb0eb839c

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)