To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲ル?遊?キ擬??烏??違??純??墺 11100001100111111000001110001011001111111001011101010110001111111000001101001100100010110101101100111111001111111000100101000111001111110011111110001000111000010011111100111111100011111000001100111111001111111001101011010010 e19f838b3f97563f834c8b5b3f3f89473f3f88e13f3f8f833f3f9ad2
EUC-JP 癲ル?遊?キ擬??烏??違??純??墺 11100010101000011010010111101011001111111100110110110111001111111010010110101101101101011011110000111111001111111011000110101000001111110011111110110000111000110011111100111111101111011110001100111111001111111101010011010100 e2a1a5eb3fcdb73fa5adb5bc3f3fb1a83f3fb0e33f3fbde33f3fd4d4
UTF-8 癲ル슢遊얕キ擬쀫눛烏겸벀違뗥윜純껊폏墺 111001111001100110110010111000111000001110101011111011001000101010100010111010011000000110001010111011001001011010010101111000111000001010101101111001101001001110101100111011001000000010101011111010111000100010011011111001111000001110001111111010101011001010111000111010111011001010000000111010011000000110010101111010111001011110100101111011001001110010011100111001111011010010010100111010101011101110001010111011011000111110001111111001011010001010111010 e799b2e383abec8aa2e9818aec9695e382ade693acec80abeb889be7838feab2b8ebb280e98195eb97a5ec9c9ce7b494eabb8aed8f8fe5a2ba
UHC 癲ル슢遊얕キ擬쀫눛烏겸벀違뗥윜純껊폏墺 1110111110100110101010111110101110011010101011101110101110110100101111101110100010101011101011011110101111110100100101111110101110000111101100111110100010100001101100001110001010010011101001101110101011011110100010111110010110011111100111111110001011101101100000111110101110111100100110101110011111110010 efa6abeb9aaeebb4bee8abadebf497eb87b3e8a1b0e293a6eade8be59f9fe2ed83ebbc9ae7f2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)