To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲ル?揖??韋??阿??游?┠釉??? 111000011001111110000011100010110011111110010111010010110011111100111111111010001110100000111111001111111000100010100010001111110011111110011111111000000011111110000100101101011110011111010110001111110011111100111111 e19f838b3f974b3f3fe8e83f3f88a23f3f9fe03f84b5e7d63f3f3f
EUC-JP 癲ル?揖??韋??阿??游?┠釉??? 111000101010000110100101111010110011111111001101101011000011111100111111111100001110101000111111001111111011000010100100001111110011111111011110111000100011111110101000101101111110111011011000001111110011111100111111 e2a1a5eb3fcdac3f3ff0ea3f3fb0a43f3fdee23fa8b7eed83f3f3f
UTF-8 癲ル슣揖띈굜韋얜짎阿숈뇠游띰┠釉띿뒗歷 111001111001100110110010111000111000001110101011111011001000101010100011111001101000111110010110111010111001110110001000111010101011010110011100111010011001111110001011111011001001011010011100111011001010011110001110111010011001100010111111111011001000100010001000111010111000011110100000111001101011100010111000111010111001110110110000111000101001010010100000111010011000011110001001111010111001110110111111111010111001001010010111111011111010011010001100 e799b2e383abec8aa3e68f96eb9d88eab59ce99f8bec969ceca78ee998bfec8888eb87a0e6b8b8eb9db0e294a0e98789eb9dbfeb9297efa68c
UHC 癲ル슣揖띈굜韋얜짎阿숈뇠游띰┠釉띿뒗歷 1110111110100110101010111110101110011010101011111110101111100111101101101110100010000010100001001110101011011111101111101110101110100011100110101110010010111001100110011110110010000111100010001110101011111101101101101110111110100110101101111110101110111000100011011110110010001010100101001110011010111000 efa6abeb9aafebe7b6e88284eadfbeeba39ae4b999ec8788eafdb6efa6b7ebb88dec8a94e6b8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)