To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲ル?肉ラ??с?嚴щ?異??惟??鈺?? 11100001100111111000001110001011001111111001001111110111100000111000100100111111001111111000010010000011001111111001101010001110100001001000101100111111100010001101100100111111001111111000100011010010001111110011111111111011110001000011111100111111 e19f838b3f93f783893f3f84833f9a8e848b3f88d93f3f88d23f3ffbc43f3f
EUC-JP 癲ル?肉ラ??с?嚴щ?異??惟??鈺?? 1110001010100001101001011110101100111111110001101111100110100101111010010011111100111111101001111110001100111111110100111110111010100111111010110011111110110000110110110011111100111111101100001101010000111111001111111000111111100011110101010011111100111111 e2a1a5eb3fc6f9a5e93f3fa7e33fd3eea7eb3fb0db3f3fb0d43f3f8fe3d53f3f
UTF-8 癲ル슢肉ラ씣戮с걶嚴щ벊異루뙠惟곗뵰鈺곕㈊ 11100111100110011011001011100011100000111010101111101100100010101010001011101000100000101000100111100011100000111010100111101100100101001010001111101111101001111001001011010001100000011110101010110001101101101110010110011010101101001101000110001001111010111011001010001010111001111001010110110000111010111010001110101000111010111001100110100000111001101000001110011111111010101011001110010111111010111011010110110000111010011000100010111010111010101011001110010101111000111000100010001010 e799b2e383abec8aa2e88289e383a9ec94a3efa792d181eab1b6e59ab4d189ebb28ae795b0eba3a8eb99a0e6839feab397ebb5b0e988baeab395e3888a
UHC 癲ル슢肉ラ씣戮с걶嚴щ벊異루뙠惟곗뵰鈺곕㈊ 111011111010011010101011111010111001101010101110111010111011111110101011111010011001110110110111111010111011110110101100111000111000000110011100111001011111000110101100111010111001001110101101111011001011011010110111111001111000110010100101111010101110111010110000111011001001010010101110111010001010110110110000111010111010100110111011 efa6abeb9aaeebbfabe99db7ebbdace3819ce5f1aceb93adecb6b7e78ca5eaeeb0ec94aee8adb0eba9bb

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)