To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲ル?肉ュ?諛ν?鴉??蹂??筌??誼 1110000110011111100000111000101100111111100100111111011110000011100001010011111111100110100001111000001111001011001111111110100111101011001111110011111111100110111110000011111100111111111000101010001100111111001111111000101101100010 e19f838b3f93f783853fe68783cb3fe9eb3f3fe6f83f3fe2a33f3f8b62
EUC-JP 癲ル?肉ュ?諛ν?鴉??蹂??筌??誼 1110001010100001101001011110101100111111110001101111100110100101111001010011111111101011111001111010011011001101001111111111001011101101001111110011111111101100111110100011111100111111111001001010010100111111001111111011010111000011 e2a1a5eb3fc6f9a5e53febe7a6cd3ff2ed3f3fecfa3f3fe4a53f3fb5c3
UTF-8 癲ル슢肉ュ츦諛ν떊鴉롥퐲蹂잕성筌뤾쑴誼 1110011110011001101100101110001110000011101010111110110010001010101000101110100010000010100010011110001110000011101001011110110010111000101001101110100010101011100110111100111010111101111010111001011010001010111010011011010010001001111010111010000110100101111011011001000010110010111010001011100110000010111011001001111010010101111011001000010010110001111001111010110110001100111010111010010010111110111011001001000110110100111010001010101010111100 e799b2e383abec8aa2e88289e383a5ecb8a6e8ab9bcebdeb968ae9b489eba1a5ed90b2e8b982ec9e95ec84b1e7ad8ceba4beec91b4e8aabc
UHC 癲ル슢肉ュ츦諛ν떊鴉롥퐲蹂잕성筌뤾쑴誼 1110111110100110101010111110101110011010101011101110101110111111101010111110010110101110100111001110101110110000101001011110110110001011101000001110010010111100100011101110010110111101100110111110101110110011100111111110101010111100101110101110111110100111100011111110101010111110101010011110101111111110 efa6abeb9aaeebbfabe5ae9cebb0a5ed8ba0e4bc8ee5bd9bebb39feabcbaefa78feabea9ebfe

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)