To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲ル????湲??艶k?異わ?恂ル?沃?? 11100001100111111000001110001011001111110011111100111111001111111001111111010001001111110011111110001001100100001000001010001011001111111000100011011001100000101110110100111111100111001001011010000011100010110011111110010111100000000011111100111111 e19f838b3f3f3f3f9fd13f3f8990828b3f88d982ed3f9c96838b3f97803f3f
EUC-JP 癲ル?佾??湲??艶k?異わ?恂ル?沃?? 111000101010000110100101111010110011111110001111101100001111101100111111001111111101111011010011001111110011111110110001111100001010001111101011001111111011000011011011101001001110111100111111110101111111011010100101111010110011111111001101111000000011111100111111 e2a1a5eb3f8fb0fb3f3fded33f3fb1f0a3eb3fb0dba4ef3fd7f6a5eb3fcde03f3f
UTF-8 癲ル슡佾붺몭湲몄쒜艶k쵐異わ쫳恂ル역沃띿갉 111001111001100110110010111000111000001110101011111011001000101010100001111001001011110110111110111010111011011010111010111010111010101010101101111001101011100110110010111010111010101010000100111011001001001010011100111010001000100110110110111011111011110110001011111011001011010110010000111001111001010110110000111000111000001010001111111011001010101110110011111001101000000110000010111000111000001110101011111011001001011110101101111001101011001010000011111010111001110110111111111010101011000010001001 e799b2e383abec8aa1e4bdbeebb6baebaaade6b9b2ebaa84ec929ce889b6efbd8becb590e795b0e3828fecabb3e68182e383abec97ade6b283eb9dbfeab089
UHC 癲ル슡佾붺몭湲몄쒜艶k쵐異わ쫳恂ル역沃띿갉 111011111010011010101011111010111001101010101101111011001110101110010100111001111001000110010111111010101011100010111000111011001011111010101110111001101111110110100011111010111010110010010010111011001011011010101010111011111010011010001011111000101110000110101011111010111011111110101010111010001010101010001101111011001011000010100110 efa6abeb9aadeceb94e79197eab8b8ecbeaee6fda3ebac92ecb6aaefa68be2e1abebbfaae8aa8decb0a6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)