To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ???侑??湲η?幽?????B 0011111100111111001111111001100011010000001111110011111110011111110100011000001111000101001111111001011101001000001111110011111100111111001111110011111101000010 3f3f3f98d03f3f9fd183c53f97483f3f3f3f3f42
EUC-JP ???侑??湲η?幽?????B 0011111100111111001111111101000011010010001111110011111111011110110100111010011011000111001111111100110110101001001111110011111100111111001111110011111101000010 3f3f3fd0d23f3fded3a6c73fcda93f3f3f3f3f42
UTF-8 說뺣뛼侑뉐껙湲η뵓幽귣꼪略녠른B 111011111010011010100001111010111011101010100011111010111001101110111100111001001011111010010001111010111000100110010000111010101011101110011001111001101011100110110010110011101011011111101011101101011001001111100101101110011011110111101010101101111010001111101010101111001010101011101111101001011011011011101011100001011010000011101011101001011011100001000010 efa6a1ebbaa3eb9bbce4be91eb8990eabb99e6b9b2ceb7ebb593e5b9bdeab7a3eabcaaefa5b6eb85a0eba5b842
UHC 說뺣뛼侑뉐껙湲η뵓幽귣꼪略녠른B 11100110111100101001010111101011100011011000001011101010111000101000011111100101101100101011001111101010101110001010010111100111100101001001010111101010111010111000001011101011100001001000011111100101101100101011001111101010101110001010010101000010 e6f295eb8d82eae287e5b2b3eab8a5e79495eaeb82eb8487e5b2b3eab8a542

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)