To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 臍???旭??基??臍???旭??基??^ 111001000110000000111111001111110011111110001000101011100011111100111111100010101110111000111111001111111110010001100000001111110011111100111111100010001010111000111111001111111000101011101110001111110011111101011110 e4603f3f3f88ae3f3f8aee3f3fe4603f3f3f88ae3f3f8aee3f3f5e
EUC-JP 臍???旭?嫄基??臍???旭?嫄基??^ 11100111110000010011111100111111001111111011000010110000001111111000111110111010101000011011010011110000001111110011111111100111110000010011111100111111001111111011000010110000001111111000111110111010101000011011010011110000001111110011111101011110 e7c13f3f3fb0b03f8fbaa1b4f03f3fe7c13f3f3fb0b03f8fbaa1b4f03f3f5e
UTF-8 臍陋렩렭旭렩嫄基렰렋臍陋렩렭旭렩嫄基렰렋^ 11101000100001111000110111101111101001011001000111101011101000001010100111101011101000001010110111100110100101111010110111101011101000001010100111100101101010111000010011100101100111111011101011101011101000001011000011101011101000001000101111101000100001111000110111101111101001011001000111101011101000001010100111101011101000001010110111100110100101111010110111101011101000001010100111100101101010111000010011100101100111111011101011101011101000001011000011101011101000001000101101011110 e8878defa591eba0a9eba0ade697adeba0a9e5ab84e59fbaeba0b0eba08be8878defa591eba0a9eba0ade697adeba0a9e5ab84e59fbaeba0b0eba08b5e
UHC 臍陋렩렭旭렩嫄基렰렋臍陋렩렭旭렩嫄基렰렋^ 1111000010110000110100101110101110001110101101111000111010111010111010011110111110001110101101111110101010110001110100001111000110001110101111011000111010100010111100001011000011010010111010111000111010110111100011101011101011101001111011111000111010110111111010101011000111010000111100011000111010111101100011101010001001011110 f0b0d2eb8eb78ebae9ef8eb7eab1d0f18ebd8ea2f0b0d2eb8eb78ebae9ef8eb7eab1d0f18ebd8ea25e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)