To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 乙?劍齬??????乙?劍齬??????^ 100010011011001100111111100110011001100011101010100101110011111100111111001111110011111100111111001111111000100110110011001111111001100110011000111010101001011100111111001111110011111100111111001111110011111101011110 89b33f9998ea973f3f3f3f3f3f89b33f9998ea973f3f3f3f3f3f5e
EUC-JP 乙?劍齬??????乙?劍齬??????^ 101100101011010100111111110100011111100011110011111101110011111100111111001111110011111100111111001111111011001010110101001111111101000111111000111100111111011100111111001111110011111100111111001111110011111101011110 b2b53fd1f8f3f73f3f3f3f3f3fb2b53fd1f8f3f73f3f3f3f3f3f5e
UTF-8 乙어劍齬뀜렒띤렗쇳석乙어劍齬뀜렒띤렗쇳석^ 11100100101110011001100111101100100101101011010011100101100010101000110111101001101111011010110011101011100000001001110011101011101000001001001011101011100111011010010011101011101000001001011111101100100001111011001111101100100001001001110111100100101110011001100111101100100101101011010011100101100010101000110111101001101111011010110011101011100000001001110011101011101000001001001011101011100111011010010011101011101000001001011111101100100001111011001111101100100001001001110101011110 e4b999ec96b4e58a8de9bdaceb809ceba092eb9da4eba097ec87b3ec849de4b999ec96b4e58a8de9bdaceb809ceba092eb9da4eba097ec87b3ec849d5e
UHC 乙어劍齬뀜렒띤렗쇳석乙어劍齬뀜렒띤렗쇳석^ 1110101111100000101111101110111011001011111111001110010111100001101100101111000110001110101001111011011011101101100011101010110010111100111011011011110010101110111010111110000010111110111011101100101111111100111001011110000110110010111100011000111010100111101101101110110110001110101011001011110011101101101111001010111001011110 ebe0beeecbfce5e1b2f18ea7b6ed8eacbcedbcaeebe0beeecbfce5e1b2f18ea7b6ed8eacbcedbcae5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)