To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ?????捲┥閃?其騁娠漿?莎??????^ 001111110011111100111111001111110011111110001100100111101000010010111100100100010100110100111111100100011011010011101001011101001001000001010000100111111111011100111111111001001011001100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f8c9e84bc914d3f91b4e97490509ff73fe4b33f3f3f3f3f3f5e
EUC-JP ?莘???捲┥閃?其騁娠漿?莎??????^ 0011111110001111110110001010101100111111001111110011111110110111111111101010100010111110110000011010111000111111110000101011011011110001110101011011111110110001110111101111100100111111111010001011010100111111001111110011111100111111001111110011111101011110 3f8fd8ab3f3f3fb7fea8bec1ae3fc2b6f1d5bfb1def93fe8b53f3f3f3f3f3f5e
UTF-8 얇莘렋야뤃捲┥閃퉶其騁娠漿♥莎닉렑얀렫롋롘^ 11101100100101101000011111101000100011101001100011101011101000001000101111101100100101011011110011101011101001001000001111100110100011011011001011100010100101001010010111101001100101101000001111101101100010011011011011100101100001011011011011101001101010001000000111100101101010001010000011100110101111001011111111100010100110011010010111101000100011101000111011101011100010111000100111101011101000001001000111101100100101101000000011101011101000001010101111101011101000011000101111101011101000011001100001011110 ec9687e88e98eba08bec95bceba483e68db2e294a5e99683ed89b6e585b6e9a881e5a8a0e6bcbfe299a5e88e8eeb8b89eba091ec9680eba0abeba18beba1985e
UHC 얇莘렋야뤃捲┥閃퉶其騁娠漿♥莎닉렑얀렫롋롘^ 10111110111000111110001111101110100011101010001010111110110111111000111110110100110011111110110010100110101111101110000011101100101110011000111011010000111011001101111010111110111000111110001111101101111011001010001010111110110111101110110110110100110100001000111010100110101111101110000110001110101110011000111011010001100011101101110001011110 bee3e3ee8ea2bedf8fb4cfeca6bee0ecb98ed0ecdebee3e3edeca2bedeedb4d08ea6bee18eb98ed18edc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)