To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????n}?????????n{^ 0011111100111111001111110011111100111111001111110011111100111111001111110110111001111101001111110011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN ??ゴ瑤?????n}??ゴ瑤?????n{^ 001111110011111110000011010100111110101010100010001111110011111100111111001111110011111101101110011111010011111100111111100000110101001111101010101000100011111100111111001111110011111100111111011011100111101101011110 3f3f8353eaa23f3f3f3f3f6e7d3f3f8353eaa23f3f3f3f3f6e7b5e
EUC-JP ??ゴ瑤?????n}??ゴ瑤?????n{^ 001111110011111110100101101101001111010010100100001111110011111100111111001111110011111101101110011111010011111100111111101001011011010011110100101001000011111100111111001111110011111100111111011011100111101101011110 3f3fa5b4f4a43f3f3f3f3f6e7d3f3fa5b4f4a43f3f3f3f3f6e7b5e
UTF-8 殮뽭ゴ瑤덄쑊念곭뿿n}殮뽭ゴ瑤덄쑊念곭뿿n{^ 1110111110100110101001011110101110111101101011011110001110000010101101001110011110010001101001001110101110001101100001001110110010010001100010101110111110100110101000111110101010110011101011011110101110111111101111110110111001111101111011111010011010100101111010111011110110101101111000111000001010110100111001111001000110100100111010111000110110000100111011001001000110001010111011111010011010100011111010101011001110101101111010111011111110111111011011100111101101011110 efa6a5ebbdade382b4e791a4eb8d84ec918aefa6a3eab3adebbfbf6e7defa6a5ebbdade382b4e791a4eb8d84ec918aefa6a3eab3adebbfbf6e7b5e
UHC 殮뽭ゴ瑤덄쑊念곭뿿n}殮뽭ゴ瑤덄쑊念곭뿿n{^ 1110011011111001100101101110100110101011101101001110100011111101100010001110011110011100101010011110011011110110100000011110011110010111101111110110111001111101111001101111100110010110111010011010101110110100111010001111110110001000111001111001110010101001111001101111011010000001111001111001011110111111011011100111101101011110 e6f996e9abb4e8fd88e79ca9e6f681e797bf6e7de6f996e9abb4e8fd88e79ca9e6f681e797bf6e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)