To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????n}?????????n{^ 0011111100111111001111110011111100111111001111110011111100111111001111110110111001111101001111110011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN ???仰??與??n}???仰??與??n{^ 001111110011111100111111100010111100001000111111001111111110010001101111001111110011111101101110011111010011111100111111001111111000101111000010001111110011111111100100011011110011111100111111011011100111101101011110 3f3f3f8bc23f3fe46f3f3f6e7d3f3f3f8bc23f3fe46f3f3f6e7b5e
EUC-JP ???仰??與??n}???仰??與??n{^ 001111110011111100111111101101101100010000111111001111111110011111010000001111110011111101101110011111010011111100111111001111111011011011000100001111110011111111100111110100000011111100111111011011100111101101011110 3f3f3fb6c43f3fe7d03f3f6e7d3f3f3fb6c43f3fe7d03f3f6e7b5e
UTF-8 娛곈죺仰삭였與믣꽏n}娛곈죺仰삭였與믣꽏n{^ 1110010110101000100110111110101010110011100010001110110010100011101110101110010010111011101100001110110010000010101011011110110010011000100000001110100010001000100001111110101110101111101000111110101010111101100011110110111001111101111001011010100010011011111010101011001110001000111011001010001110111010111001001011101110110000111011001000001010101101111011001001100010000000111010001000100010000111111010111010111110100011111010101011110110001111011011100111101101011110 e5a89beab388eca3bae4bbb0ec82adec9880e88887ebafa3eabd8f6e7de5a89beab388eca3bae4bbb0ec82adec9880e88887ebafa3eabd8f6e7b5e
UHC 娛곈죺仰삭였與믣꽏n}娛곈죺仰삭였與믣꽏n{^ 1110011111110100101100001110100110100001100101001110010011100110101110111110100010111111101101001110011010101000100100101110010110000100100111110110111001111101111001111111010010110000111010011010000110010100111001001110011010111011111010001011111110110100111001101010100010010010111001011000010010011111011011100111101101011110 e7f4b0e9a194e4e6bbe8bfb4e6a892e5849f6e7de7f4b0e9a194e4e6bbe8bfb4e6a892e5849f6e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)