To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鴦??????徇??怨?????援?ぜ恂μ? 1110100111110001001111110011111100111111001111110011111100111111100111000110110100111111001111111000100110000101001111110011111100111111001111110011111110001001100001110011111110000010101110101001110010010110100000111100101000111111 e9f13f3f3f3f3f3f9c6d3f3f89853f3f3f3f3f89873f82ba9c9683ca3f
EUC-JP 鴦??????徇??怨??濚?Ŧ援?ぜ恂μ? 111100101111001100111111001111110011111100111111001111110011111111010111110011100011111100111111101100011110010100111111001111111000111111001001101000010011111110001111101010011010111110110001111001110011111110100100101111001101011111110110101001101100110000111111 f2f33f3f3f3f3f3fd7ce3f3fb1e53f3f8fc9a13f8fa9afb1e73fa4bcd7f6a6cc3f
UTF-8 鴦꾆뀀룱烈쀫쓧徇쒏뇻怨쀬떱濚밸Ŧ援욆ぜ恂μ뒙 11101001101101001010011011101010101111101000011011101011100000001000000011101011101000111011000111101111101001101001111111101100100000001010101111101100100100111010011111100101101111101000011111101100100100101000111111101011100001111011101111100110100000001010100011101100100000001010110011101011100101101011000111100110101111111001101011101011101100001011100011000101101001101110011010001111101101001110110010011010100001101110001110000001100111001110011010000001100000101100111010111100111010111001001010011001 e9b4a6eabe86eb8080eba3b1efa69fec80abec93a7e5be87ec928feb87bbe680a8ec80aceb96b1e6bf9aebb0b8c5a6e68fb4ec9a86e3819ce68182cebceb9299
UHC 鴦꾆뀀룱烈쀫쓧徇쒏뇻怨쀬떱濚밸Ŧ援욆ぜ恂μ뒙 1110010011101100100001001100111010110010111010111000111110100110111001101110111110010111111010111001110110001000111000101101111110011100111001101011010010100111111010101011001110010111111011001011011010110111111001111011100110111001111010111010100010101110111010101011010110011110111010001010101010111100111000101110000110100101111011001000101010010110 e4ec84ceb2eb8fa6e6ef97eb9d88e2df9ce6b4a7eab397ecb6b7e7b9b9eba8aeeab59ee8aabce2e1a5ec8a96

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)