To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 怏??乙???γ?怏??熱??認???μ? 10011100100010010011111100111111100010011011001100111111001111110011111110000011110000010011111110011100100010010011111100111111100101000100110100111111001111111001010001000110001111110011111100111111100000111100101000111111 9c893f3f89b33f3f3f83c13f9c893f3f944d3f3f94463f3f3f83ca3f
EUC-JP 怏??乙??洹γ?怏??熱??認???μ? 110101111110100100111111001111111011001010110101001111110011111110001111110001111011101010100110110000110011111111010111111010010011111100111111110001111010111000111111001111111100011110100111001111110011111100111111101001101100110000111111 d7e93f3fb2b53f3f8fc7baa6c33fd7e93f3fc7ae3f3fc7a73f3f3fa6cc3f
UTF-8 怏얘랩乙ⓨ듋洹γ럻怏얘랜熱듭닠認귨쭪類μ돦 11100110100000001000111111101100100101101001100011101011100111101010100111100100101110011001100111100010100100111010100011101011100100111000101111100110101101001011100111001110101100111110101110011111101110111110011010000000100011111110110010010110100110001110101110011110100111001110011110000110101100011110101110010011101011011110101110001011101000001110100010101010100011011110101010110111101010001110110010101101101010101110111110100111100100001100111010111100111010111000111110100110 e6808fec9698eb9ea9e4b999e293a8eb938be6b4b9ceb3eb9fbbe6808fec9698eb9e9ce786b1eb93adeb8ba0e8aa8deab7a8ecadaaefa790cebceb8fa6
UHC 怏얘랩乙ⓨ듋洹γ럻怏얘랜熱듭닠認귨쭪類μ돦 111001001110100010111110111010101011011110100110111010111110000010101000111001011000101010111110111010101011011110100101111000111000111010011010111001001110100010111110111010101011011110100011111001101111000010110101111011001000100010100000111011001110001110000010111011111010011110011110111010111011101010100101111011001000100110101010 e4e8beeab7a6ebe0a8e58abeeab7a5e38e9ae4e8beeab7a3e6f0b5ec88a0ece382efa79eebbaa5ec89aa

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)