To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 翁??鍮??柔??筌??援??柔れ?裔 100010011010010100111111001111111110100001001010001111110011111110001111010111110011111100111111111000101010001100111111001111111000100110000111001111110011111110001111010111111000001011101010001111111110010111100001 89a53f3fe84a3f3f8f5f3f3fe2a33f3f89873f3f8f5f82ea3fe5e1
EUC-JP 翁??鍮??柔??筌??援??柔れ?裔 101100101010011100111111001111111110111110101011001111110011111110111101110000000011111100111111111001001010010100111111001111111011000111100111001111110011111110111101110000001010010011101100001111111110101011100011 b2a73f3fefab3f3fbdc03f3fe4a53f3fb1e73f3fbdc0a4ec3feae3
UTF-8 翁띾끃鍮껈씣柔겸뵹筌롪퉭援밭솾柔れ뿼裔 111001111011111110000001111010111001110110111110111010111000000110000011111010011000110110101110111010101011101110001000111011001001010010100011111001101001111110010100111010101011001010111000111010111011010110111001111001111010110110001100111010111010000110101010111011011000100110101101111001101000111110110100111010111011000010101101111011001000011010111110111001101001111110010100111000111000001010001100111010111011111110111100111010001010001110010100 e7bf81eb9dbeeb8183e98daeeabb88ec94a3e69f94eab2b8ebb5b9e7ad8ceba1aaed89ade68fb4ebb0adec86bee69f94e3828cebbfbce8a394
UHC 翁띾끃鍮껈씣柔겸뵹筌롪퉭援밭솾柔れ뿼裔 1110100010111010100011011110101110000101101110011110101110111001100000111110100110011101101101111110101011110101101100001110001010010100101101111110111110100111100011101110101010111001100001011110101010110101101110011110011110011001101100101110101011110101101010101110110010010111101111001110011111100000 e8ba8deb85b9ebb983e99db7eaf5b0e294b7efa78eeab985eab5b9e799b2eaf5aaec97bce7e0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)