To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 擾??攸??堰??????る?乙??循??B 10001111111011110011111100111111100111011011111100111111001111111000100110000001001111110011111100111111001111110011111100111111100000101110100100111111100010011011001100111111001111111000111101111010001111110011111101000010 8fef3f3f9dbf3f3f89813f3f3f3f3f3f82e93f89b33f3f8f7a3f3f42
EUC-JP 擾??攸??堰?????薏る?乙??循??B 101111101111000100111111001111111101101011000001001111110011111110110001111000010011111100111111001111110011111100111111100011111101100111011110101001001110101100111111101100101011010100111111001111111011110111011011001111110011111101000010 bef13f3fdac13f3fb1e13f3f3f3f3f8fd9dea4eb3fb2b53f3fbddb3f3f42
UTF-8 擾우씕攸놁궩堰쇈굥溜믧샒薏る봿乙삥듉循뗫젔B 11100110100100111011111011101100100110101011000011101100100101001001010111100110100101001011100011101011100001101000000111101010101101101010100111100101101000001011000011101100100001111000100011101010101101011010010111101111101001111000101111101011101011111010011111101100100000111001001011101000100101101000111111100011100000101000101111101011101101001011111111100100101110011001100111101100100000101010010111101011100100111000100111100101101111101010101011101011100101111010101111101100101000001001010001000010 e693beec9ab0ec9495e694b8eb8681eab6a9e5a0b0ec8788eab5a5efa78bebafa7ec8392e8968fe3828bebb4bfe4b999ec82a5eb9389e5beaaeb97abeca09442
UHC 擾우씕攸놁궩堰쇈굥溜믧샒薏る봿乙삥듉循뗫젔B 11101000111101101011111111101100100111011010101011101010111100101000011011101100100000101011101111100101111010001011110011100011100000101000101111101010111111101001001011101001100110001011111111101011111110111010101011101011100101001000011011101011111000001011101111100110100010101011110011100010111000001000101111101011101000001001001001000010 e8f6bfec9daaeaf286ec82bbe5e8bce3828beafe92e998bfebfbaaeb9486ebe0bbe68abce2e08beba09242

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)