To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 繹??哀??哀????????蹂???ゆ?? 111000111000100000111111001111111000100010100011001111110011111110001000101000110011111100111111001111110011111100111111001111110011111100111111111001101111100000111111001111110011111110000010111001000011111100111111 e3883f3f88a33f3f88a33f3f3f3f3f3f3f3fe6f83f3f3f82e43f3f
EUC-JP 繹??哀??哀??獒?????蹂???ゆ?? 1110010111101000001111110011111110110000101001010011111100111111101100001010010100111111001111111000111111001011101110110011111100111111001111110011111100111111111011001111101000111111001111110011111110100100111001100011111100111111 e5e83f3fb0a53f3fb0a53f3f8fcbbb3f3f3f3f3fecfa3f3f3fa4e63f3f
UTF-8 繹먮젾哀잙젦哀잙젦獒쎈젵緣욏렆蹂뺞쵊溜ゆ쵊溜 111001111011100110111001111010111010100010101110111011001010000010111110111001011001001110000000111011001001111010011001111011001010000010100110111001011001001110000000111011001001111010011001111011001010000010100110111001111000110110010010111011001000111010001000111011001010000010110101111001111011011110100011111011001001101010001111111010111010000010000110111010001011100110000010111010111011101010011110111011001011010110001010111011111010011110001011111000111000001010000110111011001011010110001010111011111010011110001011 e7b9b9eba8aeeca0bee59380ec9e99eca0a6e59380ec9e99eca0a6e78d92ec8e88eca0b5e7b7a3ec9a8feba086e8b982ebba9eecb58aefa78be38286ecb58aefa78b
UHC 繹먮젾哀잙젦哀잙젦獒쎈젵緣욏렆蹂뺞쵊溜ゆ쵊溜 1110011010111010100100001110101110100000101100001110010011101110100111111110101110100000100111101110010011101110100111111110101110100000100111101110100010100011101111011110101110100000101010011110011011011110100111101110110110001110101000001110101110110011100101011110011010101100100011001110101011111110101010101110011010101100100011001110101011111110 e6ba90eba0b0e4ee9feba09ee4ee9feba09ee8a3bdeba0a9e6de9eed8ea0ebb395e6ac8ceafeaae6ac8ceafe

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)