To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 嚥≪?誼??攸??嚥〓?議??飮??筌??誼 1001101010001011100000011110000100111111100010110110001000111111001111111001110110111111001111110011111110011010100010111000000110101100001111111000101101100011001111110011111110011111010110100011111100111111111000101010001100111111001111111000101101100010 9a8b81e13f8b623f3f9dbf3f3f9a8b81ac3f8b633f3f9f5a3f3fe2a33f3f8b62
EUC-JP 嚥≪?誼??攸??嚥〓Ŧ議??飮??筌??誼 11010011111010111010001011100011001111111011010111000011001111110011111111011010110000010011111100111111110100111110101110100010101011101000111110101001101011111011010111000100001111110011111111011101101110110011111100111111111001001010010100111111001111111011010111000011 d3eba2e33fb5c33f3fdac13f3fd3eba2ae8fa9afb5c43f3fddbb3f3fe4a53f3fb5c3
UTF-8 嚥≪늾誼띰쭓攸낆젶嚥〓Ŧ議뤸에飮뉗졅筌뗫봾誼 1110010110011010101001011110001010001001101010101110101110001010101111101110100010101010101111001110101110011101101100001110110010101101100100111110011010010100101110001110101110000010100001101110110010100000101101101110010110011010101001011110001110000000100100111100010110100110111010001010110110110000111010111010010010111000111011001001011110010000111010011010001110101110111010111000100110010111111011001010000110000101111001111010110110001100111010111001011110101011111010111011010010111110111010001010101010111100 e59aa5e289aaeb8abee8aabceb9db0ecad93e694b8eb8286eca0b6e59aa5e38093c5a6e8adb0eba4b8ec9790e9a3aeeb8997eca185e7ad8ceb97abebb4bee8aabc
UHC 嚥≪늾誼띰쭓攸낆젶嚥〓Ŧ議뤸에飮뉗졅筌뗫봾誼 1110011010111111101000011110110010001000100001111110101111111110101101101110111110100111100010111110101011110010100001011110110010100000101010101110011010111111101000011110101110101000101011101110110010100001100011111110011010111111101000011110101111100110100001111110110010100000101101101110111110100111100010111110101110010100100001011110101111111110 e6bfa1ec8887ebfeb6efa78beaf285eca0aae6bfa1eba8aeeca18fe6bfa1ebe687eca0b6efa78beb9485ebfe

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)