To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 塋ゆ?墺?ⅹ???汝????オ???也ゆ?? 100110101100100010000010111001000011111110011010110100100011111111111010010010010011111100111111001111111001001111110000001111110011111100111111001111111000001101001001001111110011111100111111100101101110011110000010111001000011111100111111 9ac882e43f9ad23ffa493f3f3f93f03f3f3f3f83493f3f3f96e782e43f3f
EUC-JP 塋ゆ?墺??孼??汝????オ???也ゆ?? 11010100110010101010010011100110001111111101010011010100001111110011111110001111101110101100001100111111001111111100011011110010001111110011111100111111001111111010010110101010001111110011111100111111110011001110100110100100111001100011111100111111 d4caa4e63fd4d43f3f8fbac33f3fc6f23f3f3f3fa5aa3f3f3fcce9a4e63f3f
UTF-8 塋ゆ뿈墺드ⅹ孼껇뇮汝싧쩀呂잏オ嶪뤺뵷也ゆ룂銳 111001011010000110001011111000111000001010000110111010111011111110001000111001011010001010111010111010111001001110011100111000101000010110111001111001011010110110111100111010101011101110000111111010111000011110101110111001101011000110011101111011001000101110100111111011001010100110000000111011111010011010000000111011001001111010001111111000111000001010101010111001011011011010101010111010111010010010111010111010111011010110110111111001001011100110011111111000111000001010000110111010111010001110000010111010011000101010110011 e5a18be38286ebbf88e5a2baeb939ce285b9e5adbceabb87eb87aee6b19dec8ba7eca980efa680ec9e8fe382aae5b6aaeba4baebb5b7e4b99fe38286eba382e98ab3
UHC 塋ゆ뿈墺드ⅹ孼껇뇮汝싧쩀呂잏オ嶪뤺뵷也ゆ룂銳 1110011110101011101010101110011010010111100011111110011111110010101101011110010110100101101010101110010111101101100000111110100010000111100100111110011010100011100110101110010110100100100110101110010111111011100111111110011110101011101010101110010111110101100011111110100010010100101101011110010110100101101010101110011010001111100000111110011111100101 e7abaae6978fe7f2b5e5a5aae5ed83e88793e6a39ae5a49ae5fb9fe7abaae5f58fe894b5e5a5aae68f83e7e5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)