To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ?血?閥??血?閥?^ 001111111000110010001100001111111001010010110100001111110011111110001100100011000011111110010100101101000011111101011110 3f8c8c3f94b43f3f8c8c3f94b43f5e
EUC-JP ?血?閥??血?閥?^ 001111111011011111101100001111111100100010110110001111110011111110110111111011000011111111001000101101100011111101011110 3fb7ec3fc8b63f3fb7ec3fc8b63f5e
UTF-8 뤿血ㆀ閥쩊뤿血ㆀ閥쩊^ 11101011101001001011111111101000101000011000000011100011100001101000000011101001100101101010010111101100101010011000101011101011101001001011111111101000101000011000000011100011100001101000000011101001100101101010010111101100101010011000101001011110 eba4bfe8a180e38680e996a5eca98aeba4bfe8a180e38680e996a5eca98a5e
UHC 뤿血ㆀ閥쩊뤿血ㆀ閥쩊^ 100011111110101111111010111011001010010011110000110110111110110010100101010001001000111111101011111110101110110010100100111100001101101111101100101001010100010001011110 8febfaeca4f0dbeca5448febfaeca4f0dbeca5445e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)