To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN ?暴????汗荊 0011111110010110010111000011111100111111001111110011111110001010101111101000110001110100 3f965c3f3f3f3f8abe8c74
EUC-JP ?暴????汗荊 0011111111001011101111010011111100111111001111110011111110110100110000001011011111010101 3fcbbd3f3f3f3fb4c0b7d5
UTF-8 뤋暴쫸샘렏뤋汗荊 111010111010010010001011111001101001101010110100111011001010101110111000111011001000001110011000111010111010000010001111111010111010010010001011111001101011000110010111111010001000110110001010 eba48be69ab4ecabb8ec8398eba08feba48be6b197e88d8a
UHC 뤋暴쫸샘렏뤋汗荊 10001111101110111111100011101100101001101000111110111011111110011000111010100101100011111011101111111001110100101111101110101010 8fbbf8eca68fbbf98ea58fbbf9d2fbaa

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)