To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???P????? 001111110011111100111111010100000011111100111111001111110011111100111111 3f3f3f503f3f3f3f3f
SJIS-WIN ??㎏P??㎏?? 0011111100111111100001110111001101010000001111110011111110000111011100110011111100111111 3f3f8773503f3f87733f3f
EUC-JP ???P????? 001111110011111100111111010100000011111100111111001111110011111100111111 3f3f3f503f3f3f3f3f
UTF-8 淋믪㎏P淋믪㎏淋숈 11101111101001111011010111101011101011111010101011100011100011101000111101010000111011111010011110110101111010111010111110101010111000111000111010001111111011111010011110110101111011001000100010001000 efa7b5ebafaae38e8f50efa7b5ebafaae38e8fefa7b5ec8888
UHC 淋믪㎏P淋믪㎏淋숈 1110110011111000100100101110110010100111101110000101000011101100111110001001001011101100101001111011100011101100111110001001100111101100 ecf892eca7b850ecf892eca7b8ecf899ec

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)