To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??〓?????八襁 00111111001111111000000110101100001111110011111100111111001111110011111110010100101010101110010111110100 3f3f81ac3f3f3f3f3f94aae5f4
EUC-JP ??〓?????八襁 00111111001111111010001010101110001111110011111100111111001111110011111111001000101011001110101011110110 3f3fa2ae3f3f3f3f3fc8aceaf6
UTF-8 룶웡〓룶웡∼룶점八襁 111010111010001110110110111011001001101110100001111000111000000010010011111010111010001110110110111011001001101110100001111000101000100010111100111010111010001110110110111011001010000010010000111001011000010110101011111010001010010110000001 eba3b6ec9ba1e38093eba3b6ec9ba1e288bceba3b6eca090e585abe8a581
UHC 룶웡〓룶웡∼룶점八襁 1000111110101011101111111111110110100001111010111000111110101011101111111111110110100001101011011000111110101011110000011010000111111000101000101100101110111010 8fabbffda1eb8fabbffda1ad8fabc1a1f8a2cbba

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)