To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???艶??兀??鶯 00111111001111110011111110001001100100000011111100111111100110010101100100111111001111111110100111110010 3f3f3f89903f3f99593f3fe9f2
EUC-JP ???艶??兀??鶯 00111111001111110011111110110001111100000011111100111111110100011011101000111111001111111111001011110100 3f3f3fb1f03f3fd1ba3f3ff2f4
UTF-8 了싪궏艶쎾났兀뗰스鶯 111011111010011010111010111011001000101110101010111010101011011010001111111010001000100110110110111011001000111010111110111010111000001010101100111001011000010110000000111010111001011110110000111011001000101010100100111010011011011010101111 efa6baec8baaeab68fe889b6ec8ebeeb82ace58580eb97b0ec8aa4e9b6af
UHC 了싪궏艶쎾났兀뗰스鶯 1110100011100111100110101110100010000010101001011110011011111101100110111110010110110011101101011110100010110100100010111110111110111101101110101110010110100011 e8e79ae882a5e6fd9be5b3b5e8b48befbdbae5a3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)